Our systems are now restored following recent technical disruption, and we’re working hard to catch up on publishing. We apologise for the inconvenience caused. Find out more

Recommended product

Popular links

Popular links


DNA, Words and Models

DNA, Words and Models

DNA, Words and Models

Statistics of Exceptional Words
S. Robin, Institut National de la Recherche Agronomique (INRA), Paris
F. Rodolphe, Institut National de la Recherche Agronomique (INRA), Paris
S. Schbath, Institut National de la Recherche Agronomique (INRA), Paris
November 2005
Hardback
9780521847292
$92.99
USD
Hardback

    An important problem in computational biology is identifying short DNA sequences (mathematically, 'words') associated to a biological function. One approach consists in determining whether a particular word is simply random or is of statistical significance, for example, because of its frequency or location. This book introduces the mathematical and statistical ideas used in solving this so-called exceptional word problem. It begins with a detailed description of the principal models used in sequence analysis: Markovian models are central here and capture compositional information on the sequence being analysed. There follows an introduction to several statistical methods that are used for finding exceptional words with respect to the model used. The second half of the book is illustrated with numerous examples provided from the analysis of bacterial genomes, making this a practical guide for users facing a real situation and needing to make an adequate procedure choice.

    • Practical guide for biologists and bioinformaticians
    • Statistical analysis of DNA sequences
    • Illustrated with numerous examples provided from the analysis of several bacterial genomes

    Reviews & endorsements

    "For statisticians with a little background in biology, this book delivers a very readable presentation on the analysis of DNA sequences to determine whether a motif is of statistical significance due to its overabundance (or underabundance) in terms of frequencies or location. This book is concise but sufficiently detailed. Biologists without a background in mathematical statistics may find the learning curve a little steep but tractable. The authors' continuous use of practical examples will be greatly appreciated by biologists and statisticians interested in learning about DNA sequences and motifs."
    J. Wade Davis, University of Missouri-Columbia, Journal of the American Statistician

    "... A welcome introduction to word analysis..."
    Daniel M. Burns, Jr., Mathematical Reviews

    See more reviews

    Product details

    November 2005
    Hardback
    9780521847292
    158 pages
    235 × 156 × 16 mm
    0.391kg
    28 b/w illus. 14 tables
    Available

    Table of Contents

    • Introduction
    • 1. Simple models for biological sequences
    • 2. Introduction to Markov chain models
    • 3. Taking heterogeneities into account
    • 4. Statistical properties of word occurrences
    • 5. Words with unexpected frequencies
    • 6. Words with unexpected locations.