Genome-Scale Algorithm Design
High-throughput sequencing has revolutionised the field of biological sequence analysis. Its application has enabled researchers to address important biological questions, often for the first time. This book provides an integrated presentation of the fundamental algorithms and data structures that power modern sequence analysis workflows. The topics covered range from the foundations of biological sequence analysis (alignments and hidden Markov models), to classical index structures (k-mer indexes, suffix arrays and suffix trees), Burrows–Wheeler indexes, graph algorithms and a number of advanced omics applications. The chapters feature numerous examples, algorithm visualisations, exercises and problems, each chosen to reflect the steps of large-scale sequencing projects, including read alignment, variant calling, haplotyping, fragment assembly, alignment-free genome comparison, transcript prediction and analysis of metagenomic samples. Each biological problem is accompanied by precise formulations, providing graduate students and researchers in bioinformatics and computer science with a powerful toolkit for the emerging applications of high-throughput sequencing.
- Provides an integrated picture of the fundamental algorithms and data structures that power modern sequence analysis, covering a range of topics including foundations, classical index structures and Burrows–Wheeler indexes
- Chapters feature numerous examples, algorithm visualisations, problems and end-of-chapter exercises, providing students with a powerful toolkit for the emerging applications of high-throughput sequencing
- Presents only the minimum data structures necessary so that students are not burdened with technical results and can also focus on more conceptual algorithm design questions
Reviews & endorsements
'Genome-Scale Algorithm Design is a well-thought-out … book that fills a gap in the recent literature … [on algorithms] for bioinformatics. It offers a sound, clear, and rich overview of computer science methods for the challenge of today's biological sequence analysis. I [recommend] it to students as well as to researchers in the field.' Nadia Pisanti, University of Pisa
'This book will be a new reference text for string data structures and algorithms, as well as their applications in computational molecular biology and genome analysis. The technically precise style, illustrated with a great collection of well-designed examples and many exercises, makes it an ideal resource for researchers, students, and teachers.' Jens Stoye, Universität Bielefeld
'I think the book is really great and could envision using it in courses in bioinformatics and data compression. The book's scope, clarity, and mathematically precise, compelling explanations make the advanced topics in genome-wide bioinformatics accessible to [a] wide audience.' Christina Boucher, Colorado State University
'This book is a timely, rigorous and comprehensive systematization of the concepts and tools at the core of post-genome bioinformatics. By choosing to incorporate the principles of algorithms design most pertinent to the topic, the authors have created a rare, self-contained reference that will smoothly introduce the neophyte and assist the seasoned researcher as well as anybody who needs to understand these issues at a fundamental level. The organization of the material, the clarity of exposition and rigor of treatment make the book an ideal textbook for courses directed at a mixed audience coming from diverse, even distant backgrounds.' Alberto Apostolico, Georgia Institute of Technology
'A very timely textbook covering many recently established techniques and applications at a level of rigor befitting a computer science audience.' Paul Medvedev, Pennsylvania State University
Product details
May 2015Adobe eBook Reader
9781316342978
0 pages
0kg
90 b/w illus. 4 tables 269 exercises
This ISBN is for an eBook version which is distributed on our behalf by a third party.
Table of Contents
- Notation
- Preface
- Part I. Preliminaries:
- 1. Molecular biology and high-throughput sequencing
- 2. Algorithm design
- 3. Data structures
- 4. Graphs
- 5. Network flows
- Part II. Fundamentals of Biological Sequence Analysis:
- 6. Alignments
- 7. Hidden Markov models (HMMs)
- Part III. Genome-Scale Index Structures:
- 8. Classical indexes
- 9. Burrows–Wheeler indexes
- Part IV. Genome-Scale Algorithms:
- 10. Read alignment
- 11. Genome analysis and comparison
- 12. Genome compression
- 13. Fragment assembly
- Part V. Applications:
- 14. Genomics
- 15. Transcriptomics
- 16. Metagenomics
- References
- Index.