Our systems are now restored following recent technical disruption, and we’re working hard to catch up on publishing. We apologise for the inconvenience caused. Find out more

Recommended product

Popular links

Popular links


Statistical Learning for Biomedical Data

Statistical Learning for Biomedical Data

Statistical Learning for Biomedical Data

James D. Malley, National Institutes of Health, Maryland
Karen G. Malley, Malley Research Programming, Maryland
Sinisa Pajevic, National Institutes of Health, Maryland
February 2011
Available
Paperback
9780521699099

    This book is for anyone who has biomedical data and needs to identify variables that predict an outcome, for two-group outcomes such as tumor/not-tumor, survival/death, or response from treatment. Statistical learning machines are ideally suited to these types of prediction problems, especially if the variables being studied may not meet the assumptions of traditional techniques. Learning machines come from the world of probability and computer science but are not yet widely used in biomedical research. This introduction brings learning machine techniques to the biomedical world in an accessible way, explaining the underlying principles in nontechnical language and using extensive examples and figures. The authors connect these new methods to familiar techniques by showing how to use the learning machine models to generate smaller, more easily interpretable traditional models. Coverage includes single decision trees, multiple-tree techniques such as Random Forests™, neural nets, support vector machines, nearest neighbors and boosting.

    • Free open-source computer code is available online
    • Brings valuable new ideas from probability and computer science into the biomedical world to provide more accurate predictions
    • Plain-language approach makes the techniques more accessible

    Reviews & endorsements

    'The book is well written and provides nice graphics and numerous applications.' Michael R. Chernick, Technometrics

    See more reviews

    Product details

    February 2011
    Paperback
    9780521699099
    298 pages
    245 × 175 × 11 mm
    0.6kg
    47 b/w illus. 25 tables
    Available

    Table of Contents

    • Preface
    • Acknowledgements
    • Part I. Introduction:
    • 1. Prologue
    • 2. The landscape of learning machines
    • 3. A mangle of machines
    • 4. Three examples and several machines
    • Part II. A Machine Toolkit:
    • 5. Logistic regression
    • 6. A single decision tree
    • 7. Random forests – trees everywhere
    • Part III. Analysis Fundamentals:
    • 8. Merely two variables
    • 9. More than two variables
    • 10. Resampling methods
    • 11. Error analysis and model validation
    • Part IV. Machine Strategies:
    • 12. Ensemble methods – let's take a vote
    • 13. Summary and conclusions
    • References
    • Index.
      Authors
    • James D. Malley , National Institutes of Health, Maryland

      James D. Malley is a Research Mathematical Statistician in the Mathematical and Statistical Computing Laboratory, Division of Computational Bioscience, Center for Information Technology, at the National Institutes of Health.

    • Karen G. Malley , Malley Research Programming, Maryland

      Karen G. Malley is president of Malley Research Programming, Inc. in Rockville, Maryland, providing statistical programming services to the pharmaceutical industry and the National Institutes of Health. She also serves on the global council of the Clinical Data Interchange Standards Consortium (CDISC) user network, and the steering committee of the Washington, DC area CDISC user network.

    • Sinisa Pajevic , National Institutes of Health, Maryland

      Sinisa Pajevic is a Staff Scientist in the Mathematical and Statistical Computing Laboratory, Division of Computational Bioscience, Center for Information Technology, at the National Institutes of Health.