Our systems are now restored following recent technical disruption, and we’re working hard to catch up on publishing. We apologise for the inconvenience caused. Find out more

Recommended product

Popular links

Popular links


Machine Learning for Speaker Recognition

Machine Learning for Speaker Recognition

Machine Learning for Speaker Recognition

Man-Wai Mak, The Hong Kong Polytechnic University
Jen-Tzung Chien, National Chiao Tung University, Taiwan
January 2021
Hardback
9781108428125
$117.00
USD
Hardback
eBook

    This book will help readers understand fundamental and advanced statistical models and deep learning models for robust speaker recognition and domain adaptation. This useful toolkit enables readers to apply machine learning techniques to address practical issues, such as robustness under adverse acoustic environments and domain mismatch, when deploying speaker recognition systems. Presenting state-of-the-art machine learning techniques for speaker recognition and featuring a range of probabilistic models, learning algorithms, case studies, and new trends and directions for speaker recognition based on modern machine learning and deep learning, this is the perfect resource for graduates, researchers, practitioners and engineers in electrical engineering, computer science and applied mathematics.

    • Presents the inference procedures from maximum likelihood to approximate Bayesian for linear and non-linear probabilistic models based on different types of latent variables
    • Features comprehensive treatments of noise robustness and domain adaptation in speaker recognition
    • Provides in-depth coverage of deep learning models, ranging from deep neural networks, and deep belief networks to variational autoencoders and generative adversarial networks, for feature representation and data augmentation in speaker recognition

    Reviews & endorsements

    'There is a need for an accessible textbook to help newcomers to enter the field [of automatic speaker recognition]. Machine Learning for Speaker Recognition by Man-Wai Mak and Jen-Tzung Chien serves such a need. Both authors are highly seasoned in the field. They cover both fundamental techniques and state-of-the-art methods at an accessible level using the language of modern probabilistic machine learning. The authors cover different components of speaker recognition systems including feature extraction, back-end modeling and scoring, along with various case studies. The book is well suited for the needs of graduate students and researchers in electrical engineering and computer science, along with practitioners. Apart from basic prerequisites in calculus, linear algebra, probabilities and statistics, the textbook provides a coherent and self-contained journey into what modern automatic speaker recognition is about.' Tomi Kinnunen, University of Eastern Finland

    'The topical coverage is spot-on, and the text discusses many key algorithms that support statistical learning approaches, including hybrid models, deep learning classification, and generative methods. In addition, the authors provide a deep mathematical exploration into versions of algorithms, optimization approaches, and domain adaptation statistics within the context of signal processing. The extensive diagrams, linear algebra notation, and mathematical calculus machinery will support developers who are building new implementations or need to look under the hood of existing systems. Highly Recommended.' J. Brzezinski, Choice

    See more reviews

    Product details

    February 2021
    Adobe eBook Reader
    9781108566094
    0 pages
    133 b/w illus. 4 tables
    This ISBN is for an eBook version which is distributed on our behalf by a third party.

    Table of Contents

    • Part I. Fundamental Theories:
    • 1. Introduction
    • 2. Learning algorithms
    • 3. Machine learning models
    • Part II. Advanced Studies:
    • 4. Deep learning models
    • 5. Robust speaker verification
    • 6. Domain adaptation
    • 7. Dimension reduction and data augmentation
    • 8. Future direction
    • Index.
    Resources for
    Type
    PPT.zip
    Size: 10.74 MB
    Type: application/zip
    JPEG.zip
    Size: 46.51 MB
    Type: application/zip
    Online_Resources.zip
    Size: 67.05 MB
    Type: application/zip