This book provides a comprehensive overview of speech enhancement techniques, focusing on the improvement of speech signals degraded by additive noise. It is structured into three main parts: Fundamentals, Algorithms, and Evaluation. The book is written by Philipos C. Loizou, a professor in the Department of Electrical Engineering at the University of Texas–Dallas, and is intended for graduate-level students and researchers in the field of speech processing. The book includes a DVD-ROM with a speech corpus and MATLAB code for implementing major speech enhancement algorithms.
The book begins with an introduction to speech enhancement, discussing the nature of noise and its impact on speech quality and intelligibility. It then covers the fundamentals of discrete-time signal processing and short-time Fourier analysis, which are essential for understanding speech enhancement algorithms. The next section presents various classes of speech enhancement algorithms, including spectral-subtractive algorithms, statistical-model-based algorithms, and subspace algorithms. Each of these classes is discussed in detail, along with their strengths and limitations.
The third part of the book focuses on the evaluation of speech enhancement algorithms. It discusses various methods for assessing the quality and intelligibility of enhanced speech, including subjective listening tests and objective measures such as segmental SNR and PESQ. The book also provides a comprehensive comparison of different speech enhancement algorithms, highlighting their performance in terms of quality and intelligibility.
The book is accompanied by a DVD-ROM containing a speech corpus and MATLAB code for implementing major speech enhancement algorithms. This resource is invaluable for researchers and students who wish to experiment with and understand the practical applications of speech enhancement techniques. The book is also suitable as a textbook for a one-semester graduate-level course on speech enhancement, with necessary prerequisites including a course on digital signal processing and fundamental knowledge of probability theory, random variables, and linear algebra.This book provides a comprehensive overview of speech enhancement techniques, focusing on the improvement of speech signals degraded by additive noise. It is structured into three main parts: Fundamentals, Algorithms, and Evaluation. The book is written by Philipos C. Loizou, a professor in the Department of Electrical Engineering at the University of Texas–Dallas, and is intended for graduate-level students and researchers in the field of speech processing. The book includes a DVD-ROM with a speech corpus and MATLAB code for implementing major speech enhancement algorithms.
The book begins with an introduction to speech enhancement, discussing the nature of noise and its impact on speech quality and intelligibility. It then covers the fundamentals of discrete-time signal processing and short-time Fourier analysis, which are essential for understanding speech enhancement algorithms. The next section presents various classes of speech enhancement algorithms, including spectral-subtractive algorithms, statistical-model-based algorithms, and subspace algorithms. Each of these classes is discussed in detail, along with their strengths and limitations.
The third part of the book focuses on the evaluation of speech enhancement algorithms. It discusses various methods for assessing the quality and intelligibility of enhanced speech, including subjective listening tests and objective measures such as segmental SNR and PESQ. The book also provides a comprehensive comparison of different speech enhancement algorithms, highlighting their performance in terms of quality and intelligibility.
The book is accompanied by a DVD-ROM containing a speech corpus and MATLAB code for implementing major speech enhancement algorithms. This resource is invaluable for researchers and students who wish to experiment with and understand the practical applications of speech enhancement techniques. The book is also suitable as a textbook for a one-semester graduate-level course on speech enhancement, with necessary prerequisites including a course on digital signal processing and fundamental knowledge of probability theory, random variables, and linear algebra.