Title Speech and Audio Signal Processing : Processing and Perception of Speech and Music.
Subject Speech processing systems.
Signal processing -- Digital techniques.
Electronic music.
Contents Introduction -- Synthetic audio : a brief history -- Speech analysis and synthesis overview -- Brief history of automatic speech recognition -- Speech recognition overview -- Digital signal processing -- Digital filters and discrete Fourier transform -- Pattern classification -- Statistical pattern classification -- Wave basics -- Acoustic tube modeling of speech production -- Musical instrument acoustics -- Room acoustics -- Ear physiology -- Psychoacoustics -- Models of pitch perception -- Speech perception -- Human speech recognition -- The auditory system as a filter bank -- The cepstrum as a spectral analyzer -- Linear prediction -- Feature extraction for ASR -- Linguistic categories for speech recognition -- Deterministic sequence recognition for ASR -- Statistical sequence recognition -- Statistical model training -- Discriminant acoustic probability estimation -- Acoustic model training : further topics -- Speech recognition and understanding -- Speech synthesis -- Pitch detection -- Vocoders -- Low-rate vocoders -- Medium rate and high rate vocoders -- Perceptual audio coding -- Some aspects of computer music synthesis -- Music signal analysis -- Music retrieval -- Source separation -- Speech transformations -- Speaker verification -- Speaker diarization.
Summary Annotation When Speech and Audio Signal Processing published in 1999, it stood out from its competition in its breadth of coverage and its accessible, intutiont-based style. This book was aimed at individual students and engineers excited about the broad span of audio processing and curious to understand the available techniques. Since then, with the advent of the iPod in 2001, the field of digital audio and music has exploded, leading to a much greater interest in the technical aspects of audio processing. This Second Edition will update and revise the original book to augment it with new material describing both the enabling technologies of digital music distribution (most significantly the MP3) and a range of exciting new research areas in automatic music content processing (such as automatic transcription, music similarity, etc.) that have emerged in the past five years, driven by the digital music revolution. New chapter topics include:Psychoacoustic Audio Coding, describing MP3 and related audio coding schemes based on psychoacoustic masking of quantization noiseMusic Transcription, including automatically deriving notes, beats, and chords from music signals. Music Information Retrieval, primarily focusing on audio-based genre classification, artist/style identification, and similarity estimation. Audio Source Separation, including multi-microphone beamforming, blind source separation, and the perception-inspired techniques usually referred to as Computational Auditory Scene Analysis (CASA).
Other Author Morgan, Nelson, author.
Ellis, Dan, author.