Digital Speech Processing : Synthesis, and Recognition, Second Edition.

Bibliographic Details
Main Author: Furui, Sadaoki
Corporate Author: Taylor & Francis
Format: eBook
Language:English
Published: Boca Raton : Chapman and Hall/CRC, 2017.
Edition:2nd ed.
Series:Signal Processing and Communications Ser.
Subjects:
Online Access:Connect to the full text of this electronic book
Table of Contents:
  • Cover; Half title; Series Editor; Title; Copyrights; Series Introduction; Preface to the Second Edition; Acknowledgments; Preface to the First Edition; Contents; Chapter 1. Introduction; Chapter 2. PRINCIPAL CHARACTERISTICS OF SPEECH; 2.1 Linguistic Information; 2.2 Speech and Hearing; 2.3 Speech Production Mechanism; 2.4 Acoustic Characteristics of Speech; 2.5 Statistical Characteristics of Speech; 2.5.1 Distribution of amplitude level; 2.5.2 Long-time averaged spectrum; 2.5.3 Variation in fundamental frequency; 2.5.4 Speech ratio; Chapter 3. SPEECH PRODUCTION MODELS.
  • 3.1 Acoustical Theory of Speech Production3.2 Linear Separable Equivalent Circuit Model; 3.3 Vocal Tract Transmission Model; 3.3.1 Progressing wave model; 3.3.2 Resonance model; 3.4 Vocal Cord Model; Chapter 4. SPEECH ANALYSIS AND ANAL YSIS-SYNTHESIS SYSTEMS; 4.1 Digitization; 4.1.1 Sampling; 4.1.2 Quantization and coding; 4.1.3 AjD and DjA conversion; 4.2 Spectral Analysis; 4.2.1 Spectral structure of speech; 4.2.2 Autocorrelation and Fourier transform; 4.2.3 Window function; 4.2.4 Sound spectrogram; 4.3 Cepstrum; 4.3.1 Cepstrum and its application.
  • 4.3.2 Homomorphic analysis and LPC cepstrum4.4 Filter Bank and Zero-Crossing Analysis; 4.4.1 Digital filter bank; 4.4.2 Zero-crossing analysis; 4.5 Analysis-by-Synthesis; 4.6 Analysis-Synthesis Systems; 4.6.1 Analysis-synthesis system structure; 4.6.2 Examples of analysis-synthesis systems; 4.7 Pitch Extraction; Chapter 5. LINEAR PREDICTIVE CODING (LPC) ANALYSIS; 5.1 Principles of LPC Analysis; 5.2 LPC Analysis Procedure; 5.3 Maximum Likelihood Spectral Estimation; 5.3.1 Formulation of maximum likelihood spectral estimation; 5.3.2 Physical meaning of maximum likelihood spectral estimation.
  • 5.4 Source Parameter Estimation from Residual Signals5.5 Speech Analysis-Synthesis System by LPC; 5.6 PARCOR Analysis; 5.6.1 Formulation of PARCOR analysis; 5.6.2 Relationship between PARCOR and LPC coefficients; 5.6.3 PARCOR synthesis filter; 5.6.4 Vocal tract area estimation based on PARCOR analysis; 5.7 Line Spectrum Pair (LSP) Analysis; 5.7.1 Principle of LSP analysis; 5.7.2 Solution of LSP analysis; 5.7.3 LSP synthesis filter; 5.7.4 Coding of LSP parameters; 5.7.5 Composite sinusoidal model; 5.7.6 Mutual relationships between LPC parameters; 5.8 Pole-Zero Analysis.
  • Chapter 6 SPEECH CODING6.1 Principal Techniques for Speech Coding; 6.1.1 Reversible coding; 6.1.2 Irreversible coding and information rate distortion theory; 6.1.3 Waveform coding and analysis-synthesis systems; 6.1.4 Basic techniques for waveform coding methods; 6.2 Coding in Time Domain; 6.2.1 Pulse code modulation (PCM); 6.2.2 Adaptive quantization; 6.2.3 Predictive coding; 6.2.4 Delta modulation; 6.2.5 Adaptive differential PCM (ADPCM); 6.2.6 Adaptive predictive coding (APC); 6.2.7 Noise shaping; 6.3 Coding in Frequency Domain; 6.3.1 Subband coding (SBC).