DFT-domain based single-microphone noise reduction for speech enhancement : a survey of the state-of-the-art /

Bibliographic Details
Main Author:	Hendriks, Richard C. (Richard Christian)
Other Authors:	Gerkmann, Timo, Jensen, Jesper (College teacher)
Format:	eBook
Language:	English
Published:	San Rafael, Calif. (1537 Fourth Street, San Rafael, CA 94901 USA) : Morgan & Claypool, [2013]
Series:	Synthesis lectures on speech and audio processing ; #11.
Subjects:	Speech processing systems > Noise. COMPUTERS / Optical Data Processing. single-microphone speech enhancement noise reduction Electronic books.
Online Access:	Connect to the full text of this electronic book

MARC

Tag	First Indicator	Second Indicator	Subfields
LEADER	00000cam a2200000Ia 4500
001	in00003338121
005	20190404202455.0
006	m eo d
007	cr cn\|\|\|\|m\|\|\|a
008	130218s2013 caua fob 000 0 eng d
020			\|a 9781627051446 (electronic bk.)
020			\|a 1627051449 (electronic bk.)
020			\|z 9781627051439 (pbk.)
024	7		\|a 10.2200/S00473ED1V01Y201301SAP011 \|2 doi
035			\|a (OCoLC)ocn827937142
035			\|a (OCoLC)827937142
040			\|a CaBNvSL \|c J2I \|d J2I \|d WAU \|d OCLCO \|d N$T \|d E7B \|d AZS \|d UtOrBLW
049			\|a TXAM
050		4	\|a TK7882.S65 \|b H453 2013
072		7	\|a COM \|x 047000 \|2 bisacsh
082	0	4	\|a 006.454 \|2 23
100	1		\|a Hendriks, Richard C. \|q (Richard Christian)
245	1	0	\|a DFT-domain based single-microphone noise reduction for speech enhancement : \|b a survey of the state-of-the-art / \|c Richard C. Hendriks, Timo Gerkmann, Jesper Jensen.
246	3		\|a Discrete Fourier transform-domain based single-microphone noise reduction for speech enhancement
264		1	\|a San Rafael, Calif. (1537 Fourth Street, San Rafael, CA 94901 USA) : \|b Morgan & Claypool, \|c [2013]
264		4	\|c ©2013
300			\|a 1 online resource (xii, 70 pages) : \|b illustrations
336			\|a text \|b txt \|2 rdacontent
337			\|a computer \|b c \|2 rdamedia
338			\|a online resource \|b cr \|2 rdacarrier
490	1		\|a Synthesis lectures on speech and audio processing, \|x 1932-1678 ; \|v #11
500			\|a Part of: Synthesis digital library of engineering and computer science.
588			\|a Description based on online resource; title from PDF t.p. (Morgan & Claypool, viewed on February 17, 2013).
500			\|a Series from website.
504			\|a Includes bibliographical references (pages 57-67).
505	0		\|a Acknowledgments -- Glossary -- 1. Introduction --
505	8		\|a 2. Single channel speech enhancement-general principles -- 2.1 Analysis-modification-synthesis (AMS) system -- 2.2 Finding the target estimate -- 2.3 A priori knowledge and assumptions -- 2.3.1 Taking speech signal characteristics into account -- 2.3.2 Taking noise process characteristics into account -- 2.3.3 Taking the human auditory system into account --
505	8		\|a 3. DFT-based speech enhancement methods-signal model and notation --
505	8		\|a 4. Speech DFT estimators -- 4.1 Statistical modeling assumptions -- 4.2 Spectral subtraction -- 4.3 Linear MMSE estimators -- 4.4 Non-linear MMSE estimators --
505	8		\|a 5. Speech presence probability estimation -- 5.1 A posteriori speech presence probability -- 5.2 Estimation of the model parameter H1 -- 5.2.1 Short-term adaptive estimate -- 5.2.2 Fixed optimal H1 -- 5.3 Choosing the prior probabilities -- 5.3.1 Adaptive prior probabilities -- 5.3.2 Fixed prior probabilities -- 5.4 Avoiding outliers --
505	8		\|a 6. Noise PSD estimation -- 6.1 Methods based on VAD -- 6.2 Methods based on minimum power level tracking -- 6.3 SPP-based noise PSD estimation -- 6.4 MMSE-based estimation of the noise PSD -- 6.5 DFT-subspace estimation of the noise PSD --
505	8		\|a 7. Speech PSD estimation -- 7.1 Maximum likelihood estimation and decision-directed approach -- 7.2 Kalman-type filtering, Garch modeling, and noncausal estimation -- 7.3 Temporal cepstrum smoothing -- 7.4 Comparison of the estimators --
505	8		\|a 8. Performance evaluation methods -- 8.1 Evaluating quality aspects of enhanced speech -- 8.1.1 Listening tests -- 8.1.2 Instrumental test methods -- 8.2 Evaluating intelligibility of enhanced speech -- 8.2.1 Listening tests -- 8.2.2 Instrumental test methods --
505	8		\|a 9. Simulation experiments with single-channel enhancement systems --
505	8		\|a 10. Future directions -- References -- Authors' biographies.
520	3		\|a As speech processing devices like mobile phones, voice controlled devices, and hearing aids have increased in popularity, people expect them to work anywhere and at any time without user intervention. However, the presence of acoustical disturbances limits the use of these applications, degrades their performance, or causes the user difficulties in understanding the conversation or appreciating the device. A common way to reduce the effects of such disturbances is through the use of single-microphone noise reduction algorithms for speech enhancement. The field of single-microphone noise reduction for speech enhancement comprises a history of more than 30 years of research. In this survey, we wish to demonstrate the significant advances that have been made during the last decade in the field of discrete Fourier transform domain-based single-channel noise reduction for speech enhancement. Furthermore, our goal is to provide a concise description of a state-of-the-art speech enhancement system, and demonstrate the relative importance of the various building blocks of such a system. This allows the non-expert DSP practitioner to judge the relevance of each building block and to implement a close-to-optimal enhancement system for the particular application at hand.
500			\|a Electronic resource.
650		0	\|a Speech processing systems \|x Noise.
650		7	\|a COMPUTERS / Optical Data Processing. \|2 bisacsh
653			\|a single-microphone
653			\|a speech enhancement
653			\|a noise reduction
655		7	\|a Electronic books. \|2 local
700	1		\|a Gerkmann, Timo.
700	1		\|a Jensen, Jesper \|c (College teacher)
730	0		\|a Synthesis digital library of engineering and computer science.
776	0	8	\|i Print version: \|z 9781627051439
830		0	\|a Synthesis lectures on speech and audio processing ; \|v #11. \|x 1932-121X
856	4	0	\|u http://proxy.library.tamu.edu/login?url=http://www.morganclaypool.com/doi/abs/10.2200/S00473ED1V01Y201301SAP011 \|z Connect to the full text of this electronic book \|t 0
994			\|a 92 \|b TXA
999			\|a MARS
999	f	f	\|s 95bc8022-5476-334d-af0f-322d01da7601 \|i 0a4ea72e-9f3d-39df-9832-5b832d9b8e01 \|t 0
952	f	f	\|a Texas A&M University \|b College Station \|c Electronic Resources \|d Available Online \|t 0 \|e TK7882.S65 H453 2013 \|h Library of Congress classification
998	f	f	\|a TK7882.S65 H453 2013 \|t 0 \|l Available Online