Book Chapter | Nonlinear adaptive speech enhancement inspired by early auditory processing

Book Chapter

Nonlinear adaptive speech enhancement inspired by early auditory processing

Details

Citation

Hussain A, Durrani T, Alkulaibi A & Mtetwa N (2005) Nonlinear adaptive speech enhancement inspired by early auditory processing. In: Chollet G, Esposito A, Faundez-Zanuy M & Marinaro M (eds.) Nonlinear Speech Modeling and Applications: Advanced Lectures and Revised Selected Papers. Lecture Notes in Computer Science, 3445. Berlin Heidelberg: Springer, pp. 291-316. http://link.springer.com/chapter/10.1007/11520153_13#

Abstract
This paper presents non-linear adaptive speech enhancement schemes inspired by features of early auditory processing. A generic multi-microphone sub-band adaptive (MMSBA) framework is described which allows for the manipulation of several factors that may influence the intelligibility and perceived quality of the processed speech. The proposed framework supports inclusion of: non-linear distribution of sub-bands (as in humans), cross-band effects such as lateral inhibition, and robust adaptive metrics for selecting an appropriate coherent or incoherent noise canceller for each sub-band, based on identified features of the band-limited signals from multiple-sensors during silence periods. An efficient higher order statistics (HOS) based speech/non-speech detector is proposed for enabling effective adaptive control of MMSBA filtering against the environment. New hybrid extensions of the MMSBA scheme incorporating neural networks and post-Weiner filtering are also described and their comparative performance assessed in real reverberant environments. Finally, some future research directions for MMSBA based speech enhancement are proposed including possible alternative strategies based on stochastic resonance.

Status	Published
Title of series	Lecture Notes in Computer Science
Number in series	3445
Publication date	31/12/2005
Publisher	Springer
Publisher URL
Place of publication	Berlin Heidelberg
ISSN of series	0302-9743
ISBN	978-3-540-27441-4

People (1)

Professor Tariq Durrani

Honorary Professor, Computing Science

我要吃瓜

Nonlinear adaptive speech enhancement inspired by early auditory processing

Details

People (1)