Book Chapter
Details
Citation
Hussain A, Durrani T, Alkulaibi A & Mtetwa N (2005) Nonlinear adaptive speech enhancement inspired by early auditory processing. In: Chollet G, Esposito A, Faundez-Zanuy M & Marinaro M (eds.) Nonlinear Speech Modeling and Applications: Advanced Lectures and Revised Selected Papers. Lecture Notes in Computer Science, 3445. Berlin Heidelberg: Springer, pp. 291-316. http://link.springer.com/chapter/10.1007/11520153_13#
Abstract
This paper presents non-linear adaptive speech enhancement schemes inspired by features of early auditory processing. A generic multi-microphone sub-band adaptive (MMSBA) framework is described which allows for the manipulation of several factors that may influence the intelligibility and perceived quality of the processed speech. The proposed framework supports inclusion of: non-linear distribution of sub-bands (as in humans), cross-band effects such as lateral inhibition, and robust adaptive metrics for selecting an appropriate coherent or incoherent noise canceller for each sub-band, based on identified features of the band-limited signals from multiple-sensors during silence periods. An efficient higher order statistics (HOS) based speech/non-speech detector is proposed for enabling effective adaptive control of MMSBA filtering against the environment. New hybrid extensions of the MMSBA scheme incorporating neural networks and post-Weiner filtering are also described and their comparative performance assessed in real reverberant environments. Finally, some future research directions for MMSBA based speech enhancement are proposed including possible alternative strategies based on stochastic resonance.
Status | Published |
---|---|
Title of series | Lecture Notes in Computer Science |
Number in series | 3445 |
Publication date | 31/12/2005 |
Publisher | Springer |
Publisher URL | |
Place of publication | Berlin Heidelberg |
ISSN of series | 0302-9743 |
ISBN | 978-3-540-27441-4 |
People (1)
Honorary Professor, Computing Science