Conference Paper (published)
Details
Citation
Chetouani M, Hussain A, Gas B & Zarader J (2006) Non-linear predictors based on the functionally expanded neural networks for speech feature extraction. In: IEEE International Conference on Engineering of Intelligent Systems, ICEIS 2006. 2006 IEEE International Conference on Engineering of Intelligent Systems, Islamabad, Pakistan, 22.04.2006-23.04.2006. Piscataway, NJ: IEEE, pp. 1-5. http://ieeexplore.ieee.org/xpl/freeabs_all.jsp?arnumber=1703129&abstractAccess=no&userType=inst; https://doi.org/10.1109/ICEIS.2006.1703129
Abstract
In this paper we focus on the design of the feature extractor stage of the speech recognition system which aims to compute optimal vectors for the next phoneme classification stage. We propose a new non-linear feature extraction method based on the linear-in-parameters functionally expanded neural network (FENN) model. The main idea is to design an improved and flexible feature extractor which can effectively account for some of the significant non-linear phenomena usually observed in the speech production process. The effectiveness of the proposed method is assessed on phoneme classification tasks. Specifically, we evaluate the performances on the telephone quality NTIMIT database, focusing the investigations on highly confusable phonemes such as front vowels: /ih/, /ey/, /eh/, /ae/. The results are compared with other widely used coding methods namely, the linear predictive coding (LPC) and the Mel frequency cepstral coding (MFCC). The experiments show a relative improvement in the rates through the use of our proposed non-linear feature extractor technique
Status | Published |
---|---|
Publication date | 31/12/2006 |
Publication date online | 30/04/2006 |
Publisher | IEEE |
Publisher URL | |
Place of publication | Piscataway, NJ |
ISBN | 1-4244-0456-8 |
Conference | 2006 IEEE International Conference on Engineering of Intelligent Systems |
Conference location | Islamabad, Pakistan |
Dates | – |