Book Chapter | Speaker Independent Vowel Recognition

Book Chapter

Speaker Independent Vowel Recognition

Details

Citation

Smith L & Tang C (1992) Speaker Independent Vowel Recognition. In: Linggard R, Myers D & Nightingale C (eds.) Neural Networks for Vision, Speech and Natural Language, Part 2. BT Telecommunications Series, 1. Dordrecht: Springer, pp. 148-159. http://link.springer.com/chapter/10.1007/978-94-011-2360-0_11

Abstract
In designing artificial devices to perform human perceptual functions which map the initial sensory stimuli to their corresponding responses, there are at least three aspects to be considered: the representation of sensory input, the representation of the output or response, and the mechanism which maps the input to desired output. Since Dudley first invented his vocoder more than four decades ago, many vocoders have been designed to develop a representation of speech in an efficient way such that the representation contains all the information necessary for separating signals and at the same time has minimum redundancy [2]. Within the backprop learning connectionist framework, researchers have tried different network architectures - varying the number of layers of the network, and varying the connectivity, such as Harrison's experiment with single and multilayer perceptrons, and his use of zonal units instead of making the network fully connected between layers [3]. On the output level, McCulloch and Ainsworth tried two types of output representation in their attempt to recognize steady state vowels [2]. One is local representation in which each unit represents a vowel; the other is based on the vowel quadrilateral in which each vowel is represented by a pair of real numbers indicating the first two formant frequencies. The vowel quadrilateral is illustrated in Fig. 1.

Status	Published
Title of series	BT Telecommunications Series
Number in series	1
Publication date	31/12/1992
Publisher	Springer
Publisher URL
Place of publication	Dordrecht
ISBN	978-94-010-5041-8

People (1)

Professor Leslie Smith

Emeritus Professor, Computing Science

我要吃瓜

Speaker Independent Vowel Recognition

Details

People (1)