25 October 1994 Speaker/speech recognition using microphone arrays and neural networks
Author Affiliations +
Hands-free operation of speech processing equipment is sometimes desired so that the user is unencumbered by hand-held or body-worn microphones. This paper explores the use of array microphones and neural networks (MANN) for robust speech/speaker recognition in a reverberant and noisy environment. Microphone arrays provide high-quality, hands-free sound capture at distances, and neural network processors compensate for environmental interference by transforming speech features of the array input to those of close-talking microphone input. The MANN system is evaluated using both computer-simulated degraded speech and real- room collected speech. It is found that the MANN system is capable of elevating recognition accuracies under adverse conditions, such as room reverberation, noise interference, and mismatch between the training and testing conditions, to levels comparable to those obtained with close-talking microphone input under a matched training and testing condition.
© (1994) COPYRIGHT Society of Photo-Optical Instrumentation Engineers (SPIE). Downloading of the abstract is permitted for personal use only.
Qiguang Lin, ChiWei Che, Ea-Ee Jan, James L. Flanagan, "Speaker/speech recognition using microphone arrays and neural networks", Proc. SPIE 2277, Automatic Systems for the Identification and Inspection of Humans, (25 October 1994); doi: 10.1117/12.191874; https://doi.org/10.1117/12.191874

Back to Top