Translator Disclaimer
Paper
26 February 2010 Comparative wavelet, PLP, and LPC speech recognition techniques on the Hindi speech digits database
Author Affiliations +
Proceedings Volume 7546, Second International Conference on Digital Image Processing; 754634 (2010) https://doi.org/10.1117/12.856318
Event: Second International Conference on Digital Image Processing, 2010, Singapore, Singapore
Abstract
In view of the growing use of automatic speech recognition in the modern society, we study various alternative representations of the speech signal that have the potential to contribute to the improvement of the recognition performance. In this paper wavelet based features using different wavelets are used for Hindi digits recognition. The recognition performance of these features has been compared with Linear Prediction Coefficients (LPC) and Perceptual Linear Prediction (PLP) features. All features have been tested using Hidden Markov Model (HMM) based classifier for speaker independent Hindi digits recognition. The recognition performance of PLP features is11.3% better than LPC features. The recognition performance with db10 features has shown a further improvement of 12.55% over PLP features. The recognition performance with db10 is best among all wavelet based features.
© (2010) COPYRIGHT Society of Photo-Optical Instrumentation Engineers (SPIE). Downloading of the abstract is permitted for personal use only.
A. N. Mishra, M. C. Shrotriya, and S. N. Sharan "Comparative wavelet, PLP, and LPC speech recognition techniques on the Hindi speech digits database", Proc. SPIE 7546, Second International Conference on Digital Image Processing, 754634 (26 February 2010); https://doi.org/10.1117/12.856318
PROCEEDINGS
6 PAGES


SHARE
Advertisement
Advertisement
RELATED CONTENT

Content-based retrieval of music and audio
Proceedings of SPIE (October 06 1997)
Change-point detection using wavelets
Proceedings of SPIE (June 07 1996)
Image retrieval based on wavelet vector quantization
Proceedings of SPIE (October 01 1998)
Robust speech recognition using time boundary detection
Proceedings of SPIE (April 01 2003)
Sparse representation in speech signal processing
Proceedings of SPIE (November 13 2003)

Back to Top