Translator Disclaimer
Paper
30 September 2011 Lip reading using neural networks
Author Affiliations +
Proceedings Volume 8285, International Conference on Graphic and Image Processing (ICGIP 2011); 828519 (2011) https://doi.org/10.1117/12.913406
Event: 2011 International Conference on Graphic and Image Processing, 2011, Cairo, Egypt
Abstract
Computerized lip reading, or speech reading, is concerned with the difficult task of converting a video signal of a speaking person to written text. It has several applications like teaching deaf and dumb to speak and communicate effectively with the other people, its crime fighting potential and invariance to acoustic environment. We convert the video of the subject speaking vowels into images and then images are further selected manually for processing. However, several factors like fast speech, bad pronunciation, and poor illumination, movement of face, moustaches and beards make lip reading difficult. Contour tracking methods and Template matching are used for the extraction of lips from the face. K Nearest Neighbor algorithm is then used to classify the 'speaking' images and the 'silent' images. The sequence of images is then transformed into segments of utterances. Feature vector is calculated on each frame for all the segments and is stored in the database with properly labeled class. Character recognition is performed using modified KNN algorithm which assigns more weight to nearer neighbors. This paper reports the recognition of vowels using KNN algorithms
© (2011) COPYRIGHT Society of Photo-Optical Instrumentation Engineers (SPIE). Downloading of the abstract is permitted for personal use only.
Dhananjay Kalbande and Sanjivani Patil "Lip reading using neural networks", Proc. SPIE 8285, International Conference on Graphic and Image Processing (ICGIP 2011), 828519 (30 September 2011); https://doi.org/10.1117/12.913406
PROCEEDINGS
7 PAGES


SHARE
Advertisement
Advertisement
RELATED CONTENT

Non-Manhattan layout extraction algorithm
Proceedings of SPIE (March 21 2013)
OVID: toward object-based video retrieval
Proceedings of SPIE (December 23 1999)
Survey: omnifont-printed character recognition
Proceedings of SPIE (November 01 1991)
Neural networks for handwriting recognition
Proceedings of SPIE (September 16 1992)
Automatic document reading system for technical drawings
Proceedings of SPIE (December 18 2001)

Back to Top