27 February 2008 Open source OCR framework using mobile devices
Author Affiliations +
Abstract
Mobile phones have evolved from passive one-to-one communication device to powerful handheld computing device. Today most new mobile phones are capable of capturing images, recording video, and browsing internet and do much more. Exciting new social applications are emerging on mobile landscape, like, business card readers, sing detectors and translators. These applications help people quickly gather the information in digital format and interpret them without the need of carrying laptops or tablet PCs. However with all these advancements we find very few open source software available for mobile phones. For instance currently there are many open source OCR engines for desktop platform but, to our knowledge, none are available on mobile platform. Keeping this in perspective we propose a complete text detection and recognition system with speech synthesis ability, using existing desktop technology. In this work we developed a complete OCR framework with subsystems from open source desktop community. This includes a popular open source OCR engine named Tesseract for text detection & recognition and Flite speech synthesis module, for adding text-to-speech ability.
© (2008) COPYRIGHT Society of Photo-Optical Instrumentation Engineers (SPIE). Downloading of the abstract is permitted for personal use only.
Steven Zhiying Zhou, Syed Omer Gilani, Stefan Winkler, "Open source OCR framework using mobile devices", Proc. SPIE 6821, Multimedia on Mobile Devices 2008, 682104 (27 February 2008); doi: 10.1117/12.766588; https://doi.org/10.1117/12.766588
PROCEEDINGS
6 PAGES


SHARE
Back to Top