10 April 2018 Text extraction from images in the wild using the Viola-Jones algorithm
Author Affiliations +
Proceedings Volume 10615, Ninth International Conference on Graphic and Image Processing (ICGIP 2017); 106151H (2018) https://doi.org/10.1117/12.2303559
Event: Ninth International Conference on Graphic and Image Processing, 2017, Qingdao, China
Abstract
Text Localization and extraction is an important issue in modern applications of computer vision. Applications such as reading and translating texts in the wild or from videos are among the many applications that can benefit results of this field. In this work, we adopt the well-known Viola-Jones algorithm to enable text extraction and localization from images in the wild. The Viola-Jones is an efficient, and a fast image-processing algorithm originally used for face detection. Based on some resemblance between text and face detection tasks in the wild, we have modified the viola-jones to detect regions of interest where text may be localized. In the proposed approach, some modification to the HAAR like features and a semi-automatic process of data set generating and manipulation were presented to train the algorithm. A process of sliding windows with different sizes have been used to scan the image for individual letters and letter clusters existence. A post processing step is used in order to combine the detected letters into words and to remove false positives. The novelty of the presented approach is using the strengths of a modified Viola-Jones algorithm to identify many different objects representing different letters and clusters of similar letters and later combine them into words of varying lengths. Impressive results were obtained on the ICDAR contest data sets.
© (2018) COPYRIGHT Society of Photo-Optical Instrumentation Engineers (SPIE). Downloading of the abstract is permitted for personal use only.
Raid M. Saabna, Raid M. Saabna, Eran Zingboim, Eran Zingboim, } "Text extraction from images in the wild using the Viola-Jones algorithm", Proc. SPIE 10615, Ninth International Conference on Graphic and Image Processing (ICGIP 2017), 106151H (10 April 2018); doi: 10.1117/12.2303559; https://doi.org/10.1117/12.2303559
PROCEEDINGS
10 PAGES


SHARE
RELATED CONTENT

ADA interpretative system for image algebra
Proceedings of SPIE (June 01 1992)
Some Experiments In Distributed Vision
Proceedings of SPIE (January 17 1985)
Supporting image algebra in the C++ language
Proceedings of SPIE (June 23 1993)
Research of SIFT matching algorithm in binocular vision
Proceedings of SPIE (December 02 2011)
Hexagonal domain transform for shape analysis
Proceedings of SPIE (February 01 1992)

Back to Top