1 July 2003 Automated system for text detection in individual video images
Author Affiliations +
J. of Electronic Imaging, 12(3), (2003). doi:10.1117/1.1584050
Text detection in video images is a challenging research problem because of the poor spatial resolution and complex background, which may contain a variety of colors. An automated system for text detection in video images is presented. It makes use of four modules to implement a series of processes to extract text regions from video images. The first module, called the multistage pulse code modulation (MPCM) module, is used to locate potential text regions in color video images. It converts a video image to a coded image, with each pixel encoded by a priority code ranging from 7 down to 0 in accordance with its priority, and further produces a binary thresholded image, which segments potential text regions from the background. The second module, called the text region detection module, applies a sequence of spatial filters to remove noisy regions and eliminate regions that are unlikely to contain text. The third module, called the text box finding module, merges text regions and produces boxes that are likely to contain text. Finally, the fourth module, called the optical character recognition (OCR) module, eliminates the text boxes that produce no OCR output. An extensive set of experiments is conducted and demonstrates that the proposed system is effective in detecting text in a wide variety of video images.
Yingze Du, Chein-I Chang, Paul D. Thouin, "Automated system for text detection in individual video images," Journal of Electronic Imaging 12(3), (1 July 2003). http://dx.doi.org/10.1117/1.1584050


Optical character recognition

Image segmentation

Image processing

Binary data

Edge detection


Back to Top