21 December 2000 Detection of text strings from mixed text/graphics images
Author Affiliations +
A robust system for text strings separation from mixed text/graphics images is presented. Based on a union-find (region growing) strategy the algorithm is thus able to classify the text from graphics and adapts to changes in document type, language category (e.g., English, Chinese and Japanese), text font style and size, and text string orientation within digital images. In addition, it allows for a document skew that usually occurs in documents, without skew correction prior to discrimination while these proposed methods such a projection profile or run length coding are not always suitable for the condition. The method has been tested with a variety of printed documents from different origins with one common set of parameters, and the experimental results of the performance of the algorithm in terms of computational efficiency are demonstrated by using several tested images from the evaluation.
© (2000) COPYRIGHT Society of Photo-Optical Instrumentation Engineers (SPIE). Downloading of the abstract is permitted for personal use only.
Chien-Hua Tsai, Chien-Hua Tsai, Christos A. Papachristou, Christos A. Papachristou, } "Detection of text strings from mixed text/graphics images", Proc. SPIE 4307, Document Recognition and Retrieval VIII, (21 December 2000); doi: 10.1117/12.410838; https://doi.org/10.1117/12.410838


Non-Manhattan layout extraction algorithm
Proceedings of SPIE (March 21 2013)
Text segmentation for automatic document processing
Proceedings of SPIE (January 07 1999)
Thai handwritten character recognition by Euclidean distance
Proceedings of SPIE (February 26 2010)
Graph-based table recognition system
Proceedings of SPIE (March 07 1996)
Heuristics for test recognition using contextual information
Proceedings of SPIE (January 31 1995)
Benchmarking of document page segmentation
Proceedings of SPIE (December 22 1999)

Back to Top