7 January 1999 Text segmentation for automatic document processing
Author Affiliations +
Abstract
There is a considerable interest in designing automatic systems that can scan a given paper document and store it on electronic media for easier storage, manipulation and access. Most documents contain graphics and images, in addition to text. Thus, the document image has to be segmented to identify text and image regions, so that appropriate techniques may be applied to those regions. In this paper, we have presented a new technique for image segmentation in which text and image regions, in a given document image, are automatically identified. The technique is based on the differential processing text extraction concept. The proposed technique is capable of analyzing complex document image layouts. The document image is processed by using textural feature analysis. Results of the proposed method are presented with test images which demonstrate the robustness of the technique.
© (1999) COPYRIGHT Society of Photo-Optical Instrumentation Engineers (SPIE). Downloading of the abstract is permitted for personal use only.
Dinesh P. Mital, Dinesh P. Mital, Wee Leng Goh, Wee Leng Goh, "Text segmentation for automatic document processing", Proc. SPIE 3651, Document Recognition and Retrieval VI, (7 January 1999); doi: 10.1117/12.335819; https://doi.org/10.1117/12.335819
PROCEEDINGS
11 PAGES


SHARE
RELATED CONTENT

Non-Manhattan layout extraction algorithm
Proceedings of SPIE (March 20 2013)
Archiving of line-drawing images
Proceedings of SPIE (November 20 1995)
Fast opening functions and morphological granulometries
Proceedings of SPIE (June 29 1994)
System for line drawings interpretation
Proceedings of SPIE (July 31 1992)
Very fast recognition of GIRO check forms
Proceedings of SPIE (April 13 1993)
Benchmarking of document page segmentation
Proceedings of SPIE (December 21 1999)

Back to Top