7 March 1996 Using domain knowledge to derive the logical structure of documents
Author Affiliations +
Proceedings Volume 2660, Document Recognition III; (1996); doi: 10.1117/12.234696
Event: Electronic Imaging: Science and Technology, 1996, San Jose, CA, United States
Abstract
An important aspect of document understanding is document logical structure derivation, which involves knowledge-based analysis of document images to derive a symbolic description of their structure and contents. Domain-specific as well as generic knowledge about document layout is used in order to classify, logically group, and determine the read-order of the individual blocks in the image, i.e., translate the physical structure of the document into a layout-independent logical structure. We have developed a computational model for the derivation of the logical structure of documents. Our model uses a rule-based control structure, as well as a hierarchical multi-level knowledge representation scheme in which knowledge about various types of documents is encoded into a document knowledge base and is used by reasoning processes to make inferences about the document. An important issue addressed in our research is the kind of domain knowledge that is required for such analysis. A document logical structure derivation system (DeLoS) has been developed based on the above model, and has achieved good results in deriving the logical structure of complex multi- articled documents such as newspaper pages. Applications of this approach include its use in information retrieval from digital libraries, as well as in comprehensive document understanding systems.
© (1996) COPYRIGHT Society of Photo-Optical Instrumentation Engineers (SPIE). Downloading of the abstract is permitted for personal use only.
Debashish Niyogi, Sargur N. Srihari, "Using domain knowledge to derive the logical structure of documents", Proc. SPIE 2660, Document Recognition III, (7 March 1996); doi: 10.1117/12.234696; https://doi.org/10.1117/12.234696
PROCEEDINGS
12 PAGES


SHARE
KEYWORDS
Image processing

Rule based systems

Image segmentation

Bismuth

Control systems

Image analysis

Photography

RELATED CONTENT

Guideline for specifying layout knowledge
Proceedings of SPIE (January 07 1999)
Automative morphome analysis of medical-biological images
Proceedings of SPIE (February 05 2002)
Map-aided structural analysis of aerial images
Proceedings of SPIE (August 17 1994)
Understandiny Images Using Knowledge Based Approach
Proceedings of SPIE (May 05 1986)
The Fusion of Voice and Video
Proceedings of SPIE (March 01 1990)

Back to Top