Translator Disclaimer
7 March 1996 Using domain knowledge to derive the logical structure of documents
Author Affiliations +
Abstract
An important aspect of document understanding is document logical structure derivation, which involves knowledge-based analysis of document images to derive a symbolic description of their structure and contents. Domain-specific as well as generic knowledge about document layout is used in order to classify, logically group, and determine the read-order of the individual blocks in the image, i.e., translate the physical structure of the document into a layout-independent logical structure. We have developed a computational model for the derivation of the logical structure of documents. Our model uses a rule-based control structure, as well as a hierarchical multi-level knowledge representation scheme in which knowledge about various types of documents is encoded into a document knowledge base and is used by reasoning processes to make inferences about the document. An important issue addressed in our research is the kind of domain knowledge that is required for such analysis. A document logical structure derivation system (DeLoS) has been developed based on the above model, and has achieved good results in deriving the logical structure of complex multi- articled documents such as newspaper pages. Applications of this approach include its use in information retrieval from digital libraries, as well as in comprehensive document understanding systems.
© (1996) COPYRIGHT Society of Photo-Optical Instrumentation Engineers (SPIE). Downloading of the abstract is permitted for personal use only.
Debashish Niyogi and Sargur N. Srihari "Using domain knowledge to derive the logical structure of documents", Proc. SPIE 2660, Document Recognition III, (7 March 1996); https://doi.org/10.1117/12.234696
PROCEEDINGS
12 PAGES


SHARE
Advertisement
Advertisement
RELATED CONTENT

Contextual image understanding of airport photographs
Proceedings of SPIE (April 30 1991)
Guideline for specifying layout knowledge
Proceedings of SPIE (January 06 1999)
Map-aided structural analysis of aerial images
Proceedings of SPIE (August 16 1994)
Understandiny Images Using Knowledge Based Approach
Proceedings of SPIE (May 04 1986)
The Fusion of Voice and Video
Proceedings of SPIE (February 28 1990)

Back to Top