Translator Disclaimer
30 March 1995 Stochastic attribute grammar model of document production and its use in document image decoding
Author Affiliations +
Abstract
Document image decoding (DID) refers to the process of document recognition within a communication theory framework. In this framework, a logical document structure is a message communicated by encoding the structure as an ideal image, transmitting the ideal image through a noisy channel, and decoding the degraded image into a logical structure as close to the original message as possible, on average. Thus document image decoding is document image recognition where the recognizer performs optimal reconstruction by explicitly modeling the source of logical structures, the encoding procedure, and the channel noise. In previous work, we modeled the source and encoder using probabilistic finite-state automata and transducers. In this paper, we generalize the source and encoder models using context-free attribute grammars. We employ these models in a document image decoder that uses a dynamic programming algorithm to minimize the probability of error between original and reconstructed structures. The dynamic programming algorithm is a generalization of the Cocke-Younger-Kasami parsing algorithm.
© (1995) COPYRIGHT Society of Photo-Optical Instrumentation Engineers (SPIE). Downloading of the abstract is permitted for personal use only.
Philip A. Chou and Gary E. Kopec "Stochastic attribute grammar model of document production and its use in document image decoding", Proc. SPIE 2422, Document Recognition II, (30 March 1995); https://doi.org/10.1117/12.205842
PROCEEDINGS
8 PAGES


SHARE
Advertisement
Advertisement
RELATED CONTENT

Turbo recognition: a statistical approach to layout analysis
Proceedings of SPIE (December 20 2000)
A Microprogrammable Processor For Image Operations
Proceedings of SPIE (December 18 1985)
Rank-ordered error diffusion: method and applications
Proceedings of SPIE (January 28 2007)
Comparison of connected-component algorithms
Proceedings of SPIE (August 26 1999)
Communication theory framework for document recognition
Proceedings of SPIE (March 22 1994)
Mixed-language development tool for HAVNET research
Proceedings of SPIE (March 21 1999)

Back to Top