Paper
1 April 1998 Pattern matcher for OCR-corrupted documents and its evaluation
Stefan Agne, Hans-Guenther Hein
Author Affiliations +
Proceedings Volume 3305, Document Recognition V; (1998) https://doi.org/10.1117/12.304629
Event: Photonics West '98 Electronic Imaging, 1998, San Jose, CA, United States
Abstract
Document classification is one of the fundamental technologies prior to document routing, document understanding, and information extraction algorithms. Pattern matchers with rule-based components are in use in news agencies with electronic text as input. However, classification of OCR documents must deal with the ambiguities of the underlying OCR engine. The ambiguities of character segmentation and classification lead towards a directed graph of characters as the results of the OCR process - the so-called character hypothesis lattice. This paper deals with techniques to enhance the pattern matcher in order to cope with CHLs.
© (1998) COPYRIGHT Society of Photo-Optical Instrumentation Engineers (SPIE). Downloading of the abstract is permitted for personal use only.
Stefan Agne and Hans-Guenther Hein "Pattern matcher for OCR-corrupted documents and its evaluation", Proc. SPIE 3305, Document Recognition V, (1 April 1998); https://doi.org/10.1117/12.304629
Lens.org Logo
CITATIONS
Cited by 1 scholarly publication.
Advertisement
Advertisement
RIGHTS & PERMISSIONS
Get copyright permission  Get copyright permission on Copyright Marketplace
KEYWORDS
Optical character recognition

Acquisition tracking and pointing

Image classification

Ions

Classification systems

Focus stacking software

Tin

Back to Top