14 April 1993 Identification and correction of rejection and substitution errors in optical character recognition systems
Author Affiliations +
Abstract
We examine the use of character image analysis coupled with contextual information in complex data gathering forms to identify and correct optical character recognition (OCR) system rejection and substitution errors. Segmented characters from a complex data gathering form are initially classified using an OCR engine based on a combination of Karhunen-Loeve transforms and a back-propagation neural network. Systems of equations are derived from the data gathering form to determine the values of characters rejected by the OCR engine and to verify the consistency of the data captured. If the OCR results for a single form are determined to be inconsistent with respect to the form's data relationships, a set of decision algorithms which incorporates a second neural network and uses additional character features is used to tag characters according to their likelihood of substitution error. Potential substitution errors are incrementally added to the set of OCR reject errors and are processed through dynamically selected systems of equations and search techniques which correct both error classes. We provide experimental results and determine the extent to which errors can be detected and corrected for various OCR error rates.
© (1993) COPYRIGHT Society of Photo-Optical Instrumentation Engineers (SPIE). Downloading of the abstract is permitted for personal use only.
Glenn S. Himes, Glenn S. Himes, Marty M. Scholl, Marty M. Scholl, Frank A. DeCosta, Frank A. DeCosta, "Identification and correction of rejection and substitution errors in optical character recognition systems", Proc. SPIE 1906, Character Recognition Technologies, (14 April 1993); doi: 10.1117/12.143616; https://doi.org/10.1117/12.143616
PROCEEDINGS
12 PAGES


SHARE
RELATED CONTENT


Back to Top