22 December 1999 Data extraction in form documents with gray-level background using morphological processing
Author Affiliations +
Abstract
Automatic reading of form documents with gray-level background needs a preliminary task of preparing a clean image of the data to be recognized using OCR. In this paper, we present a data extraction process for such documents. First, the preprinted background is removed by decomposing the histogram of the input image. Reference lines are then subtracted from this image. Finally, the lost parts of character images overlaying with reference lines can be restored. The experiments carried out with a bank cheque will be given to illustrate the usefulness of such an algorithm.
© (1999) COPYRIGHT Society of Photo-Optical Instrumentation Engineers (SPIE). Downloading of the abstract is permitted for personal use only.
Rapeeporn Chamchong, Yuttapong Rangsanseri, Punya Thitimajshima, "Data extraction in form documents with gray-level background using morphological processing", Proc. SPIE 3967, Document Recognition and Retrieval VII, (22 December 1999); doi: 10.1117/12.373484; https://doi.org/10.1117/12.373484
PROCEEDINGS
5 PAGES


SHARE
Back to Top