Paper
28 January 2008 Form classification
Author Affiliations +
Proceedings Volume 6815, Document Recognition and Retrieval XV; 68150Y (2008) https://doi.org/10.1117/12.766737
Event: Electronic Imaging, 2008, San Jose, California, United States
Abstract
The problem of form classification is to assign a single-page form image to one of a set of predefined form types or classes. We classify the form images using low level pixel density information from the binary images of the documents. In this paper, we solve the form classification problem with a classifier based on the k-means algorithm, supported by adaptive boosting. Our classification method is tested on the NIST scanned tax forms data bases (special forms databases 2 and 6) which include machine-typed and handwritten documents. Our method improves the performance over published results on the same databases, while still using a simple set of image features.
© (2008) COPYRIGHT Society of Photo-Optical Instrumentation Engineers (SPIE). Downloading of the abstract is permitted for personal use only.
K. V. Umamaheswara Reddy and Venu Govindaraju "Form classification", Proc. SPIE 6815, Document Recognition and Retrieval XV, 68150Y (28 January 2008); https://doi.org/10.1117/12.766737
Lens.org Logo
CITATIONS
Cited by 1 scholarly publication.
Advertisement
Advertisement
RIGHTS & PERMISSIONS
Get copyright permission  Get copyright permission on Copyright Marketplace
KEYWORDS
Image classification

Databases

Binary data

Library classification systems

Error analysis

Principal component analysis

Computer simulations

RELATED CONTENT


Back to Top