Volume Table of Contents

PROCEEDINGS VOLUME 3027

ELECTRONIC IMAGING '97 | 8-14 FEBRUARY 1997

Document Recognition IV

Editor(s): Luc M. Vincent, Jonathan J. Hull

Editor Affiliations +

IN THIS VOLUME

6 Sessions, 22 Papers, 0 Presentations, 0 Posters

Handprint Recognition (4)

OCR and Special Document Analysis Topics (4)

Document Matching (2)

Preprocessing and Engineering Drawings (3)

Benchmarking: OCR Error Analysis (4)

Segmentation/Document Structure Analysis/Compression (5)

ELECTRONIC IMAGING '97

8-14 February 1997

San Jose, CA, United States

View the SPIE Conference + Exhibitions Calendar

Subscribe to Digital Library

VIEW ALL ABSTRACTS +

Handprint Recognition

New method for generating handwritten Chinese character samples Paper

Mingmin Zhang, Zhigeng Pan, Jiaoying Shi

Proceedings Volume Document Recognition IV, (1997) https://doi.org/10.1117/12.270060

Read Abstract +

Comparison of class-selective rejection rules for OCR Paper

Thien M. Ha

Proceedings Volume Document Recognition IV, (1997) https://doi.org/10.1117/12.270069

Read Abstract +

Component-based handprint segmentation using adaptive writing style model Paper

Michael D. Garris

Proceedings Volume Document Recognition IV, (1997) https://doi.org/10.1117/12.270076

Read Abstract +

Building upon the utility of connected components, NIST has designed a new character segmentor based on statistically modeling the style of a person's handwriting. Simple spatial features capture the characteristics of a particular writer's style of handprint, enabling the new method to maintain a traditional character-level segmentation philosophy without the integration of recognition or the use of oversegmentation and linguistic postprocessing. Estimates for stroke width and character height are used to compute aspect ratio and standard stroke count features that adapt to the writer's style at the field level. The new method has been developed with a predetermined set of fuzzy rules making the segmentor much less fragile and much more adaptive, and the new method successfully reconstructs fragmented characters as well as splits touching characters. The new segmentor was integrated into the NIST public domain form-based handprint recognition systems and then tested on a set of 490 handwriting sample forms found in NIST special database 19. When compared to a simple component-based segmentor, the new adaptable method improved the overall recognition of handprinted digits by 3.4 percent and field level recognition by 6.9 percent, while effectively reducing deletion errors by 82 percent. The same program code and set of parameters successfully segments sequences of uppercase and lowercase characters without any context-based tuning. While not as dramatic as digits, the recognition of uppercase and lowercase characters improved by 1.7 percent and 1.3 percent respectively. The segmentor maintains a relatively straight-forward and logical process flow avoiding convolutions of encoded exceptions as is common in expert systems. As a result, the new segmentor operates very efficiently, and throughput as high as 362 characters per second can be achieved. Letters and numbers are constructed from a predetermined configuration of a relatively small number of strokes. Results in this paper show that capitalizing on this knowledge through the use of simple adaptable features can significantly improve segmentation, whereas recognition-based and oversegmentation methods fail to take advantage of these intrinsic qualities of handprinted characters.

Adaptive classifier based on K-means clustering and dynamic programing Paper

Antonio Navarro, Charles R. Allen

Proceedings Volume Document Recognition IV, (1997) https://doi.org/10.1117/12.270077

Read Abstract +

OCR and Special Document Analysis Topics

Gray-scale handwritten character recognition based on principal features Paper

Hee-Seon Park, Sang-Yup Kim, Seong-Whan Lee

Proceedings Volume Document Recognition IV, (1997) https://doi.org/10.1117/12.270078

Read Abstract +

Principal component analysis (PCA) has been a major field of study in image compression, coding technique, or pattern recognition, particularly for classification and feature subset selection. Based on its success in these domains, character recognition methods using PCA have attracted considerable attention in recent years. In this paper, we propose a novel scheme for gray-scale handwritten character recognition based on principal of training set are projected onto the subspaces defined by their most important eigenvectors. Here, the significant eigenvectors of each class are chosen as those with the largest associated eigenvalues. These eigenvectors can be thought of as a set of feature vectors, that is, principal features. In this paper, we consider the minimum error subspace classifier for classification. It is a discriminant function derived from the PCA. We discriminate an unknown test character during the recognition phase by projection and classification. The recognition is performed by projecting a test image onto the subspace defined by the dominant eigenvectors of each class and then choosing the class corresponding to the subspace with the minimum error as the class of the test character. In order to verify the performance of the proposed scheme for gray-scale handwritten character recognition, experiments with the IPTP CDROM1 database have been carried out. Of the 12,000 samples available on this CD, 9,000 and 3,000 have been sued for training and testing, respectively. In this paper, we investigated the influence of the number eigencharacters used to define the subspace as well as the number of training characters for each character. Experimental results reveal that the proposed scheme based on principal features has advantages over other character recognition approaches in its speed and simplicity, learning capacity, and insensitivity to variations in the handwritten character images.

Producing good font attribute determination using error-prone information Paper

Robert Cooperman

Proceedings Volume Document Recognition IV, (1997) https://doi.org/10.1117/12.270079

Read Abstract +

OCR for World Wide Web images Paper

Jiangying Zhou, Daniel P. Lopresti, Zhibin Lei

Proceedings Volume Document Recognition IV, (1997) https://doi.org/10.1117/12.270080

Read Abstract +

Embedding digital data on paper in iconic text Paper

Dan S. Bloomberg

Proceedings Volume Document Recognition IV, (1997) https://doi.org/10.1117/12.270081

Read Abstract +

Document Matching

Document matching on CCITT Group 4 compressed images Paper

Jonathan J. Hull

Proceedings Volume Document Recognition IV, (1997) https://doi.org/10.1117/12.270061

Read Abstract +

Duplicate document detection Paper

Larry Spitz

Proceedings Volume Document Recognition IV, (1997) https://doi.org/10.1117/12.270062

Read Abstract +

Preprocessing and Engineering Drawings

Locally adaptive document skew detection Paper

Jaakko J. Sauvola, David Scott Doermann, Matti Pietikaeinen

Proceedings Volume Document Recognition IV, (1997) https://doi.org/10.1117/12.270063

Read Abstract +

Document restoration and enhancement using optimal iterative and paired morphological filters Paper

Yeqing Zhang, Robert P. Loce, Edward R. Dougherty

Proceedings Volume Document Recognition IV, (1997) https://doi.org/10.1117/12.270064

Read Abstract +

Improving the arc detection method in the machine drawing understanding system Paper

Dov Dori, David Hubanks, Wenyin Liu

Proceedings Volume Document Recognition IV, (1997) https://doi.org/10.1117/12.270065

Read Abstract +

Benchmarking: OCR Error Analysis

Performance evaluation for line-drawing recognition systems Paper

Ihsin T. Phillips, Frank Chang, Kevin Chang, Bhavani Duggirala, Mike Logan, Joe Loughry, Jisheng Liang

Proceedings Volume Document Recognition IV, (1997) https://doi.org/10.1117/12.270066

Read Abstract +

Performance evaluation of document layout analysis algorithms on the UW data set Paper

Jisheng Liang, Ihsin T. Phillips, Robert M. Haralick

Proceedings Volume Document Recognition IV, (1997) https://doi.org/10.1117/12.270067

Read Abstract +

Automated system for numerically rating document image quality Paper

T. Michael Cannon, Patrick M. Kelly, S. Sitharama Iyengar, Nathan Brener

Proceedings Volume Document Recognition IV, (1997) https://doi.org/10.1117/12.270068

Read Abstract +

Multilevel character templates for document image decoding Paper

Gary E. Kopec

Proceedings Volume Document Recognition IV, (1997) https://doi.org/10.1117/12.270070

Read Abstract +

Segmentation/Document Structure Analysis/Compression

Color, complex document segmentation and compression Paper

Hei Tao Fung, Kevin J. Parker

Proceedings Volume Document Recognition IV, (1997) https://doi.org/10.1117/12.270071

Read Abstract +

Fast title extraction method for business documents Paper

Yutaka Katsuyama, Satoshi Naoi

Proceedings Volume Document Recognition IV, (1997) https://doi.org/10.1117/12.270072

Read Abstract +

Interactive form recognition for common use Paper

Sean Gugler

Proceedings Volume Document Recognition IV, (1997) https://doi.org/10.1117/12.270073

Read Abstract +

Use of document structure analysis to retrieve information from documents in digital libraries Paper

Debashish Niyogi, Sargur N. Srihari

Proceedings Volume Document Recognition IV, (1997) https://doi.org/10.1117/12.270074

Read Abstract +

Better PostScript than PostScript: portable self-extracting PostScript representation of scanned document images Paper

Qin Zhang, John M. Danskin

Proceedings Volume Document Recognition IV, (1997) https://doi.org/10.1117/12.270075

Read Abstract +

Keywords/Phrases

Search In:

Publication Years