Paper
25 February 1994 Machine-printed Arabic OCR
Khosrow M. Hassibi
Author Affiliations +
Proceedings Volume 2103, 22nd AIPR Workshop: Interdisciplinary Computer Vision: Applications and Changing Needs; (1994) https://doi.org/10.1117/12.169463
Event: 22nd Applied Imagery Pattern Recognition Workshop, 1993, Washington, DC, United States
Abstract
This paper presents a brief overview of our research in the development of an OCR system for recognition of machine-printed texts in languages that use the Arabic alphabet. The cursive nature of machine-printed Arabic makes the segmentation of words into letters a challenging problem. In our approach, through a novel preliminary segmentation technique, a word is broken into pieces where each piece may not represent a valid letter in general. Neural networks trained on a training sample set of about 500 Arabic text images are used for recognition of these pieces. The rules governing the alphabet and character-level contextual information are used for recombining these pieces into valid letters. Higher-level contextual analysis schemes including the use of an Arabic lexicon and n-grams is also under development and are expected to improve the word recognition accuracy. The segmentation, recognition, and contextual analysis processes are closely integrated using a feedback scheme. The details of preparation of the training set and some recent results on training of the networks will be presented.
© (1994) COPYRIGHT Society of Photo-Optical Instrumentation Engineers (SPIE). Downloading of the abstract is permitted for personal use only.
Khosrow M. Hassibi "Machine-printed Arabic OCR", Proc. SPIE 2103, 22nd AIPR Workshop: Interdisciplinary Computer Vision: Applications and Changing Needs, (25 February 1994); https://doi.org/10.1117/12.169463
Lens.org Logo
CITATIONS
Cited by 2 scholarly publications.
Advertisement
Advertisement
RIGHTS & PERMISSIONS
Get copyright permission  Get copyright permission on Copyright Marketplace
KEYWORDS
Image segmentation

Neural networks

Detection and tracking algorithms

Optical character recognition

Image processing algorithms and systems

Algorithm development

Raster graphics

RELATED CONTENT

Non-Manhattan layout extraction algorithm
Proceedings of SPIE (March 21 2013)
Archiving of line-drawing images
Proceedings of SPIE (November 21 1995)
New thinning algorithm using rough-set theory
Proceedings of SPIE (April 14 1993)

Back to Top