Paper
18 January 2010 General text line extraction approach based on locally orientation estimation
Nazih Ouwayed, Abdel Belaïd, François Auger
Author Affiliations +
Proceedings Volume 7534, Document Recognition and Retrieval XVII; 75340B (2010) https://doi.org/10.1117/12.839518
Event: IS&T/SPIE Electronic Imaging, 2010, San Jose, California, United States
Abstract
This paper presents a novel approach for the multi-oriented text line extraction from historical handwritten Arabic documents. Because of the multi-orientation of lines and their dispersion in the page, we use an image paving algorithm that can progressively and locally determine the lines. The paving algorithm is initialized with a small window and then its size is corrected by extension until enough lines and connected components were found. We use the Snake for line extraction. Once the paving is established, the orientation is determined using the Wigner-Ville distribution on the histogram projection profile. This local orientation is then enlarged to limit the orientation in the neighborhood. Afterwards, the text lines are extracted locally in each zone basing on the follow-up of the baselines and the proximity of connected components. Finally, the connected components that overlap and touch in adjacent lines are separated. The morphology analysis of the terminal letters of Arabic words is here considered. The proposed approach has been experimented on 100 documents reaching an separation accuracy of about 98.6%.
© (2010) COPYRIGHT Society of Photo-Optical Instrumentation Engineers (SPIE). Downloading of the abstract is permitted for personal use only.
Nazih Ouwayed, Abdel Belaïd, and François Auger "General text line extraction approach based on locally orientation estimation", Proc. SPIE 7534, Document Recognition and Retrieval XVII, 75340B (18 January 2010); https://doi.org/10.1117/12.839518
Lens.org Logo
CITATIONS
Cited by 13 scholarly publications and 1 patent.
Advertisement
Advertisement
RIGHTS & PERMISSIONS
Get copyright permission  Get copyright permission on Copyright Marketplace
KEYWORDS
Hough transforms

Picosecond phenomena

Time-frequency analysis

Detection and tracking algorithms

Image segmentation

Signal analyzers

Current controlled current source

Back to Top