31 January 2013 Keywords image retrieval in historical handwritten Arabic documents
Author Affiliations +
J. of Electronic Imaging, 22(1), 013016 (2013). doi:10.1117/1.JEI.22.1.013016
Abstract
A system is presented for spotting and searching keywords in handwritten Arabic documents. A slightly modified dynamic time warping algorithm is used to measure similarities between words. Two sets of features are generated from the outer contour of the words/word-parts. The first set is based on the angles between nodes on the contour and the second set is based on the shape context features taken from the outer contour. To recognize a given word, the segmentation-free approach is partially adopted, i.e., continuous word parts are used as the basic alphabet, instead of individual characters or complete words. Additional strokes, such as dots and detached short segments, are classified and used in a postprocessing step to determine the final comparison decision. The search for a keyword is performed by the search for its word parts given in the correct order. The performance of the presented system was very encouraging in terms of efficiency and match rates. To evaluate the presented system its performance is compared to three different systems. Unfortunately, there are no publicly available standard datasets with ground truth for testing Arabic key word searching systems. Therefore, a private set of images partially taken from Juma’a Al-Majid Center in Dubai for evaluation is used, while using a slightly modified version of the IFN/ENIT database for training.
© 2013 SPIE and IS&T
Raid M. Saabni, Jihad A. El-Sana, "Keywords image retrieval in historical handwritten Arabic documents," Journal of Electronic Imaging 22(1), 013016 (31 January 2013). https://doi.org/10.1117/1.JEI.22.1.013016
JOURNAL ARTICLE
9 PAGES


SHARE
RELATED CONTENT

Color indexing with weak spatial constraints
Proceedings of SPIE (March 13 1996)
Line-based logo recognition through a web-camera
Proceedings of SPIE (November 15 2007)
Image retrieval with templates of arbitrary size
Proceedings of SPIE (January 15 1997)
Search engine for handwritten documents
Proceedings of SPIE (January 17 2005)
Vector-based approach to color image retrieval
Proceedings of SPIE (October 05 1998)

Back to Top