You have requested a machine translation of selected content from our databases. This functionality is provided solely for your convenience and is in no way intended to replace human translation. Neither SPIE nor the owners and publishers of the content make, and they explicitly disclaim, any express or implied representations or warranties of any kind, including, without limitation, representations and warranties as to the functionality of the translation feature or the accuracy or completeness of the translations.
Translations are not retained in our system. Your use of this feature and the translations is subject to all use restrictions contained in the Terms and Conditions of Use of the SPIE website.
29 January 2007A statistical approach to line segmentation in handwritten documents
A new technique to segment a handwritten document into distinct lines of text is presented. Line segmentation
is the first and the most critical pre-processing step for a document recognition/analysis task. The proposed
algorithm starts, by obtaining an initial set of candidate lines from the piece-wise projection profile of the
document. The lines traverse around any obstructing handwritten connected component by associating it to the
line above or below. A decision of associating such a component is made by (i) modeling the lines as bivariate
Gaussian densities and evaluating the probability of the component under each Gaussian or (ii)the probability
obtained from a distance metric. The proposed method is robust to handle skewed documents and those with
lines running into each other. Experimental results show that on 720 documents (which includes English, Arabic
and children's handwriting) containing a total of 11, 581 lines, 97.31% of the lines were segmented correctly. On
an experiment over 200 handwritten images with 78, 902 connected components, 98.81% of them were associated
to the correct lines.
The alert did not successfully save. Please try again later.
Manivannan Arivazhagan, Harish Srinivasan, Sargur Srihari, "A statistical approach to line segmentation in handwritten documents," Proc. SPIE 6500, Document Recognition and Retrieval XIV, 65000T (29 January 2007); https://doi.org/10.1117/12.704538