23 January 2012 Using connected component decomposition to detect straight line segments in documents
Author Affiliations +
Straight line segment detection in digital documents has been studied extensively for the past few decades. One of the challenges is to detect line segments without priori information about document images and render good results without much parameter calibration. In this paper, we introduce a novel algorithm that is simple but effective in detecting straight line segments in scanned documents. Our Connected Component Decomposition (CCD) approach first decomposes the connected components based on the gradient direction of the edge contours, and then uses Chebyshev's inequality to statistically distinguish lines from characters, followed by a simple post processing step to examine straightness of remain segments. This CCD approach is simple to follow and fast in its implementation, and its high accuracy and usability are demonstrated empirically on a practical data set with large varieties.
© (2012) COPYRIGHT Society of Photo-Optical Instrumentation Engineers (SPIE). Downloading of the abstract is permitted for personal use only.
Xiaofan Feng, Xiaofan Feng, Abdou Youssef, Abdou Youssef, } "Using connected component decomposition to detect straight line segments in documents", Proc. SPIE 8297, Document Recognition and Retrieval XIX, 82970Z (23 January 2012); doi: 10.1117/12.912049; https://doi.org/10.1117/12.912049

Back to Top