4 February 2013 Document segmentation via oblique cuts
Author Affiliations +
Abstract
This paper presents a novel solution for the layout segmentation of graphical elements in Business Intelligence documents. We propose a generalization of the recursive X-Y cut algorithm, which allows for cutting along arbitrary oblique directions. An intermediate processing step consisting of line and solid region removal is also necessary due to presence of decorative elements. The output of the proposed segmentation is a hierarchical structure which allows for the identification of primitives in pie and bar charts. The algorithm was tested on a database composed of charts from business documents. Results are very promising.
© (2013) COPYRIGHT Society of Photo-Optical Instrumentation Engineers (SPIE). Downloading of the abstract is permitted for personal use only.
Jeremy Svendsen, Jeremy Svendsen, Alexandra Branzan-Albu, Alexandra Branzan-Albu, } "Document segmentation via oblique cuts", Proc. SPIE 8658, Document Recognition and Retrieval XX, 86580T (4 February 2013); doi: 10.1117/12.2003351; https://doi.org/10.1117/12.2003351
PROCEEDINGS
8 PAGES


SHARE
Back to Top