4 March 2015 An approach to the segmentation of multi-page document flow using binary classification
Author Affiliations +
Proceedings Volume 9443, Sixth International Conference on Graphic and Image Processing (ICGIP 2014); 944311 (2015) https://doi.org/10.1117/12.2178778
Event: Sixth International Conference on Graphic and Image Processing (ICGIP 2014), 2014, Beijing, China
Abstract
In this paper, we present a method for segmentation of document page flow applied to heterogeneous real bank documents. The approach is based on the content of images and it also incorporates font based features inside the documents. Our method involves a bag of visual words (BoVW) model on the designed image based feature descriptors and a novel approach to combine the consecutive pages of a document into a single feature vector that represents the transition between these pages. The transitions here could be represented by one of the two different classes: continuity of the same document or beginning of a new document. Using the transition feature vectors, we utilize three different binary classifiers to make predictions on the relationship between consecutive pages. Our initial results demonstrate that the proposed method can exhibit promising performance for document flow segmentation at this stage.
© (2015) COPYRIGHT Society of Photo-Optical Instrumentation Engineers (SPIE). Downloading of the abstract is permitted for personal use only.
Onur Agin, Onur Agin, Cagdas Ulas, Cagdas Ulas, Mehmet Ahat, Mehmet Ahat, Can Bekar, Can Bekar, "An approach to the segmentation of multi-page document flow using binary classification", Proc. SPIE 9443, Sixth International Conference on Graphic and Image Processing (ICGIP 2014), 944311 (4 March 2015); doi: 10.1117/12.2178778; https://doi.org/10.1117/12.2178778
PROCEEDINGS
7 PAGES


SHARE
Back to Top