1 November 2002 Document layout extraction using soft ordering
Author Affiliations +
Abstract
We present an algorithm that can determine the layout of an arbitrary document with great flexibility. The bottom-up approach of pattern extraction and classification provides good segmentation and is insensitive to skew. Soft ordering is a feature that improves segmentation by allowing distinct regions to physically overlap. It is also used to determine the correct order of the document regions. The algorithm can extract and place all the distinct document regions into a logical layout and column structure.
Phillip E. Mitchell, Hong Yan, "Document layout extraction using soft ordering," Optical Engineering 41(11), (1 November 2002). https://doi.org/10.1117/1.1512907
JOURNAL ARTICLE
13 PAGES


SHARE
Back to Top