1 November 2002 Document layout extraction using soft ordering
Author Affiliations +
We present an algorithm that can determine the layout of an arbitrary document with great flexibility. The bottom-up approach of pattern extraction and classification provides good segmentation and is insensitive to skew. Soft ordering is a feature that improves segmentation by allowing distinct regions to physically overlap. It is also used to determine the correct order of the document regions. The algorithm can extract and place all the distinct document regions into a logical layout and column structure.
© (2002) Society of Photo-Optical Instrumentation Engineers (SPIE)
Phillip E. Mitchell, Hong Yan, "Document layout extraction using soft ordering," Optical Engineering 41(11), (1 November 2002). https://doi.org/10.1117/1.1512907 . Submission:

Back to Top