Translator Disclaimer
21 December 2000 Layout and language: an efficient algorithm for detecting text blocks based on spatial and linguistic evidence
Author Affiliations +
Abstract
The ability to accurately detect those areas in plain text documents that consist of contiguous text is an important pre- process to many applications. This paper introduces a novel method that uses both spatial and linguistic knowledge in an accurate manner to provide an initial analysis of the document. This initial analysis may then be extended to provide a complete analysis of the text areas in the document.
© (2000) COPYRIGHT Society of Photo-Optical Instrumentation Engineers (SPIE). Downloading of the abstract is permitted for personal use only.
Matthew Hurst "Layout and language: an efficient algorithm for detecting text blocks based on spatial and linguistic evidence", Proc. SPIE 4307, Document Recognition and Retrieval VIII, (21 December 2000); https://doi.org/10.1117/12.410860
PROCEEDINGS
12 PAGES


SHARE
Advertisement
Advertisement
Back to Top