Paper
15 December 2003 The impact of running headers and footers on proximity searching
Author Affiliations +
Proceedings Volume 5296, Document Recognition and Retrieval XI; (2003) https://doi.org/10.1117/12.524820
Event: Electronic Imaging 2004, 2004, San Jose, California, United States
Abstract
Hundreds of experiments over the last decade on the retrieval of OCR documents performed by the Information Science Research Institute have shown that OCR errors do not significantly affect retrievability. We extend those results to show that in the case of proximity searching, the removal of running headers and footers from OCR text will not improve retrievability for such searches.
© (2003) COPYRIGHT Society of Photo-Optical Instrumentation Engineers (SPIE). Downloading of the abstract is permitted for personal use only.
Kazem Taghva, Julie Borsack, Tom Nartker, Jeffrey Coombs, and Ron Young "The impact of running headers and footers on proximity searching", Proc. SPIE 5296, Document Recognition and Retrieval XI, (15 December 2003); https://doi.org/10.1117/12.524820
Lens.org Logo
CITATIONS
Cited by 2 scholarly publications.
Advertisement
Advertisement
RIGHTS & PERMISSIONS
Get copyright permission  Get copyright permission on Copyright Marketplace
KEYWORDS
Optical character recognition

Information science

Scientific research

Licensing

Visualization

Error analysis

Lanthanum

RELATED CONTENT

Evaluation of an automatic markup system
Proceedings of SPIE (March 30 1995)
Evaluating text categorization in the presence of OCR errors
Proceedings of SPIE (December 21 2000)
Effectiveness of thesauri-aided retrieval
Proceedings of SPIE (January 07 1999)
Title extraction and generation from OCR'd documents
Proceedings of SPIE (January 29 2007)

Back to Top