16 January 2006 Author name recognition in degraded journal images
Author Affiliations +
Abstract
A method for extracting names in degraded documents is presented in this article. The documents targeted are images of photocopied scientific journals from various scientific domains. Due to the degradation, there is poor OCR recognition, and pieces of other articles appear on the sides of the image. The proposed approach relies on the combination of a low-level textual analysis and an image-based analysis. The textual analysis extracts robust typographic features, while the image analysis selects image regions of interest through anchor components. We report results on the University of Washington benchmark database.
© (2006) COPYRIGHT Society of Photo-Optical Instrumentation Engineers (SPIE). Downloading of the abstract is permitted for personal use only.
Aliette de Bodard de la Jacopière, Laurence Likforman-Sulem, "Author name recognition in degraded journal images", Proc. SPIE 6067, Document Recognition and Retrieval XIII, 60670L (16 January 2006); doi: 10.1117/12.643043; https://doi.org/10.1117/12.643043
PROCEEDINGS
9 PAGES


SHARE
Back to Top