24 January 2011 A framework to improve digital corpus uses: image-mode navigation
Author Affiliations +
In this paper, we propose a new system to enhance navigation inside digital corpora. This system is based on an automatic indexation in image mode and provides the user intuitive navigation in interactive time. Keywords and containers are extracted directly from the document images to create an Image Mode Index, which shows the keywords as cut-out images of their actual appearances. Our approach recreates a summary of the structured documents, following indications given by the creators of the document themselves. Our system is detailed in the general case and sample applications on a 19th century handwritten corpus and a 18th century machine printed text corpus are provided. This approach, developed for documents unreachable otherwise, can be applied on any corpus where keywords and containers can be identified.
© (2011) COPYRIGHT Society of Photo-Optical Instrumentation Engineers (SPIE). Downloading of the abstract is permitted for personal use only.
Loris Eynard, Loris Eynard, Vincent Malleron, Vincent Malleron, Hubert Emptoz, Hubert Emptoz, } "A framework to improve digital corpus uses: image-mode navigation", Proc. SPIE 7874, Document Recognition and Retrieval XVIII, 78740X (24 January 2011); doi: 10.1117/12.873389; https://doi.org/10.1117/12.873389

Back to Top