24 January 2011 Automatic identification of ROI in figure images toward improving hybrid (text and image) biomedical document retrieval
Author Affiliations +
Abstract
Biomedical images are often referenced for clinical decision support (CDS), educational purposes, and research. They appear in specialized databases or in biomedical publications and are not meaningfully retrievable using primarily textbased retrieval systems. The task of automatically finding the images in an article that are most useful for the purpose of determining relevance to a clinical situation is quite challenging. An approach is to automatically annotate images extracted from scientific publications with respect to their usefulness for CDS. As an important step toward achieving the goal, we proposed figure image analysis for localizing pointers (arrows, symbols) to extract regions of interest (ROI) that can then be used to obtain meaningful local image content. Content-based image retrieval (CBIR) techniques can then associate local image ROIs with identified biomedical concepts in figure captions for improved hybrid (text and image) retrieval of biomedical articles. In this work we present methods that make robust our previous Markov random field (MRF)-based approach for pointer recognition and ROI extraction. These include use of Active Shape Models (ASM) to overcome problems in recognizing distorted pointer shapes and a region segmentation method for ROI extraction. We measure the performance of our methods on two criteria: (i) effectiveness in recognizing pointers in images, and (ii) improved document retrieval through use of extracted ROIs. Evaluation on three test sets shows 87% accuracy in the first criterion. Further, the quality of document retrieval using local visual features and text is shown to be better than using visual features alone.
© (2011) COPYRIGHT Society of Photo-Optical Instrumentation Engineers (SPIE). Downloading of the abstract is permitted for personal use only.
Daekeun You, Daekeun You, Sameer Antani, Sameer Antani, Dina Demner-Fushman, Dina Demner-Fushman, Md Mahmudur Rahman, Md Mahmudur Rahman, Venu Govindaraju, Venu Govindaraju, George R. Thoma, George R. Thoma, } "Automatic identification of ROI in figure images toward improving hybrid (text and image) biomedical document retrieval", Proc. SPIE 7874, Document Recognition and Retrieval XVIII, 78740K (24 January 2011); doi: 10.1117/12.873434; https://doi.org/10.1117/12.873434
PROCEEDINGS
11 PAGES


SHARE
Back to Top