18 January 2010 A stacked sequential learning method for investigator name recognition from web-based medical articles
Author Affiliations +
Proceedings Volume 7534, Document Recognition and Retrieval XVII; 753404 (2010); doi: 10.1117/12.839141
Event: IS&T/SPIE Electronic Imaging, 2010, San Jose, California, United States
"Investigator Names" is a newly required field in MEDLINE citations. It consists of personal names listed as members of corporate organizations in an article. Extracting investigator names automatically is necessary because of the increasing volume of articles reporting collaborative biomedical research in which a large number of investigators participate. In this paper, we present an SVM-based stacked sequential learning method in a novel application - recognizing named entities such as the first and last names of investigators from online medical journal articles. Stacked sequential learning is a meta-learning algorithm which can boost any base learner. It exploits contextual information by adding the predicted labels of the surrounding tokens as features. We apply this method to tag words in text paragraphs containing investigator names, and demonstrate that stacked sequential learning improves the performance of a nonsequential base learner such as an SVM classifier.
© (2010) COPYRIGHT Society of Photo-Optical Instrumentation Engineers (SPIE). Downloading of the abstract is permitted for personal use only.
Xiaoli Zhang, Jie Zou, Daniel X. Le, George Thoma, "A stacked sequential learning method for investigator name recognition from web-based medical articles", Proc. SPIE 7534, Document Recognition and Retrieval XVII, 753404 (18 January 2010); doi: 10.1117/12.839141; https://doi.org/10.1117/12.839141

Associative arrays

Feature extraction

Medical research

Detection and tracking algorithms

Binary data


Clinical medicine

Back to Top