19 January 2009 Learning approach for multicontent analysis of compound images
Author Affiliations +
Abstract
In the context of the European Cantata project (ITEA project, 2006-2009), within Barco, a complete Multi-Content Analysis framework was developed for detection and analysis of compound images. The framework consists of: a dataset, a Multi-Content Analysis (MCA) algorithm based on learning approaches, a Ground Truth, an evaluation module based on metrics and a presentation module. The aim of the MCA methodology presented here is to classify image content of computer screenshots into different categories such as: text; Graphical User Interface; Medical images and other complex images. The AdaBoost meta-algorithm was chosen, implemented and optimized for the classification method as it fitted the constraints (real-time and precision). A large dataset separated in training and testing subsets and their ground truth (with ViPER metadata format) was both collected and generated for the four different categories. The outcome of the MCA is a cascade of strong classifiers trained and tested on the different subsets. The obtained framework and its optimization (binary search, pre-computing of the features, pre-sorting) allow to re-train the classifiers as much as needed. The preliminary results are quite encouraging with a low false positive rate and close true positive rate in comparison with expectations. The re-injection of false negative examples from new testing subsets in the training phase resulted in better performances of the MCA.
© (2009) COPYRIGHT Society of Photo-Optical Instrumentation Engineers (SPIE). Downloading of the abstract is permitted for personal use only.
Quentin Besnehard, Quentin Besnehard, Cedric Marchessoux, Cedric Marchessoux, Tom Kimpe, Tom Kimpe, Guillaume Spalla, Guillaume Spalla, Arnaud Joubel, Arnaud Joubel, François Boudet, François Boudet, } "Learning approach for multicontent analysis of compound images", Proc. SPIE 7255, Multimedia Content Access: Algorithms and Systems III, 72550E (19 January 2009); doi: 10.1117/12.805789; https://doi.org/10.1117/12.805789
PROCEEDINGS
12 PAGES


SHARE
Back to Top