19 December 2018 Improved classification approach for use with large-scale scene images in the Hadoop cluster environment
Jianfang Cao, Yanfei Li, Yan Wang, Hongyan Cui, Yun Tian
Author Affiliations +
Abstract
Faced with massive amounts of image data, the performance of classification algorithms based on traditional platforms with single-node architecture drops dramatically. We propose a classification method based on hybrid optimization and combination technology in a cluster environment that is suitable for use with large-scale scene images. Support vector machine (SVM) algorithms are optimized by the artificial bee colony and particle swarm optimization algorithms to produce weak classifiers; then, a strong classifier is constructed by combining the outputs from the 15 weak classifiers using the AdaBoost algorithm. The MapReduce parallel programming model in the Hadoop platform is used to parallelize the algorithm, and a parallel AdaBoost hybrid optimization (PAH)-SVM algorithm is proposed. Finally, a model is constructed for automatic classification of the large-scale scene images. Multiple sets of comparative experiments show that the average classification accuracy of the proposed algorithm when applied to the scene understanding (Caltech-256 and Pascal VOC 2012) database exceeds 85.0%, and its training time is <10  min when 170,000 images are used. Considering the cost of hardware, the execution time and accuracy of this algorithm are superior to those of mainstream classification algorithms, such as P-SVM and CNN. In addition, the speed of the system based on the proposed algorithm increases linearly, and the constructed Hadoop cluster shows good extensibility. The proposed algorithm is suitable for automatic classification and prediction using large-scale scene images.
© 2018 SPIE and IS&T 1017-9909/2018/$25.00 © 2018 SPIE and IS&T
Jianfang Cao, Yanfei Li, Yan Wang, Hongyan Cui, and Yun Tian "Improved classification approach for use with large-scale scene images in the Hadoop cluster environment," Journal of Electronic Imaging 27(6), 063027 (19 December 2018). https://doi.org/10.1117/1.JEI.27.6.063027
Received: 14 May 2018; Accepted: 27 November 2018; Published: 19 December 2018
Advertisement
Advertisement
RIGHTS & PERMISSIONS
Get copyright permission  Get copyright permission on Copyright Marketplace
KEYWORDS
Image classification

Scene classification

Data modeling

Optimization (mathematics)

Databases

Particle swarm optimization

Computer programming

Back to Top