29 August 2016 Multi-scale and multi-GMM pooling based on Fisher Kernel for image representation
Author Affiliations +
Proceedings Volume 10033, Eighth International Conference on Digital Image Processing (ICDIP 2016); 100334V (2016) https://doi.org/10.1117/12.2245169
Event: Eighth International Conference on Digital Image Processing (ICDIP 2016), 2016, Chengu, China
Abstract
Image representation is the key part of image classification, and Fisher kernel has been considered as one of the most effective image feature coding methods. For the Fisher encoding method, there is a critical issue that the single GMM only models features within a rough granularity space. In this paper, we propose a method that is named Multi-scale and Multi-GMM Pooling (MMP), which could effectively represent the image from various granularities. We first conduct pooling using the multi-GMM instead of a single GMM. Then, we introduce multi-scale images to enrich the model’s inputs, which could improve the performance further. Finally, we validate out proposal on PASCAL VOC2007 dataset, and the experimental results show an obvious superiority over the basic Fisher model.
© (2016) COPYRIGHT Society of Photo-Optical Instrumentation Engineers (SPIE). Downloading of the abstract is permitted for personal use only.
Yunhao Zhao, Shouhong Wan, Zhize Wu, Bangjie Yin, Lihua Yue, "Multi-scale and multi-GMM pooling based on Fisher Kernel for image representation", Proc. SPIE 10033, Eighth International Conference on Digital Image Processing (ICDIP 2016), 100334V (29 August 2016); doi: 10.1117/12.2245169; https://doi.org/10.1117/12.2245169
PROCEEDINGS
5 PAGES


SHARE
Back to Top