Paper
9 August 2018 Dynamic saliency detection via CNN and spatial-temporal fusion
Author Affiliations +
Proceedings Volume 10806, Tenth International Conference on Digital Image Processing (ICDIP 2018); 1080619 (2018) https://doi.org/10.1117/12.2503058
Event: Tenth International Conference on Digital Image Processing (ICDIP 2018), 2018, Shanghai, China
Abstract
Visual saliency prediction has obtained a significant popularity these years but the majority research is for static saliency prediction. An approach to detect dynamic saliency of videos is proposed in this paper, which exploits a spatial-temporal fusion way. Spatial saliency is detected by a trained convolutional neutral network, and we use a larger convolutional kernel for some layers in our network because saliency is influenced by global contrast according to visual psychology. While temporal saliency is extracted by optical flow and we combine it with K-means cluster, which brings a more accurate result. In addition, the two are fused in an optimal weighted way. Our experiments on DIEM datasets outperforms compared to four other dynamic saliency models on two metrics.
© (2018) COPYRIGHT Society of Photo-Optical Instrumentation Engineers (SPIE). Downloading of the abstract is permitted for personal use only.
Qi Zhang and Dong Xu "Dynamic saliency detection via CNN and spatial-temporal fusion", Proc. SPIE 10806, Tenth International Conference on Digital Image Processing (ICDIP 2018), 1080619 (9 August 2018); https://doi.org/10.1117/12.2503058
Advertisement
Advertisement
RIGHTS & PERMISSIONS
Get copyright permission  Get copyright permission on Copyright Marketplace
KEYWORDS
Optical flow

Video

Computer vision technology

Machine vision

Visualization

Network architectures

Pattern recognition

RELATED CONTENT


Back to Top