13 April 2018 Hierarchical vs non-hierarchical audio indexation and classification for video genres
Author Affiliations +
Proceedings Volume 10696, Tenth International Conference on Machine Vision (ICMV 2017); 1069621 (2018) https://doi.org/10.1117/12.2309852
Event: Tenth International Conference on Machine Vision, 2017, Vienna, Austria
Abstract
In this paper, Support Vector Machines (SVMs) are used for segmenting and indexing video genres based on only audio features extracted at block level, which has a prominent asset by capturing local temporal information. The main contribution of our study is to show the wide effect on the classification accuracies while using an hierarchical categorization structure based on Mel Frequency Cepstral Coefficients (MFCC) audio descriptor. In fact, the classification consists in three common video genres: sports videos, music clips and news scenes. The sub-classification may divide each genre into several multi-speaker and multi-dialect sub-genres. The validation of this approach was carried out on over 360 minutes of video span yielding a classification accuracy of over 99%.
© (2018) COPYRIGHT Society of Photo-Optical Instrumentation Engineers (SPIE). Downloading of the abstract is permitted for personal use only.
Nouha Dammak, Nouha Dammak, Yassine BenAyed, Yassine BenAyed, "Hierarchical vs non-hierarchical audio indexation and classification for video genres", Proc. SPIE 10696, Tenth International Conference on Machine Vision (ICMV 2017), 1069621 (13 April 2018); doi: 10.1117/12.2309852; https://doi.org/10.1117/12.2309852
PROCEEDINGS
8 PAGES


SHARE
RELATED CONTENT

Breaking down the problem of blind video quality evaluation
Proceedings of SPIE (February 02 2014)
New perspective on visual information retrieval
Proceedings of SPIE (December 17 2003)
Hierarchical video summarization for medical data
Proceedings of SPIE (December 18 2001)
Video genre classification using multimodal features
Proceedings of SPIE (December 17 2003)

Back to Top