Hierarchical vs non-hierarchical audio indexation and classification for video genres

Nouha Dammak; Yassine BenAyed

doi:10.1117/12.2309852

13 April 2018 Hierarchical vs non-hierarchical audio indexation and classification for video genres

Nouha Dammak, Yassine BenAyed

Proceedings Volume 10696, Tenth International Conference on Machine Vision (ICMV 2017); 1069621 (2018) https://doi.org/10.1117/12.2309852
Event: Tenth International Conference on Machine Vision, 2017, Vienna, Austria

Abstract

In this paper, Support Vector Machines (SVMs) are used for segmenting and indexing video genres based on only audio features extracted at block level, which has a prominent asset by capturing local temporal information. The main contribution of our study is to show the wide effect on the classification accuracies while using an hierarchical categorization structure based on Mel Frequency Cepstral Coefficients (MFCC) audio descriptor. In fact, the classification consists in three common video genres: sports videos, music clips and news scenes. The sub-classification may divide each genre into several multi-speaker and multi-dialect sub-genres. The validation of this approach was carried out on over 360 minutes of video span yielding a classification accuracy of over 99%.

Citation Download Citation

Nouha Dammak and Yassine BenAyed "Hierarchical vs non-hierarchical audio indexation and classification for video genres", Proc. SPIE 10696, Tenth International Conference on Machine Vision (ICMV 2017), 1069621 (13 April 2018); https://doi.org/10.1117/12.2309852

ACCESS THE FULL ARTICLE

INSTITUTIONAL
Select your institution to access the SPIE Digital Library.

SELECT YOUR INSTITUTION

PERSONAL
Sign in with your SPIE account to access your personal subscriptions or to use specific features such as save to my library, sign up for alerts, save searches, etc.

PERSONAL SIGN IN

No SPIE Account? Create one

PURCHASE THIS CONTENT

SUBSCRIBE TO DIGITAL LIBRARY

50 downloads per 1-year subscription

Members: $195

Non-members: $335 ADD TO CART

25 downloads per 1 - year subscription

Members: $145

Non-members: $250 ADD TO CART

PURCHASE SINGLE ARTICLE

Includes PDF, HTML & Video, when available

Members: $17.00

Non-members: $21.00 ADD TO CART

PROCEEDINGS
8 PAGES

DOWNLOAD PAPER SAVE TO MY LIBRARY

GET CITATION

RIGHTS & PERMISSIONS

Get copyright permission Get copyright permission on Copyright Marketplace

KEYWORDS

Video

Feature extraction

Databases

Visualization

Information visualization

Fourier transforms

Classification systems

RELATED CONTENT

Techniques for designing a classifier for multimedia indexing
Proceedings of SPIE (January 01 2001)

Incorporating audio cues into dialog and action scene extraction
Proceedings of SPIE (January 10 2003)

Hierarchical video summarization for medical data
Proceedings of SPIE (December 19 2001)

Content-based video retrieval and summarization using MPEG-7
Proceedings of SPIE (December 15 2003)

Virage image search engine an open framework for image...
Proceedings of SPIE (March 13 1996)

Model-based classification of visual information for content-based retrieval
Proceedings of SPIE (December 17 1998)

Delaunay triangulation for image object indexing a novel method...
Proceedings of SPIE (December 17 1998)

Subscribe to Digital Library

Receive Erratum Email Alert

Keywords/Phrases

Search In:

Publication Years