13 March 1996 Audio characterization for video indexing
Author Affiliations +
The major problem facing video databases is that of content characterization of video clips once the cut boundaries have been determined. The current efforts in this direction are focussed exclusively on the use of pictorial information, thereby neglecting an important supplementary source of content information, i.e. the embedded audio or sound track. The current research in audio processing can be readily applied to create many different video indices for use in Video On Demand (VOD), educational video indexing, sports video characterization, etc. MPEG is an emerging video and audio compression standard with rapidly increasing popularity in multimedia industry. Compressed bit stream processing has gained good recognition among the researchers. We have also demonstrated feature extraction in MPEG compressed video which implements a majority of scene change detection schemes on compressed video. In this paper, we examine the potential of audio information for content characterization by demonstrating the extraction of widely used features in audio processing directly from compressed data stream and their application to video clip classification.
© (1996) COPYRIGHT Society of Photo-Optical Instrumentation Engineers (SPIE). Downloading of the abstract is permitted for personal use only.
Nilesh V. Patel, Ishwar K. Sethi, "Audio characterization for video indexing", Proc. SPIE 2670, Storage and Retrieval for Still Image and Video Databases IV, (13 March 1996); doi: 10.1117/12.234776; https://doi.org/10.1117/12.234776

Back to Top