18 December 2003 Performance of MPEG-7 spectral basis representations for retrieval of home video abstract
Author Affiliations +
In this paper, we present a classification and retrieval technique targeted for retrieval of home video abstract using dimension-reduced, decorrelated spectral features of audio content. The feature extraction based on MPEG-7 descriptors consists of three main stages: Normalized Audio Spectrum Envelope (NASE), basis decomposition algorithm and basis projection, obtained by multiplying the NASE with a set of extracted basis functions. A classifier based on continuous hidden Markov models is applied. For retrieval with accurate performance the system consists of a two-level hierarchy method using speech recognition and sound classification. For the measure of the performance we compare the classification results of MPEG-7 standardized features vs. Mel-scale Frequency Cepstrum Coefficients (MFCC). Results show that the MFCC features yield better performance compared to MPEG-7 features.
© (2003) COPYRIGHT Society of Photo-Optical Instrumentation Engineers (SPIE). Downloading of the abstract is permitted for personal use only.
Hyoung-Gook Kim, Hyoung-Gook Kim, Thomas Sikora, Thomas Sikora, "Performance of MPEG-7 spectral basis representations for retrieval of home video abstract", Proc. SPIE 5307, Storage and Retrieval Methods and Applications for Multimedia 2004, (18 December 2003); doi: 10.1117/12.526073; https://doi.org/10.1117/12.526073


A unified framework for PCA, LDA, and LPP
Proceedings of SPIE (June 01 2012)
Rapid video shot detective based on the dichotomy
Proceedings of SPIE (October 30 2009)
PNRS: personalized news retrieval system
Proceedings of SPIE (August 23 1999)
Sparse representation in speech signal processing
Proceedings of SPIE (November 12 2003)

Back to Top