1 April 1998 Indexing and retrieval of the MPEG compressed video
Author Affiliations +
Abstract
To keep pace with the increased popularity of digital video as an archival medium, the development of techniques for fast and efficient analysis of video streams is essential. In particular, solutions to the problems of storing, indexing, browsing, and retrieving video data from large multimedia databases are necessary to allow access to these collections. Given that video is often stored efficiently in a compressed format, the costly overhead of decompression can be reduced by analyzing the compressed representation directly. In earlier work, we presented compressed domain parsing techniques which identified shots, subshots, and scenes. In this article, we present efficient key frame selection, feature extraction, indexing, and retrieval techniques that are directly applicable to MPEG compressed video. We develop a frame type independent representation which normalizes spatial and temporal features including frame type, frame size, macroblock encoding, and motion compensation vectors. Features for indexing are derived directly from this representation and mapped to a low-dimensional space where they can be accessed using standard database techniques. Spatial information is used as primary index into the database and temporal information is used to rank retrieved clips and enhance the robustness of the system. The techniques presented enable efficient indexing, querying, and retrieval of compressed video as demonstrated by our system which typically takes a fraction of a second to retrieve similar video scenes from a database, with over 95% recall.
Vikrant Kobla, David Scott Doermann, "Indexing and retrieval of the MPEG compressed video," Journal of Electronic Imaging 7(2), (1 April 1998). https://doi.org/10.1117/1.482645
JOURNAL ARTICLE
14 PAGES


SHARE
RELATED CONTENT

Fast video segmentation using encoding cost data
Proceedings of SPIE (December 17 1998)
Flow estimation for motion analysis on compressed domain
Proceedings of SPIE (December 18 2003)
Mosaics from MPEG-2 video
Proceedings of SPIE (July 01 2003)
SenseTK: a multimodal, multimedia sensor networking toolkit
Proceedings of SPIE (January 29 2007)

Back to Top