10 January 2003 Video retrieval using speech and image information
Author Affiliations +
Abstract
Video contains multiple types of audio and visual information, which are difficult to extract, combine or trade-off in general video information retrieval. This paper provides an evaluation on the effects of different types of information used for video retrieval from a video collection. A number of different sources of information are present in most typical broadcast video collections and can be exploited for information retrieval. We will discuss the contributions of automatically recognized speech transcripts, image similarity matching, face detection and video OCR in the contexts of experiments performed as part of 2001 TREC Video Retrieval Track evaluation performed by the National Institute of Standards and Technology. For the queries used in this evaluation, image matching and video OCR proved to be the deciding aspects of video information retrieval.
© (2003) COPYRIGHT Society of Photo-Optical Instrumentation Engineers (SPIE). Downloading of the abstract is permitted for personal use only.
Alexander G. Hauptmann, Rong Jin, Tobun D. Ng, "Video retrieval using speech and image information", Proc. SPIE 5021, Storage and Retrieval for Media Databases 2003, (10 January 2003); doi: 10.1117/12.479747; https://doi.org/10.1117/12.479747
PROCEEDINGS
12 PAGES


SHARE
RELATED CONTENT

Fast face recognition by using an inverted index
Proceedings of SPIE (February 27 2015)
Developing an integrated video analysis system
Proceedings of SPIE (December 29 2000)
Content-based analysis of news video
Proceedings of SPIE (September 25 2001)
Computing image histograms from compressed data
Proceedings of SPIE (September 30 1996)
Improving multimedia retrieval with a video OCR
Proceedings of SPIE (January 28 2008)

Back to Top