12 February 2007 First- and third-party ground truth for key frame extraction from consumer video clips
Author Affiliations +
Extracting key frames (KF) from video is of great interest in many applications, such as video summary, video organization, video compression, and prints from video. KF extraction is not a new problem. However, current literature has been focused mainly on sports or news video. In the consumer video space, the biggest challenges for key frame selection from consumer videos are the unconstrained content and lack of any preimposed structure. In this study, we conduct ground truth collection of key frames from video clips taken by digital cameras (as opposed to camcorders) using both first- and third-party judges. The goals of this study are: (1) to create a reference database of video clips reasonably representative of the consumer video space; (2) to identify associated key frames by which automated algorithms can be compared and judged for effectiveness; and (3) to uncover the criteria used by both first- and thirdparty human judges so these criteria can influence algorithm design. The findings from these ground truths will be discussed.
© (2007) COPYRIGHT Society of Photo-Optical Instrumentation Engineers (SPIE). Downloading of the abstract is permitted for personal use only.
Kathleen Costello and Jiebo Luo "First- and third-party ground truth for key frame extraction from consumer video clips", Proc. SPIE 6492, Human Vision and Electronic Imaging XII, 64921N (12 February 2007); doi: 10.1117/12.707534; https://doi.org/10.1117/12.707534


Back to Top