29 December 2000 Tracking faces of arbitrary views for video annotation
Author Affiliations +
We proposed an omni-face tracking system for video annotation in this paper, which is designed to find faces from arbitrary views in complex scenes. The face detector first locates potential faces in the input by performing skin-tone detection. The subsequent processing consists of two largely independent components, the frontal face module and the side- view face module, responsible for finding frontal-view and side-view faces, respectively. The frontal face module uses a region-based approach wherein regions of skin-tone pixels are analyzed for gross oval shape and the presence of facial features. In contrast, the side-view face module follows an edge-based approach to look for curves similar to a side-view profile. To extract the trajectories of faces, the temporal continuity between consecutive frames within the video shots is considered to speed up the tracking process. The main contribution of this work is being able to find faces irrespective of their poses, whereas contemporary systems deal with frontal-view faces only. Information regarding to human faces is encoded in XML format for semantic video content representation. The effectiveness of human face for video annotation is demonstrated in a TV program classification system that categories the input video clip into predefined types. It is shown that the classification accuracy is improved saliently by the employment of face information.
© (2000) COPYRIGHT Society of Photo-Optical Instrumentation Engineers (SPIE). Downloading of the abstract is permitted for personal use only.
Gang Wei, Gang Wei, Ishwar K. Sethi, Ishwar K. Sethi, Nevenka Dimitrova, Nevenka Dimitrova, } "Tracking faces of arbitrary views for video annotation", Proc. SPIE 4310, Visual Communications and Image Processing 2001, (29 December 2000); doi: 10.1117/12.411835; https://doi.org/10.1117/12.411835


Real-time gender classification
Proceedings of SPIE (September 24 2003)
The role of classifiers in multimedia content management
Proceedings of SPIE (January 09 2003)
PNRS: personalized news retrieval system
Proceedings of SPIE (August 23 1999)
Video indexing and retrieval using MPEG-7
Proceedings of SPIE (December 09 2002)
SenseTK: a multimodal, multimedia sensor networking toolkit
Proceedings of SPIE (January 28 2007)

Back to Top