Paper
1 November 1992 Semantic segmentation of videophone image sequences
Peter J. L. van Beek, Marcel J. T. Reinders, Bulent Sankur, Jan C. A. van der Lubbe
Author Affiliations +
Proceedings Volume 1818, Visual Communications and Image Processing '92; (1992) https://doi.org/10.1117/12.131389
Event: Applications in Optical Science and Engineering, 1992, Boston, MA, United States
Abstract
A system for segmentation of head-and-shoulder scenes into semantic regions, to be applied in a model-based coding scheme on video telephony, is described. The system is conceptually divided into three levels of processing and uses successive semantic regions of interest to locate the speaker, the face and the eyes automatically. Once candidate regions have been obtained by the low level segmentation modules, higher level modules perform measurements on these regions and compare these with expected values to extract the specific region searched for. Fuzzy membership functions are used to allow deviations from the expected values. The system is able to locate satisfactorily the facial region and the eye regions.
© (1992) COPYRIGHT Society of Photo-Optical Instrumentation Engineers (SPIE). Downloading of the abstract is permitted for personal use only.
Peter J. L. van Beek, Marcel J. T. Reinders, Bulent Sankur, and Jan C. A. van der Lubbe "Semantic segmentation of videophone image sequences", Proc. SPIE 1818, Visual Communications and Image Processing '92, (1 November 1992); https://doi.org/10.1117/12.131389
Lens.org Logo
CITATIONS
Cited by 15 scholarly publications.
Advertisement
Advertisement
RIGHTS & PERMISSIONS
Get copyright permission  Get copyright permission on Copyright Marketplace
KEYWORDS
Image segmentation

Eye

3D modeling

Image processing

Mouth

Visual communications

Digital filtering

RELATED CONTENT


Back to Top