Paper
30 October 2009 A multilevel region-of-interest based rate control scheme for video communication
QiLui Zhou, Jiaying Liu, Zongming Guo
Author Affiliations +
Proceedings Volume 7498, MIPPR 2009: Remote Sensing and GIS Data Processing and Other Applications; 74984W (2009) https://doi.org/10.1117/12.833686
Event: Sixth International Symposium on Multispectral Image Processing and Pattern Recognition, 2009, Yichang, China
Abstract
The ROI based video coding is widely applied in video communication. In this paper, we propose a multilevel ROI model, which includes the eye-mouth core region (CR), the face profile region (PR), the edge region (ER) and the background region (BR), to classify the subjective importance level of regions for the scene. Taking account of the proposed model, we first segment the current frame into four regions through skin color detection and feature location. Then, we improve the rate control algorithm in JVT-G012 proposal. We consider two factors, including subjective factor by our multi-level ROI model and objective factor by direct difference from reference frame, to model the complexity weight of each macroblock (MB).We allocate resources both at the frame layer and the basic unit layer, and adjust QP at MB layer. Finally, we restrict the QP of MB with three strategies to maintain the spatial and temporal smoothness. The experimental results illustrate that PSNR of ROI (CR plus PR) area using proposed method is in average over 0.5dB higher than JM8.6, while there are only slight changes in the PSNR of whole frame between two methods. Subjective quality based on our method also achieves much better performance.
© (2009) COPYRIGHT Society of Photo-Optical Instrumentation Engineers (SPIE). Downloading of the abstract is permitted for personal use only.
QiLui Zhou, Jiaying Liu, and Zongming Guo "A multilevel region-of-interest based rate control scheme for video communication", Proc. SPIE 7498, MIPPR 2009: Remote Sensing and GIS Data Processing and Other Applications, 74984W (30 October 2009); https://doi.org/10.1117/12.833686
Lens.org Logo
CITATIONS
Cited by 2 scholarly publications.
Advertisement
Advertisement
RIGHTS & PERMISSIONS
Get copyright permission  Get copyright permission on Copyright Marketplace
KEYWORDS
Image segmentation

Chromium

Video

Eye models

Eye

Video coding

Mouth

RELATED CONTENT

Facial image synthesis by hierarchical wire frame model
Proceedings of SPIE (November 01 1992)
Color-based lip localization method
Proceedings of SPIE (April 28 2010)
Face detection based on a new nonlinear color space
Proceedings of SPIE (September 25 2003)

Back to Top