27 September 2016 Scene-aware joint global and local homographic video coding
Author Affiliations +
Perspective motion is commonly represented in video content that is captured and compressed for various applications including cloud gaming, vehicle and aerial monitoring, etc. Existing approaches based on an eight-parameter homography motion model cannot deal with this efficiently, either due to low prediction accuracy or excessive bit rate overhead. In this paper, we consider the camera motion model and scene structure in such video content and propose a joint global and local homography motion coding approach for video with perspective motion. The camera motion is estimated by a computer vision approach, and camera intrinsic and extrinsic parameters are globally coded at the frame level. The scene is modeled as piece-wise planes, and three plane parameters are coded at the block level. Fast gradient-based approaches are employed to search for the plane parameters for each block region. In this way, improved prediction accuracy and low bit costs are achieved. Experimental results based on the HEVC test model show that up to 9.1% bit rate savings can be achieved (with equal PSNR quality) on test video content with perspective motion. Test sequences for the example applications showed a bit rate savings ranging from 3.7 to 9.1%.
Conference Presentation
© (2016) COPYRIGHT Society of Photo-Optical Instrumentation Engineers (SPIE). Downloading of the abstract is permitted for personal use only.
Xiulian Peng, Xiulian Peng, Jizheng Xu, Jizheng Xu, Gary J. Sullivan, Gary J. Sullivan, "Scene-aware joint global and local homographic video coding", Proc. SPIE 9971, Applications of Digital Image Processing XXXIX, 99711J (27 September 2016); doi: 10.1117/12.2240141; https://doi.org/10.1117/12.2240141


Visual pattern-based motion estimation for video coding
Proceedings of SPIE (March 21 1996)
Region-based H.263+ coding for real-time video communication
Proceedings of SPIE (January 03 2002)
Transform-domain Wyner-Ziv codec for video
Proceedings of SPIE (January 17 2004)
Wyner-Ziv residual coding for wireless multi-view system
Proceedings of SPIE (January 28 2007)
Complexity analysis of sprites in MPEG-4
Proceedings of SPIE (March 28 2001)

Back to Top