Paper
29 January 2007 Using machine learning for fast intra MB coding in H.264
Hari Kalva, Lakis Christodoulou
Author Affiliations +
Proceedings Volume 6508, Visual Communications and Image Processing 2007; 65082U (2007) https://doi.org/10.1117/12.706024
Event: Electronic Imaging 2007, 2007, San Jose, CA, United States
Abstract
H.264 is a highly efficient and complex video codec. The complexity of the codec makes it difficult to use all its features in resource constrained mobile devices. This paper presents a machine learning approach to reducing the complexity of Intra encoding in H.264. Determining the macro block coding mode requires substantial computational resources in H.264 video encoding. The goal of this work to reduce MB mode computation from a search operation, as is done in the encoders today, to a computation. We have developed a methodology based on machine learning that computes the MB coding mode instead of searching for the best match thus reducing the complexity of Intra 16x16 coding by 17 times and Intra 4x4 MB coding by 12.5 times. The proposed approach uses simple mean value metrics at the block level to characterize the coding complexity of a macro block. A generic J4.8 classifier is used to build the decision trees to quickly determine the mode. We present a methodology for Intra MB coding. The results show that intra MB mode can be determined with over 90% accuracy. The proposed can also be used for determining MB prediction modes with an accuracy varying between 70% and 80%.
© (2007) COPYRIGHT Society of Photo-Optical Instrumentation Engineers (SPIE). Downloading of the abstract is permitted for personal use only.
Hari Kalva and Lakis Christodoulou "Using machine learning for fast intra MB coding in H.264", Proc. SPIE 6508, Visual Communications and Image Processing 2007, 65082U (29 January 2007); https://doi.org/10.1117/12.706024
Lens.org Logo
CITATIONS
Cited by 5 scholarly publications.
Advertisement
Advertisement
RIGHTS & PERMISSIONS
Get copyright permission  Get copyright permission on Copyright Marketplace
KEYWORDS
Computer programming

Machine learning

Video

Video coding

Video compression

Detection and tracking algorithms

Video processing

Back to Top