29 January 2007 Using machine learning for fast intra MB coding in H.264
Author Affiliations +
H.264 is a highly efficient and complex video codec. The complexity of the codec makes it difficult to use all its features in resource constrained mobile devices. This paper presents a machine learning approach to reducing the complexity of Intra encoding in H.264. Determining the macro block coding mode requires substantial computational resources in H.264 video encoding. The goal of this work to reduce MB mode computation from a search operation, as is done in the encoders today, to a computation. We have developed a methodology based on machine learning that computes the MB coding mode instead of searching for the best match thus reducing the complexity of Intra 16x16 coding by 17 times and Intra 4x4 MB coding by 12.5 times. The proposed approach uses simple mean value metrics at the block level to characterize the coding complexity of a macro block. A generic J4.8 classifier is used to build the decision trees to quickly determine the mode. We present a methodology for Intra MB coding. The results show that intra MB mode can be determined with over 90% accuracy. The proposed can also be used for determining MB prediction modes with an accuracy varying between 70% and 80%.
© (2007) COPYRIGHT Society of Photo-Optical Instrumentation Engineers (SPIE). Downloading of the abstract is permitted for personal use only.
Hari Kalva and Lakis Christodoulou "Using machine learning for fast intra MB coding in H.264", Proc. SPIE 6508, Visual Communications and Image Processing 2007, 65082U (29 January 2007); doi: 10.1117/12.706024; https://doi.org/10.1117/12.706024


4K-based intra and interprediction techniques for HEVC
Proceedings of SPIE (April 28 2016)
Low complexity H.264 video encoding
Proceedings of SPIE (September 02 2009)
Design Of A Freeze-Frame Coder
Proceedings of SPIE (June 02 1987)
DM642 digital media processor
Proceedings of SPIE (May 06 2003)

Back to Top