31 July 2006 Overcomplete MCTF for improved spatial scalability in 3D wavelet video compression
Author Affiliations +
Proceedings Volume 5960, Visual Communications and Image Processing 2005; 59600N (2006) https://doi.org/10.1117/12.631556
Event: Visual Communications and Image Processing 2005, 2005, Beijing, China
As of its open-loop structure and good decorrelation capability, motion-compensated temporal filtering (MCTF) provides a robust basis for highly-efficient scalable video coding. Combining MCTF with spatial wavelet decomposition and embedded quantization results in a 3D wavelet video compression system, providing temporal, spatial, and SNR scalability. Recent results indicate that the overall coding performance of these systems can be maximized if temporal filtering is performed in spatial domain (t+2D approach). However, as compared to non-scalable video coding, the performance of t+2D systems may not be satisfactory if spatial scalability needs to be provided. One important reason for this fact is the problem of spatial scalability of motion information. In this paper we present a conceptually new approach for t+2D-based video compression with spatially scalable motion information. We call our approach overcomplete MCTF since multiple spatial-domain temporal filtering operations are needed to generate the lower spatial scales of the temporal subbands. Specifically, the encoder performs MCTF-based generation of reference sequences for the coarser spatial scales. We find that the newly generated reference sequences are of satisfactory quality. Compared to the conventional t+2D system, our approach allows for optimization of the reconstruction quality at lower spatial scales while having reduced impact on the reconstruction quality at high spatial scales/bitrates.
© (2006) COPYRIGHT Society of Photo-Optical Instrumentation Engineers (SPIE). Downloading of the abstract is permitted for personal use only.
Thomas Rusert, Jens-Rainer Ohm, "Overcomplete MCTF for improved spatial scalability in 3D wavelet video compression", Proc. SPIE 5960, Visual Communications and Image Processing 2005, 59600N (31 July 2006); doi: 10.1117/12.631556; https://doi.org/10.1117/12.631556

Back to Top