KEYWORDS: Scalable video coding, Video, Video coding, Wavelets, Computer programming, Image compression, JPEG2000, Spatial resolution, Video compression, Signal to noise ratio
This paper develops a new intraframe scalable coding framework based on a subband/wavelet coding approach for MPEG-4 AVC/H.264 scalable video coding (SVC). It attempts to join the subband filter banks with the traditional macroblock and DCT based video coding system. We demonstrate that the current H.264 coding system can be efficiently integrated with the traditional subband filter banks for providing the improved efficiency for intraframe scalable coding. More importantly, unlike the classical wavelet coding, the proposed framework still allows the downsampling filter to be flexibly designed to generate the ideal low resolution video for target applications. Under a wavelet critical sampling setting, the proposed system can perform similar to a conventional single-layer (non-scalable) coder without any performance overhead. Thus, by just reuse of the existing H.264 coding tools to work together with the
added subband filter banks, we can provide the H.264 standard with an alternative intraframe coding scheme that is primarily based on the transform coding approach with the additional spatial scalability and other attractive benefits and useful for both scalable and conventional single-layer intraframe coding applications. The proposed algorithm has been thoroughly evaluated against the current SVC test model JSVM, Motion-JPEG2000, and H.264 high-profile intra coding through extensive coding experiments for both scalable coding and single-layer coding. The simulation results show the proposed algorithm consistently outperforms JSVM and is competitive to Motion-JPEG2000 in PSNR performance. As such, the efficient and highly scalable wavelet image/video compression, as demonstrated by JPEG2000, can be additionally accommodated by the slightly modified MPEG-4 AVC/H.264 standard with low extra implementation costs. Image and video coding applications, traditionally serviced by separate coders, can be efficiently provided by an integrated coding system.
KEYWORDS: Video, Spatial resolution, Chemical species, Signal to noise ratio, Computer programming, Video coding, Data modeling, Video compression, Scalable video coding, Temporal resolution
Recently a methodology for representation and adaptation of arbitrary scalable bit-streams in a fully content non-specific manner has been proposed on the basis of a universal model for all scalable bit-streams called Scalable Structured Meta-formats (SSM). According to this model, elementary scalable bit-streams are naturally organized in a symmetric multi-dimensional logical structure. The model parameters for a specific bit-stream along with information guiding decision-making among possible adaptation choices are represented in a binary or XML descriptor to accompany the bit-stream flowing downstream. The capabilities and preferences of receiving terminals flow upstream and are also specified in binary or XML form to represent constraints that guide adaptation. By interpreting the descriptor and the constraint specifications, a universal adaptation engine sitting on a network node can adapt the content appropriately to suit the specified needs and preferences of recipients, without knowledge of the specifics of the content, its encoding and/or encryption. In this framework, different adaptation infrastructures are no longer needed for different types of scalable media. In this work, we show how this framework can be used to adapt fully scalable video bit-streams, specifically ones obtained by the fully scalable MC-EZBC video coding system. MC-EZBC uses a 3-D subband/wavelet transform that exploits correlation by filtering along motion trajectories, to obtain a 3-dimensional scalable bit-stream combining temporal, spatial and SNR scalability in a compact bit-stream. Several adaptation use cases are presented to demonstrate the flexibility and advantages of a fully scalable video bit-stream when used in conjunction with a network adaptation engine for transmission.
Former research on perceptual image coding was mainly developed in the traditional sequential coding framework, where the codestream is neither rate nor resolution scalable. In this paper, our earlier embedded subband/wavelet image coding algorithm EZBC is further developed for highly scalable image coding applications. Special attention is given to perceptual image coding under varying viewing/display conditions --- a common situation in typical scalable coding application environments. Unlike the conventional perceptual image coding approach, all the perceptually coded images (individually targeted at particular viewing conditions) are decoded from a single compressed bitstream file. The experimental results show the bitrate savings by the proposed algorithm are significant, particularly for coding of high-definition (HD) images.
KEYWORDS: Video, Video coding, Composites, Motion models, Motion analysis, 3D image processing, Video compression, Error analysis, Distortion, 3D video compression
Three-dimensional subband coding with motion compensation (MC- 3DSBC) has been demonstrated [1-4] to be an efficient technique for video coding applications. With half-pixel- accurate motion compensation, images need to be interpolated for motion-compensated (MC) temporal filtering. The resulting analysis/synthesis system is not invertible. In this paper, we propose a new three-dimensional analysis/synthesis system which guarantees perfect reconstruction and has a nonrecursive coding structure. We replaced the analysis/synthesis system of [1] by the new scheme. The resulting coding system does not have distortion from the analysis/synthesis system and allocate bits among classes of 3-D subbands optimally in the sense of rate-distortion function. The experimental results show that the proposed video coding system improves [1] by PSNR .3 - 2.0 dB and TM5 MPEG [10] by PSNR 2.1 - 3.0 dB over a range of bit rates.
Access to the requested content is limited to institutions that have purchased or subscribe to SPIE eBooks.
You are receiving this notice because your organization may not have SPIE eBooks access.*
*Shibboleth/Open Athens users─please
sign in
to access your institution's subscriptions.
To obtain this item, you may purchase the complete book in print or electronic format on
SPIE.org.
INSTITUTIONAL Select your institution to access the SPIE Digital Library.
PERSONAL Sign in with your SPIE account to access your personal subscriptions or to use specific features such as save to my library, sign up for alerts, save searches, etc.