JPEG XR (formerly Microsoft Windows Media Photo and HD Photo) is the latest image coding standard. By integrating
various advanced technologies such as integer hierarchical lapped transform, context adaptive Huffman coding, and high
dynamic range coding, it achieves competitive performance to JPEG-2000, but with lower computational complexity
and memory requirement. In this paper, the GPU implementation of the JPEG XR codec using NVIDIA CUDA
(Compute Unified Device Architecture) technology is investigated. Design considerations to speed up the algorithm are
discussed, by taking full advantage of the properties of the CUDA framework and JPEG XR. Experimental results are
presented to demonstrate the performance of the GPU implementation.