26 September 2017 Deep linear autoencoder and patch clustering-based unified one-dimensional coding of image and video
Author Affiliations +
Abstract
This paper proposes a unified one-dimensional (1-D) coding framework of image and video, which depends on deep learning neural network and image patch clustering. First, an improved K-means clustering algorithm for image patches is employed to obtain the compact inputs of deep artificial neural network. Second, for the purpose of best reconstructing original image patches, deep linear autoencoder (DLA), a linear version of the classical deep nonlinear autoencoder, is introduced to achieve the 1-D representation of image blocks. Under the circumstances of 1-D representation, DLA is capable of attaining zero reconstruction error, which is impossible for the classical nonlinear dimensionality reduction methods. Third, a unified 1-D coding infrastructure for image, intraframe, interframe, multiview video, three-dimensional (3-D) video, and multiview 3-D video is built by incorporating different categories of videos into the inputs of patch clustering algorithm. Finally, it is shown in the results of simulation experiments that the proposed methods can simultaneously gain higher compression ratio and peak signal-to-noise ratio than those of the state-of-the-art methods in the situation of low bitrate transmission.
© 2017 SPIE and IS&T 1017-9909/2017/$25.00 © 2017 SPIE and IS&T
Honggui Li "Deep linear autoencoder and patch clustering-based unified one-dimensional coding of image and video," Journal of Electronic Imaging 26(5), 053016 (26 September 2017). https://doi.org/10.1117/1.JEI.26.5.053016
Received: 23 May 2017; Accepted: 6 September 2017; Published: 26 September 2017
Lens.org Logo
CITATIONS
Cited by 1 scholarly publication.
Advertisement
Advertisement
RIGHTS & PERMISSIONS
Get copyright permission  Get copyright permission on Copyright Marketplace
KEYWORDS
Image compression

Video coding

Simulation of CCA and DLA aggregates

Video

Reconstruction algorithms

3D image processing

Chromium

RELATED CONTENT

Study on 5 D light field compression using multi focus...
Proceedings of SPIE (April 30 2022)
Enhancements to MPEG4 MVC for depth compression
Proceedings of SPIE (August 04 2010)
Video coding algorithm based on singularities reconstruction
Proceedings of SPIE (October 13 1998)
Compression of full-parallax integral 3D-TV image data
Proceedings of SPIE (May 15 1997)
A new video codec based on 3D DTCWT and vector...
Proceedings of SPIE (October 01 2011)
Low-delay embedded 3D wavelet color video coding with SPIHT
Proceedings of SPIE (January 09 1998)

Back to Top