19 January 2009 Hardware-friendly mixed content compression algorithm
Author Affiliations +
The mixed content compression (MCC) algorithm developed in this research provides a hardware efficient solution for compression of scanned compound document images. MCC allows for an easy implementation in imaging pipeline hardware by using only an 8 row buffer of pixels. MCC uses the JPEG encoder to effectively compress the background and picture content of a document image. The remaining text and line graphics in the image, which require high spatial resolution, but can tolerate low color resolution, are compressed using a JBIG1 encoder and color quantization. To separate the text and graphics from the image, MCC uses a simple mean square error (MSE) block classification algorithm to allow a hardware efficient implementation. Results show that for our comprehensive training suite, the compression ratio average achieved by MCC was 60:1, but JPEG only achieved 35:1. In particular, MCC compression ratios become very high on average (82:1 versus 44:1) for mono text documents, which are very common documents being copied and scanned with all-in-ones. In addition, MCC has an edge sharpening side-effect that is very desirable for the target application.
© (2009) COPYRIGHT Society of Photo-Optical Instrumentation Engineers (SPIE). Downloading of the abstract is permitted for personal use only.
Maribel Figuera, Maribel Figuera, Peter Majewicz, Peter Majewicz, Charles A. Bouman, Charles A. Bouman, "Hardware-friendly mixed content compression algorithm", Proc. SPIE 7241, Color Imaging XIV: Displaying, Processing, Hardcopy, and Applications, 724114 (19 January 2009); doi: 10.1117/12.805965; https://doi.org/10.1117/12.805965


Embedding digital data on paper in iconic text
Proceedings of SPIE (April 02 1997)
Rate-distortion-based segmentation for MRC compression
Proceedings of SPIE (December 27 2001)
A reduced color approach to high quality cartoon coding
Proceedings of SPIE (September 16 2005)
Watermarking in JPEG bitstream
Proceedings of SPIE (March 20 2005)

Back to Top