29 January 2007 Pixel and semantic capabilities from an image-object based document representation
Author Affiliations +
This paper reports on novel and traditional pixel and semantic operations using a recently standardized document representation called JPM. The JPM representation uses compressed pixel arrays for all visible elements on a page. Separate data containers called boxes provide the layout and additional semantic information. JPM and related image-based document representation standards were designed to obtain the most rate efficient document compression. The authors, however, use this representation directly for operations other than compression typically performed either on pixel arrays or semantic forms. This paper describes the image representation used in the JPM standard and presents techniques to (1) perform traditional raster-based document analysis on the compressed data, (2) transmit semantically meaningful portions of compressed data between devices, (3) create multiple views from one compressed data stream, and (4) edit high resolution document images with only low resolution proxy images.
© (2007) COPYRIGHT Society of Photo-Optical Instrumentation Engineers (SPIE). Downloading of the abstract is permitted for personal use only.
Michael Gormish, Michael Gormish, Kathrin Berkner, Kathrin Berkner, Martin Boliek, Martin Boliek, Guotong Feng, Guotong Feng, Edward L. Schwartz, Edward L. Schwartz, } "Pixel and semantic capabilities from an image-object based document representation", Proc. SPIE 6500, Document Recognition and Retrieval XIV, 65000I (29 January 2007); doi: 10.1117/12.704502; https://doi.org/10.1117/12.704502


Back to Top