Document image blocks characterized using projection profiles is proposed. We have collected statistical information on scanned pages of technical articles as a by-product of digitized document analysis. Specifically, in our hierarchical block segmentation and labeling approach (syntactic), 65 training and test pages from two publications were used. Additional information on compression and profiles was also used. Pixel-level information is required as input whether the analyzing tool is an expert system or something else. The issues covered are: (1) Profile characteristics of document objects like text, line drawings, tables, and half-tones, and the variation of these profiles with block size, type size, and direction of scan. (2) Speckle noise: sizes and distribution. (3) For the hierarchical (syntactic) approach, number of tree nodes at each level along with their areas, and comparison with node areas derived from transition-cut trees. (4) CCITT-Group 4 compression statistics on document sub-blocks and whole pages. (5) Size of postscript files and postscript commands used in printing these page files. We believe that these results would allow predicting some characteristics of a printed page digitized at any specified sampling rate.