This paper presents a very simple document recognition method called "segment coding". The method entails partitioning of a document into square segments of a given size and encode each segment according to the ratio of the numbers of black and white pixels inside the segment. The segment code is used as document feature for recognition. We show an experimental prototype which performs extremely well. We also show some analytical results including system parameter optimization and system performance versus database size. It is shown that the system is fast, flexible, extremely accurate and can accommodate a huge number of documents without significantly degrading recognition accuracy.
T. Y. Wang,
C. C. Lee,
"Segment Coding and Automated Document Recognition", Proc. SPIE 1199, Visual Communications and Image Processing IV, (1 November 1989); doi: 10.1117/12.970148; https://doi.org/10.1117/12.970148