1 November 1991 Rotation and scale invariant pattern recognition using a multistaged neural network
Author Affiliations +
This paper presents a pattern recognition system that self-organizes to recognize objects by shape. The images are processed using a log-polar transformation that maps rotations and magnifications into representative translations. The systems then uses a multistaged hierarchical neural network that exhibits insensitivity to translations in representation space, which corresponds to rotations and scalings in the image space. The network's three layers perform the functionally disjoint tasks of preprocessing (dynamic thresholding), invariance (position normalization), and recognition (identification of the shape). The Preprocessing stage uses a single layer of elements to dynamically threshold the grey level input image into a binary image. The Invariance stage is a multilayered neural network implementation of a modified Walsh-Hadamard transform that generates a representation of the object that is invariant with respect to the object's position, which maps back to an invariance to rotational orientation and/or size. The Recognition stage is a modified version of Fukushima's Neocognitron that identifies the normalized representation by shape. The resulting network can successfully recognize objects that have been rotated, scaled, or a combination of both. The network uses a small number of fairly simple elements, a subset of which self-organize to produce the recognition performance.
© (1991) COPYRIGHT Society of Photo-Optical Instrumentation Engineers (SPIE). Downloading of the abstract is permitted for personal use only.
Jay I. Minnix, Jay I. Minnix, Eugene S. McVey, Eugene S. McVey, Rafael M. Inigo, Rafael M. Inigo, } "Rotation and scale invariant pattern recognition using a multistaged neural network", Proc. SPIE 1606, Visual Communications and Image Processing '91: Image Processing, (1 November 1991); doi: 10.1117/12.50321; https://doi.org/10.1117/12.50321

Back to Top