12 February 2007 Unsupervised learning of a steerable basis for invariant image representations
Author Affiliations +
There are two aspects to unsupervised learning of invariant representations of images: First, we can reduce the dimensionality of the representation by finding an optimal trade-off between temporal stability and informativeness. We show that the answer to this optimization problem is generally not unique so that there is still considerable freedom in choosing a suitable basis. Which of the many optimal representations should be selected? Here, we focus on this second aspect, and seek to find representations that are invariant under geometrical transformations occuring in sequences of natural images. We utilize ideas of 'steerability' and Lie groups, which have been developed in the context of filter design. In particular, we show how an anti-symmetric version of canonical correlation analysis can be used to learn a full-rank image basis which is steerable with respect to rotations. We provide a geometric interpretation of this algorithm by showing that it finds the two-dimensional eigensubspaces of the average bivector. For data which exhibits a variety of transformations, we develop a bivector clustering algorithm, which we use to learn a basis of generalized quadrature pairs (i.e. 'complex cells') from sequences of natural images.
© (2007) COPYRIGHT Society of Photo-Optical Instrumentation Engineers (SPIE). Downloading of the abstract is permitted for personal use only.
Matthias Bethge, Matthias Bethge, Sebastian Gerwinn, Sebastian Gerwinn, Jakob H. Macke, Jakob H. Macke, "Unsupervised learning of a steerable basis for invariant image representations", Proc. SPIE 6492, Human Vision and Electronic Imaging XII, 64920C (12 February 2007); doi: 10.1117/12.711119; https://doi.org/10.1117/12.711119

Back to Top