Timezone: »
One approach to computer object recognition and modeling the brain's ventral stream involves unsupervised learning of representations that are invariant to common transformations. However, applications of these ideas have usually been limited to 2D affine transformations, e.g., translation and scaling, since they are easiest to solve via convolution. In accord with a recent theory of transformation-invariance, we propose a model that, while capturing other common convolutional networks as special cases, can also be used with arbitrary identity-preserving transformations. The model's wiring can be learned from videos of transforming objects---or any other grouping of images into sets by their depicted object. Through a series of successively more complex empirical tests, we study the invariance/discriminability properties of this model with respect to different transformations. First, we empirically confirm theoretical predictions for the case of 2D affine transformations. Next, we apply the model to non-affine transformations: as expected, it performs well on face verification tasks requiring invariance to the relatively smooth transformations of 3D rotation-in-depth and changes in illumination direction. Surprisingly, it can also tolerate clutter "transformations'' which map an image of a face on one background to an image of the same face on a different background. Motivated by these empirical findings, we tested the same model on face verification benchmark tasks from the computer vision literature: Labeled Faces in the Wild, PubFig and a new dataset we gathered---achieving strong performance in these highly unconstrained cases as well.
Author Information
Qianli Liao (Massachusetts Institute of Technology)
Joel Leibo (Google DeepMind)
Tomaso Poggio (Massachusetts Institute of Technology)
More from the Same Authors
-
2022 : System identification of neural systems: If we got it right, would we know? »
Yena Han · Tomaso Poggio · Brian Cheung -
2023 Poster: Norm-based Generalization Bounds for Sparse Neural Networks »
Tomer Galanti · Mengjia Xu · Liane Galanti · Tomaso Poggio -
2021 : AI X Neuroscience »
Tomaso Poggio -
2018 : Tomaso Poggio (MIT): Dynamical System Theory for Deep Learning »
Tomaso Poggio -
2017 Symposium: Kinds of intelligence: types, tests and meeting the needs of society »
José Hernández-Orallo · Zoubin Ghahramani · Tomaso Poggio · Adrian Weller · Matthew Crosby -
2016 Workshop: Learning, Inference and Control of Multi-Agent Systems »
Thore Graepel · Marc Lanctot · Joel Leibo · Guy Lever · Janusz Marecki · Frans Oliehoek · Karl Tuyls · Vicky Holgate -
2011 Poster: Why The Brain Separates Face Recognition From Object Recognition »
Joel Leibo · Jim Mutch · Tomaso Poggio