Timezone: »
We present a method for learning active vision skills, for moving the camera to observe a robot's sensors from informative points of view, without external rewards or labels. We do this by jointly training a visual predictor network, which predicts future returns of the sensors using pixels, and a camera control agent, which we reward using the negative error of the predictor. The agent thus moves the camera to points of view that are most predictive for a target sensor, which we select using a conditioning input to the agent. We show that despite this noisy learned reward function, the learned policies are competent, and precisely frame the sensor to a specific location in the view, which we call an emergent fovea. We find that replacing the conventional camera with a foveal camera further increases the policies' precision.
Author Information
Matthew Grimes (DeepMind)
Joseph Modayil (DeepMind)
Piotr Mirowski (DeepMind)
Dushyant Rao (DeepMind)
Raia Hadsell (DeepMind)
More from the Same Authors
-
2021 : Learning Transferable Motor Skills with Hierarchical Latent Mixture Policies »
Dushyant Rao · Fereshteh Sadeghi · Leonard Hasenclever · Markus Wulfmeier · Martina Zambelli · Giulia Vezzani · Dhruva Tirumala · Yusuf Aytar · Josh Merel · Nicolas Heess · Raia Hadsell -
2022 : Adapting the Function Approximation Architecture in Online Reinforcement Learning »
John Martin · Joseph Modayil · Fatima Davelouis · Michael Bowling -
2022 : Co-writing screenplays and theatre scripts alongside language models using Dramatron »
Piotr Mirowski · Kory Mathewson · Jaylen Pittman · Richard EVANS -
2022 : Adapting the Function Approximation Architecture in Online Reinforcement Learning »
Fatima Davelouis · John Martin · Joseph Modayil · Michael Bowling -
2022 : PORTAGING (live AI Performance) »
Kory Mathewson · Piotr Mirowski · Hannah Johnston · Tom White · Jason Baldridge -
2022 : Generative Collage and its Sticky Questions on Human-AI Co-Creativity »
Piotr Mirowski -
2022 : Learning to Look by Self-Prediction »
Matthew Grimes -
2019 Poster: Continual Unsupervised Representation Learning »
Dushyant Rao · Francesco Visin · Andrei A Rusu · Razvan Pascanu · Yee Whye Teh · Raia Hadsell -
2018 Poster: Learning to Navigate in Cities Without a Map »
Piotr Mirowski · Matt Grimes · Mateusz Malinowski · Karl Moritz Hermann · Keith Anderson · Denis Teplyashin · Karen Simonyan · koray kavukcuoglu · Andrew Zisserman · Raia Hadsell -
2017 Poster: Natural Value Approximators: Learning when to Trust Past Estimates »
Zhongwen Xu · Joseph Modayil · Hado van Hasselt · Andre Barreto · David Silver · Tom Schaul -
2017 Spotlight: Natural Value Approximators: Learning when to Trust Past Estimates »
Zhongwen Xu · Joseph Modayil · Hado van Hasselt · Andre Barreto · David Silver · Tom Schaul