Timezone: »
Learning object-centric representations of multi-object scenes is a promising approach towards machine intelligence, facilitating high-level reasoning and control from visual sensory data. However, current approaches for \textit{unsupervised object-centric scene representation} are incapable of aggregating information from multiple observations of a scene. As a result, these ``single-view'' methods form their representations of a 3D scene based only on a single 2D observation (view). Naturally, this leads to several inaccuracies, with these methods falling victim to single-view spatial ambiguities. To address this, we propose \textit{The Multi-View and Multi-Object Network (MulMON)}---a method for learning accurate, object-centric representations of multi-object scenes by leveraging multiple views. In order to sidestep the main technical difficulty of the \textit{multi-object-multi-view} scenario---maintaining object correspondences across views---MulMON iteratively updates the latent object representations for a scene over multiple views. To ensure that these iterative updates do indeed aggregate spatial information to form a complete 3D scene understanding, MulMON is asked to predict the appearance of the scene from novel viewpoints during training. Through experiments we show that MulMON better-resolves spatial ambiguities than single-view methods---learning more accurate and disentangled object representations---and also achieves new functionality in predicting object segmentations for novel viewpoints.
Author Information
Nanbo Li (University of Edinburgh)
Cian Eastwood (University of Edinburgh)
Robert Fisher (University of Edinburgh)
Related Events (a corresponding poster, oral, or spotlight)
-
2020 Poster: Learning Object-Centric Representations of Multi-Object Scenes from Multiple Views »
Wed. Dec 9th 05:00 -- 07:00 PM Room Poster Session 3 #949
More from the Same Authors
-
2023 : Self-Supervised Disentanglement by Leveraging Structure in Data Augmentations »
Cian Eastwood · Julius von Kügelgen · Linus Ericsson · Diane Bouchacourt · Pascal Vincent · Mark Ibrahim · Bernhard Schölkopf -
2023 : Self-Supervised Disentanglement by Leveraging Structure in Data Augmentations »
Cian Eastwood · Julius von Kügelgen · Linus Ericsson · Diane Bouchacourt · Pascal Vincent · Bernhard Schölkopf · Mark Ibrahim -
2023 Workshop: Causal Representation Learning »
Sara Magliacane · Atalanti Mastakouri · Yuki Asano · Claudia Shi · Cian Eastwood · Sébastien Lachapelle · Bernhard Schölkopf · Caroline Uhler -
2023 Poster: Spuriosity Didn’t Kill the Classifier: Using Invariant Predictions to Harness Spurious Features »
Cian Eastwood · Shashank Singh · Andrei L Nicolicioiu · Marin Vlastelica Pogančić · Julius von Kügelgen · Bernhard Schölkopf -
2022 Poster: Probable Domain Generalization via Quantile Risk Minimization »
Cian Eastwood · Alexander Robey · Shashank Singh · Julius von Kügelgen · Hamed Hassani · George J. Pappas · Bernhard Schölkopf -
2021 : Spotlight Talk 8 »
Cian Eastwood -
2021 Poster: Object-Centric Representation Learning with Generative Spatial-Temporal Factorization »
Nanbo Li · Muhammad Ahmed Raza · Wenbin Hu · Zhaole Sun · Robert Fisher -
2017 : Poster session + Coffee break »
Mikael Kågebäck · Igor Melnyk · Amir-Hossein Karimi · Gino Brunner · Ershad Banijamali · Chris Donahue · Jake Zhao · Giambattista Parascandolo · Valentin Thomas · Abhishek Kumar · Chris Burgess · Amanda Nilsson · Maria Larsson · Cian Eastwood · Momchil Peychev