Timezone: »
Given an image dataset, we are often interested in finding data generative factors that encode semantic content independently from pose variables such as rotation and translation. However, current disentanglement approaches do not impose any specific structure on the learned latent representations. We propose a method for explicitly disentangling image rotation and translation from other unstructured latent factors in a variational autoencoder (VAE) framework. By formulating the generative model as a function of the spatial coordinate, we make the reconstruction error differentiable with respect to latent translation and rotation parameters. This formulation allows us to train a neural network to perform approximate inference on these latent variables while explicitly constraining them to only represent rotation and translation. We demonstrate that this framework, termed spatial-VAE, effectively learns latent representations that disentangle image rotation and translation from content and improves reconstruction over standard VAEs on several benchmark datasets, including applications to modeling continuous 2-D views of proteins from single particle electron microscopy and galaxies in astronomical images.
Author Information
Tristan Bepler (MIT)
Ellen Zhong (Massachusetts Institute of Technology)
Kotaro Kelley (New York Structural Biology Center)
Edward Brignole (Massachusetts Institute of Technology)
Bonnie Berger (MIT)
More from the Same Authors
-
2021 : Adapting protein language models for rapid DTI prediction »
Samuel Sledzieski · Rohit Singh · Lenore J Cowen · Bonnie Berger -
2022 : Membrane and microtubule rapid instance segmentation with dimensionless instance segmentation by learning graph representations of point clouds »
Robert Kiewisz · Tristan Bepler -
2022 : Contrasting drugs from decoys »
Samuel Sledzieski · Rohit Singh · Lenore J Cowen · Bonnie Berger -
2023 Poster: PoET: A generative model of protein families as sequences-of-sequences »
Timothy Truong Jr · Tristan Bepler -
2022 Poster: Unsupervised Object Representation Learning using Translation and Rotation Group Equivariant VAE »
Alireza Nasiri · Tristan Bepler -
2021 Workshop: Machine Learning in Structural Biology »
Ellen Zhong · Raphael Townshend · Stephan Eismann · Namrata Anand · Roshan Rao · John Ingraham · Wouter Boomsma · Sergey Ovchinnikov · Bonnie Berger -
2020 : Exploring generative atomic models in cryo-EM reconstruction »
Ellen Zhong · Adam Lerer · · Bonnie Berger -
2020 : Contributed Talks Intro »
Ellen Zhong -
2020 : Morning Poster Session »
Ellen Zhong -
2020 : Andrea Thorn Intro »
Ellen Zhong -
2020 Workshop: Machine Learning for Structural Biology »
Raphael Townshend · Stephan Eismann · Ron Dror · Ellen Zhong · Namrata Anand · John Ingraham · Wouter Boomsma · Sergey Ovchinnikov · Roshan Rao · Per Greisen · Rachel Kolodny · Bonnie Berger -
2020 Poster: Learning Mutational Semantics »
Brian Hie · Ellen Zhong · Bryan Bryson · Bonnie Berger