Timezone: »
A fundamental task in data exploration is to extract simplified low dimensional representations that capture intrinsic geometry in data, especially for faithfully visualizing data in two or three dimensions. Common approaches to this task use kernel methods for manifold learning. However, these methods typically only provide an embedding of fixed input data and cannot extend to new data points. Autoencoders have also recently become popular for representation learning. But while they naturally compute feature extractors that are both extendable to new data and invertible (i.e., reconstructing original features from latent representation), they have limited capabilities to follow global intrinsic geometry compared to kernel-based manifold learning. We present a new method for integrating both approaches by incorporating a geometric regularization term in the bottleneck of the autoencoder. Our regularization, based on the diffusion potential distances from the recently-proposed PHATE visualization method, encourages the learned latent representation to follow intrinsic data geometry, similar to manifold learning algorithms, while still enabling faithful extension to new data and reconstruction of data in the original feature space from latent coordinates. We compare our approach with leading kernel methods and autoencoder models for manifold learning to provide qualitative and quantitative evidence of our advantages in preserving intrinsic structure, out of sample extension, and reconstruction.
Author Information
Andres F Duque (Utah State University)
Sacha Morin (Université de Montréal)
Guy Wolf (Université de Motréal; Mila)
Kevin Moon (Utah State University)
More from the Same Authors
-
2021 : Multimodal Single-Cell Data Integration + Q&A »
Daniel Burkhardt · Smita Krishnaswamy · Malte Luecken · Debora Marks · Angela Pisco · Bastian Rieck · Jian Tang · Alexander Tong · Fabian Theis · Guy Wolf -
2020 : Closing Remarks »
Frederic Chazal · Smita Krishnaswamy · Roland Kwitt · Karthikeyan Natesan Ramamurthy · Bastian Rieck · Yuhei Umeda · Guy Wolf -
2020 Workshop: Topological Data Analysis and Beyond »
Bastian Rieck · Frederic Chazal · Smita Krishnaswamy · Roland Kwitt · Karthikeyan Natesan Ramamurthy · Yuhei Umeda · Guy Wolf -
2020 : Opening Remarks »
Frederic Chazal · Smita Krishnaswamy · Roland Kwitt · Karthikeyan Natesan Ramamurthy · Bastian Rieck · Yuhei Umeda · Guy Wolf -
2020 Poster: Scattering GCN: Overcoming Oversmoothness in Graph Convolutional Networks »
Yimeng Min · Frederik Wenkel · Guy Wolf -
2020 Poster: Uncovering the Topology of Time-Varying fMRI Data using Cubical Persistence »
Bastian Rieck · Tristan Yates · Christian Bock · Karsten Borgwardt · Guy Wolf · Nicholas Turk-Browne · Smita Krishnaswamy -
2020 Spotlight: Uncovering the Topology of Time-Varying fMRI Data using Cubical Persistence »
Bastian Rieck · Tristan Yates · Christian Bock · Karsten Borgwardt · Guy Wolf · Nicholas Turk-Browne · Smita Krishnaswamy -
2018 Poster: Geometry Based Data Generation »
Ofir Lindenbaum · Jay Stanley · Guy Wolf · Smita Krishnaswamy -
2018 Spotlight: Geometry Based Data Generation »
Ofir Lindenbaum · Jay Stanley · Guy Wolf · Smita Krishnaswamy -
2014 Poster: Multivariate f-divergence Estimation With Confidence »
Kevin Moon · Alfred Hero