Timezone: »
The recent success in human action recognition with deep learning methods mostly adopt the supervised learning paradigm, which requires significant amount of manually labeled data to achieve good performance. However, label collection is an expensive and time-consuming process. In this work, we propose an unsupervised learning framework, which exploits unlabeled data to learn video representations. Different from previous works in video representation learning, our unsupervised learning task is to predict 3D motion in multiple target views using video representation from a source view. By learning to extrapolate cross-view motions, the representation can capture view-invariant motion dynamics which is discriminative for the action. In addition, we propose a view-adversarial training method to enhance learning of view-invariant features. We demonstrate the effectiveness of the learned representations for action recognition on multiple datasets.
Author Information
Junnan Li (National University of Singapore)
Yongkang Wong (National University of Singapore)
Qi Zhao (University of Minnesota)
Mohan Kankanhalli (National University of Singapore,)
More from the Same Authors
-
2021 Spotlight: Align before Fuse: Vision and Language Representation Learning with Momentum Distillation »
Junnan Li · Ramprasaath Selvaraju · Akhilesh Gotmare · Shafiq Joty · Caiming Xiong · Steven Chu Hong Hoi -
2022 Poster: Don't Pour Cereal into Coffee: Differentiable Temporal Logic for Temporal Action Segmentation »
Ziwei Xu · Yogesh Rawat · Yongkang Wong · Mohan Kankanhalli · Mubarak Shah -
2021 Poster: Learning to Predict Trustworthiness with Steep Slope Loss »
Yan Luo · Yongkang Wong · Mohan Kankanhalli · Qi Zhao -
2021 Poster: Unsupervised Motion Representation Learning with Capsule Autoencoders »
Ziwei Xu · Xudong Shen · Yongkang Wong · Mohan Kankanhalli -
2021 Poster: Align before Fuse: Vision and Language Representation Learning with Momentum Distillation »
Junnan Li · Ramprasaath Selvaraju · Akhilesh Gotmare · Shafiq Joty · Caiming Xiong · Steven Chu Hong Hoi -
2019 Poster: Embedding Symbolic Knowledge into Deep Networks »
Yaqi Xie · Ziwei Xu · Kuldeep S Meel · Mohan Kankanhalli · Harold Soh