firstbacksecondback
Filter by Keyword:
248 Results
Poster
|
Wed 0:30 |
Align before Fuse: Vision and Language Representation Learning with Momentum Distillation Junnan Li · Ramprasaath Selvaraju · Akhilesh Gotmare · Shafiq Joty · Caiming Xiong · Steven Chu Hong Hoi |
|
Poster
|
Tue 8:30 |
Grounding inductive biases in natural images: invariance stems from variations in data Diane Bouchacourt · Mark Ibrahim · Ari Morcos |
|
Poster
|
Tue 8:30 |
Do Vision Transformers See Like Convolutional Neural Networks? Maithra Raghu · Thomas Unterthiner · Simon Kornblith · Chiyuan Zhang · Alexey Dosovitskiy |
|
Poster
|
Tue 8:30 |
Detecting Moments and Highlights in Videos via Natural Language Queries Jie Lei · Tamara L Berg · Mohit Bansal |
|
Poster
|
Thu 0:30 |
Twins: Revisiting the Design of Spatial Attention in Vision Transformers Xiangxiang Chu · Zhi Tian · Yuqing Wang · Bo Zhang · Haibing Ren · Xiaolin Wei · Huaxia Xia · Chunhua Shen |
|
Poster
|
Thu 8:30 |
Glance-and-Gaze Vision Transformer Qihang Yu · Yingda Xia · Yutong Bai · Yongyi Lu · Alan Yuille · Wei Shen |
|
Poster
|
Wed 0:30 |
Mining the Benefits of Two-stage and One-stage HOI Detection Aixi Zhang · Yue Liao · Si Liu · Miao Lu · Yongliang Wang · Chen Gao · XIAOBO LI |
|
Poster
|
Thu 0:30 |
Discerning Decision-Making Process of Deep Neural Networks with Hierarchical Voting Transformation Ying Sun · Hengshu Zhu · Chuan Qin · Fuzhen Zhuang · Qing He · Hui Xiong |
|
Poster
|
Fri 8:30 |
ResT: An Efficient Transformer for Visual Recognition Qinglong Zhang · Yu-Bin Yang |
|
Poster
|
Tue 16:30 |
Direct Multi-view Multi-person 3D Pose Estimation tao wang · Jianfeng Zhang · Yujun Cai · Shuicheng Yan · Jiashi Feng |
|
Poster
|
Tue 8:30 |
SegFormer: Simple and Efficient Design for Semantic Segmentation with Transformers Enze Xie · Wenhai Wang · Zhiding Yu · Anima Anandkumar · Jose M. Alvarez · Ping Luo |
|
Workshop
|
Mon 7:20 |
Summarization in Quantized Transformer Spaces Mirella Lapata |