firstbacksecondback
73 Results
Poster
|
Tue 9:00 |
CAESAR: An Embodied Simulator for Generating Multimodal Referring Expression Datasets Md Mofijul Islam · Reza Mirzaiee · Alexi Gladstone · Haley Green · Tariq Iqbal |
|
Poster
|
Wed 9:00 |
VLMbench: A Compositional Benchmark for Vision-and-Language Manipulation Kaizhi Zheng · Xiaotong Chen · Odest Chadwicke Jenkins · Xin Wang |
|
Poster
|
Wed 9:00 |
MSDS: A Large-Scale Chinese Signature and Token Digit String Dataset for Handwriting Verification Peirong Zhang · Jiajia Jiang · Yuliang Liu · Lianwen Jin |
|
Poster
|
Thu 14:00 |
CLiMB: A Continual Learning Benchmark for Vision-and-Language Tasks Tejas Srinivasan · Ting-Yun Chang · Leticia Pinto Alva · Georgios Chochlakis · Mohammad Rostami · Jesse Thomason |
|
Poster
|
Tue 9:00 |
ActionSense: A Multimodal Dataset and Recording Framework for Human Activities Using Wearable Sensors in a Kitchen Environment Joseph DelPreto · Chao Liu · Yiyue Luo · Michael Foshey · Yunzhu Li · Antonio Torralba · Wojciech Matusik · Daniela Rus |
|
Competition
|
Wed 5:00 |
Multimodal Single-Cell Integration Across Time and Individuals Daniel Burkhardt · Smita Krishnaswamy · Robrecht Cannoodt · Malte Luecken · Jonathan Bloom · Fabian Theis · Christopher Lance · Angela Pisco |
|
Poster
|
Multi-Lingual Acquisition on Multimodal Pre-training for Cross-modal Retrieval Liang Zhang · Anwen Hu · Qin Jin |
||
Poster
|
Thu 9:00 |
Transferring Pre-trained Multimodal Representations with Cross-modal Similarity Matching Byoungjip Kim · Sungik Choi · Dasol Hwang · Moontae Lee · Honglak Lee |
|
Poster
|
Wed 14:00 |
CLOOB: Modern Hopfield Networks with InfoLOOB Outperform CLIP Andreas Fürst · Elisabeth Rumetshofer · Johannes Lehner · Viet T. Tran · Fei Tang · Hubert Ramsauer · David Kreil · Michael Kopp · Günter Klambauer · Angela Bitto · Sepp Hochreiter |
|
Poster
|
Wed 9:00 |
MACK: Multimodal Aligned Conceptual Knowledge for Unpaired Image-text Matching Yan Huang · Yuming Wang · Yunan Zeng · Liang Wang |
|
Poster
|
Thu 9:00 |
Can Push-forward Generative Models Fit Multimodal Distributions? Antoine Salmona · Valentin De Bortoli · Julie Delon · Agnes Desolneux |
|
Poster
|
Wed 9:00 |
Robustness Analysis of Video-Language Models Against Visual and Language Perturbations Madeline Chantry · Shruti Vyas · Hamid Palangi · Yogesh Rawat · Vibhav Vineet |