firstbacksecondback
344 Results
Affinity Event
|
MIMIC: Multimodal Islamophobic Meme Identification and Classification S M Jishanul Islam · Sahid Hossain Mustakim · Sadia Ahmmed · Md. Faiyaz Abdullah Sayeedi · Swapnil Khandoker · Syed Tasdid Azam Dhrubo · Nahid Hossain |
||
Expo Demonstration
|
Tue 15:00 |
Large Multimodal Model running on a mobile device Ron Tindall |
|
Poster
|
Wed 11:00 |
Unified Lexical Representation for Interpretable Visual-Language Alignment Yifan Li · Yikai Wang · Yanwei Fu · Dongyu Ru · Zheng Zhang · Tong He |
|
Poster
|
Fri 16:30 |
DenseFusion-1M: Merging Vision Experts for Comprehensive Multimodal Perception Xiaotong Li · Fan Zhang · Haiwen Diao · Yueze Wang · Xinlong Wang · LINGYU DUAN |
|
Poster
|
Wed 11:00 |
RSA: Resolving Scale Ambiguities in Monocular Depth Estimators through Language Descriptions Ziyao Zeng · Yangchao Wu · Hyoungseob Park · Daniel Wang · Fengyu Yang · Stefano Soatto · DONG LAO · Byung-Woo Hong · Alex Wong |
|
Poster
|
Wed 16:30 |
Implicit Multimodal Alignment: On the Generalization of Frozen LLMs to Multimodal Inputs Mustafa Shukor · Matthieu Cord |
|
Poster
|
Thu 16:30 |
MAN TruckScenes: A multimodal dataset for autonomous trucking in diverse conditions Felix Fent · Fabian Kuttenreich · Florian Ruch · Farija Rizwin · Stefan Juergens · Lorenz Lechermann · Christian Nissler · Andrea Perl · Ulrich Voll · Min Yan · Markus Lienkamp |
|
Poster
|
Thu 11:00 |
Visual Riddles: a Commonsense and World Knowledge Challenge for Large Vision and Language Models Nitzan Bitton Guetta · Aviv Slobodkin · Aviya Maimon · Eliya Habba · Royi Rassin · Yonatan Bitton · Idan Szpektor · Amir Globerson · Yuval Elovici |
|
Poster
|
Wed 16:30 |
ChatTracker: Enhancing Visual Tracking Performance via Chatting with Multimodal Large Language Model Yiming Sun · Fan Yu · Shaoxiang Chen · Yu Zhang · Junwei Huang · Yang Li · Chenhui Li · Changbo Wang |
|
Poster
|
Wed 16:30 |
Contrasting with Symile: Simple Model-Agnostic Representation Learning for Unlimited Modalities Adriel Saporta · Aahlad Manas Puli · Mark Goldstein · Rajesh Ranganath |
|
Poster
|
Wed 11:00 |
HAWK: Learning to Understand Open-World Video Anomalies Jiaqi Tang · Hao LU · RUIZHENG WU · Xiaogang Xu · Ke Ma · Cheng Fang · Bin Guo · Jiangbo Lu · Qifeng Chen · Yingcong Chen |
|
Poster
|
Thu 16:30 |
Meteor: Mamba-based Traversal of Rationale for Large Language and Vision Models Byung-Kwan Lee · Chae Won Kim · Beomchan Park · Yong Man Ro |