firstbacksecondback
80 Results
Poster
|
Wed 16:30 |
Unified Generative and Discriminative Training for Multi-modal Large Language Models Wei Chow · Juncheng Li · Qifan Yu · Kaihang Pan · Hao Fei · Zhiqi Ge · Shuaiyang · Siliang Tang · Hanwang Zhang · QIANRU SUN |
|
Workshop
|
OmniPredict: GPT-4o Enhanced Multi-modal Pedestrian Crossing Intention Prediction Je-Seok Ham · Jia Huang · Peng Jiang · Jinyoung Moon · Yongjin Kwon · Srikanth Saripalli · Changick Kim |
||
Poster
|
Thu 16:30 |
Hallo3D: Multi-Modal Hallucination Detection and Mitigation for Consistent 3D Content Generation Hongbo Wang · Jie Cao · Jin Liu · Xiaoqiang Zhou · Huaibo Huang · Ran He |
|
Poster
|
Thu 11:00 |
Multi-modal Transfer Learning between Biological Foundation Models Juan Jose Garau-Luis · Patrick Bordes · Liam Gonzalez · Maša Roller · Bernardo de Almeida · Christopher Blum · Lorenz Hexemer · Stefan Laurent · Maren Lang · Thomas Pierrot · Guillaume Richard |
|
Poster
|
Wed 11:00 |
Visual CoT: Advancing Multi-Modal Language Models with a Comprehensive Dataset and Benchmark for Chain-of-Thought Reasoning Hao Shao · Shengju Qian · Han Xiao · Guanglu Song · ZHUOFAN ZONG · Letian Wang · Yu Liu · Hongsheng Li |
|
Poster
|
Wed 16:30 |
What Factors Affect Multi-Modal In-Context Learning? An In-Depth Exploration Libo Qin · Qiguang Chen · Hao Fei · Zhi Chen · Min Li · Wanxiang Che |
|
Poster
|
Wed 16:30 |
Eye-gaze Guided Multi-modal Alignment for Medical Representation Learning Chong Ma · Hanqi Jiang · Wenting Chen · Yiwei Li · Zihao Wu · Xiaowei Yu · Zhengliang Liu · Lei Guo · Dajiang Zhu · Tuo Zhang · Dinggang Shen · Tianming Liu · Xiang Li |
|
Workshop
|
Taskverse: A Benchmark Generation Engine for Multi-modal Language Model Jieyu Zhang · Weikai Huang · Zixian Ma · Oscar Michel · Dong He · Tanmay Gupta · Wei-Chiu Ma · Ali Farhadi · Aniruddha Kembhavi · Ranjay Krishna |
||
Poster
|
MMSite: A Multi-modal Framework for the Identification of Active Sites in Proteins Song Ouyang · Huiyu Cai · Yong Luo · Kehua Su · Lefei Zhang · Bo Du |
||
Workshop
|
Zer0-Jack: A memory-efficient gradient-based jailbreaking method for black box Multi-modal Large Language Models Tiejin Chen · Kaishen Wang · Hua Wei |
||
Poster
|
Thu 16:30 |
CountGD: Multi-Modal Open-World Counting Niki Amini-Naieni · Tengda Han · Andrew Zisserman |
|
Workshop
|
Sat 9:30 |
PyTorch Frame: A Modular Framework for Multi-Modal Tabular Learning Weihua Hu · Yiwen Yuan · Zecheng Zhang · Akihiro Nitta · Kaidi Cao · Vid Kocijan · Jinu Sunil · Jure Leskovec · Matthias Fey |