Skip to yearly menu bar Skip to main content


Search All 2024 Events
 

14 Results

<<   <   Page 1 of 2   >   >>
Workshop
GUI-WORLD: A GUI-oriented Video Dataset for Multimodal LLM-based Agents
Dongping Chen · Yue Huang · Siyuan Wu · Jingyu Tang · Huichi Zhou · Qihui Zhang · Zhigang He · Yilin Bai · Gao Chujie · Liuyi Chen · Yiqiang Li · Chenlong Wang · Yue Yu · Tianshuo Zhou · Zhen Li · Yi Gui · Yao Wan · Pan Zhou · Jianfeng Gao · Lichao Sun
Poster
Wed 16:30 Vitron: A Unified Pixel-level Vision LLM for Understanding, Generating, Segmenting, Editing
Hao Fei · Shengqiong Wu · Hanwang Zhang · Tat-Seng Chua · Shuicheng Yan
Poster
Thu 16:30 GenArtist: Multimodal LLM as an Agent for Unified Image Generation and Editing
Zhenyu Wang · Aoxue Li · Zhenguo Li · Xihui Liu
Workshop
InfiMM-WebMath-40B: Advancing Multimodal Pre-Training for Enhanced Mathematical Reasoning
Xiaotian Han · Yiren Jian · Xuefeng Hu · Haogeng Liu · Yiqi Wang · Qihang Fan · Yuang Ai · Huaibo Huang · Ran He · Zhenheng Yang · Quanzeng You
Workshop
MobileFlow: A Multimodal LLM For Mobile GUI Agent
Songqin Nong · Jiali Zhu · Rui Wu · Jiongchao Jin · Shuo Shan · Xiutian Huang · Wenhao Xu
Workshop
RAP: Retrieval-Augmented Planning with Contextual Memory for Multimodal LLM Agents
Tomoyuki Kagaya · Thong Yuan · Yuxuan Lou · Panasonic Karlekar Jayashree · Panasonic Sugiri Pranata · Akira Kinose · Koki Oguri · Felix Wick · Yang You
Workshop
Sat 10:30 What do MLLMs hear? Examining the interaction between LLM and audio encoder components in Multimodal Large Language Models
Enis Çoban · Michael Mandel · Johanna Devaney
Workshop
Dissecting Adversarial Robustness of Multimodal LM Agents
Chen Wu · Rishi Shah · Jing Yu Koh · Ruslan Salakhutdinov · Daniel Fried · Aditi Raghunathan
Workshop
Dissecting Adversarial Robustness of Multimodal LM Agents
Chen Wu · Rishi Shah · Jing Yu Koh · Ruslan Salakhutdinov · Daniel Fried · Aditi Raghunathan
Poster
Fri 16:30 CuMo: Scaling Multimodal LLM with Co-Upcycled Mixture-of-Experts
Jiachen Li · Xinyao Wang · Sijie Zhu · Chia-Wen Kuo · Lu XU · Fan Chen · Jitesh Jain · Humphrey Shi · Longyin Wen
Workshop
MaCBench: A multimodal chemistry and materials science benchmark
Nawaf Alampara · Indrajeet Mandal · Pranav Khetarpal · Hargun Grover · Mara Schilling-Wilhelmi · N M Anoop Krishnan · Kevin Maik Jablonka
Workshop
3D-GRAND: A Million-Scale Dataset for 3D-LLMs with Better Grounding and Less Hallucination
Jianing Yang · Xuweiyi Chen · Nikhil Madaan · Madhavan Iyengar · Shengyi Qian · David Fouhey · Joyce Chai