firstbacksecondback
14 Results
Workshop
|
GUI-WORLD: A GUI-oriented Video Dataset for Multimodal LLM-based Agents Dongping Chen · Yue Huang · Siyuan Wu · Jingyu Tang · Huichi Zhou · Qihui Zhang · Zhigang He · Yilin Bai · Gao Chujie · Liuyi Chen · Yiqiang Li · Chenlong Wang · Yue Yu · Tianshuo Zhou · Zhen Li · Yi Gui · Yao Wan · Pan Zhou · Jianfeng Gao · Lichao Sun |
||
Poster
|
Wed 16:30 |
Vitron: A Unified Pixel-level Vision LLM for Understanding, Generating, Segmenting, Editing Hao Fei · Shengqiong Wu · Hanwang Zhang · Tat-Seng Chua · Shuicheng Yan |
|
Poster
|
Thu 16:30 |
GenArtist: Multimodal LLM as an Agent for Unified Image Generation and Editing Zhenyu Wang · Aoxue Li · Zhenguo Li · Xihui Liu |
|
Workshop
|
InfiMM-WebMath-40B: Advancing Multimodal Pre-Training for Enhanced Mathematical Reasoning Xiaotian Han · Yiren Jian · Xuefeng Hu · Haogeng Liu · Yiqi Wang · Qihang Fan · Yuang Ai · Huaibo Huang · Ran He · Zhenheng Yang · Quanzeng You |
||
Workshop
|
MobileFlow: A Multimodal LLM For Mobile GUI Agent Songqin Nong · Jiali Zhu · Rui Wu · Jiongchao Jin · Shuo Shan · Xiutian Huang · Wenhao Xu |
||
Workshop
|
RAP: Retrieval-Augmented Planning with Contextual Memory for Multimodal LLM Agents Tomoyuki Kagaya · Thong Yuan · Yuxuan Lou · Panasonic Karlekar Jayashree · Panasonic Sugiri Pranata · Akira Kinose · Koki Oguri · Felix Wick · Yang You |
||
Workshop
|
Sat 10:30 |
What do MLLMs hear? Examining the interaction between LLM and audio encoder components in Multimodal Large Language Models Enis Çoban · Michael Mandel · Johanna Devaney |
|
Workshop
|
Dissecting Adversarial Robustness of Multimodal LM Agents Chen Wu · Rishi Shah · Jing Yu Koh · Ruslan Salakhutdinov · Daniel Fried · Aditi Raghunathan |
||
Workshop
|
Dissecting Adversarial Robustness of Multimodal LM Agents Chen Wu · Rishi Shah · Jing Yu Koh · Ruslan Salakhutdinov · Daniel Fried · Aditi Raghunathan |
||
Poster
|
Fri 16:30 |
CuMo: Scaling Multimodal LLM with Co-Upcycled Mixture-of-Experts Jiachen Li · Xinyao Wang · Sijie Zhu · Chia-Wen Kuo · Lu XU · Fan Chen · Jitesh Jain · Humphrey Shi · Longyin Wen |
|
Workshop
|
MaCBench: A multimodal chemistry and materials science benchmark Nawaf Alampara · Indrajeet Mandal · Pranav Khetarpal · Hargun Grover · Mara Schilling-Wilhelmi · N M Anoop Krishnan · Kevin Maik Jablonka |
||
Workshop
|
3D-GRAND: A Million-Scale Dataset for 3D-LLMs with Better Grounding and Less Hallucination Jianing Yang · Xuweiyi Chen · Nikhil Madaan · Madhavan Iyengar · Shengyi Qian · David Fouhey · Joyce Chai |