firstbacksecondback
15 Results
Workshop
|
Decomposing Complex Visual Comprehension into Atomic Visual Skills for Vision Language Models Hyunsik Chae · Seungwoo Yoon · Chloe Yewon Chun · Gyehun Go · Yongin Cho · Gyeongmin Lee · Ernest Ryu |
||
Poster
|
Thu 16:30 |
GITA: Graph to Visual and Textual Integration for Vision-Language Graph Reasoning Yanbin Wei · Shuai Fu · Weisen Jiang · Zejian Zhang · Zhixiong Zeng · Qi Wu · James Kwok · Yu Zhang |
|
Workshop
|
Multimodal Self-Instruct: Synthetic Abstract Image and Visual Reasoning Instruction Using Language Model Wenqi Zhang · Zhenglin Cheng · Yuanyu He · Mengna Wang · Yongliang Shen · Zeqi Tan · Guiyang Hou · Mingqian He · Yanna Ma · Weiming Lu · Yueting Zhuang |