firstbacksecondback
3 Results
Workshop
|
Sat 12:00 |
MMMT-IF: A Challenging Multimodal Multi-Turn Instruction Following Benchmark Elliot Epstein · Kaisheng Yao · Jing Li · Xinyi Bai · Hamid Palangi |
|
Workshop
|
Trust but Verify: Reliable VLM evaluation in-the-wild with program synthesis Viraj Uday Prabhu · Senthil Purushwalkam · Jieyu Zhang · An Yan · Caiming Xiong · Ran Xu |
||
Workshop
|
VerMCTS: Synthesizing Multi-Step Programs using a Verifier, a Large Language Model, and Tree Search David Brandfonbrener · Simon Henniger · Sibi Raja · Tarun Prasad · Chloe Loughridge · Federico Cassano · Sabrina Hu · Jianang Yang · William Byrd · Robert Zinkov · Nada Amin |