Skip to yearly menu bar Skip to main content


Trust but Verify: Reliable VLM evaluation in-the-wild with program synthesis

Viraj Uday Prabhu · Senthil Purushwalkam · Jieyu Zhang · An Yan · Caiming Xiong · Ran Xu

Abstract

Chat is not available.