Skip to yearly menu bar Skip to main content


Aha Moment Revisited: Are Vision Language Models Truly Capable of Self-verification in Inference Scaling?

Mingyuan Wu ⋅ Meitang Li ⋅ Jingcheng Yang ⋅ Jize Jiang ⋅ Kaizhuo Yan ⋅ Zhaoheng Li ⋅ Hanchao Yu ⋅ Minjia Zhang ⋅ Klara Nahrstedt

Abstract

Chat is not available.