Skip to yearly menu bar Skip to main content


The Measure of All Measures: Quantifying LLM Benchmark Quality

Jihan Yao ⋅ Peter Jin ⋅ Ke Bao ⋅ Qiaolin Yu ⋅ Khushi Bhardwaj ⋅ Chang Su ⋅ Jialei Wang ⋅ YIKAI ZHU ⋅ Sugam Devare ⋅ Damon Mosk-Aoyama ⋅ Zhen Dong ⋅ Venkat Krishna Srinivasan ⋅ Yineng Zhang ⋅ Oleksii Kuchaiev ⋅ Jiantao Jiao ⋅ Banghua Zhu

Abstract

Video

Chat is not available.