Skip to yearly menu bar Skip to main content


Poster

MixEval: Deriving Wisdom of the Crowd from LLM Benchmark Mixtures

Jinjie Ni ⋅ Fuzhao Xue ⋅ Xiang Yue ⋅ Yuntian Deng ⋅ Mahir Shah ⋅ Kabir Jain ⋅ Graham Neubig ⋅ Yang You
2024 Poster

Abstract

Video

Chat is not available.