Skip to yearly menu bar Skip to main content


Poster

MixEval: Deriving Wisdom of the Crowd from LLM Benchmark Mixtures

Jinjie Ni · Fuzhao Xue · Xiang Yue · Yuntian Deng · Mahir Shah · Kabir Jain · Graham Neubig · Yang You
2024 Poster

Abstract

Video

Chat is not available.