Skip to yearly menu bar Skip to main content


DHP Benchmark: Measuring Discernment Ability of LLM-as-a-Judge

Jiayi Yuan ⋅ Yicheng Wang ⋅ Yu-Neng Chuang ⋅ Zhuoer Wang ⋅ Mark Cusick ⋅ Param Kulkarni ⋅ Zhengping Ji ⋅ Xia Hu

Abstract

Chat is not available.