Skip to yearly menu bar Skip to main content


DHP Benchmark: Measuring Discernment Ability of LLM-as-a-Judge

Jiayi Yuan · Yicheng Wang · Yu-Neng Chuang · Zhuoer Wang · Mark Cusick · Param Kulkarni · Zhengping Ji · Xia Hu

Abstract

Chat is not available.