Skip to yearly menu bar Skip to main content


Toward Large Language Models that Benefit for All: Benchmarking Group Fairness in Reward Models

Kefan Song · Jin Yao · Shangtong Zhang

Abstract

Video

Chat is not available.