Skip to yearly menu bar Skip to main content


Breaking the Mirror: Examining Self-Preference in LLM Evaluators through Activation-Based Representations

Dani Roytburg · Matthew Bozoukov · Hongyu Fu · Matthew Nguyen · Jou Barzdukas · Narmeen Oozeer

Abstract

Chat is not available.