Skip to yearly menu bar Skip to main content


Breaking the Mirror: Examining Self-Preference in LLM Evaluators through Activation-Based Representations

Dani Roytburg ⋅ Matthew Bozoukov ⋅ Hongyu Fu ⋅ Matthew Nguyen ⋅ Jou Barzdukas ⋅ Narmeen Oozeer

Abstract

Chat is not available.