Skip to yearly menu bar Skip to main content


Localizing Lying in Llama: Experiments in Prompting, Probing, and Patching

James Campbell ⋅ Phillip Guo ⋅ Richard Ren

Abstract

Chat is not available.