Skip to yearly menu bar Skip to main content


Poster
in
Workshop: Safe Generative AI

Language Models Can Articulate Their Implicit Goals

Jan Betley ⋅ Xuchan Bao ⋅ Martín Soto ⋅ Anna Sztyber-Betley ⋅ James Chua ⋅ Owain Evans

Abstract

Chat is not available.