Skip to yearly menu bar Skip to main content


Poster

Quantifying Elicitation of Latent Capabilities in Language Models

Elizabeth Donoway ⋅ Hailey Joren ⋅ Arushi Somani ⋅ Henry Sleight ⋅ Julian Michael ⋅ Michael Deweese ⋅ John Schulman ⋅ Ethan Perez ⋅ Fabien Roger ⋅ Jan Leike
2025 Poster

Abstract

Video

Chat is not available.