Skip to yearly menu bar Skip to main content


Poster
in
Workshop: Safe Generative AI

Extracting Unlearned Information from LLMs with Activation Steering

Atakan Seyitoğlu ⋅ Aleksei Kuvshinov ⋅ Leo Schwinn ⋅ Stephan Günnemann

Abstract

Chat is not available.