Skip to yearly menu bar Skip to main content


Poster
in
Workshop: Safe Generative AI

Extracting Unlearned Information from LLMs with Activation Steering

Atakan Seyitoğlu · Aleksei Kuvshinov · Leo Schwinn · Stephan Günnemann

Abstract

Chat is not available.