Skip to yearly menu bar Skip to main content


Poster
in
Workshop: Safe Generative AI

Applying Sparse Autoencoders to Unlearn Knowledge in Language Models

Eoin Farrell ⋅ Yeu-Tong Lau ⋅ Arthur Conmy

Abstract

Chat is not available.