Skip to yearly menu bar Skip to main content


Poster
in
Workshop: Safe Generative AI

Applying Sparse Autoencoders to Unlearn Knowledge in Language Models

Eoin Farrell · Yeu-Tong Lau · Arthur Conmy

Abstract

Chat is not available.