Skip to yearly menu bar Skip to main content


Poster

Improving Sparse Decomposition of Language Model Activations with Gated Sparse Autoencoders

Senthooran Rajamanoharan · Arthur Conmy · Lewis Smith · Tom Lieberum · Vikrant Varma · Janos Kramar · Rohin Shah · Neel Nanda
2024 Poster
[ Paper [ Poster [ OpenReview

Abstract

Video

Chat is not available.