Skip to yearly menu bar Skip to main content


Poster
in
Workshop: Reliable ML from Unreliable Data
Sat, Dec 6, 2025 • 4:00 PM – 5:00 PM PST

Rethinking Sparse Autoencoders: Select-and-Project for Fairness and Control from Encoder Features Alone

Antonio Barbalau ⋅ Cristian D Paduraru ⋅ Teodor Poncu ⋅ Alexandru Tifrea ⋅ Elena Burceanu

Abstract

Chat is not available.