Skip to yearly menu bar Skip to main content


Poster
in
Workshop: Reliable ML from Unreliable Data

Rethinking Sparse Autoencoders: Select-and-Project for Fairness and Control from Encoder Features Alone

Antonio Barbalau ⋅ Cristian D Paduraru ⋅ Teodor Poncu ⋅ Alexandru Tifrea ⋅ Elena Burceanu
2025 Poster
in
Workshop: Reliable ML from Unreliable Data

Abstract

Chat is not available.