Skip to yearly menu bar Skip to main content


Resa: Transparent Reasoning Models via SAEs

Shangshang Wang ⋅ Julian Asilis ⋅ Ömer Faruk Akgül ⋅ Enes Bilgin ⋅ Ollie Liu ⋅ Deqing Fu ⋅ Willie Neiswanger

Abstract

Chat is not available.