Skip to yearly menu bar Skip to main content


Spotlight Poster Wed, Dec 3, 2025 • 4:30 PM – 7:30 PM PST

Proxy-SPEX: Sample-Efficient Interpretability via Sparse Feature Interactions in LLMs

Landon Butler ⋅ Abhineet Agarwal ⋅ Justin Kang ⋅ Yigit Efe Erginbas ⋅ Bin Yu ⋅ Kannan Ramchandran

Abstract

Video

Chat is not available.