Toggle Poster Visibility
Mexico City Oral
Thu Dec 04 03:30 PM -- 03:50 PM (PST) @ Don Alberto 1 None
A is for Absorption: Studying Feature Splitting and Absorption in Sparse Autoencoders
Mexico City Oral
Thu Dec 04 03:50 PM -- 04:10 PM (PST) @ Don Alberto 1 None
Gated Attention for Large Language Models: Non-linearity, Sparsity, and Attention-Sink-Free
Mexico City Oral
Thu Dec 04 04:10 PM -- 04:30 PM (PST) @ Don Alberto 1 None
Superposition Yields Robust Neural Scaling
Successful Page Load