Skip to yearly menu bar Skip to main content


Spotlight Poster

What One Cannot, Two Can: Two-Layer Transformers Provably Represent Induction Heads on Any-Order Markov Chains

Chanakya Ekbote ⋅ Ashok Vardhan Makkuva ⋅ Marco Bondaschi ⋅ Nived Rajaraman ⋅ Michael Gastpar ⋅ Jason Lee ⋅ Paul Liang
2025 Spotlight Poster

Abstract

Video

Chat is not available.