firstbacksecondback
5 Results
Poster
|
Wed 16:30 |
Grokking of Implicit Reasoning in Transformers: A Mechanistic Journey to the Edge of Generalization Boshi Wang · Xiang Yue · Yu Su · Huan Sun |
|
Poster
|
Wed 16:30 |
Deep Learning Through A Telescoping Lens: A Simple Model Provides Empirical Insights On Grokking, Gradient Boosting & Beyond Alan Jeffares · Alicia Curth · Mihaela van der Schaar |
|
Workshop
|
Emergence in non-neural models: grokking modular arithmetic via average gradient outer product Neil Mallinar · Daniel Beaglehole · Libin Zhu · Adityanarayanan Radhakrishnan · Parthe Pandit · Misha Belkin |
||
Workshop
|
A Hessian View of Grokking in Mathematical Reasoning Zhenshuo Zhang · Jerry Liu · Christopher Ré · Hongyang Zhang |
||
Workshop
|
Delays in generalization match delayed changes in representational geometry Xingyu Zheng · Kyle Daruwalla · Ari Benjamin · David Klindt |