Skip to yearly menu bar Skip to main content


Search All 2024 Events
 

5 Results

<<   <   Page 1 of 1   >>   >
Poster
Wed 16:30 Grokking of Implicit Reasoning in Transformers: A Mechanistic Journey to the Edge of Generalization
Boshi Wang · Xiang Yue · Yu Su · Huan Sun
Poster
Wed 16:30 Deep Learning Through A Telescoping Lens: A Simple Model Provides Empirical Insights On Grokking, Gradient Boosting & Beyond
Alan Jeffares · Alicia Curth · Mihaela van der Schaar
Workshop
Emergence in non-neural models: grokking modular arithmetic via average gradient outer product
Neil Mallinar · Daniel Beaglehole · Libin Zhu · Adityanarayanan Radhakrishnan · Parthe Pandit · Misha Belkin
Workshop
A Hessian View of Grokking in Mathematical Reasoning
Zhenshuo Zhang · Jerry Liu · Christopher Ré · Hongyang Zhang
Workshop
Delays in generalization match delayed changes in representational geometry
Xingyu Zheng · Kyle Daruwalla · Ari Benjamin · David Klindt