Skip to yearly menu bar Skip to main content


Teaching Pretrained Language Models to Think Deeper with Retrofitted Recurrence

Sean McLeish ⋅ Leon Li ⋅ John Kirchenbauer ⋅ Dayal Singh Kalra ⋅ Brian Bartoldson ⋅ Bhavya Kailkhura ⋅ Avi Schwarzschild ⋅ Jonas Geiping ⋅ Micah Goldblum ⋅ Tom Goldstein

Abstract

Chat is not available.