Skip to yearly menu bar Skip to main content


Good regularity creates large learning rate implicit biases: edge of stability, balancing, and catapult

Yuqing Wang · Zhenghao Xu · Tuo Zhao · Molei Tao

Abstract

Chat is not available.