Skip to yearly menu bar Skip to main content


Poster
in
Workshop: Optimization for ML Workshop

Understanding Critical Batch Sizes: Scheduling and Batch-Size Invariance in Data-constrained Pre-training

Hanlin Zhang · Depen Morwani · Nikhil Vyas · Jingfeng Wu · Difan Zou · Udaya Ghai · Dean Foster · Sham Kakade

Abstract

Chat is not available.