Skip to yearly menu bar Skip to main content


Poster
in
Workshop: Optimization for ML Workshop

u-$\mu$P: The Unit-Scaled Maximal Update Parametrization

Charles Blake ⋅ Constantin Eichenberg ⋅ Josef Dean ⋅ Lukas Balles ⋅ Luke Prince ⋅ Björn Deiseroth ⋅ Andres Felipe Cruz-Salinas ⋅ Carlo Luschi ⋅ Samuel Weinbach ⋅ Douglas Orr

Abstract

Chat is not available.