Skip to yearly menu bar Skip to main content


parameter averaging laws for multitask language models

Woojin Chung ⋅ Hyowon Cho ⋅ James Thorne ⋅ Se-Young Yun

Abstract

Chat is not available.