Skip to yearly menu bar Skip to main content


parameter averaging laws for multitask language models

Woojin Chung · Hyowon Cho · James Thorne · Se-Young Yun

Abstract

Chat is not available.