Skip to yearly menu bar Skip to main content


Branch-Train-Merge: Embarrassingly Parallel Training of Expert Language Models

Luke Zettlemoyer

Abstract

Video

Chat is not available.