Skip to yearly menu bar Skip to main content


Computational Bottlenecks of Training Small-scale Large Language Models

Saleh Ashkboos ⋅ Iman Mirzadeh ⋅ Keivan Alizadeh-Vahid ⋅ Mohammad Hossein Sekhavat ⋅ Moin Nabi ⋅ Mehrdad Farajtabar ⋅ Fartash Faghri
Keywords: Efficient Training

Abstract

Video

Chat is not available.