Skip to yearly menu bar Skip to main content


Computational Bottlenecks of Training Small-scale Large Language Models

Saleh Ashkboos · Iman Mirzadeh · Keivan Alizadeh-Vahid · Mohammad Hossein Sekhavat · Moin Nabi · Mehrdad Farajtabar · Fartash Faghri
Keywords: Efficient Training

Abstract

Video

Chat is not available.