Skip to yearly menu bar Skip to main content


Revisiting SMoE Language Models by Evaluating Inefficiencies with Task Specific Expert Pruning

Soumajyoti Sarkar ⋅ Leonard Lausen ⋅ Volkan Cevher ⋅ Thomas Brox ⋅ Sheng Zha ⋅ George Karypis

Abstract

Video

Chat is not available.