Skip to yearly menu bar Skip to main content


CTR-BERT: Cost-effective knowledge distillation for billion-parameter teacher models

Aashiq Muhamed ⋅ Iman Keivanloo ⋅ Sujan Perera ⋅ James Mracek ⋅ Yi Xu ⋅ Qingjun Cui ⋅ Santosh Rajagopalan ⋅ Belinda Zeng ⋅ Trishul Chilimbi

Abstract

Video

Chat is not available.