Skip to yearly menu bar Skip to main content


TAID: Temporally Adaptive Interpolated Distillation for Efficient Knowledge Transfer in Language Models

Makoto Shing ⋅ Kou Misaki ⋅ Han Bao ⋅ Sho Yokoi ⋅ Takuya Akiba
[ Poster

Abstract

Chat is not available.