Skip to yearly menu bar Skip to main content


TAID: Temporally Adaptive Interpolated Distillation for Efficient Knowledge Transfer in Language Models

Makoto Shing · Kou Misaki · Han Bao · Sho Yokoi · Takuya Akiba
[ Poster

Abstract

Chat is not available.