Skip to yearly menu bar Skip to main content


CSKV: Training-Efficient Channel Shrinking for KV Cache in Long-Context Scenarios

Luning Wang ⋅ Shiyao Li ⋅ Xuefei Ning ⋅ Zhihang Yuan ⋅ Shengen Yan ⋅ Guohao Dai ⋅ Yu Wang
[ Poster

Abstract

Video

Chat is not available.