Skip to yearly menu bar Skip to main content


CSKV: Training-Efficient Channel Shrinking for KV Cache in Long-Context Scenarios

Luning Wang · Shiyao Li · Xuefei Ning · Zhihang Yuan · Shengen Yan · Guohao Dai · Yu Wang
[ Poster

Abstract

Video

Chat is not available.