Skip to yearly menu bar Skip to main content


Towards Dynamic KV-Cache Compression: Fine-Grained Evaluation of Key and Value Ranks in LLMs

Jian Chen ⋅ Zhuoran Wang ⋅ Jiayu Qin ⋅ Ming Li ⋅ Meng Wang ⋅ Changyou Chen ⋅ Yin Chen ⋅ Qizhen Weng ⋅ Yirui Liu

Abstract

Chat is not available.