Skip to yearly menu bar Skip to main content


Poster

R-KV: Redundancy-aware KV Cache Compression for Reasoning Models

Zefan Cai ⋅ Wen Xiao ⋅ Hanshi Sun ⋅ cheng Luo ⋅ Yikai Zhang ⋅ Ke Wan ⋅ Yucheng Li ⋅ Yeyang Zhou ⋅ Li-Wen Chang ⋅ Jiuxiang Gu ⋅ Zhen Dong ⋅ Animashree Anandkumar ⋅ Abedelkadir Asi ⋅ Junjie Hu
2025 Poster

Abstract

Video

Chat is not available.