Skip to yearly menu bar Skip to main content


Poster Fri, Dec 5, 2025 • 11:00 AM – 2:00 PM PST

R-KV: Redundancy-aware KV Cache Compression for Reasoning Models

Zefan Cai ⋅ Wen Xiao ⋅ Hanshi Sun ⋅ cheng Luo ⋅ Yikai Zhang ⋅ Ke Wan ⋅ Yucheng Li ⋅ Yeyang Zhou ⋅ Li-Wen Chang ⋅ Jiuxiang Gu ⋅ Zhen Dong ⋅ Animashree Anandkumar ⋅ Abedelkadir Asi ⋅ Junjie Hu

Abstract

Video

Chat is not available.