Skip to yearly menu bar Skip to main content


Poster

ChunkKV: Semantic-Preserving KV Cache Compression for Efficient Long-Context LLM Inference

Xiang Liu ⋅ Zhenheng Tang ⋅ Peijie Dong ⋅ Zeyu Li ⋅ Liuyue ⋅ Bo Li ⋅ Xuming Hu ⋅ Xiaowen Chu
2025 Poster

Abstract

Video

Chat is not available.