Skip to yearly menu bar Skip to main content


Poster

Scissorhands: Exploiting the Persistence of Importance Hypothesis for LLM KV Cache Compression at Test Time

Zichang Liu ⋅ Aditya Desai ⋅ Fangshuo Liao ⋅ Weitao Wang ⋅ Victor Xie ⋅ Zhaozhuo Xu ⋅ Anastasios Kyrillidis ⋅ Anshumali Shrivastava
2023 Poster

Abstract

Video

Chat is not available.