Skip to yearly menu bar Skip to main content


Poster Fri, Dec 5, 2025 • 11:00 AM – 2:00 PM PST

Improving Model Representation and Reducing KV Cache via Skip Connections with First Value Heads

Zhoutong Wu ⋅ Yuan Zhang ⋅ Yiming Dong ⋅ Chenheng Zhang ⋅ Cong Fang ⋅ Kun Yuan ⋅ Zhouchen Lin

Abstract

Video

Chat is not available.