Skip to yearly menu bar Skip to main content


XC-Cache: Cross-Attending to Cached Context for Efficient LLM Inference

Joao Monteiro ⋅ Etienne Marcotte ⋅ Pierre-Andre Noel ⋅ Valentina Zantedeschi ⋅ David Vazquez ⋅ Nicolas Chapados ⋅ Christopher Pal ⋅ Perouz Taslakian

Abstract

Video

Chat is not available.