Skip to yearly menu bar Skip to main content


XC-Cache: Cross-Attending to Cached Context for Efficient LLM Inference

Joao Monteiro · Etienne Marcotte · Pierre-Andre Noel · Valentina Zantedeschi · David Vazquez · Nicolas Chapados · Christopher Pal · Perouz Taslakian

Abstract

Video

Chat is not available.