Skip to yearly menu bar Skip to main content


Accelerating Inference of Retrieval-Augmented Generation via Sparse Context Selection

Yun Zhu ⋅ Jia-Chen Gu ⋅ Caitlin Sikora ⋅ Ho Ko ⋅ Yinxiao Liu ⋅ Chu-Cheng Lin ⋅ Lei Shu ⋅ Liangchen Luo ⋅ Lei Meng ⋅ Bang Liu ⋅ Jindong Chen

Abstract

Video

Chat is not available.