Skip to yearly menu bar Skip to main content


Accelerating Inference of Retrieval-Augmented Generation via Sparse Context Selection

Yun Zhu · Jia-Chen Gu · Caitlin Sikora · Ho Ko · Yinxiao Liu · Chu-Cheng Lin · Lei Shu · Liangchen Luo · Lei Meng · Bang Liu · Jindong Chen

Abstract

Video

Chat is not available.