Skip to yearly menu bar Skip to main content


Poster

Cascade Speculative Drafting for Even Faster LLM Inference

Ziyi Chen · Xiaocong Yang · Jiacheng Lin · Chenkai Sun · Kevin Chang · Jie Huang
2024 Poster

Abstract

Video

Chat is not available.