Skip to yearly menu bar Skip to main content


Poster

Cascade Speculative Drafting for Even Faster LLM Inference

Ziyi Chen ⋅ Xiaocong Yang ⋅ Jiacheng Lin ⋅ Chenkai Sun ⋅ Kevin Chang ⋅ Jie Huang
2024 Poster

Abstract

Video

Chat is not available.