Skip to yearly menu bar Skip to main content


Poster

CAS-Spec: Cascade Adaptive Self-Speculative Decoding for On-the-Fly Lossless Inference Acceleration of LLMs

Zhiyuan Ning ⋅ Jiawei Shao ⋅ Ruge Xu ⋅ Xinfei Guo ⋅ Jun Zhang ⋅ Chi Zhang ⋅ Xuelong Li
2025 Poster

Abstract

Video

Chat is not available.