Skip to yearly menu bar Skip to main content


AdaEDL: Early Draft Stopping for Speculative Decoding of Large Language Models via an Entropy-based Lower Bound on Token Acceptance Probability

Sudhanshu Agrawal ⋅ Wonseok Jeon ⋅ Mingu Lee

Abstract

Video

Chat is not available.