Skip to yearly menu bar Skip to main content


AdaEDL: Early Draft Stopping for Speculative Decoding of Large Language Models via an Entropy-based Lower Bound on Token Acceptance Probability

Sudhanshu Agrawal · Wonseok Jeon · Mingu Lee

Abstract

Video

Chat is not available.