Skip to yearly menu bar Skip to main content


SpecTr++: Improved transport plans for speculative decoding of large language models

Kwangjun Ahn · Ahmad Beirami · Ziteng Sun · Ananda Theertha Suresh

Abstract

Chat is not available.