Skip to yearly menu bar Skip to main content


Hold That Exit: Near Optimal Early-Exit Inference via Recall

Yuanyuan Yang · Ruimin Zhang · Jamie Morgenstern · Haifeng Xu

Abstract

Chat is not available.