Skip to yearly menu bar Skip to main content


Fast and Accurate Language Model Decoding via Parallel Token Processing

Zhepei Wei ⋅ Wei-Lin Chen ⋅ Xinyu Zhu ⋅ Yu Meng

Video

Chat is not available.