Fast and Accurate Language Model Decoding via Parallel Token Processing
Zhepei Wei ⋅ Wei-Lin Chen ⋅ Xinyu Zhu ⋅ Yu Meng
2024 Oral
in
Workshop: Adaptive Foundation Models: Evolving AI for Personalized and Efficient Learning
in
Workshop: Adaptive Foundation Models: Evolving AI for Personalized and Efficient Learning
Video
Chat is not available.
Successful Page Load