firstbacksecondback
51 Results
Expo Demonstration
|
Tue 15:00 |
Deploying Cached Conditional Mixture-of-Experts LLMs on Mobile Devices with Memory Constraints Ron Tindall |
|
Workshop
|
Dense Backpropagation Improves Routing for Sparsely-Gated Mixture-of-Experts Ashwinee Panda · Vatsal Baherwani · Zain Sarwar · Benjamin Thérien · Stephen Rawls · Sambit Sahu · Supriyo Chakraborty · Tom Goldstein |
||
Workshop
|
Dense Backpropagation Improves Routing for Sparsely-Gated Mixture-of-Experts Ashwinee Panda · Vatsal Baherwani · Zain Sarwar · Benjamin Thérien · Stephen Rawls · Sambit Sahu · Supriyo Chakraborty · Tom Goldstein |
||
Workshop
|
Dense Backpropagation Improves Routing for Sparsely-Gated Mixture-of-Experts Ashwinee Panda · Vatsal Baherwani · Zain Sarwar · Benjamin Therien · Sambit Sahu · Stephen Rawls · Supriyo Chakraborty · Tom Goldstein |
||
Poster
|
Fri 16:30 |
Exploiting Activation Sparsity with Dense to Dynamic-k Mixture-of-Experts Conversion Filip Szatkowski · Bartosz Wójcik · Mikołaj Piórczyński · Simone Scardapane |
|
Workshop
|
Understanding Compute-Parameter Trade-offs in Sparse Mixture-of-Expert Language Models Harshay Shah · Vimal Thilak · Dan Busbridge · Alaaeldin El-Nouby · Joshua Susskind · Samira Abnar |
||
Poster
|
Fri 16:30 |
Parameter Efficient Adaptation for Image Restoration with Heterogeneous Mixture-of-Experts Hang Guo · Tao Dai · Yuanchao Bai · Bin Chen · Xudong Ren · Zexuan Zhu · Shu-Tao Xia |
|
Poster
|
Fri 16:30 |
MoEUT: Mixture-of-Experts Universal Transformers Robert Csordas · Kazuki Irie · Jürgen Schmidhuber · Christopher Potts · Christopher D Manning |
|
Poster
|
Wed 11:00 |
Multi-Head Mixture-of-Experts Xun Wu · Shaohan Huang · Wenhui Wang · Shuming Ma · Li Dong · Furu Wei |
|
Poster
|
Thu 11:00 |
SwitchHead: Accelerating Transformers with Mixture-of-Experts Attention Robert Csordas · Piotr Piękos · Kazuki Irie · Jürgen Schmidhuber |
|
Workshop
|
Tabby: Tabular Adaptation for Language Models Sonia Cromp · Satya Sai Srinath Namburi · Catherine Cao · Mohammed Alkhudhayri · Samuel Guo · Nicholas Roberts · Frederic Sala |
||
Poster
|
Thu 11:00 |
FuseMoE: Mixture-of-Experts Transformers for Fleximodal Fusion Xing Han · Huy Nguyen · Carl Harris · Nhat Ho · Suchi Saria |