firstbacksecondback
27 Results
Workshop
|
Sat 11:54 |
OLMoE: Open Mixture-of-Experts Language Models Niklas Muennighoff · Luca Soldaini · Dirk Groeneveld · Kyle Lo · Jacob Morrison · Sewon Min · Weijia Shi · Evan Walsh · Oyvind Tafjord · Nathan Lambert · Yuling Gu · Shane Arora · Akshita Bhagia · Dustin Schwenk · David Wadden · Alexander Wettig · Binyuan Hui · Tim Dettmers · Douwe Kiela · Noah Smith · Pang Wei Koh · Amanpreet Singh · Hannaneh Hajishirzi |
|
Workshop
|
Rethinking Mixture-of-Agents: Is Mixing Different Large Language Models Beneficial? Wenzhe Li · Yong Lin · Mengzhou Xia · Chi Jin |
||
Poster
|
Wed 11:00 |
Multi-Head Mixture-of-Experts Xun Wu · Shaohan Huang · Wenhui Wang · Shuming Ma · Li Dong · Furu Wei |
|
Workshop
|
Tabby: Tabular Adaptation for Language Models Sonia Cromp · Satya Sai Srinath Namburi · Catherine Cao · Mohammed Alkhudhayri · Samuel Guo · Nicholas Roberts · Frederic Sala |
||
Workshop
|
Understanding Compute-Parameter Trade-offs in Sparse Mixture-of-Expert Language Models Harshay Shah · Vimal Thilak · Dan Busbridge · Alaaeldin El-Nouby · Joshua Susskind · Samira Abnar |
||
Poster
|
Thu 11:00 |
FuseMoE: Mixture-of-Experts Transformers for Fleximodal Fusion Xing Han · Huy Nguyen · Carl Harris · Nhat Ho · Suchi Saria |
|
Poster
|
Fri 16:30 |
CuMo: Scaling Multimodal LLM with Co-Upcycled Mixture-of-Experts Jiachen Li · Xinyao Wang · Sijie Zhu · Chia-Wen Kuo · Lu XU · Fan Chen · Jitesh Jain · Humphrey Shi · Longyin Wen |
|
Workshop
|
Gradient-free variational learning with conditional mixture networks Conor Heins · Hao Wu · Dimitrije Markovic · Alexander Tschantz · Jeff Beck · Christopher L Buckley |
||
Workshop
|
SciDFM: A Large Language Model with Mixture-of-Experts for Science Liangtai Sun · Danyu Luo · Da Ma · Zihan Zhao · BaocaiChen · Zhennan Shen · Su Zhu · Lu Chen · Xin Chen · Kai Yu |
||
Poster
|
Fri 11:00 |
Flex-MoE: Modeling Arbitrary Modality Combination via the Flexible Mixture-of-Experts Sukwon Yun · Inyoung Choi · Jie Peng · Yangfan Wu · Jingxuan Bao · Qiyiwen Zhang · Jiayi Xin · Qi Long · Tianlong Chen |
|
Poster
|
EnsIR: An Ensemble Algorithm for Image Restoration via Gaussian Mixture Models Shangquan Sun · Wenqi Ren · Zikun Liu · Hyunhee Park · Rui Wang · Xiaochun Cao |
||
Poster
|
Thu 11:00 |
MoME: Mixture of Multimodal Experts for Generalist Multimodal Large Language Models Leyang Shen · Gongwei Chen · Rui Shao · Weili Guan · Liqiang Nie |