Skip to yearly menu bar Skip to main content


Search All 2024 Events
 

27 Results

<<   <   Page 1 of 3   >   >>
Workshop
Sat 11:54 OLMoE: Open Mixture-of-Experts Language Models
Niklas Muennighoff · Luca Soldaini · Dirk Groeneveld · Kyle Lo · Jacob Morrison · Sewon Min · Weijia Shi · Evan Walsh · Oyvind Tafjord · Nathan Lambert · Yuling Gu · Shane Arora · Akshita Bhagia · Dustin Schwenk · David Wadden · Alexander Wettig · Binyuan Hui · Tim Dettmers · Douwe Kiela · Noah Smith · Pang Wei Koh · Amanpreet Singh · Hannaneh Hajishirzi
Workshop
Rethinking Mixture-of-Agents: Is Mixing Different Large Language Models Beneficial?
Wenzhe Li · Yong Lin · Mengzhou Xia · Chi Jin
Poster
Wed 11:00 Multi-Head Mixture-of-Experts
Xun Wu · Shaohan Huang · Wenhui Wang · Shuming Ma · Li Dong · Furu Wei
Workshop
Tabby: Tabular Adaptation for Language Models
Sonia Cromp · Satya Sai Srinath Namburi · Catherine Cao · Mohammed Alkhudhayri · Samuel Guo · Nicholas Roberts · Frederic Sala
Workshop
Understanding Compute-Parameter Trade-offs in Sparse Mixture-of-Expert Language Models
Harshay Shah · Vimal Thilak · Dan Busbridge · Alaaeldin El-Nouby · Joshua Susskind · Samira Abnar
Poster
Thu 11:00 FuseMoE: Mixture-of-Experts Transformers for Fleximodal Fusion
Xing Han · Huy Nguyen · Carl Harris · Nhat Ho · Suchi Saria
Poster
Fri 16:30 CuMo: Scaling Multimodal LLM with Co-Upcycled Mixture-of-Experts
Jiachen Li · Xinyao Wang · Sijie Zhu · Chia-Wen Kuo · Lu XU · Fan Chen · Jitesh Jain · Humphrey Shi · Longyin Wen
Workshop
Gradient-free variational learning with conditional mixture networks
Conor Heins · Hao Wu · Dimitrije Markovic · Alexander Tschantz · Jeff Beck · Christopher L Buckley
Workshop
SciDFM: A Large Language Model with Mixture-of-Experts for Science
Liangtai Sun · Danyu Luo · Da Ma · Zihan Zhao · BaocaiChen · Zhennan Shen · Su Zhu · Lu Chen · Xin Chen · Kai Yu
Poster
Fri 11:00 Flex-MoE: Modeling Arbitrary Modality Combination via the Flexible Mixture-of-Experts
Sukwon Yun · Inyoung Choi · Jie Peng · Yangfan Wu · Jingxuan Bao · Qiyiwen Zhang · Jiayi Xin · Qi Long · Tianlong Chen
Poster
EnsIR: An Ensemble Algorithm for Image Restoration via Gaussian Mixture Models
Shangquan Sun · Wenqi Ren · Zikun Liu · Hyunhee Park · Rui Wang · Xiaochun Cao
Poster
Thu 11:00 MoME: Mixture of Multimodal Experts for Generalist Multimodal Large Language Models
Leyang Shen · Gongwei Chen · Rui Shao · Weili Guan · Liqiang Nie