Skip to yearly menu bar Skip to main content


Search All 2024 Events
 

51 Results

<<   <   Page 2 of 5   >   >>
Poster
Fri 11:00 Flex-MoE: Modeling Arbitrary Modality Combination via the Flexible Mixture-of-Experts
Sukwon Yun · Inyoung Choi · Jie Peng · Yangfan Wu · Jingxuan Bao · Qiyiwen Zhang · Jiayi Xin · Qi Long · Tianlong Chen
Poster
Fri 16:30 CuMo: Scaling Multimodal LLM with Co-Upcycled Mixture-of-Experts
Jiachen Li · Xinyao Wang · Sijie Zhu · Chia-Wen Kuo · Lu XU · Fan Chen · Jitesh Jain · Humphrey Shi · Longyin Wen
Workshop
Sat 11:54 OLMoE: Open Mixture-of-Experts Language Models
Niklas Muennighoff · Luca Soldaini · Dirk Groeneveld · Kyle Lo · Jacob Morrison · Sewon Min · Weijia Shi · Evan Walsh · Oyvind Tafjord · Nathan Lambert · Yuling Gu · Shane Arora · Akshita Bhagia · Dustin Schwenk · David Wadden · Alexander Wettig · Binyuan Hui · Tim Dettmers · Douwe Kiela · Noah Smith · Pang Wei Koh · Amanpreet Singh · Hannaneh Hajishirzi
Workshop
Multi-View Mixture-of-Experts for Predicting Molecular Properties Using SMILES, SELFIES, and Graph-Based Representations
Eduardo Soares · Indra Priyadarsini S · Emilio Vital Brazil · Victor Yukio Shirasuna · Seiji Takeda
Workshop
Multi-View Mixture-of-Experts for Predicting Molecular Properties Using SMILES, SELFIES, and Graph-Based Representations
Eduardo Soares · Indra Priyadarsini S · Emilio Vital Brazil · Victor Yukio Shirasuna · Seiji Takeda
Workshop
SciDFM: A Large Language Model with Mixture-of-Experts for Science
Liangtai Sun · Danyu Luo · Da Ma · Zihan Zhao · BaocaiChen · Zhennan Shen · Su Zhu · Lu Chen · Xin Chen · Kai Yu
Workshop
Gradient-free variational learning with conditional mixture networks
Conor Heins · Hao Wu · Dimitrije Markovic · Alexander Tschantz · Jeff Beck · Christopher L Buckley
Poster
Fri 11:00 MomentumSMoE: Integrating Momentum into Sparse Mixture of Experts
Rachel S.Y. Teo · Tan Nguyen
Workshop
Buffer Overflow in Mixture of Experts
Jamie Hayes · I Shumailov · Itay Yona
Workshop
StructMoE : Structured Mixture of Experts Using Low Rank Experts
Zain Sarwar · Ashwinee Panda · Benjamin Thérien · Stephen Rawls · Anirban Das · Kartik Balasubramaniam · Berkcan Kapusuzoglu · Shixiong Zhang · Sambit Sahu · MILIND NAPHADE · Supriyo Chakraborty
Poster
Wed 11:00 Sigmoid Gating is More Sample Efficient than Softmax Gating in Mixture of Experts
Huy Nguyen · Nhat Ho · Alessandro Rinaldo
Workshop
A scalable Bayesian continual learning framework for online and sequential decision making
Hanwen Xing · Christopher Yau