firstbacksecondback
51 Results
Poster
|
Fri 11:00 |
Flex-MoE: Modeling Arbitrary Modality Combination via the Flexible Mixture-of-Experts Sukwon Yun · Inyoung Choi · Jie Peng · Yangfan Wu · Jingxuan Bao · Qiyiwen Zhang · Jiayi Xin · Qi Long · Tianlong Chen |
|
Poster
|
Fri 16:30 |
CuMo: Scaling Multimodal LLM with Co-Upcycled Mixture-of-Experts Jiachen Li · Xinyao Wang · Sijie Zhu · Chia-Wen Kuo · Lu XU · Fan Chen · Jitesh Jain · Humphrey Shi · Longyin Wen |
|
Workshop
|
Sat 11:54 |
OLMoE: Open Mixture-of-Experts Language Models Niklas Muennighoff · Luca Soldaini · Dirk Groeneveld · Kyle Lo · Jacob Morrison · Sewon Min · Weijia Shi · Evan Walsh · Oyvind Tafjord · Nathan Lambert · Yuling Gu · Shane Arora · Akshita Bhagia · Dustin Schwenk · David Wadden · Alexander Wettig · Binyuan Hui · Tim Dettmers · Douwe Kiela · Noah Smith · Pang Wei Koh · Amanpreet Singh · Hannaneh Hajishirzi |
|
Workshop
|
Multi-View Mixture-of-Experts for Predicting Molecular Properties Using SMILES, SELFIES, and Graph-Based Representations Eduardo Soares · Indra Priyadarsini S · Emilio Vital Brazil · Victor Yukio Shirasuna · Seiji Takeda |
||
Workshop
|
Multi-View Mixture-of-Experts for Predicting Molecular Properties Using SMILES, SELFIES, and Graph-Based Representations Eduardo Soares · Indra Priyadarsini S · Emilio Vital Brazil · Victor Yukio Shirasuna · Seiji Takeda |
||
Workshop
|
SciDFM: A Large Language Model with Mixture-of-Experts for Science Liangtai Sun · Danyu Luo · Da Ma · Zihan Zhao · BaocaiChen · Zhennan Shen · Su Zhu · Lu Chen · Xin Chen · Kai Yu |
||
Workshop
|
Gradient-free variational learning with conditional mixture networks Conor Heins · Hao Wu · Dimitrije Markovic · Alexander Tschantz · Jeff Beck · Christopher L Buckley |
||
Poster
|
Fri 11:00 |
MomentumSMoE: Integrating Momentum into Sparse Mixture of Experts Rachel S.Y. Teo · Tan Nguyen |
|
Workshop
|
Buffer Overflow in Mixture of Experts Jamie Hayes · I Shumailov · Itay Yona |
||
Workshop
|
StructMoE : Structured Mixture of Experts Using Low Rank Experts Zain Sarwar · Ashwinee Panda · Benjamin Thérien · Stephen Rawls · Anirban Das · Kartik Balasubramaniam · Berkcan Kapusuzoglu · Shixiong Zhang · Sambit Sahu · MILIND NAPHADE · Supriyo Chakraborty |
||
Poster
|
Wed 11:00 |
Sigmoid Gating is More Sample Efficient than Softmax Gating in Mixture of Experts Huy Nguyen · Nhat Ho · Alessandro Rinaldo |
|
Workshop
|
A scalable Bayesian continual learning framework for online and sequential decision making Hanwen Xing · Christopher Yau |