firstbacksecondback
222 Results
Poster
|
Thu 16:30 |
Rethinking Memory and Communication Costs for Efficient Data Parallel Training of Large Language Models Hanxiao Zhang · Lin JU · Chan Wu · Jinjing Huang · Youshao Xiao · Zhenglei Zhou · Zhiming fan · Zhaoxin Huan · Siyuan Li · Fanzhuang Meng · Lei Liang · Xiaolu Zhang · Jun Zhou |
|
Poster
|
Thu 16:30 |
Slight Corruption in Pre-training Data Makes Better Diffusion Models Hao Chen · Yujin Han · Diganta Misra · Xiang Li · Kai Hu · Difan Zou · Masashi Sugiyama · Jindong Wang · Bhiksha Raj |
|
Poster
|
Wed 16:30 |
Cross-Care: Assessing the Healthcare Implications of Pre-training Data on Language Model Bias Shan Chen · Jack Gallifant · Mingye Gao · Pedro Moreira · Nikolaj Munch · Ajay Muthukkumar · Arvind Rajan · Jaya Kolluri · Amelia Fiske · Janna Hastings · Hugo Aerts · Brian Anthony · Leo Anthony Celi · William La Cava · Danielle Bitterman |
|
Workshop
|
Fine-tuning Foundation Models for Molecular Dynamics: A Data-Efficient Approach with Random Features Pietro Novelli · Luigi Bonati · Pedro J. Buigues · Giacomo Meanti · Lorenzo Rosasco · Michele Parrinello · Massimiliano Pontil |
||
Poster
|
Wed 11:00 |
DAPE: Data-Adaptive Positional Encoding for Length Extrapolation Chuanyang Zheng · Yihang Gao · Han Shi · Minbin Huang · Jingyao Li · Jing Xiong · Xiaozhe Ren · Michael Ng · Xin Jiang · Zhenguo Li · Yu Li |
|
Workshop
|
Optimizing the IFMIF-DONES Particle Accelerator with Differentiable Deep Learning Surrogate Models Galo Gallardo · Guillermo Rodriguez Llorente · Lucas Magariños · Rodrigo Morant Navascués · Nikita Kkhvatkin Petrovsky · Roberto Gómez-Espinosa Martín |