firstbacksecondback
513 Results
Workshop
|
Sat 13:42 |
Bio-xLSTM: Generative modeling, representation and in-context learning of biological and chemical sequences Niklas Schmidinger · Lisa Schneckenreiter · Philipp Seidl · Johannes Schimunek · Pieter-Jan Hoedt · Johannes Brandstetter · Andreas Mayr · Sohvi Luukkonen · Sepp Hochreiter · Günter Klambauer |
|
Poster
|
Wed 11:00 |
RedPajama: an Open Dataset for Training Large Language Models Maurice Weber · Dan Fu · Quentin Anthony · Yonatan Oren · Shane Adams · Anton Alexandrov · Xiaozhong Lyu · Huu Nguyen · Xiaozhe Yao · Virginia Adams · Ben Athiwaratkun · Rahul Chalamala · Kezhen Chen · Max Ryabinin · Tri Dao · Percy Liang · Christopher Ré · Irina Rish · Ce Zhang |
|
Poster
|
Empowering and Assessing the Utility of Large Language Models in Crop Science Hang Zhang · Jiawei SUN · Renqi Chen · Wei Liu · Zhonghang Yuan · Xinzhe Zheng · Zhefan Wang · Zhiyuan Yang · Hang Yan · Han-Sen Zhong · Xiqing Wang · Wanli Ouyang · Fan Yang · Nanqing Dong |
||
Workshop
|
TRIAGE: Ethical Benchmarking of AI Models Through Mass Casualty Simulations Nathalie Kirch · Konstantin Hebenstreit · Matthias Samwald |
||
Workshop
|
MMIE: Massive Multimodal Interleaved Comprehension Benchmark for Large Vision-Language Models Peng Xia · Siwei Han · Shi Qiu · Yiyang Zhou · Zhaoyang Wang · Wenhao Zheng · Zhaorun Chen · Chenhang Cui · Mingyu Ding · Linjie Li · Lijuan Wang · Huaxiu Yao |
||
Poster
|
Thu 11:00 |
JourneyBench: A Challenging One-Stop Vision-Language Understanding Benchmark of Generated Images Zhecan Wang · Junzhang Liu · Chia-Wei Tang · Hani Alomari · Anushka Sivakumar · Rui Sun · Wenhao Li · Md. Atabuzzaman · Hammad Ayyubi · Haoxuan You · Alvi Md Ishmam · Kai-Wei Chang · Shih-Fu Chang · Christopher Thomas |
|
Workshop
|
PertEval-scFM: Benchmarking Single-Cell Foundation Models for Perturbation Effect Prediction Aaron Wenteler · Martina Occhetta · Nikhil Branson · Magdalena Huebner · William Dee · Victor Curean · William Connell · Siu Chung · Yasha Ektefaie · Amaya Gallagher-Syed · César Córdova |
||
Workshop
|
CRAB: Cross-platfrom agent benchmark for multi-modal embodied language model agents Tianqi Xu · Linyao Chen · Dai-Jie Wu · Yanjun Chen · Zecheng Zhang · Xiang Yao · Zhiqiang Xie · Yongchao Chen · Shilong Liu · Bochen Qian · Philip Torr · Bernard Ghanem · Guohao Li |
||
Workshop
|
Sat 11:54 |
OLMoE: Open Mixture-of-Experts Language Models Niklas Muennighoff · Luca Soldaini · Dirk Groeneveld · Kyle Lo · Jacob Morrison · Sewon Min · Weijia Shi · Evan Walsh · Oyvind Tafjord · Nathan Lambert · Yuling Gu · Shane Arora · Akshita Bhagia · Dustin Schwenk · David Wadden · Alexander Wettig · Binyuan Hui · Tim Dettmers · Douwe Kiela · Noah Smith · Pang Wei Koh · Amanpreet Singh · Hannaneh Hajishirzi |
|
Poster
|
Wed 16:30 |
4DBInfer: A 4D Benchmarking Toolbox for Graph-Centric Predictive Modeling on RDBs Minjie Wang · Quan Gan · David Wipf · Zheng Zhang · Christos Faloutsos · Weinan Zhang · Muhan Zhang · Zhenkun Cai · Jiahang Li · Zunyao Mao · Yakun Song · Jianheng Tang · Yanlin Zhang · Guang Yang · Chuan Lei · Xiao Qin · Ning Li · Han Zhang · Yanbo Wang · Zizhao Zhang |
|
Poster
|
Fri 11:00 |
ConMe: Rethinking Evaluation of Compositional Reasoning for Modern VLMs Irene Huang · Wei Lin · Muhammad Jehanzeb Mirza · Jacob Hansen · Sivan Doveh · Victor Butoi · Roei Herzig · Assaf Arbelle · Hilde Kuehne · Trevor Darrell · Chuang Gan · Aude Oliva · Rogerio Feris · Leonid Karlinsky |
|
Poster
|
Fri 11:00 |
Shopping MMLU: A Massive Multi-Task Online Shopping Benchmark for Large Language Models Yilun Jin · Zheng Li · Chenwei Zhang · Tianyu Cao · Yifan Gao · Pratik Jayarao · Mao Li · Xin Liu · Ritesh Sarkhel · Xianfeng Tang · Haodong Wang · Zhengyang Wang · Wenju Xu · Jingfeng Yang · Qingyu Yin · Xian Li · Priyanka Nigam · Yi Xu · Kai Chen · Qiang Yang · Meng Jiang · Bing Yin |