firstbacksecondback
16 Results
Workshop
|
Transformer-Based Large Language Models Are Not General Learners: A Universal Circuit Perspective Yang Chen · Yitao Liang · Zhouchen Lin |
||
Poster
|
Wed 8:45 |
Blockwise Parallel Transformers for Large Context Models Hao Liu · Pieter Abbeel |
|
Workshop
|
Large-scale Graph Representation Learning of Dynamic Brain Connectome with Transformers Byung-Hoon Kim · Jungwon Choi · EungGu Yun · Kyungsang Kim · Xiang Li · Juho Lee |
||
Workshop
|
READ: Recurrent Adaptation of Large Transformers Sid Wang · John Nguyen · Ke Li · Carole-Jean Wu |
||
Workshop
|
Fri 13:50 |
Capturing Formulation Design of Battery Electrolytes with Chemical Large Language Model Eduardo Soares · Vidushi Sharma · Emilio Vital Brazil · Renato Cerqueira · Young-Hye Na |
|
Poster
|
Wed 8:45 |
H3T: Efficient Integration of Memory Optimization and Parallelism for Large-scale Transformer Training Yuzhong Wang · Xu Han · Weilin Zhao · Guoyang Zeng · Zhiyuan Liu · Maosong Sun |
|
Workshop
|
Teaching Arithmetic to Small Transformers Nayoung Lee · Kartik Sreenivasan · Jason Lee · Kangwook Lee · Dimitris Papailiopoulos |
||
Poster
|
Wed 15:00 |
The geometry of hidden representations of large transformer models Lucrezia Valeriani · Diego Doimo · Francesca Cuturello · Alessandro Laio · Alessio Ansuini · Alberto Cazzaniga |
|
Workshop
|
Beyond Chemical Language: A Multimodal Approach to Enhance Molecular Property Prediction Eduardo Soares · Emilio Vital Brazil · Karen Fiorella Gutierrez · Renato Cerqueira · Daniel Sanders · Kristin Schmidt · Dmitry Zubarev |
||
Poster
|
Wed 15:00 |
SGFormer: Simplifying and Empowering Transformers for Large-Graph Representations Qitian Wu · Wentao Zhao · Chenxiao Yang · Hengrui Zhang · Fan Nie · Haitian Jiang · Yatao Bian · Junchi Yan |
|
Workshop
|
Sat 13:55 |
Federated Learning for Speech Recognition: Revisiting Current Trends Towards Large-Scale ASR Shams Azam · Martin Pelikan · Vitaly Feldman · Kunal Talwar · Jan Silovsky · Tatiana Likhomanenko |
|
Workshop
|
Paper 30: Transforming Healthcare Education: Harnessing Large Language Models for Frontline Health Worker Capacity Building using Retrieval-Augmented Generation Yasmina Al Ghadban · Huiqi Yvonne Lu · Uday Adavi · Ankita · Sridevi Gara · Neelanjana Das · Bhaskar Kumar · Renu Johns · Praveen Devarsetty · Jane Hirst · Uday Adavi |