Skip to yearly menu bar Skip to main content


Search All 2024 Events
 

7 Results

<<   <   Page 1 of 1   >>   >
Workshop
Benign Overfitting in Single-Head Attention
Roey Magen · Shuning Shang · Zhiwei Xu · Spencer Frei · Wei Hu · Gal Vardi
Poster
Fri 16:30 How Transformers Utilize Multi-Head Attention in In-Context Learning? A Case Study on Sparse Linear Regression
Xingwu Chen · Lei Zhao · Difan Zou
Poster
Thu 16:30 Dual Encoder GAN Inversion for High-Fidelity 3D Head Reconstruction from Single Images
Bahri Batuhan Bilecen · Ahmet Gökmen · Aysegul Dundar
Poster
Wed 16:30 DHA: Learning Decoupled-Head Attention from Transformer Checkpoints via Adaptive Heads Fusion
Yilong Chen · Linhao Zhang · Junyuan Shang · Zhenyu Zhang · Tingwen Liu · Shuohuan Wang · YU SUN
Workshop
Active-Dormant Attention Heads: Mechanistically Demystifying Extreme-Token Phenomena in LLMs
Tianyu Guo · Druv Pai · Yu Bai · Jiantao Jiao · Michael Jordan · Song Mei
Workshop
Sat 15:30 Active-Dormant Attention Heads: Mechanistically Demystifying Extreme-Token Phenomena in LLMs
Tianyu Guo · Druv Pai · Yu Bai · Jiantao Jiao · Michael Jordan · Song Mei
Poster
Wed 11:00 Single Image Reflection Separation via Dual-Stream Interactive Transformers
Qiming Hu · Hainuo Wang · Xiaojie Guo