Workshop
|
|
Benign Overfitting in Single-Head Attention
Roey Magen · Shuning Shang · Zhiwei Xu · Spencer Frei · Wei Hu · Gal Vardi
|
|
Poster
|
Fri 16:30
|
How Transformers Utilize Multi-Head Attention in In-Context Learning? A Case Study on Sparse Linear Regression
Xingwu Chen · Lei Zhao · Difan Zou
|
|
Poster
|
Thu 16:30
|
Dual Encoder GAN Inversion for High-Fidelity 3D Head Reconstruction from Single Images
Bahri Batuhan Bilecen · Ahmet Gökmen · Aysegul Dundar
|
|
Poster
|
Wed 16:30
|
DHA: Learning Decoupled-Head Attention from Transformer Checkpoints via Adaptive Heads Fusion
Yilong Chen · Linhao Zhang · Junyuan Shang · Zhenyu Zhang · Tingwen Liu · Shuohuan Wang · YU SUN
|
|
Workshop
|
|
Active-Dormant Attention Heads: Mechanistically Demystifying Extreme-Token Phenomena in LLMs
Tianyu Guo · Druv Pai · Yu Bai · Jiantao Jiao · Michael Jordan · Song Mei
|
|
Workshop
|
Sat 15:30
|
Active-Dormant Attention Heads: Mechanistically Demystifying Extreme-Token Phenomena in LLMs
Tianyu Guo · Druv Pai · Yu Bai · Jiantao Jiao · Michael Jordan · Song Mei
|
|
Poster
|
Wed 11:00
|
Single Image Reflection Separation via Dual-Stream Interactive Transformers
Qiming Hu · Hainuo Wang · Xiaojie Guo
|
|