firstbacksecondback
28 Results
Poster
|
Tue 9:00 |
Quality Not Quantity: On the Interaction between Dataset Design and Robustness of CLIP Thao Nguyen · Gabriel Ilharco · Mitchell Wortsman · Sewoong Oh · Ludwig Schmidt |
|
Workshop
|
Benchmarking Robustness under Distribution Shift of Multimodal Image-Text Models Jielin Qiu · Yi Zhu · Xingjian Shi · Zhiqiang Tang · DING ZHAO · Bo Li · Mu Li |
||
Poster
|
Thu 14:00 |
Patching open-vocabulary models by interpolating weights Gabriel Ilharco · Mitchell Wortsman · Samir Yitzhak Gadre · Shuran Song · Hannaneh Hajishirzi · Simon Kornblith · Ali Farhadi · Ludwig Schmidt |
|
Poster
|
Tue 14:00 |
Fine-Grained Semantically Aligned Vision-Language Pre-Training Juncheng Li · XIN HE · Longhui Wei · Long Qian · Linchao Zhu · Lingxi Xie · Yueting Zhuang · Qi Tian · Siliang Tang |
|
Poster
|
A Differentiable Semantic Metric Approximation in Probabilistic Embedding for Cross-Modal Retrieval Hao Li · Jingkuan Song · Lianli Gao · Pengpeng Zeng · Haonan Zhang · Gongfu Li |
||
Poster
|
Wed 14:00 |
LAION-5B: An open large-scale dataset for training next generation image-text models Christoph Schuhmann · Romain Beaumont · Richard Vencu · Cade Gordon · Ross Wightman · Mehdi Cherti · Theo Coombes · Aarush Katta · Clayton Mullis · Mitchell Wortsman · Patrick Schramowski · Srivatsa Kundurthy · Katherine Crowson · Ludwig Schmidt · Robert Kaczmarczyk · Jenia Jitsev |
|
Poster
|
Wed 9:00 |
MACK: Multimodal Aligned Conceptual Knowledge for Unpaired Image-text Matching Yan Huang · Yuming Wang · Yunan Zeng · Liang Wang |
|
Poster
|
Thu 9:00 |
CogView2: Faster and Better Text-to-Image Generation via Hierarchical Transformers Ming Ding · Wendi Zheng · Wenyi Hong · Jie Tang |
|
Workshop
|
Personalizing Text-to-Image Generation via Aesthetic Gradients Victor Gallego |
||
Workshop
|
Fri 12:00 |
Paper Spotlight: Personalizing Text-to-Image Generation via Aesthetic Gradients Victor Gallego |
|
Affinity Workshop
|
Learning by Injection: Attention Embedded Recurrent Neural Network for Amharic Text-image Recognition Tariku Adane Gelaw · Birhanu Hailu Belay · WELEKIROS GEBRESLASIE |
||
Workshop
|
Making Text-to-Image Diffusion Models Zero-Shot Image-to-Image Editors by Inferring "Random Seeds" Chen Henry Wu · Fernando D De la Torre |