firstbacksecondback
7 Results
Poster
|
Thu 9:00 |
Mutual Information Divergence: A Unified Metric for Multimodal Generative Models Jin-Hwa Kim · Yunji Kim · Jiyoung Lee · Kang Min Yoo · Sang-Woo Lee |
|
Workshop
|
Discovering Bugs in Vision Models using Off-the-shelf Image Generation and Captioning Olivia Wiles · Isabela Albuquerque · Sven Gowal |
||
Poster
|
Wed 14:00 |
Visual Clues: Bridging Vision and Language Foundations for Image Paragraph Captioning Yujia Xie · Luowei Zhou · Xiyang Dai · Lu Yuan · Nguyen Bach · Ce Liu · Michael Zeng |
|
Poster
|
Wed 14:00 |
Learning Distinct and Representative Modes for Image Captioning Qi Chen · Chaorui Deng · Qi Wu |
|
Poster
|
Tue 14:00 |
GLIPv2: Unifying Localization and Vision-Language Understanding Haotian Zhang · Pengchuan Zhang · Xiaowei Hu · Yen-Chun Chen · Liunian Li · Xiyang Dai · Lijuan Wang · Lu Yuan · Jenq-Neng Hwang · Jianfeng Gao |
|
Poster
|
Thu 14:00 |
Understanding Aesthetics with Language: A Photo Critique Dataset for Aesthetic Assessment Daniel Vera Nieto · Luigi Celona · Clara Fernandez Labrador |
|
Poster
|
Thu 14:00 |
Coarse-to-Fine Vision-Language Pre-training with Fusion in the Backbone Zi-Yi Dou · Aishwarya Kamath · Zhe Gan · Pengchuan Zhang · Jianfeng Wang · Linjie Li · Zicheng Liu · Ce Liu · Yann LeCun · Nanyun Peng · Jianfeng Gao · Lijuan Wang |