Skip to yearly menu bar Skip to main content


Poster

PyramidCLIP: Hierarchical Feature Alignment for Vision-language Model Pretraining

Yuting Gao ⋅ Jinfeng Liu ⋅ Zihan Xu ⋅ Jun Zhang ⋅ Ke Li ⋅ Rongrong Ji ⋅ Chunhua Shen

Abstract

Video

Chat is not available.