firstbacksecondback
800 Results
Poster
|
Wed 15:00 |
LMC: Large Model Collaboration with Cross-assessment for Training-Free Open-Set Object Recognition Haoxuan Qu · Xiaofei Hui · Yujun Cai · Jun Liu |
|
Poster
|
Thu 15:00 |
PGDiff: Guiding Diffusion Models for Versatile Face Restoration via Partial Guidance Peiqing Yang · Shangchen Zhou · Qingyi Tao · Chen Change Loy |
|
Workshop
|
VELMA: Verbalization Embodiment of LLM Agents for Vision and Language Navigation in Street View Raphael Schumann · Wanrong Zhu · Weixi Feng · Tsu-Jui Fu · Stefan Riezler · William Yang Wang |
||
Workshop
|
VELMA: Verbalization Embodiment of LLM Agents for Vision and Language Navigation in Street View Raphael Schumann · Wanrong Zhu · Weixi Feng · Tsu-Jui Fu · Stefan Riezler · William Yang Wang |
||
Poster
|
Tue 8:45 |
Rewrite Caption Semantics: Bridging Semantic Gaps for Language-Supervised Semantic Segmentation Yun Xing · Jian Kang · Aoran Xiao · Jiahao Nie · Ling Shao · Shijian Lu |
|
Poster
|
Thu 15:00 |
Perception Test: A Diagnostic Benchmark for Multimodal Video Models Viorica Patraucean · Lucas Smaira · Ankush Gupta · Adria Recasens · Larisa Markeeva · Dylan Banarse · Skanda Koppula · joseph heyward · Mateusz Malinowski · Yi Yang · Carl Doersch · Tatiana Matejovicova · Yury Sulsky · Antoine Miech · Alexandre Fréchette · Hanna Klimczak · Raphael Koster · Junlin Zhang · Stephanie Winkler · Yusuf Aytar · Simon Osindero · Dima Damen · Andrew Zisserman · Joao Carreira |
|
Workshop
|
Sat 9:45 |
StatTexNet: Evaluating the Importance of Statistical Parameters for Pyramid-Based Texture and Peripheral Vision Models Christian Koevesdi · Vasha DuTell · Anne Harrington · Mark Hamilton · Bill Freeman · Ruth Rosenholtz |
|
Poster
|
Thu 8:45 |
Detecting Any Human-Object Interaction Relationship: Universal HOI Detector with Spatial Prompt Learning on Foundation Models Yichao Cao · Qingfei Tang · Xiu Su · Song Chen · Shan You · Xiaobo Lu · Chang Xu |
|
Poster
|
Tue 8:45 |
Dense and Aligned Captions (DAC) Promote Compositional Reasoning in VL Models Sivan Doveh · Assaf Arbelle · Sivan Harary · Roei Herzig · Donghyun Kim · Paola Cascante-Bonilla · Amit Alfassy · Rameswar Panda · Raja Giryes · Rogerio Feris · Shimon Ullman · Leonid Karlinsky |
|
Poster
|
Thu 8:45 |
Segment Any Point Cloud Sequences by Distilling Vision Foundation Models Youquan Liu · Lingdong Kong · Jun CEN · Runnan Chen · Wenwei Zhang · Liang Pan · Kai Chen · Ziwei Liu |
|
Workshop
|
SAM-CLIP: Merging Vision Foundation Models towards Semantic and Spatial Understanding Haoxiang Wang · Pavan Kumar Anasosalu Vasu · Fartash Faghri · Raviteja Vemulapalli · Mehrdad Farajtabar · Sachin Mehta · Mohammad Rastegari · Oncel Tuzel · Hadi Pouransari |
||
Workshop
|
SAM-CLIP: Merging Vision Foundation Models towards Semantic and Spatial Understanding Haoxiang Wang · Pavan Kumar Anasosalu Vasu · Fartash Faghri · Raviteja Vemulapalli · Mehrdad Farajtabar · Sachin Mehta · Mohammad Rastegari · Oncel Tuzel · Hadi Pouransari |