Skip to yearly menu bar Skip to main content


Poster
in
Affinity Event: LatinX in AI

Oracle-RLAIF: An Improved Fine-Tuning Framework for Multi-modal Video Models through Reinforcement Learning from Ranking Feedback

Derek Shi ⋅ Ruben Glatt ⋅ Christine Klymko ⋅ Shubham Mohole ⋅ Hongjun Choi ⋅ Shashank Kushwaha ⋅ Wesam Sakla ⋅ Felipe Leno da Silva

Abstract

Chat is not available.