Skip to yearly menu bar Skip to main content


Poster
in
Affinity Workshop: LatinX in AI

Oracle-RLAIF: An Improved Fine-Tuning Framework for Multi-modal Video Models through Reinforcement Learning from Ranking Feedback

Derek Shi · Ruben Glatt · Christine Klymko · Shubham Mohole · Hongjun Choi · Shashank Kushwaha · Wesam Sakla · Felipe Leno da Silva

Abstract

Log in and register to view live content