Accepted papers presentations
in
Affinity Event: LatinX in AI

Oracle-RLAIF: An Improved Fine-Tuning Framework for Multi-modal Video Models through Reinforcement Learning from Oracle Ranking Feedback

Derek Shi

2025 Accepted papers presentations
in
Affinity Event: LatinX in AI

Abstract

Full-paper Derek Shi - Oracle-RLAIF: An Improved Fine-Tuning Framework for Multi-modal Video Models through Reinforcement Learning from Oracle Ranking Feedback

Video

Chat is not available.