Skip to yearly menu bar Skip to main content


Poster

Improving Data Efficiency for LLM Reinforcement Fine-tuning Through Difficulty-targeted Online Data Selection and Rollout Replay

Yifan Sun ⋅ Jingyan Shen ⋅ Yibin Wang ⋅ Tianyu Chen ⋅ Zhendong Wang ⋅ Mingyuan Zhou ⋅ Huan Zhang
2025 Poster

Abstract

Video

Chat is not available.