Skip to yearly menu bar Skip to main content


RLVR vs. Distillation: Understanding Accuracy and Capability in LLM Mathematical Reasoning

Minwu Kim · Anubhav Shrestha · Safal Shrestha · Aadim Nepal · Keith Ross

Abstract

Chat is not available.