Skip to yearly menu bar Skip to main content


DELTA: How Does RL Unlock and Transfer New Algorithms in LLMs?

Yiyou Sun ⋅ Yuhan Cao ⋅ Pohao Huang ⋅ Haoyue Bai ⋅ Hanna Hajishirzi ⋅ Nouha Dziri ⋅ Dawn Song

Abstract

Chat is not available.