Skip to yearly menu bar Skip to main content


Eureka: Human-Level Reward Design via Coding Large Language Models

Jason Ma ⋅ William Liang ⋅ Guanzhi Wang ⋅ De-An Huang ⋅ Osbert Bastani ⋅ Dinesh Jayaraman ⋅ Yuke Zhu ⋅ Linxi Fan ⋅ Animashree Anandkumar

Abstract

Video

Chat is not available.