Skip to yearly menu bar Skip to main content


Poster

Reward learning from human preferences and demonstrations in Atari

Borja Ibarz ⋅ Jan Leike ⋅ Tobias Pohlen ⋅ Geoffrey Irving ⋅ Shane Legg ⋅ Dario Amodei
2018 Poster
[ Paper

Abstract

Chat is not available.