Skip to yearly menu bar Skip to main content


Poster
in
Workshop: Deep Reinforcement Learning

The Effects of Reward Misspecification: Mapping and Mitigating Misaligned Models

Alexander Pan ⋅ Kush Bhatia ⋅ Jacob Steinhardt

Abstract

Video

Chat is not available.