Talk
in
Workshop: Offline Reinforcement Learning

Advances in (High-Confidence) Off-Policy Evaluation

Philip Thomas

Abstract: