firstbacksecondback
30 Results
Workshop
|
The Emphatic Approach to Average-Reward Policy Evaluation Jiamin He · Yi Wan · Rupam Mahmood |
||
Workshop
|
Offline Policy Evaluation for Reinforcement Learning with Adaptively Collected Data Sunil Madhow · Dan Qiao · Yu-Xiang Wang |
||
Poster
|
Tue 14:00 |
Temporally-Consistent Survival Analysis Lucas Maystre · Daniel Russo |
|
Poster
|
Thu 9:00 |
Robust On-Policy Sampling for Data-Efficient Policy Evaluation in Reinforcement Learning Rujie Zhong · Duohan Zhang · Lukas Schäfer · Stefano Albrecht · Josiah Hanna |
|
Workshop
|
CASA: Bridging the Gap between Policy Improvement and Policy Evaluation with Conflict Averse Policy Iteration Changnan Xiao · Haosen Shi · Jiajun Fan · Shihong Deng · Haiyan Yin |
||
Poster
|
A Non-asymptotic Analysis of Non-parametric Temporal-Difference Learning Eloïse Berthier · Ziad Kobeissi · Francis Bach |