firstbacksecondback
6 Results
Workshop
|
CASA: Bridging the Gap between Policy Improvement and Policy Evaluation with Conflict Averse Policy Iteration Changnan Xiao · Haosen Shi · Jiajun Fan · Shihong Deng · Haiyan Yin |
||
Workshop
|
In-Context Policy Iteration Ethan Brooks · Logan Walls · Richard L Lewis · Satinder Singh |
||
Poster
|
On the role of overparameterization in off-policy Temporal Difference learning with linear function approximation Valentin Thomas |
||
Poster
|
Tue 14:00 |
Reinforcement Learning with Logarithmic Regret and Policy Switches Grigoris Velegkas · Zhuoran Yang · Amin Karbasi |
|
Poster
|
Tue 14:00 |
Confident Approximate Policy Iteration for Efficient Local Planning in qπ-realizable MDPs Gellért Weisz · András György · Tadashi Kozuno · Csaba Szepesvari |
|
Poster
|
Tue 9:00 |
Instance-Dependent Near-Optimal Policy Identification in Linear MDPs via Online Experiment Design Andrew Wagenmaker · Kevin Jamieson |