NeurIPS Poster State-free Reinforcement Learning

Poster

State-free Reinforcement Learning

Mingyu Chen · Aldo Pacchiano · Xuezhou Zhang

West Ballroom A-D #6702

[ Abstract ]

[ Paper] [ Poster] [ OpenReview]

Thu 12 Dec 11 a.m. PST — 2 p.m. PST

Abstract: In this work, we study the \textit{state-free RL} problem, where the algorithm does not have the states information before interacting with the environment. Specifically, denote the reachable state set by

S^{Π} := {s | max_{π \in Π} q^{P, π} (s) > 0}

$\mathcal{S}^\Pi := \{ s|\max_{\pi\in \Pi}q^{P, \pi}(s)>0 \}$ , we design an algorithm which requires no information on the state space

S

$S$ while having a regret that is completely independent of

S

$\mathcal{S}$ and only depend on

S^{Π}

$\mathcal{S}^\Pi$ . We view this as a concrete first step towards \textit{parameter-free RL}, with the goal of designing RL algorithms that require no hyper-parameter tuning.

Chat is not available.