NeurIPS Poster The Limits of Transfer Reinforcement Learning with Latent Low-rank Structure

Poster

The Limits of Transfer Reinforcement Learning with Latent Low-rank Structure

Tyler Sam · Yudong Chen · Christina Yu

West Ballroom A-D #6906

[ Abstract ]

[ Paper] [ OpenReview]

Thu 12 Dec 4:30 p.m. PST — 7:30 p.m. PST

Abstract: Many reinforcement learning (RL) algorithms are too costly to use in practice due to the large sizes

S, A

$S,A$ of the problem's state and action space. To resolve this issue, we study transfer RL with latent low rank structure. We consider the problem of transferring a latent low rank representation when the source and target MDPs have transition kernels with Tucker rank

(S, d, A)

$(S, d, A)$ ,

(S, S, d), (d, S, A)

$(S ,S , d), (d, S , A )$ , or

(d, d, d)

$(d , d , d )$ . In each setting, we introduce the transfer-ability coefficient

α

$\alpha$ that measures the difficulty of representational transfer. Our algorithm learns latent representations in each source MDP and then exploits the linear structure to remove the dependence on

S, A

$S , A$ , or

S A

$SA$ in the target MDP regret bound. We complement our positive results with information theoretic lower bounds that show our algorithms (excluding the (

d, d, d

$d, d, d$ ) setting) are minimax-optimal with respect to

α

$\alpha$ .

Chat is not available.