NeurIPS Contrastive Abstraction for Reinforcement Learning

Poster
in
Workshop: Generalization in Planning (GenPlan '23)

Contrastive Abstraction for Reinforcement Learning

Vihang Patil · Markus Hofmarcher · Elisabeth Rumetshofer · Sepp Hochreiter

Keywords: [ Reinforcement Learning ] [ Contrastive ] [ Representation Learning ] [ Planning ] [ abstraction ]

[ Abstract ] [ Project Page ]

[ OpenReview]

Abstract:

Learning agents with reinforcement learning is difficult when dealing with long trajectories that involve a large number of states. To address these learning problems effectively, the number of states can be reduced by abstract representations that cluster states. In principle, deep reinforcement learning can find abstract states, but end-to-end learning is unstable. We propose contrastive abstraction learning to find abstract states, where we assume that successive states in a trajectory belong to the same abstract state. Such abstract states may be basic locations, achieved subgoals, inventory, or health conditions. Contrastive abstraction learning first constructs clusters of state representations by contrastive learning and then applies modern Hopfield networks to determine the abstract states. The first phase of contrastive abstraction learning is self-supervised learning, where contrastive learning forces states with sequential proximity to have similar representations. The second phase uses modern Hopfield networks to map similar state representations to the same fixed point, i.e.\ to an abstract state. The level of abstraction can be adjusted by determining the number of fixed points of the modern Hopfield network. Furthermore, contrastive abstraction learning does not require rewards and facilitates efficient reinforcement learning for wide range of downstream tasks. Our experiments demonstrate the effectiveness of contrastive abstraction learning for reinforcement learning.

Chat is not available.

Poster in Workshop: Generalization in Planning (GenPlan '23)

Contrastive Abstraction for Reinforcement Learning

Vihang Patil · Markus Hofmarcher · Elisabeth Rumetshofer · Sepp Hochreiter

Poster
in
Workshop: Generalization in Planning (GenPlan '23)