Poster
State Chrono Representation for Enhancing Generalization in Reinforcement Learning
Jianda Chen · Wen zheng terence Ng · Zichen Chen · Sinno Pan · Tianwei Zhang
West Ballroom A-D #6410
In the context of reinforcement learning with image-based inputs, it is crucial to establish a robust and generalizable state representation. Recent advancements in metric learning, such as deep bisimulation metric approaches, have shown promising results in learning structured low-dimensional representation space from pixel observations, where the distance between states is measured based on task-relevant features. However, these approaches face challenges in demanding generalization tasks and scenarios with non-informative rewards. This is because they fail to capture sufficient long-term information in the learned representations. To address these challenges, we propose a novel State Chrono Representation (SCR) approach. SCR augments state metric-based representations by incorporating extensive temporal information into the update step of bisimulation metric learning. It learns state distances within a temporal framework that considers both future dynamics and cumulative rewards over current and long-term future states. Our learning strategy effectively incorporates future behavioral information into the representation space without introducing a significant number of additional parameters for modeling dynamics. Extensive experiments conducted in DeepMind Control and MetaWorld environments demonstrate that SCR achieves SOTA performance in demanding generalization tasks.
Live content is unavailable. Log in and register to view live content