Timezone: »
Symbolic-Model-Based Reinforcement Learning
Pierre-alexandre Kamienny · Sylvain Lamprier
Event URL: https://openreview.net/forum?id=yeF6cyYU7W »
We investigate using symbolic regression (SR) to model dynamics with mathematical expressions in model-based reinforcement learning (MBRL). While the primary promise of MBRL is to enable sample-efficient learning, most popular MBRL algorithms rely, in order to learn their approximate world model, on black-box over-parametrized neural networks, which are known to be data-hungry and are prone to overfitting in low-data regime. In this paper, we leverage the fact that a large collection of environments considered in RL is governed by physical laws that compose elementary operators e.g $\sin{},\sqrt{\phantom{x}}, \exp{}, \frac{\text{d}}{\text{dt}}$, and we propose to search a world model in the space of interpretable mathematical expressions with SR. We show empirically on simple domains that MBRL can benefit from the extrapolation capabilities and sample efficiency of SR compared to neural models.
We investigate using symbolic regression (SR) to model dynamics with mathematical expressions in model-based reinforcement learning (MBRL). While the primary promise of MBRL is to enable sample-efficient learning, most popular MBRL algorithms rely, in order to learn their approximate world model, on black-box over-parametrized neural networks, which are known to be data-hungry and are prone to overfitting in low-data regime. In this paper, we leverage the fact that a large collection of environments considered in RL is governed by physical laws that compose elementary operators e.g $\sin{},\sqrt{\phantom{x}}, \exp{}, \frac{\text{d}}{\text{dt}}$, and we propose to search a world model in the space of interpretable mathematical expressions with SR. We show empirically on simple domains that MBRL can benefit from the extrapolation capabilities and sample efficiency of SR compared to neural models.
Author Information
Pierre-alexandre Kamienny (Meta)
Sylvain Lamprier (LIP6-UPMC)
More from the Same Authors
-
2022 : Symbolic-Model-Based Reinforcement Learning »
Pierre-alexandre Kamienny · Sylvain Lamprier -
2022 : Privileged Deep Symbolic Regression »
Luca Biggio · Tommaso Bendinelli · Pierre-alexandre Kamienny -
2022 Poster: End-to-end Symbolic Regression with Transformers »
Pierre-alexandre Kamienny · Stéphane d'Ascoli · Guillaume Lample · Francois Charton -
2021 Poster: To Beam Or Not To Beam: That is a Question of Cooperation for Language GANs »
Thomas Scialom · Paul-Alexis Dray · Jacopo Staiano · Sylvain Lamprier · Benjamin Piwowarski -
2020 Poster: ColdGANs: Taming Language GANs with Cautious Sampling Strategies »
Thomas Scialom · Paul-Alexis Dray · Sylvain Lamprier · Benjamin Piwowarski · Jacopo Staiano