Skip to yearly menu bar Skip to main content

Workshop: Goal-Conditioned Reinforcement Learning

Is feedback all you need? Leveraging natural language feedback in goal-conditioned RL

Sabrina McCallum · Max Taylor-Davies · Stefano Albrecht · Alessandro Suglia

Keywords: [ learning from feedback ] [ goal-conditioned reinforcement learning ] [ offline reinforcement learning ] [ decision transformer ]


Despite numerous successes, reinforcement learning is still far from replicating the power and flexibility of behaviour learning in humans. One way to help bridge this gap may be to provide learning agents with richer, more humanlike feedback signals in the form of natural language. We adapt the decision transformer architecture to train agents on the BabyAI environment suite using two different types of generated language feedback, and compare the effect of using language feedback in place of return-to-go and goal description conditioning.

Chat is not available.