Timezone: »
The ability to compose learned skills to solve new tasks is an important property for lifelong-learning agents. In this work we formalise the logical composition of tasks as a Boolean algebra. This allows us to formulate new tasks in terms of the negation, disjunction and conjunction of a set of base tasks. We then show that by learning goal-oriented value functions and restricting the transition dynamics of the tasks, an agent can solve these new tasks with no further learning. We prove that by composing these value functions in specific ways, we immediately recover the optimal policies for all tasks expressible under the Boolean algebra. We verify our approach in two domains---including a high-dimensional video game environment requiring function approximation---where an agent first learns a set of base skills, and then composes them to solve a super-exponential number of new tasks.
Author Information
Geraud Nangue Tasse (University of the Witwatersrand)
Steven James (University of the Witwatersrand)
Benjamin Rosman (University of the Witwatersrand)
More from the Same Authors
-
2021 : Generalisation in Lifelong Reinforcement Learning through Logical Composition »
Geraud Nangue Tasse · Steven James · Benjamin Rosman -
2021 : The challenge of redundancy on multi agent value factorisation »
Siddarth Singh · Benjamin Rosman -
2022 : Skill Machines: Temporal Logic Composition in Reinforcement Learning »
Geraud Nangue Tasse · Devon Jarvis · Steven James · Benjamin Rosman -
2023 Poster: Dynamics Generalisation via Adaptive Policies »
Michael Beukman · Devon Jarvis · Richard Klein · Steven James · Benjamin Rosman -
2022 Workshop: Broadening Research Collaborations »
Sara Hooker · Rosanne Liu · Pablo Samuel Castro · FatemehSadat Mireshghallah · Sunipa Dev · Benjamin Rosman · João Madeira Araújo · Savannah Thais · Sara Hooker · Sunny Sanyal · Tejumade Afonja · Swapneel Mehta · Tyler Zhu -
2018 Poster: Zero-Shot Transfer with Deictic Object-Oriented Representation in Reinforcement Learning »
Ofir Marom · Benjamin Rosman