Timezone: »
Developmental machine learning studies how artificial agents can model the way children learn open-ended repertoires of skills. Such agents need to create and represent goals, select which ones to pursue and learn to achieve them. Recent approaches have considered goal spaces that were either fixed and hand-defined or learned using generative models of states. This limited agents to sample goals within the distribution of known effects. We argue that the ability to imagine out-of-distribution goals is key to enable creative discoveries and open-ended learning. Children do so by leveraging the compositionality of language as a tool to imagine descriptions of outcomes they never experienced before, targeting them as goals during play. We introduce IMAGINE, an intrinsically motivated deep reinforcement learning architecture that models this ability. Such imaginative agents, like children, benefit from the guidance of a social peer who provides language descriptions. To take advantage of goal imagination, agents must be able to leverage these descriptions to interpret their imagined out-of-distribution goals. This generalization is made possible by modularity: a decomposition between learned goal-achievement reward function and policy relying on deep sets, gated attention and object-centered representations. We introduce the Playground environment and study how this form of goal imagination improves generalization and exploration over agents lacking this capacity. In addition, we identify the properties of goal imagination that enable these results and study the impacts of modularity and social interactions.
Author Information
Cédric Colas (INRIA)
Tristan Karch (Inria)
Nicolas Lair (Inserm Robot Cognition Lab)
Jean-Michel Dussoux (Cloud Temple)
Clément Moulin-Frier (Inria)
Peter F Dominey (INSERM/CNRS)
Pierre-Yves Oudeyer (INRIA)
More from the Same Authors
-
2022 : Using Confounded Data in Offline RL »
Maxime Gasse · Damien GRASSET · Guillaume Gaudron · Pierre-Yves Oudeyer -
2023 Workshop: Intrinsically Motivated Open-ended Learning (IMOL) Workshop »
Cédric Colas · Laetitia Teodorescu · Nadia Ady · Cansu Sancaktar · Junyi Chu -
2022 Workshop: LaReL: Language and Reinforcement Learning »
Laetitia Teodorescu · Laura Ruis · Tristan Karch · Cédric Colas · Paul Barde · Jelena Luketina · Athul Jacob · Pratyusha Sharma · Edward Grefenstette · Jacob Andreas · Marc-Alexandre Côté -
2022 Poster: EAGER: Asking and Answering Questions for Automatic Reward Shaping in Language-guided RL »
Thomas Carta · Pierre-Yves Oudeyer · Olivier Sigaud · Sylvain Lamprier -
2021 : Grounding an Ecological Theory of Artificial Intelligence in Human Evolution »
Eleni Nisioti · Clément Moulin-Frier -
2021 : Sculpting (human-like) AI systems by sculpting their (social) environments »
Pierre-Yves Oudeyer -
2021 : Grounding an Ecological Theory of Artificial Intelligence in Human Evolution »
Eleni Nisioti · Clément Moulin-Frier -
2021 Poster: Grounding Spatio-Temporal Language with Transformers »
Tristan Karch · Laetitia Teodorescu · Katja Hofmann · Clément Moulin-Frier · Pierre-Yves Oudeyer -
2020 : Panel discussion »
Pierre-Yves Oudeyer · Marc Bellemare · Peter Stone · Matt Botvinick · Susan Murphy · Anusha Nagabandi · Ashley Edwards · Karen Liu · Pieter Abbeel -
2020 : Invited talk: PierreYves Oudeyer "Machines that invent their own problems: Towards open-ended learning of skills" »
Pierre-Yves Oudeyer -
2020 Poster: Hierarchically Organized Latent Modules for Exploratory Search in Morphogenetic Systems »
Mayalen Etcheverry · Clément Moulin-Frier · Pierre-Yves Oudeyer -
2020 Oral: Hierarchically Organized Latent Modules for Exploratory Search in Morphogenetic Systems »
Mayalen Etcheverry · Clément Moulin-Frier · Pierre-Yves Oudeyer -
2019 : Poster session »
Candace Ross · Yassine Mrabet · Sanjay Subramanian · Geoffrey Cideron · Jesse Mu · Suvrat Bhooshan · Eda Okur Kavil · Jean-Benoit Delbrouck · Yen-Ling Kuo · Nicolas Lair · Gabriel Ilharco · T.S. Jayram · Alba María Herrera Palacio · Chihiro Fujiyama · Olivier Tieleman · Anna Potapenko · Guan-Lin Chao · Thomas Sutter · Olga Kovaleva · Farley Lai · Xin Wang · Vasu Sharma · Catalina Cangea · Nikhil Krishnaswamy · Yuta Tsuboi · Alexander Kuhnle · Khanh Nguyen · Dian Yu · Homagni Saha · Jiannan Xiang · Vijay Venkataraman · Ankita Kalra · Ning Xie · Derek Doran · Travis Goodwin · Asim Kadav · Shabnam Daghaghi · Jason Baldridge · Jialin Wu · Jingxiang Lin · Unnat Jain -
2018 : Poster Session 1 + Coffee »
Tom Van de Wiele · Rui Zhao · J. Fernando Hernandez-Garcia · Fabio Pardo · Xian Yeow Lee · Xiaolin Andy Li · Marcin Andrychowicz · Jie Tang · Suraj Nair · Juhyeon Lee · Cédric Colas · S. M. Ali Eslami · Yen-Chen Wu · Stephen McAleer · Ryan Julian · Yang Xue · Matthia Sabatelli · Pranav Shyam · Alexandros Kalousis · Giovanni Montana · Emanuele Pesce · Felix Leibfried · Zhanpeng He · Chunxiao Liu · Yanjun Li · Yoshihide Sawada · Alexander Pashevich · Tejas Kulkarni · Keiran Paster · Luca Rigazio · Quan Vuong · Hyunggon Park · Minhae Kwon · Rivindu Weerasekera · Shamane Siriwardhanaa · Rui Wang · Ozsel Kilinc · Keith Ross · Yizhou Wang · Simon Schmitt · Thomas Anthony · Evan Cater · Forest Agostinelli · Tegg Sung · Shirou Maruyama · Alexander Shmakov · Devin Schwab · Mohammad Firouzi · Glen Berseth · Denis Osipychev · Jesse Farebrother · Jianlan Luo · William Agnew · Peter Vrancx · Jonathan Heek · Catalin Ionescu · Haiyan Yin · Megumi Miyashita · Nathan Jay · Noga H. Rotman · Sam Leroux · Shaileshh Bojja Venkatakrishnan · Henri Schmidt · Jack Terwilliger · Ishan Durugkar · Jonathan Sauder · David Kas · Arash Tavakoli · Alain-Sam Cohen · Philip Bontrager · Adam Lerer · Thomas Paine · Ahmed Khalifa · Ruben Rodriguez · Avi Singh · Yiming Zhang -
2016 Demonstration: Autonomous exploration, active learning and human guidance with open-source Poppy humanoid robot platform and Explauto library »
Sébastien Forestier · Yoan Mollard · Pierre-Yves Oudeyer -
2012 Poster: Exploration in Model-based Reinforcement Learning by Empirically Estimating Learning Progress »
Manuel Lopes · Tobias Lang · Marc Toussaint · Pierre-Yves Oudeyer