Intrinsically Motivated Open-ended Learning (IMOL) Workshop

Sat 6:15 a.m. - 6:25 a.m.

Opening Remarks ( Introduction ) >
SlidesLive Video

Cédric Colas 🔗

Sat 6:25 a.m. - 7:05 a.m.

Georg Martius - Intrinsic Motivations for Efficient Exploration in Reinforcement Learning ( Invited Talk ) >
SlidesLive Video

I will summarize research in the area of intrinsic motivation in the context of learning and exploration and touch upon open-ended learning in the IMOL community. I will then present our recent work on combining different intrinsic motivation signals with reinforcement learning, such as learning progress, causal influence and information gain. A particular exciting direction is to employ model-based reinforcement learning to make robots learn by freely playing how to interact effectively driven by information gain and other generic drives. We find that this leads to high zero-shot generalization to new tasks.

🔗

Sat 7:05 a.m. - 7:45 a.m.

Doina Precup - Towards a General Blueprint for Continual Reinforcement Learning ( Invited Talk ) >
SlidesLive Video

Intelligent agents must be able to learn by interacting with their environment and to adapt to changes. Continual reinforcement learning provides a natural way to model this process. In this talk, I will discuss for tackling this problem by constructing abstractions, such as intents, options, affordances and partial models that allow an agent to generalize its knowledge quickly to new circumstances.

🔗

Sat 7:45 a.m. - 8:00 a.m.

Surprise-Adaptive Intrinsic Motivation for Unsupervised Reinforcement Learning ( Contributed Talk ) >
SlidesLive Video

🔗

Sat 8:00 a.m. - 9:00 a.m.

Break and Posters ( Break and Posters ) >

🔗

Sat 9:00 a.m. - 9:15 a.m.

What can AI Learn from Human Exploration? ( Contributed Talk ) >
SlidesLive Video

🔗

Sat 9:15 a.m. - 9:55 a.m.

Michael Tomasello - Agency and Cognitive Development ( Invited Talk ) >
SlidesLive Video

Modern theories explain children’s cognitive development mainly in term of Bayesian learning (with some innate priors in infancy). But learning cannot be the whole story or else children could learn anything at any age - which they cannot. They cannot because their capacities to experience and cognitively represent the world are structured by the human species’ evolved psychological architecture - inherited from ancient animal ancestors - and this architecture changes in significant ways over the first years of life. The main organizing principle is agency, including shared agency. The developmental proposal is that young infants (below 9 months) are goal-directed agents who cognitively represent and learn about actualities; toddlers are intentional agents who executively represent and learn also about causal, intentional, and logical possibilities; and preschoolers (over 3 years) are metacognitive agents who metacognitively represent and learn also about normative necessities. This agency-based model of cognitive development recognizes the important role of learning, but at the same time places it in the context of the overall agentive organization of children at particular developmental periods.

🔗

Sat 9:55 a.m. - 11:30 a.m.

Lunch & Mentoring Session ( Lunch & Mentoring Session ) >

🔗

Sat 11:30 a.m. - 11:45 a.m.

Voyager: An Open-Ended Embodied Agent with Large Language Models ( Contributed Talk ) >
SlidesLive Video

🔗

Sat 11:45 a.m. - 12:25 p.m.

Yannick Schroecker - Human-Timescale Adaptation in an Open-Ended Task Space ( Invited Talk ) >
SlidesLive Video

Foundation models when trained at scale have shown impressive capabilities to adapt to new tasks with few examples provided in context; however, there remains a gap between the ability of these models and requirements to successfully act in embodied domains. To close this gap with reinforcement learning, our agents have to be trained at scale as well. In this talk, I will present recipes towards this end and dive into the details of how we trained AdA, utilizing a vast open ended task space, to achieve human-timescale adaptation in a 3d embodied domain. The trained agent displays on-the-fly hypothesis-driven exploration, efficient exploitation of acquired knowledge, and can successfully be prompted with first-person demonstrations.

🔗

Sat 12:25 p.m. - 1:05 p.m.

Daniel Polani - Information and its Flow: From Dynamics to Agency and Back ( Invited Talk ) >
SlidesLive Video

In the last few years, various forms of information flow were found to be useful quantities for the characterization of the decision-making of agents, whether natural or artificial. We here especially consider one particular type of information flow, empowerment, which can be used as intrinsic motivation that is derived from the dynamical properties of the external perception-action loop. The present talk will discuss empowerment in the context of its evolutionary motivation, questions of agency as well as some insightful new links to Dynamical Systems theory.

🔗

Sat 1:05 p.m. - 2:05 p.m.

Break and Posters ( Break and Posters ) >

🔗

Sat 2:05 p.m. - 2:45 p.m.

Dani Bassett - Agents of Curiosity: Testing Network Theories in Human and Non-Human Inquirers ( Invited Talk ) >
SlidesLive Video

What is curiosity? Across disciplines, some scholars offer a range of definitions while others eschew definitions altogether. Is the field of curiosity studies simply too young? Should we, as has been argued in neuroscience, press forward in definition-less characterization? At this juncture in the field's history, we turn to an examination of curiosity styles, and ask: How has curiosity been practiced over the last two millennia and how is it practiced today? We exercise a recent historico-philosophical account to catalogue common styles of curiosity and test for their presence as humans browse Wikipedia. Next we consider leading theories from psychology and mechanics that could explain curiosity styles, and formalize those theories in the mathematics of network science. Such a formalization allows theories of curiosity to be explicitly tested in human behavioral data and for their relative mental affordances to be investigated. Moreover, the formalization allows us to train artificial agents to build in human-like curiosity styles through reinforcement learning. Finally, with styles and theories in hand, we expand out to a study of several million users of Wikipedia to understand how curiosity styles might or might not differ around the world and track factors of social inequality. Collectively, our findings support the notion that curiosity is practiced---differently across people---as unique styles of network building, thereby providing a connective counterpoint to the common acquisitional account of curiosity in humans.

🔗

Sat 2:45 p.m. - 3:25 p.m.

Natalia Vélez - Studying Large-Scale Collaborations in Open-Ended Games ( Invited Talk ) >
SlidesLive Video

Humans have developed technological repertoires that have enabled us to survive in virtually every habitat on Earth. However, it can be difficult to trace how these technologies came to be—folk histories of technological achievement often highlight a few brilliant individuals, while losing sight of the rest of the community’s contributions. In this talk, I will present work analyzing player behavior in One Hour One Life, a multiplayer online game where players can build technologically complex communities over many generations (N = 22,011 players, 2,700 communities, 428,255 lives lived, 127,768,267 social interactions detected). This dataset provides a unique opportunity to test how community dynamics shape technological development in an open-ended world: Players can form communities that endure for many generations, and they can combine thousands of unique materials to build vast technological repertoires. At a macroscopic level, we find that community characteristics—such as population size, interconnectedness, and specialization—predict the size and stability of a community’s technological repertoire. Zooming in, we find that individual players contribute their own, individual expertise to technological development—participants consistently perform similar jobs in different communities that they’re placed in, and acquire expertise in these jobs through social interaction. Our work tests theories of cultural evolution and economic complexity at scale and provides a methodological basis to study the interplay between individual expertise and community structures.

🔗

Sat 3:25 p.m. - 3:30 p.m.

Closing Remarks ( Closing Remarks ) >
SlidesLive Video

🔗

-

Reinforcement Learning of Diverse Skills using Mixture of Deep Experts ( Poster ) > link

Agents that can acquire diverse skills to solve the same task have a benefit over other agents if e.g. unexpected environmental changes occur. However, Reinforcement Learning (RL) policies mainly rely on Gaussian parameterization, preventing them from learning multi-modal, diverse skills. In this work, we propose a novel RL approach for training policies that exhibit diverse behavior. To this end, we propose a highly non-linear Mixture of Experts (MoE) as the policy representation, where each expert formalizes a skill as a contextual motion primitive. The context defines the task, which can be for instance the goal reaching position of the agent, or changing physical parameters like friction. Given a context, our trained policy first selects an expert out of the repertoire of skills and subsequently adapts the parameters of the contextual motion primitive. To incentivize our policy to learn diverse skills, we leverage a maximum entropy objective combined with a per-expert context distribution that we optimize alongside each expert. The per-expert context distribution allows each expert to focus on a context sub-space and boost learning speed. However, these distributions need to be able to represent multi-modality and hard discontinuities in the environment's context probability space. We solve these requirements by leveraging energy-based models to represent the per-expert context distributions and show how we can efficiently train them using the standard policy gradient objective.

Workshop

Intrinsically Motivated Open-ended Learning (IMOL) Workshop

Cédric Colas · Laetitia Teodorescu · Nadia Ady · Cansu Sancaktar · Junyi Chu

Room 260 - 262

Schedule