Timezone: »
Distributional reinforcement learning (RL) – in which agents learn about all the possible long-term consequences of their actions, and not just the expected value – is of great recent interest. One of the most important affordances of a distributional view is facilitating a modern, measured, approach to risk when outcomes are not completely certain. By contrast, psychological and neuroscientific investigations into decision making under risk have utilized a variety of more venerable theoretical models such as prospect theory that lack axiomatically desirable properties such as coherence. Here, we consider a particularly relevant risk measure for modeling human and animal planning, called conditional value-at-risk (CVaR), which quantifies worst-case outcomes (e.g., vehicle accidents or predation). We first adopt a conventional distributional approach to CVaR in a sequential setting and reanalyze the choices of human decision-makers in the well-known two-step task, revealing substantial risk aversion that had been lurking under stickiness and perseveration. We then consider a further critical property of risk sensitivity, namely time consistency, showing alternatives to this form of CVaR that enjoy this desirable characteristic. We use simulations to examine settings in which the various forms differ in ways that have implications for human and animal planning and behavior.
Author Information
Christopher Gagne (Max Planck Institute, Biological Cybernetics)
Peter Dayan (Max Planck Institute for Biological Cybernetics)
Related Events (a corresponding poster, oral, or spotlight)
-
2021 Spotlight: Two steps to risk sensitivity »
Dates n/a. Room
More from the Same Authors
-
2021 : Catastrophe, Compounding & Consistency in Choice »
Christopher Gagne · Peter Dayan -
2022 : A (dis-)information theory of revealed and unrevealed preferences »
Nitay Alon · Lion Schulz · Peter Dayan · Jeffrey S Rosenschein -
2022 : A (dis-)information theory of revealed and unrevealed preferences »
Nitay Alon · Lion Schulz · Peter Dayan · Jeffrey S Rosenschein -
2020 : Panel Discussions »
Grace Lindsay · George Konidaris · Shakir Mohamed · Kimberly Stachenfeld · Peter Dayan · Yael Niv · Doina Precup · Catherine Hartley · Ishita Dasgupta -
2020 Poster: A Local Temporal Difference Code for Distributional Reinforcement Learning »
Pablo Tano · Peter Dayan · Alexandre Pouget -
2019 Poster: Disentangled behavioural representations »
Amir Dezfouli · Hassan Ashtiani · Omar Ghattas · Richard Nock · Peter Dayan · Cheng Soon Ong