Timezone: »
Combining a multi-armed bandit task and Bayesian computational modeling, we find that humans systematically under-estimate reward availability in the environment. This apparent pessimism turns out to be an optimism bias in disguise, and one that compensates for other idiosyncrasies in human learning and decision-making under uncertainty, such as a default tendency to assume non-stationarity in environmental statistics as well as the adoption of a simplistic decision policy. In particular, reward rate underestimation discourages the decision-maker from switching away from a “good” option, thus achieving near-optimal behavior (which never switches away after a win). Furthermore, we demonstrate that the Bayesian model that best predicts human behavior is equivalent to a particular form of Q-learning often used in the brain sciences, thus providing statistical, normative grounding to phenomenological models of human and animal behavior.
Author Information
Angela Yu (UC San Diego)
More from the Same Authors
-
2021 : Panel I: Human decisions »
Jennifer Trueblood · Alex Peysakhovich · Angela Yu · Ori Plonsky · Tal Yarkoni · Daniel Bjorkegren -
2019 : Panel Discussion led by Grace Lindsay »
Grace Lindsay · Blake Richards · Doina Precup · Jacqueline Gottlieb · Jeff Clune · Jane Wang · Richard Sutton · Angela Yu · Ida Momennejad -
2018 Poster: Why so gloomy? A Bayesian explanation of human pessimism bias in the multi-armed bandit task »
Dalin Guo · Angela Yu -
2018 Poster: Demystifying excessively volatile human learning: A Bayesian persistent prior and a neural approximation »
Chaitanya Ryali · Gautam Reddy · Angela Yu -
2018 Poster: Beauty-in-averageness and its contextual modulations: A Bayesian statistical account »
Chaitanya Ryali · Angela Yu -
2017 : Computational modeling of human face processing »
Angela Yu -
2017 : Workshop overview »
Michael Mozer · Angela Yu · Brenden Lake -
2017 Workshop: Cognitively Informed Artificial Intelligence: Insights From Natural Intelligence »
Michael Mozer · Brenden Lake · Angela Yu -
2013 Poster: Forgetful Bayes and myopic planning: Human learning and decision-making in a bandit setting »
Shunan Zhang · Angela Yu -
2013 Poster: Context-sensitive active sensing in humans »
Sheeraz Ahmad · He Huang · Angela Yu -
2012 Poster: Strategic Impatience in Go/NoGo versus Forced-Choice Decision-Making »
Pradeep Shenoy · Angela Yu -
2012 Oral: Strategic Impatience in Go/NoGo versus Forced-Choice Decision-Making »
Pradeep Shenoy · Angela Yu -
2010 Oral: A rational decision making framework for inhibitory control »
Pradeep Shenoy · Rajesh PN Rao · Angela Yu -
2010 Poster: A rational decision making framework for inhibitory control »
Pradeep Shenoy · Rajesh PN Rao · Angela Yu -
2008 Poster: Sequential effects: Superstition or rational behavior? »
Angela Yu · Jonathan D Cohen -
2008 Spotlight: Sequential effects: Superstition or rational behavior? »
Angela Yu · Jonathan D Cohen -
2007 Spotlight: Sequential Hypothesis Testing under Stochastic Deadlines »
Peter Frazier · Angela Yu -
2007 Poster: Sequential Hypothesis Testing under Stochastic Deadlines »
Peter Frazier · Angela Yu -
2006 Poster: Optimal Change-Detection and Spiking Neurons »
Angela Yu