Pigeon and human performance in a multi-armed bandit task in response to changes in variable interval schedules |
| |
Authors: | Racey Deborah Young Michael E Garlick Dennis Pham Jennifer Ngoc-Minh Blaisdell Aaron P |
| |
Institution: | (1) Western Carolina University, Cullowhee, NC, USA;(2) Southern Illinois University at Carbondale, Carbondale, IL, USA;(3) University of California at Los Angeles, Los Angeles, CA, USA; |
| |
Abstract: | The tension between exploitation of the best options and exploration of alternatives is a ubiquitous problem that all organisms
face. To examine this trade-off across species, pigeons and people were trained on an eight-armed bandit task in which the
options were rewarded on a variable interval (VI) schedule. At regular intervals, each option’s VI changed, thus encouraging
dynamic increases in exploration in response to these anticipated changes. Both species showed sensitivity to the payoffs
that was often well modeled by Luce’s (1963) decision rule. For pigeons, exploration of alternative options was driven by experienced changes in the payoff schedules,
not the beginning of a new session, even though each session signaled a new schedule. In contrast, people quickly learned
to explore in response to signaled changes in the payoffs. |
| |
Keywords: | |
本文献已被 PubMed SpringerLink 等数据库收录! |
|