Choice-confirmation bias and gradual perseveration in human reinforcement learning.

IF 1.6 4区 医学 Q3 BEHAVIORAL SCIENCES Behavioral neuroscience Pub Date : 2023-02-01 Epub Date: 2022-11-17 DOI:10.1037/bne0000541
Stefano Palminteri
{"title":"Choice-confirmation bias and gradual perseveration in human reinforcement learning.","authors":"Stefano Palminteri","doi":"10.1037/bne0000541","DOIUrl":null,"url":null,"abstract":"<p><p>Do we preferentially learn from outcomes that confirm our choices? In recent years, we investigated this question in a series of studies implementing increasingly complex behavioral protocols. The learning rates fitted in experiments featuring partial or complete feedback, as well as free and forced choices, were systematically found to be consistent with a choice-confirmation bias. One of the prominent behavioral consequences of the confirmatory learning rate pattern is choice hysteresis: that is, the tendency of repeating previous choices, despite contradictory evidence. However, choice-confirmatory pattern of learning rates may spuriously arise from not taking into consideration an explicit choice (gradual) perseveration term in the model. In the present study, we reanalyze data from four published papers (nine experiments; 363 subjects; 126,192 trials), originally included in the studies demonstrating or criticizing the choice-confirmation bias in human participants. We fitted two models: one featured valence-specific updates (i.e., different learning rates for confirmatory and disconfirmatory outcomes) and one additionally including gradual perseveration. Our analysis confirms that the inclusion of the gradual perseveration process in the model significantly reduces the estimated choice-confirmation bias. However, in all considered experiments, the choice-confirmation bias remains present at the meta-analytical level, and significantly different from zero in most experiments. Our results demonstrate that the choice-confirmation bias resists the inclusion of a gradual perseveration term, thus proving to be a robust feature of human reinforcement learning. We conclude by pointing to additional computational processes that may play an important role in estimating and interpreting the computational biases under scrutiny. (PsycInfo Database Record (c) 2023 APA, all rights reserved).</p>","PeriodicalId":8739,"journal":{"name":"Behavioral neuroscience","volume":"137 1","pages":"78-88"},"PeriodicalIF":1.6000,"publicationDate":"2023-02-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Behavioral neuroscience","FirstCategoryId":"3","ListUrlMain":"https://doi.org/10.1037/bne0000541","RegionNum":4,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"2022/11/17 0:00:00","PubModel":"Epub","JCR":"Q3","JCRName":"BEHAVIORAL SCIENCES","Score":null,"Total":0}
引用次数: 0

Abstract

Do we preferentially learn from outcomes that confirm our choices? In recent years, we investigated this question in a series of studies implementing increasingly complex behavioral protocols. The learning rates fitted in experiments featuring partial or complete feedback, as well as free and forced choices, were systematically found to be consistent with a choice-confirmation bias. One of the prominent behavioral consequences of the confirmatory learning rate pattern is choice hysteresis: that is, the tendency of repeating previous choices, despite contradictory evidence. However, choice-confirmatory pattern of learning rates may spuriously arise from not taking into consideration an explicit choice (gradual) perseveration term in the model. In the present study, we reanalyze data from four published papers (nine experiments; 363 subjects; 126,192 trials), originally included in the studies demonstrating or criticizing the choice-confirmation bias in human participants. We fitted two models: one featured valence-specific updates (i.e., different learning rates for confirmatory and disconfirmatory outcomes) and one additionally including gradual perseveration. Our analysis confirms that the inclusion of the gradual perseveration process in the model significantly reduces the estimated choice-confirmation bias. However, in all considered experiments, the choice-confirmation bias remains present at the meta-analytical level, and significantly different from zero in most experiments. Our results demonstrate that the choice-confirmation bias resists the inclusion of a gradual perseveration term, thus proving to be a robust feature of human reinforcement learning. We conclude by pointing to additional computational processes that may play an important role in estimating and interpreting the computational biases under scrutiny. (PsycInfo Database Record (c) 2023 APA, all rights reserved).

查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
人类强化学习中的选择确认偏差和逐步坚持。
我们是否优先从确认我们选择的结果中学习?近年来,我们在一系列实施越来越复杂的行为协议的研究中调查了这个问题。系统地发现,以部分或完全反馈以及自由和强迫选择为特征的实验中的学习率与选择确认偏差一致。验证性学习率模式的一个突出行为后果是选择滞后:即,尽管有相互矛盾的证据,但仍有重复先前选择的倾向。然而,学习率的选择确认模式可能是由于在模型中没有考虑明确的选择(渐进)持续术语而产生的。在本研究中,我们重新分析了四篇已发表论文(9个实验;363名受试者;126192项试验)的数据,这些论文最初包括在证明或批评人类参与者的选择确认偏见的研究中。我们拟合了两个模型:一个模型以效价特异性更新为特征(即确认和非确认结果的不同学习率),另一个模型还包括逐渐坚持。我们的分析证实,在模型中加入逐步坚持过程显著降低了估计的选择确认偏差。然而,在所有考虑的实验中,选择确认偏差仍然存在于荟萃分析水平,并且在大多数实验中与零显著不同。我们的研究结果表明,选择确认偏差抵制了逐渐坚持项的加入,因此被证明是人类强化学习的一个强大特征。最后,我们指出了额外的计算过程,这些过程可能在估计和解释仔细审查的计算偏差方面发挥重要作用。(PsycInfo数据库记录(c)2023 APA,保留所有权利)。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 去求助
来源期刊
Behavioral neuroscience
Behavioral neuroscience 医学-行为科学
CiteScore
3.40
自引率
0.00%
发文量
51
审稿时长
6-12 weeks
期刊介绍: Behavioral Neuroscience publishes original research articles as well as reviews in the broad field of the neural bases of behavior.
期刊最新文献
Methylphenidate differentially affects the social ultrasonic vocalizations of wild-type and prodromal Parkinsonian rats. Slight and hidden hearing loss in young rats is associated with impaired recognition memory and reduced myelination in the corpus callosum. Renewal of conditioned fear in male and female rats. N-tert-butoxycarbonyl-methylenedioxymethamphetamine, an methylenedioxymethamphetamine derivative, exhibits rewarding and reinforcing effects by increasing dopamine levels. Sex differences in behavior and glutamic acid decarboxylase in Long Evans rats after prolonged social isolation beginning in adolescence.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1