Decomposition of Reinforcement Learning Deficits in Disordered Gambling via Drift Diffusion Modeling and Functional Magnetic Resonance Imaging.

Computational psychiatry (Cambridge, Mass.) Pub Date : 2024-03-20 eCollection Date: 2024-01-01 DOI:10.5334/cpsy.104
Antonius Wiehler, Jan Peters
{"title":"Decomposition of Reinforcement Learning Deficits in Disordered Gambling via Drift Diffusion Modeling and Functional Magnetic Resonance Imaging.","authors":"Antonius Wiehler, Jan Peters","doi":"10.5334/cpsy.104","DOIUrl":null,"url":null,"abstract":"<p><p>Gambling disorder is associated with deficits in reward-based learning, but the underlying computational mechanisms are still poorly understood. Here, we examined this issue using a stationary reinforcement learning task in combination with computational modeling and functional resonance imaging (fMRI) in individuals that regular participate in gambling (n = 23, seven fulfilled one to three DSM 5 criteria for gambling disorder, sixteen fulfilled four or more) and matched controls (n = 23). As predicted, the gambling group exhibited substantially reduced accuracy, whereas overall response times (RTs) were not reliably different between groups. We then used comprehensive modeling using reinforcement learning drift diffusion models (RLDDMs) in combination with hierarchical Bayesian parameter estimation to shed light on the computational underpinnings of this performance deficit. In both groups, an RLDDM in which both non-decision time and decision threshold (boundary separation) changed over the course of the experiment accounted for the data best. The model showed good parameter and model recovery, and posterior predictive checks revealed that, in both groups, the model accurately reproduced the evolution of accuracies and RTs over time. Modeling revealed that, compared to controls, the learning impairment in the gambling group was linked to a more rapid reduction in decision thresholds over time, and a reduced impact of value-differences on the drift rate. The gambling group also showed shorter non-decision times. FMRI analyses replicated effects of prediction error coding in the ventral striatum and value coding in the ventro-medial prefrontal cortex, but there was no credible evidence for group differences in these effects. Taken together, our findings show that reinforcement learning impairments in disordered gambling are linked to both maladaptive decision threshold adjustments and a reduced consideration of option values in the choice process.</p>","PeriodicalId":72664,"journal":{"name":"Computational psychiatry (Cambridge, Mass.)","volume":null,"pages":null},"PeriodicalIF":0.0000,"publicationDate":"2024-03-20","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11104325/pdf/","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Computational psychiatry (Cambridge, Mass.)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.5334/cpsy.104","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"2024/1/1 0:00:00","PubModel":"eCollection","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0

Abstract

Gambling disorder is associated with deficits in reward-based learning, but the underlying computational mechanisms are still poorly understood. Here, we examined this issue using a stationary reinforcement learning task in combination with computational modeling and functional resonance imaging (fMRI) in individuals that regular participate in gambling (n = 23, seven fulfilled one to three DSM 5 criteria for gambling disorder, sixteen fulfilled four or more) and matched controls (n = 23). As predicted, the gambling group exhibited substantially reduced accuracy, whereas overall response times (RTs) were not reliably different between groups. We then used comprehensive modeling using reinforcement learning drift diffusion models (RLDDMs) in combination with hierarchical Bayesian parameter estimation to shed light on the computational underpinnings of this performance deficit. In both groups, an RLDDM in which both non-decision time and decision threshold (boundary separation) changed over the course of the experiment accounted for the data best. The model showed good parameter and model recovery, and posterior predictive checks revealed that, in both groups, the model accurately reproduced the evolution of accuracies and RTs over time. Modeling revealed that, compared to controls, the learning impairment in the gambling group was linked to a more rapid reduction in decision thresholds over time, and a reduced impact of value-differences on the drift rate. The gambling group also showed shorter non-decision times. FMRI analyses replicated effects of prediction error coding in the ventral striatum and value coding in the ventro-medial prefrontal cortex, but there was no credible evidence for group differences in these effects. Taken together, our findings show that reinforcement learning impairments in disordered gambling are linked to both maladaptive decision threshold adjustments and a reduced consideration of option values in the choice process.

查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
通过漂移扩散建模和功能磁共振成像分解赌博障碍中的强化学习缺陷。
赌博障碍与基于奖赏的学习缺陷有关,但人们对其背后的计算机制仍知之甚少。在这里,我们使用静态强化学习任务,结合计算建模和功能共振成像(fMRI),对经常参与赌博的人(n = 23,7 人符合 1 到 3 项 DSM 5 赌博障碍标准,16 人符合 4 项或更多标准)和匹配的对照组(n = 23)进行了研究。正如预测的那样,赌博组的准确性大大降低,而总体反应时间(RTs)在组间并无可靠差异。随后,我们利用强化学习漂移扩散模型(RLDDM)结合分层贝叶斯参数估计法进行了综合建模,以揭示这种成绩缺陷的计算基础。在两组实验中,非决策时间和决策阈值(边界分离)在实验过程中均发生变化的 RLDDM 对数据的解释最为准确。该模型显示出良好的参数和模型恢复能力,后验预测检查显示,在两组中,该模型都准确地再现了准确率和实时时间随时间的变化。建模结果表明,与对照组相比,赌博组的学习障碍与决策阈值随时间的推移下降更快以及价值差异对漂移率的影响减小有关。赌博组的非决策时间也更短。核磁共振成像分析复制了腹侧纹状体的预测错误编码和腹内侧前额叶皮层的价值编码效应,但没有可信的证据表明这些效应存在群体差异。综上所述,我们的研究结果表明,无序赌博中的强化学习障碍与适应不良的决策阈值调整和选择过程中对选项价值的考虑减少有关。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 去求助
来源期刊
CiteScore
4.30
自引率
0.00%
发文量
0
审稿时长
17 weeks
期刊最新文献
(Mal)adaptive Mentalizing in the Cognitive Hierarchy, and Its Link to Paranoia. Decision-Making, Pro-variance Biases and Mood-Related Traits. Enhancing Within-Person Estimation of Neurocognition and the Prediction of Externalizing Behaviors in Adolescents. Updating Prospective Self-Efficacy Beliefs About Cardiac Interoception in Anorexia Nervosa: An Experimental and Computational Study. Temporal Dynamics of Uncertainty Cause Anxiety and Avoidance.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1