Model based control can give rise to devaluation insensitive choice

Neil Garrett , Sean Allan , Nathaniel D. Daw
{"title":"Model based control can give rise to devaluation insensitive choice","authors":"Neil Garrett ,&nbsp;Sean Allan ,&nbsp;Nathaniel D. Daw","doi":"10.1016/j.addicn.2023.100070","DOIUrl":null,"url":null,"abstract":"<div><p>Influential recent work aims to ground psychiatric dysfunction in the brain's basic computational mechanisms. For instance, the compulsive symptoms that feature prominently in drug abuse and addiction have been argued to arise from over reliance on a habitual “model-free” system in contrast to a more laborious “model-based” system. Support for this account comes in part from failures to appropriately change behavior in light of new events. Notably, instrumental responding can, in some circumstances, persist despite reinforcer devaluation, perhaps reflecting control by model-free mechanisms that are driven by past reinforcement rather than knowledge of the (now devalued) outcome. However, another line of theory posits a different mechanism – latent causal inference – that can modulate behavioral change. It concerns how animals identify different contingencies that apply in different circumstances, by covertly clustering experiences into distinct groups. Here we combine both lines of theory to investigate the consequences of latent cause inference on instrumental sensitivity to reinforcer devaluation. We show that instrumental insensitivity to reinforcer devaluation can arise in this theory even using only model-based planning, and does not require or imply any habitual, model-free component. These ersatz habits (like laboratory ones) emerge after overtraining, interact with contextual cues, and show preserved sensitivity to reinforcer devaluation on a separate consumption test, a standard control. Together, this work highlights the need for caution in using reinforcer devaluation procedures to rule in (or out) the contribution of different learning mechanisms and offers a new perspective on the neurocomputational substrates of drug abuse.</p></div>","PeriodicalId":72067,"journal":{"name":"Addiction neuroscience","volume":null,"pages":null},"PeriodicalIF":0.0000,"publicationDate":"2023-06-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Addiction neuroscience","FirstCategoryId":"1085","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S277239252300010X","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0

Abstract

Influential recent work aims to ground psychiatric dysfunction in the brain's basic computational mechanisms. For instance, the compulsive symptoms that feature prominently in drug abuse and addiction have been argued to arise from over reliance on a habitual “model-free” system in contrast to a more laborious “model-based” system. Support for this account comes in part from failures to appropriately change behavior in light of new events. Notably, instrumental responding can, in some circumstances, persist despite reinforcer devaluation, perhaps reflecting control by model-free mechanisms that are driven by past reinforcement rather than knowledge of the (now devalued) outcome. However, another line of theory posits a different mechanism – latent causal inference – that can modulate behavioral change. It concerns how animals identify different contingencies that apply in different circumstances, by covertly clustering experiences into distinct groups. Here we combine both lines of theory to investigate the consequences of latent cause inference on instrumental sensitivity to reinforcer devaluation. We show that instrumental insensitivity to reinforcer devaluation can arise in this theory even using only model-based planning, and does not require or imply any habitual, model-free component. These ersatz habits (like laboratory ones) emerge after overtraining, interact with contextual cues, and show preserved sensitivity to reinforcer devaluation on a separate consumption test, a standard control. Together, this work highlights the need for caution in using reinforcer devaluation procedures to rule in (or out) the contribution of different learning mechanisms and offers a new perspective on the neurocomputational substrates of drug abuse.

查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
基于模型的控制可能导致对贬值不敏感的选择
最近有影响力的工作旨在将精神功能障碍建立在大脑的基本计算机制中。例如,药物滥用和成瘾中突出的强迫症状被认为是由于过度依赖习惯性的“无模型”系统,而不是更费力的“基于模型”系统。对该帐户的支持部分来自于未能根据新事件适当更改行为。值得注意的是,在某些情况下,尽管强化货币贬值,但工具性反应可能会持续存在,这可能反映了由过去强化而非对(现在贬值)结果的了解驱动的无模型机制的控制。然而,另一条理论线提出了一种不同的机制——潜在的因果推断——可以调节行为变化。它关注的是动物如何通过将经验秘密地聚集到不同的群体中,来识别适用于不同环境的不同突发事件。在这里,我们将两种理论结合起来,研究潜在原因推断对工具对强化贬值敏感性的影响。我们表明,即使只使用基于模型的规划,也可能在该理论中产生对强化货币贬值的工具不敏感,并且不需要或暗示任何习惯的、无模型的组件。这些伪习惯(如实验室习惯)在过度训练后出现,与上下文线索相互作用,并在单独的消费测试(标准对照)中显示出对强化贬值的敏感性。总之,这项工作强调了在使用强化物贬值程序来排除(或排除)不同学习机制的贡献时需要谨慎,并为药物滥用的神经计算基础提供了一个新的视角。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 去求助
来源期刊
Addiction neuroscience
Addiction neuroscience Neuroscience (General)
CiteScore
1.30
自引率
0.00%
发文量
0
审稿时长
118 days
期刊最新文献
Opioid drug seeking after early-life adversity: a role for delta opioid receptors Contents Editorial Board Corrigendum to “Xylazine is an agonist at kappa opioid receptors and exhibits sex-specific responses to opioid antagonism” [Addiction Neuroscience, Volume 11, June 2024, 100155] Neurokinin-1 receptors in the nucleus accumbens shell influence sensitivity to social defeat stress and stress-induced alcohol consumption in male mice
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1