多维奖励环境中注意力对学习的贡献。

IF 4.4 2区 医学 Q1 NEUROSCIENCES Journal of Neuroscience Pub Date : 2025-02-12 DOI:10.1523/JNEUROSCI.2300-23.2024
Michael Chong Wang, Alireza Soltani
{"title":"多维奖励环境中注意力对学习的贡献。","authors":"Michael Chong Wang, Alireza Soltani","doi":"10.1523/JNEUROSCI.2300-23.2024","DOIUrl":null,"url":null,"abstract":"<p><p>Real-world choice options have many features or attributes, whereas the reward outcome from those options only depends on a few features or attributes. It has been shown that humans learn and combine feature-based with more complex conjunction-based learning to tackle challenges of learning in naturalistic reward environments. However, it remains unclear how different learning strategies interact to determine what features or conjunctions should be attended to and control choice behavior, and how subsequent attentional modulations influence future learning and choice. To address these questions, we examined the behavior of male and female human participants during a three-dimensional learning task in which reward outcomes for different stimuli could be predicted based on a combination of an informative feature and conjunction. Using multiple approaches, we found that both choice behavior and reward probabilities estimated by participants were most accurately described by attention-modulated models that learned the predictive values of both the informative feature and the informative conjunction. Specifically, in the reinforcement learning model that best fit choice data, attention was controlled by the difference in the integrated feature and conjunction values. The resulting attention weights modulated learning by increasing the learning rate on attended features and conjunctions. Critically, modulating decision-making by attention weights did not improve the fit of data, providing little evidence for direct attentional effects on choice. These results suggest that in multidimensional environments, humans direct their attention not only to selectively process reward-predictive attributes but also to find parsimonious representations of the reward contingencies for more efficient learning.</p>","PeriodicalId":50114,"journal":{"name":"Journal of Neuroscience","volume":" ","pages":""},"PeriodicalIF":4.4000,"publicationDate":"2025-02-12","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11823339/pdf/","citationCount":"0","resultStr":"{\"title\":\"Contributions of Attention to Learning in Multidimensional Reward Environments.\",\"authors\":\"Michael Chong Wang, Alireza Soltani\",\"doi\":\"10.1523/JNEUROSCI.2300-23.2024\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<p><p>Real-world choice options have many features or attributes, whereas the reward outcome from those options only depends on a few features or attributes. It has been shown that humans learn and combine feature-based with more complex conjunction-based learning to tackle challenges of learning in naturalistic reward environments. However, it remains unclear how different learning strategies interact to determine what features or conjunctions should be attended to and control choice behavior, and how subsequent attentional modulations influence future learning and choice. To address these questions, we examined the behavior of male and female human participants during a three-dimensional learning task in which reward outcomes for different stimuli could be predicted based on a combination of an informative feature and conjunction. Using multiple approaches, we found that both choice behavior and reward probabilities estimated by participants were most accurately described by attention-modulated models that learned the predictive values of both the informative feature and the informative conjunction. Specifically, in the reinforcement learning model that best fit choice data, attention was controlled by the difference in the integrated feature and conjunction values. The resulting attention weights modulated learning by increasing the learning rate on attended features and conjunctions. Critically, modulating decision-making by attention weights did not improve the fit of data, providing little evidence for direct attentional effects on choice. These results suggest that in multidimensional environments, humans direct their attention not only to selectively process reward-predictive attributes but also to find parsimonious representations of the reward contingencies for more efficient learning.</p>\",\"PeriodicalId\":50114,\"journal\":{\"name\":\"Journal of Neuroscience\",\"volume\":\" \",\"pages\":\"\"},\"PeriodicalIF\":4.4000,\"publicationDate\":\"2025-02-12\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11823339/pdf/\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Journal of Neuroscience\",\"FirstCategoryId\":\"3\",\"ListUrlMain\":\"https://doi.org/10.1523/JNEUROSCI.2300-23.2024\",\"RegionNum\":2,\"RegionCategory\":\"医学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q1\",\"JCRName\":\"NEUROSCIENCES\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Journal of Neuroscience","FirstCategoryId":"3","ListUrlMain":"https://doi.org/10.1523/JNEUROSCI.2300-23.2024","RegionNum":2,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"NEUROSCIENCES","Score":null,"Total":0}
引用次数: 0

摘要

现实世界的选择选项具有许多特征或属性,而这些选项的奖励结果仅取决于少数特征或属性。研究表明,人类通过学习并将基于特征的学习与更复杂的基于连接的学习结合起来,来应对在自然奖励环境中学习的挑战。然而,目前尚不清楚不同的学习策略如何相互作用,以决定应该注意哪些特征或连词并控制选择行为,以及随后的注意调节如何影响未来的学习和选择。为了解决这些问题,我们研究了男性和女性参与者在三维学习任务中的行为,其中不同刺激的奖励结果可以基于信息特征和连接的组合来预测。使用多种方法,我们发现参与者估计的选择行为和奖励概率都是由学习了信息特征和信息连接的预测值的注意力调节模型最准确地描述的。具体来说,在最适合选择数据的强化学习模型中,注意力被综合特征值和连接值的差异所控制。由此产生的注意权重通过提高被关注特征和连词的学习率来调节学习。关键的是,通过注意权重调节决策并没有改善数据的拟合,这为直接注意对选择的影响提供了很少的证据。这些结果表明,在多维环境中,人类不仅会引导他们的注意力选择性地处理奖励预测属性,而且还会找到奖励偶然事件的精简表示,以获得更有效的学习。从尝试奇异的食谱到结交新的社会群体,现实生活中的行为结果取决于许多因素,但我们如何根据收到的反馈了解这些因素的预测价值呢?研究表明,人类通过专注于最能预测结果的单个特征来简化这个问题,但在必要时可以扩展他们的学习策略,包括特征的组合。在这里,我们研究了多维奖励环境中注意和学习之间的相互作用,该环境要求学习个体特征及其连词。使用多种方法,我们发现特征和连词的学习以合作的方式控制注意力,随后的注意调节主要影响未来的学习而不是决策。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
Contributions of Attention to Learning in Multidimensional Reward Environments.

Real-world choice options have many features or attributes, whereas the reward outcome from those options only depends on a few features or attributes. It has been shown that humans learn and combine feature-based with more complex conjunction-based learning to tackle challenges of learning in naturalistic reward environments. However, it remains unclear how different learning strategies interact to determine what features or conjunctions should be attended to and control choice behavior, and how subsequent attentional modulations influence future learning and choice. To address these questions, we examined the behavior of male and female human participants during a three-dimensional learning task in which reward outcomes for different stimuli could be predicted based on a combination of an informative feature and conjunction. Using multiple approaches, we found that both choice behavior and reward probabilities estimated by participants were most accurately described by attention-modulated models that learned the predictive values of both the informative feature and the informative conjunction. Specifically, in the reinforcement learning model that best fit choice data, attention was controlled by the difference in the integrated feature and conjunction values. The resulting attention weights modulated learning by increasing the learning rate on attended features and conjunctions. Critically, modulating decision-making by attention weights did not improve the fit of data, providing little evidence for direct attentional effects on choice. These results suggest that in multidimensional environments, humans direct their attention not only to selectively process reward-predictive attributes but also to find parsimonious representations of the reward contingencies for more efficient learning.

求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
Journal of Neuroscience
Journal of Neuroscience 医学-神经科学
CiteScore
9.30
自引率
3.80%
发文量
1164
审稿时长
12 months
期刊介绍: JNeurosci (ISSN 0270-6474) is an official journal of the Society for Neuroscience. It is published weekly by the Society, fifty weeks a year, one volume a year. JNeurosci publishes papers on a broad range of topics of general interest to those working on the nervous system. Authors now have an Open Choice option for their published articles
期刊最新文献
Characterizing Brain-Cardiovascular Aging Using Multiorgan Imaging and Machine Learning Distinct patterns of PV and SST GABAergic neuronal activity in the basal forebrain during olfactory-guided behavior in mice. Neurofibromin deficiency alters the patterning and prioritization of motor behaviors in a state-dependent manner. Rab27b promotes lysosomal function and alpha-synuclein clearance in neurons. Sleep modulates neural timescales and spatiotemporal integration in the human cortex.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1