沉思vs.直觉:强化学习视角

IF 2.3 Q3 MANAGEMENT EURO Journal on Decision Processes Pub Date : 2017-11-01 DOI:10.1007/s40070-017-0068-x
In-Koo Cho , Anna Rubinchik
{"title":"沉思vs.直觉:强化学习视角","authors":"In-Koo Cho ,&nbsp;Anna Rubinchik","doi":"10.1007/s40070-017-0068-x","DOIUrl":null,"url":null,"abstract":"<div><p>In a search for a positive model of decision-making with observable primitives, we rely on the burgeoning literature in cognitive neuroscience to construct a three-element machine (agent). Its control unit initiates either impulsive or cognitive elements to solve a problem in a stationary Markov environment, the element chosen depends on whether the problem is mundane or novel, memory of past successes, and the strength of inhibition. Our predictions are based on a stationary asymptotic distribution of the memory, which, depending on the parameters, can generate different “characters”, e.g., an <em>uptight dimwit</em>, who could succeed more often with less inhibition, as well as a <em>laid-back wise-guy</em>, who could gain more with a stronger inhibition of impulsive (intuitive) responses. As one would expect, stronger inhibition and lower cognitive costs increase the frequency of decisions made by the cognitive element. More surprisingly, increasing the “carrot” and reducing the “stick” (being in a more supportive environment) enhance contemplative decisions (made by the cognitive unit) for an alert agent, i.e., the one who identifies novel problems frequently enough.</p></div>","PeriodicalId":44104,"journal":{"name":"EURO Journal on Decision Processes","volume":null,"pages":null},"PeriodicalIF":2.3000,"publicationDate":"2017-11-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://sci-hub-pdf.com/10.1007/s40070-017-0068-x","citationCount":"2","resultStr":"{\"title\":\"Contemplation vs. intuition: a reinforcement learning perspective\",\"authors\":\"In-Koo Cho ,&nbsp;Anna Rubinchik\",\"doi\":\"10.1007/s40070-017-0068-x\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<div><p>In a search for a positive model of decision-making with observable primitives, we rely on the burgeoning literature in cognitive neuroscience to construct a three-element machine (agent). Its control unit initiates either impulsive or cognitive elements to solve a problem in a stationary Markov environment, the element chosen depends on whether the problem is mundane or novel, memory of past successes, and the strength of inhibition. Our predictions are based on a stationary asymptotic distribution of the memory, which, depending on the parameters, can generate different “characters”, e.g., an <em>uptight dimwit</em>, who could succeed more often with less inhibition, as well as a <em>laid-back wise-guy</em>, who could gain more with a stronger inhibition of impulsive (intuitive) responses. As one would expect, stronger inhibition and lower cognitive costs increase the frequency of decisions made by the cognitive element. More surprisingly, increasing the “carrot” and reducing the “stick” (being in a more supportive environment) enhance contemplative decisions (made by the cognitive unit) for an alert agent, i.e., the one who identifies novel problems frequently enough.</p></div>\",\"PeriodicalId\":44104,\"journal\":{\"name\":\"EURO Journal on Decision Processes\",\"volume\":null,\"pages\":null},\"PeriodicalIF\":2.3000,\"publicationDate\":\"2017-11-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"https://sci-hub-pdf.com/10.1007/s40070-017-0068-x\",\"citationCount\":\"2\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"EURO Journal on Decision Processes\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://www.sciencedirect.com/science/article/pii/S2193943821000753\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q3\",\"JCRName\":\"MANAGEMENT\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"EURO Journal on Decision Processes","FirstCategoryId":"1085","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S2193943821000753","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q3","JCRName":"MANAGEMENT","Score":null,"Total":0}
引用次数: 2

摘要

为了寻找具有可观察原语的积极决策模型,我们依靠认知神经科学中新兴的文献来构建一个三要素机器(agent)。它的控制单元启动冲动或认知元素来解决固定马尔可夫环境中的问题,所选择的元素取决于问题是平凡的还是新奇的,过去成功的记忆,以及抑制的强度。我们的预测是基于记忆的平稳渐近分布,根据参数的不同,它可以产生不同的“特征”,例如,一个紧张的笨蛋,他可以通过更少的抑制更经常地成功,以及一个悠闲的聪明人,他可以通过更强的抑制冲动(直觉)反应获得更多。正如人们所预料的那样,更强的抑制和更低的认知成本增加了认知因素做出决策的频率。更令人惊讶的是,增加“胡萝卜”和减少“大棒”(在一个更支持性的环境中)可以增强警觉代理(即经常发现新问题的代理)的深思熟虑决策(由认知单元做出)。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
Contemplation vs. intuition: a reinforcement learning perspective

In a search for a positive model of decision-making with observable primitives, we rely on the burgeoning literature in cognitive neuroscience to construct a three-element machine (agent). Its control unit initiates either impulsive or cognitive elements to solve a problem in a stationary Markov environment, the element chosen depends on whether the problem is mundane or novel, memory of past successes, and the strength of inhibition. Our predictions are based on a stationary asymptotic distribution of the memory, which, depending on the parameters, can generate different “characters”, e.g., an uptight dimwit, who could succeed more often with less inhibition, as well as a laid-back wise-guy, who could gain more with a stronger inhibition of impulsive (intuitive) responses. As one would expect, stronger inhibition and lower cognitive costs increase the frequency of decisions made by the cognitive element. More surprisingly, increasing the “carrot” and reducing the “stick” (being in a more supportive environment) enhance contemplative decisions (made by the cognitive unit) for an alert agent, i.e., the one who identifies novel problems frequently enough.

求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
CiteScore
2.70
自引率
10.00%
发文量
15
期刊最新文献
Editorial: Feature Issue on Fair and Explainable Decision Support Systems Editorial: Feature issue on fair and explainable decision support systems Corrigendum to “Multi-objective optimization in real-time operation of rainwater harvesting systems” [EURO Journal on Decision Processes Volume 11 (2023) 100039] Multiobjective combinatorial optimization with interactive evolutionary algorithms: The case of facility location problems Performance assessment of waste sorting: Component-based approach to incorporate quality into data envelopment analysis
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1