沉思vs.直觉:强化学习视角

IF 3 Q3 MANAGEMENT EURO Journal on Decision Processes Pub Date : 2017-11-01 Epub Date: 2017-08-19 DOI:10.1007/s40070-017-0068-x

In-Koo Cho , Anna Rubinchik

{"title":"沉思vs.直觉:强化学习视角","authors":"In-Koo Cho , Anna Rubinchik","doi":"10.1007/s40070-017-0068-x","DOIUrl":null,"url":null,"abstract":"<div><p>In a search for a positive model of decision-making with observable primitives, we rely on the burgeoning literature in cognitive neuroscience to construct a three-element machine (agent). Its control unit initiates either impulsive or cognitive elements to solve a problem in a stationary Markov environment, the element chosen depends on whether the problem is mundane or novel, memory of past successes, and the strength of inhibition. Our predictions are based on a stationary asymptotic distribution of the memory, which, depending on the parameters, can generate different “characters”, e.g., an <em>uptight dimwit</em>, who could succeed more often with less inhibition, as well as a <em>laid-back wise-guy</em>, who could gain more with a stronger inhibition of impulsive (intuitive) responses. As one would expect, stronger inhibition and lower cognitive costs increase the frequency of decisions made by the cognitive element. More surprisingly, increasing the “carrot” and reducing the “stick” (being in a more supportive environment) enhance contemplative decisions (made by the cognitive unit) for an alert agent, i.e., the one who identifies novel problems frequently enough.</p></div>","PeriodicalId":44104,"journal":{"name":"EURO Journal on Decision Processes","volume":"5 1","pages":"Pages 141-167"},"PeriodicalIF":3.0000,"publicationDate":"2017-11-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://sci-hub-pdf.com/10.1007/s40070-017-0068-x","citationCount":"2","resultStr":"{\"title\":\"Contemplation vs. intuition: a reinforcement learning perspective\",\"authors\":\"In-Koo Cho , Anna Rubinchik\",\"doi\":\"10.1007/s40070-017-0068-x\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<div><p>In a search for a positive model of decision-making with observable primitives, we rely on the burgeoning literature in cognitive neuroscience to construct a three-element machine (agent). Its control unit initiates either impulsive or cognitive elements to solve a problem in a stationary Markov environment, the element chosen depends on whether the problem is mundane or novel, memory of past successes, and the strength of inhibition. Our predictions are based on a stationary asymptotic distribution of the memory, which, depending on the parameters, can generate different “characters”, e.g., an <em>uptight dimwit</em>, who could succeed more often with less inhibition, as well as a <em>laid-back wise-guy</em>, who could gain more with a stronger inhibition of impulsive (intuitive) responses. As one would expect, stronger inhibition and lower cognitive costs increase the frequency of decisions made by the cognitive element. More surprisingly, increasing the “carrot” and reducing the “stick” (being in a more supportive environment) enhance contemplative decisions (made by the cognitive unit) for an alert agent, i.e., the one who identifies novel problems frequently enough.</p></div>\",\"PeriodicalId\":44104,\"journal\":{\"name\":\"EURO Journal on Decision Processes\",\"volume\":\"5 1\",\"pages\":\"Pages 141-167\"},\"PeriodicalIF\":3.0000,\"publicationDate\":\"2017-11-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"https://sci-hub-pdf.com/10.1007/s40070-017-0068-x\",\"citationCount\":\"2\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"EURO Journal on Decision Processes\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://www.sciencedirect.com/science/article/pii/S2193943821000753\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"2017/8/19 0:00:00\",\"PubModel\":\"Epub\",\"JCR\":\"Q3\",\"JCRName\":\"MANAGEMENT\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"EURO Journal on Decision Processes","FirstCategoryId":"1085","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S2193943821000753","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"2017/8/19 0:00:00","PubModel":"Epub","JCR":"Q3","JCRName":"MANAGEMENT","Score":null,"Total":0}

引用次数: 2

摘要

为了寻找具有可观察原语的积极决策模型，我们依靠认知神经科学中新兴的文献来构建一个三要素机器(agent)。它的控制单元启动冲动或认知元素来解决固定马尔可夫环境中的问题，所选择的元素取决于问题是平凡的还是新奇的，过去成功的记忆，以及抑制的强度。我们的预测是基于记忆的平稳渐近分布，根据参数的不同，它可以产生不同的“特征”，例如，一个紧张的笨蛋，他可以通过更少的抑制更经常地成功，以及一个悠闲的聪明人，他可以通过更强的抑制冲动(直觉)反应获得更多。正如人们所预料的那样，更强的抑制和更低的认知成本增加了认知因素做出决策的频率。更令人惊讶的是，增加“胡萝卜”和减少“大棒”(在一个更支持性的环境中)可以增强警觉代理(即经常发现新问题的代理)的深思熟虑决策(由认知单元做出)。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文

微信好友朋友圈 QQ好友复制链接

本刊更多论文

Contemplation vs. intuition: a reinforcement learning perspective

In a search for a positive model of decision-making with observable primitives, we rely on the burgeoning literature in cognitive neuroscience to construct a three-element machine (agent). Its control unit initiates either impulsive or cognitive elements to solve a problem in a stationary Markov environment, the element chosen depends on whether the problem is mundane or novel, memory of past successes, and the strength of inhibition. Our predictions are based on a stationary asymptotic distribution of the memory, which, depending on the parameters, can generate different “characters”, e.g., an uptight dimwit, who could succeed more often with less inhibition, as well as a laid-back wise-guy, who could gain more with a stronger inhibition of impulsive (intuitive) responses. As one would expect, stronger inhibition and lower cognitive costs increase the frequency of decisions made by the cognitive element. More surprisingly, increasing the “carrot” and reducing the “stick” (being in a more supportive environment) enhance contemplative decisions (made by the cognitive unit) for an alert agent, i.e., the one who identifies novel problems frequently enough.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

EURO Journal on Decision Processes MANAGEMENT-

CiteScore

2.70

自引率

10.00%

发文量