Reinforcement learning and instance-based learning approaches to modeling human decision making in a prognostic foraging task

Suhas E. Chelian, Jaehyon Paik, P. Pirolli, C. Lebiere, Rajan Bhattacharyya
{"title":"Reinforcement learning and instance-based learning approaches to modeling human decision making in a prognostic foraging task","authors":"Suhas E. Chelian, Jaehyon Paik, P. Pirolli, C. Lebiere, Rajan Bhattacharyya","doi":"10.1109/DEVLRN.2015.7346127","DOIUrl":null,"url":null,"abstract":"Procedural memory and episodic memory are known to be distinct and both underlie the performance of many tasks. Reinforcement learning (RL) and instance-based learning (IBL) represent common approaches to modeling procedural and episodic memory in that order. In this work, we present a neural model utilizing RL dynamics and an ACT-R model utilizing IBL productions to the task of modeling human decision making in a prognostic foraging task. The task performed was derived from a geospatial intelligence domain wherein agents must choose among information sources to more accurately predict the actions of an adversary. Results from both models are compared to human data and suggest that information gain is an important component in modeling decision-making behavior using either memory system; with respect to the episodic memory approach, the procedural memory approach has a small but significant advantage in fitting human data. Finally, we discuss the interactions of multi-memory systems in complex decision-making tasks.","PeriodicalId":164756,"journal":{"name":"2015 Joint IEEE International Conference on Development and Learning and Epigenetic Robotics (ICDL-EpiRob)","volume":"7 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2015-12-07","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"5","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2015 Joint IEEE International Conference on Development and Learning and Epigenetic Robotics (ICDL-EpiRob)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/DEVLRN.2015.7346127","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 5

Abstract

Procedural memory and episodic memory are known to be distinct and both underlie the performance of many tasks. Reinforcement learning (RL) and instance-based learning (IBL) represent common approaches to modeling procedural and episodic memory in that order. In this work, we present a neural model utilizing RL dynamics and an ACT-R model utilizing IBL productions to the task of modeling human decision making in a prognostic foraging task. The task performed was derived from a geospatial intelligence domain wherein agents must choose among information sources to more accurately predict the actions of an adversary. Results from both models are compared to human data and suggest that information gain is an important component in modeling decision-making behavior using either memory system; with respect to the episodic memory approach, the procedural memory approach has a small but significant advantage in fitting human data. Finally, we discuss the interactions of multi-memory systems in complex decision-making tasks.
查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
预测觅食任务中人类决策建模的强化学习和基于实例的学习方法
程序记忆和情景记忆是截然不同的,它们都是许多任务表现的基础。强化学习(RL)和基于实例的学习(IBL)是程序记忆和情景记忆建模的常用方法。在这项工作中,我们提出了一个利用RL动力学的神经模型和一个利用IBL产品的ACT-R模型来模拟人类在预测觅食任务中的决策。执行的任务来自地理空间情报领域,其中代理必须在信息源中进行选择,以更准确地预测对手的行动。将这两种模型的结果与人类数据进行了比较,并表明信息增益是使用任何一种记忆系统建模决策行为的重要组成部分;相对于情景记忆方法,程序记忆方法在拟合人类数据方面有一个小而显著的优势。最后,我们讨论了多记忆系统在复杂决策任务中的相互作用。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 去求助
来源期刊
自引率
0.00%
发文量
0
期刊最新文献
The sequential organization of movement is critical to the development of reaching: A neural dynamics account Incremental grounded language learning in robot-robot interactions — Examples from spatial language A learning model for essentialist concepts Biological and simulated neuronal networks show similar competence on a visual tracking task A Deep Learning Neural Network for Number Cognition: A bi-cultural study with the iCub
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1