A quantitative measure of regret in decision-making for human-robot collaborative search tasks

Zhanrui Liao, Longsheng Jiang, Yue Wang
{"title":"A quantitative measure of regret in decision-making for human-robot collaborative search tasks","authors":"Zhanrui Liao, Longsheng Jiang, Yue Wang","doi":"10.23919/ACC.2017.7963169","DOIUrl":null,"url":null,"abstract":"Human-robot collaborations (HRC) can be used for object detection in domain search tasks, which integrate human and computer vision to improve accuracy and efficiency. The Bayesian sequential decision-making (BSD) method has been used for task allocation of a robot in search tasks. In this paper, we first provide an explanation to reveal the nature of the BSD approach: it makes decisions based on the expected value criterion, which is proved to be very different from human decision-making behaviors. On the other hand, it has been shown that joint performance of a team will improve if all members share the same decision-making logic. In HRC, since forcing a human to act like a robot is not desired, we propose to modify the BSD approach such that the robot imitates human logic. In particular, regret theory qualitatively models human's rational decision-making behaviors under uncertainty. We propose a holistic framework to measure regret quantitatively, an individual-based parametric model that fits the measurements, and the integration of regret into the BSD method. Furthermore, we design a human-in-the-loop experiment based on the framework to collect enough data points to further elicit requisite functions of regret theory. Our preliminary results match all the properties in regret theory, while the parametric elicited model shows a good fit to the experimental data.","PeriodicalId":422926,"journal":{"name":"2017 American Control Conference (ACC)","volume":"56 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2017-05-24","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"6","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2017 American Control Conference (ACC)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.23919/ACC.2017.7963169","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 6

Abstract

Human-robot collaborations (HRC) can be used for object detection in domain search tasks, which integrate human and computer vision to improve accuracy and efficiency. The Bayesian sequential decision-making (BSD) method has been used for task allocation of a robot in search tasks. In this paper, we first provide an explanation to reveal the nature of the BSD approach: it makes decisions based on the expected value criterion, which is proved to be very different from human decision-making behaviors. On the other hand, it has been shown that joint performance of a team will improve if all members share the same decision-making logic. In HRC, since forcing a human to act like a robot is not desired, we propose to modify the BSD approach such that the robot imitates human logic. In particular, regret theory qualitatively models human's rational decision-making behaviors under uncertainty. We propose a holistic framework to measure regret quantitatively, an individual-based parametric model that fits the measurements, and the integration of regret into the BSD method. Furthermore, we design a human-in-the-loop experiment based on the framework to collect enough data points to further elicit requisite functions of regret theory. Our preliminary results match all the properties in regret theory, while the parametric elicited model shows a good fit to the experimental data.
查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
人机协同搜索任务决策后悔的定量度量
人机协作(human -robot collaborative, HRC)可以用于领域搜索任务中的目标检测,它将人视觉与计算机视觉相结合,提高了检测的准确性和效率。将贝叶斯序列决策(BSD)方法用于机器人搜索任务的任务分配。在本文中,我们首先提供了一个解释来揭示BSD方法的本质:它基于期望值标准进行决策,这被证明与人类的决策行为有很大的不同。另一方面,研究表明,如果所有成员共享相同的决策逻辑,团队的联合绩效将得到提高。在HRC中,由于不希望强迫人类像机器人一样行动,我们建议修改BSD方法,使机器人模仿人类的逻辑。特别是后悔理论定性地模拟了人类在不确定性下的理性决策行为。我们提出了一个定量测量后悔的整体框架,一个适合测量的基于个体的参数模型,并将后悔整合到BSD方法中。在此基础上,我们设计了一个人在循环实验,以收集足够的数据点,进一步引出后悔理论的必要功能。我们的初步结果符合后悔理论的所有性质,而参数化模型与实验数据拟合得很好。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 去求助
来源期刊
自引率
0.00%
发文量
0
期刊最新文献
Plenary and semi-plenary sessions Spatial Iterative Learning Control: Systems with input saturation Distributed Second Order Sliding Modes for Optimal Load Frequency Control Adaptive optimal observer design via approximate dynamic programming Nonlinear adaptive stabilization of a class of planar slow-fast systems at a non-hyperbolic point
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1