Solving Stochastic Orienteering Problems With Chance Constraints Using Monte Carlo Tree Search

IF 6.4 2区 计算机科学 Q1 AUTOMATION & CONTROL SYSTEMS IEEE Transactions on Automation Science and Engineering Pub Date : 2024-10-09 DOI:10.1109/TASE.2024.3472453
Stefano Carpin
{"title":"Solving Stochastic Orienteering Problems With Chance Constraints Using Monte Carlo Tree Search","authors":"Stefano Carpin","doi":"10.1109/TASE.2024.3472453","DOIUrl":null,"url":null,"abstract":"We present a new Monte Carlo Tree Search (MCTS) algorithm to solve the stochastic orienteering problem with chance constraints, i.e., a version of the problem where travel costs are random, and one is assigned a bound on the tolerable probability of exceeding the budget. The algorithm we present is online and anytime, i.e., it alternates planning and execution, and the quality of the solution it produces increases as the allowed computational time increases. Differently from most former MCTS algorithms, for each action available in a state the algorithm maintains estimates of both its value and the probability that its execution will eventually result in a violation of the chance constraint. Then, at action selection time, our proposed solution prunes away trajectories that are estimated to violate the failure probability. Extensive simulation results show that this approach can quickly produce high-quality solutions and is competitive with the optimal but time-consuming solution. Note to Practitioners—In many practical scenarios one is faced with multiobjective sequential decision making problems that can be solved through constrained optimization. If some of the parameters are known with uncertainty, the event “violating one of the constraints” becomes a random variable whose probability should be bound. As an application of this general problem formulation, in this paper we consider stochastic orienteering, a problem that finds applications when a robot is tasked with performing multiple tasks of varying utility while being subject to a bound on the traveled distance. Many problems in logistics, precision agriculture, and environmental monitoring, just to name a few, can be cast as instances of this optimization problem.","PeriodicalId":51060,"journal":{"name":"IEEE Transactions on Automation Science and Engineering","volume":"22 ","pages":"7855-7869"},"PeriodicalIF":6.4000,"publicationDate":"2024-10-09","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"IEEE Transactions on Automation Science and Engineering","FirstCategoryId":"94","ListUrlMain":"https://ieeexplore.ieee.org/document/10711219/","RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"AUTOMATION & CONTROL SYSTEMS","Score":null,"Total":0}
引用次数: 0

Abstract

We present a new Monte Carlo Tree Search (MCTS) algorithm to solve the stochastic orienteering problem with chance constraints, i.e., a version of the problem where travel costs are random, and one is assigned a bound on the tolerable probability of exceeding the budget. The algorithm we present is online and anytime, i.e., it alternates planning and execution, and the quality of the solution it produces increases as the allowed computational time increases. Differently from most former MCTS algorithms, for each action available in a state the algorithm maintains estimates of both its value and the probability that its execution will eventually result in a violation of the chance constraint. Then, at action selection time, our proposed solution prunes away trajectories that are estimated to violate the failure probability. Extensive simulation results show that this approach can quickly produce high-quality solutions and is competitive with the optimal but time-consuming solution. Note to Practitioners—In many practical scenarios one is faced with multiobjective sequential decision making problems that can be solved through constrained optimization. If some of the parameters are known with uncertainty, the event “violating one of the constraints” becomes a random variable whose probability should be bound. As an application of this general problem formulation, in this paper we consider stochastic orienteering, a problem that finds applications when a robot is tasked with performing multiple tasks of varying utility while being subject to a bound on the traveled distance. Many problems in logistics, precision agriculture, and environmental monitoring, just to name a few, can be cast as instances of this optimization problem.
查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
利用蒙特卡洛树搜索解决带有偶然性约束的随机定向问题
我们提出了一种新的蒙特卡罗树搜索(MCTS)算法来解决具有机会约束的随机定向问题,即,问题的一个版本,其中旅行费用是随机的,并且给定了超出预算的可容忍概率的界限。我们提出的算法是在线和随时随地的,也就是说,它交替规划和执行,它产生的解决方案的质量随着允许的计算时间的增加而增加。与大多数以前的MCTS算法不同,对于一个状态中可用的每个动作,算法保持其值和其执行最终导致违反机会约束的概率的估计。然后,在动作选择时,我们提出的解决方案去除估计违反失效概率的轨迹。大量的仿真结果表明,该方法可以快速生成高质量的解,并与耗时的最优解相竞争。从业人员注意:在许多实际情况下,人们面临着多目标序列决策问题,这些问题可以通过约束优化来解决。如果某些参数是不确定的,那么“违反约束之一”的事件就变成了一个随机变量,其概率应该被限定。作为这一一般问题公式的应用,本文考虑了随机定向运动问题,当机器人被指派执行不同效用的多个任务,同时受到行进距离的限制时,该问题得到了应用。物流、精准农业和环境监测中的许多问题,仅举几例,都可以作为这种优化问题的实例。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 去求助
来源期刊
IEEE Transactions on Automation Science and Engineering
IEEE Transactions on Automation Science and Engineering 工程技术-自动化与控制系统
CiteScore
12.50
自引率
14.30%
发文量
404
审稿时长
3.0 months
期刊介绍: The IEEE Transactions on Automation Science and Engineering (T-ASE) publishes fundamental papers on Automation, emphasizing scientific results that advance efficiency, quality, productivity, and reliability. T-ASE encourages interdisciplinary approaches from computer science, control systems, electrical engineering, mathematics, mechanical engineering, operations research, and other fields. T-ASE welcomes results relevant to industries such as agriculture, biotechnology, healthcare, home automation, maintenance, manufacturing, pharmaceuticals, retail, security, service, supply chains, and transportation. T-ASE addresses a research community willing to integrate knowledge across disciplines and industries. For this purpose, each paper includes a Note to Practitioners that summarizes how its results can be applied or how they might be extended to apply in practice.
期刊最新文献
Automation 5.0: The Step to Systems Intelligence for a Sustainable Future Dual-Layer Bumpless Transfer Control for Markovian Jump Systems and Its Applications: A Weighted Coefficient Mechanism Continuous-time/event-triggered decentralized output feedback prescribed-time control applied to a 2-DOF helicopter High-Precision Tracking Control of Multi-Axis Electro-Optical Systems Based on Accurate Identification of Dynamic Parameters Distributed Adaptive Secondary Control of DC Microgrids with Uncertainties: A Real-time Parameter Estimation Approach
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1