Solving Stochastic Orienteering Problems With Chance Constraints Using Monte Carlo Tree Search

IF 6.4 2区计算机科学 Q1 AUTOMATION & CONTROL SYSTEMS IEEE Transactions on Automation Science and Engineering Pub Date : 2024-10-09 DOI:10.1109/TASE.2024.3472453

Stefano Carpin

{"title":"Solving Stochastic Orienteering Problems With Chance Constraints Using Monte Carlo Tree Search","authors":"Stefano Carpin","doi":"10.1109/TASE.2024.3472453","DOIUrl":null,"url":null,"abstract":"We present a new Monte Carlo Tree Search (MCTS) algorithm to solve the stochastic orienteering problem with chance constraints, i.e., a version of the problem where travel costs are random, and one is assigned a bound on the tolerable probability of exceeding the budget. The algorithm we present is online and anytime, i.e., it alternates planning and execution, and the quality of the solution it produces increases as the allowed computational time increases. Differently from most former MCTS algorithms, for each action available in a state the algorithm maintains estimates of both its value and the probability that its execution will eventually result in a violation of the chance constraint. Then, at action selection time, our proposed solution prunes away trajectories that are estimated to violate the failure probability. Extensive simulation results show that this approach can quickly produce high-quality solutions and is competitive with the optimal but time-consuming solution. Note to Practitioners—In many practical scenarios one is faced with multiobjective sequential decision making problems that can be solved through constrained optimization. If some of the parameters are known with uncertainty, the event “violating one of the constraints” becomes a random variable whose probability should be bound. As an application of this general problem formulation, in this paper we consider stochastic orienteering, a problem that finds applications when a robot is tasked with performing multiple tasks of varying utility while being subject to a bound on the traveled distance. Many problems in logistics, precision agriculture, and environmental monitoring, just to name a few, can be cast as instances of this optimization problem.","PeriodicalId":51060,"journal":{"name":"IEEE Transactions on Automation Science and Engineering","volume":"22 ","pages":"7855-7869"},"PeriodicalIF":6.4000,"publicationDate":"2024-10-09","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"IEEE Transactions on Automation Science and Engineering","FirstCategoryId":"94","ListUrlMain":"https://ieeexplore.ieee.org/document/10711219/","RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"AUTOMATION & CONTROL SYSTEMS","Score":null,"Total":0}

引用次数: 0

Abstract

We present a new Monte Carlo Tree Search (MCTS) algorithm to solve the stochastic orienteering problem with chance constraints, i.e., a version of the problem where travel costs are random, and one is assigned a bound on the tolerable probability of exceeding the budget. The algorithm we present is online and anytime, i.e., it alternates planning and execution, and the quality of the solution it produces increases as the allowed computational time increases. Differently from most former MCTS algorithms, for each action available in a state the algorithm maintains estimates of both its value and the probability that its execution will eventually result in a violation of the chance constraint. Then, at action selection time, our proposed solution prunes away trajectories that are estimated to violate the failure probability. Extensive simulation results show that this approach can quickly produce high-quality solutions and is competitive with the optimal but time-consuming solution. Note to Practitioners—In many practical scenarios one is faced with multiobjective sequential decision making problems that can be solved through constrained optimization. If some of the parameters are known with uncertainty, the event “violating one of the constraints” becomes a random variable whose probability should be bound. As an application of this general problem formulation, in this paper we consider stochastic orienteering, a problem that finds applications when a robot is tasked with performing multiple tasks of varying utility while being subject to a bound on the traveled distance. Many problems in logistics, precision agriculture, and environmental monitoring, just to name a few, can be cast as instances of this optimization problem.

查看原文

微信好友朋友圈 QQ好友复制链接

本刊更多论文

利用蒙特卡洛树搜索解决带有偶然性约束的随机定向问题

我们提出了一种新的蒙特卡罗树搜索（MCTS）算法来解决具有机会约束的随机定向问题，即，问题的一个版本，其中旅行费用是随机的，并且给定了超出预算的可容忍概率的界限。我们提出的算法是在线和随时随地的，也就是说，它交替规划和执行，它产生的解决方案的质量随着允许的计算时间的增加而增加。与大多数以前的MCTS算法不同，对于一个状态中可用的每个动作，算法保持其值和其执行最终导致违反机会约束的概率的估计。然后，在动作选择时，我们提出的解决方案去除估计违反失效概率的轨迹。大量的仿真结果表明，该方法可以快速生成高质量的解，并与耗时的最优解相竞争。从业人员注意：在许多实际情况下，人们面临着多目标序列决策问题，这些问题可以通过约束优化来解决。如果某些参数是不确定的，那么“违反约束之一”的事件就变成了一个随机变量，其概率应该被限定。作为这一一般问题公式的应用，本文考虑了随机定向运动问题，当机器人被指派执行不同效用的多个任务，同时受到行进距离的限制时，该问题得到了应用。物流、精准农业和环境监测中的许多问题，仅举几例，都可以作为这种优化问题的实例。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文去求助

来源期刊

IEEE Transactions on Automation Science and Engineering 工程技术-自动化与控制系统

CiteScore

12.50

自引率

14.30%

发文量

404

审稿时长

3.0 months

期刊介绍： The IEEE Transactions on Automation Science and Engineering (T-ASE) publishes fundamental papers on Automation, emphasizing scientific results that advance efficiency, quality, productivity, and reliability. T-ASE encourages interdisciplinary approaches from computer science, control systems, electrical engineering, mathematics, mechanical engineering, operations research, and other fields. T-ASE welcomes results relevant to industries such as agriculture, biotechnology, healthcare, home automation, maintenance, manufacturing, pharmaceuticals, retail, security, service, supply chains, and transportation. T-ASE addresses a research community willing to integrate knowledge across disciplines and industries. For this purpose, each paper includes a Note to Practitioners that summarizes how its results can be applied or how they might be extended to apply in practice.