首页 > 最新文献

International Conference on Automated Planning and Scheduling最新文献

英文 中文
Loopless Top-K Planning 无循环Top-K规划
Pub Date : 2022-06-13 DOI: 10.1609/icaps.v32i1.19823
Julian von Tschammer, Robert Mattmüller, David Speck
In top-k planning, the objective is to determine a set of k cheapest plans that provide several good alternatives to choose from. Such a solution set often contains plans that visit at least one state more than once. Depending on the application, plans with such loops are of little importance because they are dominated by a loopless representative and can prevent more meaningful plans from being found.In this paper, we motivate and introduce loopless top-k planning. We show how to enhance the state-of-the-art symbolic top-k planner, symK, to obtain an efficient, sound and complete algorithm for loopless top-k planning. An empirical evaluation shows that our proposed approach has a higher k-coverage than a generate-and-test approach that uses an ordinary top-k planner, which we show to be incomplete in the presence of zero-cost loops.
在top-k计划中,目标是确定一组k个最便宜的计划,这些计划提供了几个不错的选择。这样的解决方案集通常包含多次访问至少一个州的计划。根据应用程序的不同,具有这种循环的计划并不重要,因为它们由无循环的代表所控制,并且可能会阻止发现更有意义的计划。在本文中,我们激励并引入了无环的top-k规划。我们展示了如何增强最先进的符号top-k规划器,symK,以获得一个高效,健全和完整的无环路top-k规划算法。经验评估表明,我们提出的方法比使用普通top-k规划器的生成和测试方法具有更高的k-覆盖率,我们表明,在零成本循环存在时,生成和测试方法是不完整的。
{"title":"Loopless Top-K Planning","authors":"Julian von Tschammer, Robert Mattmüller, David Speck","doi":"10.1609/icaps.v32i1.19823","DOIUrl":"https://doi.org/10.1609/icaps.v32i1.19823","url":null,"abstract":"In top-k planning, the objective is to determine a set of k cheapest plans that provide several good alternatives to choose from. Such a solution set often contains plans that visit at least one state more than once. Depending on the application, plans with such loops are of little importance because they are dominated by a loopless representative and can prevent more meaningful plans from being found.\u0000\u0000In this paper, we motivate and introduce loopless top-k planning. We show how to enhance the state-of-the-art symbolic top-k planner, symK, to obtain an efficient, sound and complete algorithm for loopless top-k planning. An empirical evaluation shows that our proposed approach has a higher k-coverage than a generate-and-test approach that uses an ordinary top-k planner, which we show to be incomplete in the presence of zero-cost loops.","PeriodicalId":239898,"journal":{"name":"International Conference on Automated Planning and Scheduling","volume":"170 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2022-06-13","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"115962160","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 5
Multi-Agent Tree Search with Dynamic Reward Shaping 动态奖励形成的多智能体树搜索
Pub Date : 2022-06-13 DOI: 10.1609/icaps.v32i1.19854
Alvaro Velasquez, Brett Bissey, Lior Barak, Daniel Melcer, Andre Beckus, Ismail R. Alkhouri, George K. Atia
Sparse rewards and their representation in multi-agent domains remains a challenge for the development of multi-agent planning systems. While techniques from formal methods can be adopted to represent the underlying planning objectives, their use in facilitating and accelerating learning has witnessed limited attention in multi-agent settings. Reward shaping methods that leverage such formal representations in single-agent settings are typically static in the sense that the artificial rewards remain the same throughout the entire learning process. In contrast, we investigate the use of such formal objective representations to define novel reward shaping functions that capture the learned experience of the agents. More specifically, we leverage the automaton representation of the underlying team objectives in mixed cooperative-competitive domains such that each automaton transition is assigned an expected value proportional to the frequency with which it was observed in successful trajectories of past behavior. This form of dynamic reward shaping is proposed within a multi-agent tree search architecture wherein agents can simultaneously reason about the future behavior of other agents as well as their own future behavior.
稀疏奖励及其在多智能体领域中的表示仍然是多智能体规划系统发展的一个挑战。虽然可以采用形式化方法中的技术来表示潜在的规划目标,但它们在促进和加速学习方面的使用在多智能体设置中受到了有限的关注。在单智能体设置中利用这种形式表示的奖励塑造方法通常是静态的,因为人工奖励在整个学习过程中保持不变。相反,我们研究了使用这种形式的客观表征来定义新的奖励塑造函数,以捕获代理的学习经验。更具体地说,我们利用混合合作-竞争领域中潜在团队目标的自动机表示,这样每个自动机的转换被分配一个期望值,与它在过去行为的成功轨迹中观察到的频率成正比。这种形式的动态奖励形成是在多智能体树搜索架构中提出的,其中智能体可以同时推断其他智能体的未来行为以及自己的未来行为。
{"title":"Multi-Agent Tree Search with Dynamic Reward Shaping","authors":"Alvaro Velasquez, Brett Bissey, Lior Barak, Daniel Melcer, Andre Beckus, Ismail R. Alkhouri, George K. Atia","doi":"10.1609/icaps.v32i1.19854","DOIUrl":"https://doi.org/10.1609/icaps.v32i1.19854","url":null,"abstract":"Sparse rewards and their representation in multi-agent domains remains a challenge for the development of multi-agent planning systems. While techniques from formal methods can be adopted to represent the underlying planning objectives, their use in facilitating and accelerating learning has witnessed limited attention in multi-agent settings. Reward shaping methods that leverage such formal representations in single-agent settings are typically static in the sense that the artificial rewards remain the same throughout the entire learning process. In contrast, we investigate the use of such formal objective representations to define novel reward shaping functions that capture the learned experience of the agents. More specifically, we leverage the automaton representation of the underlying team objectives in mixed cooperative-competitive domains such that each automaton transition is assigned an expected value proportional to the frequency with which it was observed in successful trajectories of past behavior. This form of dynamic reward shaping is proposed within a multi-agent tree search architecture wherein agents can simultaneously reason about the future behavior of other agents as well as their own future behavior.","PeriodicalId":239898,"journal":{"name":"International Conference on Automated Planning and Scheduling","volume":"7 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2022-06-13","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"122415854","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
Biased Exploration for Satisficing Heuristic Search 满足启发式搜索的偏差探索
Pub Date : 2022-06-13 DOI: 10.1609/icaps.v32i1.19804
Ryo Kuroiwa, J. Christopher Beck
Satisficing heuristic search such as greedy best-first search (GBFS) suffers from local minima, regions where heuristic values are inaccurate and a good node has a worse heuristic value than other nodes. Search algorithms that incorporate exploration mechanisms in GBFS empirically reduce the search effort to solve difficult problems. Although some of these methods entirely ignore the guidance of a heuristic during their exploration phase, intuitively, a good heuristic should have some bound on its inaccuracy, and exploration mechanisms should exploit this bound. In this paper, we theoretically analyze what a good node is for satisficing heuristic search algorithms and show that the heuristic value of a good node has an upper bound if a heuristic satisfies a certain property. Then, we propose biased exploration mechanisms which select lower heuristic values with higher probabilities. In the experiments using synthetic graph search problems and classical planning benchmarks, we show that the biased exploration mechanisms can be useful. In particular, one of our methods, Softmin-Type(h), significantly outperforms other GBFS variants in classical planning and improves the performance of Type-LAMA, a state-of-the-art classical planner.
满足启发式搜索(如贪婪最优优先搜索(GBFS))存在局部最小值、启发式值不准确的区域以及良好节点的启发式值比其他节点差的问题。在GBFS中结合探索机制的搜索算法从经验上减少了解决难题的搜索工作量。虽然其中一些方法在探索阶段完全忽略了启发式的指导,但从直觉上讲,一个好的启发式应该对其不准确性有一定的限制,而探索机制应该利用这个限制。本文从理论上分析了满足启发式搜索算法的好节点是什么,并证明了当一个启发式算法满足一定的性质时,一个好节点的启发式值有上界。然后,我们提出了有偏差的探索机制,该机制选择具有较高概率的较低启发式值。在使用合成图搜索问题和经典规划基准的实验中,我们证明了有偏差的探索机制是有用的。特别是,我们的方法之一,Softmin-Type(h),在经典规划中显著优于其他GBFS变体,并提高了最先进的经典规划器Type-LAMA的性能。
{"title":"Biased Exploration for Satisficing Heuristic Search","authors":"Ryo Kuroiwa, J. Christopher Beck","doi":"10.1609/icaps.v32i1.19804","DOIUrl":"https://doi.org/10.1609/icaps.v32i1.19804","url":null,"abstract":"Satisficing heuristic search such as greedy best-first search (GBFS) suffers from local minima, regions where heuristic values are inaccurate and a good node has a worse heuristic value than other nodes. Search algorithms that incorporate exploration mechanisms in GBFS empirically reduce the search effort to solve difficult problems. Although some of these methods entirely ignore the guidance of a heuristic during their exploration phase, intuitively, a good heuristic should have some bound on its inaccuracy, and exploration mechanisms should exploit this bound. In this paper, we theoretically analyze what a good node is for satisficing heuristic search algorithms and show that the heuristic value of a good node has an upper bound if a heuristic satisfies a certain property. Then, we propose biased exploration mechanisms which select lower heuristic values with higher probabilities. In the experiments using synthetic graph search problems and classical planning benchmarks, we show that the biased exploration mechanisms can be useful. In particular, one of our methods, Softmin-Type(h), significantly outperforms other GBFS variants in classical planning and improves the performance of Type-LAMA, a state-of-the-art classical planner.","PeriodicalId":239898,"journal":{"name":"International Conference on Automated Planning and Scheduling","volume":"6 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2022-06-13","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"123816134","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 3
Joint Pricing and Matching for City-Scale Ride-Pooling 城市规模拼车的联合定价与匹配
Pub Date : 2022-06-13 DOI: 10.1609/icaps.v32i1.19836
Sanket Shah, Meghna Lowalekar, Pradeep Varakantham
Central to efficient ride-pooling are two challenges: (1) how to `price' customers' requests for rides, and (2) if the customer agrees to that price, how to best `match' these requests to drivers. While both of them are interdependent, each challenge's individual complexity has meant that, historically, they have been decoupled and studied individually.This paper creates a framework for batched pricing and matching in which pricing is seen as a meta-level optimisation over different possible matching decisions. Our key contributions are in developing a variant of the revenue-maximizing auction corresponding to the meta-level optimization problem, and then providing a scalable mechanism for computing posted prices. We test our algorithm on real-world data at city-scale and show that our algorithm reliably matches demand to supply across a range of parameters.
高效拼车的核心是两个挑战:(1)如何为客户的乘车请求“定价”;(2)如果客户同意这个价格,如何最好地将这些请求与司机“匹配”起来。虽然两者都是相互依赖的,但每个挑战的个体复杂性意味着,从历史上看,它们已经被解耦并单独研究。本文创建了一个批量定价和匹配的框架,其中定价被视为不同可能匹配决策的元级优化。我们的主要贡献是开发了一种与元级优化问题相对应的收益最大化拍卖的变体,然后提供了一种计算公布价格的可扩展机制。我们在城市规模的真实世界数据上测试了我们的算法,并表明我们的算法可靠地匹配了一系列参数的供需。
{"title":"Joint Pricing and Matching for City-Scale Ride-Pooling","authors":"Sanket Shah, Meghna Lowalekar, Pradeep Varakantham","doi":"10.1609/icaps.v32i1.19836","DOIUrl":"https://doi.org/10.1609/icaps.v32i1.19836","url":null,"abstract":"Central to efficient ride-pooling are two challenges: (1) how to `price' customers' requests for rides, and (2) if the customer agrees to that price, how to best `match' these requests to drivers. While both of them are interdependent, each challenge's individual complexity has meant that, historically, they have been decoupled and studied individually.\u0000\u0000This paper creates a framework for batched pricing and matching in which pricing is seen as a meta-level optimisation over different possible matching decisions. Our key contributions are in developing a variant of the revenue-maximizing auction corresponding to the meta-level optimization problem, and then providing a scalable mechanism for computing posted prices. We test our algorithm on real-world data at city-scale and show that our algorithm reliably matches demand to supply across a range of parameters.","PeriodicalId":239898,"journal":{"name":"International Conference on Automated Planning and Scheduling","volume":"16 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2022-06-13","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"127745836","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
Talking Trucks: Decentralized Collaborative Multi-Agent Order Scheduling for Self-Organizing Logistics 会说话的卡车:自组织物流的分散协作多智能体订单调度
Pub Date : 2022-06-13 DOI: 10.1609/icaps.v32i1.19834
Geert L. J. Pingen, C. R. V. Ommeren, C. J. V. Leeuwen, Ruben Fransen, Tijmen Elfrink, Yorick C. de Vries, Janarthanan Karunakaran, Emir Demirovic, N. Yorke-Smith
Logistics planning is a complex optimization problem involving multiple decision makers. Automated scheduling systems offer support to human planners; however state-of-the-art approaches often employ a centralized control paradigm. While these approaches have shown great value, their application is hindered in dynamic settings with no central authority. Motivated by real-world scenarios, we present a decentralized approach to collaborative multi-agent scheduling by casting the problem as a Distributed Constraint Optimization Problem (DCOP). Our model-based heuristic approach uses message passing with a novel pruning technique to allow agents to cooperate on mutual agreement, leading to a near-optimal solution while offering low computational costs and flexibility in case of disruptions. Performance is evaluated in three real-world field trials with a logistics carrier and compared against a centralized model-free Deep Q-Network (DQN)-based Reinforcement Learning (RL) approach, a Mixed-Integer Linear Programming (MILP)-based solver, and both human and heuristic baselines. The results demonstrate that it is feasible to have virtual agents make autonomous decisions using our DCOP method, leading to an efficient distributed solution. To facilitate further research in Self-Organizing Logistics (SOL), we provide a novel real-life dataset.
物流规划是一个涉及多个决策者的复杂优化问题。自动调度系统为人类计划者提供支持;然而,最先进的方法通常采用集中控制范式。虽然这些方法显示出巨大的价值,但它们的应用在没有中央权威的动态环境中受到阻碍。受现实场景的启发,我们提出了一种分散的多智能体协作调度方法,将该问题转换为分布式约束优化问题(DCOP)。我们基于模型的启发式方法使用消息传递和一种新颖的修剪技术,允许代理在相互协议的基础上进行合作,从而在提供低计算成本和灵活性的情况下获得接近最优的解决方案。在三次现实世界的现场试验中,对物流承运人的性能进行了评估,并与基于集中式无模型深度q -网络(DQN)的强化学习(RL)方法、基于混合整数线性规划(MILP)的求解器以及人类和启发式基线进行了比较。结果表明,使用我们的DCOP方法让虚拟代理自主决策是可行的,从而产生了一个高效的分布式解决方案。为了促进自组织物流(SOL)的进一步研究,我们提供了一个新的现实生活数据集。
{"title":"Talking Trucks: Decentralized Collaborative Multi-Agent Order Scheduling for Self-Organizing Logistics","authors":"Geert L. J. Pingen, C. R. V. Ommeren, C. J. V. Leeuwen, Ruben Fransen, Tijmen Elfrink, Yorick C. de Vries, Janarthanan Karunakaran, Emir Demirovic, N. Yorke-Smith","doi":"10.1609/icaps.v32i1.19834","DOIUrl":"https://doi.org/10.1609/icaps.v32i1.19834","url":null,"abstract":"Logistics planning is a complex optimization problem involving multiple decision makers. Automated scheduling systems offer support to human planners; however state-of-the-art approaches often employ a centralized control paradigm. While these approaches have shown great value, their application is hindered in dynamic settings with no central authority. Motivated by real-world scenarios, we present a decentralized approach to collaborative multi-agent scheduling by casting the problem as a Distributed Constraint Optimization Problem (DCOP). Our model-based heuristic approach uses message passing with a novel pruning technique to allow agents to cooperate on mutual agreement, leading to a near-optimal solution while offering low computational costs and flexibility in case of disruptions. Performance is evaluated in three real-world field trials with a logistics carrier and compared against a centralized model-free Deep Q-Network (DQN)-based Reinforcement Learning (RL) approach, a Mixed-Integer Linear Programming (MILP)-based solver, and both human and heuristic baselines. The results demonstrate that it is feasible to have virtual agents make autonomous decisions using our DCOP method, leading to an efficient distributed solution. To facilitate further research in Self-Organizing Logistics (SOL), we provide a novel real-life dataset.","PeriodicalId":239898,"journal":{"name":"International Conference on Automated Planning and Scheduling","volume":"10 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2022-06-13","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"133539902","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
On the Complexity of Heuristic Synthesis for Satisficing Classical Planning: Potential Heuristics and Beyond 论满足经典规划的启发式综合的复杂性:潜在启发式及超越
Pub Date : 2022-06-13 DOI: 10.1609/icaps.v32i1.19793
M. Helmert, Silvan Sievers, Alexander Rovner, Augusto B. Corrêa
Potential functions are a general class of heuristics for classical planning. For satisficing planning, previous work suggested the use of descending and dead-end avoiding (DDA) potential heuristics, which solve planning tasks by backtrack-free search. In this work we study the complexity of devising DDA potential heuristics for classical planning tasks. We show that verifying or synthesizing DDA potential heuristics is PSPACE-complete, but suitable modifications of the DDA properties reduce the complexity of these problems to the first and second level of the polynomial hierarchy. We also discuss the implications of our results for other forms of heuristic synthesis in classical planning.
势函数是经典规划中的一类启发式方法。为了满足规划,以前的工作建议使用下降和避免死角(DDA)潜在启发式,通过无回溯搜索来解决规划任务。在这项工作中,我们研究了设计经典规划任务的DDA潜在启发式的复杂性。我们证明验证或综合DDA潜在启发式是pspace完备的,但适当修改DDA性质将这些问题的复杂性降低到多项式层次的第一和第二级。我们还讨论了我们的结果对经典规划中其他形式的启发式综合的影响。
{"title":"On the Complexity of Heuristic Synthesis for Satisficing Classical Planning: Potential Heuristics and Beyond","authors":"M. Helmert, Silvan Sievers, Alexander Rovner, Augusto B. Corrêa","doi":"10.1609/icaps.v32i1.19793","DOIUrl":"https://doi.org/10.1609/icaps.v32i1.19793","url":null,"abstract":"Potential functions are a general class of heuristics for classical planning. For satisficing planning, previous work suggested the use of descending and dead-end avoiding (DDA) potential heuristics, which solve planning tasks by backtrack-free search. In this work we study the complexity of devising DDA potential heuristics for classical planning tasks. We show that verifying or synthesizing DDA potential heuristics is PSPACE-complete, but suitable modifications of the DDA properties reduce the complexity of these problems to the first and second level of the polynomial hierarchy. We also discuss the implications of our results for other forms of heuristic synthesis in classical planning.","PeriodicalId":239898,"journal":{"name":"International Conference on Automated Planning and Scheduling","volume":"1 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2022-06-13","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"130992840","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
Stochastic Resource Optimization over Heterogeneous Graph Neural Networks for Failure-Predictive Maintenance Scheduling 基于异构图神经网络的故障预测维修调度随机资源优化
Pub Date : 2022-06-13 DOI: 10.1609/icaps.v32i1.19839
Zheyuan Wang, M. Gombolay
Resource optimization for predictive maintenance is a challenging computational problem that requires inferring and reasoning over stochastic failure models and dynamically allocating repair resources. Predictive maintenance scheduling is typically performed with a combination of ad hoc, hand-crafted heuristics with manual scheduling corrections by human domain experts, which is a labor-intensive process that is hard to scale. In this paper, we develop an innovative heterogeneous graph neural network to automatically learn an end-to-end resource scheduling policy. Our approach is fully graph-based with the addition of state summary and decision value nodes that provides a computationally lightweight and nonparametric means to perform dynamic scheduling. We augment our policy optimization procedure to enable robust learning in highly stochastic environments for which typical actor-critic reinforcement learning methods are ill-suited. In consultation with aerospace industry partners, we develop a virtual predictive-maintenance environment for a heterogeneous fleet of aircraft, called AirME. Our approach sets a new state-of-the-art by outperforming conventional, hand-crafted heuristics and baseline learning methods across problem sizes and various objective functions.
预测性维修的资源优化是一个具有挑战性的计算问题,需要对随机故障模型进行推断和推理,并动态分配维修资源。预测性维护计划通常是由人工领域专家结合使用特别的、手工制作的启发式方法和手动调度更正来执行的,这是一个难以扩展的劳动密集型过程。在本文中,我们开发了一种创新的异构图神经网络来自动学习端到端的资源调度策略。我们的方法是完全基于图的,添加了状态汇总和决策值节点,提供了一种计算轻量级和非参数化的方法来执行动态调度。我们增强了我们的策略优化过程,以在高度随机的环境中实现鲁棒学习,而典型的行为者批评强化学习方法不适合这种环境。在与航空航天业合作伙伴协商后,我们为异构机队开发了一个虚拟的预测性维护环境,称为AirME。我们的方法通过超越传统的、手工制作的启发式和基线学习方法,在问题大小和各种目标函数上设置了一个新的最先进的方法。
{"title":"Stochastic Resource Optimization over Heterogeneous Graph Neural Networks for Failure-Predictive Maintenance Scheduling","authors":"Zheyuan Wang, M. Gombolay","doi":"10.1609/icaps.v32i1.19839","DOIUrl":"https://doi.org/10.1609/icaps.v32i1.19839","url":null,"abstract":"Resource optimization for predictive maintenance is a challenging computational problem that requires inferring and reasoning over stochastic failure models and dynamically allocating repair resources. Predictive maintenance scheduling is typically performed with a combination of ad hoc, hand-crafted heuristics with manual scheduling corrections by human domain experts, which is a labor-intensive process that is hard to scale. In this paper, we develop an innovative heterogeneous graph neural network to automatically learn an end-to-end resource scheduling policy. Our approach is fully graph-based with the addition of state summary and decision value nodes that provides a computationally lightweight and nonparametric means to perform dynamic scheduling. We augment our policy optimization procedure to enable robust learning in highly stochastic environments for which typical actor-critic reinforcement learning methods are ill-suited. In consultation with aerospace industry partners, we develop a virtual predictive-maintenance environment for a heterogeneous fleet of aircraft, called AirME. Our approach sets a new state-of-the-art by outperforming conventional, hand-crafted heuristics and baseline learning methods across problem sizes and various objective functions.","PeriodicalId":239898,"journal":{"name":"International Conference on Automated Planning and Scheduling","volume":"20 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2022-06-13","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"128383292","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
Cost Partitioning Heuristics for Stochastic Shortest Path Problems 随机最短路径问题的代价划分启发式算法
Pub Date : 2022-06-13 DOI: 10.1609/icaps.v32i1.19802
Thorsten Klößner, F. Pommerening, Thomas Keller, G. Röger
In classical planning, cost partitioning is a powerful method which allows to combine multiple admissible heuristics while retaining an admissible bound. In this paper, we extend the theory of cost partitioning to probabilistic planning by generalizing from deterministic transition systems to stochastic shortest path problems (SSPs). We show that fundamental results related to cost partitioning still hold in our extended theory. We also investigate how to optimally partition costs for a large class of abstraction heuristics for SSPs. Lastly, we analyze occupation measure heuristics for SSPs as well as the theory of approximate linear programming for reward-oriented Markov decision processes. All of these fit our framework and can be seen as cost-partitioned heuristics.
在经典规划中,成本划分是一种强大的方法,它可以在保留可接受边界的情况下组合多个可接受的启发式方法。本文通过将确定性转移系统推广到随机最短路径问题,将成本分配理论推广到概率规划中。我们证明了与成本分配相关的基本结果在我们的扩展理论中仍然成立。我们还研究了如何最优地划分ssp抽象启发式的开销。最后,我们分析了面向奖励的马尔可夫决策过程的职业度量启发式以及近似线性规划理论。所有这些都符合我们的框架,可以看作是成本划分的启发式。
{"title":"Cost Partitioning Heuristics for Stochastic Shortest Path Problems","authors":"Thorsten Klößner, F. Pommerening, Thomas Keller, G. Röger","doi":"10.1609/icaps.v32i1.19802","DOIUrl":"https://doi.org/10.1609/icaps.v32i1.19802","url":null,"abstract":"In classical planning, cost partitioning is a powerful method which allows to combine multiple admissible heuristics while retaining an admissible bound. In this paper, we extend the theory of cost partitioning to probabilistic planning by generalizing from deterministic transition systems to stochastic shortest path problems (SSPs). We show that fundamental results related to cost partitioning still hold in our extended theory. We also investigate how to optimally partition costs for a large class of abstraction heuristics for SSPs. Lastly, we analyze occupation measure heuristics for SSPs as well as the theory of approximate linear programming for reward-oriented Markov decision processes. All of these fit our framework and can be seen as cost-partitioned heuristics.","PeriodicalId":239898,"journal":{"name":"International Conference on Automated Planning and Scheduling","volume":"1 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2022-06-13","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"114976372","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
OFFICERS: Operational Framework for Intelligent Crime-and-Emergency Response Scheduling 官员:智能犯罪和应急响应调度的操作框架
Pub Date : 2022-06-13 DOI: 10.1609/icaps.v32i1.19830
Jonathan Chase, Siong Thye Goh, T. Phong, H. Lau
In the quest to achieve better response times in dense urban environments, law enforcement agencies are seeking AI-driven planning systems to inform their patrol strategies. In this paper, we present a framework, OFFICERS, for deployment planning that learns from historical data to generate deployment schedules on a daily basis. We accurately predict incidents using ST-ResNet, a deep learning technique that captures wide-ranging spatio-temporal dependencies, and solve a large-scale optimization problem to schedule deployment, significantly improving its scalability through a simulated annealing solver. Methodologically, our approach outperforms our previous works where prediction was done using Generative Adversarial Networks, and optimization was performed with the CPLEX solver. Furthermore, we show that our proposed framework is designed to be readily transferable between use cases, handling a wide range of both criminal and non-criminal incidents, with the use of deep learning and a general-purpose efficient solver, reducing dependence on context-specific details. We demonstrate the value of our approach on a police patrol case study, and discuss both the ethical considerations, and operational requirements, for deployment of a lightweight and responsive planning system.
为了在密集的城市环境中实现更好的响应时间,执法机构正在寻求人工智能驱动的规划系统,以告知他们的巡逻策略。在本文中,我们提出了一个框架,军官,用于部署计划,它从历史数据中学习,每天生成部署计划。我们使用ST-ResNet准确预测事件,ST-ResNet是一种深度学习技术,可捕获广泛的时空依赖性,并解决大规模优化问题以调度部署,通过模拟退火求解器显着提高其可扩展性。在方法上,我们的方法优于我们以前使用生成对抗网络进行预测的工作,并使用CPLEX求解器进行优化。此外,我们表明,我们提出的框架被设计成易于在用例之间转移,处理广泛的刑事和非刑事事件,使用深度学习和通用高效求解器,减少对特定于上下文的细节的依赖。我们通过一个警察巡逻案例研究,展示了我们的方法的价值,并讨论了部署一个轻量级和反应迅速的规划系统的道德考虑和操作要求。
{"title":"OFFICERS: Operational Framework for Intelligent Crime-and-Emergency Response Scheduling","authors":"Jonathan Chase, Siong Thye Goh, T. Phong, H. Lau","doi":"10.1609/icaps.v32i1.19830","DOIUrl":"https://doi.org/10.1609/icaps.v32i1.19830","url":null,"abstract":"In the quest to achieve better response times in dense urban environments, law enforcement agencies are seeking AI-driven planning systems to inform their patrol strategies. In this paper, we present a framework, OFFICERS, for deployment planning that learns from historical data to generate deployment schedules on a daily basis. We accurately predict incidents using ST-ResNet, a deep learning technique that captures wide-ranging spatio-temporal dependencies, and solve a large-scale optimization problem to schedule deployment, significantly improving its scalability through a simulated annealing solver. Methodologically, our approach outperforms our previous works where prediction was done using Generative Adversarial Networks, and optimization was performed with the CPLEX solver. Furthermore, we show that our proposed framework is designed to be readily transferable between use cases, handling a wide range of both criminal and non-criminal incidents, with the use of deep learning and a general-purpose efficient solver, reducing dependence on context-specific details. We demonstrate the value of our approach on a police patrol case study, and discuss both the ethical considerations, and operational requirements, for deployment of a lightweight and responsive planning system.","PeriodicalId":239898,"journal":{"name":"International Conference on Automated Planning and Scheduling","volume":"6 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2022-06-13","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"129731468","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Who Needs These Operators Anyway: Top Quality Planning with Operator Subset Criteria 谁需要这些算子:算子子集标准的高质量规划
Pub Date : 2022-06-13 DOI: 10.1609/icaps.v32i1.19800
Michael Katz, Shirin Sohrabi
Top-quality planning in general and quotient top-quality planning in particular deal with producing multiple high-quality plans while allowing for their efficient generation, skipping equivalent ones. Prior work has explored one equivalence relation, considering two plans to be equivalent if their operator multi-sets are equal. This allowed omitting plans that are reorderings of previously found ones. However, the resulting sets of plans were still large, in some domains even infinite.In this paper, we consider a different relation: two plans are related if one's operator multiset is a subset of the other's. We propose novel reformulations that forbid plans that are related to the given ones. While the new relation is not transitive and thus not an equivalence relation, we can define a new subset top-quality planning problem, with finite size solution sets. We formally prove that these solutions can be obtained by exploiting the proposed reformulations. Our empirical evaluation shows that solutions to the new problem can be found for more tasks than unordered top-quality planning solutions. Further, the results shows that the solution sizes significantly decrease, making the new approach more practical, particularly in domains with redundant operators.
一般来说,高质量规划,特别是商数高质量规划,处理的是产生多个高质量计划,同时允许它们高效地生成,跳过等效的计划。先前的工作探讨了一个等价关系,认为如果两个计划的算子多集相等,则它们是等价的。这允许省略先前发现的重新排序的计划。然而,最终的计划集仍然很大,在某些领域甚至是无限的。本文考虑一种不同的关系:当一个算子的多集是另一个算子的子集时,两个计划是相关的。我们提出新的改革方案,禁止与现有方案相关的方案。由于新的关系不是传递关系,因此不是等价关系,因此我们可以定义一个新的子集高质量规划问题,具有有限大小的解集。我们正式证明了这些解可以通过利用所提出的重新表述得到。我们的实证评估表明,与无序的高质量规划方案相比,新问题的解决方案可以找到更多的任务。此外,结果表明,解的大小显着减小,使新方法更加实用,特别是在具有冗余算子的领域。
{"title":"Who Needs These Operators Anyway: Top Quality Planning with Operator Subset Criteria","authors":"Michael Katz, Shirin Sohrabi","doi":"10.1609/icaps.v32i1.19800","DOIUrl":"https://doi.org/10.1609/icaps.v32i1.19800","url":null,"abstract":"Top-quality planning in general and quotient top-quality planning in particular deal with producing multiple high-quality plans while allowing for their efficient generation, skipping equivalent ones. Prior work has explored one equivalence relation, considering two plans to be equivalent if their operator multi-sets are equal. This allowed omitting plans that are reorderings of previously found ones. However, the resulting sets of plans were still large, in some domains even infinite.\u0000In this paper, we consider a different relation: two plans are related if one's operator multiset is a subset of the other's. We propose novel reformulations that forbid plans that are related to the given ones. While the new relation is not transitive and thus not an equivalence relation, we can define a new subset top-quality planning problem, with finite size solution sets. We formally prove that these solutions can be obtained by exploiting the proposed reformulations. Our empirical evaluation shows that solutions to the new problem can be found for more tasks than unordered top-quality planning solutions. Further, the results shows that the solution sizes significantly decrease, making the new approach more practical, particularly in domains with redundant operators.","PeriodicalId":239898,"journal":{"name":"International Conference on Automated Planning and Scheduling","volume":"13 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2022-06-13","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"131161197","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 4
期刊
International Conference on Automated Planning and Scheduling
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1