首页 > 最新文献

International Conference on Automated Planning and Scheduling最新文献

英文 中文
Fixing Plans for PDDL+ Problems: Theoretical and Practical Implications 修正PDDL+问题的计划:理论和实践意义
Pub Date : 2023-07-01 DOI: 10.1609/icaps.v33i1.27210
Francesco Percassi, Enrico Scala, M. Vallati
The plan, execution, and replan framework has proven to be extremely valuable in complex real-world applications, where the dynamics of the environment cannot be fully encoded in the domain model. However, this comes at the cost of regenerating plans from scratch, which can be expensive when expressive formalisms like PDDL+ are used. Given the complexity of generating PDDL+ plans, it would be ideal to reuse as much as possible of an existing plan, rather than generating a new one from scratch every time. To support more effective exploitation of the plan, execution, and replan framework in PDDL+, in this paper, we introduce the problem of discretized PDDL+ plan fixing, which allows one to fix existing plans according to some defined constraints. We demonstrate the theoretical implications of the introduced notion and introduce reformulations to address the problem using domain-independent planning engines. Our results show that such reformulations can outperform replanning from scratch and unlock planning engines to solve more problems with fine-grained discretizations.
计划、执行和重新计划框架已被证明在复杂的实际应用程序中非常有价值,在这些应用程序中,环境的动态不能完全编码到域模型中。然而,这是以从头开始重新生成计划为代价的,当使用像PDDL+这样的表达形式时,这可能是昂贵的。考虑到生成PDDL+计划的复杂性,理想的做法是尽可能多地重用现有计划,而不是每次都从头生成一个新计划。为了支持更有效地利用PDDL+中的计划、执行和重新计划框架,本文引入了离散化PDDL+计划修复问题,该问题允许人们根据一些定义的约束来修复现有的计划。我们展示了引入的概念的理论含义,并引入了使用领域独立规划引擎来解决问题的重新表述。我们的研究结果表明,这种重新规划可以胜过从头开始的重新规划,并解锁规划引擎,以解决更多具有细粒度离散化的问题。
{"title":"Fixing Plans for PDDL+ Problems: Theoretical and Practical Implications","authors":"Francesco Percassi, Enrico Scala, M. Vallati","doi":"10.1609/icaps.v33i1.27210","DOIUrl":"https://doi.org/10.1609/icaps.v33i1.27210","url":null,"abstract":"The plan, execution, and replan framework has proven to be extremely valuable in complex real-world applications, where the dynamics of the environment cannot be fully encoded in the domain model. However, this comes at the cost of regenerating plans from scratch, which can be expensive when expressive formalisms like PDDL+ are used. Given the complexity of generating PDDL+ plans, it would be ideal to reuse as much as possible of an existing plan, rather than generating a new one from scratch every time. To support more effective exploitation of the plan, execution, and replan framework in PDDL+, in this paper, we introduce the problem of discretized PDDL+ plan fixing, which allows one to fix existing plans according to some defined constraints. We demonstrate the theoretical implications of the introduced notion and introduce reformulations to address the problem using domain-independent planning engines. Our results show that such reformulations can outperform replanning from scratch and unlock planning engines to solve more problems with fine-grained discretizations.","PeriodicalId":239898,"journal":{"name":"International Conference on Automated Planning and Scheduling","volume":"14 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2023-07-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"126625821","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
Landmark Progression in Heuristic Search 启发式搜索的里程碑式进展
Pub Date : 2023-07-01 DOI: 10.1609/icaps.v33i1.27180
Clemens Büchner, Thomas Keller, Salomé Eriksson, M. Helmert
The computation of high-quality landmarks and orderings for heuristicstate-space search is often prohibitively expensive to be performed inevery generated state. Computing information only for the initialstate and progressing it from every state to its successors is asuccessful alternative, exploited for example in classical planning bythe LAMA planner. We propose a general framework for using landmarksin any kind of best-first search. Its core component, the progressionfunction, uses orderings and search history to determine whichlandmarks must still be achieved. We show that the progressionfunction that is used in LAMA infers invalid information in thepresence of reasonable orderings. We define a sound progressionfunction that allows to exploit reasonable orderings in cost-optimalplanning and show empirically that our new progression function isbeneficial both in satisficing and optimal planning.
启发式状态空间搜索的高质量地标和排序的计算通常非常昂贵,无法在每个生成的状态中执行。仅计算初始状态的信息并将其从每个状态推进到后续状态是一种成功的替代方案,例如在经典规划中被LAMA规划器利用。我们提出了在任何类型的最佳优先搜索中使用地标的一般框架。它的核心组件,进度功能,使用排序和搜索历史来确定哪些地标必须达到。我们证明了在LAMA中使用的递进函数在存在合理排序的情况下推断无效信息。我们定义了一个合理的进度函数,允许在成本最优规划中开发合理的排序,并通过经验证明我们的新进度函数在满足和最优规划中都是有益的。
{"title":"Landmark Progression in Heuristic Search","authors":"Clemens Büchner, Thomas Keller, Salomé Eriksson, M. Helmert","doi":"10.1609/icaps.v33i1.27180","DOIUrl":"https://doi.org/10.1609/icaps.v33i1.27180","url":null,"abstract":"The computation of high-quality landmarks and orderings for heuristic\u0000state-space search is often prohibitively expensive to be performed in\u0000every generated state. Computing information only for the initial\u0000state and progressing it from every state to its successors is a\u0000successful alternative, exploited for example in classical planning by\u0000the LAMA planner. We propose a general framework for using landmarks\u0000in any kind of best-first search. Its core component, the progression\u0000function, uses orderings and search history to determine which\u0000landmarks must still be achieved. We show that the progression\u0000function that is used in LAMA infers invalid information in the\u0000presence of reasonable orderings. We define a sound progression\u0000function that allows to exploit reasonable orderings in cost-optimal\u0000planning and show empirically that our new progression function is\u0000beneficial both in satisficing and optimal planning.","PeriodicalId":239898,"journal":{"name":"International Conference on Automated Planning and Scheduling","volume":"1 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2023-07-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"128975246","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Model Checking for Adversarial Multi-Agent Reinforcement Learning with Reactive Defense Methods 基于反应防御的对抗多智能体强化学习模型检验
Pub Date : 2023-07-01 DOI: 10.1609/icaps.v33i1.27191
Dennis Gross, C. Schmidl, N. Jansen, G. Pérez
Cooperative multi-agent reinforcement learning (CMARL) enables agents to achieve a common objective. However, the safety (a.k.a. robustness) of the CMARL agents operating in critical environments is not guaranteed. In particular, agents are susceptible to adversarial noise in their observations that can mislead their decision-making.So-called denoisers aim to remove adversarial noise from observations, yet, they are often error-prone.A key challenge for any rigorous safety verification technique in CMARL settings is the large number of states and transitions, which generally prohibits the construction of a (monolithic) model of the whole system.In this paper, we present a verification method for CMARL agents in settings with or without adversarial attacks or denoisers.Our method relies on a tight integration of CMARL and a verification technique referred to as model checking.We showcase the applicability of our method on various benchmarks from different domains.Our experiments show that our method is indeed suited to verify CMARL agents and that it scales better than a naive approach to model checking.
协作式多智能体强化学习(CMARL)使智能体能够实现一个共同的目标。然而,在关键环境中运行的CMARL代理的安全性(即鲁棒性)并不能得到保证。特别是,代理人在他们的观察中容易受到对抗性噪音的影响,这可能会误导他们的决策。所谓的去噪器旨在从观测中去除对抗性噪声,然而,它们往往容易出错。在CMARL设置中,任何严格的安全验证技术的一个关键挑战是大量的状态和转换,这通常禁止构建整个系统的(单片)模型。在本文中,我们提出了一种CMARL代理在有或没有对抗性攻击或去噪设置下的验证方法。我们的方法依赖于CMARL和一种被称为模型检查的验证技术的紧密集成。我们展示了我们的方法在不同领域的各种基准测试中的适用性。我们的实验表明,我们的方法确实适合于验证CMARL代理,并且它比简单的模型检查方法具有更好的扩展性。
{"title":"Model Checking for Adversarial Multi-Agent Reinforcement Learning with Reactive Defense Methods","authors":"Dennis Gross, C. Schmidl, N. Jansen, G. Pérez","doi":"10.1609/icaps.v33i1.27191","DOIUrl":"https://doi.org/10.1609/icaps.v33i1.27191","url":null,"abstract":"Cooperative multi-agent reinforcement learning (CMARL) enables agents to achieve a common objective. \u0000However, the safety (a.k.a. robustness) of the CMARL agents operating in critical environments is not guaranteed. \u0000In particular, agents are susceptible to adversarial noise in their observations that can mislead their decision-making.\u0000So-called denoisers aim to remove adversarial noise from observations, yet, they are often error-prone.\u0000A key challenge for any rigorous safety verification technique in CMARL settings is the large number of states and transitions, which generally prohibits the construction of a (monolithic) model of the whole system.\u0000In this paper, we present a verification method for CMARL agents in settings with or without adversarial attacks or denoisers.\u0000Our method relies on a tight integration of CMARL and a verification technique referred to as model checking.\u0000We showcase the applicability of our method on various benchmarks from different domains.\u0000Our experiments show that our method is indeed suited to verify CMARL agents and that it scales better than a naive approach to model checking.","PeriodicalId":239898,"journal":{"name":"International Conference on Automated Planning and Scheduling","volume":"21 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2023-07-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"130147427","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
An Efficient Hybrid Genetic Algorithm for the Quadratic Traveling Salesman Problem 二次型旅行商问题的一种高效混合遗传算法
Pub Date : 2023-07-01 DOI: 10.1609/icaps.v33i1.27212
Quang Anh Pham, H. Lau, Minh Hoàng Hà, Lam Vu
The traveling salesman problem (TSP) is the most well-known problem in combinatorial optimization which has been studied for many decades. This paper focuses on dealing with one of the most difficult TSP variants named the quadratic traveling salesman problem (QTSP) that has numerous planning applications in robotics and bioinformatics. The goal of QTSP is similar to TSP which finds a cycle visiting all nodes exactly once with minimum total costs. However, the costs in QTSP are associated with three vertices traversed in succession (instead of two like in TSP). This leads to a quadratic objective function that is much harder to solve. To efficiently solve the problem, we propose a hybrid genetic algorithm including a local search procedure for intensification and a new mutation operator for diversification. The local search is composed of a restricted double-bridge move (a variant of 4-Opt); and we show the neighborhood can be evaluated in O(n^2), the same complexity as for the classical TSP. The mutation phase is inspired by a ruin-and-recreate scheme. Experimental results conducted on benchmark instances show that our method significantly outperforms state-of-the-art algorithms in terms of solution quality. Out of 800 considered instances, it finds 437 new best-known solutions.
旅行商问题(TSP)是组合优化中最著名的问题,已经被研究了几十年。本文重点研究了在机器人和生物信息学中有许多规划应用的二次旅行推销员问题(quadratic traveling salesman problem, QTSP)。QTSP的目标类似于TSP,它找到一个以最小总成本访问所有节点的周期。然而,QTSP中的成本与连续遍历的三个顶点相关(而不是像TSP中的两个)。这导致二次目标函数更难求解。为了有效地解决这一问题,我们提出了一种混合遗传算法,其中包括局部搜索过程的强化和新的变异算子的多样化。局部搜索由受限双桥移动(4-Opt的一种变体)组成;我们证明了邻域可以在O(n^2)内求值,与经典TSP的复杂度相同。突变阶段的灵感来自于“破坏-重建”方案。在基准实例上进行的实验结果表明,我们的方法在解决质量方面明显优于最先进的算法。在考虑的800个实例中,它发现了437个新的最知名的解决方案。
{"title":"An Efficient Hybrid Genetic Algorithm for the Quadratic Traveling Salesman Problem","authors":"Quang Anh Pham, H. Lau, Minh Hoàng Hà, Lam Vu","doi":"10.1609/icaps.v33i1.27212","DOIUrl":"https://doi.org/10.1609/icaps.v33i1.27212","url":null,"abstract":"The traveling salesman problem (TSP) is the most well-known problem in combinatorial optimization which has been studied for many decades. This paper focuses on dealing with one of the most difficult TSP variants named the quadratic traveling salesman problem (QTSP) that has numerous planning applications in robotics and bioinformatics. The goal of QTSP is similar to TSP which finds a cycle visiting all nodes exactly once with minimum total costs. However, the costs in QTSP are associated with three vertices traversed in succession (instead of two like in TSP). This leads to a quadratic objective function that is much harder to solve. To efficiently solve the problem, we propose a hybrid genetic algorithm including a local search procedure for intensification and a new mutation operator for diversification. The local search is composed of a restricted double-bridge move (a variant of 4-Opt); and we show the neighborhood can be evaluated in O(n^2), the same complexity as for the classical TSP. The mutation phase is inspired by a ruin-and-recreate scheme. Experimental results conducted on benchmark instances show that our method significantly outperforms state-of-the-art algorithms in terms of solution quality. Out of 800 considered instances, it finds 437 new best-known solutions.","PeriodicalId":239898,"journal":{"name":"International Conference on Automated Planning and Scheduling","volume":"109 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2023-07-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"121997000","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Planning with Multi-Agent Belief Using Justified Perspectives 使用合理视角的多主体信念规划
Pub Date : 2023-07-01 DOI: 10.1609/icaps.v33i1.27193
Guanghua Hu, Tim Miller, N. Lipovetzky
Epistemic planning plays an important role in multi-agent and human-agent interaction domains. Most existing works solve multi-agent epistemic planning problems by either pre-compiling them into classical planning problems; or, using explicit actions and their effects to encode Kripke-based semantics. A recent approach called Planning with Perspectives (PWP) delegates epistemic reasoning in planning to external functions using F-STRIPS, keeping the search within the planning algorithm and lazily evaluating epistemic formulae.Although PWP is expressive and efficient, it models S5 epistemic logic and does not support belief, including false belief. In this paper, we extend the PWP model to handle multi-agent belief by following the intuition that agents believe something they have seen until they see otherwise. We call this justified perspectives. We formalise this notion of multi-agent belief based on the definition of knowledge in PWP. Using experiments on existing epistemic and doxastic planning benchmarks, we show that our belief planner can solve benchmarks more efficiently than the state-of-the-art baseline, and can model some problems that are infeasible to model using propositional-based approaches.
认知规划在多智能体和人-智能体交互领域中起着重要的作用。现有的大多数研究都是通过将多智能体认知规划问题预编译成经典规划问题来解决的;或者,使用显式操作及其效果来编码基于kripke的语义。最近,一种名为“透视规划”(PWP)的方法将规划中的认知推理委托给使用f - strip的外部功能,将搜索保留在规划算法内,并延迟评估认知公式。虽然PWP具有表现力和效率,但它模拟了S5认知逻辑,不支持信念,包括错误的信念。在本文中,我们扩展了PWP模型,通过遵循智能体相信他们所看到的东西直到他们看到其他东西的直觉来处理多智能体信念。我们称之为合理的视角。基于知识的定义,我们形式化了多智能体信念的概念。通过对现有认知和随机规划基准的实验,我们表明我们的信念规划器可以比最先进的基线更有效地解决基准问题,并且可以建模一些使用基于命题的方法无法建模的问题。
{"title":"Planning with Multi-Agent Belief Using Justified Perspectives","authors":"Guanghua Hu, Tim Miller, N. Lipovetzky","doi":"10.1609/icaps.v33i1.27193","DOIUrl":"https://doi.org/10.1609/icaps.v33i1.27193","url":null,"abstract":"Epistemic planning plays an important role in multi-agent and human-agent interaction domains. \u0000Most existing works solve multi-agent epistemic planning problems by either pre-compiling them into classical planning problems; or, using explicit actions and their effects to encode Kripke-based semantics. \u0000A recent approach called Planning with Perspectives (PWP) delegates epistemic reasoning in planning to external functions using F-STRIPS, keeping the search within the planning algorithm and lazily evaluating epistemic formulae.\u0000\u0000Although PWP is expressive and efficient, it models S5 epistemic logic and does not support belief, including false belief. \u0000In this paper, we extend the PWP model to handle multi-agent belief by following the intuition that agents believe something they have seen until they see otherwise. We call this justified perspectives. We formalise this notion of multi-agent belief based on the definition of knowledge in PWP. Using experiments on existing epistemic and doxastic planning benchmarks, we show that our belief planner can solve benchmarks more efficiently than the state-of-the-art baseline, and can model some problems that are infeasible to model using propositional-based approaches.","PeriodicalId":239898,"journal":{"name":"International Conference on Automated Planning and Scheduling","volume":"56 3 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2023-07-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"126140089","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Planning over Integers: Compilations and Undecidability 整数规划:编译和不可判定性
Pub Date : 2023-07-01 DOI: 10.1609/icaps.v33i1.27189
Daniel Gnad, M. Helmert, P. Jonsson, Alexander Shleyfman
Restricted Tasks (RT) are a special case of numeric planning characterized by numeric conditions that involve one numeric variable per formula and numeric effects that allow only the addition of constants. Despite this, RTs form an expressive class whose planning problem is undecidable. The restricted nature of RTs often makes problem modeling awkward and unnecessarily complicated. We show that this can be alleviated by compiling mathematical operations that are not natively supported into RTs using macro-like action sequences. With that, we can encode many features found in general numeric planning such as constant multiplication, addition of linear formulas, and integer division and residue. We demonstrate how our compilations can be used to capture challenging mathematical problems such as the (in)famous Collatz conjecture. Our approach additionally gives a simple undecidability proof for RTs, and the proof shows that the number of variables needed to construct an undecidable class of RTs issurprisingly low: two numeric and one propositional variable.
受限任务(Restricted Tasks, RT)是数字规划的一种特殊情况,其特征是每个公式包含一个数字变量的数字条件和只允许添加常量的数字效果。尽管如此,RTs形成了一个表达性的类,其规划问题是不可确定的。RTs的有限性通常会使问题建模变得笨拙和不必要的复杂。我们表明,这可以通过使用类似宏的动作序列编译RTs中不支持的数学操作来缓解。有了它,我们可以编码在一般数值规划中发现的许多特征,如常数乘法,线性公式的加法,整数除法和剩余。我们将演示如何使用我们的编译来捕获具有挑战性的数学问题,例如(in)著名的Collatz猜想。我们的方法还为RTs提供了一个简单的不可判定性证明,并且证明了构建不可判定类RTs所需的变量数量惊人地低:两个数值变量和一个命题变量。
{"title":"Planning over Integers: Compilations and Undecidability","authors":"Daniel Gnad, M. Helmert, P. Jonsson, Alexander Shleyfman","doi":"10.1609/icaps.v33i1.27189","DOIUrl":"https://doi.org/10.1609/icaps.v33i1.27189","url":null,"abstract":"Restricted Tasks (RT) are a special case of numeric planning characterized by numeric conditions that involve one numeric variable per formula and numeric effects that allow only the addition of constants. Despite this, RTs form an expressive class whose planning problem is undecidable. The restricted nature of RTs often makes problem modeling awkward and unnecessarily complicated. We show that this can be alleviated by compiling mathematical operations that are not natively supported into RTs using macro-like action sequences. With that, we can encode many features found in general numeric planning such as constant multiplication, addition of linear formulas, and integer division and residue. We demonstrate how our compilations can be used to capture challenging mathematical problems such as the (in)famous Collatz conjecture. Our approach additionally gives a simple undecidability proof for RTs, and the proof shows that the number of variables needed to construct an undecidable class of RTs is\u0000surprisingly low: two numeric and one propositional variable.","PeriodicalId":239898,"journal":{"name":"International Conference on Automated Planning and Scheduling","volume":"1 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2023-07-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"128513576","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 2
A Theory of Merge-and-Shrink for Stochastic Shortest Path Problems 随机最短路径问题的合并收缩理论
Pub Date : 2023-07-01 DOI: 10.1609/icaps.v33i1.27196
Thorsten Klößner, Á. Torralba, Marcel Steinmetz, Silvan Sievers
The merge-and-shrink framework is a powerful tool to construct state space abstractions based on factored representations. One of its core applications in classical planning is the construction of admissible abstraction heuristics. In this paper, we develop a compositional theory of merge-and-shrink in the context of probabilistic planning, focusing on stochastic shortest path problems (SSPs). As the basis for this development, we contribute a novel factored state space model for SSPs. We show how general transformations, including abstractions, can be formulated on this model to derive admissible and/or perfect heuristics. To formalize the merge-and-shrink framework for SSPs, we transfer the fundamental merge-and-shrink transformations from the classical setting: shrinking, merging, and label reduction. We analyze the formal properties of these transformations in detail and show how the conditions under which shrinking and label reduction lead to perfect heuristics can be extended to the SSP setting.
合并和收缩框架是一个强大的工具,可以基于因子表示构建状态空间抽象。它在经典规划中的核心应用之一是构建可容许抽象启发式。本文在概率规划的背景下,针对随机最短路径问题(ssp),提出了一种合并收缩的组合理论。作为这一发展的基础,我们为ssp提供了一个新的因子状态空间模型。我们展示了一般的转换,包括抽象,如何在这个模型上公式化,以派生出可接受的和/或完美的启发式。为了形式化ssp的合并和收缩框架,我们从经典设置中转移了基本的合并和收缩转换:收缩、合并和标签缩减。我们详细分析了这些变换的形式性质,并展示了如何将收缩和标签约简导致完美启发式的条件扩展到SSP设置。
{"title":"A Theory of Merge-and-Shrink for Stochastic Shortest Path Problems","authors":"Thorsten Klößner, Á. Torralba, Marcel Steinmetz, Silvan Sievers","doi":"10.1609/icaps.v33i1.27196","DOIUrl":"https://doi.org/10.1609/icaps.v33i1.27196","url":null,"abstract":"The merge-and-shrink framework is a powerful tool to construct state space abstractions based on factored representations. One of its core applications in classical planning is the construction of admissible abstraction heuristics. In this paper, we develop a compositional theory of merge-and-shrink in the context of probabilistic planning, focusing on stochastic shortest path problems (SSPs). As the basis for this development, we contribute a novel factored state space model for SSPs. We show how general transformations, including abstractions, can be formulated on this model to derive admissible and/or perfect heuristics. To formalize the merge-and-shrink framework for SSPs, we transfer the fundamental merge-and-shrink transformations from the classical setting: shrinking, merging, and label reduction. We analyze the formal properties of these transformations in detail and show how the conditions under which shrinking and label reduction lead to perfect heuristics can be extended to the SSP setting.","PeriodicalId":239898,"journal":{"name":"International Conference on Automated Planning and Scheduling","volume":"35 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2023-07-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"132323375","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
Imitation Improvement Learning for Large-Scale Capacitated Vehicle Routing Problems 大规模车辆路径问题的模仿改进学习
Pub Date : 2023-07-01 DOI: 10.1609/icaps.v33i1.27236
Viet The Bui, Tien Mai
Recent works using deep reinforcement learning (RL) to solve routing problems such as the capacitated vehicle routing problem (CVRP) have focused on improvement learning-based methods, which involve improving a given solution until it becomes near-optimal. Although adequate solutions can be achieved for small problem instances, their efficiency degrades for large-scale ones. In this work, we propose a new improvement learning-based framework based on imitation learning where classical heuristics serve as experts to encourage the policy model to mimic and produce similar and better solutions. Moreover, to improve scalability, we propose Clockwise Clustering, a novel augmented framework for decomposing large-scale CVRP into subproblems by clustering sequentially nodes in clockwise order, and then learning to solve them simultaneously. Our approaches enhance state-of-the-art CVRP solvers while attaining competitive solution quality on several well-known datasets, including real-world instances with sizes up to 30,000 nodes. Our best methods are able to achieve new state-of-the-art solutions for several large instances and generalize to a wide range of CVRP variants and solvers. We also contribute new datasets and results to test the generalizability of our deep RL algorithms.
最近使用深度强化学习(RL)来解决路线问题(如有能力车辆路线问题(CVRP))的工作主要集中在改进基于学习的方法上,其中包括改进给定的解决方案,直到它接近最优。尽管对于小问题实例可以获得适当的解决方案,但对于大规模问题实例,它们的效率会降低。在这项工作中,我们提出了一个基于模仿学习的新的改进学习框架,其中经典启发式作为专家来鼓励政策模型模仿并产生类似和更好的解决方案。此外,为了提高可扩展性,我们提出了顺时针聚类,这是一种新的增强框架,通过顺时针顺序聚类节点,将大规模CVRP分解为子问题,然后同时学习解决它们。我们的方法增强了最先进的CVRP求解器,同时在几个知名数据集上获得具有竞争力的解决方案质量,包括规模高达30,000个节点的现实世界实例。我们最好的方法能够为几个大型实例实现新的最先进的解决方案,并推广到广泛的CVRP变体和求解器。我们还提供了新的数据集和结果来测试我们的深度强化学习算法的泛化性。
{"title":"Imitation Improvement Learning for Large-Scale Capacitated Vehicle Routing Problems","authors":"Viet The Bui, Tien Mai","doi":"10.1609/icaps.v33i1.27236","DOIUrl":"https://doi.org/10.1609/icaps.v33i1.27236","url":null,"abstract":"Recent works using deep reinforcement learning (RL) to solve routing problems such as the capacitated vehicle routing problem (CVRP) have focused on improvement learning-based methods, which involve improving a given solution until it becomes near-optimal. Although adequate solutions can be achieved for small problem instances, their efficiency degrades for large-scale ones. In this work, we propose a new improvement learning-based framework based on imitation learning where classical heuristics serve as experts to encourage the policy model to mimic and produce similar and better solutions. Moreover, to improve scalability, we propose Clockwise Clustering, a novel augmented framework for decomposing large-scale CVRP into subproblems by clustering sequentially nodes in clockwise order, and then learning to solve them simultaneously. Our approaches enhance state-of-the-art CVRP solvers while attaining competitive solution quality on several well-known datasets, including real-world instances with sizes up to 30,000 nodes. Our best methods are able to achieve new state-of-the-art solutions for several large instances and generalize to a wide range of CVRP variants and solvers. We also contribute new datasets and results to test the generalizability of our deep RL algorithms.","PeriodicalId":239898,"journal":{"name":"International Conference on Automated Planning and Scheduling","volume":"7 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2023-07-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"123821522","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Combining Heuristic Search and Linear Programming to Compute Realistic Financial Plans 结合启发式搜索和线性规划计算现实理财计划
Pub Date : 2023-07-01 DOI: 10.1609/icaps.v33i1.27233
Alberto Pozanco, Kassiani Papasotiriou, D. Borrajo, M. Veloso
Defining financial goals and formulating actionable plans to achieve them are essential components for ensuring financial health. This task is computationally challenging, given the abundance of factors that can influence one’s financial situation. In this paper, we present the Personal Finance Planner (PFP), which can generate personalized financial plans that consider a person’s context and the likelihood of taking financially related actions to help them achieve their goals. PFP solves the problem in two stages. First, it uses heuristic search to find a high-level sequence of actions that increase the income and reduce spending to help users achieve their financial goals. Next, it uses integer linear programming to determine the best low-level actions to implement the high-level plan. Results show that PFP is able to scale on generating realistic financial plans for complex tasks involving many low level actions and long planning horizons.
确定财务目标并制定可行的计划以实现这些目标,是确保财务健康的重要组成部分。考虑到影响个人财务状况的因素很多,这项任务在计算上具有挑战性。在本文中,我们介绍了个人理财规划师(PFP),它可以生成个性化的财务计划,考虑到一个人的背景和采取财务相关行动帮助他们实现目标的可能性。PFP分两个阶段解决了这个问题。首先,它使用启发式搜索来找到增加收入和减少支出的高级操作序列,以帮助用户实现其财务目标。接下来,它使用整数线性规划来确定实现高级计划的最佳低级操作。结果表明,PFP能够为涉及许多低水平行动和长期规划视野的复杂任务生成现实的财务计划。
{"title":"Combining Heuristic Search and Linear Programming to Compute Realistic Financial Plans","authors":"Alberto Pozanco, Kassiani Papasotiriou, D. Borrajo, M. Veloso","doi":"10.1609/icaps.v33i1.27233","DOIUrl":"https://doi.org/10.1609/icaps.v33i1.27233","url":null,"abstract":"Defining financial goals and formulating actionable plans to achieve them are essential components for ensuring financial health. This task is computationally challenging, given the abundance of factors that can influence one’s financial situation. In this paper, we present the Personal Finance Planner (PFP), which can generate personalized financial plans that consider a person’s context and the likelihood of taking financially related actions to help them achieve their goals. PFP solves the problem in two stages. First, it uses heuristic search to find a high-level sequence of actions that increase the income and reduce spending to help users achieve their financial goals. Next, it uses integer linear programming to determine the best low-level actions to implement the high-level plan. Results show that PFP is able to scale on generating realistic financial plans for complex tasks involving many low level actions and long planning horizons.","PeriodicalId":239898,"journal":{"name":"International Conference on Automated Planning and Scheduling","volume":"202 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2023-07-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"121796555","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Lifted Stackelberg Planning 取消了Stackelberg计划
Pub Date : 2023-07-01 DOI: 10.1609/icaps.v33i1.27215
Philipp Sauer, Marcel Steinmetz, R. Künnemann, Jörg Hoffmann
In Stackelberg planning, a leader and a follower each choose a plan in the same planning task, the leader's objective being to maximize plan cost for the follower. This formulation naturally captures, among others, security-related scenarios where the leader defends an infrastructure against subsequent attacks by the follower. Indeed, Stackelberg planning has been applied to the analysis of email infrastructure security. At web scale, however, the planning tasks involved easily contain tens of thousands of objects, so that grounding becomes the bottleneck. Here we introduce a lifted form of Stackelberg planning to address this. We devise leader-follower search algorithms working at the level of the PDDL-style input model to the extent possible. Our experiments show that, in Stackelberg tasks with many objects, including in particular models of web infrastructure security, our lifted algorithms outperform grounded Stackelberg planning.
在Stackelberg规划中,领导者和追随者在同一个规划任务中各自选择一个计划,领导者的目标是使追随者的计划成本最大化。这个公式自然地捕获了与安全相关的场景,其中领导者保护基础设施免受后续追随者的攻击。事实上,Stackelberg规划已被应用于电子邮件基础设施安全性的分析。然而,在网络规模下,所涉及的规划任务很容易包含成千上万的对象,因此接地成为瓶颈。在这里,我们引入一种改进的Stackelberg计划来解决这个问题。我们设计了尽可能在pddl风格输入模型级别上工作的领导-追随者搜索算法。我们的实验表明,在具有许多对象的Stackelberg任务中,包括web基础设施安全的特定模型,我们的提升算法优于基础Stackelberg规划。
{"title":"Lifted Stackelberg Planning","authors":"Philipp Sauer, Marcel Steinmetz, R. Künnemann, Jörg Hoffmann","doi":"10.1609/icaps.v33i1.27215","DOIUrl":"https://doi.org/10.1609/icaps.v33i1.27215","url":null,"abstract":"In Stackelberg planning, a leader and a follower each choose a plan in the same planning task, the leader's objective being to maximize plan cost for the follower. This formulation naturally captures, among others, security-related scenarios where the leader defends an infrastructure against subsequent attacks by the follower. Indeed, Stackelberg planning has been applied to the analysis of email infrastructure security. At web scale, however, the planning tasks involved easily contain tens of thousands of objects, so that grounding becomes the bottleneck. Here we introduce a lifted form of Stackelberg planning to address this. We devise leader-follower search algorithms working at the level of the PDDL-style input model to the extent possible. Our experiments show that, in Stackelberg tasks with many objects, including in particular models of web infrastructure security, our lifted algorithms outperform grounded Stackelberg planning.","PeriodicalId":239898,"journal":{"name":"International Conference on Automated Planning and Scheduling","volume":"67 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2023-07-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"134109346","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
期刊
International Conference on Automated Planning and Scheduling
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1