Constrained Environment Optimization for Prioritized Multi-Agent Navigation

IEEE open journal of control systems Pub Date : 2023-09-15 DOI:10.1109/OJCSYS.2023.3316090

Zhan Gao;Amanda Prorok

{"title":"Constrained Environment Optimization for Prioritized Multi-Agent Navigation","authors":"Zhan Gao;Amanda Prorok","doi":"10.1109/OJCSYS.2023.3316090","DOIUrl":null,"url":null,"abstract":"Traditional approaches for multi-agent navigation consider the environment as a fixed constraint, despite the obvious influence of spatial constraints on agents' performance. Yet hand-designing conducive environments is inefficient and potentially expensive. The goal of this article is to consider the obstacle layout of the environment as a decision variable in a system-level optimization problem. In other words, we aim to find an automated solution that optimizes the obstacle layout to improve the performance of multi-agent navigation, under a variety of realistic constraints. Towards this end, we propose novel problems of \n<italic>unprioritized</i>\n and \n<italic>prioritized environment optimization</i>\n, where the former considers agents unbiasedly and the latter incorporates agent priorities into optimization. We show, through formal proofs, under which conditions the environment can change to guarantee completeness (i.e., all agents reach goals), and analyze the role of agent priorities in the environment optimization. We proceed to impose constraints on the environment optimization that correspond to real-world restrictions on obstacle changes, and formulate it mathematically as a constrained stochastic optimization problem. Since the relationship between agents, environment and performance is challenging to model, we leverage reinforcement learning to develop a model-free solution and a primal-dual mechanism to handle constraints. Distinct information processing architectures are integrated for various implementation scenarios, including online/offline optimization and discrete/continuous environment. Numerical results corroborate the theory and demonstrate the validity and adaptability of our approach.","PeriodicalId":73299,"journal":{"name":"IEEE open journal of control systems","volume":"2 ","pages":"337-355"},"PeriodicalIF":0.0000,"publicationDate":"2023-09-15","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://ieeexplore.ieee.org/iel7/9552933/9973428/10251921.pdf","citationCount":"1","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"IEEE open journal of control systems","FirstCategoryId":"1085","ListUrlMain":"https://ieeexplore.ieee.org/document/10251921/","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 1

Abstract

Traditional approaches for multi-agent navigation consider the environment as a fixed constraint, despite the obvious influence of spatial constraints on agents' performance. Yet hand-designing conducive environments is inefficient and potentially expensive. The goal of this article is to consider the obstacle layout of the environment as a decision variable in a system-level optimization problem. In other words, we aim to find an automated solution that optimizes the obstacle layout to improve the performance of multi-agent navigation, under a variety of realistic constraints. Towards this end, we propose novel problems of unprioritized and prioritized environment optimization , where the former considers agents unbiasedly and the latter incorporates agent priorities into optimization. We show, through formal proofs, under which conditions the environment can change to guarantee completeness (i.e., all agents reach goals), and analyze the role of agent priorities in the environment optimization. We proceed to impose constraints on the environment optimization that correspond to real-world restrictions on obstacle changes, and formulate it mathematically as a constrained stochastic optimization problem. Since the relationship between agents, environment and performance is challenging to model, we leverage reinforcement learning to develop a model-free solution and a primal-dual mechanism to handle constraints. Distinct information processing architectures are integrated for various implementation scenarios, including online/offline optimization and discrete/continuous environment. Numerical results corroborate the theory and demonstrate the validity and adaptability of our approach.

查看原文

微信好友朋友圈 QQ好友复制链接

本刊更多论文

多智能体优先导航的约束环境优化

传统的多智能体导航方法将环境视为一个固定的约束，尽管空间约束对智能体的性能有明显的影响。然而，手工设计有利的环境效率低下，而且可能成本高昂。本文的目标是将环境的障碍物布局视为系统级优化问题中的决策变量。换句话说，我们的目标是找到一种自动化解决方案，在各种现实约束下优化障碍物布局，以提高多智能体导航的性能。为此，我们提出了新的无优先级和有优先级的环境优化问题，其中前者无约束地考虑代理，而后者将代理优先级纳入优化中。通过形式化证明，我们展示了在什么条件下环境可以改变以保证完整性（即所有代理都达到目标），并分析了代理优先级在环境优化中的作用。我们继续对环境优化施加与现实世界中对障碍物变化的限制相对应的约束，并将其数学公式化为一个受约束的随机优化问题。由于代理、环境和性能之间的关系很难建模，我们利用强化学习来开发无模型解决方案和处理约束的原对偶机制。不同的信息处理架构集成在各种实现场景中，包括在线/离线优化和离散/连续环境。数值结果证实了该理论，并证明了该方法的有效性和适应性。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文去求助