{"title":"Data-Efficient Learning-Based Iterative Optimization Method With Time-Varying Prediction Horizon for Multiagent Collaboration","authors":"Bowen Wang;Xinle Gong;Yafei Wang;Rongtao Xu;Hongcheng Huang","doi":"10.1109/JIOT.2024.3497185","DOIUrl":null,"url":null,"abstract":"Learning-based strategy can be well integrated with model-based optimal control to facilitate cooperative multiagent control through the Internet of Things (IoT). In this work, we propose a data-efficient learning-based iterative optimization method with time-varying prediction horizon (TV-LIO) for multiagent collaboration. Our method builds a multiagent optimization problem by introducing a time-domain guided terminal set and an approximated general cost. We collect the historical agent states at previous iterations as a dataset to reconstruct the general cost and the terminal set iteratively, forming closed-loop data-efficient learning. We consider the influence of the predictive time domain on the optimality and feasibility of the optimization problem and design a time-domain recursive updating mechanism to determine the optimal predictive horizon for each agent at the epoch. The continuous feasibility, stability, and recursive convergence of the proposed method are analyzed theoretically. Unlike the traditional optimization approaches that rely on a preplaned reference path, the proposed method integrates the trajectory planning and tracking control for multiple agents. After several iterations, the general cost of the optimization problem monotonically decreases and the optimal states are finally obtained. The proposed approach is validated and the results demonstrate that our approach can obtain the optimal-cost strategy and trajectories with optimizing time domains for the multiagent system.","PeriodicalId":54347,"journal":{"name":"IEEE Internet of Things Journal","volume":"12 6","pages":"7577-7589"},"PeriodicalIF":8.9000,"publicationDate":"2024-11-13","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"IEEE Internet of Things Journal","FirstCategoryId":"94","ListUrlMain":"https://ieeexplore.ieee.org/document/10752569/","RegionNum":1,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"COMPUTER SCIENCE, INFORMATION SYSTEMS","Score":null,"Total":0}
引用次数: 0
Abstract
Learning-based strategy can be well integrated with model-based optimal control to facilitate cooperative multiagent control through the Internet of Things (IoT). In this work, we propose a data-efficient learning-based iterative optimization method with time-varying prediction horizon (TV-LIO) for multiagent collaboration. Our method builds a multiagent optimization problem by introducing a time-domain guided terminal set and an approximated general cost. We collect the historical agent states at previous iterations as a dataset to reconstruct the general cost and the terminal set iteratively, forming closed-loop data-efficient learning. We consider the influence of the predictive time domain on the optimality and feasibility of the optimization problem and design a time-domain recursive updating mechanism to determine the optimal predictive horizon for each agent at the epoch. The continuous feasibility, stability, and recursive convergence of the proposed method are analyzed theoretically. Unlike the traditional optimization approaches that rely on a preplaned reference path, the proposed method integrates the trajectory planning and tracking control for multiple agents. After several iterations, the general cost of the optimization problem monotonically decreases and the optimal states are finally obtained. The proposed approach is validated and the results demonstrate that our approach can obtain the optimal-cost strategy and trajectories with optimizing time domains for the multiagent system.
期刊介绍:
The EEE Internet of Things (IoT) Journal publishes articles and review articles covering various aspects of IoT, including IoT system architecture, IoT enabling technologies, IoT communication and networking protocols such as network coding, and IoT services and applications. Topics encompass IoT's impacts on sensor technologies, big data management, and future internet design for applications like smart cities and smart homes. Fields of interest include IoT architecture such as things-centric, data-centric, service-oriented IoT architecture; IoT enabling technologies and systematic integration such as sensor technologies, big sensor data management, and future Internet design for IoT; IoT services, applications, and test-beds such as IoT service middleware, IoT application programming interface (API), IoT application design, and IoT trials/experiments; IoT standardization activities and technology development in different standard development organizations (SDO) such as IEEE, IETF, ITU, 3GPP, ETSI, etc.