首页 > 最新文献

IET Control Theory & Applications最新文献

英文 中文
Innovative hull cleaning robot design and control by Laguerre base model predictive control for impedance and vibration management 利用拉盖尔基础模型预测控制进行阻抗和振动管理的创新型船体清洁机器人设计与控制
Pub Date : 2024-07-11 DOI: 10.1049/cth2.12716
Vahid Madanipour, Farid Najafi
The intricate and unpredictable nature of underwater environments and disturbances necessitates the use of model predictive control for the effective operation and inspection of remotely operated vehicles (ROVs). This paper presented an innovative suspension system for a hull‐cleaning robot to control impedance while reducing the vibration of ROV brushes in the presence of environmental disturbances and uncertainties. The use of a model predictive controller that utilizes Laguerre functions results significant reduction in tracking time, and the efficiency of the proposed controller is demonstrated through successful impedance tracking in Z‐direction and vibration reduction in Z and Y directions of the robot in an uncertain environment with disturbance. A prototype robot is built and the controller performance is validated in a real condition and modal analysis theory output with experimental data. The results highlight the effectiveness of the designed suspension system and the developed MPC for real‐world applications where environmental conditions are unpredictable or subject to change while the robot is needed to clean the surface perfectly without scratching the hull.
水下环境和干扰错综复杂且不可预测,因此有必要使用模型预测控制来有效操作和检查遥控潜水器(ROV)。本文介绍了一种用于船体清洁机器人的创新悬挂系统,该系统可在环境干扰和不确定性的情况下控制阻抗,同时减少遥控潜水器刷子的振动。利用拉盖尔函数的模型预测控制器大大缩短了跟踪时间,并通过在不确定的干扰环境中成功实现机器人 Z 方向的阻抗跟踪以及 Z 和 Y 方向的减振,证明了所提控制器的效率。我们制作了一个机器人原型,并在真实条件下验证了控制器的性能,同时将模态分析理论输出与实验数据相结合。结果凸显了所设计的悬挂系统和所开发的 MPC 在实际应用中的有效性,在实际应用中,环境条件是不可预测的或可能发生变化的,而机器人需要在不刮伤船体的情况下完美地清洁表面。
{"title":"Innovative hull cleaning robot design and control by Laguerre base model predictive control for impedance and vibration management","authors":"Vahid Madanipour, Farid Najafi","doi":"10.1049/cth2.12716","DOIUrl":"https://doi.org/10.1049/cth2.12716","url":null,"abstract":"The intricate and unpredictable nature of underwater environments and disturbances necessitates the use of model predictive control for the effective operation and inspection of remotely operated vehicles (ROVs). This paper presented an innovative suspension system for a hull‐cleaning robot to control impedance while reducing the vibration of ROV brushes in the presence of environmental disturbances and uncertainties. The use of a model predictive controller that utilizes Laguerre functions results significant reduction in tracking time, and the efficiency of the proposed controller is demonstrated through successful impedance tracking in Z‐direction and vibration reduction in Z and Y directions of the robot in an uncertain environment with disturbance. A prototype robot is built and the controller performance is validated in a real condition and modal analysis theory output with experimental data. The results highlight the effectiveness of the designed suspension system and the developed MPC for real‐world applications where environmental conditions are unpredictable or subject to change while the robot is needed to clean the surface perfectly without scratching the hull.","PeriodicalId":502998,"journal":{"name":"IET Control Theory & Applications","volume":"34 6","pages":""},"PeriodicalIF":0.0,"publicationDate":"2024-07-11","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141658789","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
A hybrid energy storage array group control strategy for wind power smoothing 用于风电平滑的混合储能阵列群控制策略
Pub Date : 2024-07-06 DOI: 10.1049/cth2.12698
Tong Tong, Le Wei, Yuanye Chen, Fang Fang
With the increase of wind power generation, the safety and economy of power system operations are greatly influenced by the intermittency and fluctuation of wind power. To take the advantage of the complementary characteristics between different energy storage devices, a Hybrid Energy Storage System (HESS) consisting of Battery Energy Storage System (BESS) and Flywheel Energy Storage System (FESS) can alleviate the uncertainty of wind power. This article has proposed a coordinated control strategy through group consensus algorithm based on Model Predictive Control (MPC) for Hybrid Energy Storage Array (HESA) to smooth wind power fluctuations. To allocate power commands to the FESS and BESS, the fluctuation of wind power output is extracted with different frequency domain characteristics as instructions by Empirical Mode Decomposition (EMD) technology. Moreover, a group consensus algorithm based on MPC is proposed to complete the adaptive power allocation of energy storage units. Eventually, the actual wind farm data is used for the simulation to verify the effect of control strategy proposed in this paper. It can be seen that the developed group consensus algorithm based on MPC can cope with different frequency power commands, avoid overcharging and discharging of energy storage media, and smooth wind power effectively.
随着风力发电量的增加,电力系统运行的安全性和经济性受到风力发电间歇性和波动性的极大影响。为了利用不同储能设备之间的互补性,由电池储能系统(BESS)和飞轮储能系统(FESS)组成的混合储能系统(HESS)可以缓解风力发电的不确定性。本文提出了一种基于模型预测控制(MPC)的混合储能阵列(HESA)群组共识算法协调控制策略,以平滑风电波动。为了将功率指令分配给 FESS 和 BESS,利用经验模式分解(EMD)技术提取了具有不同频域特征的风电输出波动。此外,还提出了一种基于 MPC 的群体共识算法,以完成储能单元的自适应功率分配。最后,利用实际风电场数据进行仿真,验证本文提出的控制策略的效果。可以看出,所开发的基于 MPC 的群组共识算法能够应对不同频率的功率指令,避免储能介质的过充和过放,并有效平滑风功率。
{"title":"A hybrid energy storage array group control strategy for wind power smoothing","authors":"Tong Tong, Le Wei, Yuanye Chen, Fang Fang","doi":"10.1049/cth2.12698","DOIUrl":"https://doi.org/10.1049/cth2.12698","url":null,"abstract":"With the increase of wind power generation, the safety and economy of power system operations are greatly influenced by the intermittency and fluctuation of wind power. To take the advantage of the complementary characteristics between different energy storage devices, a Hybrid Energy Storage System (HESS) consisting of Battery Energy Storage System (BESS) and Flywheel Energy Storage System (FESS) can alleviate the uncertainty of wind power. This article has proposed a coordinated control strategy through group consensus algorithm based on Model Predictive Control (MPC) for Hybrid Energy Storage Array (HESA) to smooth wind power fluctuations. To allocate power commands to the FESS and BESS, the fluctuation of wind power output is extracted with different frequency domain characteristics as instructions by Empirical Mode Decomposition (EMD) technology. Moreover, a group consensus algorithm based on MPC is proposed to complete the adaptive power allocation of energy storage units. Eventually, the actual wind farm data is used for the simulation to verify the effect of control strategy proposed in this paper. It can be seen that the developed group consensus algorithm based on MPC can cope with different frequency power commands, avoid overcharging and discharging of energy storage media, and smooth wind power effectively.","PeriodicalId":502998,"journal":{"name":"IET Control Theory & Applications","volume":" 46","pages":""},"PeriodicalIF":0.0,"publicationDate":"2024-07-06","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141673003","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Optimal data injection attack design for spacecraft systems via a model free Q‐learning approach 通过无模型 Q-learning 方法优化航天器系统的数据注入攻击设计
Pub Date : 2024-06-14 DOI: 10.1049/cth2.12685
Huanhuan Yuan, Mengbi Wang, Chao Xi
This paper aims to analyse the dynamic response of a corrupted spacecraft rendezvous system from the perspective of attacker. The optimal data injection attack problem is formulated by constructing a tradeoff cost function in a quadratic form. First, the optimal attack strategy and associated sufficient condition for its existence are derived similar to optimal control for attacker without being detected. Breaking the assumption in most existing works, the goal of this paper is to explore the optimal attack strategy without knowing system matrices. A model free Q‐learning approach is designed with the application to solve attacker's optimization problem. Critic network and action network are used to adaptive tuning the value and action for attacker in a forward time. For a more practical situation, a model free attack strategy design is implemented only based on measured input/output data. Finally, the simulation results on the spacecraft system are presented to show the effectiveness of the proposed method for model free attack strategy design.
本文旨在从攻击者的角度分析被破坏的航天器会合系统的动态响应。通过构建二次函数形式的权衡成本函数,提出了最优数据注入攻击问题。首先,推导出最优攻击策略及其存在的相关充分条件,类似于攻击者在不被检测到的情况下的最优控制。本文打破了大多数现有著作中的假设,目标是在不知道系统矩阵的情况下探索最优攻击策略。本文设计了一种无模型 Q-learning 方法,用于解决攻击者的优化问题。批评网络和行动网络用于在前向时间内自适应地调整攻击者的值和行动。在更实际的情况下,仅根据测量的输入/输出数据实施无模型攻击策略设计。最后,介绍了航天器系统的仿真结果,以说明所提出的无模型攻击策略设计方法的有效性。
{"title":"Optimal data injection attack design for spacecraft systems via a model free Q‐learning approach","authors":"Huanhuan Yuan, Mengbi Wang, Chao Xi","doi":"10.1049/cth2.12685","DOIUrl":"https://doi.org/10.1049/cth2.12685","url":null,"abstract":"This paper aims to analyse the dynamic response of a corrupted spacecraft rendezvous system from the perspective of attacker. The optimal data injection attack problem is formulated by constructing a tradeoff cost function in a quadratic form. First, the optimal attack strategy and associated sufficient condition for its existence are derived similar to optimal control for attacker without being detected. Breaking the assumption in most existing works, the goal of this paper is to explore the optimal attack strategy without knowing system matrices. A model free Q‐learning approach is designed with the application to solve attacker's optimization problem. Critic network and action network are used to adaptive tuning the value and action for attacker in a forward time. For a more practical situation, a model free attack strategy design is implemented only based on measured input/output data. Finally, the simulation results on the spacecraft system are presented to show the effectiveness of the proposed method for model free attack strategy design.","PeriodicalId":502998,"journal":{"name":"IET Control Theory & Applications","volume":"51 48","pages":""},"PeriodicalIF":0.0,"publicationDate":"2024-06-14","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141339645","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Lightweight environment sensing algorithm for intelligent driving based on improved YOLOv7 基于改进型 YOLOv7 的智能驾驶轻量级环境感知算法
Pub Date : 2024-06-09 DOI: 10.1049/cth2.12704
Guoyong Qian, Dongbo Xie, Dawei Bi, Qi Wang, Liqing Chen, Hai Wang
Accurately and quickly detecting obstacles ahead is a prerequisite for intelligent driving. The combined detection scheme of light detection and ranging (LiDAR) and the camera is far more capable of coping with complex road conditions than a single sensor. However, immediately afterward, ensuring the real‐time performance of the sensing algorithms through a significantly increased amount of computation has become a new challenge. For this purpose, the paper introduces an improved dynamic obstacle detection algorithm based on YOLOv7 (You Only Look Once version 7) to overcome the drawbacks of slow and unstable detection of traditional methods. Concretely, Mobilenetv3 supplants the backbone network utilized in the original YOLOv7 architecture, thereby achieving a reduction in computational overhead. It integrates a specialized layer for the detection of small‐scale targets and incorporates a convolutional block attention module to enhance detection efficacy for diminutive obstacles. Furthermore, the framework adopts the Efficient Intersection over Union Loss function, which is specifically designed to mitigate the issue of mutual occlusion among detected objects. On a dataset consisting of 27,362 labelled KITTI data samples, the improved YOLOv7 algorithm achieves 92.6% mean average precision and 82 frames per second, which reduces the Model_size by 85.9% and loses only 1.5% accuracy compared with the traditional YOLOv7 algorithm. In addition, this paper builds a virtual scene to test the improved algorithm and fuses LiDAR and camera data. Experimental results conducted on a test vehicle equipped with a camera and LiDAR sensor demonstrate the effectiveness and significant performance of the method. The improved obstacle detection algorithm proposed in this research can significantly reduce the computational cost of the environment perception task, meet the requirements of real‐world applications, and is crucial for achieving safer and smarter driving.
准确、快速地探测前方障碍物是智能驾驶的先决条件。光探测与测距(LiDAR)和摄像头的组合探测方案远比单一传感器更能应对复杂的路况。然而,紧接着,通过大幅增加计算量来确保传感算法的实时性能就成了新的挑战。为此,本文介绍了一种基于 YOLOv7(You Only Look Once version 7)的改进型动态障碍物检测算法,以克服传统方法检测速度慢和不稳定的缺点。具体来说,Mobilenetv3 取代了原有 YOLOv7 架构中使用的主干网络,从而减少了计算开销。它集成了一个专门用于检测小型目标的层,并加入了一个卷积块注意力模块,以提高对小型障碍物的检测效率。此外,该框架还采用了 "Efficient Intersection over Union Loss "函数,该函数专门用于缓解检测对象之间的相互遮挡问题。在由 27,362 个带标签的 KITTI 数据样本组成的数据集上,改进后的 YOLOv7 算法达到了 92.6% 的平均精度和 82 帧/秒的速度,与传统的 YOLOv7 算法相比,模型大小减少了 85.9%,精度仅降低了 1.5%。此外,本文还建立了一个虚拟场景来测试改进算法,并融合了激光雷达和摄像头数据。在装有摄像头和激光雷达传感器的测试车辆上进行的实验结果证明了该方法的有效性和显著性能。本研究提出的改进型障碍物检测算法能显著降低环境感知任务的计算成本,满足实际应用的要求,对实现更安全、更智能的驾驶至关重要。
{"title":"Lightweight environment sensing algorithm for intelligent driving based on improved YOLOv7","authors":"Guoyong Qian, Dongbo Xie, Dawei Bi, Qi Wang, Liqing Chen, Hai Wang","doi":"10.1049/cth2.12704","DOIUrl":"https://doi.org/10.1049/cth2.12704","url":null,"abstract":"Accurately and quickly detecting obstacles ahead is a prerequisite for intelligent driving. The combined detection scheme of light detection and ranging (LiDAR) and the camera is far more capable of coping with complex road conditions than a single sensor. However, immediately afterward, ensuring the real‐time performance of the sensing algorithms through a significantly increased amount of computation has become a new challenge. For this purpose, the paper introduces an improved dynamic obstacle detection algorithm based on YOLOv7 (You Only Look Once version 7) to overcome the drawbacks of slow and unstable detection of traditional methods. Concretely, Mobilenetv3 supplants the backbone network utilized in the original YOLOv7 architecture, thereby achieving a reduction in computational overhead. It integrates a specialized layer for the detection of small‐scale targets and incorporates a convolutional block attention module to enhance detection efficacy for diminutive obstacles. Furthermore, the framework adopts the Efficient Intersection over Union Loss function, which is specifically designed to mitigate the issue of mutual occlusion among detected objects. On a dataset consisting of 27,362 labelled KITTI data samples, the improved YOLOv7 algorithm achieves 92.6% mean average precision and 82 frames per second, which reduces the Model_size by 85.9% and loses only 1.5% accuracy compared with the traditional YOLOv7 algorithm. In addition, this paper builds a virtual scene to test the improved algorithm and fuses LiDAR and camera data. Experimental results conducted on a test vehicle equipped with a camera and LiDAR sensor demonstrate the effectiveness and significant performance of the method. The improved obstacle detection algorithm proposed in this research can significantly reduce the computational cost of the environment perception task, meet the requirements of real‐world applications, and is crucial for achieving safer and smarter driving.","PeriodicalId":502998,"journal":{"name":"IET Control Theory & Applications","volume":" 1","pages":""},"PeriodicalIF":0.0,"publicationDate":"2024-06-09","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141367446","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Smart line planning method for power transmission based on D3QN‐PER algorithm 基于 D3QN-PER 算法的智能输电线路规划方法
Pub Date : 2024-06-09 DOI: 10.1049/cth2.12689
Guojun Nan, Zixiang Shen, Haibo Du, Lanlin Yu, Wenwu Zhu
The planning of power transmission line projects encompasses vast and complex geographical terrains. To address the complexity of transmission line planning and achieve lower line costs, this study proposes a novel intelligent line planning method. For the first time, it combines the Dueling Double Deep Q Network (D3QN) with the prioritized experience replay (PER) mechanism. First, correlate the reward function with metrics such as line length, number of corner points, and geographical environmental data, which are pertinent to the construction costs of power transmission line. Second, the D3QN algorithm is formulated by integrating Double DQN and Dueling DQN. The network's input information is divided into two components during training, aligning with the characteristics of power transmission line planning projects. Finally, the convergence efficiency of the algorithm is improved by using the PER mechanism for the problem of cost difference due to the different number of corner points in the planning path. In order to test the feasibility of the algorithm, we conducted experiments using real maps. Compared with the traditional ant colony optimization (ACO) algorithm, the D3QN‐PER deep reinforcement learning algorithm reduces the line length by more than 4% and the number of corner points by more than 60%.
输电线路项目规划涉及广阔而复杂的地理地形。为解决输电线路规划的复杂性并降低线路成本,本研究提出了一种新颖的智能线路规划方法。它首次将决斗双深 Q 网络(D3QN)与优先经验重放(PER)机制相结合。首先,将奖励函数与线路长度、转角点数量和地理环境数据等指标相关联,这些指标与输电线路的建设成本息息相关。其次,通过整合双DQN和决斗DQN,制定了D3QN算法。在训练过程中,根据输电线路规划项目的特点,将网络的输入信息分为两部分。最后,针对规划路径中角点数量不同导致的成本差异问题,利用 PER 机制提高了算法的收敛效率。为了检验算法的可行性,我们使用真实地图进行了实验。与传统的蚁群优化(ACO)算法相比,D3QN-PER 深度强化学习算法的线路长度减少了 4% 以上,角点数量减少了 60% 以上。
{"title":"Smart line planning method for power transmission based on D3QN‐PER algorithm","authors":"Guojun Nan, Zixiang Shen, Haibo Du, Lanlin Yu, Wenwu Zhu","doi":"10.1049/cth2.12689","DOIUrl":"https://doi.org/10.1049/cth2.12689","url":null,"abstract":"The planning of power transmission line projects encompasses vast and complex geographical terrains. To address the complexity of transmission line planning and achieve lower line costs, this study proposes a novel intelligent line planning method. For the first time, it combines the Dueling Double Deep Q Network (D3QN) with the prioritized experience replay (PER) mechanism. First, correlate the reward function with metrics such as line length, number of corner points, and geographical environmental data, which are pertinent to the construction costs of power transmission line. Second, the D3QN algorithm is formulated by integrating Double DQN and Dueling DQN. The network's input information is divided into two components during training, aligning with the characteristics of power transmission line planning projects. Finally, the convergence efficiency of the algorithm is improved by using the PER mechanism for the problem of cost difference due to the different number of corner points in the planning path. In order to test the feasibility of the algorithm, we conducted experiments using real maps. Compared with the traditional ant colony optimization (ACO) algorithm, the D3QN‐PER deep reinforcement learning algorithm reduces the line length by more than 4% and the number of corner points by more than 60%.","PeriodicalId":502998,"journal":{"name":"IET Control Theory & Applications","volume":" 4","pages":""},"PeriodicalIF":0.0,"publicationDate":"2024-06-09","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141367311","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Enhanced LSTM‐DQN algorithm for a two‐player zero‐sum game in three‐dimensional space 三维空间双人零和博弈的增强型 LSTM-DQN 算法
Pub Date : 2024-05-14 DOI: 10.1049/cth2.12677
Bo Lu, L. Ru, Maolong Lv, Shiguang Hu, Hongguo Zhang, Zilong Zhao
To tackle the challenges presented by the two‐player zero sum game (TZSG) in three‐dimensional space, this study introduces an enhanced deep Q‐learning (DQN) algorithm that utilizes long short term memory (LSTM) network. The primary objective of this algorithm is to enhance the temporal correlation of the TZSG in three‐dimensional space. Additionally, it incorporates the hindsight experience replay (HER) mechanism to improve the learning efficiency of the network and mitigate the issue of the “sparse reward” that arises from prolonged training of intelligence in solving the TZSG in the three‐dimensional. Furthermore, this method enhances the convergence and stability of the overall solution.An intelligent training environment centred around an airborne agent and its mutual pursuit interaction scenario was designed to proposed approach's effectiveness. The algorithm training and comparison results show that the LSTM‐DQN‐HER algorithm outperforms similar algorithm in solving the TZSG in three‐dimensional space. In conclusion, this paper presents an improved DQN algorithm based on LSTM and incorporates the HER mechanism to address the challenges posed by the TZSG in three‐dimensional space. The proposed algorithm enhances the solution's temporal correlation, learning efficiency, convergence, and stability. The simulation results confirm its superior performance in solving the TZSG in three‐dimensional space.
为应对三维空间中的双人零和博弈(TZSG)所带来的挑战,本研究引入了一种利用长短期记忆(LSTM)网络的增强型深度 Q-learning (DQN)算法。该算法的主要目标是增强三维空间中 TZSG 的时间相关性。此外,它还结合了事后经验重放(HER)机制,以提高网络的学习效率,并缓解在解决三维空间中的 TZSG 时,由于长时间的智能训练而产生的 "奖励稀疏 "问题。此外,该方法还增强了整体求解的收敛性和稳定性。为了验证所提方法的有效性,我们设计了一个以机载代理及其相互追逐交互场景为中心的智能训练环境。算法训练和对比结果表明,LSTM-DQN-HER 算法在求解三维空间中的 TZSG 时优于同类算法。总之,本文提出了一种基于 LSTM 并结合 HER 机制的改进 DQN 算法,以解决三维空间中的 TZSG 所带来的挑战。所提出的算法增强了解的时间相关性、学习效率、收敛性和稳定性。仿真结果证实了该算法在求解三维空间中的 TZSG 时的卓越性能。
{"title":"Enhanced LSTM‐DQN algorithm for a two‐player zero‐sum game in three‐dimensional space","authors":"Bo Lu, L. Ru, Maolong Lv, Shiguang Hu, Hongguo Zhang, Zilong Zhao","doi":"10.1049/cth2.12677","DOIUrl":"https://doi.org/10.1049/cth2.12677","url":null,"abstract":"To tackle the challenges presented by the two‐player zero sum game (TZSG) in three‐dimensional space, this study introduces an enhanced deep Q‐learning (DQN) algorithm that utilizes long short term memory (LSTM) network. The primary objective of this algorithm is to enhance the temporal correlation of the TZSG in three‐dimensional space. Additionally, it incorporates the hindsight experience replay (HER) mechanism to improve the learning efficiency of the network and mitigate the issue of the “sparse reward” that arises from prolonged training of intelligence in solving the TZSG in the three‐dimensional. Furthermore, this method enhances the convergence and stability of the overall solution.An intelligent training environment centred around an airborne agent and its mutual pursuit interaction scenario was designed to proposed approach's effectiveness. The algorithm training and comparison results show that the LSTM‐DQN‐HER algorithm outperforms similar algorithm in solving the TZSG in three‐dimensional space. In conclusion, this paper presents an improved DQN algorithm based on LSTM and incorporates the HER mechanism to address the challenges posed by the TZSG in three‐dimensional space. The proposed algorithm enhances the solution's temporal correlation, learning efficiency, convergence, and stability. The simulation results confirm its superior performance in solving the TZSG in three‐dimensional space.","PeriodicalId":502998,"journal":{"name":"IET Control Theory & Applications","volume":"91 16","pages":""},"PeriodicalIF":0.0,"publicationDate":"2024-05-14","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"140978382","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Robust optimal tracking control of multiple autonomous underwater vehicles subject to uncertain disturbances 受不确定干扰影响的多自主水下航行器的鲁棒优化跟踪控制
Pub Date : 2024-05-10 DOI: 10.1049/cth2.12671
Guan Huang, Zhuo Zhang, Weisheng Yan, Rongxin Cui, Shouxu Zhang, Xinxin Guo
This paper considers the problem of robust optimal tracking control of multiple autonomous underwater Vehicles (AUVs) subject to uncertain external disturbances. First, the Takagi‐Sugeno (T‐S) fuzzy based technique is utilized to convert the high‐order nonlinear multi‐AUV system into a series of linearized subsystems. Second, a novel fully distributed sliding mode control (FDSMC) strategy is proposed to attenuate the disturbances. Meanwhile, the leader‐following consensus and the nearly optimization of the energy‐cost function for the multi‐AUV system can be achieved simultaneously through the designed optimal nominal control protocol. Moreover, the proposed control strategy has more mild constraints on the communication topologies. Finally, the effectiveness of the proposed FDSMC strategy is verified by numerical simulation studies.
本文探讨了受不确定外部干扰影响的多自主水下航行器(AUV)的鲁棒性优化跟踪控制问题。首先,利用基于高木-菅野(Takagi-Sugeno,T-S)模糊技术将高阶非线性多自主潜航器系统转换为一系列线性化子系统。其次,提出了一种新颖的全分布式滑模控制(FDSMC)策略来减弱干扰。同时,通过设计的最优标称控制协议,可同时实现多无人飞行器系统的领导-跟随共识和能量成本函数的近乎最优化。此外,所提出的控制策略对通信拓扑的约束更为温和。最后,通过数值模拟研究验证了所提出的 FDSMC 策略的有效性。
{"title":"Robust optimal tracking control of multiple autonomous underwater vehicles subject to uncertain disturbances","authors":"Guan Huang, Zhuo Zhang, Weisheng Yan, Rongxin Cui, Shouxu Zhang, Xinxin Guo","doi":"10.1049/cth2.12671","DOIUrl":"https://doi.org/10.1049/cth2.12671","url":null,"abstract":"This paper considers the problem of robust optimal tracking control of multiple autonomous underwater Vehicles (AUVs) subject to uncertain external disturbances. First, the Takagi‐Sugeno (T‐S) fuzzy based technique is utilized to convert the high‐order nonlinear multi‐AUV system into a series of linearized subsystems. Second, a novel fully distributed sliding mode control (FDSMC) strategy is proposed to attenuate the disturbances. Meanwhile, the leader‐following consensus and the nearly optimization of the energy‐cost function for the multi‐AUV system can be achieved simultaneously through the designed optimal nominal control protocol. Moreover, the proposed control strategy has more mild constraints on the communication topologies. Finally, the effectiveness of the proposed FDSMC strategy is verified by numerical simulation studies.","PeriodicalId":502998,"journal":{"name":"IET Control Theory & Applications","volume":" 80","pages":""},"PeriodicalIF":0.0,"publicationDate":"2024-05-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"140990657","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
DQN based coverage control for multi‐agent system in line intersection region 基于 DQN 的线路交叉区域多代理系统覆盖控制
Pub Date : 2024-04-22 DOI: 10.1049/cth2.12670
Zuo Lei, Tengfei Zhang, Zhang Jinqi, Yan Maode
Generally, the coverage control is studied in a convex region, in which the agent kinematics and the coverage environment both have strong limitations. It is difficult to directly apply these results to practical scenarios, such as the road environment or indoor environment. In this study, the multi‐agent coverage control problems in a line intersection region is investigated, where the agents can only move along the given lines. To present the agents motion in this line intersection region, the moving directions and velocities of the agents are analyzed in the first part. Then, the coverage control model for the multi‐agent system in line intersection region is presented, in which the cost function is provided based on the agent's minimum moving distance and the agent motions are used as the constraints. To solve this constrained coverage problem, the deep Q‐learning network (DQN) is employed to find the optimal positions for each agent in the line intersection region. In final, numerical simulations are presented to validate the feasibility and effectiveness of proposed approaches.
一般来说,覆盖控制是在一个凸区域内研究的,在这个区域内,代理运动学和覆盖环境都有很大的局限性。这些结果很难直接应用于实际场景,如道路环境或室内环境。在本研究中,研究了线形交叉区域中的多机器人覆盖控制问题,在该区域中,机器人只能沿着给定的线移动。为了呈现代理在该线路交叉区域的运动情况,首先分析了代理的移动方向和速度。然后,提出了线交叉区域内多代理系统的覆盖控制模型,其中成本函数基于代理的最小移动距离,代理运动作为约束条件。为了解决这个受约束的覆盖问题,采用了深度 Q-learning 网络(DQN)来找到每个代理在线路交叉区域的最佳位置。最后,通过数值模拟验证了建议方法的可行性和有效性。
{"title":"DQN based coverage control for multi‐agent system in line intersection region","authors":"Zuo Lei, Tengfei Zhang, Zhang Jinqi, Yan Maode","doi":"10.1049/cth2.12670","DOIUrl":"https://doi.org/10.1049/cth2.12670","url":null,"abstract":"Generally, the coverage control is studied in a convex region, in which the agent kinematics and the coverage environment both have strong limitations. It is difficult to directly apply these results to practical scenarios, such as the road environment or indoor environment. In this study, the multi‐agent coverage control problems in a line intersection region is investigated, where the agents can only move along the given lines. To present the agents motion in this line intersection region, the moving directions and velocities of the agents are analyzed in the first part. Then, the coverage control model for the multi‐agent system in line intersection region is presented, in which the cost function is provided based on the agent's minimum moving distance and the agent motions are used as the constraints. To solve this constrained coverage problem, the deep Q‐learning network (DQN) is employed to find the optimal positions for each agent in the line intersection region. In final, numerical simulations are presented to validate the feasibility and effectiveness of proposed approaches.","PeriodicalId":502998,"journal":{"name":"IET Control Theory & Applications","volume":"5 11","pages":""},"PeriodicalIF":0.0,"publicationDate":"2024-04-22","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"140675105","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Multiple‐missile fixed‐time integrated guidance and control design with multi‐stage interconnected observers under impact angle and input saturation constraints 在撞击角和输入饱和约束条件下,利用多级互联观测器进行多枚导弹定时综合制导与控制设计
Pub Date : 2024-04-12 DOI: 10.1049/cth2.12658
Dingye Zhang, Hang Yu, Keren Dai, Wenjun Yi, He Zhang, Zhiming Lei
In this paper, a novel three‐dimensional fixed‐time integrated guidance and control (IGC) scheme with multi‐stage interconnected observers is proposed for cooperative attacks using multiple missiles against a maneuvering target under impact angle and input saturation constraints. External disturbances, modeling errors, and aerodynamic parameter variations are considered as system uncertainties and a three‐channel fully coupled IGC model for multiple missiles is established. The IGC system is designed optimally based on fixed‐time stability theory, sliding mode control, and the backstepping technique. Three inter‐cascaded fixed‐time disturbance observers based on an improved super‐twisting algorithm are designed to estimate and compensate for system uncertainties. Second‐order command filters are used to constrain virtual control signals, and additional filtering error subsystems are introduced to compensate for the tracking errors of filters. System stability and uniformly ultimately fixed‐time boundedness of all states are proven using the Lyapunov stability theory. Finally, the limits of the acceleration components of the maneuvering target perpendicular to the line of sight direction are derived. The effectiveness of the designed IGC scheme and the ability of multi‐stage interconnected observers to sense disturbances with each other are verified through simulations.
本文提出了一种新型三维固定时间综合制导与控制(IGC)方案,该方案具有多级互联观测器,适用于在撞击角和输入饱和约束条件下使用多枚导弹对机动目标进行合作攻击。外部干扰、建模误差和空气动力参数变化被视为系统的不确定性因素,并建立了多枚导弹的三通道全耦合 IGC 模型。基于定时稳定性理论、滑模控制和反步进技术,对 IGC 系统进行了优化设计。设计了三个基于改进的超扭曲算法的级联固定时间扰动观测器,以估计和补偿系统的不确定性。使用二阶指令滤波器来约束虚拟控制信号,并引入额外的滤波误差子系统来补偿滤波器的跟踪误差。利用 Lyapunov 稳定性理论证明了系统稳定性和所有状态的均匀最终固定时间约束性。最后,推导出了机动目标垂直于视线方向的加速度分量极限。通过仿真验证了所设计的 IGC 方案的有效性以及多级互联观测器相互感知干扰的能力。
{"title":"Multiple‐missile fixed‐time integrated guidance and control design with multi‐stage interconnected observers under impact angle and input saturation constraints","authors":"Dingye Zhang, Hang Yu, Keren Dai, Wenjun Yi, He Zhang, Zhiming Lei","doi":"10.1049/cth2.12658","DOIUrl":"https://doi.org/10.1049/cth2.12658","url":null,"abstract":"In this paper, a novel three‐dimensional fixed‐time integrated guidance and control (IGC) scheme with multi‐stage interconnected observers is proposed for cooperative attacks using multiple missiles against a maneuvering target under impact angle and input saturation constraints. External disturbances, modeling errors, and aerodynamic parameter variations are considered as system uncertainties and a three‐channel fully coupled IGC model for multiple missiles is established. The IGC system is designed optimally based on fixed‐time stability theory, sliding mode control, and the backstepping technique. Three inter‐cascaded fixed‐time disturbance observers based on an improved super‐twisting algorithm are designed to estimate and compensate for system uncertainties. Second‐order command filters are used to constrain virtual control signals, and additional filtering error subsystems are introduced to compensate for the tracking errors of filters. System stability and uniformly ultimately fixed‐time boundedness of all states are proven using the Lyapunov stability theory. Finally, the limits of the acceleration components of the maneuvering target perpendicular to the line of sight direction are derived. The effectiveness of the designed IGC scheme and the ability of multi‐stage interconnected observers to sense disturbances with each other are verified through simulations.","PeriodicalId":502998,"journal":{"name":"IET Control Theory & Applications","volume":"7 5","pages":""},"PeriodicalIF":0.0,"publicationDate":"2024-04-12","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"140710599","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Differential graphical game‐based multi‐agent tracking control using integral reinforcement learning 利用积分强化学习实现基于差分图形游戏的多代理跟踪控制
Pub Date : 2024-04-12 DOI: 10.1049/cth2.12667
Yaning Guo, Qi Sun, Yintao Wang, Quan Pan
This paper studies the cooperative tracking control problem of interacted multi‐agent systems (MASs) under undirected communication. Based on differential graphical game theory, the MAS tracking control problem is formulated as an infinite horizon cooperative differential graphical game‐theoretic tracking control framework, where a multi‐objective optimization problem is designed and then cast into a Pareto‐equivalent single‐objective optimization problem using a scalarization method. Necessary and sufficient conditions for the existence of the Pareto‐optimal strategy to the game theoretic tracking control are established, where it has been proven that the solution to the integral Bellman optimality equation leads to Pareto‐optimal strategy. Then, an off‐policy integral reinforcement learning scheme to find optimal control strategy using a pure data‐driven manner is developed, which consumes less computation efforts than the traditional learning scheme. Simulated results are conducted to validate the effectiveness of the proposed game and IRL‐based tracking control method.
本文研究了无定向通信条件下交互式多代理系统(MAS)的合作跟踪控制问题。基于微分图式博弈论,将 MAS 跟踪控制问题表述为一个无限视界合作微分图式博弈论跟踪控制框架,设计了一个多目标优化问题,并利用标量化方法将其转化为帕累托最优单目标优化问题。建立了博弈论跟踪控制帕累托最优策略存在的必要条件和充分条件,证明了积分贝尔曼最优方程的解会导致帕累托最优策略。然后,开发了一种非策略积分强化学习方案,以纯数据驱动的方式找到最优控制策略,与传统学习方案相比计算量更小。模拟结果验证了所提出的博弈和基于 IRL 的跟踪控制方法的有效性。
{"title":"Differential graphical game‐based multi‐agent tracking control using integral reinforcement learning","authors":"Yaning Guo, Qi Sun, Yintao Wang, Quan Pan","doi":"10.1049/cth2.12667","DOIUrl":"https://doi.org/10.1049/cth2.12667","url":null,"abstract":"This paper studies the cooperative tracking control problem of interacted multi‐agent systems (MASs) under undirected communication. Based on differential graphical game theory, the MAS tracking control problem is formulated as an infinite horizon cooperative differential graphical game‐theoretic tracking control framework, where a multi‐objective optimization problem is designed and then cast into a Pareto‐equivalent single‐objective optimization problem using a scalarization method. Necessary and sufficient conditions for the existence of the Pareto‐optimal strategy to the game theoretic tracking control are established, where it has been proven that the solution to the integral Bellman optimality equation leads to Pareto‐optimal strategy. Then, an off‐policy integral reinforcement learning scheme to find optimal control strategy using a pure data‐driven manner is developed, which consumes less computation efforts than the traditional learning scheme. Simulated results are conducted to validate the effectiveness of the proposed game and IRL‐based tracking control method.","PeriodicalId":502998,"journal":{"name":"IET Control Theory & Applications","volume":"75 2","pages":""},"PeriodicalIF":0.0,"publicationDate":"2024-04-12","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"140710997","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
期刊
IET Control Theory & Applications
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1