Shihang Gao, Xu Yang, Jian Huang, Rang Tu, Tao Zhang, Qing Li
Heating, ventilation and air conditioning (HVAC) chilled water systems offer significant potential for energy saving and reinforcement learning (RL) methods have been extensively studied and validated for optimizing HVAC energy consumption. However, RL's low sample efficiency and reliance on randomized exploration limit its practical application. To enhance the robustness and stability of RL-based energy optimization methods, an RL optimization approach based on a mechanism-data hybrid-driven model is proposed, derived from the MBPO (Model-based policy optimization) scheme. Firstly, a novel end-to-end HVAC chilled water system model is developed to serve as the foundation for the hybrid model design. Second, a hybrid-driven RL environment model framework is introduced, combining a mechanistic model with a probabilistic neural network. The mechanistic component provides generalization capabilities, while the data-driven component offers adaptability. Third, improvements to MBPO are proposed, including double policy optimization and adaptive branch rollout, further to enhance dynamic environmental adaptability and model utilization efficiency. Finally, comparative and ablation experiments conducted using both simulation environments and measured data demonstrate that the proposed method achieves higher learning efficiency and improved robustness.
{"title":"Hybrid-Driven Model-Based Reinforcement Learning Approach for Energy Consumption Optimization of HVAC Chilled Water Systems","authors":"Shihang Gao, Xu Yang, Jian Huang, Rang Tu, Tao Zhang, Qing Li","doi":"10.1049/cth2.70104","DOIUrl":"https://doi.org/10.1049/cth2.70104","url":null,"abstract":"<p>Heating, ventilation and air conditioning (HVAC) chilled water systems offer significant potential for energy saving and reinforcement learning (RL) methods have been extensively studied and validated for optimizing HVAC energy consumption. However, RL's low sample efficiency and reliance on randomized exploration limit its practical application. To enhance the robustness and stability of RL-based energy optimization methods, an RL optimization approach based on a mechanism-data hybrid-driven model is proposed, derived from the MBPO (Model-based policy optimization) scheme. Firstly, a novel end-to-end HVAC chilled water system model is developed to serve as the foundation for the hybrid model design. Second, a hybrid-driven RL environment model framework is introduced, combining a mechanistic model with a probabilistic neural network. The mechanistic component provides generalization capabilities, while the data-driven component offers adaptability. Third, improvements to MBPO are proposed, including double policy optimization and adaptive branch rollout, further to enhance dynamic environmental adaptability and model utilization efficiency. Finally, comparative and ablation experiments conducted using both simulation environments and measured data demonstrate that the proposed method achieves higher learning efficiency and improved robustness.</p>","PeriodicalId":50382,"journal":{"name":"IET Control Theory and Applications","volume":"20 1","pages":""},"PeriodicalIF":2.3,"publicationDate":"2026-01-23","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://ietresearch.onlinelibrary.wiley.com/doi/epdf/10.1049/cth2.70104","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"146058079","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
This paper investigates dynamic instantaneous-integration event-triggered control for networked non-linear rail vehicle suspension systems over the train communication network (TCN). The primary goal is to ensure the desired suspension performance while efficiently utilizing the TCN resources. Firstly, a 2-DOF non-linear model is developed, with the non-linearities approximated by a neural network using a dissipativity-learning method. To efficiently conserve TCN resources during signal transmission, this paper proposes a novel dynamic instantaneous-integral fusion event-triggered (DIFT) scheme. Compared with event-triggered schemes based solely on instantaneous or accumulated state errors, the DIFT condition uses a fusion of the instantaneous error and the integral of the state error and compares it to a dynamic threshold that depends on the sampled state at the triggering instant and an auxiliary dynamic variable. This integrated approach ensures robust, Zeno-free event-triggered updates while maintaining desired control performance and communication efficiency. Furthermore, stability and dissipativity conditions based on a looped Lyapunov function are proposed to guarantee the stability and dissipative performance of the closed-loop suspension system. Then, the desired controller, the event-triggered parameter and the updating law for weight are co-designed. Simulation experiments are conducted to validate the effectiveness of the proposed method.
{"title":"Novel Dynamic Instantaneous–Integral Fusion Event-Triggered Control of Networked Non-Linear Rail Vehicle Active Suspension","authors":"J. Wang, Y. Lin, B. Fu, Q. Wu","doi":"10.1049/cth2.70105","DOIUrl":"https://doi.org/10.1049/cth2.70105","url":null,"abstract":"<p>This paper investigates dynamic instantaneous-integration event-triggered control for networked non-linear rail vehicle suspension systems over the train communication network (TCN). The primary goal is to ensure the desired suspension performance while efficiently utilizing the TCN resources. Firstly, a 2-DOF non-linear model is developed, with the non-linearities approximated by a neural network using a dissipativity-learning method. To efficiently conserve TCN resources during signal transmission, this paper proposes a novel dynamic instantaneous-integral fusion event-triggered (DIFT) scheme. Compared with event-triggered schemes based solely on instantaneous or accumulated state errors, the DIFT condition uses a fusion of the instantaneous error and the integral of the state error and compares it to a dynamic threshold that depends on the sampled state at the triggering instant and an auxiliary dynamic variable. This integrated approach ensures robust, Zeno-free event-triggered updates while maintaining desired control performance and communication efficiency. Furthermore, stability and dissipativity conditions based on a looped Lyapunov function are proposed to guarantee the stability and dissipative performance of the closed-loop suspension system. Then, the desired controller, the event-triggered parameter and the updating law for weight are co-designed. Simulation experiments are conducted to validate the effectiveness of the proposed method.</p>","PeriodicalId":50382,"journal":{"name":"IET Control Theory and Applications","volume":"20 1","pages":""},"PeriodicalIF":2.3,"publicationDate":"2026-01-22","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://ietresearch.onlinelibrary.wiley.com/doi/epdf/10.1049/cth2.70105","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"146083266","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Li Yu, Yong Ding, Peng Wang, Jialong Zhang, Lei Ye
Tiltrotor UAV is a hybrid aircraft that enjoys the advantages of both the conventional fixed-wing and helicopter in a single platform. The design of a practical and efficient flight control system for this novel aircraft with a complex configuration is still a challenge. This paper proposes a practical fixed-time control scheme for a novel distributed tiltrotor UAV to improve flight stability. Based on the analysis of the control principle, the dynamic model and control allocation are derived to address the coordinated control problems with redundant actuators. A fast fixed-time sliding mode attitude controller is subsequently designed to improve the response speed and robustness of the control system. Furthermore, a smooth mode transition strategy is developed to guarantee the stable conversion between fixed-wing and helicopter modes. Experiments are conducted under different modes to investigate the effectiveness of the proposed flight control scheme.
{"title":"Flight Control of a Novel Distributed Tiltrotor UAV With Experiments","authors":"Li Yu, Yong Ding, Peng Wang, Jialong Zhang, Lei Ye","doi":"10.1049/cth2.70102","DOIUrl":"https://doi.org/10.1049/cth2.70102","url":null,"abstract":"<p>Tiltrotor UAV is a hybrid aircraft that enjoys the advantages of both the conventional fixed-wing and helicopter in a single platform. The design of a practical and efficient flight control system for this novel aircraft with a complex configuration is still a challenge. This paper proposes a practical fixed-time control scheme for a novel distributed tiltrotor UAV to improve flight stability. Based on the analysis of the control principle, the dynamic model and control allocation are derived to address the coordinated control problems with redundant actuators. A fast fixed-time sliding mode attitude controller is subsequently designed to improve the response speed and robustness of the control system. Furthermore, a smooth mode transition strategy is developed to guarantee the stable conversion between fixed-wing and helicopter modes. Experiments are conducted under different modes to investigate the effectiveness of the proposed flight control scheme.</p>","PeriodicalId":50382,"journal":{"name":"IET Control Theory and Applications","volume":"20 1","pages":""},"PeriodicalIF":2.3,"publicationDate":"2026-01-21","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://ietresearch.onlinelibrary.wiley.com/doi/epdf/10.1049/cth2.70102","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"146057970","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
This study presents the development of a robust second-order sliding mode controller (SOSMC) that incorporates position error constraints into the control design. To achieve this, a barrier Lyapunov function (BLF) is employed, enabling the enforcement of predefined bounds on tracking errors throughout the system's evolution. This approach ensures that errors remain within previously known limits, thereby improving safety and reliability in practical applications, particularly robotic systems. The proposed control scheme guarantees the existence of a sliding mode and achieves exponential convergence of the tracking errors, even in the presence of bounded disturbances and model uncertainties. Hence, the main contribution of this study is the integration of the BLF into the SOSMC framework, which not only maintains robustness but also addresses the critical issue of constraint satisfaction. This is often overlooked in traditional sliding mode designs. The effectiveness and improved performance of the proposed controller are validated through simulation studies conducted on a three-degree-of-freedom robotic manipulator and through experiments on a six-degree-of-freedom robotic manipulator. Comparative results demonstrate that, unlike a conventional SOSMC without error constraints, the proposed controller successfully maintains position errors within the specified limits while preserving fast convergence and robustness. These findings highlight the significant benefits of incorporating barrier Lyapunov functions in sliding mode control strategies for systems with strict performance and safety requirements.
{"title":"Exponential Convergent Second-Order Sliding Mode Control Based on Barrier Lyapunov Function of State Constraint Robotic Manipulators","authors":"Luis Pantoja-Garcia, Isaac Chairez","doi":"10.1049/cth2.70106","DOIUrl":"https://doi.org/10.1049/cth2.70106","url":null,"abstract":"<p>This study presents the development of a robust second-order sliding mode controller (SOSMC) that incorporates position error constraints into the control design. To achieve this, a barrier Lyapunov function (BLF) is employed, enabling the enforcement of predefined bounds on tracking errors throughout the system's evolution. This approach ensures that errors remain within previously known limits, thereby improving safety and reliability in practical applications, particularly robotic systems. The proposed control scheme guarantees the existence of a sliding mode and achieves exponential convergence of the tracking errors, even in the presence of bounded disturbances and model uncertainties. Hence, the main contribution of this study is the integration of the BLF into the SOSMC framework, which not only maintains robustness but also addresses the critical issue of constraint satisfaction. This is often overlooked in traditional sliding mode designs. The effectiveness and improved performance of the proposed controller are validated through simulation studies conducted on a three-degree-of-freedom robotic manipulator and through experiments on a six-degree-of-freedom robotic manipulator. Comparative results demonstrate that, unlike a conventional SOSMC without error constraints, the proposed controller successfully maintains position errors within the specified limits while preserving fast convergence and robustness. These findings highlight the significant benefits of incorporating barrier Lyapunov functions in sliding mode control strategies for systems with strict performance and safety requirements.</p>","PeriodicalId":50382,"journal":{"name":"IET Control Theory and Applications","volume":"20 1","pages":""},"PeriodicalIF":2.3,"publicationDate":"2026-01-19","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://ietresearch.onlinelibrary.wiley.com/doi/epdf/10.1049/cth2.70106","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"146002547","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Space robotic systems are essential for on-orbit servicing missions, including satellite refueling, in-space assembly of large structures, and active debris removal. This paper presents a robust, task-specific optimal trajectory planning framework for dual-arm space robots. The method unifies two key problems—point-to-point grasping and continuous trajectory tracking—within an efficient optimization framework. For point-to-point grasping, the objective is to maximize the gradient of the distance between the predicted and target end-effector positions, yielding a time-optimal trajectory. For continuous trajectory tracking, the approach minimizes spacecraft base attitude disturbance while ensuring bounded end-effector tracking error. Additional objectives, such as manipulability and energy efficiency, are incorporated as weighted terms. Physical constraints on joint angles, velocities, accelerations, and self-collision avoidance in both planning problems are formulated as linear constraints. Task-specific constraints are also integrated: an approaching cone constraint for grasping and trajectory relaxation error bounds for tracking. Both problems are cast as convex optimization formulations, enabling efficient real-time solutions. The robustness of the method is demonstrated under challenging conditions, including high initial momentum in grasping and gravity-gradient-induced momentum in tracking. Extensive comparative simulations on a highly redundant 14-degree-of-freedom (14-DoF) dual-arm space robot validate the superior effectiveness, efficiency, and robustness of the proposed approach.
{"title":"Task-Specific Optimal Trajectory Planning of Dual-arm Space Robot Based on Convex Optimization","authors":"Run Li, Fan Wu, Ang Li, Ming Liu","doi":"10.1049/cth2.70098","DOIUrl":"https://doi.org/10.1049/cth2.70098","url":null,"abstract":"<p>Space robotic systems are essential for on-orbit servicing missions, including satellite refueling, in-space assembly of large structures, and active debris removal. This paper presents a robust, task-specific optimal trajectory planning framework for dual-arm space robots. The method unifies two key problems—point-to-point grasping and continuous trajectory tracking—within an efficient optimization framework. For point-to-point grasping, the objective is to maximize the gradient of the distance between the predicted and target end-effector positions, yielding a time-optimal trajectory. For continuous trajectory tracking, the approach minimizes spacecraft base attitude disturbance while ensuring bounded end-effector tracking error. Additional objectives, such as manipulability and energy efficiency, are incorporated as weighted terms. Physical constraints on joint angles, velocities, accelerations, and self-collision avoidance in both planning problems are formulated as linear constraints. Task-specific constraints are also integrated: an approaching cone constraint for grasping and trajectory relaxation error bounds for tracking. Both problems are cast as convex optimization formulations, enabling efficient real-time solutions. The robustness of the method is demonstrated under challenging conditions, including high initial momentum in grasping and gravity-gradient-induced momentum in tracking. Extensive comparative simulations on a highly redundant 14-degree-of-freedom (14-DoF) dual-arm space robot validate the superior effectiveness, efficiency, and robustness of the proposed approach.</p>","PeriodicalId":50382,"journal":{"name":"IET Control Theory and Applications","volume":"20 1","pages":""},"PeriodicalIF":2.3,"publicationDate":"2026-01-14","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://ietresearch.onlinelibrary.wiley.com/doi/epdf/10.1049/cth2.70098","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"146002054","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Davut Izci, Serdar Ekinci, Emre Çelik, Murat Uyar, Mohit Bajaj, Vojtech Blazek, Olena Rubanenko
This study presents a novel exponential proportional-derivative controller with filter (exp-PDN) for stabilising the nonlinear and underactuated ball and beam system. Unlike conventional PID-based approaches, the proposed controller removes the integral term, resulting in faster transient responses and improved robustness. It incorporates nonlinear exponential shaping of both the error and its derivative, along with a filtered derivative path for enhanced noise handling. A custom multi-objective cost function, comprising the squared error, settling time, and percent overshoot, is proposed to evaluate control performance. The quadratic interpolation optimiser (QIO), a recently developed metaheuristic based on analytical interpolation, is employed to optimise the controller parameters. To validate its effectiveness, the exp-PDN controller is compared against five state-of-the-art metaheuristic algorithms: QIO, spider wasp optimiser, komodo mlipir algorithm, golden eagle optimiser, and slime mould algorithm. The QIO-optimised exp-PDN achieves the best performance, with the lowest cost value (0.3211), minimal overshoot (5.52%), fast rise time (0.97 s), and smallest steady-state error (4.1643 × 10−4). Further comparisons with QIO-optimised phase-lead and PID-with-filter controllers demonstrate the superiority of the proposed method in both transient and steady-state behaviour. In summary, this work advances the control of nonlinear unstable systems by delivering a structurally simple yet highly responsive control architecture. The combination of dual-channel exponential shaping and efficient metaheuristic optimisation results in state-of-the-art closed-loop performance, highlighting the practical value of the proposed exp-PDN framework for real-world control applications.
{"title":"Design of Novel Exponential PDN Controller via Quadratic Interpolation Optimiser for Nonlinear and Unstable Ball and Beam System","authors":"Davut Izci, Serdar Ekinci, Emre Çelik, Murat Uyar, Mohit Bajaj, Vojtech Blazek, Olena Rubanenko","doi":"10.1049/cth2.70107","DOIUrl":"https://doi.org/10.1049/cth2.70107","url":null,"abstract":"<p>This study presents a novel exponential proportional-derivative controller with filter (exp-PDN) for stabilising the nonlinear and underactuated ball and beam system. Unlike conventional PID-based approaches, the proposed controller removes the integral term, resulting in faster transient responses and improved robustness. It incorporates nonlinear exponential shaping of both the error and its derivative, along with a filtered derivative path for enhanced noise handling. A custom multi-objective cost function, comprising the squared error, settling time, and percent overshoot, is proposed to evaluate control performance. The quadratic interpolation optimiser (QIO), a recently developed metaheuristic based on analytical interpolation, is employed to optimise the controller parameters. To validate its effectiveness, the exp-PDN controller is compared against five state-of-the-art metaheuristic algorithms: QIO, spider wasp optimiser, komodo mlipir algorithm, golden eagle optimiser, and slime mould algorithm. The QIO-optimised exp-PDN achieves the best performance, with the lowest cost value (0.3211), minimal overshoot (5.52%), fast rise time (0.97 s), and smallest steady-state error (4.1643 × 10<sup>−</sup><sup>4</sup>). Further comparisons with QIO-optimised phase-lead and PID-with-filter controllers demonstrate the superiority of the proposed method in both transient and steady-state behaviour. In summary, this work advances the control of nonlinear unstable systems by delivering a structurally simple yet highly responsive control architecture. The combination of dual-channel exponential shaping and efficient metaheuristic optimisation results in state-of-the-art closed-loop performance, highlighting the practical value of the proposed exp-PDN framework for real-world control applications.</p>","PeriodicalId":50382,"journal":{"name":"IET Control Theory and Applications","volume":"20 1","pages":""},"PeriodicalIF":2.3,"publicationDate":"2026-01-12","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://ietresearch.onlinelibrary.wiley.com/doi/epdf/10.1049/cth2.70107","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145964084","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Wireless sensor networks (WSNs) are highly vulnerable to malware attacks due to resource constraints. However, existing propagation models often overlook the heterogeneous security levels in hierarchically protected WSNs, leading to inaccurate representations of dynamics across network layers. This paper proposes a novel malware propagation model specifically for hierarchically protected WSNs to capture the distinct defense capabilities of heterogeneous nodes. Based on epidemiological theory, nodes are stratified into high protection level (HPL) and low protection level (LPL) categories within a susceptible-exposed-infected-recovered (SEIR) differential equation framework. We rigorously derive the basic reproduction number (