首页 > 最新文献

ISA transactions最新文献

英文 中文
Reinforcement learning-based pitch control for wind turbines using double deep Q-networks. 基于强化学习的双深q网络风力发电机俯仰控制。
IF 6.5 Pub Date : 2026-01-09 DOI: 10.1016/j.isatra.2026.01.015
Víctor Espinoza, Carolina Ormaza, Christian Tutivén, Yolanda Vidal

This paper presents a novel reinforcement learning-based framework for pitch control in wind turbines, addressing the challenges posed by high nonlinearity and uncertainties in wind dynamics. Traditional pitch control methods, such as proportional-integral controllers, often struggle with adaptability and load mitigation under fluctuating wind conditions. To enhance control efficiency, the proposed framework employs a two-stage approach: policy transfer and policy refinement. In the policy transfer stage, a proportional-integral controller is used to estimate an initial control policy. This policy is then refined using the double deep Q-Network algorithm. Furthermore, a novel reward function is introduced as an improved version of a recent formulation in the literature, ensuring better alignment with practical operational scenarios. Simulations demonstrate that the proposed controller outperforms the industry-standard ROSCO controller, achieving a 15.87% reduction in power fluctuations, while preserving comparable structural load levels, with slight reductions in some cases. These results highlight the potential of reinforcement learning-based control to enhance wind turbine power regulation without compromising structural safety in terms of loads and vibrations.

本文提出了一种新的基于强化学习的风力发电机俯仰控制框架,解决了风力力学中高度非线性和不确定性带来的挑战。传统的俯仰控制方法,如比例积分控制器,在波动风条件下往往难以适应和减轻负荷。为了提高控制效率,该框架采用了两阶段的方法:策略转移和策略细化。在策略转移阶段,采用比例积分控制器对初始控制策略进行估计。然后使用双深度Q-Network算法对该策略进行细化。此外,本文还引入了一种新的奖励函数,作为文献中最近公式的改进版本,以确保更好地与实际操作场景保持一致。仿真表明,所提出的控制器优于行业标准的ROSCO控制器,在保持可比结构负载水平的同时,实现了15.87%的功率波动降低,在某些情况下略有降低。这些结果突出了基于强化学习的控制的潜力,可以在不影响负载和振动方面的结构安全的情况下增强风力涡轮机的功率调节。
{"title":"Reinforcement learning-based pitch control for wind turbines using double deep Q-networks.","authors":"Víctor Espinoza, Carolina Ormaza, Christian Tutivén, Yolanda Vidal","doi":"10.1016/j.isatra.2026.01.015","DOIUrl":"https://doi.org/10.1016/j.isatra.2026.01.015","url":null,"abstract":"<p><p>This paper presents a novel reinforcement learning-based framework for pitch control in wind turbines, addressing the challenges posed by high nonlinearity and uncertainties in wind dynamics. Traditional pitch control methods, such as proportional-integral controllers, often struggle with adaptability and load mitigation under fluctuating wind conditions. To enhance control efficiency, the proposed framework employs a two-stage approach: policy transfer and policy refinement. In the policy transfer stage, a proportional-integral controller is used to estimate an initial control policy. This policy is then refined using the double deep Q-Network algorithm. Furthermore, a novel reward function is introduced as an improved version of a recent formulation in the literature, ensuring better alignment with practical operational scenarios. Simulations demonstrate that the proposed controller outperforms the industry-standard ROSCO controller, achieving a 15.87% reduction in power fluctuations, while preserving comparable structural load levels, with slight reductions in some cases. These results highlight the potential of reinforcement learning-based control to enhance wind turbine power regulation without compromising structural safety in terms of loads and vibrations.</p>","PeriodicalId":94059,"journal":{"name":"ISA transactions","volume":" ","pages":""},"PeriodicalIF":6.5,"publicationDate":"2026-01-09","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145992371","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Fixed-time safety tracking control of QUAV under external disturbances using robust high-order time-varying control barrier function. 基于鲁棒高阶时变控制屏障函数的外部扰动下QUAV的固定时间安全跟踪控制。
IF 6.5 Pub Date : 2026-01-09 DOI: 10.1016/j.isatra.2026.01.014
Lan Duo, Hongtao Yin, Ping Fu, Ang Li

This study proposes a robust fixed-time safety control scheme for a quadrotor unmanned aerial vehicle (QUAV) operating in environments with obstacles and external disturbances. Firstly, a robust high-order time-varying control barrier function (R-HO-TV-CBF) is designed as an obstacle avoidance constraint. This method achieves robustness without relying about any prior assumptions on the bounds of the disturbance and its derivative, effectively handles dynamic obstacle avoidance, suppresses oscillation and ensures optimal trajectory tracking. Subsequently, a nonsingular fixed-time sliding mode control (NFxSMC) method is developed to guarantee fixed-time convergence, independent of the initial state. Finally, simulation results demonstrate the superior performance of the proposed scheme in obstacle avoidance, disturbance rejection and error convergence.

针对具有障碍物和外界干扰的四旋翼无人机,提出了一种鲁棒固定时间安全控制方案。首先,设计了鲁棒高阶时变控制障碍函数(R-HO-TV-CBF)作为避障约束;该方法不依赖于对扰动及其导数边界的任何先验假设,实现了鲁棒性,有效地处理了动态避障,抑制了振荡,保证了最优轨迹跟踪。在此基础上,提出了一种非奇异固定时间滑模控制(NFxSMC)方法,以保证系统与初始状态无关的固定时间收敛性。仿真结果表明,该方法在避障、抗干扰和误差收敛方面具有较好的性能。
{"title":"Fixed-time safety tracking control of QUAV under external disturbances using robust high-order time-varying control barrier function.","authors":"Lan Duo, Hongtao Yin, Ping Fu, Ang Li","doi":"10.1016/j.isatra.2026.01.014","DOIUrl":"https://doi.org/10.1016/j.isatra.2026.01.014","url":null,"abstract":"<p><p>This study proposes a robust fixed-time safety control scheme for a quadrotor unmanned aerial vehicle (QUAV) operating in environments with obstacles and external disturbances. Firstly, a robust high-order time-varying control barrier function (R-HO-TV-CBF) is designed as an obstacle avoidance constraint. This method achieves robustness without relying about any prior assumptions on the bounds of the disturbance and its derivative, effectively handles dynamic obstacle avoidance, suppresses oscillation and ensures optimal trajectory tracking. Subsequently, a nonsingular fixed-time sliding mode control (NFxSMC) method is developed to guarantee fixed-time convergence, independent of the initial state. Finally, simulation results demonstrate the superior performance of the proposed scheme in obstacle avoidance, disturbance rejection and error convergence.</p>","PeriodicalId":94059,"journal":{"name":"ISA transactions","volume":" ","pages":""},"PeriodicalIF":6.5,"publicationDate":"2026-01-09","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145992355","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Distributed fixed/predefined-time optimization for multi-agent systems: new exponential-function-based algorithms. 多智能体系统的分布式固定/预定义时间优化:基于指数函数的新算法。
IF 6.5 Pub Date : 2026-01-09 DOI: 10.1016/j.isatra.2026.01.013
Luke Li, Qintao Gan, Ruihong Li, Qiaokun Kang, Yuanlong Liu

Distributed convex optimization with time-varying or time-invariant cost functions remains one of the central challenges in multi-agent systems (MASs). Achieving efficient distributed optimization within a fixed/predefined-time continues to face difficulties such as restrictive assumptions and high computational complexity. This paper proposes innovative distributed optimization frameworks to address the aforementioned limitations. For time-invariant optimization problems, an estimator-based two-stage distributed protocol is introduced, which achieves both inter-agent consensus and convergence to the global optimum within a fixed/predefined-time. Notably, this protocol only requires strong convexity of the global cost function, thereby relaxing the constraints on local functions. For time-varying scenarios, an enhanced zero-gradient-sum (ZGS) framework is developed by integrating a zeroing neural network (ZNN) with sliding mode control. This framework not only eliminates dependence on initial conditions but also implicitly computes the Hessian inverse through ZNN dynamics, effectively avoiding the O(n3) computational burden associated with explicit matrix inversion. Numerical simulations validate the superior convergence speed and broad applicability of our method, attesting to its great potential for distributed optimization.

时变或定常代价函数的分布凸优化是多智能体系统(MASs)的核心挑战之一。在固定/预定义的时间内实现高效的分布式优化仍然面临诸如限制性假设和高计算复杂性等困难。本文提出了创新的分布式优化框架来解决上述限制。针对时不变优化问题,引入了一种基于估计量的两阶段分布式优化协议,该协议既能实现智能体间的一致,又能在固定/预定义的时间内收敛到全局最优。值得注意的是,该协议只要求全局代价函数具有强凸性,从而放松了对局部函数的约束。针对时变情况,将调零神经网络(ZNN)与滑模控制相结合,提出了一种改进型零梯度和框架。该框架不仅消除了对初始条件的依赖,而且通过ZNN动力学隐式计算Hessian逆,有效避免了显式矩阵反演带来的O(n3)计算负担。数值仿真结果表明,该方法具有较快的收敛速度和广泛的适用性,证明了该方法在分布式优化中的巨大潜力。
{"title":"Distributed fixed/predefined-time optimization for multi-agent systems: new exponential-function-based algorithms.","authors":"Luke Li, Qintao Gan, Ruihong Li, Qiaokun Kang, Yuanlong Liu","doi":"10.1016/j.isatra.2026.01.013","DOIUrl":"https://doi.org/10.1016/j.isatra.2026.01.013","url":null,"abstract":"<p><p>Distributed convex optimization with time-varying or time-invariant cost functions remains one of the central challenges in multi-agent systems (MASs). Achieving efficient distributed optimization within a fixed/predefined-time continues to face difficulties such as restrictive assumptions and high computational complexity. This paper proposes innovative distributed optimization frameworks to address the aforementioned limitations. For time-invariant optimization problems, an estimator-based two-stage distributed protocol is introduced, which achieves both inter-agent consensus and convergence to the global optimum within a fixed/predefined-time. Notably, this protocol only requires strong convexity of the global cost function, thereby relaxing the constraints on local functions. For time-varying scenarios, an enhanced zero-gradient-sum (ZGS) framework is developed by integrating a zeroing neural network (ZNN) with sliding mode control. This framework not only eliminates dependence on initial conditions but also implicitly computes the Hessian inverse through ZNN dynamics, effectively avoiding the O(n<sup>3</sup>) computational burden associated with explicit matrix inversion. Numerical simulations validate the superior convergence speed and broad applicability of our method, attesting to its great potential for distributed optimization.</p>","PeriodicalId":94059,"journal":{"name":"ISA transactions","volume":" ","pages":""},"PeriodicalIF":6.5,"publicationDate":"2026-01-09","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145986118","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Fully distributed output formation-containment control of heterogeneous multi-agent systems via adaptive observer approach. 基于自适应观测器的异构多智能体系统全分布式输出控制。
IF 6.5 Pub Date : 2026-01-09 DOI: 10.1016/j.isatra.2026.01.012
Xiaojie Sun, Jie Zhang, Yu Hou, Jian-An Wang, Da-Wei Ding

For heterogeneous linear multi-agent systems (MASs) in which not all followers have access to the information of multiple leaders, this paper studies the distributed output formation-containment problem. By converting the research question into the cooperative output regulation problem, a fully distributed control framework via a fully adaptive observer approach is developed. Since a part of the followers cannot directly access the states of leaders, a new observer is designed for reconstructing the system matrices and state information. Specifically, by introducing adaptive gains in place of fixed coupling coefficients in the observer design, the global topological information is no longer needed. Moreover, the conventional regulator equations cannot be directly applied for controller synthesis due to the time-varying estimated system matrices of the leaders. To overcome this challenge, the adaptive regulator equations are developed to dynamically compute the controller gains. Subsequently, both state and output feedback control strategies are presented to achieve the desired output formation-containment control objective. The validation of the proposed method is performed in the simulation results with practical and comparative examples.

对于并非所有follower都能获得多个leader信息的异构线性多智能体系统(MASs),研究了分布式输出约束问题。通过将研究问题转化为协同输出调节问题,建立了基于全自适应观测器方法的全分布式控制框架。由于部分follower不能直接访问leader的状态,设计了一个新的观测器来重构系统矩阵和状态信息。具体来说,通过在观测器设计中引入自适应增益来代替固定的耦合系数,不再需要全局拓扑信息。此外,由于前导系统估计矩阵时变,传统的调节器方程不能直接应用于控制器综合。为了克服这一挑战,开发了自适应调节器方程来动态计算控制器增益。在此基础上,提出了状态反馈控制策略和输出反馈控制策略,以实现期望的输出编队控制目标。通过实际算例和对比算例的仿真验证了所提方法的有效性。
{"title":"Fully distributed output formation-containment control of heterogeneous multi-agent systems via adaptive observer approach.","authors":"Xiaojie Sun, Jie Zhang, Yu Hou, Jian-An Wang, Da-Wei Ding","doi":"10.1016/j.isatra.2026.01.012","DOIUrl":"https://doi.org/10.1016/j.isatra.2026.01.012","url":null,"abstract":"<p><p>For heterogeneous linear multi-agent systems (MASs) in which not all followers have access to the information of multiple leaders, this paper studies the distributed output formation-containment problem. By converting the research question into the cooperative output regulation problem, a fully distributed control framework via a fully adaptive observer approach is developed. Since a part of the followers cannot directly access the states of leaders, a new observer is designed for reconstructing the system matrices and state information. Specifically, by introducing adaptive gains in place of fixed coupling coefficients in the observer design, the global topological information is no longer needed. Moreover, the conventional regulator equations cannot be directly applied for controller synthesis due to the time-varying estimated system matrices of the leaders. To overcome this challenge, the adaptive regulator equations are developed to dynamically compute the controller gains. Subsequently, both state and output feedback control strategies are presented to achieve the desired output formation-containment control objective. The validation of the proposed method is performed in the simulation results with practical and comparative examples.</p>","PeriodicalId":94059,"journal":{"name":"ISA transactions","volume":" ","pages":""},"PeriodicalIF":6.5,"publicationDate":"2026-01-09","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145985162","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Dynamic event-triggered prescribed-time observer via parametric Lyapunov equations. 通过参数李雅普诺夫方程的动态事件触发的规定时间观测器。
IF 6.5 Pub Date : 2026-01-08 DOI: 10.1016/j.isatra.2026.01.006
Jiatong He, Jing Chen, Gang Zheng, Heng Liu, Jinde Cao, Liuxiao Guo, Jiancheng Zhang

This paper is concerned with a prescribed-time observer (PTO) for a class of linear systems by using a novel dynamic event-triggered mechanism (DETM) with bounded triggering conditions. Traditional PTOs often require real-time updates of the measurement output signal, leading to high communication costs. Event-triggered approaches can address this issue. However, event-triggered approaches hardly achieve prescribed-time convergence. In order to achieve fast state estimation while maintaining low communication costs, a dynamic event-triggered prescribed-time observer (DETPTO) is developed by using a parametric Lyapunov equation (PLE), in which the estimation of all system states can also be achieved within a prescribed-time T. To achieve this goal, a DETM is proposed by integrating both estimation error and measurement error, ensuring that the observer updates the measurement output only when necessary rather than continuously. Moreover, it is proven that the proposed DETPTO can avoid the Zeno behavior. Finally, numerical simulations validate the effectiveness of the proposed methods.

利用一种具有有界触发条件的动态事件触发机制,研究一类线性系统的规定时间观测器。传统的pto往往需要实时更新测量输出信号,导致通信成本高。事件触发的方法可以解决这个问题。然而,事件触发的方法很难达到规定的时间收敛性。为了在保持低通信成本的同时实现快速状态估计,利用参数Lyapunov方程(PLE)开发了一种动态事件触发的规定时间观测器(DETPTO),该观测器也可以在规定时间t内实现对系统所有状态的估计。为了实现这一目标,提出了一种综合估计误差和测量误差的DETM。确保观测器仅在必要时才更新测量输出,而不是连续更新。此外,还证明了所提出的DETPTO可以避免芝诺行为。最后,通过数值仿真验证了所提方法的有效性。
{"title":"Dynamic event-triggered prescribed-time observer via parametric Lyapunov equations.","authors":"Jiatong He, Jing Chen, Gang Zheng, Heng Liu, Jinde Cao, Liuxiao Guo, Jiancheng Zhang","doi":"10.1016/j.isatra.2026.01.006","DOIUrl":"https://doi.org/10.1016/j.isatra.2026.01.006","url":null,"abstract":"<p><p>This paper is concerned with a prescribed-time observer (PTO) for a class of linear systems by using a novel dynamic event-triggered mechanism (DETM) with bounded triggering conditions. Traditional PTOs often require real-time updates of the measurement output signal, leading to high communication costs. Event-triggered approaches can address this issue. However, event-triggered approaches hardly achieve prescribed-time convergence. In order to achieve fast state estimation while maintaining low communication costs, a dynamic event-triggered prescribed-time observer (DETPTO) is developed by using a parametric Lyapunov equation (PLE), in which the estimation of all system states can also be achieved within a prescribed-time T. To achieve this goal, a DETM is proposed by integrating both estimation error and measurement error, ensuring that the observer updates the measurement output only when necessary rather than continuously. Moreover, it is proven that the proposed DETPTO can avoid the Zeno behavior. Finally, numerical simulations validate the effectiveness of the proposed methods.</p>","PeriodicalId":94059,"journal":{"name":"ISA transactions","volume":" ","pages":""},"PeriodicalIF":6.5,"publicationDate":"2026-01-08","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145954503","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
High-dimensional optimized extraction chirplet transform: Algorithm and applications. 高维优化提取啁啾变换:算法与应用。
IF 6.5 Pub Date : 2026-01-07 DOI: 10.1016/j.isatra.2026.01.003
Long Ge, Dezun Zhao, Tianyang Wang, Lingli Cui

Under time-varying operating conditions, vibration signals from rotating machinery typically contain dense and non-proportional frequency components, which make it difficult for current time-frequency analysis (TFA) techniques to achieve high-precision and highly concentrated time-frequency representations (TFRs). Accordingly, the high-dimensional optimized extraction chirplet transform (HOECT) is proposed. In the HOECT, a window parameter adaptive optimization criterion is first designed to dynamically match local characteristics of the signal; second, the component matching CT (CMCT) is improved based on the proposed criterion, and its results are mapped into the time-frequency-chirprate domain to separate frequency components; finally, based on estimated local frequency and chirprate, the smoothing extraction operator (SEO) is constructed in the frequency-chirprate domain, which significantly enhances the energy concentration. In addition, a novel connected domain analysis (CDA) algorithm is proposed to effectively suppress strong noise interference and highlight key features of signal components. Simulation results illustrate that the HOECT can accurately characterize dense and non-proportional frequency components with high energy concentration. The superiority of the HOECT in operational state characterization and fault feature identification is further verified through two vibration signals of planetary gearboxes. Additionally, the whale sound signal results indicate that the proposed method exhibits strong robustness and applicability in analyzing complex non-stationary signals.

在时变工况下,旋转机械的振动信号通常包含密集且不成比例的频率成分,这使得当前的时频分析技术难以实现高精度和高度集中的时频表示。据此,提出了高维优化提取小波变换(HOECT)。在HOECT中,首先设计了窗口参数自适应优化准则来动态匹配信号的局部特征;其次,根据提出的准则对分量匹配CT (CMCT)进行改进,并将其结果映射到时-频-chirprate域中分离频率分量;最后,基于估计的局部频率和chirprate,在频率-chirprate域中构造平滑提取算子(SEO),显著提高了能量集中度。此外,提出了一种新的连通域分析(CDA)算法,可以有效地抑制强噪声干扰,突出信号成分的关键特征。仿真结果表明,HOECT能准确表征能量浓度高的密集非比例频率分量。通过行星齿轮箱的两个振动信号,进一步验证了HOECT在运行状态表征和故障特征识别方面的优越性。此外,对鲸声信号的分析结果表明,该方法具有较强的鲁棒性和对复杂非平稳信号分析的适用性。
{"title":"High-dimensional optimized extraction chirplet transform: Algorithm and applications.","authors":"Long Ge, Dezun Zhao, Tianyang Wang, Lingli Cui","doi":"10.1016/j.isatra.2026.01.003","DOIUrl":"https://doi.org/10.1016/j.isatra.2026.01.003","url":null,"abstract":"<p><p>Under time-varying operating conditions, vibration signals from rotating machinery typically contain dense and non-proportional frequency components, which make it difficult for current time-frequency analysis (TFA) techniques to achieve high-precision and highly concentrated time-frequency representations (TFRs). Accordingly, the high-dimensional optimized extraction chirplet transform (HOECT) is proposed. In the HOECT, a window parameter adaptive optimization criterion is first designed to dynamically match local characteristics of the signal; second, the component matching CT (CMCT) is improved based on the proposed criterion, and its results are mapped into the time-frequency-chirprate domain to separate frequency components; finally, based on estimated local frequency and chirprate, the smoothing extraction operator (SEO) is constructed in the frequency-chirprate domain, which significantly enhances the energy concentration. In addition, a novel connected domain analysis (CDA) algorithm is proposed to effectively suppress strong noise interference and highlight key features of signal components. Simulation results illustrate that the HOECT can accurately characterize dense and non-proportional frequency components with high energy concentration. The superiority of the HOECT in operational state characterization and fault feature identification is further verified through two vibration signals of planetary gearboxes. Additionally, the whale sound signal results indicate that the proposed method exhibits strong robustness and applicability in analyzing complex non-stationary signals.</p>","PeriodicalId":94059,"journal":{"name":"ISA transactions","volume":" ","pages":""},"PeriodicalIF":6.5,"publicationDate":"2026-01-07","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145992411","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Cooperative control for FWID-EVs with active suspension under extreme conditions by using off-policy safe reinforcement learning. 基于非策略安全强化学习的fwidd - ev主动悬架极端工况协同控制
IF 6.5 Pub Date : 2026-01-07 DOI: 10.1016/j.isatra.2026.01.010
Zhijiang Gao, Pak Kin Wong, Fuqiu Zhou, Jing Zhao

The unbalanced distribution of tyre normal forces induced by lateral load transfer poses a significant threat to vehicle handling stability under extreme conditions. However, the mitigation of such imbalance is inherently constrained by the suspension support capability. This work proposes a novel off-policy safe reinforcement learning-based cooperative (OSRC) controller that coordinates direct yaw control (DYC) and active suspension system (ASS) to optimize the distribution of tyre normal forces and enhance the handling stability of the four-wheel independent drive electric vehicle (FWID-EV) under extreme conditions. First, a DYC-ASS cooperative architecture is proposed to optimize the distribution of tyre normal forces in the FWID-EV system. Subsequently, a combined tyre-slip-angle and phase-portrait (CTP) method is proposed to characterize the relationship between the tyre force saturation and the lateral stability region (LSR). Based on a two-player game, the OSRC controller is proposed to achieve the cooperation between the DYC and the ASS and enhance the handling stability of the FWID-EV under extreme conditions. Finally, the effectiveness of the proposed OSRC controller is examined via the rapid control prototyping tests.

侧向载荷传递引起的轮胎法向力的不平衡分布对极端工况下车辆的操纵稳定性构成重大威胁。然而,这种不平衡的缓解受到悬架支撑能力的固有约束。本文提出了一种新型的非策略安全强化学习协同(OSRC)控制器,该控制器协调直接偏航控制(DYC)和主动悬架系统(ASS),以优化轮胎法向力的分布,提高四轮独立驱动电动汽车(FWID-EV)在极端条件下的操纵稳定性。首先,提出DYC-ASS协同架构,优化FWID-EV系统中轮胎法向力的分布;随后,提出了轮胎滑移角与相位画像相结合的方法来表征轮胎力饱和度与横向稳定区(LSR)的关系。为了实现DYC与ASS的协同工作,提高fwidd - ev在极端工况下的操控稳定性,提出了基于二人博弈的OSRC控制器。最后,通过快速控制原型测试验证了所提出的OSRC控制器的有效性。
{"title":"Cooperative control for FWID-EVs with active suspension under extreme conditions by using off-policy safe reinforcement learning.","authors":"Zhijiang Gao, Pak Kin Wong, Fuqiu Zhou, Jing Zhao","doi":"10.1016/j.isatra.2026.01.010","DOIUrl":"https://doi.org/10.1016/j.isatra.2026.01.010","url":null,"abstract":"<p><p>The unbalanced distribution of tyre normal forces induced by lateral load transfer poses a significant threat to vehicle handling stability under extreme conditions. However, the mitigation of such imbalance is inherently constrained by the suspension support capability. This work proposes a novel off-policy safe reinforcement learning-based cooperative (OSRC) controller that coordinates direct yaw control (DYC) and active suspension system (ASS) to optimize the distribution of tyre normal forces and enhance the handling stability of the four-wheel independent drive electric vehicle (FWID-EV) under extreme conditions. First, a DYC-ASS cooperative architecture is proposed to optimize the distribution of tyre normal forces in the FWID-EV system. Subsequently, a combined tyre-slip-angle and phase-portrait (CTP) method is proposed to characterize the relationship between the tyre force saturation and the lateral stability region (LSR). Based on a two-player game, the OSRC controller is proposed to achieve the cooperation between the DYC and the ASS and enhance the handling stability of the FWID-EV under extreme conditions. Finally, the effectiveness of the proposed OSRC controller is examined via the rapid control prototyping tests.</p>","PeriodicalId":94059,"journal":{"name":"ISA transactions","volume":" ","pages":""},"PeriodicalIF":6.5,"publicationDate":"2026-01-07","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145954491","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Prescribed-time fully distributed optimization for time-varying costs: Zero-gradient-sum scheme. 时变成本的规定时间全分布优化:零梯度和方案。
IF 6.5 Pub Date : 2026-01-06 DOI: 10.1016/j.isatra.2026.01.007
Ningning Mao, Shuai Liu, Yuan Liu

This article introduces a novel distributed optimization algorithm for time-varying objectives that achieves prescribed-time convergence. The method is developed based on zero-gradient-sum (ZGS) principles and employs a sliding-mode control framework. Specifically, the controller is designed to satisfy the ZGS condition within a prescribed time horizon, ensuring exact convergence by the user-specified deadline. A key contribution of this research is the introduction of adaptive parameters with time-varying scaling functions. This innovation addresses a fundamental limitation in existing methods by eliminating their dependency on global network information. Consequently, the algorithm achieves truly distributed control without requiring knowledge of Laplacian matrix eigenvalues. The algorithm offers significant advantages including complete freedom from initial condition constraints and local minimization requirements, full independence from global topological information, and rigorous prescribed-time convergence guarantees. Theoretical analysis establishes the convergence properties through an integrated approach that combines optimization theory with Lyapunov stability analysis. Numerical simulations demonstrate the algorithm's effectiveness and superior performance in handling time-varying optimization problems compared to existing approaches.

本文介绍了一种新的时变目标分布式优化算法,该算法能达到规定的时间收敛性。该方法基于零梯度和(ZGS)原理,采用滑模控制框架。具体来说,控制器被设计为在规定的时间范围内满足ZGS条件,确保在用户指定的截止日期前精确收敛。本研究的一个关键贡献是引入了具有时变尺度函数的自适应参数。这一创新消除了现有方法对全球网络信息的依赖,从而解决了现有方法的一个基本限制。因此,该算法在不需要了解拉普拉斯矩阵特征值的情况下,实现了真正意义上的分布式控制。该算法具有完全不受初始条件约束和局部最小化要求的约束、完全独立于全局拓扑信息、严格的规定时间收敛保证等显著优点。理论分析采用优化理论与李雅普诺夫稳定性分析相结合的方法,建立了算法的收敛性。数值仿真结果表明,该算法在处理时变优化问题时具有较好的有效性和性能。
{"title":"Prescribed-time fully distributed optimization for time-varying costs: Zero-gradient-sum scheme.","authors":"Ningning Mao, Shuai Liu, Yuan Liu","doi":"10.1016/j.isatra.2026.01.007","DOIUrl":"https://doi.org/10.1016/j.isatra.2026.01.007","url":null,"abstract":"<p><p>This article introduces a novel distributed optimization algorithm for time-varying objectives that achieves prescribed-time convergence. The method is developed based on zero-gradient-sum (ZGS) principles and employs a sliding-mode control framework. Specifically, the controller is designed to satisfy the ZGS condition within a prescribed time horizon, ensuring exact convergence by the user-specified deadline. A key contribution of this research is the introduction of adaptive parameters with time-varying scaling functions. This innovation addresses a fundamental limitation in existing methods by eliminating their dependency on global network information. Consequently, the algorithm achieves truly distributed control without requiring knowledge of Laplacian matrix eigenvalues. The algorithm offers significant advantages including complete freedom from initial condition constraints and local minimization requirements, full independence from global topological information, and rigorous prescribed-time convergence guarantees. Theoretical analysis establishes the convergence properties through an integrated approach that combines optimization theory with Lyapunov stability analysis. Numerical simulations demonstrate the algorithm's effectiveness and superior performance in handling time-varying optimization problems compared to existing approaches.</p>","PeriodicalId":94059,"journal":{"name":"ISA transactions","volume":" ","pages":""},"PeriodicalIF":6.5,"publicationDate":"2026-01-06","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145954512","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Bumpless transfer for distributed generation units with mode-dependent controllers. 具有模式依赖控制器的分布式发电机组的无颠簸传输。
IF 6.5 Pub Date : 2026-01-06 DOI: 10.1016/j.isatra.2025.12.048
Pablo E Muñoz, Ricardo J Mantz, Sergio A González

Distributed generation units (DGUs) in grid-connected microgrids are typically controlled as current sources. Nevertheless, when the microgrids are isolated, some DGUs must switch their control law to operate as voltage sources. Switching between controllers implies unwanted transients at the DGUs output power due to mismatches between controller's outputs, challenging both voltage and frequency transients and potentially leading to instability. This work proposes a bumpless-transfer scheme based on sliding-mode reference conditioning for the DGUs to achieve a smooth transition between grid-connected and island modes in electrical microgrids. By guaranteeing the establishment of a sliding regime, the suggested bumpless algorithm reduces to zero the mismatch between the DGU controllers' outputs when the switch is performed. In this manner, jumps in the control action and the associated unwanted transients are avoided regardless of when the control law switches. Results show good and robust performance in the face of both islanding and reconnection processes.

并网微电网中的分布式发电机组通常作为电流源进行控制。然而,当微电网处于隔离状态时,一些dgu必须切换其控制律作为电压源运行。控制器之间的切换意味着由于控制器输出之间的不匹配,在DGUs输出功率上存在不必要的瞬态,挑战电压和频率瞬态,并可能导致不稳定。本文提出了一种基于滑模参考调节的无碰撞转换方案,以实现微电网并网模式和孤岛模式之间的平滑过渡。通过保证滑动状态的建立,建议的无颠簸算法在切换时将DGU控制器输出之间的不匹配减小到零。通过这种方式,无论控制律何时切换,都可以避免控制动作中的跳跃和相关的不必要的瞬变。结果表明,该方法在孤岛和重连接过程中都具有良好的鲁棒性。
{"title":"Bumpless transfer for distributed generation units with mode-dependent controllers.","authors":"Pablo E Muñoz, Ricardo J Mantz, Sergio A González","doi":"10.1016/j.isatra.2025.12.048","DOIUrl":"https://doi.org/10.1016/j.isatra.2025.12.048","url":null,"abstract":"<p><p>Distributed generation units (DGUs) in grid-connected microgrids are typically controlled as current sources. Nevertheless, when the microgrids are isolated, some DGUs must switch their control law to operate as voltage sources. Switching between controllers implies unwanted transients at the DGUs output power due to mismatches between controller's outputs, challenging both voltage and frequency transients and potentially leading to instability. This work proposes a bumpless-transfer scheme based on sliding-mode reference conditioning for the DGUs to achieve a smooth transition between grid-connected and island modes in electrical microgrids. By guaranteeing the establishment of a sliding regime, the suggested bumpless algorithm reduces to zero the mismatch between the DGU controllers' outputs when the switch is performed. In this manner, jumps in the control action and the associated unwanted transients are avoided regardless of when the control law switches. Results show good and robust performance in the face of both islanding and reconnection processes.</p>","PeriodicalId":94059,"journal":{"name":"ISA transactions","volume":" ","pages":""},"PeriodicalIF":6.5,"publicationDate":"2026-01-06","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145947173","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Coordinated neural adaptive active power control of wind turbines considering pitch system load reduction. 考虑桨距系统减载的风力发电机组协调神经自适应有功控制。
IF 6.5 Pub Date : 2026-01-05 DOI: 10.1016/j.isatra.2026.01.005
Xuguo Jiao, Hao Luo, Bo Fan, Xia Chen, Jingbo Zhao, Ruchang Yin

Active power control (APC) is an effective way to address the instability problem caused by high wind energy penetration in power systems. This study presents a coordinated APC scheme to reduce the pitch system's loads while ensuring accurate active power tracking performance. Firstly, a pitch activation limitation parameter updated by wind speed is designed to expand the rotor speed regulation range. Subsequently, a neural network (NN)-based controller with a segmented weight updating mechanism is designed to achieve smooth and stable pitch angle variations, effectively reducing the pitch system's loads. Furthermore, we develop a robust differentiator to estimate derivatives of the rotor speed and wind speed thereby avoiding the use of additional sensors. Finally, simulations on the OpenFAST platform demonstrate the effectiveness of our method.

有功功率控制(APC)是解决风电高渗透引起的电力系统不稳定问题的有效途径。本文提出了一种协调的APC方案,以减少俯仰系统的负载,同时保证准确的有功功率跟踪性能。首先,设计了基于风速更新的桨距激活限制参数,扩大了转子的调速范围;在此基础上,设计了基于神经网络的分段权值更新控制器,实现了俯仰角的平稳变化,有效降低了俯仰系统的负载。此外,我们开发了一个鲁棒微分器来估计转子速度和风速的导数,从而避免使用额外的传感器。最后,在OpenFAST平台上进行了仿真,验证了该方法的有效性。
{"title":"Coordinated neural adaptive active power control of wind turbines considering pitch system load reduction.","authors":"Xuguo Jiao, Hao Luo, Bo Fan, Xia Chen, Jingbo Zhao, Ruchang Yin","doi":"10.1016/j.isatra.2026.01.005","DOIUrl":"https://doi.org/10.1016/j.isatra.2026.01.005","url":null,"abstract":"<p><p>Active power control (APC) is an effective way to address the instability problem caused by high wind energy penetration in power systems. This study presents a coordinated APC scheme to reduce the pitch system's loads while ensuring accurate active power tracking performance. Firstly, a pitch activation limitation parameter updated by wind speed is designed to expand the rotor speed regulation range. Subsequently, a neural network (NN)-based controller with a segmented weight updating mechanism is designed to achieve smooth and stable pitch angle variations, effectively reducing the pitch system's loads. Furthermore, we develop a robust differentiator to estimate derivatives of the rotor speed and wind speed thereby avoiding the use of additional sensors. Finally, simulations on the OpenFAST platform demonstrate the effectiveness of our method.</p>","PeriodicalId":94059,"journal":{"name":"ISA transactions","volume":" ","pages":""},"PeriodicalIF":6.5,"publicationDate":"2026-01-05","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145947175","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
期刊
ISA transactions
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1