首页 > 最新文献

IEEE Control Systems Letters最新文献

英文 中文
Distributed Fenchel Dual Gradient Algorithm for Constrained Convex Optimization Over Digraphs 有向图上约束凸优化的分布式Fenchel对偶梯度算法
IF 2 Q2 AUTOMATION & CONTROL SYSTEMS Pub Date : 2025-11-13 DOI: 10.1109/LCSYS.2025.3632307
Yanan Zhu;Wenwu Yu;Guanghui Wen
This letter provides the first rigorous theoretical analysis for the Distributed Fenchel Dual Gradient (DFDG) algorithm, a continuous-time method for solving distributed convex optimization problems with local set constraints over digraphs. The DFDG algorithm, originally proposed in our prior work (Zhu et al., 2020), transforms the primal problem into its Fenchel dual and solves it using a two-time-scale dynamical system. This letter provides a more comprehensive explanation of the algorithm’s design mechanism and formally establishes its convergence properties. Under strong convexity and Lipschitz continuity assumptions, Lyapunov stability theory is employed to prove the asymptotic convergence to the optimal solutions of both the primal and its dual problems. This analysis provides rigorous guarantees for a class of dual-based algorithms over digraphs, filling a critical gap in the existing literature.
这封信为分布式Fenchel对偶梯度(DFDG)算法提供了第一个严格的理论分析,这是一种连续时间方法,用于解决有向图上具有局部集约束的分布式凸优化问题。DFDG算法最初是在我们之前的工作中提出的(Zhu et al., 2020),它将原始问题转换为Fenchel对偶,并使用双时间尺度动力系统来解决它。这封信提供了一个更全面的解释算法的设计机制,并正式建立了其收敛性质。在强凸性和Lipschitz连续性假设下,利用Lyapunov稳定性理论证明了该问题的最优解及其对偶问题的渐近收敛性。这种分析为有向图上的一类基于双重的算法提供了严格的保证,填补了现有文献中的一个关键空白。
{"title":"Distributed Fenchel Dual Gradient Algorithm for Constrained Convex Optimization Over Digraphs","authors":"Yanan Zhu;Wenwu Yu;Guanghui Wen","doi":"10.1109/LCSYS.2025.3632307","DOIUrl":"https://doi.org/10.1109/LCSYS.2025.3632307","url":null,"abstract":"This letter provides the first rigorous theoretical analysis for the Distributed Fenchel Dual Gradient (DFDG) algorithm, a continuous-time method for solving distributed convex optimization problems with local set constraints over digraphs. The DFDG algorithm, originally proposed in our prior work (Zhu et al., 2020), transforms the primal problem into its Fenchel dual and solves it using a two-time-scale dynamical system. This letter provides a more comprehensive explanation of the algorithm’s design mechanism and formally establishes its convergence properties. Under strong convexity and Lipschitz continuity assumptions, Lyapunov stability theory is employed to prove the asymptotic convergence to the optimal solutions of both the primal and its dual problems. This analysis provides rigorous guarantees for a class of dual-based algorithms over digraphs, filling a critical gap in the existing literature.","PeriodicalId":37235,"journal":{"name":"IEEE Control Systems Letters","volume":"9 ","pages":"2537-2542"},"PeriodicalIF":2.0,"publicationDate":"2025-11-13","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145560763","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Observability and State Estimation for Smooth and Nonsmooth Differential Algebraic Equation Systems 光滑与非光滑微分代数方程组的可观测性与状态估计
IF 2 Q2 AUTOMATION & CONTROL SYSTEMS Pub Date : 2025-11-07 DOI: 10.1109/LCSYS.2025.3630241
Hesham Abdelfattah;Sameh A. Eisa;Peter Stechlinski
In this letter, we extend the sensitivity-based rank condition (SERC) test for local observability to another class of systems, namely smooth and nonsmooth differential-algebraic equation (DAE) systems of index-1. The newly introduced test for DAEs, which we call the lexicographic SERC (L-SERC) observability test, utilizes the theory of lexicographic differentiation to compute sensitivity information. Moreover, the newly introduced L-SERC observability test can judges which states are observable and which are not. Additionally, we introduce a novel sensitivity-based extended Kalman filter (S-EKF) algorithm for state estimation, applicable to both smooth and nonsmooth DAE systems. Finally, we apply the newly developed S-EKF to estimate the states of a wind turbine power system model.
本文将基于灵敏度的秩条件(SERC)检验推广到另一类系统,即指标-1的光滑和非光滑微分代数方程(DAE)系统。新引入的DAEs测试,我们称之为词典SERC (L-SERC)可观察性测试,利用词典区分理论来计算灵敏度信息。此外,新引入的L-SERC可观察性检验可以判断哪些状态是可观察的,哪些状态是不可观察的。此外,我们引入了一种新的基于灵敏度的扩展卡尔曼滤波(S-EKF)算法用于状态估计,适用于光滑和非光滑DAE系统。最后,我们将新开发的S-EKF应用于风力发电系统模型的状态估计。
{"title":"Observability and State Estimation for Smooth and Nonsmooth Differential Algebraic Equation Systems","authors":"Hesham Abdelfattah;Sameh A. Eisa;Peter Stechlinski","doi":"10.1109/LCSYS.2025.3630241","DOIUrl":"https://doi.org/10.1109/LCSYS.2025.3630241","url":null,"abstract":"In this letter, we extend the sensitivity-based rank condition (SERC) test for local observability to another class of systems, namely smooth and nonsmooth differential-algebraic equation (DAE) systems of index-1. The newly introduced test for DAEs, which we call the lexicographic SERC (L-SERC) observability test, utilizes the theory of lexicographic differentiation to compute sensitivity information. Moreover, the newly introduced L-SERC observability test can judges which states are observable and which are not. Additionally, we introduce a novel sensitivity-based extended Kalman filter (S-EKF) algorithm for state estimation, applicable to both smooth and nonsmooth DAE systems. Finally, we apply the newly developed S-EKF to estimate the states of a wind turbine power system model.","PeriodicalId":37235,"journal":{"name":"IEEE Control Systems Letters","volume":"9 ","pages":"2507-2512"},"PeriodicalIF":2.0,"publicationDate":"2025-11-07","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145510102","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Feedback for Improved Hierarchical Reinforcement Learning With Timed Subgoals 带定时子目标的改进层次强化学习反馈
IF 2 Q2 AUTOMATION & CONTROL SYSTEMS Pub Date : 2025-11-04 DOI: 10.1109/LCSYS.2025.3629008
Yajie Bao;Dan Shen;Genshe Chen;Hao Xu;Samson Badlia;Simon Khan;Erik Blasch;Khanh Pham
Hierarchical reinforcement learning (RL) aims to improve sample efficiency by decomposing complex long-horizon tasks into fast low-level myopic and slower high-level non-myopic subtasks. However, the unilateral nested policy structure in current goal-conditioned hierarchical RL (HRL) methods sets subgoals at the high level without considering feedback from the low level, which significantly degrades the performance of high-level subgoal generation and sampling efficiency. Hindsight action relabeling further weakens subgoal settings by submitting to low-level reachability. Inspired by feedback control of dynamic systems, we present Feedback for Improved HRL with Timed Subgoals (FIHTS), a mechanism allowing feedback control of subgoal generation for improved HRL. Unlike current HRL, FIHTS enables both the high level to set subgoals and the low level to receive rewards based on subgoal achievement. Our experiments in various challenging dynamic RL environments show that our FIHTS method achieves higher success rates with higher sample efficiency than existing subgoal-based HRL methods.
分层强化学习(Hierarchical reinforcement learning, RL)的目的是通过将复杂的长视距任务分解为快速的低视距子任务和缓慢的高视距非视距子任务来提高样本效率。然而,当前基于目标条件的分层RL (HRL)方法中的单边嵌套策略结构在高层设置子目标,而不考虑低层的反馈,严重降低了高层子目标生成的性能和采样效率。后见之明的行动重新标记通过提交低水平的可达性进一步削弱了子目标设置。受动态系统反馈控制的启发,我们提出了带定时子目标的改进HRL反馈控制机制(FIHTS),该机制允许对改进HRL的子目标生成进行反馈控制。与当前的HRL不同,fhts允许高层设置子目标,也允许低层根据子目标的实现获得奖励。我们在各种具有挑战性的动态RL环境中的实验表明,我们的方法比现有的基于子目标的HRL方法获得了更高的成功率和更高的样本效率。
{"title":"Feedback for Improved Hierarchical Reinforcement Learning With Timed Subgoals","authors":"Yajie Bao;Dan Shen;Genshe Chen;Hao Xu;Samson Badlia;Simon Khan;Erik Blasch;Khanh Pham","doi":"10.1109/LCSYS.2025.3629008","DOIUrl":"https://doi.org/10.1109/LCSYS.2025.3629008","url":null,"abstract":"Hierarchical reinforcement learning (RL) aims to improve sample efficiency by decomposing complex long-horizon tasks into fast low-level myopic and slower high-level non-myopic subtasks. However, the unilateral nested policy structure in current goal-conditioned hierarchical RL (HRL) methods sets subgoals at the high level without considering feedback from the low level, which significantly degrades the performance of high-level subgoal generation and sampling efficiency. Hindsight action relabeling further weakens subgoal settings by submitting to low-level reachability. Inspired by feedback control of dynamic systems, we present Feedback for Improved HRL with Timed Subgoals (FIHTS), a mechanism allowing feedback control of subgoal generation for improved HRL. Unlike current HRL, FIHTS enables both the high level to set subgoals and the low level to receive rewards based on subgoal achievement. Our experiments in various challenging dynamic RL environments show that our FIHTS method achieves higher success rates with higher sample efficiency than existing subgoal-based HRL methods.","PeriodicalId":37235,"journal":{"name":"IEEE Control Systems Letters","volume":"9 ","pages":"2501-2506"},"PeriodicalIF":2.0,"publicationDate":"2025-11-04","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145510104","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Toward O(log(1/ϵ)/ϵ) Computational Complexity for PL Functions in Decentralized Stochastic Optimization With Communication Noise 带通信噪声的分散随机优化PL函数的O(log(1/ λ)/ λ)计算复杂度
IF 2 Q2 AUTOMATION & CONTROL SYSTEMS Pub Date : 2025-10-30 DOI: 10.1109/LCSYS.2025.3627045
Soham Mukherjee;Mrityunjoy Chakraborty
A decentralized stochastic optimization problem is considered, in which a network of nodes collaborate over noisy communication links to minimize a global objective function. Recently, the authors in Mukherjee and Chakraborty (2025) incorporated a skipping technique in the Noisy Consensus + Stochastic Gradient Descent (SGD) framework to address this problem, and showed that their proposed skipping technique helps improve the ${mathrm { O}}text {(}1/epsilon ^{3}text {)}$ computational complexity obtained in previous works to ${mathrm { O}}text {(}1/epsilon ^{2}text {)}$ under a general smoothness assumption. In this letter, we consider the algorithm proposed in Mukherjee and Chakraborty (2025) to show how the computational complexity can be further improved to ${mathrm { O}}text {(}{mathrm { log}}text {(}1/epsilon text {)}/epsilon text {)}$ when the Polyak-Lojasiewicz (PL) condition is satisfied in addition to the smoothness assumption. The obtained ${mathrm { O}}text {(}{mathrm { log}}text {(}1/epsilon text {)}/epsilon text {)}$ rate in the current work is also an Improvement over the ${mathrm { O}}text {(}1/epsilon ^{2}text {)}$ rate obtained in previous works under the strong-convexity assumption (which is known to be stricter than the PL condition), and matches the $Omega text {(}1/epsilon text {)}$ lower bound for the number of stochastic gradient computations for the considered problem class up to an extra ${mathrm { log}}text {(}1/epsilon text {)}$ factor. Last but not least, the ${mathrm { O}}text {(}{mathrm { log}}text {(}1/epsilon text {)}/epsilon text {)}$ computational cost is achieved while retaining the ${mathrm { O}}text {(}{mathrm { log}}^{2}text {(}1/epsilon text {)}/epsilon ^{2}text {)}$ rate for the number of iterations and communication rounds, which is at par with the results obtained in previous works which consider strong-convexity, up to logarithmic factors. A numerical experiment is conducted corroborate theoretical results.
研究了一个分散的随机优化问题,其中节点网络在有噪声的通信链路上协作以最小化全局目标函数。最近,Mukherjee和Chakraborty(2025)的作者在噪声一致+随机梯度下降(SGD)框架中引入了一种跳过技术来解决这个问题,并表明他们提出的跳过技术有助于在一般平滑假设下将先前工作中获得的${mathrm { O}}text {(}1/epsilon ^{3}text {)}$计算复杂度提高到${mathrm { O}}text {(}1/epsilon ^{2}text {)}$。在这封信中,我们考虑了Mukherjee和Chakraborty(2025)中提出的算法,以展示在平滑假设之外满足Polyak-Lojasiewicz (PL)条件时如何将计算复杂度进一步提高到${mathrm { O}}text {(}{mathrm { log}}text {(}1/epsilon text {)}/epsilon text {)}$。在当前工作中获得的${mathrm { O}}text {(}{mathrm { log}}text {(}1/epsilon text {)}/epsilon text {)}$比率也是在强凸性假设下(已知比PL条件更严格)的先前工作中获得的${mathrm { O}}text {(}1/epsilon ^{2}text {)}$比率的改进,并且与所考虑的问题类别的随机梯度计算次数的$Omega text {(}1/epsilon text {)}$下界匹配到一个额外的${mathrm { log}}text {(}1/epsilon text {)}$因子。最后但并非最不重要的是,${mathrm { O}}text {(}{mathrm { log}}text {(}1/epsilon text {)}/epsilon text {)}$计算成本是在保持迭代次数和通信轮数的${mathrm { O}}text {(}{mathrm { log}}^{2}text {(}1/epsilon text {)}/epsilon ^{2}text {)}$速率的同时实现的,这与之前考虑强凸性的工作中获得的结果相当,直到对数因子。数值实验验证了理论结果。
{"title":"Toward O(log(1/ϵ)/ϵ) Computational Complexity for PL Functions in Decentralized Stochastic Optimization With Communication Noise","authors":"Soham Mukherjee;Mrityunjoy Chakraborty","doi":"10.1109/LCSYS.2025.3627045","DOIUrl":"https://doi.org/10.1109/LCSYS.2025.3627045","url":null,"abstract":"A decentralized stochastic optimization problem is considered, in which a network of nodes collaborate over noisy communication links to minimize a global objective function. Recently, the authors in Mukherjee and Chakraborty (2025) incorporated a skipping technique in the Noisy Consensus + Stochastic Gradient Descent (SGD) framework to address this problem, and showed that their proposed skipping technique helps improve the <inline-formula> <tex-math>${mathrm { O}}text {(}1/epsilon ^{3}text {)}$ </tex-math></inline-formula> computational complexity obtained in previous works to <inline-formula> <tex-math>${mathrm { O}}text {(}1/epsilon ^{2}text {)}$ </tex-math></inline-formula> under a general smoothness assumption. In this letter, we consider the algorithm proposed in Mukherjee and Chakraborty (2025) to show how the computational complexity can be further improved to <inline-formula> <tex-math>${mathrm { O}}text {(}{mathrm { log}}text {(}1/epsilon text {)}/epsilon text {)}$ </tex-math></inline-formula> when the Polyak-Lojasiewicz (PL) condition is satisfied in addition to the smoothness assumption. The obtained <inline-formula> <tex-math>${mathrm { O}}text {(}{mathrm { log}}text {(}1/epsilon text {)}/epsilon text {)}$ </tex-math></inline-formula> rate in the current work is also an Improvement over the <inline-formula> <tex-math>${mathrm { O}}text {(}1/epsilon ^{2}text {)}$ </tex-math></inline-formula> rate obtained in previous works under the strong-convexity assumption (which is known to be stricter than the PL condition), and matches the <inline-formula> <tex-math>$Omega text {(}1/epsilon text {)}$ </tex-math></inline-formula> lower bound for the number of stochastic gradient computations for the considered problem class up to an extra <inline-formula> <tex-math>${mathrm { log}}text {(}1/epsilon text {)}$ </tex-math></inline-formula> factor. Last but not least, the <inline-formula> <tex-math>${mathrm { O}}text {(}{mathrm { log}}text {(}1/epsilon text {)}/epsilon text {)}$ </tex-math></inline-formula> computational cost is achieved while retaining the <inline-formula> <tex-math>${mathrm { O}}text {(}{mathrm { log}}^{2}text {(}1/epsilon text {)}/epsilon ^{2}text {)}$ </tex-math></inline-formula> rate for the number of iterations and communication rounds, which is at par with the results obtained in previous works which consider strong-convexity, up to logarithmic factors. A numerical experiment is conducted corroborate theoretical results.","PeriodicalId":37235,"journal":{"name":"IEEE Control Systems Letters","volume":"9 ","pages":"2477-2482"},"PeriodicalIF":2.0,"publicationDate":"2025-10-30","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145455747","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
A General Framework for Detectability in Stochastic Discrete-Event Systems 随机离散事件系统可检测性的一般框架
IF 2 Q2 AUTOMATION & CONTROL SYSTEMS Pub Date : 2025-10-30 DOI: 10.1109/LCSYS.2025.3626794
Jun Chen;Feng Lin
This letter proposes a new framework to capture detectability property in stochastic discrete-event systems. A new notion, name Partition-based Detectability or P-Detectability, is proposed based on partitions of the system state space, rather than the state space itself. In other words, the proposed P-Detectability focuses on the system capability to detect certain state group from other state groups, while ignoring the ambiguity between individual states within the same state group. As a consequence, the proposed P-Detectability allows users to define customized public and cover to ignore irrelevant ambiguity. Compared to existing notions such as A-Detectability and A-Diagnosabiltiy, the proposed notion is shown to be more general. A necessary and sufficient condition to verify P-Detectability, together with a testing algorithm, are developed.
本文提出了一个捕捉随机离散事件系统可检测性的新框架。提出了一个新的概念,称为基于分区的可探测性或p -可探测性,它基于系统状态空间的分区,而不是状态空间本身。换句话说,所提出的p -可探测性侧重于系统从其他状态组中检测特定状态组的能力,而忽略了同一状态组中单个状态之间的模糊性。因此,建议的p -可探测性允许用户定义定制的公共和覆盖,以忽略不相关的歧义。与现有的a -可检测性和a -可诊断性等概念相比,本文提出的概念更为普遍。给出了验证p -可探测性的充分必要条件,并给出了检验算法。
{"title":"A General Framework for Detectability in Stochastic Discrete-Event Systems","authors":"Jun Chen;Feng Lin","doi":"10.1109/LCSYS.2025.3626794","DOIUrl":"https://doi.org/10.1109/LCSYS.2025.3626794","url":null,"abstract":"This letter proposes a new framework to capture detectability property in stochastic discrete-event systems. A new notion, name Partition-based Detectability or P-Detectability, is proposed based on partitions of the system state space, rather than the state space itself. In other words, the proposed P-Detectability focuses on the system capability to detect certain state group from other state groups, while ignoring the ambiguity between individual states within the same state group. As a consequence, the proposed P-Detectability allows users to define customized public and cover to ignore irrelevant ambiguity. Compared to existing notions such as A-Detectability and A-Diagnosabiltiy, the proposed notion is shown to be more general. A necessary and sufficient condition to verify P-Detectability, together with a testing algorithm, are developed.","PeriodicalId":37235,"journal":{"name":"IEEE Control Systems Letters","volume":"9 ","pages":"2465-2470"},"PeriodicalIF":2.0,"publicationDate":"2025-10-30","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145455762","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Finite- and Fixed-Time Gradient Flows for Constrained Optimization via Control Barrier Functions 基于控制势垒函数的有限和固定时间梯度流约束优化
IF 2 Q2 AUTOMATION & CONTROL SYSTEMS Pub Date : 2025-10-30 DOI: 10.1109/LCSYS.2025.3627044
Baby Diana;Shyam Kamal;Sandip Ghosh;Thach Ngoc Dinh
This letter presents a novel finite- and fixed-time convergent framework for solving constrained convex optimization problems using safe gradient flow dynamics. In the existing literature, finite- and fixed-time gradient flow methods have primarily addressed equality constraints by formulating Lagrangian-based gradient dynamics or a scaled augmented Lagrangian formulation. In contrast, the proposed method augments the gradient flow with constraint-driven control inputs derived using control barrier function (CBF) techniques, enabling simultaneous handling of equality and inequality constraints without relying on penalty function formulations. This formulation treats decision variables as system states and Lagrange multipliers as control inputs, enabling the feasible set to act as an invariant safe set throughout the system’s evolution. The approach ensures strict feasibility and guarantees convergence to the optimal solution in both finite- and fixed- time. Rigorous Lyapunov-based analysis establishes theoretical stability guarantees, and numerical simulations demonstrate rapid, exact convergence for problems involving both equality and inequality constraints.
这封信提出了一个新的有限和固定时间收敛框架解决约束凸优化问题,使用安全梯度流动动力学。在现有的文献中,有限时间和固定时间梯度流方法主要通过建立基于拉格朗日的梯度动力学或缩放增广拉格朗日公式来解决等式约束。相比之下,所提出的方法通过使用控制屏障函数(CBF)技术推导的约束驱动控制输入来增强梯度流,从而可以同时处理相等和不等式约束,而无需依赖于惩罚函数公式。该公式将决策变量作为系统状态,拉格朗日乘数作为控制输入,使可行集在整个系统演化过程中充当不变的安全集。该方法具有严格的可行性,并保证在有限时间和固定时间内收敛到最优解。严格的李雅普诺夫分析建立了理论稳定性保证,数值模拟证明了涉及相等和不等式约束的问题的快速,精确收敛。
{"title":"Finite- and Fixed-Time Gradient Flows for Constrained Optimization via Control Barrier Functions","authors":"Baby Diana;Shyam Kamal;Sandip Ghosh;Thach Ngoc Dinh","doi":"10.1109/LCSYS.2025.3627044","DOIUrl":"https://doi.org/10.1109/LCSYS.2025.3627044","url":null,"abstract":"This letter presents a novel finite- and fixed-time convergent framework for solving constrained convex optimization problems using safe gradient flow dynamics. In the existing literature, finite- and fixed-time gradient flow methods have primarily addressed equality constraints by formulating Lagrangian-based gradient dynamics or a scaled augmented Lagrangian formulation. In contrast, the proposed method augments the gradient flow with constraint-driven control inputs derived using control barrier function (CBF) techniques, enabling simultaneous handling of equality and inequality constraints without relying on penalty function formulations. This formulation treats decision variables as system states and Lagrange multipliers as control inputs, enabling the feasible set to act as an invariant safe set throughout the system’s evolution. The approach ensures strict feasibility and guarantees convergence to the optimal solution in both finite- and fixed- time. Rigorous Lyapunov-based analysis establishes theoretical stability guarantees, and numerical simulations demonstrate rapid, exact convergence for problems involving both equality and inequality constraints.","PeriodicalId":37235,"journal":{"name":"IEEE Control Systems Letters","volume":"9 ","pages":"2483-2488"},"PeriodicalIF":2.0,"publicationDate":"2025-10-30","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145510103","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Power Flow Security Maximization via Inverse Chance Constrained Optimization 基于逆机会约束优化的潮流安全最大化
IF 2 Q2 AUTOMATION & CONTROL SYSTEMS Pub Date : 2025-10-28 DOI: 10.1109/LCSYS.2025.3626276
Shenglu Wang;Kairui Feng;Mengqi Xue;Yue Song
The chance-constrained optimal power flow (CC-OPF) essentially finds the low-cost generation dispatch scheme ensuring operational constraints are met with a specified probability, termed the security level. While the security level is a crucial input parameter, how it shapes the CC-OPF feasibility boundary has not been revealed. Changing the security level from a parameter to a decision variable, this letter proposes a security maximization approach based on the chance constrained DC-OPF model, termed inverse CC-OPF (ICC-OPF), that seeks the maximum security level achievable by the system. To efficiently solve the ICC-OPF, we design a Newton-Raphson-like iteration algorithm leveraging the duality-based sensitivity analysis of an associated surrogate problem. Numerical experiments validate the proposed approach, revealing complex feasibility boundaries for security levels that underscore the importance of coordinating security levels across multiple chance constraints.
机会约束最优潮流(CC-OPF)本质上是寻找低成本的发电调度方案,以确保运行约束在特定的概率下得到满足,称为安全级别。虽然安全级别是一个关键的输入参数,但它如何塑造CC-OPF可行性边界尚未揭示。将安全级别从参数更改为决策变量,这封信提出了一种基于机会约束DC-OPF模型的安全最大化方法,称为逆CC-OPF (ICC-OPF),该方法寻求系统可实现的最大安全级别。为了有效地解决ICC-OPF问题,我们设计了一种类似newton - raphson的迭代算法,利用相关代理问题的基于对偶性的灵敏度分析。数值实验验证了所提出的方法,揭示了安全级别的复杂可行性边界,强调了跨多个机会约束协调安全级别的重要性。
{"title":"Power Flow Security Maximization via Inverse Chance Constrained Optimization","authors":"Shenglu Wang;Kairui Feng;Mengqi Xue;Yue Song","doi":"10.1109/LCSYS.2025.3626276","DOIUrl":"https://doi.org/10.1109/LCSYS.2025.3626276","url":null,"abstract":"The chance-constrained optimal power flow (CC-OPF) essentially finds the low-cost generation dispatch scheme ensuring operational constraints are met with a specified probability, termed the security level. While the security level is a crucial input parameter, how it shapes the CC-OPF feasibility boundary has not been revealed. Changing the security level from a parameter to a decision variable, this letter proposes a security maximization approach based on the chance constrained DC-OPF model, termed inverse CC-OPF (ICC-OPF), that seeks the maximum security level achievable by the system. To efficiently solve the ICC-OPF, we design a Newton-Raphson-like iteration algorithm leveraging the duality-based sensitivity analysis of an associated surrogate problem. Numerical experiments validate the proposed approach, revealing complex feasibility boundaries for security levels that underscore the importance of coordinating security levels across multiple chance constraints.","PeriodicalId":37235,"journal":{"name":"IEEE Control Systems Letters","volume":"9 ","pages":"2489-2494"},"PeriodicalIF":2.0,"publicationDate":"2025-10-28","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145455765","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Data-Driven Control Based on Virtual Reference Feedback Tuning With Optimal Reference Model for Non-Minimum Phase Multivariable Systems 非最小相位多变量系统基于最优参考模型虚拟参考反馈整定的数据驱动控制
IF 2 Q2 AUTOMATION & CONTROL SYSTEMS Pub Date : 2025-10-28 DOI: 10.1109/LCSYS.2025.3626272
Mohammad Jeddi;Mohammad Hossein Khademi;Ali Khaki-Sedigh
Data-driven control methodologies design controllers directly from data without explicit system models. Among these methods, the Virtual Reference Feedback Tuning (VRFT) method uses offline data collected from an unknown system. However, challenges arise in selecting an appropriate reference model and in handling non-minimum phase (NMP) transmission zeros. This letter extends an optimal reference-model selection algorithm from single-input single-output (SISO) to multivariable systems and introduces a one-shot method for identifying and incorporating NMP transmission zeros. An appropriate cost function is formulated for multivariable systems, and evolutionary optimization techniques are applied for reference-model selection. Simulation results validate the effectiveness of the proposed approach, contributing to improved data-driven control design for multivariable systems.
数据驱动控制方法直接从数据设计控制器,不需要明确的系统模型。在这些方法中,虚拟参考反馈调谐(VRFT)方法使用从未知系统收集的离线数据。然而,在选择合适的参考模型和处理非最小相位(NMP)传输零点方面出现了挑战。本文将单输入单输出(SISO)的最佳参考模型选择算法扩展到多变量系统,并介绍了一种一次性识别和合并NMP传输零点的方法。针对多变量系统建立了合适的成本函数,并采用进化优化技术进行参考模型选择。仿真结果验证了该方法的有效性,有助于改进多变量系统的数据驱动控制设计。
{"title":"Data-Driven Control Based on Virtual Reference Feedback Tuning With Optimal Reference Model for Non-Minimum Phase Multivariable Systems","authors":"Mohammad Jeddi;Mohammad Hossein Khademi;Ali Khaki-Sedigh","doi":"10.1109/LCSYS.2025.3626272","DOIUrl":"https://doi.org/10.1109/LCSYS.2025.3626272","url":null,"abstract":"Data-driven control methodologies design controllers directly from data without explicit system models. Among these methods, the Virtual Reference Feedback Tuning (VRFT) method uses offline data collected from an unknown system. However, challenges arise in selecting an appropriate reference model and in handling non-minimum phase (NMP) transmission zeros. This letter extends an optimal reference-model selection algorithm from single-input single-output (SISO) to multivariable systems and introduces a one-shot method for identifying and incorporating NMP transmission zeros. An appropriate cost function is formulated for multivariable systems, and evolutionary optimization techniques are applied for reference-model selection. Simulation results validate the effectiveness of the proposed approach, contributing to improved data-driven control design for multivariable systems.","PeriodicalId":37235,"journal":{"name":"IEEE Control Systems Letters","volume":"9 ","pages":"2459-2464"},"PeriodicalIF":2.0,"publicationDate":"2025-10-28","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145455745","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Policy Gradient Bounds in Multitask LQR 多任务LQR中的策略梯度边界
IF 2 Q2 AUTOMATION & CONTROL SYSTEMS Pub Date : 2025-10-27 DOI: 10.1109/LCSYS.2025.3625957
Charis Stamouli;Leonardo F. Toso;Anastasios Tsiamis;George J. Pappas;James Anderson
We analyze the performance of policy gradient in multitask linear quadratic regulation (LQR), where the system and cost parameters differ across tasks. The main goal of multitask LQR is to find a controller with satisfactory performance on every task. Prior analyses on relevant contexts fail to capture closed-loop task similarities, resulting in conservative performance guarantees. To account for such similarities, we propose bisimulation-based measures of task heterogeneity. Our measures employ new bisimulation functions to bound the cost gradient distance between a pair of tasks in closed loop with a common stabilizing controller. Employing these measures, we derive suboptimality bounds for both the multitask optimal controller and the asymptotic policy gradient controller with respect to each of the tasks. We further provide conditions under which the policy gradient iterates remain stabilizing for every system. For multiple random sets of certain tasks, we observe that our bisimulation-based measures improve upon baseline measures of task heterogeneity dramatically.
本文分析了多任务线性二次调节(LQR)中不同任务间系统参数和成本参数不同的策略梯度的性能。多任务LQR的主要目标是找到对每个任务都具有满意性能的控制器。先前对相关上下文的分析未能捕获闭环任务相似性,导致保守的性能保证。为了解释这种相似性,我们提出了基于双模拟的任务异质性测量方法。我们的方法采用新的双仿真函数,用一个通用的稳定控制器约束闭环中一对任务之间的代价梯度距离。利用这些度量,我们推导了多任务最优控制器和渐近策略梯度控制器相对于每个任务的次优性界。我们进一步提供了每个系统的策略梯度迭代保持稳定的条件。对于特定任务的多个随机集,我们观察到我们基于双模拟的测量显著改善了任务异质性的基线测量。
{"title":"Policy Gradient Bounds in Multitask LQR","authors":"Charis Stamouli;Leonardo F. Toso;Anastasios Tsiamis;George J. Pappas;James Anderson","doi":"10.1109/LCSYS.2025.3625957","DOIUrl":"https://doi.org/10.1109/LCSYS.2025.3625957","url":null,"abstract":"We analyze the performance of policy gradient in multitask linear quadratic regulation (LQR), where the system and cost parameters differ across tasks. The main goal of multitask LQR is to find a controller with satisfactory performance on every task. Prior analyses on relevant contexts fail to capture closed-loop task similarities, resulting in conservative performance guarantees. To account for such similarities, we propose bisimulation-based measures of task heterogeneity. Our measures employ new bisimulation functions to bound the cost gradient distance between a pair of tasks in closed loop with a common stabilizing controller. Employing these measures, we derive suboptimality bounds for both the multitask optimal controller and the asymptotic policy gradient controller with respect to each of the tasks. We further provide conditions under which the policy gradient iterates remain stabilizing for every system. For multiple random sets of certain tasks, we observe that our bisimulation-based measures improve upon baseline measures of task heterogeneity dramatically.","PeriodicalId":37235,"journal":{"name":"IEEE Control Systems Letters","volume":"9 ","pages":"2495-2500"},"PeriodicalIF":2.0,"publicationDate":"2025-10-27","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145455838","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Distributed Active-Constrained Consensus of Second-Order Multi-Agent Systems 二阶多智能体系统的分布式主动约束一致性
IF 2 Q2 AUTOMATION & CONTROL SYSTEMS Pub Date : 2025-10-27 DOI: 10.1109/LCSYS.2025.3626269
Mengxiang Zeng;Peng Lin
Most of the existing studies on position constraints integrate the constraint conditions into the agent dynamics. In contrast, the active constraints considered in this letter are implemented through control algorithms, ensuring that the agents’ states remain within their constraint sets without altering the system dynamics. Therefore, this letter explores the active-constrained consensus problem with nonconvex velocity and convex position constraints. The position constraints are fundamentally different from the velocity constraints, because the velocity constraint sets contain the origin, whereas the position constraint sets may not. The key difficulty is how to deal with the coupling between these two fundamentally different constraints. By performing a series of model transformations and utilizing the convexity of the system, it is demonstrated that the active-constrained consensus can be attained. Finally, simulation examples show the validity of the conclusions.
现有的关于位置约束的研究大多将约束条件整合到智能体动力学中。相比之下,本文中考虑的主动约束是通过控制算法实现的,确保智能体的状态保持在约束集内,而不会改变系统动力学。因此,本文探讨了具有非凸速度和凸位置约束的主动约束一致性问题。位置约束从根本上不同于速度约束,因为速度约束集包含原点,而位置约束集可能不包含原点。关键的困难是如何处理这两个根本不同的约束之间的耦合。通过对模型进行一系列的变换,利用系统的凸性,可以得到主动约束的一致性。最后,通过仿真算例验证了结论的有效性。
{"title":"Distributed Active-Constrained Consensus of Second-Order Multi-Agent Systems","authors":"Mengxiang Zeng;Peng Lin","doi":"10.1109/LCSYS.2025.3626269","DOIUrl":"https://doi.org/10.1109/LCSYS.2025.3626269","url":null,"abstract":"Most of the existing studies on position constraints integrate the constraint conditions into the agent dynamics. In contrast, the active constraints considered in this letter are implemented through control algorithms, ensuring that the agents’ states remain within their constraint sets without altering the system dynamics. Therefore, this letter explores the active-constrained consensus problem with nonconvex velocity and convex position constraints. The position constraints are fundamentally different from the velocity constraints, because the velocity constraint sets contain the origin, whereas the position constraint sets may not. The key difficulty is how to deal with the coupling between these two fundamentally different constraints. By performing a series of model transformations and utilizing the convexity of the system, it is demonstrated that the active-constrained consensus can be attained. Finally, simulation examples show the validity of the conclusions.","PeriodicalId":37235,"journal":{"name":"IEEE Control Systems Letters","volume":"9 ","pages":"2471-2476"},"PeriodicalIF":2.0,"publicationDate":"2025-10-27","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145455774","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
期刊
IEEE Control Systems Letters
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1