首页 > 最新文献

IEEE Control Systems Letters最新文献

英文 中文
Gradient-Based Co-Design of Nonlinear Optimal Regulators With Stability Guarantee 具有稳定保证的非线性最优调节器的梯度协同设计
IF 2 Q2 AUTOMATION & CONTROL SYSTEMS Pub Date : 2025-11-19 DOI: 10.1109/LCSYS.2025.3634657
Hikaru Hoshino
This letter proposes a gradient-based control co-design method for nonlinear optimal regulator problems, where physical design parameters and feedback controllers are optimized simultaneously. The proposed method is based on Galerkin approximations of the Hamilton–Jacobi–Bellman equation in a policy iteration framework. The key idea is to evaluate closed-loop performance as the expected cost over a prescribed distribution of initial states, which enables sensitivity analysis and gradient-based updates of the design parameters, while the controller is improved through policy iteration. As a result, the proposed method overcomes restrictive structural assumptions such as system equivalence, thereby avoiding conservatism and allowing flexible incorporation of design-dependent costs. Moreover, closed-loop stability is ensured at every iteration of the co-design procedure by embedding a recursive admissibility verification that combines two complementary Lyapunov conditions. The effectiveness of the proposed method is demonstrated through an example of a load positioning system.
这封信提出了一种基于梯度的非线性最优调节器问题的控制协同设计方法,其中物理设计参数和反馈控制器同时优化。该方法基于策略迭代框架中Hamilton-Jacobi-Bellman方程的Galerkin近似。其关键思想是将闭环性能评估为初始状态在规定分布上的预期成本,从而实现灵敏度分析和基于梯度的设计参数更新,同时通过策略迭代改进控制器。因此,所提出的方法克服了限制性结构假设,如系统等效,从而避免了保守性,并允许灵活地纳入设计相关成本。此外,通过嵌入结合两个互补Lyapunov条件的递归可容许性验证,确保了协同设计过程的每次迭代的闭环稳定性。通过负载定位系统的算例验证了该方法的有效性。
{"title":"Gradient-Based Co-Design of Nonlinear Optimal Regulators With Stability Guarantee","authors":"Hikaru Hoshino","doi":"10.1109/LCSYS.2025.3634657","DOIUrl":"https://doi.org/10.1109/LCSYS.2025.3634657","url":null,"abstract":"This letter proposes a gradient-based control co-design method for nonlinear optimal regulator problems, where physical design parameters and feedback controllers are optimized simultaneously. The proposed method is based on Galerkin approximations of the Hamilton–Jacobi–Bellman equation in a policy iteration framework. The key idea is to evaluate closed-loop performance as the expected cost over a prescribed distribution of initial states, which enables sensitivity analysis and gradient-based updates of the design parameters, while the controller is improved through policy iteration. As a result, the proposed method overcomes restrictive structural assumptions such as system equivalence, thereby avoiding conservatism and allowing flexible incorporation of design-dependent costs. Moreover, closed-loop stability is ensured at every iteration of the co-design procedure by embedding a recursive admissibility verification that combines two complementary Lyapunov conditions. The effectiveness of the proposed method is demonstrated through an example of a load positioning system.","PeriodicalId":37235,"journal":{"name":"IEEE Control Systems Letters","volume":"9 ","pages":"2561-2566"},"PeriodicalIF":2.0,"publicationDate":"2025-11-19","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=11260446","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145612065","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Optimal Multimarginal Schrödinger Bridge: Minimum Spanning Tree Over Measure-Valued Vertices 最优多边际Schrödinger桥:最小生成树在测量值顶点
IF 2 Q2 AUTOMATION & CONTROL SYSTEMS Pub Date : 2025-11-19 DOI: 10.1109/LCSYS.2025.3634944
Georgiy A. Bondar;Abhishek Halder
The Multimarginal Schrödinger Bridge (MSB) finds the optimal coupling among a collection of random vectors with known statistics and a known correlation structure. In the MSB formulation, this correlation structure is specified a priori as an undirected connected graph with measure-valued vertices. In this letter, we formulate and solve the problem of finding the optimal MSB in the sense we seek the optimal coupling over all possible graph structures. We find that computing the optimal MSB amounts to solving the minimum spanning tree problem over measure-valued vertices. We show that the resulting problem can be solved in two steps. The first step constructs a complete graph with edge weight equal to a sum of the optimal value of the corresponding bimarginal SB and the entropies of the endpoints. The second step solves a minimum spanning tree problem over that weighted graph. Numerical experiments illustrate the proposed solution.
Multimarginal Bridge (MSB)找到具有已知统计量和已知相关结构的随机向量集合之间的最优耦合。在MSB公式中,这种关联结构被先验地指定为具有测量值顶点的无向连接图。在这封信中,我们制定并解决了寻找最优MSB的问题,在某种意义上,我们寻求所有可能的图结构的最优耦合。我们发现计算最优MSB相当于解决测量值顶点上的最小生成树问题。我们表明,由此产生的问题可以分两步解决。第一步构造一个完全图,其边权等于相应的双边SB的最优值与端点的熵的和。第二步解决加权图上的最小生成树问题。数值实验验证了所提出的解决方案。
{"title":"Optimal Multimarginal Schrödinger Bridge: Minimum Spanning Tree Over Measure-Valued Vertices","authors":"Georgiy A. Bondar;Abhishek Halder","doi":"10.1109/LCSYS.2025.3634944","DOIUrl":"https://doi.org/10.1109/LCSYS.2025.3634944","url":null,"abstract":"The Multimarginal Schrödinger Bridge (MSB) finds the optimal coupling among a collection of random vectors with known statistics and a known correlation structure. In the MSB formulation, this correlation structure is specified a priori as an undirected connected graph with measure-valued vertices. In this letter, we formulate and solve the problem of finding the optimal MSB in the sense we seek the optimal coupling over all possible graph structures. We find that computing the optimal MSB amounts to solving the minimum spanning tree problem over measure-valued vertices. We show that the resulting problem can be solved in two steps. The first step constructs a complete graph with edge weight equal to a sum of the optimal value of the corresponding bimarginal SB and the entropies of the endpoints. The second step solves a minimum spanning tree problem over that weighted graph. Numerical experiments illustrate the proposed solution.","PeriodicalId":37235,"journal":{"name":"IEEE Control Systems Letters","volume":"9 ","pages":"2555-2560"},"PeriodicalIF":2.0,"publicationDate":"2025-11-19","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145612053","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Output Feedback Quantized Tracking Control for a Class of Nonlinear Systems With Sensor Failures 一类传感器失效非线性系统的输出反馈量化跟踪控制
IF 2 Q2 AUTOMATION & CONTROL SYSTEMS Pub Date : 2025-11-19 DOI: 10.1109/LCSYS.2025.3635516
Hanfeng Li;Min Li
The primary objective of this letter is to develop an adaptive output feedback quantized control scheme for strict-feedback nonlinear systems. Three technical challenges must be overcome in addressing this problem. First, it is necessary to compensate for the unknown parameter associated with sensor failures, but the existing compensating method is applicable only within the backstepping framework, which complicates the overall design process. Second, sensor failures lead to only corrupted output being available and prevent the observer and controller from utilizing the true system output. Third, the nature of input quantization renders the actual controller discontinuous, which unavoidably introduces an undesirable quantization error and thus poses a challenge in mitigating its effect. To circumvent these difficulties, this letter establishes a new compensation scheme and a transformation of the control signal to address the effects of sensor failures and input quantization. A concise dynamic gain approach is proposed to construct a novel adaptive observer by using only the corrupted output signal. It is shown that, with the derived output feedback quantized controller, all closed-loop signals are bounded and the tracking error converges to an adjustable region. Two simulation examples are presented to demonstrate the effectiveness of the proposed scheme.
本文的主要目的是为严格反馈非线性系统开发一种自适应输出反馈量化控制方案。要解决这一问题,必须克服三个技术挑战。首先,需要对与传感器故障相关的未知参数进行补偿,但现有的补偿方法仅适用于退步框架,使整体设计过程复杂化。其次,传感器故障导致只有损坏的输出可用,并阻止观测器和控制器利用真实的系统输出。第三,输入量化的性质使得实际控制器不连续,这不可避免地引入了不希望的量化误差,从而对减轻其影响提出了挑战。为了克服这些困难,本文建立了一种新的补偿方案和控制信号的转换,以解决传感器故障和输入量化的影响。提出了一种简明的动态增益方法,仅利用损坏的输出信号构建一种新的自适应观测器。结果表明,在导出的输出反馈量化控制器下,所有闭环信号都是有界的,跟踪误差收敛到一个可调区域。通过两个仿真实例验证了该方法的有效性。
{"title":"Output Feedback Quantized Tracking Control for a Class of Nonlinear Systems With Sensor Failures","authors":"Hanfeng Li;Min Li","doi":"10.1109/LCSYS.2025.3635516","DOIUrl":"https://doi.org/10.1109/LCSYS.2025.3635516","url":null,"abstract":"The primary objective of this letter is to develop an adaptive output feedback quantized control scheme for strict-feedback nonlinear systems. Three technical challenges must be overcome in addressing this problem. First, it is necessary to compensate for the unknown parameter associated with sensor failures, but the existing compensating method is applicable only within the backstepping framework, which complicates the overall design process. Second, sensor failures lead to only corrupted output being available and prevent the observer and controller from utilizing the true system output. Third, the nature of input quantization renders the actual controller discontinuous, which unavoidably introduces an undesirable quantization error and thus poses a challenge in mitigating its effect. To circumvent these difficulties, this letter establishes a new compensation scheme and a transformation of the control signal to address the effects of sensor failures and input quantization. A concise dynamic gain approach is proposed to construct a novel adaptive observer by using only the corrupted output signal. It is shown that, with the derived output feedback quantized controller, all closed-loop signals are bounded and the tracking error converges to an adjustable region. Two simulation examples are presented to demonstrate the effectiveness of the proposed scheme.","PeriodicalId":37235,"journal":{"name":"IEEE Control Systems Letters","volume":"9 ","pages":"2573-2578"},"PeriodicalIF":2.0,"publicationDate":"2025-11-19","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145612070","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Dynamic Control Allocation for Nonlinear Systems via a Sensitivity Approach 基于灵敏度方法的非线性系统动态控制分配
IF 2 Q2 AUTOMATION & CONTROL SYSTEMS Pub Date : 2025-11-17 DOI: 10.1109/LCSYS.2025.3633369
S. Akbari;S. Galeani;G. Manca;M. Sassano
The objective of this letter is to develop a dynamic control allocation framework for nonlinear systems subject to periodic exogenous signals. The proposed approach combines gradient-based optimization with parameter-dependent sensitivity dynamics, enabling an extension of control allocation beyond conventional strategies, tailored to constant references and instantaneous costs; this methodology systematically addresses periodic reference trajectories and integral cost functionals, while preserving the desirable property of output invisibility, similarly to the linear settings. The proposed technique is implemented and validated on a nonlinear mechanical system.
这封信的目的是为受周期性外生信号影响的非线性系统开发一个动态控制分配框架。所提出的方法将基于梯度的优化与参数依赖的灵敏度动力学相结合,使控制分配的扩展超出了常规策略,可以根据恒定的参考和瞬时成本进行定制;该方法系统地处理周期参考轨迹和积分成本函数,同时保留输出不可见性的理想特性,类似于线性设置。该方法在一个非线性机械系统上进行了实现和验证。
{"title":"Dynamic Control Allocation for Nonlinear Systems via a Sensitivity Approach","authors":"S. Akbari;S. Galeani;G. Manca;M. Sassano","doi":"10.1109/LCSYS.2025.3633369","DOIUrl":"https://doi.org/10.1109/LCSYS.2025.3633369","url":null,"abstract":"The objective of this letter is to develop a dynamic control allocation framework for nonlinear systems subject to periodic exogenous signals. The proposed approach combines gradient-based optimization with parameter-dependent sensitivity dynamics, enabling an extension of control allocation beyond conventional strategies, tailored to constant references and instantaneous costs; this methodology systematically addresses periodic reference trajectories and integral cost functionals, while preserving the desirable property of output invisibility, similarly to the linear settings. The proposed technique is implemented and validated on a nonlinear mechanical system.","PeriodicalId":37235,"journal":{"name":"IEEE Control Systems Letters","volume":"9 ","pages":"2549-2554"},"PeriodicalIF":2.0,"publicationDate":"2025-11-17","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145560765","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Robust Position Tracking Control of Electro-Hydraulic Actuators: Elimination of Velocity Measurements 电液执行器的鲁棒位置跟踪控制:消除速度测量
IF 2 Q2 AUTOMATION & CONTROL SYSTEMS Pub Date : 2025-11-17 DOI: 10.1109/LCSYS.2025.3633925
Sule Taskingollu;Alper Bayrak;Erman Selim;Enver Tatlicioglu;Erkan Zergeroglu
This letter presents a robust backstepping type controller formulation for the position tracking control of engineering systems actuated via electro-hydraulic actuators (EHAs). Specifically, a robust controller that does not require accurate knowledge of the system parameters and uses only position measurements is proposed. A filtered based approach is applied to remove the velocity dependency of the controller formulation. Stability of the closed loop system and the uniform boundedness of the tracking error signals are ensured via Lyapunov based arguments. The overall performance of the proposed method is illustrated, initially through physics-based MATLAB/Simscape studies, and then experimentally on a 1 degree of freedom (dof) EHA test-bed and a 2 dof robotic arm.
本文提出了一种鲁棒反步式控制器公式,用于通过电液致动器(EHAs)驱动的工程系统的位置跟踪控制。具体而言,提出了一种不需要精确了解系统参数并仅使用位置测量的鲁棒控制器。采用基于滤波的方法去除控制器公式的速度依赖性。通过基于李亚普诺夫的参数保证了闭环系统的稳定性和跟踪误差信号的一致有界性。首先通过基于物理的MATLAB/Simscape研究说明了所提出方法的总体性能,然后在1自由度EHA试验台和2自由度机械臂上进行了实验。
{"title":"Robust Position Tracking Control of Electro-Hydraulic Actuators: Elimination of Velocity Measurements","authors":"Sule Taskingollu;Alper Bayrak;Erman Selim;Enver Tatlicioglu;Erkan Zergeroglu","doi":"10.1109/LCSYS.2025.3633925","DOIUrl":"https://doi.org/10.1109/LCSYS.2025.3633925","url":null,"abstract":"This letter presents a robust backstepping type controller formulation for the position tracking control of engineering systems actuated via electro-hydraulic actuators (EHAs). Specifically, a robust controller that does not require accurate knowledge of the system parameters and uses only position measurements is proposed. A filtered based approach is applied to remove the velocity dependency of the controller formulation. Stability of the closed loop system and the uniform boundedness of the tracking error signals are ensured via Lyapunov based arguments. The overall performance of the proposed method is illustrated, initially through physics-based MATLAB/Simscape studies, and then experimentally on a 1 degree of freedom (dof) EHA test-bed and a 2 dof robotic arm.","PeriodicalId":37235,"journal":{"name":"IEEE Control Systems Letters","volume":"9 ","pages":"2543-2548"},"PeriodicalIF":2.0,"publicationDate":"2025-11-17","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145560764","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
On the Instability of Nesterov’s ODE Under Non-Conservative Vector Fields 非保守向量场下Nesterov ODE的不稳定性
IF 2 Q2 AUTOMATION & CONTROL SYSTEMS Pub Date : 2025-11-17 DOI: 10.1109/LCSYS.2025.3633320
Daniel E. Ochoa;Mahmoud Abdelgalil;Jorge I. Poveda
We study the instability of Nesterov’s ODE in non-conservative settings, where the driving term is not necessarily the gradient of a potential function. While convergence properties under Nesterov’s ODE are well-characterized for settings with gradient-based driving terms, we show that the presence of arbitrarily small non-conservative terms can lead to instability. To resolve the instability issue, we study a regularization mechanism based on restarting. For this mechanism, we establish novel explicit bounds on the resetting period that ensure the decrease of a suitable Lyapunov function, thereby guaranteeing stability and “accelerated” convergence rates under suitable smoothness and monotonicity properties on the driving term. Numerical simulations support our results.
我们研究了非保守情况下Nesterov ODE的不稳定性,其中驱动项不一定是势函数的梯度。虽然Nesterov ODE下的收敛性质对于基于梯度的驱动项的设置具有很好的特征,但我们表明任意小的非保守项的存在可能导致不稳定。为了解决不稳定性问题,我们研究了一种基于重启的正则化机制。对于该机制,我们在复位周期上建立了新的显式边界,保证了适当的Lyapunov函数的减小,从而保证了在驱动项上适当的平滑性和单调性下的稳定性和“加速”收敛速率。数值模拟支持我们的结果。
{"title":"On the Instability of Nesterov’s ODE Under Non-Conservative Vector Fields","authors":"Daniel E. Ochoa;Mahmoud Abdelgalil;Jorge I. Poveda","doi":"10.1109/LCSYS.2025.3633320","DOIUrl":"https://doi.org/10.1109/LCSYS.2025.3633320","url":null,"abstract":"We study the instability of Nesterov’s ODE in non-conservative settings, where the driving term is not necessarily the gradient of a potential function. While convergence properties under Nesterov’s ODE are well-characterized for settings with gradient-based driving terms, we show that the presence of arbitrarily small non-conservative terms can lead to instability. To resolve the instability issue, we study a regularization mechanism based on restarting. For this mechanism, we establish novel explicit bounds on the resetting period that ensure the decrease of a suitable Lyapunov function, thereby guaranteeing stability and “accelerated” convergence rates under suitable smoothness and monotonicity properties on the driving term. Numerical simulations support our results.","PeriodicalId":37235,"journal":{"name":"IEEE Control Systems Letters","volume":"9 ","pages":"2639-2644"},"PeriodicalIF":2.0,"publicationDate":"2025-11-17","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145674666","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Robust Model Predictive Control for Spacecraft Rendezvous Under Sector-Bounded Nonlinearities 扇形有界非线性条件下航天器交会鲁棒模型预测控制
IF 2 Q2 AUTOMATION & CONTROL SYSTEMS Pub Date : 2025-11-13 DOI: 10.1109/LCSYS.2025.3632763
Ákos M. Bokor;Felix Biertümpfe;Peter Seiler;Roland Tóth
This letter proposes a robust tube-based model predictive control approach for spacecraft rendezvous subject to sector-bounded nonlinearities and bounded disturbances. Unlike existing methods that design the feedback controller and tube tightening parameters sequentially, we jointly optimize both through a convex Linear Matrix Inequality using static quadratic constraints. This eliminates the conservatism inherent in two-step design procedures, while maintaining computational tractability for real-time implementation. The approach is validated through a CubeSat docking simulation with tight operational constraints, showing around 70% fuel savings, 21% reduction in average computational time and smaller tube sizes compared to LQR-based fixed-gain methods.
针对扇形有界非线性和有界扰动,提出了一种基于管的鲁棒模型预测控制方法。与现有的依次设计反馈控制器和管紧参数的方法不同,本文采用静态二次约束,通过凸线性矩阵不等式对两者进行联合优化。这消除了两步设计过程中固有的保守性,同时保持了实时实现的计算可跟踪性。与基于lqr的固定增益方法相比,该方法在严格操作约束的CubeSat对接模拟中得到了验证,节省了约70%的燃料,平均计算时间减少了21%,并且管尺寸更小。
{"title":"Robust Model Predictive Control for Spacecraft Rendezvous Under Sector-Bounded Nonlinearities","authors":"Ákos M. Bokor;Felix Biertümpfe;Peter Seiler;Roland Tóth","doi":"10.1109/LCSYS.2025.3632763","DOIUrl":"https://doi.org/10.1109/LCSYS.2025.3632763","url":null,"abstract":"This letter proposes a robust tube-based model predictive control approach for spacecraft rendezvous subject to sector-bounded nonlinearities and bounded disturbances. Unlike existing methods that design the feedback controller and tube tightening parameters sequentially, we jointly optimize both through a convex Linear Matrix Inequality using static quadratic constraints. This eliminates the conservatism inherent in two-step design procedures, while maintaining computational tractability for real-time implementation. The approach is validated through a CubeSat docking simulation with tight operational constraints, showing around 70% fuel savings, 21% reduction in average computational time and smaller tube sizes compared to LQR-based fixed-gain methods.","PeriodicalId":37235,"journal":{"name":"IEEE Control Systems Letters","volume":"9 ","pages":"2567-2572"},"PeriodicalIF":2.0,"publicationDate":"2025-11-13","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=11245213","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145612069","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Distributed Fenchel Dual Gradient Algorithm for Constrained Convex Optimization Over Digraphs 有向图上约束凸优化的分布式Fenchel对偶梯度算法
IF 2 Q2 AUTOMATION & CONTROL SYSTEMS Pub Date : 2025-11-13 DOI: 10.1109/LCSYS.2025.3632307
Yanan Zhu;Wenwu Yu;Guanghui Wen
This letter provides the first rigorous theoretical analysis for the Distributed Fenchel Dual Gradient (DFDG) algorithm, a continuous-time method for solving distributed convex optimization problems with local set constraints over digraphs. The DFDG algorithm, originally proposed in our prior work (Zhu et al., 2020), transforms the primal problem into its Fenchel dual and solves it using a two-time-scale dynamical system. This letter provides a more comprehensive explanation of the algorithm’s design mechanism and formally establishes its convergence properties. Under strong convexity and Lipschitz continuity assumptions, Lyapunov stability theory is employed to prove the asymptotic convergence to the optimal solutions of both the primal and its dual problems. This analysis provides rigorous guarantees for a class of dual-based algorithms over digraphs, filling a critical gap in the existing literature.
这封信为分布式Fenchel对偶梯度(DFDG)算法提供了第一个严格的理论分析,这是一种连续时间方法,用于解决有向图上具有局部集约束的分布式凸优化问题。DFDG算法最初是在我们之前的工作中提出的(Zhu et al., 2020),它将原始问题转换为Fenchel对偶,并使用双时间尺度动力系统来解决它。这封信提供了一个更全面的解释算法的设计机制,并正式建立了其收敛性质。在强凸性和Lipschitz连续性假设下,利用Lyapunov稳定性理论证明了该问题的最优解及其对偶问题的渐近收敛性。这种分析为有向图上的一类基于双重的算法提供了严格的保证,填补了现有文献中的一个关键空白。
{"title":"Distributed Fenchel Dual Gradient Algorithm for Constrained Convex Optimization Over Digraphs","authors":"Yanan Zhu;Wenwu Yu;Guanghui Wen","doi":"10.1109/LCSYS.2025.3632307","DOIUrl":"https://doi.org/10.1109/LCSYS.2025.3632307","url":null,"abstract":"This letter provides the first rigorous theoretical analysis for the Distributed Fenchel Dual Gradient (DFDG) algorithm, a continuous-time method for solving distributed convex optimization problems with local set constraints over digraphs. The DFDG algorithm, originally proposed in our prior work (Zhu et al., 2020), transforms the primal problem into its Fenchel dual and solves it using a two-time-scale dynamical system. This letter provides a more comprehensive explanation of the algorithm’s design mechanism and formally establishes its convergence properties. Under strong convexity and Lipschitz continuity assumptions, Lyapunov stability theory is employed to prove the asymptotic convergence to the optimal solutions of both the primal and its dual problems. This analysis provides rigorous guarantees for a class of dual-based algorithms over digraphs, filling a critical gap in the existing literature.","PeriodicalId":37235,"journal":{"name":"IEEE Control Systems Letters","volume":"9 ","pages":"2537-2542"},"PeriodicalIF":2.0,"publicationDate":"2025-11-13","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145560763","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Observability and State Estimation for Smooth and Nonsmooth Differential Algebraic Equation Systems 光滑与非光滑微分代数方程组的可观测性与状态估计
IF 2 Q2 AUTOMATION & CONTROL SYSTEMS Pub Date : 2025-11-07 DOI: 10.1109/LCSYS.2025.3630241
Hesham Abdelfattah;Sameh A. Eisa;Peter Stechlinski
In this letter, we extend the sensitivity-based rank condition (SERC) test for local observability to another class of systems, namely smooth and nonsmooth differential-algebraic equation (DAE) systems of index-1. The newly introduced test for DAEs, which we call the lexicographic SERC (L-SERC) observability test, utilizes the theory of lexicographic differentiation to compute sensitivity information. Moreover, the newly introduced L-SERC observability test can judges which states are observable and which are not. Additionally, we introduce a novel sensitivity-based extended Kalman filter (S-EKF) algorithm for state estimation, applicable to both smooth and nonsmooth DAE systems. Finally, we apply the newly developed S-EKF to estimate the states of a wind turbine power system model.
本文将基于灵敏度的秩条件(SERC)检验推广到另一类系统,即指标-1的光滑和非光滑微分代数方程(DAE)系统。新引入的DAEs测试,我们称之为词典SERC (L-SERC)可观察性测试,利用词典区分理论来计算灵敏度信息。此外,新引入的L-SERC可观察性检验可以判断哪些状态是可观察的,哪些状态是不可观察的。此外,我们引入了一种新的基于灵敏度的扩展卡尔曼滤波(S-EKF)算法用于状态估计,适用于光滑和非光滑DAE系统。最后,我们将新开发的S-EKF应用于风力发电系统模型的状态估计。
{"title":"Observability and State Estimation for Smooth and Nonsmooth Differential Algebraic Equation Systems","authors":"Hesham Abdelfattah;Sameh A. Eisa;Peter Stechlinski","doi":"10.1109/LCSYS.2025.3630241","DOIUrl":"https://doi.org/10.1109/LCSYS.2025.3630241","url":null,"abstract":"In this letter, we extend the sensitivity-based rank condition (SERC) test for local observability to another class of systems, namely smooth and nonsmooth differential-algebraic equation (DAE) systems of index-1. The newly introduced test for DAEs, which we call the lexicographic SERC (L-SERC) observability test, utilizes the theory of lexicographic differentiation to compute sensitivity information. Moreover, the newly introduced L-SERC observability test can judges which states are observable and which are not. Additionally, we introduce a novel sensitivity-based extended Kalman filter (S-EKF) algorithm for state estimation, applicable to both smooth and nonsmooth DAE systems. Finally, we apply the newly developed S-EKF to estimate the states of a wind turbine power system model.","PeriodicalId":37235,"journal":{"name":"IEEE Control Systems Letters","volume":"9 ","pages":"2507-2512"},"PeriodicalIF":2.0,"publicationDate":"2025-11-07","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145510102","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Feedback for Improved Hierarchical Reinforcement Learning With Timed Subgoals 带定时子目标的改进层次强化学习反馈
IF 2 Q2 AUTOMATION & CONTROL SYSTEMS Pub Date : 2025-11-04 DOI: 10.1109/LCSYS.2025.3629008
Yajie Bao;Dan Shen;Genshe Chen;Hao Xu;Samson Badlia;Simon Khan;Erik Blasch;Khanh Pham
Hierarchical reinforcement learning (RL) aims to improve sample efficiency by decomposing complex long-horizon tasks into fast low-level myopic and slower high-level non-myopic subtasks. However, the unilateral nested policy structure in current goal-conditioned hierarchical RL (HRL) methods sets subgoals at the high level without considering feedback from the low level, which significantly degrades the performance of high-level subgoal generation and sampling efficiency. Hindsight action relabeling further weakens subgoal settings by submitting to low-level reachability. Inspired by feedback control of dynamic systems, we present Feedback for Improved HRL with Timed Subgoals (FIHTS), a mechanism allowing feedback control of subgoal generation for improved HRL. Unlike current HRL, FIHTS enables both the high level to set subgoals and the low level to receive rewards based on subgoal achievement. Our experiments in various challenging dynamic RL environments show that our FIHTS method achieves higher success rates with higher sample efficiency than existing subgoal-based HRL methods.
分层强化学习(Hierarchical reinforcement learning, RL)的目的是通过将复杂的长视距任务分解为快速的低视距子任务和缓慢的高视距非视距子任务来提高样本效率。然而,当前基于目标条件的分层RL (HRL)方法中的单边嵌套策略结构在高层设置子目标,而不考虑低层的反馈,严重降低了高层子目标生成的性能和采样效率。后见之明的行动重新标记通过提交低水平的可达性进一步削弱了子目标设置。受动态系统反馈控制的启发,我们提出了带定时子目标的改进HRL反馈控制机制(FIHTS),该机制允许对改进HRL的子目标生成进行反馈控制。与当前的HRL不同,fhts允许高层设置子目标,也允许低层根据子目标的实现获得奖励。我们在各种具有挑战性的动态RL环境中的实验表明,我们的方法比现有的基于子目标的HRL方法获得了更高的成功率和更高的样本效率。
{"title":"Feedback for Improved Hierarchical Reinforcement Learning With Timed Subgoals","authors":"Yajie Bao;Dan Shen;Genshe Chen;Hao Xu;Samson Badlia;Simon Khan;Erik Blasch;Khanh Pham","doi":"10.1109/LCSYS.2025.3629008","DOIUrl":"https://doi.org/10.1109/LCSYS.2025.3629008","url":null,"abstract":"Hierarchical reinforcement learning (RL) aims to improve sample efficiency by decomposing complex long-horizon tasks into fast low-level myopic and slower high-level non-myopic subtasks. However, the unilateral nested policy structure in current goal-conditioned hierarchical RL (HRL) methods sets subgoals at the high level without considering feedback from the low level, which significantly degrades the performance of high-level subgoal generation and sampling efficiency. Hindsight action relabeling further weakens subgoal settings by submitting to low-level reachability. Inspired by feedback control of dynamic systems, we present Feedback for Improved HRL with Timed Subgoals (FIHTS), a mechanism allowing feedback control of subgoal generation for improved HRL. Unlike current HRL, FIHTS enables both the high level to set subgoals and the low level to receive rewards based on subgoal achievement. Our experiments in various challenging dynamic RL environments show that our FIHTS method achieves higher success rates with higher sample efficiency than existing subgoal-based HRL methods.","PeriodicalId":37235,"journal":{"name":"IEEE Control Systems Letters","volume":"9 ","pages":"2501-2506"},"PeriodicalIF":2.0,"publicationDate":"2025-11-04","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145510104","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
期刊
IEEE Control Systems Letters
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1