首页 > 最新文献

IEEE open journal of control systems最新文献

英文 中文
Novel Bounds for Incremental Hessian Estimation With Application to Zeroth-Order Federated Learning 应用于零阶联合学习的增量赫赛斯估计新界限
Pub Date : 2024-04-15 DOI: 10.1109/OJCSYS.2024.3388374
Alessio Maritan;Luca Schenato;Subhrakanti Dey
The Hessian matrix conveys important information about the curvature, spectrum and partial derivatives of a function, and is required in a variety of tasks. However, computing the exact Hessian is prohibitively expensive for high-dimensional input spaces, and is just impossible in zeroth-order optimization, where the objective function is a black-box of which only input-output pairs are known. In this work we address this relevant problem by providing a rigorous analysis of an Hessian estimator available in the literature, allowing it to be used as a provably accurate replacement of the true Hessian matrix. The Hessian estimator is randomized and incremental, and its computation requires only point function evaluations. We provide non-asymptotic convergence bounds on the estimation error and derive the minimum number of function queries needed to achieve a desired accuracy with arbitrarily high probability. In the second part of the paper we show a practical application of our results, introducing a novel optimization algorithm suitable for non-convex and black-box federated learning. The algorithm only requires clients to evaluate their local functions at certain input points, and builds a sufficiently accurate estimate of the global Hessian matrix in a distributed way. The algorithm exploits inexact cubic regularization to escape saddle points and guarantees convergence with optimal iteration complexity and high probability. Numerical results show that the proposed algorithm outperforms the existing zeroth-order federated algorithms in both convex and non-convex problems. Furthermore, we achieve similar performance to state-of-the-art algorithms for federated convex optimization that use exact gradients and Hessian matrices.
黑森矩阵传达了函数的曲率、频谱和偏导数等重要信息,在各种任务中都需要使用。然而,计算精确的黑森矩阵对于高维输入空间来说过于昂贵,而且在零阶优化中也是不可能的,因为在零阶优化中,目标函数是一个黑箱,只有输入输出对是已知的。在这项工作中,我们通过对文献中的一个黑森估计器进行严格分析,解决了这一相关问题,使其可以用作真正黑森矩阵的可证明精确替代物。黑森估计器是随机和增量的,其计算只需要点函数求值。我们提供了估计误差的非渐近收敛边界,并推导出了以任意高的概率达到预期精度所需的最小函数查询次数。在论文的第二部分,我们展示了我们成果的实际应用,介绍了一种适用于非凸和黑箱联合学习的新型优化算法。该算法只要求客户在特定输入点评估其局部函数,并以分布式方式建立足够精确的全局赫塞斯矩阵估计值。该算法利用非精确立方正则化来摆脱鞍点,并保证以最佳迭代复杂度和高概率收敛。数值结果表明,在凸问题和非凸问题上,所提出的算法都优于现有的零阶联合算法。此外,我们还取得了与使用精确梯度和黑森矩阵的最先进联合凸优化算法类似的性能。
{"title":"Novel Bounds for Incremental Hessian Estimation With Application to Zeroth-Order Federated Learning","authors":"Alessio Maritan;Luca Schenato;Subhrakanti Dey","doi":"10.1109/OJCSYS.2024.3388374","DOIUrl":"https://doi.org/10.1109/OJCSYS.2024.3388374","url":null,"abstract":"The Hessian matrix conveys important information about the curvature, spectrum and partial derivatives of a function, and is required in a variety of tasks. However, computing the exact Hessian is prohibitively expensive for high-dimensional input spaces, and is just impossible in zeroth-order optimization, where the objective function is a black-box of which only input-output pairs are known. In this work we address this relevant problem by providing a rigorous analysis of an Hessian estimator available in the literature, allowing it to be used as a provably accurate replacement of the true Hessian matrix. The Hessian estimator is randomized and incremental, and its computation requires only point function evaluations. We provide non-asymptotic convergence bounds on the estimation error and derive the minimum number of function queries needed to achieve a desired accuracy with arbitrarily high probability. In the second part of the paper we show a practical application of our results, introducing a novel optimization algorithm suitable for non-convex and black-box federated learning. The algorithm only requires clients to evaluate their local functions at certain input points, and builds a sufficiently accurate estimate of the global Hessian matrix in a distributed way. The algorithm exploits inexact cubic regularization to escape saddle points and guarantees convergence with optimal iteration complexity and high probability. Numerical results show that the proposed algorithm outperforms the existing zeroth-order federated algorithms in both convex and non-convex problems. Furthermore, we achieve similar performance to state-of-the-art algorithms for federated convex optimization that use exact gradients and Hessian matrices.","PeriodicalId":73299,"journal":{"name":"IEEE open journal of control systems","volume":"3 ","pages":"173-189"},"PeriodicalIF":0.0,"publicationDate":"2024-04-15","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=10499850","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"140818786","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Learning Robust Output Control Barrier Functions From Safe Expert Demonstrations 从安全专家演示中学习鲁棒输出控制障碍函数
Pub Date : 2024-04-04 DOI: 10.1109/OJCSYS.2024.3385348
Lars Lindemann;Alexander Robey;Lejun Jiang;Satyajeet Das;Stephen Tu;Nikolai Matni
This paper addresses learning safe output feedback control laws from partial observations of expert demonstrations. We assume that a model of the system dynamics and a state estimator are available along with corresponding error bounds, e.g., estimated from data in practice. We first propose robust output control barrier functions (ROCBFs) as a means to guarantee safety, as defined through controlled forward invariance of a safe set. We then formulate an optimization problem to learn ROCBFs from expert demonstrations that exhibit safe system behavior, e.g., data collected from a human operator or an expert controller. When the parametrization of the ROCBF is linear, then we show that, under mild assumptions, the optimization problem is convex. Along with the optimization problem, we provide verifiable conditions in terms of the density of the data, smoothness of the system model and state estimator, and the size of the error bounds that guarantee validity of the obtained ROCBF. Towards obtaining a practical control algorithm, we propose an algorithmic implementation of our theoretical framework that accounts for assumptions made in our framework in practice. We validate our algorithm in the autonomous driving simulator CARLA and demonstrate how to learn safe control laws from simulated RGB camera images.
本文探讨了从专家示范的部分观测结果中学习安全输出反馈控制法的问题。我们假定系统动力学模型和状态估计器以及相应的误差边界(例如,从实际数据中估计的误差边界)是可用的。我们首先提出了鲁棒输出控制障碍函数(ROCBFs),作为保证安全性的一种手段,通过安全集的受控前向不变性来定义。然后,我们提出了一个优化问题,即从显示安全系统行为的专家演示(例如,从人类操作员或专家控制器收集的数据)中学习 ROCBFs。当 ROCBF 的参数化为线性时,我们将证明,在温和的假设条件下,优化问题是凸性的。除了优化问题,我们还提供了数据密度、系统模型和状态估计的平滑性以及误差边界大小等方面的可验证条件,以保证所获得的 ROCBF 的有效性。为了获得实用的控制算法,我们提出了理论框架的算法实现方法,在实践中考虑到了框架中的假设。我们在自动驾驶模拟器 CARLA 中验证了我们的算法,并演示了如何从模拟的 RGB 摄像头图像中学习安全控制法则。
{"title":"Learning Robust Output Control Barrier Functions From Safe Expert Demonstrations","authors":"Lars Lindemann;Alexander Robey;Lejun Jiang;Satyajeet Das;Stephen Tu;Nikolai Matni","doi":"10.1109/OJCSYS.2024.3385348","DOIUrl":"https://doi.org/10.1109/OJCSYS.2024.3385348","url":null,"abstract":"This paper addresses learning safe output feedback control laws from partial observations of expert demonstrations. We assume that a model of the system dynamics and a state estimator are available along with corresponding error bounds, e.g., estimated from data in practice. We first propose robust output control barrier functions (ROCBFs) as a means to guarantee safety, as defined through controlled forward invariance of a safe set. We then formulate an optimization problem to learn ROCBFs from expert demonstrations that exhibit safe system behavior, e.g., data collected from a human operator or an expert controller. When the parametrization of the ROCBF is linear, then we show that, under mild assumptions, the optimization problem is convex. Along with the optimization problem, we provide verifiable conditions in terms of the density of the data, smoothness of the system model and state estimator, and the size of the error bounds that guarantee validity of the obtained ROCBF. Towards obtaining a practical control algorithm, we propose an algorithmic implementation of our theoretical framework that accounts for assumptions made in our framework in practice. We validate our algorithm in the autonomous driving simulator CARLA and demonstrate how to learn safe control laws from simulated RGB camera images.","PeriodicalId":73299,"journal":{"name":"IEEE open journal of control systems","volume":"3 ","pages":"158-172"},"PeriodicalIF":0.0,"publicationDate":"2024-04-04","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=10491341","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"140818787","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
$mathcal {H}_{2}$- and $mathcal {H}_infty$-Optimal Model Predictive Controllers for Robust Legged Locomotion 用于稳健腿部运动的 $mathcal {H}_{2}$- 和 $mathcal {H}_infty$ 最佳模型预测控制器
Pub Date : 2024-03-31 DOI: 10.1109/OJCSYS.2024.3407999
Abhishek Pandala;Aaron D. Ames;Kaveh Akbari Hamed
This paper formally develops robust optimal predictive control solutions that can accommodate disturbances and stabilize periodic legged locomotion. To this end, we build upon existing optimization-based control paradigms, particularly quadratic programming (QP)-based model predictive controllers (MPCs). We present conditions under which the closed-loop reduced-order systems (i.e., template models) with MPC have the continuous differentiability property on an open neighborhood of gaits. We then linearize the resulting discrete-time, closed-loop nonlinear template system around the gait to obtain a linear time-varying (LTV) system. This periodic LTV system is further transformed into a linear system with a constant state-transition matrix using discrete-time Floquet transform. The system is then analyzed to accommodate parametric uncertainties and to synthesize robust optimal $mathcal {H}_{2}$ and $mathcal {H}_infty$ feedback controllers via linear matrix inequalities (LMIs). The paper then extends the theoretical results to the single rigid body (SRB) template dynamics and numerically verifies them. The proposed robust optimal predictive controllers are used in a layered control structure, where the optimal reduced-order trajectories are provided to a full-order nonlinear whole-body controller (WBC) for tracking at the low level. The developed layered controllers are numerically and experimentally validated for the robust locomotion of the A1 quadrupedal robot subject to various disturbances and uneven terrains. Our numerical results suggest that the $mathcal {H}_{2}$- and $mathcal {H}_infty$-optimal MPC controllers significantly improve the robust stability of the gaits compared to the normal MPC.
本文正式提出了稳健的最优预测控制解决方案,可适应干扰并稳定周期性的腿部运动。为此,我们借鉴了现有的基于优化的控制范式,特别是基于二次编程(QP)的模型预测控制器(MPC)。我们提出了使用 MPC 的闭环降阶系统(即模板模型)在步态的开放邻域上具有连续可微分特性的条件。然后,我们将所得到的围绕步态的离散时间闭环非线性模板系统线性化,得到一个线性时变(LTV)系统。利用离散时间 Floquet 变换,可将此周期性 LTV 系统进一步转换为具有恒定状态转换矩阵的线性系统。然后分析该系统以适应参数不确定性,并通过线性矩阵不等式(LMI)合成鲁棒的最优 $mathcal {H}_{2}$ 和 $mathcal {H}_infty$ 反馈控制器。然后,本文将理论结果扩展到单刚体(SRB)模板动力学,并对其进行了数值验证。所提出的鲁棒最优预测控制器被用于分层控制结构中,其中最优的降阶轨迹被提供给全阶非线性全身控制器 (WBC),用于低层次的跟踪。针对 A1 四足机器人在各种干扰和不平地形下的鲁棒运动,对所开发的分层控制器进行了数值和实验验证。数值结果表明,与普通 MPC 相比,$mathcal {H}_{2}$- 和 $mathcal {H}_infty$- 最佳 MPC 控制器显著提高了步态的鲁棒稳定性。
{"title":"$mathcal {H}_{2}$- and $mathcal {H}_infty$-Optimal Model Predictive Controllers for Robust Legged Locomotion","authors":"Abhishek Pandala;Aaron D. Ames;Kaveh Akbari Hamed","doi":"10.1109/OJCSYS.2024.3407999","DOIUrl":"https://doi.org/10.1109/OJCSYS.2024.3407999","url":null,"abstract":"This paper formally develops robust optimal predictive control solutions that can accommodate disturbances and stabilize periodic legged locomotion. To this end, we build upon existing optimization-based control paradigms, particularly quadratic programming (QP)-based model predictive controllers (MPCs). We present conditions under which the closed-loop reduced-order systems (i.e., template models) with MPC have the continuous differentiability property on an open neighborhood of gaits. We then linearize the resulting discrete-time, closed-loop nonlinear template system around the gait to obtain a linear time-varying (LTV) system. This periodic LTV system is further transformed into a linear system with a constant state-transition matrix using discrete-time Floquet transform. The system is then analyzed to accommodate parametric uncertainties and to synthesize robust optimal \u0000<inline-formula><tex-math>$mathcal {H}_{2}$</tex-math></inline-formula>\u0000 and \u0000<inline-formula><tex-math>$mathcal {H}_infty$</tex-math></inline-formula>\u0000 feedback controllers via linear matrix inequalities (LMIs). The paper then extends the theoretical results to the single rigid body (SRB) template dynamics and numerically verifies them. The proposed robust optimal predictive controllers are used in a layered control structure, where the optimal reduced-order trajectories are provided to a full-order nonlinear whole-body controller (WBC) for tracking at the low level. The developed layered controllers are numerically and experimentally validated for the robust locomotion of the A1 quadrupedal robot subject to various disturbances and uneven terrains. Our numerical results suggest that the \u0000<inline-formula><tex-math>$mathcal {H}_{2}$</tex-math></inline-formula>\u0000- and \u0000<inline-formula><tex-math>$mathcal {H}_infty$</tex-math></inline-formula>\u0000-optimal MPC controllers significantly improve the robust stability of the gaits compared to the normal MPC.","PeriodicalId":73299,"journal":{"name":"IEEE open journal of control systems","volume":"3 ","pages":"225-238"},"PeriodicalIF":0.0,"publicationDate":"2024-03-31","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=10543084","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141315179","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
An Efficient Solution to Optimal Motion Planning With Provable Safety and Convergence 具有可证明安全性和收敛性的最优运动规划高效解决方案
Pub Date : 2024-03-18 DOI: 10.1109/OJCSYS.2024.3378055
PANAGIOTIS ROUSSEAS;Charalampos Bechlioulis;Kostas Kyriakopoulos
An innovative solution to the optimal motion planning problem is presented in this work. A novel parametrized actor structure is proposed, which guarantees safe and convergent navigation by construction. Concurrently, an efficient scheme for optimizing a mixed state and energy cost function is formulated. The proposed method inherits the positive traits of continuous methods, while at the same time providing sub-optimal –but close to optimal– results significantly faster and in more complex workspaces than previous ones. The scheme is demonstrated to outperform established relevant methods, while at the same time being competitive w.r.t. execution time. Extensive simulations to validate the effectiveness of the method are presented, along with relevant technical proofs for safety and convergence.
本研究提出了优化运动规划问题的创新解决方案。本文提出了一种新颖的参数化角色结构,通过构造保证了安全和收敛的导航。同时,还提出了优化混合状态和能量成本函数的高效方案。所提出的方法继承了连续方法的积极特征,同时在更复杂的工作空间中,比以前的方法更快地提供次优但接近最优的结果。事实证明,该方案优于已有的相关方法,同时在执行时间上也具有竞争力。本文还介绍了大量仿真,以验证该方法的有效性,以及安全性和收敛性的相关技术证明。
{"title":"An Efficient Solution to Optimal Motion Planning With Provable Safety and Convergence","authors":"PANAGIOTIS ROUSSEAS;Charalampos Bechlioulis;Kostas Kyriakopoulos","doi":"10.1109/OJCSYS.2024.3378055","DOIUrl":"https://doi.org/10.1109/OJCSYS.2024.3378055","url":null,"abstract":"An innovative solution to the optimal motion planning problem is presented in this work. A novel parametrized actor structure is proposed, which guarantees safe and convergent navigation by construction. Concurrently, an efficient scheme for optimizing a mixed state and energy cost function is formulated. The proposed method inherits the positive traits of continuous methods, while at the same time providing sub-optimal –but close to optimal– results significantly faster and in more complex workspaces than previous ones. The scheme is demonstrated to outperform established relevant methods, while at the same time being competitive w.r.t. execution time. Extensive simulations to validate the effectiveness of the method are presented, along with relevant technical proofs for safety and convergence.","PeriodicalId":73299,"journal":{"name":"IEEE open journal of control systems","volume":"3 ","pages":"143-157"},"PeriodicalIF":0.0,"publicationDate":"2024-03-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=10473133","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"140345509","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Self-Excited Dynamics of Discrete-Time Lur'e Systems With Affinely Constrained, Piecewise-C$^{1}$ Feedback Nonlinearities 具有 Affinely 约束、Piecewise-C$^{1}$ 反馈非线性的离散时间 Lur'e 系统的自激动力学
Pub Date : 2024-03-16 DOI: 10.1109/OJCSYS.2024.3402050
Juan A. Paredes;Omran Kouba;Dennis S. Bernstein
Self-excited systems (SES) arise in numerous applications, such as fluid-structure interaction, combustion, and biochemical systems. In support of system identification and digital control of SES, this paper analyzes discrete-time Lur'e systems with affinely constrained, piecewise-C$^{1}$ feedback nonlinearities. In particular, a novel feature of the discrete-time Lur'e system considered in this paper is the structural assumption that the linear dynamics possess a zero at 1. This assumption ensures that the Lur'e system have a unique equilibrium for each constant, exogenous input and prevents the system from having an additional equilibrium with a nontrivial domain of attraction. The main result provides sufficient conditions under which a discrete-time Lur'e system is self-excited in the sense that its response is 1) nonconvergent for almost all initial conditions, and 2) bounded for all initial conditions. Sufficient conditions for 1) include the instability and nonsingularity of the linearized, closed-loop dynamics at the unique equilibrium and their nonsingularity almost everywhere. Sufficient conditions for 2) include asymptotic stability of the linear dynamics of the Lur'e system and their feedback interconnection with linear mappings that correspond to the affine constraints that bound the nonlinearity, as well as the feasibility of a linear matrix inequality.
自激系统(SES)出现在流固耦合、燃烧和生化系统等众多应用中。为了支持 SES 的系统识别和数字控制,本文分析了具有仿射约束、片断 C$^{1}$ 反馈非线性的离散时间 Lur'e 系统。特别是,本文所考虑的离散时间 Lur'e 系统的一个新特征是线性动力学在 1 处有零点的结构假设。这一假设确保 Lur'e 系统对每个恒定的外生输入都有一个唯一的平衡,并防止系统有一个额外的具有非小吸引域的平衡。主要结果提供了离散时间 Lur'e 系统自激的充分条件,在此条件下,该系统的响应 1)对于几乎所有初始条件都是非收敛的;2)对于所有初始条件都是有界的。1)的充分条件包括线性化闭环动力学在唯一平衡点的不稳定性和非奇异性,以及几乎所有地方的非奇异性。2) 的充分条件包括 Lur'e 系统线性动力学的渐近稳定性及其与约束非线性的仿射约束相对应的线性映射的反馈互连,以及线性矩阵不等式的可行性。
{"title":"Self-Excited Dynamics of Discrete-Time Lur'e Systems With Affinely Constrained, Piecewise-C$^{1}$ Feedback Nonlinearities","authors":"Juan A. Paredes;Omran Kouba;Dennis S. Bernstein","doi":"10.1109/OJCSYS.2024.3402050","DOIUrl":"https://doi.org/10.1109/OJCSYS.2024.3402050","url":null,"abstract":"Self-excited systems (SES) arise in numerous applications, such as fluid-structure interaction, combustion, and biochemical systems. In support of system identification and digital control of SES, this paper analyzes discrete-time Lur'e systems with affinely constrained, piecewise-C\u0000<inline-formula><tex-math>$^{1}$</tex-math></inline-formula>\u0000 feedback nonlinearities. In particular, a novel feature of the discrete-time Lur'e system considered in this paper is the structural assumption that the linear dynamics possess a zero at 1. This assumption ensures that the Lur'e system have a unique equilibrium for each constant, exogenous input and prevents the system from having an additional equilibrium with a nontrivial domain of attraction. The main result provides sufficient conditions under which a discrete-time Lur'e system is self-excited in the sense that its response is 1) nonconvergent for almost all initial conditions, and 2) bounded for all initial conditions. Sufficient conditions for 1) include the instability and nonsingularity of the linearized, closed-loop dynamics at the unique equilibrium and their nonsingularity almost everywhere. Sufficient conditions for 2) include asymptotic stability of the linear dynamics of the Lur'e system and their feedback interconnection with linear mappings that correspond to the affine constraints that bound the nonlinearity, as well as the feasibility of a linear matrix inequality.","PeriodicalId":73299,"journal":{"name":"IEEE open journal of control systems","volume":"3 ","pages":"214-224"},"PeriodicalIF":0.0,"publicationDate":"2024-03-16","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=10531633","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141304041","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Model-Free Change Point Detection for Mixing Processes 混合过程的无模型变化点检测
Pub Date : 2024-03-08 DOI: 10.1109/OJCSYS.2024.3398530
Hao Chen;Abhishek Gupta;Yin Sun;Ness Shroff
This paper considers the change point detection problem under dependent samples. In particular, we provide performance guarantees for the MMD-CUSUM test under exponentially $alpha$, $beta$, and fast $phi$-mixing processes, which significantly expands its utility beyond the i.i.d. and Markovian cases used in previous studies. We obtain lower bounds for average-run-length ($ {mathtt {ARL}}$) and upper bounds for average-detection-delay ($ {mathtt {ADD}}$) in terms of the threshold parameter. We show that the MMD-CUSUM test enjoys the same level of performance as the i.i.d. case under fast $phi$-mixing processes. The MMD-CUSUM test also achieves strong performance under exponentially $alpha$/$beta$-mixing processes, which are significantly more relaxed than existing results. The MMD-CUSUM test statistic adapts to different settings without modifications, rendering it a completely data-driven, dependence-agnostic change point detection scheme. Numerical simulations are provided at the end to evaluate our findings.
本文考虑了依赖样本下的变化点检测问题。特别是,我们为指数$alpha$、$beta$和快速$phi$混合过程下的 MMD-CUSUM 检验提供了性能保证,这大大扩展了它的用途,使其超越了以往研究中使用的 i.i.d. 和马尔可夫情况。我们得到了平均运行长度($ {mathtt {ARL}}$)的下限和平均检测延迟($ {mathtt {ADD}}$)的上限。我们证明,在快速 $phi$ 混合过程中,MMD-CUSUM 检验与 i.i.d. 检验具有相同的性能水平。在指数$alpha$/$beta$混合过程下,MMD-CUSUM 检验也取得了很好的性能,这比现有结果要宽松得多。MMD-CUSUM 检验统计量无需修改即可适应不同的设置,使其成为一种完全由数据驱动、与依赖性无关的变化点检测方案。最后还提供了数值模拟来评估我们的发现。
{"title":"Model-Free Change Point Detection for Mixing Processes","authors":"Hao Chen;Abhishek Gupta;Yin Sun;Ness Shroff","doi":"10.1109/OJCSYS.2024.3398530","DOIUrl":"https://doi.org/10.1109/OJCSYS.2024.3398530","url":null,"abstract":"This paper considers the change point detection problem under dependent samples. In particular, we provide performance guarantees for the MMD-CUSUM test under exponentially \u0000<inline-formula><tex-math>$alpha$</tex-math></inline-formula>\u0000, \u0000<inline-formula><tex-math>$beta$</tex-math></inline-formula>\u0000, and fast \u0000<inline-formula><tex-math>$phi$</tex-math></inline-formula>\u0000-mixing processes, which significantly expands its utility beyond the i.i.d. and Markovian cases used in previous studies. We obtain lower bounds for average-run-length (\u0000<inline-formula><tex-math>$ {mathtt {ARL}}$</tex-math></inline-formula>\u0000) and upper bounds for average-detection-delay (\u0000<inline-formula><tex-math>$ {mathtt {ADD}}$</tex-math></inline-formula>\u0000) in terms of the threshold parameter. We show that the MMD-CUSUM test enjoys the same level of performance as the i.i.d. case under fast \u0000<inline-formula><tex-math>$phi$</tex-math></inline-formula>\u0000-mixing processes. The MMD-CUSUM test also achieves strong performance under exponentially \u0000<inline-formula><tex-math>$alpha$</tex-math></inline-formula>\u0000/\u0000<inline-formula><tex-math>$beta$</tex-math></inline-formula>\u0000-mixing processes, which are significantly more relaxed than existing results. The MMD-CUSUM test statistic adapts to different settings without modifications, rendering it a completely data-driven, dependence-agnostic change point detection scheme. Numerical simulations are provided at the end to evaluate our findings.","PeriodicalId":73299,"journal":{"name":"IEEE open journal of control systems","volume":"3 ","pages":"202-213"},"PeriodicalIF":0.0,"publicationDate":"2024-03-08","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=10522896","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141304042","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Navigation Systems May Deteriorate Stability in Traffic Networks 导航系统可能会降低交通网络的稳定性
Pub Date : 2024-03-06 DOI: 10.1109/OJCSYS.2024.3397270
Gianluca Bianchin;Fabio Pasqualetti
Advanced traffic navigation systems, which provide routing recommendations to drivers based on real-time congestion information, are nowadays widely adopted by roadway transportation users. Yet, the emerging effects on the traffic dynamics originating from the widespread adoption of these technologies have remained largely unexplored until now. In this paper, we propose a dynamic model where drivers imitate the path preferences of previous drivers, and we study the properties of its equilibrium points. Our model is a dynamic generalization of the classical traffic assignment framework, and extends it by accounting for dynamics both in the path decision process and in the network's traffic flows. We show that, when travelers learn shortest paths by imitating other travelers, the overall traffic system benefits from this mechanism and transfers the maximum admissible amount of traffic demand. On the other hand, we demonstrate that, when the travel delay functions are not sufficiently steep or the rates at which drivers imitate previous travelers are not adequately chosen, the trajectories of the traffic system may fail to converge to an equilibrium point, thus compromising asymptotic stability. Illustrative numerical simulations combined with empirical data from highway sensors illustrate our findings.
先进的交通导航系统可根据实时拥堵信息向驾驶员提供路线建议,如今已被道路交通用户广泛采用。然而,迄今为止,这些工具的广泛应用对交通动态产生的新影响在很大程度上仍未得到探讨。在本文中,我们提出了一个动态模型,即驾驶员模仿先前驾驶员的路径偏好,并研究了其均衡点的特性。我们的模型是对经典交通分配框架的动态概括,并通过考虑路径决策过程和网络交通流的动态变化对其进行了扩展。我们的研究表明,当出行者通过模仿其他出行者学习最短路径时,整个交通系统会从这一机制中获益,并最大限度地转移可容许的交通需求量。另一方面,我们也证明了当出行延迟函数不够陡峭或驾驶者模仿前人的比率选择不当时,交通系统的轨迹可能无法收敛到均衡点,从而失去渐近稳定性。结合高速公路传感器的经验数据进行的数值模拟说明了我们的发现。
{"title":"Navigation Systems May Deteriorate Stability in Traffic Networks","authors":"Gianluca Bianchin;Fabio Pasqualetti","doi":"10.1109/OJCSYS.2024.3397270","DOIUrl":"10.1109/OJCSYS.2024.3397270","url":null,"abstract":"Advanced traffic navigation systems, which provide routing recommendations to drivers based on real-time congestion information, are nowadays widely adopted by roadway transportation users. Yet, the emerging effects on the traffic dynamics originating from the widespread adoption of these technologies have remained largely unexplored until now. In this paper, we propose a dynamic model where drivers imitate the path preferences of previous drivers, and we study the properties of its equilibrium points. Our model is a dynamic generalization of the classical \u0000<italic>traffic assignment framework</i>\u0000, and extends it by accounting for dynamics both in the path decision process and in the network's traffic flows. We show that, when travelers learn shortest paths by imitating other travelers, the overall traffic system benefits from this mechanism and transfers the maximum admissible amount of traffic demand. On the other hand, we demonstrate that, when the travel delay functions are not sufficiently steep or the rates at which drivers imitate previous travelers are not adequately chosen, the trajectories of the traffic system may fail to converge to an equilibrium point, thus compromising asymptotic stability. Illustrative numerical simulations combined with empirical data from highway sensors illustrate our findings.","PeriodicalId":73299,"journal":{"name":"IEEE open journal of control systems","volume":"3 ","pages":"239-252"},"PeriodicalIF":0.0,"publicationDate":"2024-03-06","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=10520878","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141221343","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Nonstationary Stochastic Bandits: UCB Policies and Minimax Regret 非平稳随机强盗:UCB 政策和最小遗憾
Pub Date : 2024-03-05 DOI: 10.1109/OJCSYS.2024.3372929
Lai Wei;Vaibhav Srivastava
We study the nonstationary stochastic Multi-Armed Bandit (MAB) problem in which the distributions of rewards associated with arms are assumed to be time-varying and the total variation in the expected rewards is subject to a variation budget. The regret of a policy is defined by the difference in the expected cumulative reward obtained using the policy and using an oracle that selects the arm with the maximum mean reward at each time. We characterize the performance of the proposed policies in terms of the worst-case regret, which is the supremum of the regret over the set of reward distribution sequences satisfying the variation budget. We design Upper-Confidence Bound (UCB)-based policies with three different approaches, namely, periodic resetting, sliding observation window, and discount factor, and show that they are order-optimal with respect to the minimax regret, i.e., the minimum worst-case regret achieved by any policy. We also relax the sub-Gaussian assumption on reward distributions and develop robust versions of the proposed policies that can handle heavy-tailed reward distributions and maintain their performance guarantees.
我们研究的是非平稳随机多臂匪徒(MAB)问题,其中假定与臂相关的奖励分布是时变的,并且预期奖励的总变化受制于变化预算。一个策略的遗憾度是指使用该策略与使用一个每次选择平均奖励最大的手臂的神谕所获得的预期累积奖励之差。我们用最坏情况下的遗憾值来描述所提策略的性能,即满足变化预算的奖励分布序列集合上遗憾值的上确值。我们设计了基于置信度上限(UCB)的策略,采用了三种不同的方法,即周期性重置、滑动观察窗口和贴现因子,并证明它们在最小遗憾(即任何策略都能达到的最小最坏情况遗憾)方面都是有序最优的。我们还放宽了奖赏分布的亚高斯假设,并开发了可处理重尾奖赏分布并保持其性能保证的鲁棒版本。
{"title":"Nonstationary Stochastic Bandits: UCB Policies and Minimax Regret","authors":"Lai Wei;Vaibhav Srivastava","doi":"10.1109/OJCSYS.2024.3372929","DOIUrl":"https://doi.org/10.1109/OJCSYS.2024.3372929","url":null,"abstract":"We study the nonstationary stochastic Multi-Armed Bandit (MAB) problem in which the distributions of rewards associated with arms are assumed to be time-varying and the total variation in the expected rewards is subject to a variation budget. The regret of a policy is defined by the difference in the expected cumulative reward obtained using the policy and using an oracle that selects the arm with the maximum mean reward at each time. We characterize the performance of the proposed policies in terms of the worst-case regret, which is the supremum of the regret over the set of reward distribution sequences satisfying the variation budget. We design Upper-Confidence Bound (UCB)-based policies with three different approaches, namely, periodic resetting, sliding observation window, and discount factor, and show that they are order-optimal with respect to the minimax regret, i.e., the minimum worst-case regret achieved by any policy. We also relax the sub-Gaussian assumption on reward distributions and develop robust versions of the proposed policies that can handle heavy-tailed reward distributions and maintain their performance guarantees.","PeriodicalId":73299,"journal":{"name":"IEEE open journal of control systems","volume":"3 ","pages":"128-142"},"PeriodicalIF":0.0,"publicationDate":"2024-03-05","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=10460198","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"140342701","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
A Computationally-Efficient Data-Driven Safe Optimal Algorithm Through Control Merging 通过控制合并实现计算高效的数据驱动安全最优算法
Pub Date : 2024-02-22 DOI: 10.1109/OJCSYS.2024.3368850
Marjan Khaledi;Bahare Kiumarsi
This article presents a proactive approach to resolving the conflict between safety and optimality for continuous-time (CT) safety-critical systems with unknown dynamics. The presented method guarantees safety and performance specifications by combining two controllers: a safe controller and an optimal controller. On the one hand, the safe controller is designed using only input and state data measurements and without requiring the state derivative data, which are typically required in data-driven control of CT systems. State derivative measurement is costly, and its approximation introduces noise to the system. On the other hand, the optimal controller is learned using a low-complexity one-shot optimization problem, which again does not rely on prior knowledge of the system dynamics and state derivative data. Compared to existing optimal control learning methods for CT systems, which are typically iterative, a one-shot optimization is considerably more sample-efficient and computationally efficient. The share of optimal and safe controllers in the overall control policy is obtained by solving a computationally efficient optimization problem involving a scalar variable in a data-driven manner. It is shown that the contribution of the safe controller dominates that of the optimal controller when the system's state is close to the safety boundaries, and this domination drops as the system trajectories move away from the safety boundaries. In this case, the optimal controller contributes more to the overall controller. The feasibility and stability of the proposed controller are shown. Finally, the simulation results show the efficacy of the proposed approach.
本文提出了一种积极主动的方法,用于解决具有未知动态的连续时间(CT)安全关键型系统的安全性与最优性之间的冲突。本文提出的方法通过结合两个控制器(安全控制器和最优控制器)来保证安全性和性能指标。一方面,安全控制器的设计只需测量输入和状态数据,无需状态导数数据,而状态导数数据通常是 CT 系统数据驱动控制所必需的。状态导数测量的成本很高,而且其近似值会给系统带来噪声。另一方面,最优控制器是通过低复杂度的单次优化问题来学习的,这同样不依赖于系统动态和状态导数数据的先验知识。现有的 CT 系统最优控制学习方法通常是迭代式的,与之相比,一次优化的样本效率和计算效率要高得多。通过以数据驱动的方式求解一个涉及标量变量的计算高效的优化问题,可以获得最优和安全控制器在整体控制策略中的份额。结果表明,当系统状态接近安全边界时,安全控制器的贡献比最优控制器的贡献大,而当系统轨迹远离安全边界时,安全控制器的贡献就会下降。在这种情况下,最优控制器对整体控制器的贡献更大。仿真结果表明了拟议控制器的可行性和稳定性。最后,仿真结果表明了所提方法的有效性。
{"title":"A Computationally-Efficient Data-Driven Safe Optimal Algorithm Through Control Merging","authors":"Marjan Khaledi;Bahare Kiumarsi","doi":"10.1109/OJCSYS.2024.3368850","DOIUrl":"https://doi.org/10.1109/OJCSYS.2024.3368850","url":null,"abstract":"This article presents a proactive approach to resolving the conflict between safety and optimality for continuous-time (CT) safety-critical systems with unknown dynamics. The presented method guarantees safety and performance specifications by combining two controllers: a safe controller and an optimal controller. On the one hand, the safe controller is designed using only input and state data measurements and without requiring the state derivative data, which are typically required in data-driven control of CT systems. State derivative measurement is costly, and its approximation introduces noise to the system. On the other hand, the optimal controller is learned using a low-complexity one-shot optimization problem, which again does not rely on prior knowledge of the system dynamics and state derivative data. Compared to existing optimal control learning methods for CT systems, which are typically iterative, a one-shot optimization is considerably more sample-efficient and computationally efficient. The share of optimal and safe controllers in the overall control policy is obtained by solving a computationally efficient optimization problem involving a scalar variable in a data-driven manner. It is shown that the contribution of the safe controller dominates that of the optimal controller when the system's state is close to the safety boundaries, and this domination drops as the system trajectories move away from the safety boundaries. In this case, the optimal controller contributes more to the overall controller. The feasibility and stability of the proposed controller are shown. Finally, the simulation results show the efficacy of the proposed approach.","PeriodicalId":73299,"journal":{"name":"IEEE open journal of control systems","volume":"3 ","pages":"118-127"},"PeriodicalIF":0.0,"publicationDate":"2024-02-22","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=10443513","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"140328981","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
A Multiplex Approach Against Disturbance Propagation in Nonlinear Networks With Delays 在有延迟的非线性网络中对抗干扰传播的多重方法
Pub Date : 2024-01-26 DOI: 10.1109/OJCSYS.2024.3359089
Shihao Xie;Giovanni Russo
We consider both leaderless and leader-follower, possibly nonlinear, networks affected by time-varying communication delays. For such systems, we give a set of sufficient conditions that guarantee the convergence of the network towards some desired behaviour while simultaneously ensuring the rejection of polynomial disturbances and the non-amplification of other classes of disturbances across the network. To fulfill these desired properties, and prove our main results, we propose the use of a control protocol that implements a multiplex architecture. The use of our results for control protocol design is then illustrated in the context of formation control. The protocols are validated both in-silico and via an experimental set-up with real robots. All experiments confirm the effectiveness of our approach.
我们考虑了受时变通信延迟影响的无领导者网络和领导者-追随者网络(可能是非线性网络)。对于此类系统,我们给出了一系列充分条件,以保证网络向某些期望行为收敛,同时确保拒绝多项式干扰,并确保其他类别的干扰不会在整个网络中放大。为了实现这些预期特性并证明我们的主要结果,我们建议使用一种实现多路复用架构的控制协议。然后,我们以编队控制为背景,说明了我们的结果在控制协议设计中的应用。这些协议既在内部进行了验证,也通过真实机器人的实验装置进行了验证。所有实验都证实了我们方法的有效性。
{"title":"A Multiplex Approach Against Disturbance Propagation in Nonlinear Networks With Delays","authors":"Shihao Xie;Giovanni Russo","doi":"10.1109/OJCSYS.2024.3359089","DOIUrl":"https://doi.org/10.1109/OJCSYS.2024.3359089","url":null,"abstract":"We consider both leaderless and leader-follower, possibly nonlinear, networks affected by time-varying communication delays. For such systems, we give a set of sufficient conditions that guarantee the convergence of the network towards some desired behaviour while simultaneously ensuring the rejection of polynomial disturbances and the non-amplification of other classes of disturbances across the network. To fulfill these desired properties, and prove our main results, we propose the use of a control protocol that implements a multiplex architecture. The use of our results for control protocol design is then illustrated in the context of formation control. The protocols are validated both in-silico and via an experimental set-up with real robots. All experiments confirm the effectiveness of our approach.","PeriodicalId":73299,"journal":{"name":"IEEE open journal of control systems","volume":"3 ","pages":"87-101"},"PeriodicalIF":0.0,"publicationDate":"2024-01-26","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=10415106","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"139937154","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
期刊
IEEE open journal of control systems
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1