首页 > 最新文献

IEEE open journal of control systems最新文献

英文 中文
Training Reflexes Using Adaptive Feedforward Control 使用自适应前馈控制训练反射
Pub Date : 2023-10-09 DOI: 10.1109/OJCSYS.2023.3322906
Erick Mejia Uzeda;Mireille E. Broucke
We consider the problem of mixed feedforward and feedback based disturbance rejection, where the feedforward measurement only provides a partial reconstruction of the disturbance. In doing so, we pose a new biologically relevant disturbance rejection problem which puts the role of feedforward measurements at the forefront. Based on the architecture of the human brain, we propose a design that utilizes an adaptive internal model operating on a fast timescale that, in turn, trains the correct feedforward gains on a slow timescale. As such, the training of reflexes in biological systems can be explained by leveraging the theory of adaptive feedforward control. It is proven that our design provides an arbitrary level of disturbance attenuation, and the benefits of using reflexes are illustrated via a multitude of simulations.
我们考虑了基于前馈和反馈的混合扰动抑制问题,其中前馈测量仅提供扰动的部分重建。在这样做的过程中,我们提出了一个新的与生物学相关的干扰抑制问题,该问题将前馈测量的作用放在首位。基于人脑的结构,我们提出了一种设计,该设计利用了在快速时间尺度上运行的自适应内部模型,进而在慢速时间尺度上训练正确的前馈增益。因此,生物系统中反射的训练可以通过利用自适应前馈控制的理论来解释。事实证明,我们的设计提供了任意水平的干扰衰减,并通过大量模拟说明了使用反射的好处。
{"title":"Training Reflexes Using Adaptive Feedforward Control","authors":"Erick Mejia Uzeda;Mireille E. Broucke","doi":"10.1109/OJCSYS.2023.3322906","DOIUrl":"https://doi.org/10.1109/OJCSYS.2023.3322906","url":null,"abstract":"We consider the problem of mixed feedforward and feedback based disturbance rejection, where the feedforward measurement only provides a partial reconstruction of the disturbance. In doing so, we pose a new biologically relevant disturbance rejection problem which puts the role of feedforward measurements at the forefront. Based on the architecture of the human brain, we propose a design that utilizes an adaptive internal model operating on a fast timescale that, in turn, trains the correct feedforward gains on a slow timescale. As such, the training of reflexes in biological systems can be explained by leveraging the theory of adaptive feedforward control. It is proven that our design provides an arbitrary level of disturbance attenuation, and the benefits of using reflexes are illustrated via a multitude of simulations.","PeriodicalId":73299,"journal":{"name":"IEEE open journal of control systems","volume":"2 ","pages":"396-409"},"PeriodicalIF":0.0,"publicationDate":"2023-10-09","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://ieeexplore.ieee.org/iel7/9552933/9973428/10274847.pdf","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"50376167","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
Adaptive Control for Singularly Perturbed Systems 奇异扰动系统的自适应控制
Pub Date : 2023-10-06 DOI: 10.1109/OJCSYS.2023.3322636
Kameron Eves;John Valasek
Singularly perturbed systems are a class of mathematical systems that are not well approximated by their limits and can be used to model plants with multiple fast and slow states. Multiple-timescale systems are very common in engineering applications, but adaptive control can be sensitive to timescale effects. Recently a method called [K]control of Adaptive Multiple-timescale Systems (KAMS) has shown improved performance and increased robustness for singularly perturbed systems, but it has only been studied on systems using adaptive control for the slow states. This article extends KAMS to the general case when adaptive control is used to stabilize both the slow and fast states simultaneously. This causes complex interactions between the fast state reference model and the manifold to which the fast states converge. It is proven that under certain conditions the system still converges to the reference model despite these complex interactions. This method is demonstrated on a nonlinear, nonstandard, numerical example.
奇异扰动系统是一类不能很好近似其极限的数学系统,可用于模拟具有多种快慢状态的植物。多时标系统在工程应用中非常常见,但自适应控制对时标效应非常敏感。最近,一种名为[K]自适应多时阶系统控制(KAMS)的方法显示,对于奇异扰动系统,KAMS 的性能有所改善,鲁棒性也有所提高,但该方法仅用于对慢速状态进行自适应控制的系统。本文将 KAMS 扩展到使用自适应控制同时稳定慢速和快速状态的一般情况。这将导致快速状态参考模型与快速状态收敛流形之间复杂的相互作用。事实证明,在某些条件下,尽管存在这些复杂的相互作用,系统仍能收敛到参考模型。该方法在一个非线性、非标准的数值示例中得到了验证。
{"title":"Adaptive Control for Singularly Perturbed Systems","authors":"Kameron Eves;John Valasek","doi":"10.1109/OJCSYS.2023.3322636","DOIUrl":"10.1109/OJCSYS.2023.3322636","url":null,"abstract":"Singularly perturbed systems are a class of mathematical systems that are not well approximated by their limits and can be used to model plants with multiple fast and slow states. Multiple-timescale systems are very common in engineering applications, but adaptive control can be sensitive to timescale effects. Recently a method called [K]control of Adaptive Multiple-timescale Systems (KAMS) has shown improved performance and increased robustness for singularly perturbed systems, but it has only been studied on systems using adaptive control for the slow states. This article extends KAMS to the general case when adaptive control is used to stabilize both the slow and fast states simultaneously. This causes complex interactions between the fast state reference model and the manifold to which the fast states converge. It is proven that under certain conditions the system still converges to the reference model despite these complex interactions. This method is demonstrated on a nonlinear, nonstandard, numerical example.","PeriodicalId":73299,"journal":{"name":"IEEE open journal of control systems","volume":"3 ","pages":"1-13"},"PeriodicalIF":0.0,"publicationDate":"2023-10-06","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=10273579","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"136008098","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Data-Driven Model Discrimination of Switched Nonlinear Systems With Temporal Logic Inference 具有时序逻辑推理的切换非线性系统的数据驱动模型判别
Pub Date : 2023-10-04 DOI: 10.1109/OJCSYS.2023.3322069
Zeyuan Jin;Nasim Baharisangari;Zhe Xu;Sze Zheng Yong
This article addresses the problem of data-driven model discrimination for unknown switched systems with unknown linear temporal logic (LTL) specifications, representing tasks, that govern their mode sequences, where only sampled data of the unknown dynamics and tasks are available. To tackle this problem, we propose data-driven methods to over-approximate the unknown dynamics and to infer the unknown specifications such that both set-membership models of the unknown dynamics and LTL formulas are guaranteed to include the ground truth model and specification/task. Moreover, we present an optimization-based algorithm for analyzing the distinguishability of a set of learned/inferred model-task pairs as well as a model discrimination algorithm for ruling out model-task pairs from this set that are inconsistent with new observations at run time. Further, we present an approach for reducing the size of inferred specifications to increase the computational efficiency of the model discrimination algorithms.
本文解决了具有未知线性时序逻辑(LTL)规范的未知切换系统的数据驱动模型判别问题,这些系统表示控制其模式序列的任务,其中只有未知动态和任务的采样数据可用。为了解决这个问题,我们提出了数据驱动的方法来过度逼近未知动态和推断未知规范,从而保证未知动态和LTL公式的集合隶属度模型都包含基本真值模型和规范/任务。此外,我们提出了一种基于优化的算法,用于分析一组学习/推断模型任务对的可区分性,以及一种模型判别算法,用于排除与运行时新观测结果不一致的模型任务对。此外,我们提出了一种减小推断规格大小的方法,以提高模型识别算法的计算效率。
{"title":"Data-Driven Model Discrimination of Switched Nonlinear Systems With Temporal Logic Inference","authors":"Zeyuan Jin;Nasim Baharisangari;Zhe Xu;Sze Zheng Yong","doi":"10.1109/OJCSYS.2023.3322069","DOIUrl":"10.1109/OJCSYS.2023.3322069","url":null,"abstract":"This article addresses the problem of data-driven model discrimination for unknown switched systems with unknown linear temporal logic (LTL) specifications, representing tasks, that govern their mode sequences, where only sampled data of the unknown dynamics and tasks are available. To tackle this problem, we propose data-driven methods to over-approximate the unknown dynamics and to infer the unknown specifications such that both set-membership models of the unknown dynamics and LTL formulas are guaranteed to include the ground truth model and specification/task. Moreover, we present an optimization-based algorithm for analyzing the distinguishability of a set of learned/inferred model-task pairs as well as a model discrimination algorithm for ruling out model-task pairs from this set that are inconsistent with new observations at run time. Further, we present an approach for reducing the size of inferred specifications to increase the computational efficiency of the model discrimination algorithms.","PeriodicalId":73299,"journal":{"name":"IEEE open journal of control systems","volume":"2 ","pages":"410-424"},"PeriodicalIF":0.0,"publicationDate":"2023-10-04","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=10271526","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"135955069","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
An Innovative Control Design Procedure for Under-Actuated Mechanical Systems: Emphasizing Potential Energy Shaping and Structural Preservation 一种创新的欠驱动机械系统控制设计程序:强调势能成形和结构保护
Pub Date : 2023-09-28 DOI: 10.1109/OJCSYS.2023.3320512
Babak Salamat;Abolfazl Yaghmaei;Gerhard Elsbacher;Andrea M. Tonello;Mohammad Javad Yazdanpanah
In this article, we propose a procedure to solve the controlled design for a class of under-actuated mechanical systems. Our proposed method can be viewed as a sub-method of the IDA-PBC or Controlled Lagrangian approaches, with a particular focus on shaping the potential energy. By emphasizing potential energy shaping, we can effectively tackle the bottleneck presented by the matching equation in these approaches. Moreover, our method leverages a suitable coordinate transformation that is inspired by the physics of the system, further enhancing its efficacy. Therefore, our design procedure is based on a coordinate transformation plus potential energy shaping in the new coordinates, and its existence and possibility of potential energy shaping can be verified via some algebraic calculations, making it constructive. To illustrate the results, we consider the cart-pole system and a recently introduced under-actuated mechanical system named swash mass pendulum (SMP) (Salamat and Tonello, 2021). The SMP consists of a pendulum made of a rigid shaft connected to a pair of cross-shafts where two swash masses can move under the action of servo-mechanisms.
在本文中,我们提出了一种求解一类欠驱动机械系统受控设计的程序。我们提出的方法可以被视为IDA-PBC或受控拉格朗日方法的一个子方法,特别关注势能的形成。通过强调势能塑造,我们可以有效地解决这些方法中匹配方程所带来的瓶颈。此外,我们的方法利用了受系统物理启发的适当坐标变换,进一步提高了其效率。因此,我们的设计程序是基于坐标变换加上新坐标下的势能成形,并且可以通过一些代数计算来验证其势能成形的存在性和可能性,使其具有建设性。为了说明结果,我们考虑了推车杆系统和最近引入的一种称为旋转质量摆(SMP)的欠驱动机械系统(Salamat和Tonello,2021)。SMP由一个由刚性轴制成的摆锤组成,刚性轴连接到一对横轴上,两个旋转斜盘质量可以在伺服机构的作用下移动。
{"title":"An Innovative Control Design Procedure for Under-Actuated Mechanical Systems: Emphasizing Potential Energy Shaping and Structural Preservation","authors":"Babak Salamat;Abolfazl Yaghmaei;Gerhard Elsbacher;Andrea M. Tonello;Mohammad Javad Yazdanpanah","doi":"10.1109/OJCSYS.2023.3320512","DOIUrl":"https://doi.org/10.1109/OJCSYS.2023.3320512","url":null,"abstract":"In this article, we propose a procedure to solve the controlled design for a class of under-actuated mechanical systems. Our proposed method can be viewed as a sub-method of the IDA-PBC or Controlled Lagrangian approaches, with a particular focus on shaping the potential energy. By emphasizing potential energy shaping, we can effectively tackle the bottleneck presented by the matching equation in these approaches. Moreover, our method leverages a suitable coordinate transformation that is inspired by the physics of the system, further enhancing its efficacy. Therefore, our design procedure is based on a coordinate transformation plus potential energy shaping in the new coordinates, and its existence and possibility of potential energy shaping can be verified via some algebraic calculations, making it constructive. To illustrate the results, we consider the cart-pole system and a recently introduced under-actuated mechanical system named swash mass pendulum (SMP) (Salamat and Tonello, 2021). The SMP consists of a pendulum made of a rigid shaft connected to a pair of cross-shafts where two swash masses can move under the action of servo-mechanisms.","PeriodicalId":73299,"journal":{"name":"IEEE open journal of control systems","volume":"2 ","pages":"356-365"},"PeriodicalIF":0.0,"publicationDate":"2023-09-28","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://ieeexplore.ieee.org/iel7/9552933/9973428/10266689.pdf","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"50376165","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Global and Local Convergence Analysis of a Bandit Learning Algorithm in Merely Coherent Games 纯相干对策中Bandit学习算法的全局和局部收敛性分析
Pub Date : 2023-09-15 DOI: 10.1109/OJCSYS.2023.3316071
Yuanhanqing Huang;Jianghai Hu
Non-cooperative games serve as a powerful framework for capturing the interactions among self-interested players and have broad applicability in modeling a wide range of practical scenarios, ranging from power management to path planning of self-driving vehicles. Although most existing solution algorithms assume the availability of first-order information or full knowledge of the objectives and others' action profiles, there are situations where the only accessible information at players' disposal is the realized objective function values. In this article, we devise a bandit online learning algorithm that integrates the optimistic mirror descent scheme and multi-point pseudo-gradient estimates. We further prove that the generated actual sequence of play converges a.s. to a critical point if the game under study is globally merely coherent, without resorting to extra Tikhonov regularization terms or additional norm conditions. We also discuss the convergence properties of the proposed bandit learning algorithm in locally merely coherent games. Finally, we illustrate the validity of the proposed algorithm via two two-player minimax problems and a cognitive radio bandwidth allocation game.
非合作游戏是捕捉自利玩家之间互动的强大框架,在建模从电源管理到自动驾驶汽车路径规划的各种实际场景方面具有广泛的适用性。尽管大多数现有的解决方案算法假设一阶信息的可用性或对目标和其他人的行动概况的充分了解,但在某些情况下,玩家唯一可获得的信息是已实现的目标函数值。在本文中,我们设计了一种结合乐观镜像下降方案和多点伪梯度估计的土匪在线学习算法。我们进一步证明,如果所研究的博弈在全局上仅仅是一致的,则生成的实际博弈序列收敛到一个临界点,而不诉诸于额外的Tikhonov正则化项或额外的范数条件。我们还讨论了所提出的土匪学习算法在局部纯相干对策中的收敛性。最后,我们通过两个两人极小极大问题和一个认知无线电带宽分配博弈来说明所提出算法的有效性。
{"title":"Global and Local Convergence Analysis of a Bandit Learning Algorithm in Merely Coherent Games","authors":"Yuanhanqing Huang;Jianghai Hu","doi":"10.1109/OJCSYS.2023.3316071","DOIUrl":"https://doi.org/10.1109/OJCSYS.2023.3316071","url":null,"abstract":"Non-cooperative games serve as a powerful framework for capturing the interactions among self-interested players and have broad applicability in modeling a wide range of practical scenarios, ranging from power management to path planning of self-driving vehicles. Although most existing solution algorithms assume the availability of first-order information or full knowledge of the objectives and others' action profiles, there are situations where the only accessible information at players' disposal is the realized objective function values. In this article, we devise a bandit online learning algorithm that integrates the optimistic mirror descent scheme and multi-point pseudo-gradient estimates. We further prove that the generated actual sequence of play converges a.s. to a critical point if the game under study is globally merely coherent, without resorting to extra Tikhonov regularization terms or additional norm conditions. We also discuss the convergence properties of the proposed bandit learning algorithm in locally merely coherent games. Finally, we illustrate the validity of the proposed algorithm via two two-player minimax problems and a cognitive radio bandwidth allocation game.","PeriodicalId":73299,"journal":{"name":"IEEE open journal of control systems","volume":"2 ","pages":"366-379"},"PeriodicalIF":0.0,"publicationDate":"2023-09-15","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://ieeexplore.ieee.org/iel7/9552933/9973428/10251984.pdf","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"50226403","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Constrained Environment Optimization for Prioritized Multi-Agent Navigation 多智能体优先导航的约束环境优化
Pub Date : 2023-09-15 DOI: 10.1109/OJCSYS.2023.3316090
Zhan Gao;Amanda Prorok
Traditional approaches for multi-agent navigation consider the environment as a fixed constraint, despite the obvious influence of spatial constraints on agents' performance. Yet hand-designing conducive environments is inefficient and potentially expensive. The goal of this article is to consider the obstacle layout of the environment as a decision variable in a system-level optimization problem. In other words, we aim to find an automated solution that optimizes the obstacle layout to improve the performance of multi-agent navigation, under a variety of realistic constraints. Towards this end, we propose novel problems of unprioritized and prioritized environment optimization, where the former considers agents unbiasedly and the latter incorporates agent priorities into optimization. We show, through formal proofs, under which conditions the environment can change to guarantee completeness (i.e., all agents reach goals), and analyze the role of agent priorities in the environment optimization. We proceed to impose constraints on the environment optimization that correspond to real-world restrictions on obstacle changes, and formulate it mathematically as a constrained stochastic optimization problem. Since the relationship between agents, environment and performance is challenging to model, we leverage reinforcement learning to develop a model-free solution and a primal-dual mechanism to handle constraints. Distinct information processing architectures are integrated for various implementation scenarios, including online/offline optimization and discrete/continuous environment. Numerical results corroborate the theory and demonstrate the validity and adaptability of our approach.
传统的多智能体导航方法将环境视为一个固定的约束,尽管空间约束对智能体的性能有明显的影响。然而,手工设计有利的环境效率低下,而且可能成本高昂。本文的目标是将环境的障碍物布局视为系统级优化问题中的决策变量。换句话说,我们的目标是找到一种自动化解决方案,在各种现实约束下优化障碍物布局,以提高多智能体导航的性能。为此,我们提出了新的无优先级和有优先级的环境优化问题,其中前者无约束地考虑代理,而后者将代理优先级纳入优化中。通过形式化证明,我们展示了在什么条件下环境可以改变以保证完整性(即所有代理都达到目标),并分析了代理优先级在环境优化中的作用。我们继续对环境优化施加与现实世界中对障碍物变化的限制相对应的约束,并将其数学公式化为一个受约束的随机优化问题。由于代理、环境和性能之间的关系很难建模,我们利用强化学习来开发无模型解决方案和处理约束的原对偶机制。不同的信息处理架构集成在各种实现场景中,包括在线/离线优化和离散/连续环境。数值结果证实了该理论,并证明了该方法的有效性和适应性。
{"title":"Constrained Environment Optimization for Prioritized Multi-Agent Navigation","authors":"Zhan Gao;Amanda Prorok","doi":"10.1109/OJCSYS.2023.3316090","DOIUrl":"https://doi.org/10.1109/OJCSYS.2023.3316090","url":null,"abstract":"Traditional approaches for multi-agent navigation consider the environment as a fixed constraint, despite the obvious influence of spatial constraints on agents' performance. Yet hand-designing conducive environments is inefficient and potentially expensive. The goal of this article is to consider the obstacle layout of the environment as a decision variable in a system-level optimization problem. In other words, we aim to find an automated solution that optimizes the obstacle layout to improve the performance of multi-agent navigation, under a variety of realistic constraints. Towards this end, we propose novel problems of \u0000<italic>unprioritized</i>\u0000 and \u0000<italic>prioritized environment optimization</i>\u0000, where the former considers agents unbiasedly and the latter incorporates agent priorities into optimization. We show, through formal proofs, under which conditions the environment can change to guarantee completeness (i.e., all agents reach goals), and analyze the role of agent priorities in the environment optimization. We proceed to impose constraints on the environment optimization that correspond to real-world restrictions on obstacle changes, and formulate it mathematically as a constrained stochastic optimization problem. Since the relationship between agents, environment and performance is challenging to model, we leverage reinforcement learning to develop a model-free solution and a primal-dual mechanism to handle constraints. Distinct information processing architectures are integrated for various implementation scenarios, including online/offline optimization and discrete/continuous environment. Numerical results corroborate the theory and demonstrate the validity and adaptability of our approach.","PeriodicalId":73299,"journal":{"name":"IEEE open journal of control systems","volume":"2 ","pages":"337-355"},"PeriodicalIF":0.0,"publicationDate":"2023-09-15","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://ieeexplore.ieee.org/iel7/9552933/9973428/10251921.pdf","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"50376164","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
Low-Rank Gradient Descent 低阶梯度下降
Pub Date : 2023-09-13 DOI: 10.1109/OJCSYS.2023.3315088
Romain Cosson;Ali Jadbabaie;Anuran Makur;Amirhossein Reisizadeh;Devavrat Shah
Several recent empirical studies demonstrate that important machine learning tasks such as training deep neural networks, exhibit a low-rank structure, where most of the variation in the loss function occurs only in a few directions of the input space. In this article, we leverage such low-rank structure to reduce the high computational cost of canonical gradient-based methods such as gradient descent (GD). Our proposed Low-Rank Gradient Descent (LRGD) algorithm finds an $epsilon$-approximate stationary point of a $p$-dimensional function by first identifying $r leq p$ significant directions, and then estimating the true $p$-dimensional gradient at every iteration by computing directional derivatives only along those $r$ directions. We establish that the “directional oracle complexities” of LRGD for strongly convex and non-convex objective functions are ${mathcal {O}}(r log (1/epsilon) + rp)$ and ${mathcal {O}}(r/epsilon ^{2} + rp)$, respectively. Therefore, when $r ll p$, LRGD provides significant improvement over the known complexities of ${mathcal {O}}(p log (1/epsilon))$ and ${mathcal {O}}(p/epsilon ^{2})$ of GD in the strongly convex and non-convex settings, respectively. Furthermore, we formally characterize the classes of exactly and approximately low-rank functions. Empirically, using real and synthetic data, LRGD provides significant gains over GD when the data has low-rank structure, and in the absence of such structure, LRGD does not degrade performance compared to GD. This suggests that LRGD could be used in practice in any setting in place of GD.
最近的几项实证研究表明,重要的机器学习任务,如训练深度神经网络,表现出低秩结构,其中损失函数的大部分变化仅发生在输入空间的几个方向上。在本文中,我们利用这种低秩结构来降低基于正则梯度的方法(如梯度下降(GD))的高计算成本。我们提出的低阶梯度下降(LRGD)算法通过首先识别$rleq-p$有效方向,然后通过仅沿$r$方向计算方向导数来估计每次迭代时的真实$p$维梯度,从而找到$p$维度函数的$epsilon$近似平稳点。我们确定了强凸和非凸目标函数的LRGD的“方向预言复杂性”分别为${mathcal{O}}(rlog(1/epsilon)+rp)$和${ mathcal{O}}(r/epsilon^{2}+rp)$。因此,当$rll p$时,LRGD在强凸和非凸设置中分别提供了对GD的已知复杂性${mathcal{O}}(plog(1/epsilon))$和${ mathcal{O}}(p/epsilon^{2})$的显著改进。此外,我们形式化地刻画了精确和近似低秩函数的类。从经验上讲,使用真实和合成数据,当数据具有低秩结构时,LRGD比GD提供了显著的增益,并且在没有这种结构的情况下,与GD相比,LRGD不会降低性能。这表明LRGD可以在任何情况下代替GD在实践中使用。
{"title":"Low-Rank Gradient Descent","authors":"Romain Cosson;Ali Jadbabaie;Anuran Makur;Amirhossein Reisizadeh;Devavrat Shah","doi":"10.1109/OJCSYS.2023.3315088","DOIUrl":"https://doi.org/10.1109/OJCSYS.2023.3315088","url":null,"abstract":"Several recent empirical studies demonstrate that important machine learning tasks such as training deep neural networks, exhibit a low-rank structure, where most of the variation in the loss function occurs only in a few directions of the input space. In this article, we leverage such low-rank structure to reduce the high computational cost of canonical gradient-based methods such as gradient descent (\u0000<monospace>GD</monospace>\u0000). Our proposed \u0000<italic>Low-Rank Gradient Descent</i>\u0000 (\u0000<monospace>LRGD</monospace>\u0000) algorithm finds an \u0000<inline-formula><tex-math>$epsilon$</tex-math></inline-formula>\u0000-approximate stationary point of a \u0000<inline-formula><tex-math>$p$</tex-math></inline-formula>\u0000-dimensional function by first identifying \u0000<inline-formula><tex-math>$r leq p$</tex-math></inline-formula>\u0000 significant directions, and then estimating the true \u0000<inline-formula><tex-math>$p$</tex-math></inline-formula>\u0000-dimensional gradient at every iteration by computing directional derivatives only along those \u0000<inline-formula><tex-math>$r$</tex-math></inline-formula>\u0000 directions. We establish that the “directional oracle complexities” of \u0000<monospace>LRGD</monospace>\u0000 for strongly convex and non-convex objective functions are \u0000<inline-formula><tex-math>${mathcal {O}}(r log (1/epsilon) + rp)$</tex-math></inline-formula>\u0000 and \u0000<inline-formula><tex-math>${mathcal {O}}(r/epsilon ^{2} + rp)$</tex-math></inline-formula>\u0000, respectively. Therefore, when \u0000<inline-formula><tex-math>$r ll p$</tex-math></inline-formula>\u0000, \u0000<monospace>LRGD</monospace>\u0000 provides significant improvement over the known complexities of \u0000<inline-formula><tex-math>${mathcal {O}}(p log (1/epsilon))$</tex-math></inline-formula>\u0000 and \u0000<inline-formula><tex-math>${mathcal {O}}(p/epsilon ^{2})$</tex-math></inline-formula>\u0000 of \u0000<monospace>GD</monospace>\u0000 in the strongly convex and non-convex settings, respectively. Furthermore, we formally characterize the classes of exactly and approximately low-rank functions. Empirically, using real and synthetic data, \u0000<monospace>LRGD</monospace>\u0000 provides significant gains over \u0000<monospace>GD</monospace>\u0000 when the data has low-rank structure, and in the absence of such structure, \u0000<monospace>LRGD</monospace>\u0000 does not degrade performance compared to \u0000<monospace>GD</monospace>\u0000. This suggests that \u0000<monospace>LRGD</monospace>\u0000 could be used in practice in any setting in place of \u0000<monospace>GD</monospace>\u0000.","PeriodicalId":73299,"journal":{"name":"IEEE open journal of control systems","volume":"2 ","pages":"380-395"},"PeriodicalIF":0.0,"publicationDate":"2023-09-13","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://ieeexplore.ieee.org/iel7/9552933/9973428/10250907.pdf","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"50226404","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Global Performance Guarantees for Localized Model Predictive Control 局部模型预测控制的全局性能保证
Pub Date : 2023-08-24 DOI: 10.1109/OJCSYS.2023.3308009
Jing Shuang Li;Carmen Amo Alonso
Recent advances in model predictive control (MPC) leverage local communication constraints to produce localized MPC algorithms whose complexities scale independently of total network size. However, no characterization is available regarding global performance, i.e. whether localized MPC (with communication constraints) performs just as well as global MPC (no communication constraints). In this paper, we provide analysis and guarantees on global performance of localized MPC — in particular, we derive sufficient conditions for optimal global performance in the presence of local communication constraints. We also present an algorithm to determine the communication structure for a given system that will preserve performance while minimizing computational complexity. The effectiveness of the algorithm is verified in simulations, and additional relationships between network properties and performance-preserving communication constraints are characterized. A striking finding is that in a network of 121 coupled pendula, each node only needs to communicate with its immediate neighbors to preserve optimal global performance. Overall, this work offers theoretical understanding on the effect of local communication on global performance, and provides practitioners with the tools necessary to deploy localized model predictive control by establishing a rigorous method of selecting local communication constraints. This work also demonstrates — surprisingly — that the inclusion of severe communication constraints need not compromise global performance.
模型预测控制(MPC)的最新进展利用局部通信约束来产生局部MPC算法,其复杂性与总网络大小无关。然而,没有关于全局性能的特征,即局部MPC(具有通信约束)是否与全局MPC(没有通信约束)一样好。在本文中,我们对局部MPC的全局性能进行了分析和保证——特别是,我们推导了在存在局部通信约束的情况下最优全局性能的充分条件。我们还提出了一种算法来确定给定系统的通信结构,该算法将在最小化计算复杂性的同时保持性能。仿真验证了该算法的有效性,并描述了网络特性和性能保持通信约束之间的额外关系。一个惊人的发现是,在一个由121个耦合钟摆组成的网络中,每个节点只需要与其近邻通信,就可以保持最佳的全局性能。总的来说,这项工作为局部通信对全局性能的影响提供了理论理解,并通过建立严格的局部通信约束选择方法,为从业者提供了部署局部模型预测控制所需的工具。令人惊讶的是,这项工作还表明,包含严格的通信约束不需要损害全局性能。
{"title":"Global Performance Guarantees for Localized Model Predictive Control","authors":"Jing Shuang Li;Carmen Amo Alonso","doi":"10.1109/OJCSYS.2023.3308009","DOIUrl":"https://doi.org/10.1109/OJCSYS.2023.3308009","url":null,"abstract":"Recent advances in model predictive control (MPC) leverage local communication constraints to produce localized MPC algorithms whose complexities scale independently of total network size. However, no characterization is available regarding global performance, i.e. whether localized MPC (with communication constraints) performs just as well as global MPC (no communication constraints). In this paper, we provide analysis and guarantees on global performance of localized MPC — in particular, we derive sufficient conditions for optimal global performance in the presence of local communication constraints. We also present an algorithm to determine the communication structure for a given system that will preserve performance while minimizing computational complexity. The effectiveness of the algorithm is verified in simulations, and additional relationships between network properties and performance-preserving communication constraints are characterized. A striking finding is that in a network of 121 coupled pendula, each node only needs to communicate with its immediate neighbors to preserve optimal global performance. Overall, this work offers theoretical understanding on the effect of local communication on global performance, and provides practitioners with the tools necessary to deploy localized model predictive control by establishing a rigorous method of selecting local communication constraints. This work also demonstrates — surprisingly — that the inclusion of severe communication constraints need not compromise global performance.","PeriodicalId":73299,"journal":{"name":"IEEE open journal of control systems","volume":"2 ","pages":"325-336"},"PeriodicalIF":0.0,"publicationDate":"2023-08-24","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://ieeexplore.ieee.org/iel7/9552933/9973428/10229197.pdf","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"50226364","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Risk-Based Security Measure Allocation Against Actuator Attacks 针对执行器攻击的基于风险的安全措施分配
Pub Date : 2023-08-16 DOI: 10.1109/OJCSYS.2023.3305831
Sribalaji C. Anand;André M. H. Teixeira
This article considers the problem of risk-optimal allocation of security measures when the actuators of an uncertain control system are under attack. We consider an adversary injecting false data into the actuator channels. The attack impact is characterized by the maximum performance loss caused by a stealthy adversary with bounded energy. Since the impact is a random variable, due to system uncertainty, we use Conditional Value-at-Risk (CVaR) to characterize the risk associated with the attack. We then consider the problem of allocating security measures to the set of actuators to minimize the risk. We assume that there are only a limited number of security measures available. Under this constraint, we observe that the allocation problem is a mixed-integer optimization problem. Thus we use relaxation techniques to approximate the security allocation problem into a Semi-Definite Program (SDP). We also compare our allocation method $(i)$ across different risk measures: the worst-case measure, the average (nominal) measure, and $(ii)$ across different search algorithms: the exhaustive and the greedy search algorithms. We depict the efficacy of our approach through numerical examples.
本文研究了不确定控制系统执行器受到攻击时安全措施的风险最优分配问题。我们考虑对手将虚假数据注入致动器通道。攻击冲击的特点是由具有有限能量的隐形对手造成的最大性能损失。由于影响是一个随机变量,由于系统的不确定性,我们使用条件风险值(CVaR)来表征与攻击相关的风险。然后,我们考虑将安全措施分配给执行器集以将风险降至最低的问题。我们认为,可用的安全措施数量有限。在这种约束条件下,我们观察到分配问题是一个混合整数优化问题。因此,我们使用松弛技术将安全分配问题近似为半定程序(SDP)。我们还比较了不同风险度量的分配方法$(i)$:最坏情况度量、平均(名义)度量,以及不同搜索算法的分配方法:穷举搜索算法和贪婪搜索算法。我们通过数值例子描述了我们的方法的有效性。
{"title":"Risk-Based Security Measure Allocation Against Actuator Attacks","authors":"Sribalaji C. Anand;André M. H. Teixeira","doi":"10.1109/OJCSYS.2023.3305831","DOIUrl":"https://doi.org/10.1109/OJCSYS.2023.3305831","url":null,"abstract":"This article considers the problem of risk-optimal allocation of security measures when the actuators of an uncertain control system are under attack. We consider an adversary injecting false data into the actuator channels. The attack impact is characterized by the maximum performance loss caused by a stealthy adversary with bounded energy. Since the impact is a random variable, due to system uncertainty, we use Conditional Value-at-Risk (CVaR) to characterize the risk associated with the attack. We then consider the problem of allocating security measures to the set of actuators to minimize the risk. We assume that there are only a limited number of security measures available. Under this constraint, we observe that the allocation problem is a mixed-integer optimization problem. Thus we use relaxation techniques to approximate the security allocation problem into a Semi-Definite Program (SDP). We also compare our allocation method \u0000<inline-formula><tex-math>$(i)$</tex-math></inline-formula>\u0000 across different risk measures: the worst-case measure, the average (nominal) measure, and \u0000<inline-formula><tex-math>$(ii)$</tex-math></inline-formula>\u0000 across different search algorithms: the exhaustive and the greedy search algorithms. We depict the efficacy of our approach through numerical examples.","PeriodicalId":73299,"journal":{"name":"IEEE open journal of control systems","volume":"2 ","pages":"297-309"},"PeriodicalIF":0.0,"publicationDate":"2023-08-16","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://ieeexplore.ieee.org/iel7/9552933/9973428/10221684.pdf","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"50226363","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Context-Triggered Abstraction-Based Control Design 基于上下文触发的抽象控制设计
Pub Date : 2023-08-16 DOI: 10.1109/OJCSYS.2023.3305835
Satya Prakash Nayak;Lucas N. Egidio;Matteo Della Rossa;Anne-Kathrin Schmuck;Raphael M. Jungers
We consider the problem of automatically synthesizing a hybrid controller for non-linear dynamical systems which ensures that the closed-loop fulfills an arbitrary Linear Temporal Logic specification. Moreover, the specification may take into account logical context switches induced by an external environment or the system itself. Finally, we want to avoid classical brute-force time- and space-discretization for scalability. We achieve these goals by a novel two-layer strategy synthesis approach, where the controller generated in the lower layer provides invariant sets and basins of attraction, which are exploited at the upper logical layer in an abstract way. In order to achieve this, we provide new techniques for both the upper- and lower-level synthesis. Our new methodology allows to leverage both the computing power of state space control techniques and the intelligence of finite game solving for complex specifications, in a scalable way.
我们考虑了非线性动力系统的混合控制器的自动合成问题,该控制器确保闭环满足任意线性时序逻辑规范。此外,该规范可以考虑由外部环境或系统本身引起的逻辑上下文切换。最后,为了可伸缩性,我们希望避免经典的暴力时间和空间离散化。我们通过一种新的两层策略综合方法来实现这些目标,其中在较低层生成的控制器提供不变集和吸引池,在较高的逻辑层以抽象的方式利用它们。为了实现这一点,我们为上层和下层合成提供了新的技术。我们的新方法允许以可扩展的方式利用状态空间控制技术的计算能力和复杂规范的有限对策求解的智能。
{"title":"Context-Triggered Abstraction-Based Control Design","authors":"Satya Prakash Nayak;Lucas N. Egidio;Matteo Della Rossa;Anne-Kathrin Schmuck;Raphael M. Jungers","doi":"10.1109/OJCSYS.2023.3305835","DOIUrl":"https://doi.org/10.1109/OJCSYS.2023.3305835","url":null,"abstract":"We consider the problem of automatically synthesizing a hybrid controller for non-linear dynamical systems which ensures that the closed-loop fulfills an arbitrary \u0000<italic>Linear Temporal Logic</i>\u0000 specification. Moreover, the specification may take into account logical context switches induced by an external environment or the system itself. Finally, we want to avoid classical brute-force time- and space-discretization for scalability. We achieve these goals by a novel two-layer strategy synthesis approach, where the controller generated in the lower layer provides invariant sets and basins of attraction, which are exploited at the upper logical layer in an abstract way. In order to achieve this, we provide new techniques for both the upper- and lower-level synthesis. Our new methodology allows to leverage both the computing power of state space control techniques and the intelligence of finite game solving for complex specifications, in a scalable way.","PeriodicalId":73299,"journal":{"name":"IEEE open journal of control systems","volume":"2 ","pages":"277-296"},"PeriodicalIF":0.0,"publicationDate":"2023-08-16","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://ieeexplore.ieee.org/iel7/9552933/9973428/10221705.pdf","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"50226362","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
期刊
IEEE open journal of control systems
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1