首页 > 最新文献

IEEE open journal of control systems最新文献

英文 中文
Exact Decomposition of Optimal Control Problems via Simultaneous Block Diagonalization of Matrices 最优控制问题的矩阵同时块对角化的精确分解
Pub Date : 2022-12-22 DOI: 10.1109/OJCSYS.2022.3231553
Amirhossein Nazerian;Kshitij Bhatta;Francesco Sorrentino
In this paper, we consider optimal control problems (OCPs) applied to large-scale linear dynamical systems with a large number of states and inputs. We attempt to reduce such problems into a set of independent OCPs of lower dimensions. Our decomposition is ‘exact’ in the sense that it preserves all the information about the original system and the objective function. Previous work in this area has focused on strategies that exploit symmetries of the underlying system and of the objective function. Here, instead, we implement the algebraic method of simultaneous block diagonalization of matrices (SBD), which we show provides advantages both in terms of the dimension of the subproblems that are obtained and of the computation time. We provide practical examples with networked systems that demonstrate the benefits of applying the SBD decomposition over the decomposition method based on group symmetries.
本文研究了具有大量状态和输入的大型线性动力系统的最优控制问题。我们试图将这些问题简化为一组较低维度的独立OCP。我们的分解是“精确的”,因为它保留了关于原始系统和目标函数的所有信息。以前在这一领域的工作集中在利用底层系统和目标函数对称性的策略上。相反,在这里,我们实现了矩阵的同时块对角化(SBD)的代数方法,我们证明了该方法在所获得的子问题的维数和计算时间方面都具有优势。我们提供了网络系统的实际例子,证明了应用SBD分解相对于基于群对称性的分解方法的好处。
{"title":"Exact Decomposition of Optimal Control Problems via Simultaneous Block Diagonalization of Matrices","authors":"Amirhossein Nazerian;Kshitij Bhatta;Francesco Sorrentino","doi":"10.1109/OJCSYS.2022.3231553","DOIUrl":"10.1109/OJCSYS.2022.3231553","url":null,"abstract":"In this paper, we consider optimal control problems (OCPs) applied to large-scale linear dynamical systems with a large number of states and inputs. We attempt to reduce such problems into a set of independent OCPs of lower dimensions. Our decomposition is ‘exact’ in the sense that it preserves all the information about the original system and the objective function. Previous work in this area has focused on strategies that exploit symmetries of the underlying system and of the objective function. Here, instead, we implement the algebraic method of simultaneous block diagonalization of matrices (SBD), which we show provides advantages both in terms of the dimension of the subproblems that are obtained and of the computation time. We provide practical examples with networked systems that demonstrate the benefits of applying the SBD decomposition over the decomposition method based on group symmetries.","PeriodicalId":73299,"journal":{"name":"IEEE open journal of control systems","volume":"2 ","pages":"24-35"},"PeriodicalIF":0.0,"publicationDate":"2022-12-22","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=9996568","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"9111923","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
The Robotarium: A Remotely-Accessible, Multi-Robot Testbed for Control Research and Education 机器人博物馆:用于控制研究和教育的可远程访问的多机器人试验台
Pub Date : 2022-12-22 DOI: 10.1109/OJCSYS.2022.3231523
Sean Wilson;Magnus Egerstedt
In robotic research and education, the cost in terms of money, expertise, and time required to instantiate and maintain robotic testbeds can prevent researchers and educators from including hardware based experimentation in their laboratories and classrooms. This results in robotic algorithms often being validated by low-fidelity simulation due to the complexity and computational demand required by high-fidelity simulators. Unfortunately, these simulation environments often neglect real world complexities, such as wheel slip, actuator dynamics, computation time, communication delays, and sensor noise. The Robotarium provides a solution to these problems by providing a state-of-the-art, multi-robot research facility to everyone around the world free of charge for academic and educational purposes. This paper discusses the remote usage of the testbed since its opening in 2017, details the testbeds design, and provides a brief tutorial on how to use it.
在机器人研究和教育中,实例化和维护机器人试验台所需的资金、专业知识和时间成本可能会阻碍研究人员和教育工作者在实验室和教室中进行基于硬件的实验。由于高保真度模拟器所需的复杂性和计算需求,这导致机器人算法经常通过低保真度模拟进行验证。不幸的是,这些模拟环境往往忽略了现实世界的复杂性,如车轮打滑、执行器动力学、计算时间、通信延迟和传感器噪声。机器人博物馆为世界各地的每个人免费提供最先进的多机器人研究设施,用于学术和教育目的,从而为这些问题提供了解决方案。本文讨论了自2017年开放以来测试台的远程使用,详细介绍了测试台的设计,并提供了如何使用它的简短教程。
{"title":"The Robotarium: A Remotely-Accessible, Multi-Robot Testbed for Control Research and Education","authors":"Sean Wilson;Magnus Egerstedt","doi":"10.1109/OJCSYS.2022.3231523","DOIUrl":"https://doi.org/10.1109/OJCSYS.2022.3231523","url":null,"abstract":"In robotic research and education, the cost in terms of money, expertise, and time required to instantiate and maintain robotic testbeds can prevent researchers and educators from including hardware based experimentation in their laboratories and classrooms. This results in robotic algorithms often being validated by low-fidelity simulation due to the complexity and computational demand required by high-fidelity simulators. Unfortunately, these simulation environments often neglect real world complexities, such as wheel slip, actuator dynamics, computation time, communication delays, and sensor noise. The Robotarium provides a solution to these problems by providing a state-of-the-art, multi-robot research facility to everyone around the world free of charge for academic and educational purposes. This paper discusses the remote usage of the testbed since its opening in 2017, details the testbeds design, and provides a brief tutorial on how to use it.","PeriodicalId":73299,"journal":{"name":"IEEE open journal of control systems","volume":"2 ","pages":"12-23"},"PeriodicalIF":0.0,"publicationDate":"2022-12-22","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://ieeexplore.ieee.org/iel7/9552933/9973428/09996578.pdf","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"50226356","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
IEEE Open Journal of Control Systems Publication Information IEEE控制系统公开期刊出版信息
Pub Date : 2022-12-02 DOI: 10.1109/OJCSYS.2022.3219740
Presents a listing of the editorial board, board of governors, current staff, committee members, and/or society editors for this issue of the publication.
列出本期出版物的编辑委员会、董事会、现任工作人员、委员会成员和/或协会编辑。
{"title":"IEEE Open Journal of Control Systems Publication Information","authors":"","doi":"10.1109/OJCSYS.2022.3219740","DOIUrl":"https://doi.org/10.1109/OJCSYS.2022.3219740","url":null,"abstract":"Presents a listing of the editorial board, board of governors, current staff, committee members, and/or society editors for this issue of the publication.","PeriodicalId":73299,"journal":{"name":"IEEE open journal of control systems","volume":"1 ","pages":"C2-C2"},"PeriodicalIF":0.0,"publicationDate":"2022-12-02","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://ieeexplore.ieee.org/iel7/9552933/9683993/09969409.pdf","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"50237546","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
IEEE Control Systems Society Information IEEE控制系统协会信息
Pub Date : 2022-12-02 DOI: 10.1109/OJCSYS.2022.3219735
Presents a listing of the editorial board, board of governors, current staff, committee members, and/or society editors for this issue of the publication.
列出本期出版物的编辑委员会、董事会、现任工作人员、委员会成员和/或协会编辑。
{"title":"IEEE Control Systems Society Information","authors":"","doi":"10.1109/OJCSYS.2022.3219735","DOIUrl":"https://doi.org/10.1109/OJCSYS.2022.3219735","url":null,"abstract":"Presents a listing of the editorial board, board of governors, current staff, committee members, and/or society editors for this issue of the publication.","PeriodicalId":73299,"journal":{"name":"IEEE open journal of control systems","volume":"1 ","pages":"C3-C3"},"PeriodicalIF":0.0,"publicationDate":"2022-12-02","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://ieeexplore.ieee.org/iel7/9552933/9683993/09969411.pdf","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"50237544","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Velocity Estimation of Robot Manipulators: An Experimental Comparison 机器人速度估计的实验比较
Pub Date : 2022-11-16 DOI: 10.1109/OJCSYS.2022.3222753
Stefan B. Liu;Andrea Giusti;Matthias Althoff
Accurate velocity information is often essential to the control of robot manipulators, especially for precise tracking of fast trajectories. However, joint velocities are rarely directly measured and instead estimated to save costs. While many approaches have been proposed for the velocity estimation of robot joints, no comprehensive experimental evaluation exists, making it difficult to choose the appropriate method. This paper compares multiple estimation methods running on a six degrees-of-freedom manipulator. We evaluate: 1) the estimation error using a ground-truth signal, 2) the closed-loop tracking error, 3) convergence behavior, 4) sensor fault tolerance, 5) implementation and tuning effort. To ensure a fair comparison, we optimally tune the estimators using a genetic algorithm. All estimation methods have a similar estimation error and similar closed-loop tracking performance, except for the nonlinear high-gain observer, which is not accurate enough. Sliding-mode observers can provide a precise velocity estimation despite sensor faults.
精确的速度信息通常对机器人的控制至关重要,尤其是对快速轨迹的精确跟踪。然而,很少直接测量节理速度,而是估计节理速度以节省成本。虽然已经提出了许多方法来估计机器人关节的速度,但没有全面的实验评估,这使得选择合适的方法变得困难。本文比较了在六自由度机械手上运行的多种估计方法。我们评估:1)使用地面实况信号的估计误差,2)闭环跟踪误差,3)收敛行为,4)传感器容错,5)实现和调整工作。为了确保公平的比较,我们使用遗传算法优化估计量。除了非线性高增益观测器不够精确之外,所有的估计方法都具有相似的估计误差和相似的闭环跟踪性能。尽管存在传感器故障,滑模观测器仍能提供精确的速度估计。
{"title":"Velocity Estimation of Robot Manipulators: An Experimental Comparison","authors":"Stefan B. Liu;Andrea Giusti;Matthias Althoff","doi":"10.1109/OJCSYS.2022.3222753","DOIUrl":"https://doi.org/10.1109/OJCSYS.2022.3222753","url":null,"abstract":"Accurate velocity information is often essential to the control of robot manipulators, especially for precise tracking of fast trajectories. However, joint velocities are rarely directly measured and instead estimated to save costs. While many approaches have been proposed for the velocity estimation of robot joints, no comprehensive experimental evaluation exists, making it difficult to choose the appropriate method. This paper compares multiple estimation methods running on a six degrees-of-freedom manipulator. We evaluate: 1) the estimation error using a ground-truth signal, 2) the closed-loop tracking error, 3) convergence behavior, 4) sensor fault tolerance, 5) implementation and tuning effort. To ensure a fair comparison, we optimally tune the estimators using a genetic algorithm. All estimation methods have a similar estimation error and similar closed-loop tracking performance, except for the nonlinear high-gain observer, which is not accurate enough. Sliding-mode observers can provide a precise velocity estimation despite sensor faults.","PeriodicalId":73299,"journal":{"name":"IEEE open journal of control systems","volume":"2 ","pages":"1-11"},"PeriodicalIF":0.0,"publicationDate":"2022-11-16","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://ieeexplore.ieee.org/iel7/9552933/9973428/09953534.pdf","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"50376166","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 3
Convex Neural Network-Based Cost Modifications for Learning Model Predictive Control 基于凸神经网络的学习模型预测控制成本修正
Pub Date : 2022-11-10 DOI: 10.1109/OJCSYS.2022.3221063
Katrine Seel;Arash Bahari Kordabad;Sébastien Gros;Jan Tommy Gravdahl
Developing model predictive control (MPC) schemes can be challenging for systems where an accurate model is not available, or too costly to develop. With the increasing availability of data and tools to treat them, learning-based MPC has of late attracted wide attention. It has recently been shown that adapting not only the MPC model, but also its cost function is conducive to achieving optimal closed-loop performance when an accurate model cannot be provided. In the learning context, this modification can be performed via parametrizing the MPC cost and adjusting the parameters via, e.g., reinforcement learning (RL). In this framework, simple cost parametrizations can be effective, but the underlying theory suggests that rich parametrizations in principle can be useful. In this paper, we propose such a cost parametrization using a class of neural networks (NNs) that preserves convexity. This choice avoids creating difficulties when solving the MPC problem via sensitivity-based solvers. In addition, this choice of cost parametrization ensures nominal stability of the resulting MPC scheme. Moreover, we detail how this choice can be applied to economic MPC problems where the cost function is generic and therefore does not necessarily fulfill any specific property.
对于无法获得准确模型或开发成本过高的系统来说,开发模型预测控制(MPC)方案可能具有挑战性。随着数据和治疗工具的可用性不断增加,基于学习的MPC最近引起了广泛关注。最近的研究表明,当无法提供准确的模型时,不仅调整MPC模型,而且调整其成本函数,都有助于实现最佳闭环性能。在学习上下文中,这种修改可以通过参数化MPC成本和通过例如强化学习(RL)调整参数来执行。在这个框架中,简单的成本参数化可能是有效的,但基本理论表明,原则上丰富的参数化可能有用。在本文中,我们使用一类保持凸性的神经网络(NN)提出了这样一种成本参数化。这种选择避免了在通过基于灵敏度的求解器求解MPC问题时产生困难。此外,这种成本参数化的选择确保了所得MPC方案的标称稳定性。此外,我们详细介绍了这种选择如何应用于经济MPC问题,其中成本函数是通用的,因此不一定满足任何特定性质。
{"title":"Convex Neural Network-Based Cost Modifications for Learning Model Predictive Control","authors":"Katrine Seel;Arash Bahari Kordabad;Sébastien Gros;Jan Tommy Gravdahl","doi":"10.1109/OJCSYS.2022.3221063","DOIUrl":"https://doi.org/10.1109/OJCSYS.2022.3221063","url":null,"abstract":"Developing model predictive control (MPC) schemes can be challenging for systems where an accurate model is not available, or too costly to develop. With the increasing availability of data and tools to treat them, learning-based MPC has of late attracted wide attention. It has recently been shown that adapting not only the MPC model, but also its cost function is conducive to achieving optimal closed-loop performance when an accurate model cannot be provided. In the learning context, this modification can be performed via parametrizing the MPC cost and adjusting the parameters via, e.g., reinforcement learning (RL). In this framework, simple cost parametrizations can be effective, but the underlying theory suggests that rich parametrizations in principle can be useful. In this paper, we propose such a cost parametrization using a class of neural networks (NNs) that preserves convexity. This choice avoids creating difficulties when solving the MPC problem via sensitivity-based solvers. In addition, this choice of cost parametrization ensures nominal stability of the resulting MPC scheme. Moreover, we detail how this choice can be applied to economic MPC problems where the cost function is generic and therefore does not necessarily fulfill any specific property.","PeriodicalId":73299,"journal":{"name":"IEEE open journal of control systems","volume":"1 ","pages":"366-379"},"PeriodicalIF":0.0,"publicationDate":"2022-11-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://ieeexplore.ieee.org/iel7/9552933/9683993/09944720.pdf","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"50237542","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 7
Learning Discrete-Time Uncertain Nonlinear Systems With Probabilistic Safety and Stability Constraints 具有概率安全和稳定性约束的离散时间不确定非线性系统的学习
Pub Date : 2022-10-21 DOI: 10.1109/OJCSYS.2022.3216545
Iman Salehi;Tyler Taplin;Ashwin P. Dani
This paper presents a discrete-time dynamical system model learning method from demonstration while providing probabilistic guarantees on the safety and stability of the learned model. The controlled dynamic model of a discrete-time system with a zero-mean Gaussian process noise is approximated using an Extreme Learning Machine (ELM) whose parameters are learned subject to chance constraints derived using a discrete-time control barrier function and discrete-time control Lyapunov function in the presence of the ELM reconstruction error. To estimate the ELM parameters a quadratically constrained quadratic program (QCQP) is developed subject to the constraints that are only required to be evaluated at sampled points. Simulations validate that the system model learned using the proposed method can reproduce the demonstrations inside a prescribed safe set while converging to the desired goal location starting from various different initial conditions inside the safe set. Furthermore, it is shown that the learned model can adapt to changes in goal location during reproductions without violating the stability and safety constraints.
本文从演示中提出了一种离散时间动态系统模型学习方法,同时为学习模型的安全性和稳定性提供了概率保证。具有零均值高斯过程噪声的离散时间系统的受控动态模型使用极限学习机(ELM)进行近似,在存在ELM重构误差的情况下,极限学习机的参数在使用离散时间控制屏障函数和离散时间控制李雅普诺夫函数导出的机会约束下进行学习。为了估计ELM参数,开发了一个二次约束二次规划(QCQP),该规划受仅需要在采样点进行评估的约束。仿真验证了使用所提出的方法学习的系统模型可以在规定的安全集中再现演示,同时从安全集中的各种不同初始条件开始收敛到期望的目标位置。此外,研究表明,所学习的模型可以在不违反稳定性和安全约束的情况下适应复制过程中目标位置的变化。
{"title":"Learning Discrete-Time Uncertain Nonlinear Systems With Probabilistic Safety and Stability Constraints","authors":"Iman Salehi;Tyler Taplin;Ashwin P. Dani","doi":"10.1109/OJCSYS.2022.3216545","DOIUrl":"https://doi.org/10.1109/OJCSYS.2022.3216545","url":null,"abstract":"This paper presents a discrete-time dynamical system model learning method from demonstration while providing probabilistic guarantees on the safety and stability of the learned model. The controlled dynamic model of a discrete-time system with a zero-mean Gaussian process noise is approximated using an Extreme Learning Machine (ELM) whose parameters are learned subject to chance constraints derived using a discrete-time control barrier function and discrete-time control Lyapunov function in the presence of the ELM reconstruction error. To estimate the ELM parameters a quadratically constrained quadratic program (QCQP) is developed subject to the constraints that are only required to be evaluated at sampled points. Simulations validate that the system model learned using the proposed method can reproduce the demonstrations inside a prescribed safe set while converging to the desired goal location starting from various different initial conditions inside the safe set. Furthermore, it is shown that the learned model can adapt to changes in goal location during reproductions without violating the stability and safety constraints.","PeriodicalId":73299,"journal":{"name":"IEEE open journal of control systems","volume":"1 ","pages":"354-365"},"PeriodicalIF":0.0,"publicationDate":"2022-10-21","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://ieeexplore.ieee.org/iel7/9552933/9683993/09926168.pdf","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"50237541","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 2
Mode Reduction for Markov Jump Systems 马尔可夫跳跃系统的模式约简
Pub Date : 2022-10-10 DOI: 10.1109/OJCSYS.2022.3212613
Zhe Du;Laura Balzano;Necmiye Ozay
Switched systems are capable of modeling processes with underlying dynamics that may change abruptly over time. To achieve accurate modeling in practice, one may need a large number of modes, but this may in turn increase the model complexity drastically. Existing work on reducing system complexity mainly considers state space reduction, whereas reducing the number of modes is less studied. In this work, we consider Markov jump linear systems (MJSs), a special class of switched systems where the active mode switches according to a Markov chain, and several issues associated with its mode complexity. Specifically, inspired by clustering techniques from unsupervised learning, we are able to construct a reduced MJS with fewer modes that approximates the original MJS well under various metrics. Furthermore, both theoretically and empirically, we show how one can use the reduced MJS to analyze stability and design controllers with significant reduction in computational cost while achieving guaranteed accuracy.
交换系统能够对具有潜在动态的过程进行建模,这些动态可能随着时间的推移而突然变化。为了在实践中实现准确的建模,可能需要大量的模式,但这反过来可能会大大增加模型的复杂性。现有的降低系统复杂性的工作主要考虑状态空间的减少,而减少模式数量的研究较少。在这项工作中,我们考虑了马尔可夫跳跃线性系统(MJSs),这是一类特殊的切换系统,其中主动模式根据马尔可夫链进行切换,以及与其模式复杂性相关的几个问题。具体来说,受无监督学习的聚类技术的启发,我们能够构建一个具有较少模式的简化MJS,在各种度量下很好地近似原始MJS。此外,无论从理论上还是从经验上,我们都展示了如何使用简化的MJS来分析稳定性并设计控制器,同时显著降低计算成本,同时实现有保证的精度。
{"title":"Mode Reduction for Markov Jump Systems","authors":"Zhe Du;Laura Balzano;Necmiye Ozay","doi":"10.1109/OJCSYS.2022.3212613","DOIUrl":"https://doi.org/10.1109/OJCSYS.2022.3212613","url":null,"abstract":"Switched systems are capable of modeling processes with underlying dynamics that may change abruptly over time. To achieve accurate modeling in practice, one may need a large number of modes, but this may in turn increase the model complexity drastically. Existing work on reducing system complexity mainly considers state space reduction, whereas reducing the number of modes is less studied. In this work, we consider Markov jump linear systems (MJSs), a special class of switched systems where the active mode switches according to a Markov chain, and several issues associated with its mode complexity. Specifically, inspired by clustering techniques from unsupervised learning, we are able to construct a reduced MJS with fewer modes that approximates the original MJS well under various metrics. Furthermore, both theoretically and empirically, we show how one can use the reduced MJS to analyze stability and design controllers with significant reduction in computational cost while achieving guaranteed accuracy.","PeriodicalId":73299,"journal":{"name":"IEEE open journal of control systems","volume":"1 ","pages":"335-353"},"PeriodicalIF":0.0,"publicationDate":"2022-10-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://ieeexplore.ieee.org/iel7/9552933/9683993/09913637.pdf","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"50237540","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Distributed Anytime-Feasible Resource Allocation Subject to Heterogeneous Time-Varying Delays 异构时变时滞下的分布式随时可行资源分配
Pub Date : 2022-09-28 DOI: 10.1109/OJCSYS.2022.3210453
Mohammadreza Doostmohammadian;Alireza Aghasi;Apostolos I. Rikos;Andreas Grammenos;Evangelia Kalyvianaki;Christoforos N. Hadjicostis;Karl H. Johansson;Themistoklis Charalambous
This paper considers distributed allocation strategies, formulated as a distributed sum-preserving (fixed-sum) allocation of resources over a multi-agent network in the presence of heterogeneous arbitrary time-varying delays. We propose a double time-scale scenario for unknown delays and a faster single time-scale scenario for known delays. Further, the links among the nodes are considered subject to certain nonlinearities (e.g, quantization and saturation/clipping). We discuss different models for nonlinearities and how they may affect the convergence, sum-preserving feasibility constraint, and solution optimality over general weight-balanced uniformly strongly connected networks and, further, time-delayed undirected networks. Our proposed scheme works in a variety of applications with general non-quadratic strongly-convex smooth objective functions. The non-quadratic part, for example, can be due to additive convex penalty or barrier functions to address the local box constraints. The network can change over time, is not necessarily connected at all times, but is only assumed to be uniformly-connected. The novelty of this work is to address all-time feasible Laplacian gradient solutions in presence of nonlinearities, switching digraph topology (not necessarily all-time connected), and heterogeneous time-varying delays.
本文考虑了分布式分配策略,该策略被表述为在存在异构任意时变延迟的情况下,在多智能体网络上对资源进行分布式保和(固定和)分配。我们提出了未知延迟的双时间尺度场景和已知延迟的更快单时间尺度场景。此外,节点之间的链路被认为受到某些非线性的影响(例如,量化和饱和/削波)。我们讨论了不同的非线性模型,以及它们如何影响一般权重平衡一致强连通网络以及时滞无向网络的收敛性、保和可行性约束和解的最优性。我们提出的方案适用于一般非二次强凸光滑目标函数的各种应用。例如,非二次部分可能是由于附加凸惩罚或障碍函数来解决局部盒约束。网络可以随着时间的推移而变化,不一定总是连接的,但只假设是一致连接的。这项工作的新颖之处在于,在存在非线性、切换有向图拓扑(不一定是全时连接的)和异构时变延迟的情况下,解决了始终可行的拉普拉斯梯度解。
{"title":"Distributed Anytime-Feasible Resource Allocation Subject to Heterogeneous Time-Varying Delays","authors":"Mohammadreza Doostmohammadian;Alireza Aghasi;Apostolos I. Rikos;Andreas Grammenos;Evangelia Kalyvianaki;Christoforos N. Hadjicostis;Karl H. Johansson;Themistoklis Charalambous","doi":"10.1109/OJCSYS.2022.3210453","DOIUrl":"https://doi.org/10.1109/OJCSYS.2022.3210453","url":null,"abstract":"This paper considers distributed allocation strategies, formulated as a distributed sum-preserving (fixed-sum) allocation of resources over a multi-agent network in the presence of heterogeneous arbitrary time-varying delays. We propose a double time-scale scenario for unknown delays and a faster single time-scale scenario for known delays. Further, the links among the nodes are considered subject to certain nonlinearities (e.g, quantization and saturation/clipping). We discuss different models for nonlinearities and how they may affect the convergence, sum-preserving feasibility constraint, and solution optimality over general weight-balanced uniformly strongly connected networks and, further, time-delayed undirected networks. Our proposed scheme works in a variety of applications with general non-quadratic strongly-convex smooth objective functions. The non-quadratic part, for example, can be due to additive convex penalty or barrier functions to address the local box constraints. The network can change over time, is not necessarily connected at all times, but is only assumed to be uniformly-connected. The novelty of this work is to address all-time feasible Laplacian gradient solutions in presence of nonlinearities, switching digraph topology (not necessarily all-time connected), and heterogeneous time-varying delays.","PeriodicalId":73299,"journal":{"name":"IEEE open journal of control systems","volume":"1 ","pages":"255-267"},"PeriodicalIF":0.0,"publicationDate":"2022-09-28","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://ieeexplore.ieee.org/iel7/9552933/9683993/09904851.pdf","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"50348955","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 6
Reinforcement Learning With Safety and Stability Guarantees During Exploration For Linear Systems 线性系统探索过程中具有安全性和稳定性保证的强化学习
Pub Date : 2022-09-28 DOI: 10.1109/OJCSYS.2022.3209945
Zahra Marvi;Bahare Kiumarsi
The satisfaction of the safety and stability properties of reinforcement learning (RL) algorithms has been a long-standing challenge. These properties must be satisfied even during learning, for which exploration is required to collect rich data. However, satisfying the safety of actions when little is known about the system dynamics is a daunting challenge. After all, predicting the consequence of RL actions requires knowing the system dynamics. This paper presents a novel RL scheme that ensures the safety and stability of the linear systems during the exploration and exploitation phases. To do so, a fast and data-efficient model-learning with the convergence guarantee is employed along and simultaneously with an off-policy RL scheme to find the optimal controller. The accurate bound of the model-learning error is derived and its characteristic is employed in the formation of a novel adaptive robustified control barrier function (ARCBF) which guarantees that states of the system remain in the safe set even when the learning is incomplete. Therefore, after satisfaction of a mild rank condition, the noisy input in the exploratory data collection phase and the optimal controller in the exploitation phase are minimally altered such that the ARCBF criterion is satisfied and, therefore, safety is guaranteed in both phases. It is shown that under the proposed RL framework, the model learning error is a vanishing perturbation to the original system. Therefore, a stability guarantee is also provided even in the exploration when noisy random inputs are applied to the system.
增强学习算法的安全性和稳定性一直是一个长期的挑战。即使在学习过程中,也必须满足这些特性,为此需要进行探索以收集丰富的数据。然而,在对系统动力学知之甚少的情况下,满足行动的安全性是一项艰巨的挑战。毕竟,预测RL行为的后果需要了解系统动力学。本文提出了一种新的RL方案,该方案确保了线性系统在勘探和开发阶段的安全性和稳定性。为此,在非策略RL方案的同时,采用了一种具有收敛保证的快速且数据有效的模型学习来寻找最优控制器。导出了模型学习误差的精确界,并将其特性用于形成一种新的自适应鲁棒控制屏障函数(ARCBF),该函数保证了即使在学习不完全的情况下,系统的状态也保持在安全集内。因此,在满足温和秩条件之后,探索数据收集阶段中的噪声输入和开发阶段中的最优控制器被最小程度地改变,从而满足ARCBF标准,并且因此在两个阶段中都保证了安全性。结果表明,在所提出的RL框架下,模型学习误差对原始系统是一个消失的扰动。因此,即使在将噪声随机输入应用于系统的探索中,也提供了稳定性保证。
{"title":"Reinforcement Learning With Safety and Stability Guarantees During Exploration For Linear Systems","authors":"Zahra Marvi;Bahare Kiumarsi","doi":"10.1109/OJCSYS.2022.3209945","DOIUrl":"https://doi.org/10.1109/OJCSYS.2022.3209945","url":null,"abstract":"The satisfaction of the safety and stability properties of reinforcement learning (RL) algorithms has been a long-standing challenge. These properties must be satisfied even during learning, for which exploration is required to collect rich data. However, satisfying the safety of actions when little is known about the system dynamics is a daunting challenge. After all, predicting the consequence of RL actions requires knowing the system dynamics. This paper presents a novel RL scheme that ensures the safety and stability of the linear systems during the exploration and exploitation phases. To do so, a fast and data-efficient model-learning with the convergence guarantee is employed along and simultaneously with an off-policy RL scheme to find the optimal controller. The accurate bound of the model-learning error is derived and its characteristic is employed in the formation of a novel adaptive robustified control barrier function (ARCBF) which guarantees that states of the system remain in the safe set even when the learning is incomplete. Therefore, after satisfaction of a mild rank condition, the noisy input in the exploratory data collection phase and the optimal controller in the exploitation phase are minimally altered such that the ARCBF criterion is satisfied and, therefore, safety is guaranteed in both phases. It is shown that under the proposed RL framework, the model learning error is a vanishing perturbation to the original system. Therefore, a stability guarantee is also provided even in the exploration when noisy random inputs are applied to the system.","PeriodicalId":73299,"journal":{"name":"IEEE open journal of control systems","volume":"1 ","pages":"322-334"},"PeriodicalIF":0.0,"publicationDate":"2022-09-28","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://ieeexplore.ieee.org/iel7/9552933/9683993/09904857.pdf","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"50237539","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
期刊
IEEE open journal of control systems
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1