首页 > 最新文献

IEEE open journal of control systems最新文献

英文 中文
Cross Apprenticeship Learning Framework: Properties and Solution Approaches 跨学徒制学习框架:特性与解决方法
Pub Date : 2023-01-09 DOI: 10.1109/OJCSYS.2023.3235248
Ashwin Aravind;Debasish Chatterjee;Ashish Cherukuri
Apprenticeship learning is a framework in which an agent learns a policy to perform a given task in an environment using example trajectories provided by an expert. In the real world, one might have access to expert trajectories in different environments where system dynamics is different while the learning task is the same. For such scenarios, two types of learning objectives can be defined. One where the learned policy performs very well in one specific environment and another when it performs well across all environments. To balance these two objectives in a principled way, our work presents the cross apprenticeship learning (CAL) framework. This consists of an optimization problem where an optimal policy for each environment is sought while ensuring that all policies remain close to each other. This nearness is facilitated by one tuning parameter in the optimization problem. We derive properties of the optimizers of the problem as the tuning parameter varies. We identify conditions under which an agent prefers using the policy obtained from CAL over the traditional apprenticeship learning. Since the CAL problem is nonconvex, we provide a convex outer approximation. Finally, we demonstrate the attributes of our framework in the context of a navigation task in a windy gridworld environment.
学徒制学习是一种框架,在该框架中,代理使用专家提供的示例轨迹来学习在环境中执行给定任务的策略。在现实世界中,在系统动力学不同而学习任务相同的不同环境中,人们可能可以访问专家轨迹。对于这样的场景,可以定义两种类型的学习目标。其中学习到的策略在一个特定环境中表现良好,而在另一个环境中,它在所有环境中都表现良好。为了以原则的方式平衡这两个目标,我们的工作提出了跨学徒学习(CAL)框架。这包括一个优化问题,其中为每个环境寻求最佳策略,同时确保所有策略保持彼此接近。优化问题中的一个调整参数促进了这种接近性。随着调谐参数的变化,我们导出了问题的优化器的性质。我们确定了代理人更喜欢使用从CAL获得的策略而不是传统学徒学习的条件。由于CAL问题是非凸的,我们提供了一个凸的外近似。最后,我们在风网格世界环境中的导航任务上下文中演示了我们的框架的属性。
{"title":"Cross Apprenticeship Learning Framework: Properties and Solution Approaches","authors":"Ashwin Aravind;Debasish Chatterjee;Ashish Cherukuri","doi":"10.1109/OJCSYS.2023.3235248","DOIUrl":"https://doi.org/10.1109/OJCSYS.2023.3235248","url":null,"abstract":"Apprenticeship learning is a framework in which an agent learns a policy to perform a given task in an environment using example trajectories provided by an expert. In the real world, one might have access to expert trajectories in different environments where system dynamics is different while the learning task is the same. For such scenarios, two types of learning objectives can be defined. One where the learned policy performs very well in one specific environment and another when it performs well across all environments. To balance these two objectives in a principled way, our work presents the cross apprenticeship learning (CAL) framework. This consists of an optimization problem where an optimal policy for each environment is sought while ensuring that all policies remain close to each other. This nearness is facilitated by one tuning parameter in the optimization problem. We derive properties of the optimizers of the problem as the tuning parameter varies. We identify conditions under which an agent prefers using the policy obtained from CAL over the traditional apprenticeship learning. Since the CAL problem is nonconvex, we provide a convex outer approximation. Finally, we demonstrate the attributes of our framework in the context of a navigation task in a windy gridworld environment.","PeriodicalId":73299,"journal":{"name":"IEEE open journal of control systems","volume":"2 ","pages":"36-48"},"PeriodicalIF":0.0,"publicationDate":"2023-01-09","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://ieeexplore.ieee.org/iel7/9552933/9973428/10011555.pdf","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"50376168","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Modeling and Characterization of Pre-Charged Collapse-Mode CMUTs 预充电坍缩模式cmut的建模与表征
Pub Date : 2023-01-01 DOI: 10.1109/OJUFFC.2023.3240699
M. Saccher, Shinnosuke Kawasaki, J. Klootwijk, R. van Schaijk, Ronald Dekker
Recently, the applications of ultrasound transducers expanded from high-end diagnostic tools to point of care diagnostic devices and wireless power receivers for implantable devices. These new applications additionally require that the transducer technology must comply to biocompatibility and manufacturing scalability. In this respect, Capacitive Micromachined Ultrasound Transducers (CMUTs) have a strong advantage compared to the conventional PZT based transducers. However, current CMUTs require a large DC bias voltage for their operation, which limits the miniaturizability of these devices. In this study, we propose a pre-charged collapse-mode CMUT for immersive applications that can operate without an external bias by means of a charge trapping Al2O3 layer embedded in the dielectrics between the top and bottom electrodes. The built-in charge layer was analytically modeled and four layer stack combinations were investigated and characterized. The measurement results of the CMUTs were then used to fit the model and to quantify the amount and type of trapped charge. It was found that these devices polarize due to the ferroelectric-like behavior of the Al2O3, and the amount of charge stored in the charge-trapping layer was estimated to be approximately 0.02 C/m2. Their acoustic performance shows a transmit and receive sensitivity of 8.8 kPa/V and 13.1 V/MPa respectively. In addition, we show that increasing the charging temperature, the charging duration, and the charging voltage results in a higher amount of stored charge. Finally, results of ALT tests showed that these devices have a lifetime of more than 2.5 years at body temperature.
近年来,超声换能器的应用范围从高端诊断工具扩展到护理点诊断设备和植入式设备的无线电源接收器。这些新的应用还要求换能器技术必须符合生物相容性和制造可扩展性。在这方面,电容式微机械超声换能器(CMUTs)与传统的PZT换能器相比具有很强的优势。然而,当前的cmut需要较大的直流偏置电压才能运行,这限制了这些器件的小型化。在这项研究中,我们提出了一种用于沉浸式应用的预充电坍缩模式CMUT,通过在上下电极之间的电介质中嵌入电荷捕获Al2O3层,该CMUT可以在没有外部偏置的情况下运行。对内置电荷层进行了解析建模,并对四层叠加组合进行了研究和表征。然后使用cmut的测量结果来拟合模型并量化捕获电荷的数量和类型。研究发现,这些器件由于Al2O3的类铁电行为而极化,并且电荷捕获层中存储的电荷量估计约为0.02 C/m2。其发射灵敏度为8.8 kPa/V,接收灵敏度为13.1 V/MPa。此外,我们还表明,增加充电温度、充电持续时间和充电电压会导致更高的存储电量。最后,ALT测试结果表明,这些装置在体温下的使用寿命超过2.5年。
{"title":"Modeling and Characterization of Pre-Charged Collapse-Mode CMUTs","authors":"M. Saccher, Shinnosuke Kawasaki, J. Klootwijk, R. van Schaijk, Ronald Dekker","doi":"10.1109/OJUFFC.2023.3240699","DOIUrl":"https://doi.org/10.1109/OJUFFC.2023.3240699","url":null,"abstract":"Recently, the applications of ultrasound transducers expanded from high-end diagnostic tools to point of care diagnostic devices and wireless power receivers for implantable devices. These new applications additionally require that the transducer technology must comply to biocompatibility and manufacturing scalability. In this respect, Capacitive Micromachined Ultrasound Transducers (CMUTs) have a strong advantage compared to the conventional PZT based transducers. However, current CMUTs require a large DC bias voltage for their operation, which limits the miniaturizability of these devices. In this study, we propose a pre-charged collapse-mode CMUT for immersive applications that can operate without an external bias by means of a charge trapping Al2O3 layer embedded in the dielectrics between the top and bottom electrodes. The built-in charge layer was analytically modeled and four layer stack combinations were investigated and characterized. The measurement results of the CMUTs were then used to fit the model and to quantify the amount and type of trapped charge. It was found that these devices polarize due to the ferroelectric-like behavior of the Al2O3, and the amount of charge stored in the charge-trapping layer was estimated to be approximately 0.02 C/m2. Their acoustic performance shows a transmit and receive sensitivity of 8.8 kPa/V and 13.1 V/MPa respectively. In addition, we show that increasing the charging temperature, the charging duration, and the charging voltage results in a higher amount of stored charge. Finally, results of ALT tests showed that these devices have a lifetime of more than 2.5 years at body temperature.","PeriodicalId":73299,"journal":{"name":"IEEE open journal of control systems","volume":"3 1","pages":"14-28"},"PeriodicalIF":0.0,"publicationDate":"2023-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"62907489","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 3
Exact Decomposition of Optimal Control Problems via Simultaneous Block Diagonalization of Matrices 最优控制问题的矩阵同时块对角化的精确分解
Pub Date : 2022-12-22 DOI: 10.1109/OJCSYS.2022.3231553
Amirhossein Nazerian;Kshitij Bhatta;Francesco Sorrentino
In this paper, we consider optimal control problems (OCPs) applied to large-scale linear dynamical systems with a large number of states and inputs. We attempt to reduce such problems into a set of independent OCPs of lower dimensions. Our decomposition is ‘exact’ in the sense that it preserves all the information about the original system and the objective function. Previous work in this area has focused on strategies that exploit symmetries of the underlying system and of the objective function. Here, instead, we implement the algebraic method of simultaneous block diagonalization of matrices (SBD), which we show provides advantages both in terms of the dimension of the subproblems that are obtained and of the computation time. We provide practical examples with networked systems that demonstrate the benefits of applying the SBD decomposition over the decomposition method based on group symmetries.
本文研究了具有大量状态和输入的大型线性动力系统的最优控制问题。我们试图将这些问题简化为一组较低维度的独立OCP。我们的分解是“精确的”,因为它保留了关于原始系统和目标函数的所有信息。以前在这一领域的工作集中在利用底层系统和目标函数对称性的策略上。相反,在这里,我们实现了矩阵的同时块对角化(SBD)的代数方法,我们证明了该方法在所获得的子问题的维数和计算时间方面都具有优势。我们提供了网络系统的实际例子,证明了应用SBD分解相对于基于群对称性的分解方法的好处。
{"title":"Exact Decomposition of Optimal Control Problems via Simultaneous Block Diagonalization of Matrices","authors":"Amirhossein Nazerian;Kshitij Bhatta;Francesco Sorrentino","doi":"10.1109/OJCSYS.2022.3231553","DOIUrl":"10.1109/OJCSYS.2022.3231553","url":null,"abstract":"In this paper, we consider optimal control problems (OCPs) applied to large-scale linear dynamical systems with a large number of states and inputs. We attempt to reduce such problems into a set of independent OCPs of lower dimensions. Our decomposition is ‘exact’ in the sense that it preserves all the information about the original system and the objective function. Previous work in this area has focused on strategies that exploit symmetries of the underlying system and of the objective function. Here, instead, we implement the algebraic method of simultaneous block diagonalization of matrices (SBD), which we show provides advantages both in terms of the dimension of the subproblems that are obtained and of the computation time. We provide practical examples with networked systems that demonstrate the benefits of applying the SBD decomposition over the decomposition method based on group symmetries.","PeriodicalId":73299,"journal":{"name":"IEEE open journal of control systems","volume":"2 ","pages":"24-35"},"PeriodicalIF":0.0,"publicationDate":"2022-12-22","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=9996568","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"9111923","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
The Robotarium: A Remotely-Accessible, Multi-Robot Testbed for Control Research and Education 机器人博物馆:用于控制研究和教育的可远程访问的多机器人试验台
Pub Date : 2022-12-22 DOI: 10.1109/OJCSYS.2022.3231523
Sean Wilson;Magnus Egerstedt
In robotic research and education, the cost in terms of money, expertise, and time required to instantiate and maintain robotic testbeds can prevent researchers and educators from including hardware based experimentation in their laboratories and classrooms. This results in robotic algorithms often being validated by low-fidelity simulation due to the complexity and computational demand required by high-fidelity simulators. Unfortunately, these simulation environments often neglect real world complexities, such as wheel slip, actuator dynamics, computation time, communication delays, and sensor noise. The Robotarium provides a solution to these problems by providing a state-of-the-art, multi-robot research facility to everyone around the world free of charge for academic and educational purposes. This paper discusses the remote usage of the testbed since its opening in 2017, details the testbeds design, and provides a brief tutorial on how to use it.
在机器人研究和教育中,实例化和维护机器人试验台所需的资金、专业知识和时间成本可能会阻碍研究人员和教育工作者在实验室和教室中进行基于硬件的实验。由于高保真度模拟器所需的复杂性和计算需求,这导致机器人算法经常通过低保真度模拟进行验证。不幸的是,这些模拟环境往往忽略了现实世界的复杂性,如车轮打滑、执行器动力学、计算时间、通信延迟和传感器噪声。机器人博物馆为世界各地的每个人免费提供最先进的多机器人研究设施,用于学术和教育目的,从而为这些问题提供了解决方案。本文讨论了自2017年开放以来测试台的远程使用,详细介绍了测试台的设计,并提供了如何使用它的简短教程。
{"title":"The Robotarium: A Remotely-Accessible, Multi-Robot Testbed for Control Research and Education","authors":"Sean Wilson;Magnus Egerstedt","doi":"10.1109/OJCSYS.2022.3231523","DOIUrl":"https://doi.org/10.1109/OJCSYS.2022.3231523","url":null,"abstract":"In robotic research and education, the cost in terms of money, expertise, and time required to instantiate and maintain robotic testbeds can prevent researchers and educators from including hardware based experimentation in their laboratories and classrooms. This results in robotic algorithms often being validated by low-fidelity simulation due to the complexity and computational demand required by high-fidelity simulators. Unfortunately, these simulation environments often neglect real world complexities, such as wheel slip, actuator dynamics, computation time, communication delays, and sensor noise. The Robotarium provides a solution to these problems by providing a state-of-the-art, multi-robot research facility to everyone around the world free of charge for academic and educational purposes. This paper discusses the remote usage of the testbed since its opening in 2017, details the testbeds design, and provides a brief tutorial on how to use it.","PeriodicalId":73299,"journal":{"name":"IEEE open journal of control systems","volume":"2 ","pages":"12-23"},"PeriodicalIF":0.0,"publicationDate":"2022-12-22","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://ieeexplore.ieee.org/iel7/9552933/9973428/09996578.pdf","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"50226356","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
IEEE Open Journal of Control Systems Publication Information IEEE控制系统公开期刊出版信息
Pub Date : 2022-12-02 DOI: 10.1109/OJCSYS.2022.3219740
Presents a listing of the editorial board, board of governors, current staff, committee members, and/or society editors for this issue of the publication.
列出本期出版物的编辑委员会、董事会、现任工作人员、委员会成员和/或协会编辑。
{"title":"IEEE Open Journal of Control Systems Publication Information","authors":"","doi":"10.1109/OJCSYS.2022.3219740","DOIUrl":"https://doi.org/10.1109/OJCSYS.2022.3219740","url":null,"abstract":"Presents a listing of the editorial board, board of governors, current staff, committee members, and/or society editors for this issue of the publication.","PeriodicalId":73299,"journal":{"name":"IEEE open journal of control systems","volume":"1 ","pages":"C2-C2"},"PeriodicalIF":0.0,"publicationDate":"2022-12-02","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://ieeexplore.ieee.org/iel7/9552933/9683993/09969409.pdf","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"50237546","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
IEEE Control Systems Society Information IEEE控制系统协会信息
Pub Date : 2022-12-02 DOI: 10.1109/OJCSYS.2022.3219735
Presents a listing of the editorial board, board of governors, current staff, committee members, and/or society editors for this issue of the publication.
列出本期出版物的编辑委员会、董事会、现任工作人员、委员会成员和/或协会编辑。
{"title":"IEEE Control Systems Society Information","authors":"","doi":"10.1109/OJCSYS.2022.3219735","DOIUrl":"https://doi.org/10.1109/OJCSYS.2022.3219735","url":null,"abstract":"Presents a listing of the editorial board, board of governors, current staff, committee members, and/or society editors for this issue of the publication.","PeriodicalId":73299,"journal":{"name":"IEEE open journal of control systems","volume":"1 ","pages":"C3-C3"},"PeriodicalIF":0.0,"publicationDate":"2022-12-02","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://ieeexplore.ieee.org/iel7/9552933/9683993/09969411.pdf","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"50237544","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Velocity Estimation of Robot Manipulators: An Experimental Comparison 机器人速度估计的实验比较
Pub Date : 2022-11-16 DOI: 10.1109/OJCSYS.2022.3222753
Stefan B. Liu;Andrea Giusti;Matthias Althoff
Accurate velocity information is often essential to the control of robot manipulators, especially for precise tracking of fast trajectories. However, joint velocities are rarely directly measured and instead estimated to save costs. While many approaches have been proposed for the velocity estimation of robot joints, no comprehensive experimental evaluation exists, making it difficult to choose the appropriate method. This paper compares multiple estimation methods running on a six degrees-of-freedom manipulator. We evaluate: 1) the estimation error using a ground-truth signal, 2) the closed-loop tracking error, 3) convergence behavior, 4) sensor fault tolerance, 5) implementation and tuning effort. To ensure a fair comparison, we optimally tune the estimators using a genetic algorithm. All estimation methods have a similar estimation error and similar closed-loop tracking performance, except for the nonlinear high-gain observer, which is not accurate enough. Sliding-mode observers can provide a precise velocity estimation despite sensor faults.
精确的速度信息通常对机器人的控制至关重要,尤其是对快速轨迹的精确跟踪。然而,很少直接测量节理速度,而是估计节理速度以节省成本。虽然已经提出了许多方法来估计机器人关节的速度,但没有全面的实验评估,这使得选择合适的方法变得困难。本文比较了在六自由度机械手上运行的多种估计方法。我们评估:1)使用地面实况信号的估计误差,2)闭环跟踪误差,3)收敛行为,4)传感器容错,5)实现和调整工作。为了确保公平的比较,我们使用遗传算法优化估计量。除了非线性高增益观测器不够精确之外,所有的估计方法都具有相似的估计误差和相似的闭环跟踪性能。尽管存在传感器故障,滑模观测器仍能提供精确的速度估计。
{"title":"Velocity Estimation of Robot Manipulators: An Experimental Comparison","authors":"Stefan B. Liu;Andrea Giusti;Matthias Althoff","doi":"10.1109/OJCSYS.2022.3222753","DOIUrl":"https://doi.org/10.1109/OJCSYS.2022.3222753","url":null,"abstract":"Accurate velocity information is often essential to the control of robot manipulators, especially for precise tracking of fast trajectories. However, joint velocities are rarely directly measured and instead estimated to save costs. While many approaches have been proposed for the velocity estimation of robot joints, no comprehensive experimental evaluation exists, making it difficult to choose the appropriate method. This paper compares multiple estimation methods running on a six degrees-of-freedom manipulator. We evaluate: 1) the estimation error using a ground-truth signal, 2) the closed-loop tracking error, 3) convergence behavior, 4) sensor fault tolerance, 5) implementation and tuning effort. To ensure a fair comparison, we optimally tune the estimators using a genetic algorithm. All estimation methods have a similar estimation error and similar closed-loop tracking performance, except for the nonlinear high-gain observer, which is not accurate enough. Sliding-mode observers can provide a precise velocity estimation despite sensor faults.","PeriodicalId":73299,"journal":{"name":"IEEE open journal of control systems","volume":"2 ","pages":"1-11"},"PeriodicalIF":0.0,"publicationDate":"2022-11-16","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://ieeexplore.ieee.org/iel7/9552933/9973428/09953534.pdf","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"50376166","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 3
Convex Neural Network-Based Cost Modifications for Learning Model Predictive Control 基于凸神经网络的学习模型预测控制成本修正
Pub Date : 2022-11-10 DOI: 10.1109/OJCSYS.2022.3221063
Katrine Seel;Arash Bahari Kordabad;Sébastien Gros;Jan Tommy Gravdahl
Developing model predictive control (MPC) schemes can be challenging for systems where an accurate model is not available, or too costly to develop. With the increasing availability of data and tools to treat them, learning-based MPC has of late attracted wide attention. It has recently been shown that adapting not only the MPC model, but also its cost function is conducive to achieving optimal closed-loop performance when an accurate model cannot be provided. In the learning context, this modification can be performed via parametrizing the MPC cost and adjusting the parameters via, e.g., reinforcement learning (RL). In this framework, simple cost parametrizations can be effective, but the underlying theory suggests that rich parametrizations in principle can be useful. In this paper, we propose such a cost parametrization using a class of neural networks (NNs) that preserves convexity. This choice avoids creating difficulties when solving the MPC problem via sensitivity-based solvers. In addition, this choice of cost parametrization ensures nominal stability of the resulting MPC scheme. Moreover, we detail how this choice can be applied to economic MPC problems where the cost function is generic and therefore does not necessarily fulfill any specific property.
对于无法获得准确模型或开发成本过高的系统来说,开发模型预测控制(MPC)方案可能具有挑战性。随着数据和治疗工具的可用性不断增加,基于学习的MPC最近引起了广泛关注。最近的研究表明,当无法提供准确的模型时,不仅调整MPC模型,而且调整其成本函数,都有助于实现最佳闭环性能。在学习上下文中,这种修改可以通过参数化MPC成本和通过例如强化学习(RL)调整参数来执行。在这个框架中,简单的成本参数化可能是有效的,但基本理论表明,原则上丰富的参数化可能有用。在本文中,我们使用一类保持凸性的神经网络(NN)提出了这样一种成本参数化。这种选择避免了在通过基于灵敏度的求解器求解MPC问题时产生困难。此外,这种成本参数化的选择确保了所得MPC方案的标称稳定性。此外,我们详细介绍了这种选择如何应用于经济MPC问题,其中成本函数是通用的,因此不一定满足任何特定性质。
{"title":"Convex Neural Network-Based Cost Modifications for Learning Model Predictive Control","authors":"Katrine Seel;Arash Bahari Kordabad;Sébastien Gros;Jan Tommy Gravdahl","doi":"10.1109/OJCSYS.2022.3221063","DOIUrl":"https://doi.org/10.1109/OJCSYS.2022.3221063","url":null,"abstract":"Developing model predictive control (MPC) schemes can be challenging for systems where an accurate model is not available, or too costly to develop. With the increasing availability of data and tools to treat them, learning-based MPC has of late attracted wide attention. It has recently been shown that adapting not only the MPC model, but also its cost function is conducive to achieving optimal closed-loop performance when an accurate model cannot be provided. In the learning context, this modification can be performed via parametrizing the MPC cost and adjusting the parameters via, e.g., reinforcement learning (RL). In this framework, simple cost parametrizations can be effective, but the underlying theory suggests that rich parametrizations in principle can be useful. In this paper, we propose such a cost parametrization using a class of neural networks (NNs) that preserves convexity. This choice avoids creating difficulties when solving the MPC problem via sensitivity-based solvers. In addition, this choice of cost parametrization ensures nominal stability of the resulting MPC scheme. Moreover, we detail how this choice can be applied to economic MPC problems where the cost function is generic and therefore does not necessarily fulfill any specific property.","PeriodicalId":73299,"journal":{"name":"IEEE open journal of control systems","volume":"1 ","pages":"366-379"},"PeriodicalIF":0.0,"publicationDate":"2022-11-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://ieeexplore.ieee.org/iel7/9552933/9683993/09944720.pdf","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"50237542","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 7
Learning Discrete-Time Uncertain Nonlinear Systems With Probabilistic Safety and Stability Constraints 具有概率安全和稳定性约束的离散时间不确定非线性系统的学习
Pub Date : 2022-10-21 DOI: 10.1109/OJCSYS.2022.3216545
Iman Salehi;Tyler Taplin;Ashwin P. Dani
This paper presents a discrete-time dynamical system model learning method from demonstration while providing probabilistic guarantees on the safety and stability of the learned model. The controlled dynamic model of a discrete-time system with a zero-mean Gaussian process noise is approximated using an Extreme Learning Machine (ELM) whose parameters are learned subject to chance constraints derived using a discrete-time control barrier function and discrete-time control Lyapunov function in the presence of the ELM reconstruction error. To estimate the ELM parameters a quadratically constrained quadratic program (QCQP) is developed subject to the constraints that are only required to be evaluated at sampled points. Simulations validate that the system model learned using the proposed method can reproduce the demonstrations inside a prescribed safe set while converging to the desired goal location starting from various different initial conditions inside the safe set. Furthermore, it is shown that the learned model can adapt to changes in goal location during reproductions without violating the stability and safety constraints.
本文从演示中提出了一种离散时间动态系统模型学习方法,同时为学习模型的安全性和稳定性提供了概率保证。具有零均值高斯过程噪声的离散时间系统的受控动态模型使用极限学习机(ELM)进行近似,在存在ELM重构误差的情况下,极限学习机的参数在使用离散时间控制屏障函数和离散时间控制李雅普诺夫函数导出的机会约束下进行学习。为了估计ELM参数,开发了一个二次约束二次规划(QCQP),该规划受仅需要在采样点进行评估的约束。仿真验证了使用所提出的方法学习的系统模型可以在规定的安全集中再现演示,同时从安全集中的各种不同初始条件开始收敛到期望的目标位置。此外,研究表明,所学习的模型可以在不违反稳定性和安全约束的情况下适应复制过程中目标位置的变化。
{"title":"Learning Discrete-Time Uncertain Nonlinear Systems With Probabilistic Safety and Stability Constraints","authors":"Iman Salehi;Tyler Taplin;Ashwin P. Dani","doi":"10.1109/OJCSYS.2022.3216545","DOIUrl":"https://doi.org/10.1109/OJCSYS.2022.3216545","url":null,"abstract":"This paper presents a discrete-time dynamical system model learning method from demonstration while providing probabilistic guarantees on the safety and stability of the learned model. The controlled dynamic model of a discrete-time system with a zero-mean Gaussian process noise is approximated using an Extreme Learning Machine (ELM) whose parameters are learned subject to chance constraints derived using a discrete-time control barrier function and discrete-time control Lyapunov function in the presence of the ELM reconstruction error. To estimate the ELM parameters a quadratically constrained quadratic program (QCQP) is developed subject to the constraints that are only required to be evaluated at sampled points. Simulations validate that the system model learned using the proposed method can reproduce the demonstrations inside a prescribed safe set while converging to the desired goal location starting from various different initial conditions inside the safe set. Furthermore, it is shown that the learned model can adapt to changes in goal location during reproductions without violating the stability and safety constraints.","PeriodicalId":73299,"journal":{"name":"IEEE open journal of control systems","volume":"1 ","pages":"354-365"},"PeriodicalIF":0.0,"publicationDate":"2022-10-21","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://ieeexplore.ieee.org/iel7/9552933/9683993/09926168.pdf","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"50237541","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 2
Mode Reduction for Markov Jump Systems 马尔可夫跳跃系统的模式约简
Pub Date : 2022-10-10 DOI: 10.1109/OJCSYS.2022.3212613
Zhe Du;Laura Balzano;Necmiye Ozay
Switched systems are capable of modeling processes with underlying dynamics that may change abruptly over time. To achieve accurate modeling in practice, one may need a large number of modes, but this may in turn increase the model complexity drastically. Existing work on reducing system complexity mainly considers state space reduction, whereas reducing the number of modes is less studied. In this work, we consider Markov jump linear systems (MJSs), a special class of switched systems where the active mode switches according to a Markov chain, and several issues associated with its mode complexity. Specifically, inspired by clustering techniques from unsupervised learning, we are able to construct a reduced MJS with fewer modes that approximates the original MJS well under various metrics. Furthermore, both theoretically and empirically, we show how one can use the reduced MJS to analyze stability and design controllers with significant reduction in computational cost while achieving guaranteed accuracy.
交换系统能够对具有潜在动态的过程进行建模,这些动态可能随着时间的推移而突然变化。为了在实践中实现准确的建模,可能需要大量的模式,但这反过来可能会大大增加模型的复杂性。现有的降低系统复杂性的工作主要考虑状态空间的减少,而减少模式数量的研究较少。在这项工作中,我们考虑了马尔可夫跳跃线性系统(MJSs),这是一类特殊的切换系统,其中主动模式根据马尔可夫链进行切换,以及与其模式复杂性相关的几个问题。具体来说,受无监督学习的聚类技术的启发,我们能够构建一个具有较少模式的简化MJS,在各种度量下很好地近似原始MJS。此外,无论从理论上还是从经验上,我们都展示了如何使用简化的MJS来分析稳定性并设计控制器,同时显著降低计算成本,同时实现有保证的精度。
{"title":"Mode Reduction for Markov Jump Systems","authors":"Zhe Du;Laura Balzano;Necmiye Ozay","doi":"10.1109/OJCSYS.2022.3212613","DOIUrl":"https://doi.org/10.1109/OJCSYS.2022.3212613","url":null,"abstract":"Switched systems are capable of modeling processes with underlying dynamics that may change abruptly over time. To achieve accurate modeling in practice, one may need a large number of modes, but this may in turn increase the model complexity drastically. Existing work on reducing system complexity mainly considers state space reduction, whereas reducing the number of modes is less studied. In this work, we consider Markov jump linear systems (MJSs), a special class of switched systems where the active mode switches according to a Markov chain, and several issues associated with its mode complexity. Specifically, inspired by clustering techniques from unsupervised learning, we are able to construct a reduced MJS with fewer modes that approximates the original MJS well under various metrics. Furthermore, both theoretically and empirically, we show how one can use the reduced MJS to analyze stability and design controllers with significant reduction in computational cost while achieving guaranteed accuracy.","PeriodicalId":73299,"journal":{"name":"IEEE open journal of control systems","volume":"1 ","pages":"335-353"},"PeriodicalIF":0.0,"publicationDate":"2022-10-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://ieeexplore.ieee.org/iel7/9552933/9683993/09913637.pdf","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"50237540","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
期刊
IEEE open journal of control systems
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1