首页 > 最新文献

IEEE open journal of control systems最新文献

英文 中文
Controllability and Observability of Heterogeneous Networked Systems With Non-Uniform Node Dimensions and Distinct Inner-Coupling Matrices 具有非均匀节点维数和不同内耦合矩阵的异构网络系统的可控性和可观察性
Pub Date : 2025-07-10 DOI: 10.1109/OJCSYS.2025.3587537
Aleena Thomas;Abhijith Ajayakumar;Raju K. George
In this paper, controllability and observability of a heterogeneous networked system with Linear Time Invariant (LTI) nodal systems having Multiple-Inputs and Multiple-Outputs (MIMO) aligned in a weighted and directed network topology are studied. Apart from the heterogenity in nodal dynamics, the inner-coupling matrices that quantify the interactions among nodes are also different. In contrast to the existing literature, the system under consideration has distinct node dimensions, which adds to the generality. Necessary and sufficient conditions for controllability and observability as well as certain necessary conditions for controllability of a class of networked systems are established. These conditions show the dependence of network controllability and observability on various node and network-specific factors. As a practical application, a three-sector economy is modelled as a heterogeneous networked system with distinct node dimensions and its controllability is analysed. Computational time in floating point operations (flops) of the proposed methods are estimated, which indicates their efficiency on comparison with the classical conditions. This is illustrated by computational comparison of the existing and proposed schemes, applied to a randomly generated networked system. Also, robustness of the proposed schemes are analysed with the example of randomly generated networked systems. All the results are supported with illustrative numerical examples.
本文研究了具有多输入多输出(MIMO)的线性时不变(LTI)节点系统在加权有向网络拓扑结构中的可控性和可观察性问题。除了节点动力学的异质性外,量化节点间相互作用的内耦合矩阵也不同。与现有文献相比,所考虑的系统具有不同的节点维度,这增加了通用性。建立了一类网络系统的可控性和可观测性的充分必要条件以及可控性的若干必要条件。这些条件表明网络的可控性和可观测性依赖于各种节点和网络特定因素。作为实际应用,本文将三部门经济建模为具有不同节点维度的异构网络系统,并分析了其可控性。对所提方法的浮点运算计算时间进行了估计,并与经典条件进行了比较。这是通过计算比较现有的和提出的方案,应用于一个随机生成的网络系统。并以随机生成的网络系统为例,对所提方案的鲁棒性进行了分析。所有的结果都得到了数值实例的支持。
{"title":"Controllability and Observability of Heterogeneous Networked Systems With Non-Uniform Node Dimensions and Distinct Inner-Coupling Matrices","authors":"Aleena Thomas;Abhijith Ajayakumar;Raju K. George","doi":"10.1109/OJCSYS.2025.3587537","DOIUrl":"https://doi.org/10.1109/OJCSYS.2025.3587537","url":null,"abstract":"In this paper, controllability and observability of a heterogeneous networked system with Linear Time Invariant (LTI) nodal systems having Multiple-Inputs and Multiple-Outputs (MIMO) aligned in a weighted and directed network topology are studied. Apart from the heterogenity in nodal dynamics, the inner-coupling matrices that quantify the interactions among nodes are also different. In contrast to the existing literature, the system under consideration has distinct node dimensions, which adds to the generality. Necessary and sufficient conditions for controllability and observability as well as certain necessary conditions for controllability of a class of networked systems are established. These conditions show the dependence of network controllability and observability on various node and network-specific factors. As a practical application, a three-sector economy is modelled as a heterogeneous networked system with distinct node dimensions and its controllability is analysed. Computational time in floating point operations (flops) of the proposed methods are estimated, which indicates their efficiency on comparison with the classical conditions. This is illustrated by computational comparison of the existing and proposed schemes, applied to a randomly generated networked system. Also, robustness of the proposed schemes are analysed with the example of randomly generated networked systems. All the results are supported with illustrative numerical examples.","PeriodicalId":73299,"journal":{"name":"IEEE open journal of control systems","volume":"4 ","pages":"219-235"},"PeriodicalIF":0.0,"publicationDate":"2025-07-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=11075535","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"144831949","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Time-Series Out-of-Distribution Data Detection in Mechanical Ventilation 机械通气时间序列非分布数据检测
Pub Date : 2025-07-02 DOI: 10.1109/OJCSYS.2025.3585427
L. van de Kamp;B. Hunnekens;T. Oomen;N. van de Wouw
Safe deployment of neural networks to classify time series in safety-critical applications relies on the ability of the classifier to detect data that does not originate from the same distribution as the training data. The aim of this paper is to propose a framework for detecting whether time-series data is sampled from a different distribution than the training data, known as the problem of out-of-distribution (OOD) detection. We propose a novel distance-based OOD method for time-series data using a hierarchical clustering method together with dynamic time-warping to measure the difference between a new data instance and the training set. The method is evaluated in the context of mechanical ventilation, a safety critical application, using both simulated and clinical datasets. Results of the mechanical ventilation use case demonstrate that the proposed approach effectively detects out-of-distribution data and improves classification performance in diverse settings.
在安全关键应用中,安全部署神经网络对时间序列进行分类依赖于分类器检测与训练数据不同分布的数据的能力。本文的目的是提出一个框架,用于检测时间序列数据是否从不同于训练数据的分布中采样,称为out- distribution (OOD)检测问题。我们提出了一种新的基于距离的时间序列数据OOD方法,使用层次聚类方法和动态时间规整来度量新数据实例与训练集之间的差异。该方法在机械通气这一安全关键应用的背景下进行评估,使用模拟和临床数据集。机械通气用例的结果表明,该方法可以有效地检测出分布外数据,并提高了不同设置下的分类性能。
{"title":"Time-Series Out-of-Distribution Data Detection in Mechanical Ventilation","authors":"L. van de Kamp;B. Hunnekens;T. Oomen;N. van de Wouw","doi":"10.1109/OJCSYS.2025.3585427","DOIUrl":"https://doi.org/10.1109/OJCSYS.2025.3585427","url":null,"abstract":"Safe deployment of neural networks to classify time series in safety-critical applications relies on the ability of the classifier to detect data that does not originate from the same distribution as the training data. The aim of this paper is to propose a framework for detecting whether time-series data is sampled from a different distribution than the training data, known as the problem of <italic>out-of-distribution</i> (OOD) detection. We propose a novel distance-based OOD method for time-series data using a hierarchical clustering method together with dynamic time-warping to measure the difference between a new data instance and the training set. The method is evaluated in the context of mechanical ventilation, a safety critical application, using both simulated and clinical datasets. Results of the mechanical ventilation use case demonstrate that the proposed approach effectively detects out-of-distribution data and improves classification performance in diverse settings.","PeriodicalId":73299,"journal":{"name":"IEEE open journal of control systems","volume":"4 ","pages":"236-249"},"PeriodicalIF":0.0,"publicationDate":"2025-07-02","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=11066264","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"144914214","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Optimistic Algorithms for Safe Linear Bandits Under General Constraints 一般约束下安全线性强盗的乐观算法
Pub Date : 2025-04-07 DOI: 10.1109/OJCSYS.2025.3558118
Spencer Hutchinson;Arghavan Zibaie;Ramtin Pedarsani;Mahnoosh Alizadeh
The stochastic linear bandit problem has emerged as a fundamental building-block in machine learning and control, and a realistic model for many applications. By equipping this classical problem with safety constraints, the safe linear bandit problem further broadens its relevance to safety-critical applications. However, most existing algorithms for safe linear bandits only consider linear constraints, making them inadequate for many real-world applications, which often have non-linear constraints. To alleviate this limitation, we study the problem of safe linear bandits under general (non-linear) constraints. Under a novel constraint regularity condition that is weaker than convexity, we give two algorithms with $tilde{mathcal {O}}(d sqrt{T})$ regret. We then give efficient implementations of these algorithms for several specific settings. Lastly, we give simulation results demonstrating the effectiveness of our algorithms in choosing dynamic pricing signals for a demand response problem under distribution power flow constraints.
随机线性强盗问题已经成为机器学习和控制的基本组成部分,也是许多应用的现实模型。通过为这一经典问题配备安全约束,安全线性强盗问题进一步扩大了其与安全关键应用的相关性。然而,大多数现有的安全线性强盗算法只考虑线性约束,使得它们不适合许多具有非线性约束的实际应用。为了减轻这种限制,我们研究了一般(非线性)约束下的安全线性强盗问题。在一种比凸性更弱的约束规则条件下,给出了两种具有$tilde{mathcal {O}}(d sqrt{T})$遗憾的算法。然后,我们给出了这些算法在几个特定设置下的有效实现。最后,给出了仿真结果,证明了算法在配电网潮流约束下的需求响应问题中选择动态定价信号的有效性。
{"title":"Optimistic Algorithms for Safe Linear Bandits Under General Constraints","authors":"Spencer Hutchinson;Arghavan Zibaie;Ramtin Pedarsani;Mahnoosh Alizadeh","doi":"10.1109/OJCSYS.2025.3558118","DOIUrl":"https://doi.org/10.1109/OJCSYS.2025.3558118","url":null,"abstract":"The stochastic linear bandit problem has emerged as a fundamental building-block in machine learning and control, and a realistic model for many applications. By equipping this classical problem with safety constraints, the <italic>safe linear bandit problem</i> further broadens its relevance to safety-critical applications. However, most existing algorithms for safe linear bandits only consider <italic>linear constraints</i>, making them inadequate for many real-world applications, which often have non-linear constraints. To alleviate this limitation, we study the problem of safe linear bandits under general (non-linear) constraints. Under a novel constraint regularity condition that is weaker than convexity, we give two algorithms with <inline-formula><tex-math>$tilde{mathcal {O}}(d sqrt{T})$</tex-math></inline-formula> regret. We then give efficient implementations of these algorithms for several specific settings. Lastly, we give simulation results demonstrating the effectiveness of our algorithms in choosing dynamic pricing signals for a demand response problem under distribution power flow constraints.","PeriodicalId":73299,"journal":{"name":"IEEE open journal of control systems","volume":"4 ","pages":"103-116"},"PeriodicalIF":0.0,"publicationDate":"2025-04-07","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=10950393","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"143896165","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Scalable Distributed Reproduction Numbers of Network Epidemics With Differential Privacy 具有差分隐私的网络流行病的可伸缩分布再现数
Pub Date : 2025-03-30 DOI: 10.1109/OJCSYS.2025.3575305
Bo Chen;Baike She;Calvin Hawkins;Philip E. Paré;Matthew T. Hale
Reproduction numbers are widely used to analyze epidemic spreading processes over networks. However,conventional reproduction numbers of an overall network, which require spreading information from the entire network, do not indicate where an epidemic is spreading. To address this limitation, we first propose a novel class of local distributed reproduction numbers that capture spreading behaviors at the level of individual nodes. We demonstrate how to compute these values in a distributed way and use them to derive new threshold conditions for network spreading analysis. Due to the fact that epidemic data are often collected at multiple geographic or administrative scales, we then define a class of cluster distributed reproduction numbers to describe the spread between groups of nodes such as communities, cities, or states. We further show that the local distributed reproduction numbers can be aggregated to form the cluster distributed reproduction numbers. Unlike conventional network-level reproduction numbers, these distributed measures reveal fine-grained interaction patterns that may raise privacy concerns by exposing the frequency or intensity of interactions across regions. To address this issue, we propose a privacy-enhanced distributed reproduction number framework that implements differential privacy. This framework enables scalable and privacy-preserving analysis of epidemic spreading processes in networked populations through the calculation of privacy-preserving distributed reproduction numbers. Numerical experiments show that while maintaining differential privacy, the private distributed reproduction numbers yield accurate estimates of epidemic spread while also offering more insights than conventional reproduction numbers.
复制数被广泛用于分析网络上的流行病传播过程。但是,需要从整个网络传播信息的整个网络的常规复制数并不能表明流行病正在何处传播。为了解决这一限制,我们首先提出了一类新的局部分布式复制数,它在单个节点的水平上捕捉传播行为。我们演示了如何以分布式方式计算这些值,并利用它们推导出网络传播分析的新阈值条件。由于流行病数据通常是在多个地理或行政尺度上收集的,因此我们定义了一类集群分布式复制数来描述社区、城市或州等节点组之间的传播。进一步证明了局部分布再生产数可以聚合成集群分布再生产数。与传统的网络级复制数不同,这些分布式度量揭示了细粒度交互模式,这些模式可能通过暴露跨区域交互的频率或强度而引起隐私问题。为了解决这个问题,我们提出了一个隐私增强的分布式复制数框架,实现了差分隐私。该框架通过计算保护隐私的分布式复制数,实现了对网络人群中流行病传播过程的可扩展和隐私保护分析。数值实验表明,在保持差异私密性的同时,私有的分布式繁殖数可以准确估计流行病的传播,同时也比传统的繁殖数提供更多的见解。
{"title":"Scalable Distributed Reproduction Numbers of Network Epidemics With Differential Privacy","authors":"Bo Chen;Baike She;Calvin Hawkins;Philip E. Paré;Matthew T. Hale","doi":"10.1109/OJCSYS.2025.3575305","DOIUrl":"https://doi.org/10.1109/OJCSYS.2025.3575305","url":null,"abstract":"Reproduction numbers are widely used to analyze epidemic spreading processes over networks. However,conventional reproduction numbers of an overall network, which require spreading information from the entire network, do not indicate where an epidemic is spreading. To address this limitation, we first propose a novel class of local distributed reproduction numbers that capture spreading behaviors at the level of individual nodes. We demonstrate how to compute these values in a distributed way and use them to derive new threshold conditions for network spreading analysis. Due to the fact that epidemic data are often collected at multiple geographic or administrative scales, we then define a class of cluster distributed reproduction numbers to describe the spread between groups of nodes such as communities, cities, or states. We further show that the local distributed reproduction numbers can be aggregated to form the cluster distributed reproduction numbers. Unlike conventional network-level reproduction numbers, these distributed measures reveal fine-grained interaction patterns that may raise privacy concerns by exposing the frequency or intensity of interactions across regions. To address this issue, we propose a privacy-enhanced distributed reproduction number framework that implements differential privacy. This framework enables scalable and privacy-preserving analysis of epidemic spreading processes in networked populations through the calculation of privacy-preserving distributed reproduction numbers. Numerical experiments show that while maintaining differential privacy, the private distributed reproduction numbers yield accurate estimates of epidemic spread while also offering more insights than conventional reproduction numbers.","PeriodicalId":73299,"journal":{"name":"IEEE open journal of control systems","volume":"4 ","pages":"199-218"},"PeriodicalIF":0.0,"publicationDate":"2025-03-30","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=11018355","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"144524373","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Safe and Robust Binary Classification and Fault Detection Using Reinforcement Learning 基于强化学习的安全鲁棒二值分类与故障检测
Pub Date : 2025-03-22 DOI: 10.1109/OJCSYS.2025.3572375
Josh Netter;Kyriakos G. Vamvoudakis;Timothy F. Walsh;Jaideep Ray
In this paper, we propose a learning-based method utilizing the Soft Actor-Critic (SAC) algorithm to train a binary Support Vector Machine (SVM) classifier. This classifier is designed to identify valid input spaces in high-dimensional, highly constrained systems while minimizing the total runtime of offline simulations. The simulations adapt their runtime based on the likelihood that a given training input will be informative to the classifier. Furthermore, we introduce a method for using the trained SAC model to predict whether a desired system input is likely to violate constraints, along with a technique to adjust the input as necessary. Additionally, we explore the potential of this model to detect faults or adversarial attacks within the system. The effectiveness of our approach is demonstrated through various simulations of challenging classification problems and a constrained quadrotor model.
在本文中,我们提出了一种基于学习的方法,利用软Actor-Critic (SAC)算法来训练二进制支持向量机(SVM)分类器。该分类器旨在识别高维,高度约束系统中的有效输入空间,同时最小化离线模拟的总运行时间。仿真根据给定训练输入对分类器提供信息的可能性来调整其运行时间。此外,我们还介绍了一种使用训练好的SAC模型来预测所需系统输入是否可能违反约束的方法,以及一种根据需要调整输入的技术。此外,我们还探索了该模型在检测系统内的故障或对抗性攻击方面的潜力。我们的方法的有效性是通过具有挑战性的分类问题和约束四旋翼模型的各种模拟证明。
{"title":"Safe and Robust Binary Classification and Fault Detection Using Reinforcement Learning","authors":"Josh Netter;Kyriakos G. Vamvoudakis;Timothy F. Walsh;Jaideep Ray","doi":"10.1109/OJCSYS.2025.3572375","DOIUrl":"https://doi.org/10.1109/OJCSYS.2025.3572375","url":null,"abstract":"In this paper, we propose a learning-based method utilizing the Soft Actor-Critic (SAC) algorithm to train a binary Support Vector Machine (SVM) classifier. This classifier is designed to identify valid input spaces in high-dimensional, highly constrained systems while minimizing the total runtime of offline simulations. The simulations adapt their runtime based on the likelihood that a given training input will be informative to the classifier. Furthermore, we introduce a method for using the trained SAC model to predict whether a desired system input is likely to violate constraints, along with a technique to adjust the input as necessary. Additionally, we explore the potential of this model to detect faults or adversarial attacks within the system. The effectiveness of our approach is demonstrated through various simulations of challenging classification problems and a constrained quadrotor model.","PeriodicalId":73299,"journal":{"name":"IEEE open journal of control systems","volume":"4 ","pages":"172-186"},"PeriodicalIF":0.0,"publicationDate":"2025-03-22","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=11010134","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"144481878","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Adaptive Actor-Critic Based Optimal Regulation for Drift-Free Nonlinear Systems 基于自适应因子评价的无漂移非线性系统最优调节
Pub Date : 2025-03-18 DOI: 10.1109/OJCSYS.2025.3552999
Ashwin P. Dani;Shubhendu Bhasin
In this paper, a continuous-time adaptive actor-critic reinforcement learning (RL) controller is developed for drift-free uncertain nonlinear systems. Practical examples of such systems are image-based visual servoing (IBVS) and wheeled mobile robots (WMR), where the system dynamics include a parametric uncertainty in the control effectiveness matrix with no drift term. The uncertainty in the input term poses a challenge when developing a continuous-time RL controller using existing methods. This paper presents an actor-critic/synchronous policy iteration (PI)-based RL controller with a newly derived constrained concurrent learning (CCL)-based parameter update law for estimating the unknown parameters of the linearly parametrized control effectiveness matrix. The parameter update law ensures that the parameters do not converge to $zero$, avoiding possible loss of stabilization. An infinite-horizon value function minimization objective is achieved by regulating the current states to the desired with near-optimal control efforts. The proposed controller guarantees closed-loop stability, and simulation results in the presence of noise validate the proposed theory using IBVS and WMR examples.
针对无漂移不确定非线性系统,提出了一种连续时间自适应行为者评价强化学习(RL)控制器。此类系统的实际示例是基于图像的视觉伺服(IBVS)和轮式移动机器人(WMR),其中系统动力学包括控制有效性矩阵中没有漂移项的参数不确定性。在使用现有方法开发连续时间RL控制器时,输入项的不确定性提出了挑战。本文提出了一种基于actor-critic/synchronous policy iteration (PI)的RL控制器,该控制器采用了一种新的基于约束并发学习(CCL)的参数更新律来估计线性参数化控制有效性矩阵的未知参数。参数更新律保证了参数不收敛于零,避免了可能的镇定损失。利用接近最优的控制努力将当前状态调节到理想状态,从而实现了无限视界值函数的最小化目标。所提出的控制器保证了闭环稳定性,并且在存在噪声的情况下,使用IBVS和WMR实例的仿真结果验证了所提出的理论。
{"title":"Adaptive Actor-Critic Based Optimal Regulation for Drift-Free Nonlinear Systems","authors":"Ashwin P. Dani;Shubhendu Bhasin","doi":"10.1109/OJCSYS.2025.3552999","DOIUrl":"https://doi.org/10.1109/OJCSYS.2025.3552999","url":null,"abstract":"In this paper, a continuous-time adaptive actor-critic reinforcement learning (RL) controller is developed for drift-free uncertain nonlinear systems. Practical examples of such systems are image-based visual servoing (IBVS) and wheeled mobile robots (WMR), where the system dynamics include a parametric uncertainty in the control effectiveness matrix with no drift term. The uncertainty in the input term poses a challenge when developing a continuous-time RL controller using existing methods. This paper presents an actor-critic/synchronous policy iteration (PI)-based RL controller with a newly derived constrained concurrent learning (CCL)-based parameter update law for estimating the unknown parameters of the linearly parametrized control effectiveness matrix. The parameter update law ensures that the parameters do not converge to <inline-formula><tex-math>$zero$</tex-math></inline-formula>, avoiding possible loss of stabilization. An infinite-horizon value function minimization objective is achieved by regulating the current states to the desired with near-optimal control efforts. The proposed controller guarantees closed-loop stability, and simulation results in the presence of noise validate the proposed theory using IBVS and WMR examples.","PeriodicalId":73299,"journal":{"name":"IEEE open journal of control systems","volume":"4 ","pages":"117-129"},"PeriodicalIF":0.0,"publicationDate":"2025-03-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=10932715","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"144072925","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Online Learning of Stabilizing Controllers Using Noisy Input-Output Data and Prior Knowledge 基于噪声输入输出数据和先验知识的稳定控制器在线学习
Pub Date : 2025-03-15 DOI: 10.1109/OJCSYS.2025.3570578
Nariman Niknejad;Farnaz Adib Yaghmaie;Hamidreza Modares
This paper presents online prior-knowledge-based data-driven approaches for verifying stability and learning a stabilizing dynamic controller for linear stochastic input-output systems. The system is modeled in an autoregressive exogenous (ARX) framework to accommodate cases where states are not fully observable. A key challenge addressed in this article is online stabilizing open-loop unstable systems, where collecting sufficient data for controller learning is impractical due to the risk of failure. To mitigate this, the proposed method integrates uncertain prior knowledge, derived from system physics, with limited available data. Inspired by set-membership system identification, the prior knowledge set is dynamically updated as new data becomes available, reducing conservatism over time. Unlike traditional approaches, this method bypasses explicit system identification, directly designing controllers based on current knowledge and data. A connection between ARX models and behavior theory is established, providing necessary and sufficient stability conditions using strict lossy S-Lemma. Quadratic difference forms serve as a framework for Lyapunov functions, and robust dynamic controllers are synthesized via linear matrix inequalities. The methodology is validated through simulations, including an unstable scalar system visualizing the integration of prior knowledge and data, and a rotary inverted pendulum demonstrating controller effectiveness in a nonlinear, unstable setting.
本文提出了基于先验知识的在线数据驱动方法,用于线性随机输入输出系统的稳定性验证和稳定动态控制器的学习。系统在一个自回归外生(ARX)框架中建模,以适应状态不能完全观察到的情况。本文解决的一个关键挑战是在线稳定开环不稳定系统,其中收集足够的数据用于控制器学习是不切实际的,因为存在故障风险。为了缓解这一问题,该方法将来自系统物理的不确定先验知识与有限的可用数据集成在一起。受集成员系统辨识的启发,先验知识集随着新数据的出现而动态更新,降低了保守性。与传统方法不同,该方法绕过显式系统识别,直接根据当前知识和数据设计控制器。建立了ARX模型与行为理论之间的联系,利用严格有耗s引理给出了稳定性的充分必要条件。二次差分形式作为李雅普诺夫函数的框架,通过线性矩阵不等式合成鲁棒动态控制器。通过仿真验证了该方法,包括一个不稳定标量系统,显示了先验知识和数据的集成,以及一个旋转倒立摆,证明了控制器在非线性、不稳定环境中的有效性。
{"title":"Online Learning of Stabilizing Controllers Using Noisy Input-Output Data and Prior Knowledge","authors":"Nariman Niknejad;Farnaz Adib Yaghmaie;Hamidreza Modares","doi":"10.1109/OJCSYS.2025.3570578","DOIUrl":"https://doi.org/10.1109/OJCSYS.2025.3570578","url":null,"abstract":"This paper presents online prior-knowledge-based data-driven approaches for verifying stability and learning a stabilizing dynamic controller for linear stochastic input-output systems. The system is modeled in an autoregressive exogenous (ARX) framework to accommodate cases where states are not fully observable. A key challenge addressed in this article is online stabilizing open-loop unstable systems, where collecting sufficient data for controller learning is impractical due to the risk of failure. To mitigate this, the proposed method integrates uncertain prior knowledge, derived from system physics, with limited available data. Inspired by set-membership system identification, the prior knowledge set is dynamically updated as new data becomes available, reducing conservatism over time. Unlike traditional approaches, this method bypasses explicit system identification, directly designing controllers based on current knowledge and data. A connection between ARX models and behavior theory is established, providing necessary and sufficient stability conditions using strict lossy <italic>S</i>-Lemma. Quadratic difference forms serve as a framework for Lyapunov functions, and robust dynamic controllers are synthesized via linear matrix inequalities. The methodology is validated through simulations, including an unstable scalar system visualizing the integration of prior knowledge and data, and a rotary inverted pendulum demonstrating controller effectiveness in a nonlinear, unstable setting.","PeriodicalId":73299,"journal":{"name":"IEEE open journal of control systems","volume":"4 ","pages":"156-171"},"PeriodicalIF":0.0,"publicationDate":"2025-03-15","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=11005422","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"144323194","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Relationship Between the Number of Agents and Sparse Observability Index 智能体数量与稀疏可观察性指标的关系
Pub Date : 2025-03-07 DOI: 10.1109/OJCSYS.2025.3567867
T. Shinohara;T. Namerikawa
The state estimation problem in the presence of malicious sensor attacks is commonly referred to as a secure state estimation problem. Central to addressing this problem is the concept of the sparse observability index, defined as the largest integer $ delta$ for which the system remains observable after the removal of any $delta$ sensors. This index plays a critical role in quantifying the resilience of the system, as a higher $delta$ enables unique state reconstruction despite the presence of more compromised sensors. In this study, for undirected multi-agent systems consisting of $ n$ agents, we analyze the relationship between the number of agents $ n$ and the sparse observability index $ delta$ for effective secure state estimation. In particular, we consider four typical graph structures: path, cycle, complete, and complete bipartite graphs. Our analysis reveals that $delta$ does not increase monotonically with $n$, and that resilience is intricately tied to the underlying network structure. Notably, we demonstrate that the system exhibits enhanced resilience when the number of agents $n$ is a prime number, although the specifics of this relationship vary depending on the graph topology.
存在恶意传感器攻击的状态估计问题通常被称为安全状态估计问题。解决这个问题的核心是稀疏可观察性指数的概念,定义为在移除任何$delta$传感器后系统仍然可观察的最大整数$delta$。该指数在量化系统弹性方面起着至关重要的作用,因为尽管存在更多受损的传感器,较高的$delta$可以实现唯一的状态重建。在本研究中,对于由$ n$智能体组成的无向多智能体系统,我们分析了智能体数目$ n$与稀疏可观察性指数$ delta$之间的关系,以获得有效的安全状态估计。特别地,我们考虑了四种典型的图结构:路径图、循环图、完全图和完全二部图。我们的分析表明,$delta$不会随着$n$单调地增加,并且弹性与底层网络结构错综复杂地联系在一起。值得注意的是,我们证明了当代理数量$n$为素数时,系统表现出增强的弹性,尽管这种关系的具体情况取决于图拓扑。
{"title":"Relationship Between the Number of Agents and Sparse Observability Index","authors":"T. Shinohara;T. Namerikawa","doi":"10.1109/OJCSYS.2025.3567867","DOIUrl":"https://doi.org/10.1109/OJCSYS.2025.3567867","url":null,"abstract":"The state estimation problem in the presence of malicious sensor attacks is commonly referred to as a secure state estimation problem. Central to addressing this problem is the concept of the sparse observability index, defined as the largest integer <inline-formula><tex-math>$ delta$</tex-math></inline-formula> for which the system remains observable after the removal of any <inline-formula><tex-math>$delta$</tex-math></inline-formula> sensors. This index plays a critical role in quantifying the resilience of the system, as a higher <inline-formula><tex-math>$delta$</tex-math></inline-formula> enables unique state reconstruction despite the presence of more compromised sensors. In this study, for undirected multi-agent systems consisting of <inline-formula><tex-math>$ n$</tex-math></inline-formula> agents, we analyze the relationship between the number of agents <inline-formula><tex-math>$ n$</tex-math></inline-formula> and the sparse observability index <inline-formula><tex-math>$ delta$</tex-math></inline-formula> for effective secure state estimation. In particular, we consider four typical graph structures: path, cycle, complete, and complete bipartite graphs. Our analysis reveals that <inline-formula><tex-math>$delta$</tex-math></inline-formula> does not increase monotonically with <inline-formula><tex-math>$n$</tex-math></inline-formula>, and that resilience is intricately tied to the underlying network structure. Notably, we demonstrate that the system exhibits enhanced resilience when the number of agents <inline-formula><tex-math>$n$</tex-math></inline-formula> is a prime number, although the specifics of this relationship vary depending on the graph topology.","PeriodicalId":73299,"journal":{"name":"IEEE open journal of control systems","volume":"4 ","pages":"144-155"},"PeriodicalIF":0.0,"publicationDate":"2025-03-07","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=10989748","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"144170955","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Precision Cylinder Gluing With Uncertainty-Aware MPC-Enhanced DDPG 具有不确定意识的mpc -增强型DDPG的精密气缸粘接
Pub Date : 2025-03-01 DOI: 10.1109/OJCSYS.2025.3566323
Liangshun Wu;Junsuo Qu
This paper presents an uncertainty-aware optimization method for high-precision servo control in automotive dosing cylinder gluing. A comprehensive system model captures the interdependent dynamics of mechanical, hydraulic, and servo motor subsystems, formulating the control problem as a Markov Decision Process (MDP). Using Deep Deterministic Policy Gradient (DDPG) reinforcement learning with Model Predictive Control (MPC), the approach combines MPC's optimization capabilities with DDPG's adaptive learning, improving resilience to uncertainties. The DDPG Actor refines the MPC baseline, while uncertainty analysis in the MPC objective anticipates future variations. The Critic evaluates Q-values with uncertainty feedback. Simulations and real-world tests confirm the method's stability, precision, and reliability for high-precision industrial gluing.
提出了一种汽车加药缸上胶高精度伺服控制的不确定性感知优化方法。一个全面的系统模型捕获了机械、液压和伺服电机子系统的相互依赖的动力学,将控制问题表述为马尔可夫决策过程(MDP)。该方法将深度确定性策略梯度(DDPG)强化学习与模型预测控制(MPC)相结合,将MPC的优化能力与DDPG的自适应学习相结合,提高了对不确定性的适应能力。DDPG Actor细化MPC基线,而MPC目标中的不确定性分析预测未来的变化。批评家用不确定性反馈评价q值。仿真和实际测试证实了该方法的稳定性、精度和可靠性,适用于高精度工业胶接。
{"title":"Precision Cylinder Gluing With Uncertainty-Aware MPC-Enhanced DDPG","authors":"Liangshun Wu;Junsuo Qu","doi":"10.1109/OJCSYS.2025.3566323","DOIUrl":"https://doi.org/10.1109/OJCSYS.2025.3566323","url":null,"abstract":"This paper presents an uncertainty-aware optimization method for high-precision servo control in automotive dosing cylinder gluing. A comprehensive system model captures the interdependent dynamics of mechanical, hydraulic, and servo motor subsystems, formulating the control problem as a Markov Decision Process (MDP). Using Deep Deterministic Policy Gradient (DDPG) reinforcement learning with Model Predictive Control (MPC), the approach combines MPC's optimization capabilities with DDPG's adaptive learning, improving resilience to uncertainties. The DDPG Actor refines the MPC baseline, while uncertainty analysis in the MPC objective anticipates future variations. The Critic evaluates Q-values with uncertainty feedback. Simulations and real-world tests confirm the method's stability, precision, and reliability for high-precision industrial gluing.","PeriodicalId":73299,"journal":{"name":"IEEE open journal of control systems","volume":"4 ","pages":"130-143"},"PeriodicalIF":0.0,"publicationDate":"2025-03-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=10981597","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"144148132","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Robustness to Modeling Errors in Risk-Sensitive Markov Decision Problems With Markov Risk Measures 使用马尔可夫风险度量的风险敏感马尔可夫决策问题中建模错误的鲁棒性
Pub Date : 2025-02-04 DOI: 10.1109/OJCSYS.2025.3538267
Shiping Shao;Abhishek Gupta
We consider risk-sensitive Markov decision processes (MDPs), where the MDP model is influenced by a parameter which takes values in a compact metric space. These situations arise when the underlying dynamics of the system depend on parameters that drifts over time. For example, mass of a vehicle depends on the number of passengers in the vehicle, which may change from one trip to another. Similarly, the energy demand of a building depends on the local weather, which changes every hour of the day. We identify sufficient conditions under which small perturbations in the model parameters lead to small changes in the optimal value function and optimal policy. This is achieved by establishing the continuity of the value function with respect to the parameters. A direct consequence of this result is that an optimal policy under a specific parameter remains near-optimal if the parameter is perturbed slightly. Implications of the results for data-driven decision-making, decision-making with preference uncertainty, and systems with changing noise distributions are discussed.
我们考虑风险敏感马尔可夫决策过程(MDP),其中MDP模型受到一个参数的影响,该参数在紧致度量空间中取值。当系统的潜在动态依赖于随时间漂移的参数时,就会出现这些情况。例如,车辆的质量取决于车辆中的乘客数量,这可能会在每次旅行中发生变化。同样,建筑物的能源需求也取决于当地的天气,而当地的天气每时每刻都在变化。我们确定了模型参数中的小扰动导致最优值函数和最优策略的小变化的充分条件。这是通过建立值函数相对于参数的连续性来实现的。这个结果的一个直接结果是,在特定参数下的最优策略,如果参数稍微受到扰动,仍然是接近最优的。讨论了结果对数据驱动决策、偏好不确定性决策和噪声分布变化系统的影响。
{"title":"Robustness to Modeling Errors in Risk-Sensitive Markov Decision Problems With Markov Risk Measures","authors":"Shiping Shao;Abhishek Gupta","doi":"10.1109/OJCSYS.2025.3538267","DOIUrl":"https://doi.org/10.1109/OJCSYS.2025.3538267","url":null,"abstract":"We consider risk-sensitive Markov decision processes (MDPs), where the MDP model is influenced by a parameter which takes values in a compact metric space. These situations arise when the underlying dynamics of the system depend on parameters that drifts over time. For example, mass of a vehicle depends on the number of passengers in the vehicle, which may change from one trip to another. Similarly, the energy demand of a building depends on the local weather, which changes every hour of the day. We identify sufficient conditions under which small perturbations in the model parameters lead to small changes in the optimal value function and optimal policy. This is achieved by establishing the continuity of the value function with respect to the parameters. A direct consequence of this result is that an optimal policy under a specific parameter remains near-optimal if the parameter is perturbed slightly. Implications of the results for data-driven decision-making, decision-making with preference uncertainty, and systems with changing noise distributions are discussed.","PeriodicalId":73299,"journal":{"name":"IEEE open journal of control systems","volume":"4 ","pages":"70-82"},"PeriodicalIF":0.0,"publicationDate":"2025-02-04","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=10872799","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"143570694","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
期刊
IEEE open journal of control systems
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1