首页 > 最新文献

IEEE open journal of control systems最新文献

英文 中文
Integrating Reinforcement Learning and Model Predictive Control for Mixed- Logical Dynamical Systems 混合逻辑动力系统强化学习与模型预测控制的集成
Pub Date : 2025-08-21 DOI: 10.1109/OJCSYS.2025.3601435
Caio Fabio Oliveira da Silva;Azita Dabiri;Bart De Schutter
This work proposes an approach that integrates reinforcement learning (RL) and model predictive control (MPC) to solve finite-horizon optimal control problems in mixed-logical dynamical systems efficiently. Optimization-based control of such systems with discrete and continuous decision variables entails the online solution of mixed-integer linear programs, which suffer from the curse of dimensionality. In the proposed approach, by repeated interaction with a simulator of the system, a reinforcement learning agent is trained to provide a policy for the discrete decision variables. During online operation, the RL policy simplifies the online optimization problem of the MPC controller from a mixed-integer linear program to a linear program, significantly reducing the computation time. A fundamental contribution of this work is the definition of the decoupled Q-function, which plays a crucial role in making the learning problem tractable in a combinatorial action space. We motivate the use of recurrent neural networks to approximate the decoupled Q-function and show how they can be employed in a reinforcement learning setting. A microgrid system is used as an illustrative example where real-world data is used to demonstrate that the proposed method substantially reduces the maximum online computation time of MPC (up to $20times$) while maintaining high feasibility and average optimality gap lower than 1.1% .
本文提出了一种集成强化学习(RL)和模型预测控制(MPC)的方法来有效地解决混合逻辑动态系统中的有限视界最优控制问题。这类具有离散和连续决策变量的系统的优化控制需要在线求解受维数诅咒的混合整数线性规划。在提出的方法中,通过与系统模拟器的重复交互,训练强化学习代理为离散决策变量提供策略。在线运行时,RL策略将MPC控制器的在线优化问题从混合整数线性规划简化为线性规划,大大减少了计算时间。这项工作的一个基本贡献是解耦q函数的定义,它在使学习问题在组合动作空间中易于处理方面起着至关重要的作用。我们鼓励使用循环神经网络来近似解耦的q函数,并展示如何在强化学习设置中使用它们。以微电网系统为例,使用实际数据证明所提出的方法大大减少了MPC的最大在线计算时间(高达$20times$),同时保持高可行性和平均最优性差距低于1.1%。
{"title":"Integrating Reinforcement Learning and Model Predictive Control for Mixed- Logical Dynamical Systems","authors":"Caio Fabio Oliveira da Silva;Azita Dabiri;Bart De Schutter","doi":"10.1109/OJCSYS.2025.3601435","DOIUrl":"https://doi.org/10.1109/OJCSYS.2025.3601435","url":null,"abstract":"This work proposes an approach that integrates reinforcement learning (RL) and model predictive control (MPC) to solve finite-horizon optimal control problems in mixed-logical dynamical systems efficiently. Optimization-based control of such systems with discrete and continuous decision variables entails the online solution of mixed-integer linear programs, which suffer from the curse of dimensionality. In the proposed approach, by repeated interaction with a simulator of the system, a reinforcement learning agent is trained to provide a policy for the discrete decision variables. During online operation, the RL policy simplifies the online optimization problem of the MPC controller from a mixed-integer linear program to a linear program, significantly reducing the computation time. A fundamental contribution of this work is the definition of the decoupled Q-function, which plays a crucial role in making the learning problem tractable in a combinatorial action space. We motivate the use of recurrent neural networks to approximate the decoupled Q-function and show how they can be employed in a reinforcement learning setting. A microgrid system is used as an illustrative example where real-world data is used to demonstrate that the proposed method substantially reduces the maximum online computation time of MPC (up to <inline-formula><tex-math>$20times$</tex-math></inline-formula>) while maintaining high feasibility and average optimality gap lower than 1.1% .","PeriodicalId":73299,"journal":{"name":"IEEE open journal of control systems","volume":"4 ","pages":"316-331"},"PeriodicalIF":0.0,"publicationDate":"2025-08-21","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=11134093","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145027930","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
MDP-Based High-Level Decision-Making for Combining Safety and Optimality: Autonomous Overtaking 基于mdp的安全性与最优性相结合的高层决策:自动超车
Pub Date : 2025-08-20 DOI: 10.1109/OJCSYS.2025.3600925
Xue-Fang Wang;Jingjing Jiang;Wen-Hua Chen
This paper presents a novel solution for optimal high-level decision-making in autonomous overtaking on two-lane roads, considering both opposite-direction and same-direction traffic. The proposed solutionaccounts for key factors such as safety and optimality, while also ensuring recursive feasibility and stability.To safely complete overtaking maneuvers, the solution is built on a constrained Markov decision process (MDP) that generates optimal decisions for path planners. By combining MDP with model predictive control (MPC), the approach guarantees recursive feasibility and stability through a baseline control policy that calculates the terminal cost and is incorporated into a constructed Lyapunov function. The proposed solution is validated through five simulated driving scenarios, demonstrating its robustness in handling diverse interactions within dynamic and complex traffic conditions.
提出了一种考虑反向和同向车辆的双车道自动超车高层最优决策的新方法。该方案兼顾了安全性和最优性等关键因素,同时保证了递归的可行性和稳定性。为了安全地完成超车操作,该解决方案建立在约束马尔可夫决策过程(MDP)的基础上,该决策过程为路径规划者生成最优决策。通过将MDP与模型预测控制(MPC)相结合,该方法通过计算终端成本的基线控制策略来保证递归的可行性和稳定性,并将其纳入构建的Lyapunov函数中。通过五个模拟驾驶场景验证了该方案的有效性,证明了其在处理动态和复杂交通条件下的各种交互方面的鲁棒性。
{"title":"MDP-Based High-Level Decision-Making for Combining Safety and Optimality: Autonomous Overtaking","authors":"Xue-Fang Wang;Jingjing Jiang;Wen-Hua Chen","doi":"10.1109/OJCSYS.2025.3600925","DOIUrl":"https://doi.org/10.1109/OJCSYS.2025.3600925","url":null,"abstract":"This paper presents a novel solution for optimal high-level decision-making in autonomous overtaking on two-lane roads, considering both opposite-direction and same-direction traffic. The proposed solutionaccounts for key factors such as safety and optimality, while also ensuring recursive feasibility and stability.To safely complete overtaking maneuvers, the solution is built on a constrained Markov decision process (MDP) that generates optimal decisions for path planners. By combining MDP with model predictive control (MPC), the approach guarantees recursive feasibility and stability through a baseline control policy that calculates the terminal cost and is incorporated into a constructed Lyapunov function. The proposed solution is validated through five simulated driving scenarios, demonstrating its robustness in handling diverse interactions within dynamic and complex traffic conditions.","PeriodicalId":73299,"journal":{"name":"IEEE open journal of control systems","volume":"4 ","pages":"299-315"},"PeriodicalIF":0.0,"publicationDate":"2025-08-20","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=11130904","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145021272","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Feature Construction Using Network Control Theory and Rank Encoding for Graph Machine Learning 基于网络控制理论和秩编码的图机学习特征构建
Pub Date : 2025-08-15 DOI: 10.1109/OJCSYS.2025.3599371
Anwar Said;Yifan Wei;Obaid Ullah Ahmad;Mudassir Shabbir;Waseem Abbas;Xenofon Koutsoukos
In this article, we utilize the concept of average controllability in graphs, along with a novel rank encoding method, to enhance the performance of Graph Neural Networks (GNNs) in social network classification tasks. GNNs have proven highly effective in various network-based learning applications and require some form of node features to function. However, their performance is heavily influenced by the expressiveness of these features. In social networks, node features are often unavailable due to privacy constraints or the absence of inherent attributes, making it challenging for GNNs to achieve optimal performance. To address this limitation, we propose two strategies for constructing expressive node features. First, we introduce average controllability along with other centrality metrics (denoted as NCT-EFA) as node-level metrics that capture critical aspects of network topology. Building on this, we develop a rank encoding method that transforms average controllability—or any other graph-theoretic metric—into a fixed-dimensional feature space, thereby improving feature representation. We conduct extensive numerical evaluations using six benchmark GNN models across four social network datasets to compare different node feature construction methods. Our results demonstrate that incorporating average controllability into the feature space significantly improves GNN performance. Moreover, the proposed rank encoding method outperforms traditional one-hot degree encoding, improving the ROC AUC from 68.7% to 73.9% using GraphSAGE on the GitHub Stargazers dataset, underscoring its effectiveness in generating expressive and efficient node representations.
在本文中,我们利用图中平均可控性的概念,以及一种新的秩编码方法,来提高图神经网络(gnn)在社交网络分类任务中的性能。gnn已被证明在各种基于网络的学习应用中非常有效,并且需要某种形式的节点特征才能发挥作用。然而,它们的性能在很大程度上受到这些特征的表现力的影响。在社交网络中,由于隐私约束或缺乏固有属性,节点特征通常不可用,这给gnn实现最佳性能带来了挑战。为了解决这一限制,我们提出了两种构建表达性节点特征的策略。首先,我们将平均可控性与其他中心性指标(表示为NCT-EFA)一起引入,作为捕获网络拓扑关键方面的节点级指标。在此基础上,我们开发了一种秩编码方法,将平均可控性(或任何其他图论度量)转换为固定维特征空间,从而改进特征表示。我们在四个社交网络数据集上使用六个基准GNN模型进行了广泛的数值评估,以比较不同的节点特征构建方法。我们的研究结果表明,将平均可控性纳入特征空间可以显著提高GNN的性能。此外,本文提出的秩编码方法优于传统的单热度编码,在GitHub Stargazers数据集上使用GraphSAGE将ROC AUC从68.7%提高到73.9%,强调了其在生成富有表现力和高效的节点表示方面的有效性。
{"title":"Feature Construction Using Network Control Theory and Rank Encoding for Graph Machine Learning","authors":"Anwar Said;Yifan Wei;Obaid Ullah Ahmad;Mudassir Shabbir;Waseem Abbas;Xenofon Koutsoukos","doi":"10.1109/OJCSYS.2025.3599371","DOIUrl":"https://doi.org/10.1109/OJCSYS.2025.3599371","url":null,"abstract":"In this article, we utilize the concept of average controllability in graphs, along with a novel rank encoding method, to enhance the performance of Graph Neural Networks (GNNs) in social network classification tasks. GNNs have proven highly effective in various network-based learning applications and require some form of node features to function. However, their performance is heavily influenced by the expressiveness of these features. In social networks, node features are often unavailable due to privacy constraints or the absence of inherent attributes, making it challenging for GNNs to achieve optimal performance. To address this limitation, we propose two strategies for constructing expressive node features. First, we introduce average controllability along with other centrality metrics (denoted as NCT-EFA) as node-level metrics that capture critical aspects of network topology. Building on this, we develop a rank encoding method that transforms average controllability—or any other graph-theoretic metric—into a fixed-dimensional feature space, thereby improving feature representation. We conduct extensive numerical evaluations using six benchmark GNN models across four social network datasets to compare different node feature construction methods. Our results demonstrate that incorporating average controllability into the feature space significantly improves GNN performance. Moreover, the proposed rank encoding method outperforms traditional one-hot degree encoding, improving the ROC AUC from 68.7% to 73.9% using GraphSAGE on the GitHub Stargazers dataset, underscoring its effectiveness in generating expressive and efficient node representations.","PeriodicalId":73299,"journal":{"name":"IEEE open journal of control systems","volume":"4 ","pages":"288-298"},"PeriodicalIF":0.0,"publicationDate":"2025-08-15","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=11126872","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145021271","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
The Capture-the-Flag Differential Game: Attack, Transition and Retreat 夺旗微分博弈:进攻、转换和撤退
Pub Date : 2025-08-14 DOI: 10.1109/OJCSYS.2025.3599473
Eloy Garcia;David W. Casbeer
This paper analyzes the classic game of capture-the-flag, modeled as a conflict between an Attacker and a Defender. The game unfolds in distinct phases with changing objectives: first, the Attacker tries to capture a flag while the Defender attempts to intercept; second, if successful, the Attacker tries to reach a safe zone while the Defender again seeks interception. We mathematically derive the optimal state-feedback strategies for both players and the associated Value function for each phase, rigorously proving their correctness. A key contribution is introducing the transition phase, where we analyze the Defender’s optimal repositioning strategy when flag capture becomes inevitable, preparing it for the game’s second phase. This novel transition connects the game’s stages, critically enabling us to solve the overall Game of Kind – determining the winner from any starting condition – and define the precise circumstances under which the Attacker can both capture the flag and successfully escape to the safe zone.
本文分析了经典的夺旗游戏,将其建模为攻击者和防御者之间的冲突。游戏以不同的阶段展开,目标不断变化:首先,攻击者试图夺取一面旗帜,而防守者试图拦截;其次,如果成功,攻击者尝试到达安全区域,而防御者再次寻求拦截。我们从数学上推导出每个阶段参与者的最佳状态反馈策略和相关的价值函数,严格证明了它们的正确性。一个关键的贡献是引入过渡阶段,在这个阶段,我们分析了当夺旗不可避免时防守者的最佳重新定位策略,为游戏的第二阶段做准备。这种新颖的过渡连接了游戏的各个阶段,使我们能够解决整个“同类游戏”(game of Kind)——在任何起始条件下决定获胜者——并定义攻击者既能夺取旗帜又能成功逃到安全区的精确环境。
{"title":"The Capture-the-Flag Differential Game: Attack, Transition and Retreat","authors":"Eloy Garcia;David W. Casbeer","doi":"10.1109/OJCSYS.2025.3599473","DOIUrl":"https://doi.org/10.1109/OJCSYS.2025.3599473","url":null,"abstract":"This paper analyzes the classic game of capture-the-flag, modeled as a conflict between an Attacker and a Defender. The game unfolds in distinct phases with changing objectives: first, the Attacker tries to capture a flag while the Defender attempts to intercept; second, if successful, the Attacker tries to reach a safe zone while the Defender again seeks interception. We mathematically derive the optimal state-feedback strategies for both players and the associated Value function for each phase, rigorously proving their correctness. A key contribution is introducing the transition phase, where we analyze the Defender’s optimal repositioning strategy when flag capture becomes inevitable, preparing it for the game’s second phase. This novel transition connects the game’s stages, critically enabling us to solve the overall Game of Kind – determining the winner from any starting condition – and define the precise circumstances under which the Attacker can both capture the flag and successfully escape to the safe zone.","PeriodicalId":73299,"journal":{"name":"IEEE open journal of control systems","volume":"4 ","pages":"271-287"},"PeriodicalIF":0.0,"publicationDate":"2025-08-14","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=11125922","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"144998218","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Vision-Based Collision Avoidance for Multi-Agent Systems With Intermittent Measurements 基于视觉的间歇测量多智能体系统避碰
Pub Date : 2025-08-13 DOI: 10.1109/OJCSYS.2025.3598673
Mia Scoblic;Camilla Tabasso;Venanzio Cichella;Isaac Kaminer
Collision avoidance is a fundamental aspect of many applications involving autonomous vehicles. Solving this problem becomes especially challenging when the agents involved cannot communicate. In these scenarios, onboard sensors are essential for detecting and avoiding other vehicles or obstacles. However, in many practical applications, sensors have limited range and measurements may be intermittent due to external factors. With this in mind, in this work, we present a novel decentralized vision-based collision avoidance algorithm which does not require communication among the agents and has mild assumptions on the sensing capabilities of the vehicles. Once a collision is detected, the agents replan their trajectories to follow a circular path centered at the point of collision. A feedback control law is designed so that the vehicles can maintain a predefined phase shift along this circle and therefore are able to avoid collisions. A Lyapunov analysis is performed to provide performance bounds and the efficacy of the proposed method is demonstrated through both simulated and experimental results.
避免碰撞是许多自动驾驶汽车应用的一个基本方面。当涉及的代理无法通信时,解决这个问题变得特别具有挑战性。在这些情况下,车载传感器对于探测和避开其他车辆或障碍物至关重要。然而,在许多实际应用中,传感器的测量范围有限,并且由于外部因素的影响,测量可能是间歇性的。考虑到这一点,在这项工作中,我们提出了一种新的基于分散视觉的避碰算法,该算法不需要智能体之间的通信,并且对车辆的感知能力有轻微的假设。一旦检测到碰撞,智能体就会重新规划它们的轨迹,沿着以碰撞点为中心的圆形路径前进。设计了一种反馈控制律,使车辆能够沿此圆保持预定的相移,从而避免碰撞。李雅普诺夫分析提供了性能界限,并通过模拟和实验结果证明了所提出方法的有效性。
{"title":"Vision-Based Collision Avoidance for Multi-Agent Systems With Intermittent Measurements","authors":"Mia Scoblic;Camilla Tabasso;Venanzio Cichella;Isaac Kaminer","doi":"10.1109/OJCSYS.2025.3598673","DOIUrl":"https://doi.org/10.1109/OJCSYS.2025.3598673","url":null,"abstract":"Collision avoidance is a fundamental aspect of many applications involving autonomous vehicles. Solving this problem becomes especially challenging when the agents involved cannot communicate. In these scenarios, onboard sensors are essential for detecting and avoiding other vehicles or obstacles. However, in many practical applications, sensors have limited range and measurements may be intermittent due to external factors. With this in mind, in this work, we present a novel decentralized vision-based collision avoidance algorithm which does not require communication among the agents and has mild assumptions on the sensing capabilities of the vehicles. Once a collision is detected, the agents replan their trajectories to follow a circular path centered at the point of collision. A feedback control law is designed so that the vehicles can maintain a predefined phase shift along this circle and therefore are able to avoid collisions. A Lyapunov analysis is performed to provide performance bounds and the efficacy of the proposed method is demonstrated through both simulated and experimental results.","PeriodicalId":73299,"journal":{"name":"IEEE open journal of control systems","volume":"4 ","pages":"349-359"},"PeriodicalIF":0.0,"publicationDate":"2025-08-13","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=11123838","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145778185","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Generalized Concentration-Based Performance Guarantees on Sensor Selection for State Estimation 基于广义浓度的状态估计传感器选择性能保证
Pub Date : 2025-08-13 DOI: 10.1109/OJCSYS.2025.3598626
Christopher I. Calle;Shaunak D. Bopardikar
In this work, we apply concentration-based results to the problem of sensor selection for state estimation to provide us with meaningful guarantees on the properties of our selection. We consider a selection of sensors that is randomly chosen with replacement for a stochastic linear dynamical system, and we utilize the Kalman filter to perform state estimation. Our main contributions are four-fold. First, we derive novel matrix concentration inequalities (CIs) for a sum of positive semi-definite random matrices. Second, we provide two algorithms for specifying the parameters required to apply our matrix CIs, a novel statistical tool. Third, we propose two avenues for improving the sample complexity of this statistical tool. Fourth, we provide a procedure for optimizing the semi-definite bounds of our matrix CIs. When our matrix CIs are applied to the problem of sensor selection for state estimation, our final contribution is a procedure for optimizing the filtered state estimation error covariance matrix of the Kalman filter. Finally, we show through simulations that our bounds significantly outperform those of an existing matrix CI and are applicable for a larger parameter regime. Also, we demonstrate the applicability of our matrix CIs for the state estimation of nonlinear dynamical systems.
在这项工作中,我们将基于浓度的结果应用于状态估计的传感器选择问题,为我们提供了对我们选择的属性的有意义的保证。我们考虑用随机线性动力系统替换随机选择的传感器的选择,并利用卡尔曼滤波器进行状态估计。我们的主要贡献有四方面。首先,我们导出了一类正半定随机矩阵和的矩阵浓度不等式。其次,我们提供了两种算法来指定应用我们的矩阵CIs(一种新的统计工具)所需的参数。第三,我们提出了两种方法来提高该统计工具的样本复杂度。第四,我们提供了一个优化矩阵ci的半定界的过程。当我们的矩阵ci应用于状态估计的传感器选择问题时,我们的最终贡献是优化卡尔曼滤波器的滤波状态估计误差协方差矩阵的过程。最后,我们通过模拟表明,我们的边界明显优于现有矩阵CI的边界,并且适用于更大的参数范围。同时,我们也证明了矩阵ci对非线性动力系统状态估计的适用性。
{"title":"Generalized Concentration-Based Performance Guarantees on Sensor Selection for State Estimation","authors":"Christopher I. Calle;Shaunak D. Bopardikar","doi":"10.1109/OJCSYS.2025.3598626","DOIUrl":"https://doi.org/10.1109/OJCSYS.2025.3598626","url":null,"abstract":"In this work, we apply concentration-based results to the problem of sensor selection for state estimation to provide us with meaningful guarantees on the properties of our selection. We consider a selection of sensors that is randomly chosen with replacement for a stochastic linear dynamical system, and we utilize the Kalman filter to perform state estimation. Our main contributions are four-fold. First, we derive novel matrix concentration inequalities (CIs) for a sum of positive semi-definite random matrices. Second, we provide two algorithms for specifying the parameters required to apply our matrix CIs, a novel statistical tool. Third, we propose two avenues for improving the sample complexity of this statistical tool. Fourth, we provide a procedure for optimizing the semi-definite bounds of our matrix CIs. When our matrix CIs are applied to the problem of sensor selection for state estimation, our final contribution is a procedure for optimizing the filtered state estimation error covariance matrix of the Kalman filter. Finally, we show through simulations that our bounds significantly outperform those of an existing matrix CI and are applicable for a larger parameter regime. Also, we demonstrate the applicability of our matrix CIs for the state estimation of nonlinear dynamical systems.","PeriodicalId":73299,"journal":{"name":"IEEE open journal of control systems","volume":"4 ","pages":"250-270"},"PeriodicalIF":0.0,"publicationDate":"2025-08-13","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=11123730","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"144990018","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Controllability and Observability of Heterogeneous Networked Systems With Non-Uniform Node Dimensions and Distinct Inner-Coupling Matrices 具有非均匀节点维数和不同内耦合矩阵的异构网络系统的可控性和可观察性
Pub Date : 2025-07-10 DOI: 10.1109/OJCSYS.2025.3587537
Aleena Thomas;Abhijith Ajayakumar;Raju K. George
In this paper, controllability and observability of a heterogeneous networked system with Linear Time Invariant (LTI) nodal systems having Multiple-Inputs and Multiple-Outputs (MIMO) aligned in a weighted and directed network topology are studied. Apart from the heterogenity in nodal dynamics, the inner-coupling matrices that quantify the interactions among nodes are also different. In contrast to the existing literature, the system under consideration has distinct node dimensions, which adds to the generality. Necessary and sufficient conditions for controllability and observability as well as certain necessary conditions for controllability of a class of networked systems are established. These conditions show the dependence of network controllability and observability on various node and network-specific factors. As a practical application, a three-sector economy is modelled as a heterogeneous networked system with distinct node dimensions and its controllability is analysed. Computational time in floating point operations (flops) of the proposed methods are estimated, which indicates their efficiency on comparison with the classical conditions. This is illustrated by computational comparison of the existing and proposed schemes, applied to a randomly generated networked system. Also, robustness of the proposed schemes are analysed with the example of randomly generated networked systems. All the results are supported with illustrative numerical examples.
本文研究了具有多输入多输出(MIMO)的线性时不变(LTI)节点系统在加权有向网络拓扑结构中的可控性和可观察性问题。除了节点动力学的异质性外,量化节点间相互作用的内耦合矩阵也不同。与现有文献相比,所考虑的系统具有不同的节点维度,这增加了通用性。建立了一类网络系统的可控性和可观测性的充分必要条件以及可控性的若干必要条件。这些条件表明网络的可控性和可观测性依赖于各种节点和网络特定因素。作为实际应用,本文将三部门经济建模为具有不同节点维度的异构网络系统,并分析了其可控性。对所提方法的浮点运算计算时间进行了估计,并与经典条件进行了比较。这是通过计算比较现有的和提出的方案,应用于一个随机生成的网络系统。并以随机生成的网络系统为例,对所提方案的鲁棒性进行了分析。所有的结果都得到了数值实例的支持。
{"title":"Controllability and Observability of Heterogeneous Networked Systems With Non-Uniform Node Dimensions and Distinct Inner-Coupling Matrices","authors":"Aleena Thomas;Abhijith Ajayakumar;Raju K. George","doi":"10.1109/OJCSYS.2025.3587537","DOIUrl":"https://doi.org/10.1109/OJCSYS.2025.3587537","url":null,"abstract":"In this paper, controllability and observability of a heterogeneous networked system with Linear Time Invariant (LTI) nodal systems having Multiple-Inputs and Multiple-Outputs (MIMO) aligned in a weighted and directed network topology are studied. Apart from the heterogenity in nodal dynamics, the inner-coupling matrices that quantify the interactions among nodes are also different. In contrast to the existing literature, the system under consideration has distinct node dimensions, which adds to the generality. Necessary and sufficient conditions for controllability and observability as well as certain necessary conditions for controllability of a class of networked systems are established. These conditions show the dependence of network controllability and observability on various node and network-specific factors. As a practical application, a three-sector economy is modelled as a heterogeneous networked system with distinct node dimensions and its controllability is analysed. Computational time in floating point operations (flops) of the proposed methods are estimated, which indicates their efficiency on comparison with the classical conditions. This is illustrated by computational comparison of the existing and proposed schemes, applied to a randomly generated networked system. Also, robustness of the proposed schemes are analysed with the example of randomly generated networked systems. All the results are supported with illustrative numerical examples.","PeriodicalId":73299,"journal":{"name":"IEEE open journal of control systems","volume":"4 ","pages":"219-235"},"PeriodicalIF":0.0,"publicationDate":"2025-07-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=11075535","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"144831949","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Time-Series Out-of-Distribution Data Detection in Mechanical Ventilation 机械通气时间序列非分布数据检测
Pub Date : 2025-07-02 DOI: 10.1109/OJCSYS.2025.3585427
L. van de Kamp;B. Hunnekens;T. Oomen;N. van de Wouw
Safe deployment of neural networks to classify time series in safety-critical applications relies on the ability of the classifier to detect data that does not originate from the same distribution as the training data. The aim of this paper is to propose a framework for detecting whether time-series data is sampled from a different distribution than the training data, known as the problem of out-of-distribution (OOD) detection. We propose a novel distance-based OOD method for time-series data using a hierarchical clustering method together with dynamic time-warping to measure the difference between a new data instance and the training set. The method is evaluated in the context of mechanical ventilation, a safety critical application, using both simulated and clinical datasets. Results of the mechanical ventilation use case demonstrate that the proposed approach effectively detects out-of-distribution data and improves classification performance in diverse settings.
在安全关键应用中,安全部署神经网络对时间序列进行分类依赖于分类器检测与训练数据不同分布的数据的能力。本文的目的是提出一个框架,用于检测时间序列数据是否从不同于训练数据的分布中采样,称为out- distribution (OOD)检测问题。我们提出了一种新的基于距离的时间序列数据OOD方法,使用层次聚类方法和动态时间规整来度量新数据实例与训练集之间的差异。该方法在机械通气这一安全关键应用的背景下进行评估,使用模拟和临床数据集。机械通气用例的结果表明,该方法可以有效地检测出分布外数据,并提高了不同设置下的分类性能。
{"title":"Time-Series Out-of-Distribution Data Detection in Mechanical Ventilation","authors":"L. van de Kamp;B. Hunnekens;T. Oomen;N. van de Wouw","doi":"10.1109/OJCSYS.2025.3585427","DOIUrl":"https://doi.org/10.1109/OJCSYS.2025.3585427","url":null,"abstract":"Safe deployment of neural networks to classify time series in safety-critical applications relies on the ability of the classifier to detect data that does not originate from the same distribution as the training data. The aim of this paper is to propose a framework for detecting whether time-series data is sampled from a different distribution than the training data, known as the problem of <italic>out-of-distribution</i> (OOD) detection. We propose a novel distance-based OOD method for time-series data using a hierarchical clustering method together with dynamic time-warping to measure the difference between a new data instance and the training set. The method is evaluated in the context of mechanical ventilation, a safety critical application, using both simulated and clinical datasets. Results of the mechanical ventilation use case demonstrate that the proposed approach effectively detects out-of-distribution data and improves classification performance in diverse settings.","PeriodicalId":73299,"journal":{"name":"IEEE open journal of control systems","volume":"4 ","pages":"236-249"},"PeriodicalIF":0.0,"publicationDate":"2025-07-02","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=11066264","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"144914214","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Optimistic Algorithms for Safe Linear Bandits Under General Constraints 一般约束下安全线性强盗的乐观算法
Pub Date : 2025-04-07 DOI: 10.1109/OJCSYS.2025.3558118
Spencer Hutchinson;Arghavan Zibaie;Ramtin Pedarsani;Mahnoosh Alizadeh
The stochastic linear bandit problem has emerged as a fundamental building-block in machine learning and control, and a realistic model for many applications. By equipping this classical problem with safety constraints, the safe linear bandit problem further broadens its relevance to safety-critical applications. However, most existing algorithms for safe linear bandits only consider linear constraints, making them inadequate for many real-world applications, which often have non-linear constraints. To alleviate this limitation, we study the problem of safe linear bandits under general (non-linear) constraints. Under a novel constraint regularity condition that is weaker than convexity, we give two algorithms with $tilde{mathcal {O}}(d sqrt{T})$ regret. We then give efficient implementations of these algorithms for several specific settings. Lastly, we give simulation results demonstrating the effectiveness of our algorithms in choosing dynamic pricing signals for a demand response problem under distribution power flow constraints.
随机线性强盗问题已经成为机器学习和控制的基本组成部分,也是许多应用的现实模型。通过为这一经典问题配备安全约束,安全线性强盗问题进一步扩大了其与安全关键应用的相关性。然而,大多数现有的安全线性强盗算法只考虑线性约束,使得它们不适合许多具有非线性约束的实际应用。为了减轻这种限制,我们研究了一般(非线性)约束下的安全线性强盗问题。在一种比凸性更弱的约束规则条件下,给出了两种具有$tilde{mathcal {O}}(d sqrt{T})$遗憾的算法。然后,我们给出了这些算法在几个特定设置下的有效实现。最后,给出了仿真结果,证明了算法在配电网潮流约束下的需求响应问题中选择动态定价信号的有效性。
{"title":"Optimistic Algorithms for Safe Linear Bandits Under General Constraints","authors":"Spencer Hutchinson;Arghavan Zibaie;Ramtin Pedarsani;Mahnoosh Alizadeh","doi":"10.1109/OJCSYS.2025.3558118","DOIUrl":"https://doi.org/10.1109/OJCSYS.2025.3558118","url":null,"abstract":"The stochastic linear bandit problem has emerged as a fundamental building-block in machine learning and control, and a realistic model for many applications. By equipping this classical problem with safety constraints, the <italic>safe linear bandit problem</i> further broadens its relevance to safety-critical applications. However, most existing algorithms for safe linear bandits only consider <italic>linear constraints</i>, making them inadequate for many real-world applications, which often have non-linear constraints. To alleviate this limitation, we study the problem of safe linear bandits under general (non-linear) constraints. Under a novel constraint regularity condition that is weaker than convexity, we give two algorithms with <inline-formula><tex-math>$tilde{mathcal {O}}(d sqrt{T})$</tex-math></inline-formula> regret. We then give efficient implementations of these algorithms for several specific settings. Lastly, we give simulation results demonstrating the effectiveness of our algorithms in choosing dynamic pricing signals for a demand response problem under distribution power flow constraints.","PeriodicalId":73299,"journal":{"name":"IEEE open journal of control systems","volume":"4 ","pages":"103-116"},"PeriodicalIF":0.0,"publicationDate":"2025-04-07","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=10950393","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"143896165","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Scalable Distributed Reproduction Numbers of Network Epidemics With Differential Privacy 具有差分隐私的网络流行病的可伸缩分布再现数
Pub Date : 2025-03-30 DOI: 10.1109/OJCSYS.2025.3575305
Bo Chen;Baike She;Calvin Hawkins;Philip E. Paré;Matthew T. Hale
Reproduction numbers are widely used to analyze epidemic spreading processes over networks. However,conventional reproduction numbers of an overall network, which require spreading information from the entire network, do not indicate where an epidemic is spreading. To address this limitation, we first propose a novel class of local distributed reproduction numbers that capture spreading behaviors at the level of individual nodes. We demonstrate how to compute these values in a distributed way and use them to derive new threshold conditions for network spreading analysis. Due to the fact that epidemic data are often collected at multiple geographic or administrative scales, we then define a class of cluster distributed reproduction numbers to describe the spread between groups of nodes such as communities, cities, or states. We further show that the local distributed reproduction numbers can be aggregated to form the cluster distributed reproduction numbers. Unlike conventional network-level reproduction numbers, these distributed measures reveal fine-grained interaction patterns that may raise privacy concerns by exposing the frequency or intensity of interactions across regions. To address this issue, we propose a privacy-enhanced distributed reproduction number framework that implements differential privacy. This framework enables scalable and privacy-preserving analysis of epidemic spreading processes in networked populations through the calculation of privacy-preserving distributed reproduction numbers. Numerical experiments show that while maintaining differential privacy, the private distributed reproduction numbers yield accurate estimates of epidemic spread while also offering more insights than conventional reproduction numbers.
复制数被广泛用于分析网络上的流行病传播过程。但是,需要从整个网络传播信息的整个网络的常规复制数并不能表明流行病正在何处传播。为了解决这一限制,我们首先提出了一类新的局部分布式复制数,它在单个节点的水平上捕捉传播行为。我们演示了如何以分布式方式计算这些值,并利用它们推导出网络传播分析的新阈值条件。由于流行病数据通常是在多个地理或行政尺度上收集的,因此我们定义了一类集群分布式复制数来描述社区、城市或州等节点组之间的传播。进一步证明了局部分布再生产数可以聚合成集群分布再生产数。与传统的网络级复制数不同,这些分布式度量揭示了细粒度交互模式,这些模式可能通过暴露跨区域交互的频率或强度而引起隐私问题。为了解决这个问题,我们提出了一个隐私增强的分布式复制数框架,实现了差分隐私。该框架通过计算保护隐私的分布式复制数,实现了对网络人群中流行病传播过程的可扩展和隐私保护分析。数值实验表明,在保持差异私密性的同时,私有的分布式繁殖数可以准确估计流行病的传播,同时也比传统的繁殖数提供更多的见解。
{"title":"Scalable Distributed Reproduction Numbers of Network Epidemics With Differential Privacy","authors":"Bo Chen;Baike She;Calvin Hawkins;Philip E. Paré;Matthew T. Hale","doi":"10.1109/OJCSYS.2025.3575305","DOIUrl":"https://doi.org/10.1109/OJCSYS.2025.3575305","url":null,"abstract":"Reproduction numbers are widely used to analyze epidemic spreading processes over networks. However,conventional reproduction numbers of an overall network, which require spreading information from the entire network, do not indicate where an epidemic is spreading. To address this limitation, we first propose a novel class of local distributed reproduction numbers that capture spreading behaviors at the level of individual nodes. We demonstrate how to compute these values in a distributed way and use them to derive new threshold conditions for network spreading analysis. Due to the fact that epidemic data are often collected at multiple geographic or administrative scales, we then define a class of cluster distributed reproduction numbers to describe the spread between groups of nodes such as communities, cities, or states. We further show that the local distributed reproduction numbers can be aggregated to form the cluster distributed reproduction numbers. Unlike conventional network-level reproduction numbers, these distributed measures reveal fine-grained interaction patterns that may raise privacy concerns by exposing the frequency or intensity of interactions across regions. To address this issue, we propose a privacy-enhanced distributed reproduction number framework that implements differential privacy. This framework enables scalable and privacy-preserving analysis of epidemic spreading processes in networked populations through the calculation of privacy-preserving distributed reproduction numbers. Numerical experiments show that while maintaining differential privacy, the private distributed reproduction numbers yield accurate estimates of epidemic spread while also offering more insights than conventional reproduction numbers.","PeriodicalId":73299,"journal":{"name":"IEEE open journal of control systems","volume":"4 ","pages":"199-218"},"PeriodicalIF":0.0,"publicationDate":"2025-03-30","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=11018355","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"144524373","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
期刊
IEEE open journal of control systems
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1