首页 > 最新文献

IEEE open journal of control systems最新文献

英文 中文
On the Equivalence of Sensory and Incremental Nonlinear Dynamic Inversion 感官与增量非线性动态反演的等价性
Pub Date : 2026-03-02 DOI: 10.1109/OJCSYS.2026.3669038
S. Hafner;T. De Ponti;E. Smeur
In the aerospace control domain, Nonlinear Dynamic Inversion (NDI)-based control laws are widely spread. As a variation to Incremental Nonlinear Dynamic Inversion (INDI), the sensory Nonlinear Dynamic Inversion (sNDI) method was recently developed. Both methods rely on replacing model knowledge with sensor measurements. However, the methods differ in how the pseudo-controls are allocated: INDI allocates them incrementally, while sNDI allocates them globally, with corresponding advantages and disadvantages. While INDI requires a restoring mechanism in the control allocation due to path dependency issues in overactuated nonlinear systems, sNDI does not experience this problem. In addition to the comparison, the paper demonstrates that both methods lead to identical results if restoring is applied in the control allocation of INDI. Even though sNDI and INDI with restoring can lead to limit cycles for theoretical non-linear overactuated systems, the practical applicability of this approach to transition electrical vertical take-off and landing vehicles (eVTOL) is demonstrated in flight tests of the Variable Skew Quad Plane.
在航空航天控制领域,基于非线性动态反演(NDI)的控制律得到了广泛应用。感觉非线性动态反演(sNDI)方法是增量非线性动态反演(INDI)方法的一种发展。这两种方法都依赖于用传感器测量代替模型知识。然而,这些方法在分配伪控件的方式上有所不同:INDI增量地分配它们,而sNDI全局地分配它们,并具有相应的优点和缺点。由于过度驱动非线性系统中的路径依赖问题,INDI需要在控制分配中使用恢复机制,而sNDI没有遇到这个问题。此外,本文还表明,如果在INDI的控制分配中应用恢复,两种方法的结果是相同的。尽管sNDI和带恢复的INDI可能会导致理论上非线性过驱动系统的极限循环,但该方法在过渡电动垂直起降飞行器(eVTOL)上的实际适用性已在变斜度四面飞机的飞行试验中得到证明。
{"title":"On the Equivalence of Sensory and Incremental Nonlinear Dynamic Inversion","authors":"S. Hafner;T. De Ponti;E. Smeur","doi":"10.1109/OJCSYS.2026.3669038","DOIUrl":"https://doi.org/10.1109/OJCSYS.2026.3669038","url":null,"abstract":"In the aerospace control domain, Nonlinear Dynamic Inversion (NDI)-based control laws are widely spread. As a variation to Incremental Nonlinear Dynamic Inversion (INDI), the sensory Nonlinear Dynamic Inversion (sNDI) method was recently developed. Both methods rely on replacing model knowledge with sensor measurements. However, the methods differ in how the pseudo-controls are allocated: INDI allocates them incrementally, while sNDI allocates them globally, with corresponding advantages and disadvantages. While INDI requires a restoring mechanism in the control allocation due to path dependency issues in overactuated nonlinear systems, sNDI does not experience this problem. In addition to the comparison, the paper demonstrates that both methods lead to identical results if restoring is applied in the control allocation of INDI. Even though sNDI and INDI with restoring can lead to limit cycles for theoretical non-linear overactuated systems, the practical applicability of this approach to transition electrical vertical take-off and landing vehicles (eVTOL) is demonstrated in flight tests of the Variable Skew Quad Plane.","PeriodicalId":73299,"journal":{"name":"IEEE open journal of control systems","volume":"5 ","pages":"136-148"},"PeriodicalIF":0.0,"publicationDate":"2026-03-02","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=11417726","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"147440621","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Lyapunov-Based Nonlinear Model Predictive Control of Input-Delayed Functional Electrical Stimulation: Investigative Simulations and Experiments 输入延迟功能电刺激的lyapunov非线性模型预测控制:研究仿真与实验
Pub Date : 2026-02-20 DOI: 10.1109/OJCSYS.2026.3666636
Krysten Lambeth;Ziyue Sun;Ashwin Iyer;Vidisha Ganesh;Nitin Sharma
Existing closed-loop controllers for functional electrical stimulation are prone to exceeding subject-specific stimulation limits, thereby limiting performance and also accelerating stimulation-induced muscle fatigue. In view of these challenges, this paper develops a Lyapunov-based model predictive control method to control knee flexion and extension during input-delayed stimulation. The method incorporates a contractive constraint under an electromechanical delay (EMD) compensation control law that achieves system stability despite an unknown constant input delay, bounded control constraints, and imperfectly estimated model parameters. A Lyapunov stability analysis proves that the Lyapunov constraint renders the closed-loop error ultimately bounded, and gain conditions are provided to guarantee recursive feasibility. LMPC's performance is explored in simulation and experiments and compared against an analytical proportional derivative-dynamic surface controller (PD-DSC) and a proportional-derivative-delay compensation (PD-DC) controller. In simulation, LMPC improved tracking root-mean-square error by 75.57% and 71.71%, on average, compared to PD-DSC and PD-DC, respectively. We observed that incorporating a slackening term often improved LMPC's tracking performance, although strict enforcement of the Lyapunov constraint was superior when there was greater EMD estimation error. Additionally, unlike PD-DSC and PD-DC, LMPC was not destabilized when EMD was overestimated or underestimated, nor did it violate input constraints. In knee extension experiments, LMPC respected input constraints, which PD-DSC did not. The LMPC was also validated in overground walking experiments to test its ability to produce both knee flexion and extension in participants with and without spinal cord injury.
现有的用于功能性电刺激的闭环控制器容易超过受试者特定的刺激极限,从而限制了性能,也加速了刺激引起的肌肉疲劳。针对这些挑战,本文开发了一种基于lyapunov模型的预测控制方法,用于控制输入延迟刺激下的膝关节屈伸。该方法在机电延迟(EMD)补偿控制律下引入了一个收缩约束,在未知的恒定输入延迟、有界控制约束和模型参数不完全估计的情况下仍能实现系统稳定。Lyapunov稳定性分析证明了Lyapunov约束使闭环误差最终有界,并给出了保证递归可行性的增益条件。在仿真和实验中探讨了LMPC的性能,并与解析型比例导数-动态表面控制器(PD-DSC)和比例导数-延迟补偿控制器(PD-DC)进行了比较。在仿真中,与PD-DSC和PD-DC相比,LMPC将跟踪均方根误差平均提高了75.57%和71.71%。我们观察到,加入松弛项通常会提高LMPC的跟踪性能,尽管当EMD估计误差较大时,严格执行Lyapunov约束更为优越。此外,与PD-DSC和PD-DC不同,当EMD被高估或低估时,LMPC不会不稳定,也不会违反输入约束。在膝关节伸展实验中,LMPC尊重输入约束,而PD-DSC没有。LMPC也在地面行走实验中得到验证,以测试其在有或没有脊髓损伤的参与者中产生膝关节屈曲和伸展的能力。
{"title":"Lyapunov-Based Nonlinear Model Predictive Control of Input-Delayed Functional Electrical Stimulation: Investigative Simulations and Experiments","authors":"Krysten Lambeth;Ziyue Sun;Ashwin Iyer;Vidisha Ganesh;Nitin Sharma","doi":"10.1109/OJCSYS.2026.3666636","DOIUrl":"https://doi.org/10.1109/OJCSYS.2026.3666636","url":null,"abstract":"Existing closed-loop controllers for functional electrical stimulation are prone to exceeding subject-specific stimulation limits, thereby limiting performance and also accelerating stimulation-induced muscle fatigue. In view of these challenges, this paper develops a Lyapunov-based model predictive control method to control knee flexion and extension during input-delayed stimulation. The method incorporates a contractive constraint under an electromechanical delay (EMD) compensation control law that achieves system stability despite an unknown constant input delay, bounded control constraints, and imperfectly estimated model parameters. A Lyapunov stability analysis proves that the Lyapunov constraint renders the closed-loop error ultimately bounded, and gain conditions are provided to guarantee recursive feasibility. LMPC's performance is explored in simulation and experiments and compared against an analytical proportional derivative-dynamic surface controller (PD-DSC) and a proportional-derivative-delay compensation (PD-DC) controller. In simulation, LMPC improved tracking root-mean-square error by 75.57% and 71.71%, on average, compared to PD-DSC and PD-DC, respectively. We observed that incorporating a slackening term often improved LMPC's tracking performance, although strict enforcement of the Lyapunov constraint was superior when there was greater EMD estimation error. Additionally, unlike PD-DSC and PD-DC, LMPC was not destabilized when EMD was overestimated or underestimated, nor did it violate input constraints. In knee extension experiments, LMPC respected input constraints, which PD-DSC did not. The LMPC was also validated in overground walking experiments to test its ability to produce both knee flexion and extension in participants with and without spinal cord injury.","PeriodicalId":73299,"journal":{"name":"IEEE open journal of control systems","volume":"5 ","pages":"121-135"},"PeriodicalIF":0.0,"publicationDate":"2026-02-20","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=11404180","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"147362500","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
IEEE Open Journal of Control Systems Publication Information IEEE控制系统公开杂志出版信息
Pub Date : 2026-02-13 DOI: 10.1109/OJCSYS.2026.3662069
{"title":"IEEE Open Journal of Control Systems Publication Information","authors":"","doi":"10.1109/OJCSYS.2026.3662069","DOIUrl":"https://doi.org/10.1109/OJCSYS.2026.3662069","url":null,"abstract":"","PeriodicalId":73299,"journal":{"name":"IEEE open journal of control systems","volume":"5 ","pages":"C2-C2"},"PeriodicalIF":0.0,"publicationDate":"2026-02-13","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=11396397","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"146175976","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
IEEE Control Systems Society Information IEEE控制系统学会信息
Pub Date : 2026-02-13 DOI: 10.1109/OJCSYS.2026.3662073
{"title":"IEEE Control Systems Society Information","authors":"","doi":"10.1109/OJCSYS.2026.3662073","DOIUrl":"https://doi.org/10.1109/OJCSYS.2026.3662073","url":null,"abstract":"","PeriodicalId":73299,"journal":{"name":"IEEE open journal of control systems","volume":"5 ","pages":"C3-C3"},"PeriodicalIF":0.0,"publicationDate":"2026-02-13","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=11396398","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"146175975","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Geometry-Aware Edge-State Tracking for Robust Affine Formation Control 鲁棒仿射编队控制的几何感知边缘状态跟踪
Pub Date : 2026-01-28 DOI: 10.1109/OJCSYS.2026.3657987
Zhonggang Li;Raj Thilak Rajan
Affine formation control (AFC) is a subset of formation control methods that enables coordinated multiagent movement while preserving affine relationships, and has recently gained increasing popularity due to its utility across diverse applications. AFC is inherently distributed, where each agent's local controller relies on the relative displacements of neighboring agents. The unavailability of these measurements in practice, due to node or communication failures, leads to a change in the underlying graph topology and subsequently causes instability or sub-optimal performance. In this work, each edge in the graph is modeled using a state-space framework, allowing the corresponding edge-states to be estimated with or without up-to-date measurements. We then propose a Kalman-based estimation framework where we fuse both temporal information from agents' dynamics and spatial information, which is derived from the geometry of the affine formations. We give convergence guarantees and optimality analysis on the proposed algorithm, and numerical validations show the enhanced robustness of AFC against these topology changes in several practical scenarios.
仿射编队控制(AFC)是编队控制方法的一个子集,它可以在保持仿射关系的同时实现多智能体的协调运动,最近由于其在各种应用中的实用性而越来越受欢迎。AFC本质上是分布式的,每个agent的局部控制器依赖于相邻agent的相对位移。在实践中,由于节点或通信故障,这些测量的不可用性会导致底层图拓扑的变化,并随后导致不稳定或次优性能。在这项工作中,图中的每条边都使用状态空间框架建模,允许在有或没有最新测量的情况下估计相应的边缘状态。然后,我们提出了一个基于卡尔曼的估计框架,在该框架中,我们融合了来自智能体动态的时间信息和来自仿射结构几何的空间信息。我们给出了算法的收敛性保证和最优性分析,并在几个实际场景中进行了数值验证,证明了AFC对这些拓扑变化的鲁棒性增强。
{"title":"Geometry-Aware Edge-State Tracking for Robust Affine Formation Control","authors":"Zhonggang Li;Raj Thilak Rajan","doi":"10.1109/OJCSYS.2026.3657987","DOIUrl":"https://doi.org/10.1109/OJCSYS.2026.3657987","url":null,"abstract":"Affine formation control (AFC) is a subset of formation control methods that enables coordinated multiagent movement while preserving affine relationships, and has recently gained increasing popularity due to its utility across diverse applications. AFC is inherently distributed, where each agent's local controller relies on the relative displacements of neighboring agents. The unavailability of these measurements in practice, due to node or communication failures, leads to a change in the underlying graph topology and subsequently causes instability or sub-optimal performance. In this work, each edge in the graph is modeled using a state-space framework, allowing the corresponding edge-states to be estimated with or without up-to-date measurements. We then propose a Kalman-based estimation framework where we fuse both temporal information from agents' dynamics and spatial information, which is derived from the geometry of the affine formations. We give convergence guarantees and optimality analysis on the proposed algorithm, and numerical validations show the enhanced robustness of AFC against these topology changes in several practical scenarios.","PeriodicalId":73299,"journal":{"name":"IEEE open journal of control systems","volume":"5 ","pages":"107-120"},"PeriodicalIF":0.0,"publicationDate":"2026-01-28","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=11364073","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"146175974","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Risk-Tunable Safe Adaptive Control for Nonlinear Systems Under Dynamical Uncertainties 动态不确定非线性系统的风险可调安全自适应控制
Pub Date : 2026-01-26 DOI: 10.1109/OJCSYS.2026.3657701
Vipul K. Sharma;S. Sivaranjani
We address the problem of safe adaptive control for a class of nonlinear systems with dynamical uncertainties, while satisfying control barrier function (CBF) type safety constraints with user-defined risk tolerances at all times. We develop a model reference adaptive control framework that provably guarantees safety in two stages. In the first stage, we design a safe reference model to generate reference trajectories that satisfy CBF-based safety conditions. However, asymptotically tracking a safe reference trajectory does not automatically guarantee safety at every time step. Therefore, in the second stage, we formulate a chance-constrained optimization problem for the nonlinear system with dynamical uncertainties to track the reference model, while provably guaranteeing CBF-based safety constraint satisfaction at each time step up to a user-defined risk bound. We then provide a risk-tunable sampling-based scenario design approach to tune parameterized controllers that solve this optimization problem. In addition, for the special case of linear dynamics, we provide conditions on the uncertainty samples for the existence of controller parameters that can guarantee safe tracking. We illustrate the performance of our framework on a quadcopter navigation problem with obstacle avoidance constraints.
研究了一类具有动态不确定性的非线性系统的安全自适应控制问题,同时在任何时候都满足控制屏障函数(CBF)类型的安全约束,并具有用户自定义的风险容忍度。我们开发了一个模型参考自适应控制框架,可证明在两个阶段保证安全。在第一阶段,我们设计了一个安全参考模型来生成满足基于cbf的安全条件的参考轨迹。然而,渐近跟踪安全参考轨迹并不能自动保证每个时间步长的安全性。因此,在第二阶段,我们针对具有动态不确定性的非线性系统,提出了一个机会约束优化问题来跟踪参考模型,同时可证明地保证基于cbf的安全约束在每个时间步长都满足用户自定义的风险界。然后,我们提供了一种基于风险可调采样的场景设计方法来调整解决此优化问题的参数化控制器。此外,对于线性动力学的特殊情况,给出了控制器参数存在的不确定性样本的条件,保证了系统的安全跟踪。我们举例说明了我们的框架在具有避障约束的四轴飞行器导航问题上的性能。
{"title":"Risk-Tunable Safe Adaptive Control for Nonlinear Systems Under Dynamical Uncertainties","authors":"Vipul K. Sharma;S. Sivaranjani","doi":"10.1109/OJCSYS.2026.3657701","DOIUrl":"https://doi.org/10.1109/OJCSYS.2026.3657701","url":null,"abstract":"We address the problem of safe adaptive control for a class of nonlinear systems with dynamical uncertainties, while satisfying control barrier function (CBF) type safety constraints with user-defined risk tolerances at all times. We develop a model reference adaptive control framework that provably guarantees safety in two stages. In the first stage, we design a safe reference model to generate reference trajectories that satisfy CBF-based safety conditions. However, asymptotically tracking a safe reference trajectory does not automatically guarantee safety at every time step. Therefore, in the second stage, we formulate a chance-constrained optimization problem for the nonlinear system with dynamical uncertainties to track the reference model, while provably guaranteeing CBF-based safety constraint satisfaction at each time step up to a user-defined risk bound. We then provide a risk-tunable sampling-based scenario design approach to tune parameterized controllers that solve this optimization problem. In addition, for the special case of linear dynamics, we provide conditions on the uncertainty samples for the existence of controller parameters that can guarantee safe tracking. We illustrate the performance of our framework on a quadcopter navigation problem with obstacle avoidance constraints.","PeriodicalId":73299,"journal":{"name":"IEEE open journal of control systems","volume":"5 ","pages":"91-106"},"PeriodicalIF":0.0,"publicationDate":"2026-01-26","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=11363681","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"146175979","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
2025 Index IEEE Open Journal of Control Systems 控制系统开放杂志
Pub Date : 2026-01-14 DOI: 10.1109/OJCSYS.2026.3654320
{"title":"2025 Index IEEE Open Journal of Control Systems","authors":"","doi":"10.1109/OJCSYS.2026.3654320","DOIUrl":"https://doi.org/10.1109/OJCSYS.2026.3654320","url":null,"abstract":"","PeriodicalId":73299,"journal":{"name":"IEEE open journal of control systems","volume":"4 ","pages":"629-639"},"PeriodicalIF":0.0,"publicationDate":"2026-01-14","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=11353212","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145982218","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Welfare and Cost Aggregation for Multi-Agent Control: When to Choose Which Social Cost Function, and Why? 多主体控制的福利和成本聚合:何时选择哪种社会成本函数,为什么?
Pub Date : 2026-01-12 DOI: 10.1109/OJCSYS.2026.3653649
Ilia Shilov;Ezzat Elokda;Sophie Hall;Heinrich H. Nax;Saverio Bolognani
Many multi-agent socio-technical systems rely on aggregating heterogeneous agents’ costs into a social cost function (SCF) to coordinate resource allocation in domains such as energy grids, water allocation, or traffic management. The choice of SCF often entails implicit assumptions and may lead to undesirable outcomes if not rigorously justified. In this paper, we demonstrate that what determines which SCF ought to be used is the degree to which individual costs can be compared across agents and which axioms the aggregation shall fulfill. Drawing on the results from social choice theory, we provide guidance on how this process can be used in control applications. We demonstrate which assumptions about interpersonal utility comparability - ranging from ordinal level comparability to full cardinal comparability - together with a choice of desirable axioms, inform the selection of a correct SCF, be it the classical utilitarian sum, the Nash SCF, or maximin. Thus, fixing comparability level first, then choosing an objective from the compatible class, and reporting both as part of the specification, makes the fairness and efficiency consequences transparent. We demonstrate how the proposed framework can be applied for principled allocations of water, transportation, and energy resources.
许多多智能体社会技术系统依赖于将异质智能体的成本聚合成社会成本函数(SCF)来协调能源网、水分配或交通管理等领域的资源分配。SCF的选择通常需要隐含的假设,如果没有严格的证明,可能会导致不希望的结果。在本文中,我们证明了决定应该使用哪种SCF的是个体成本可以跨代理进行比较的程度,以及聚合应该满足哪些公理。根据社会选择理论的结果,我们提供了如何在控制应用中使用这一过程的指导。我们展示了哪些关于人际效用可比性的假设——从序数水平可比性到完全基数可比性——以及理想公理的选择,为正确的SCF的选择提供了信息,无论是经典的功利和、纳什SCF还是最大值。因此,首先确定可比性级别,然后从兼容类中选择一个目标,并将两者作为规范的一部分进行报告,从而使公平性和效率结果透明。我们演示了拟议的框架如何应用于水、交通和能源资源的原则分配。
{"title":"Welfare and Cost Aggregation for Multi-Agent Control: When to Choose Which Social Cost Function, and Why?","authors":"Ilia Shilov;Ezzat Elokda;Sophie Hall;Heinrich H. Nax;Saverio Bolognani","doi":"10.1109/OJCSYS.2026.3653649","DOIUrl":"https://doi.org/10.1109/OJCSYS.2026.3653649","url":null,"abstract":"Many multi-agent socio-technical systems rely on aggregating heterogeneous agents’ costs into a social cost function (SCF) to coordinate resource allocation in domains such as energy grids, water allocation, or traffic management. The choice of SCF often entails implicit assumptions and may lead to undesirable outcomes if not rigorously justified. In this paper, we demonstrate that what determines which SCF ought to be used is the degree to which individual costs can be compared across agents and which axioms the aggregation shall fulfill. Drawing on the results from social choice theory, we provide guidance on how this process can be used in control applications. We demonstrate which assumptions about interpersonal utility comparability - ranging from ordinal level comparability to full cardinal comparability - together with a choice of desirable axioms, inform the selection of a correct SCF, be it the classical utilitarian sum, the Nash SCF, or maximin. Thus, fixing comparability level first, then choosing an objective from the compatible class, and reporting both as part of the specification, makes the fairness and efficiency consequences transparent. We demonstrate how the proposed framework can be applied for principled allocations of water, transportation, and energy resources.","PeriodicalId":73299,"journal":{"name":"IEEE open journal of control systems","volume":"5 ","pages":"80-90"},"PeriodicalIF":0.0,"publicationDate":"2026-01-12","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=11347468","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"146175980","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Policy Optimization in Multi-Agent Settings Under Partially Observable Environments 部分可观察环境下的多智能体策略优化
Pub Date : 2026-01-12 DOI: 10.1109/OJCSYS.2026.3651197
Ainur Zhaikhan;Malek Khammassi;Ali H. Sayed
This work leverages adaptive social learning to estimate partially observable global states in multi-agent reinforcement learning (MARL) problems. Unlike existing methods, the proposed approach enables the concurrent operation of social learning and reinforcement learning. Specifically, it alternates between a single step of social learning and a single step of MARL, eliminating the need for the time- and computation-intensive two-timescale learning frameworks. Theoretical guarantees are provided to support the effectiveness of the proposed method. Simulation results verify that the performance of the proposed methodology can approach that of reinforcement learning when the true state is known.
这项工作利用自适应社会学习来估计多智能体强化学习(MARL)问题中部分可观察的全局状态。与现有方法不同,该方法能够实现社会学习和强化学习的并行操作。具体来说,它在单步社会学习和单步MARL之间交替,消除了对时间和计算密集型双时间尺度学习框架的需求。为该方法的有效性提供了理论保证。仿真结果表明,当真实状态已知时,所提方法的性能接近强化学习。
{"title":"Policy Optimization in Multi-Agent Settings Under Partially Observable Environments","authors":"Ainur Zhaikhan;Malek Khammassi;Ali H. Sayed","doi":"10.1109/OJCSYS.2026.3651197","DOIUrl":"https://doi.org/10.1109/OJCSYS.2026.3651197","url":null,"abstract":"This work leverages adaptive social learning to estimate partially observable global states in multi-agent reinforcement learning (MARL) problems. Unlike existing methods, the proposed approach enables the concurrent operation of social learning and reinforcement learning. Specifically, it alternates between a single step of social learning and a single step of MARL, eliminating the need for the time- and computation-intensive two-timescale learning frameworks. Theoretical guarantees are provided to support the effectiveness of the proposed method. Simulation results verify that the performance of the proposed methodology can approach that of reinforcement learning when the true state is known.","PeriodicalId":73299,"journal":{"name":"IEEE open journal of control systems","volume":"5 ","pages":"64-79"},"PeriodicalIF":0.0,"publicationDate":"2026-01-12","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=11329162","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"146175978","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
IEEE Control Systems Society Publication Information IEEE控制系统协会出版信息
Pub Date : 2025-12-17 DOI: 10.1109/OJCSYS.2025.3628513
{"title":"IEEE Control Systems Society Publication Information","authors":"","doi":"10.1109/OJCSYS.2025.3628513","DOIUrl":"https://doi.org/10.1109/OJCSYS.2025.3628513","url":null,"abstract":"","PeriodicalId":73299,"journal":{"name":"IEEE open journal of control systems","volume":"4 ","pages":"C3-C3"},"PeriodicalIF":0.0,"publicationDate":"2025-12-17","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=11303147","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145778370","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
期刊
IEEE open journal of control systems
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1