首页 > 最新文献

IFAC Journal of Systems and Control最新文献

英文 中文
Language-aided state estimation 语言辅助状态估计
IF 1.8 Q3 AUTOMATION & CONTROL SYSTEMS Pub Date : 2026-01-30 DOI: 10.1016/j.ifacsc.2026.100372
Yuki Miyoshi , Masaki Inoue , Yusuke Fujimoto
Natural language data, such as text and speech, have become readily available through social networking services and chat platforms. By leveraging human observations expressed in natural language, this paper addresses the problem of state estimation for physical systems, in which humans act as sensing agents. To this end, we propose a Language-Aided Particle Filter (LAPF), a particle filter framework that structures human observations via natural language processing and incorporates them into the update step of the state estimation. Finally, the LAPF is applied to the water level estimation problem in an irrigation canal and its effectiveness is demonstrated.
自然语言数据,如文本和语音,已经可以通过社交网络服务和聊天平台随时获得。通过利用以自然语言表达的人类观察,本文解决了物理系统的状态估计问题,其中人类充当感知代理。为此,我们提出了一种语言辅助粒子滤波器(LAPF),这是一种通过自然语言处理构建人类观测数据并将其纳入状态估计更新步骤的粒子滤波器框架。最后,将LAPF应用于灌溉渠的水位估算问题,验证了其有效性。
{"title":"Language-aided state estimation","authors":"Yuki Miyoshi ,&nbsp;Masaki Inoue ,&nbsp;Yusuke Fujimoto","doi":"10.1016/j.ifacsc.2026.100372","DOIUrl":"10.1016/j.ifacsc.2026.100372","url":null,"abstract":"<div><div>Natural language data, such as text and speech, have become readily available through social networking services and chat platforms. By leveraging human observations expressed in natural language, this paper addresses the problem of state estimation for physical systems, in which humans act as sensing agents. To this end, we propose a Language-Aided Particle Filter (LAPF), a particle filter framework that structures human observations via natural language processing and incorporates them into the update step of the state estimation. Finally, the LAPF is applied to the water level estimation problem in an irrigation canal and its effectiveness is demonstrated.</div></div>","PeriodicalId":29926,"journal":{"name":"IFAC Journal of Systems and Control","volume":"35 ","pages":"Article 100372"},"PeriodicalIF":1.8,"publicationDate":"2026-01-30","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"146173136","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Compatible realisation of control and identification of direct adaptive control via probing signal auto-elimination 通过探测信号自动消去实现直接自适应控制的兼容实现和辨识
IF 1.8 Q3 AUTOMATION & CONTROL SYSTEMS Pub Date : 2026-01-30 DOI: 10.1016/j.ifacsc.2026.100375
Akira Takakura , Takashi Yokoyama , Takahiro Nozaki , Shuichi Adachi , Hiromitsu Ohmori
The model reference adaptive control system is an adaptive controller that maintains the control performance even when the uncertainty of the controlled system’s parameters is high, and its design methodology is well established. In particular, the direct MRACS excels in responsiveness; however, it suffers from the problem that its adjustable parameters do not converge to their true values. To converge the adjustable parameters to their true values, a conventional method involves injecting a probing signal to satisfy the PE property; however, this compromises the control performance. Thus, a control error-based probing signal auto-elimination scheme is proposed in this study, which adaptively regulates the probing signal based solely on the control error without predefined elimination timing. This enables the identification of adjustable parameters during transient phases, while automatically suppressing the probing signal once sufficient tracking performance is achieved. Furthermore, unlike existing probing-based methods, the proposed scheme allows re-injection of the probing signal when performance degradation is detected, thereby achieving a compatible realisation of identification and control within a single framework. Therefore, the proposed scheme simultaneously contributes to the identification and control, significantly reducing the tracking error. The validity of the proposed structure was confirmed by simulations under plant variation conditions.
模型参考自适应控制系统是在被控系统参数不确定性较大时仍能保持控制性能的一种自适应控制器,其设计方法是成熟的。特别是,直接的MRACS在反应性方面表现出色;然而,它的可调参数不收敛于其真实值的问题。为了使可调参数收敛到它们的真值,传统的方法是注入探测信号以满足PE特性;然而,这损害了控制性能。因此,本文提出了一种基于控制误差的探测信号自动消除方案,该方案仅根据控制误差对探测信号进行自适应调节,无需预先设定消除时间。这样可以在瞬态阶段识别可调参数,同时一旦达到足够的跟踪性能就自动抑制探测信号。此外,与现有的基于探测的方法不同,该方案允许在检测到性能下降时重新注入探测信号,从而在单个框架内实现识别和控制的兼容实现。因此,所提出的方案同时有助于识别和控制,大大减少了跟踪误差。在植物变异条件下的仿真验证了该结构的有效性。
{"title":"Compatible realisation of control and identification of direct adaptive control via probing signal auto-elimination","authors":"Akira Takakura ,&nbsp;Takashi Yokoyama ,&nbsp;Takahiro Nozaki ,&nbsp;Shuichi Adachi ,&nbsp;Hiromitsu Ohmori","doi":"10.1016/j.ifacsc.2026.100375","DOIUrl":"10.1016/j.ifacsc.2026.100375","url":null,"abstract":"<div><div>The model reference adaptive control system is an adaptive controller that maintains the control performance even when the uncertainty of the controlled system’s parameters is high, and its design methodology is well established. In particular, the direct MRACS excels in responsiveness; however, it suffers from the problem that its adjustable parameters do not converge to their true values. To converge the adjustable parameters to their true values, a conventional method involves injecting a probing signal to satisfy the PE property; however, this compromises the control performance. Thus, a control error-based probing signal auto-elimination scheme is proposed in this study, which adaptively regulates the probing signal based solely on the control error without predefined elimination timing. This enables the identification of adjustable parameters during transient phases, while automatically suppressing the probing signal once sufficient tracking performance is achieved. Furthermore, unlike existing probing-based methods, the proposed scheme allows re-injection of the probing signal when performance degradation is detected, thereby achieving a compatible realisation of identification and control within a single framework. Therefore, the proposed scheme simultaneously contributes to the identification and control, significantly reducing the tracking error. The validity of the proposed structure was confirmed by simulations under plant variation conditions.</div></div>","PeriodicalId":29926,"journal":{"name":"IFAC Journal of Systems and Control","volume":"35 ","pages":"Article 100375"},"PeriodicalIF":1.8,"publicationDate":"2026-01-30","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"146077720","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Multisine input signal design for constrained, “plant-friendly” system identification of nonlinear systems 多正弦输入信号的设计约束,“植物友好”系统辨识的非线性系统
IF 1.8 Q3 AUTOMATION & CONTROL SYSTEMS Pub Date : 2026-01-29 DOI: 10.1016/j.ifacsc.2026.100371
Sarasij Banerjee , Eric Hekler , Daniel E. Rivera
This paper presents a methodology for optimizing “plant-friendly” multisine input signals to identify nonlinear dynamic systems under time-domain input and output constraints, without requiring a global parametric model a priori. The goal is to construct an informative dataset for open-loop, data-driven identification while selecting operational requirements. A weighted optimization framework is proposed to minimize the output crest factor resulting from a data-driven model, with penalties for violating input and output constraints. Model-on-Demand (MoD) estimation is employed to simulate outputs using prior data, effectively predicting nonlinear responses without global modeling. This MoD-based formulation enables evaluating output crest factors and output constraint compliance with modest modeling effort and improved impact. The resulting non-smooth, non-convex problem is solved using the Simultaneous Perturbation Stochastic Approximation (SPSA) algorithm, which perturbs the multisine phase vector to achieve the desired performance efficiently. This method supports the concept of identification test monitoring, as illustrated in this paper. Within the identification test loops, each optimized excitation is applied to gather new estimation data, iteratively refining MoD-based output predictions and improving constraint satisfaction. The method’s effectiveness is demonstrated through a safety-critical case study on a Susceptible-Infected-Recovered (SIR) epidemiological network, showing that the optimized excitation yields highly informative data for identification while keeping the infection spread within safe limits.
本文提出了一种优化“植物友好”多正弦输入信号的方法,以识别时域输入和输出约束下的非线性动态系统,而不需要先验的全局参数模型。目标是在选择操作需求的同时,为开环、数据驱动的识别构建信息数据集。提出了一种加权优化框架,以最小化由数据驱动模型产生的输出波峰因子,并对违反输入和输出约束进行惩罚。模型-按需(MoD)估计采用先验数据模拟输出,有效预测非线性响应而无需全局建模。这种基于模型的公式可以通过适度的建模努力和改进的影响来评估输出峰值因子和输出约束依从性。采用同步摄动随机逼近(SPSA)算法对多正弦相位矢量进行摄动以有效地达到预期的性能。该方法支持识别测试监控的概念,如本文所示。在识别测试循环中,应用每个优化的激励来收集新的估计数据,迭代地改进基于mod的输出预测,提高约束满意度。通过对易感-感染-恢复(SIR)流行病学网络的安全关键案例研究,证明了该方法的有效性,表明优化的激励产生了用于识别的高信息量数据,同时将感染传播保持在安全范围内。
{"title":"Multisine input signal design for constrained, “plant-friendly” system identification of nonlinear systems","authors":"Sarasij Banerjee ,&nbsp;Eric Hekler ,&nbsp;Daniel E. Rivera","doi":"10.1016/j.ifacsc.2026.100371","DOIUrl":"10.1016/j.ifacsc.2026.100371","url":null,"abstract":"<div><div>This paper presents a methodology for optimizing “plant-friendly” multisine input signals to identify nonlinear dynamic systems under time-domain input and output constraints, without requiring a global parametric model <em>a priori</em>. The goal is to construct an informative dataset for open-loop, data-driven identification while selecting operational requirements. A weighted optimization framework is proposed to minimize the output crest factor resulting from a data-driven model, with penalties for violating input and output constraints. Model-on-Demand (MoD) estimation is employed to simulate outputs using prior data, effectively predicting nonlinear responses without global modeling. This MoD-based formulation enables evaluating output crest factors and output constraint compliance with modest modeling effort and improved impact. The resulting non-smooth, non-convex problem is solved using the Simultaneous Perturbation Stochastic Approximation (SPSA) algorithm, which perturbs the multisine phase vector to achieve the desired performance efficiently. This method supports the concept of <em>identification test monitoring</em>, as illustrated in this paper. Within the identification test loops, each optimized excitation is applied to gather new estimation data, iteratively refining MoD-based output predictions and improving constraint satisfaction. The method’s effectiveness is demonstrated through a safety-critical case study on a Susceptible-Infected-Recovered (SIR) epidemiological network, showing that the optimized excitation yields highly informative data for identification while keeping the infection spread within safe limits.</div></div>","PeriodicalId":29926,"journal":{"name":"IFAC Journal of Systems and Control","volume":"35 ","pages":"Article 100371"},"PeriodicalIF":1.8,"publicationDate":"2026-01-29","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"146077766","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Observability analysis and state estimation of wind turbine power systems: A novel sensitivity-based approach 风力发电系统的可观测性分析与状态估计:一种新的基于灵敏度的方法
IF 1.8 Q3 AUTOMATION & CONTROL SYSTEMS Pub Date : 2026-01-29 DOI: 10.1016/j.ifacsc.2026.100370
Hesham Abdelfattah , Sameh A. Eisa , Peter Stechlinski
In this paper, we provide a novel framework that enables a sensitivity-based observability test and state estimation algorithm for wind turbine power systems (WTPSs). The provided framework is the first of its kind in the literature, as it is able to deal with state-of-the-art WTPS models that are non-reduced, highly nonlinear differential–algebraic equation systems. Moreover, the framework includes nonsmoothness in both the dynamics and output functions to unify the operational conditions over different wind speed regions. We demonstrate the effectiveness of the proposed framework (thanks to the underlying tools from generalized derivatives theory) on different wind speed profiles, including real-world wind data. We also illustrate how the proposed framework, by the utilization of robust observability analysis during nonsmooth transitions, enables accurate state estimation for cases when the conventional Extended Kalman Filter approach fails.
在本文中,我们提供了一个新的框架,使基于灵敏度的风力发电系统(wtps)的可观察性测试和状态估计算法。所提供的框架是此类文献中的第一个,因为它能够处理最先进的非约化、高度非线性微分代数方程系统的WTPS模型。此外,该框架在动力学和输出函数中都考虑了非平滑性,以统一不同风速区域的运行条件。我们证明了所提出的框架(得益于广义导数理论的基础工具)在不同风速剖面上的有效性,包括真实世界的风数据。我们还说明了所提出的框架如何利用非光滑过渡期间的鲁棒可观察性分析,在传统扩展卡尔曼滤波方法失败的情况下实现准确的状态估计。
{"title":"Observability analysis and state estimation of wind turbine power systems: A novel sensitivity-based approach","authors":"Hesham Abdelfattah ,&nbsp;Sameh A. Eisa ,&nbsp;Peter Stechlinski","doi":"10.1016/j.ifacsc.2026.100370","DOIUrl":"10.1016/j.ifacsc.2026.100370","url":null,"abstract":"<div><div>In this paper, we provide a novel framework that enables a sensitivity-based observability test and state estimation algorithm for wind turbine power systems (WTPSs). The provided framework is the first of its kind in the literature, as it is able to deal with state-of-the-art WTPS models that are non-reduced, highly nonlinear differential–algebraic equation systems. Moreover, the framework includes nonsmoothness in both the dynamics and output functions to unify the operational conditions over different wind speed regions. We demonstrate the effectiveness of the proposed framework (thanks to the underlying tools from generalized derivatives theory) on different wind speed profiles, including real-world wind data. We also illustrate how the proposed framework, by the utilization of robust observability analysis during nonsmooth transitions, enables accurate state estimation for cases when the conventional Extended Kalman Filter approach fails.</div></div>","PeriodicalId":29926,"journal":{"name":"IFAC Journal of Systems and Control","volume":"35 ","pages":"Article 100370"},"PeriodicalIF":1.8,"publicationDate":"2026-01-29","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"146173697","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Data-driven design of dynamic quantizers applicable to nonminimum phase systems 非最小相位系统动态量化器的数据驱动设计
IF 1.8 Q3 AUTOMATION & CONTROL SYSTEMS Pub Date : 2026-01-23 DOI: 10.1016/j.ifacsc.2026.100369
Yusuke Fujimoto , Yuki Minami
This paper discusses the data-driven design of a dynamic quantizer for control systems with discrete-valued input. We consider a quantizer with a noise-shaping filter that converts the continuous-valued input into the discrete-valued input, and discuss how to optimize the filter to minimize the error between the system outputs with and without quantization. It is known that this output deterioration can be measured by the H norm of a transfer function that depends on both the system and the noise-shaping filter. This paper focuses on data-driven estimation of the H norm from its input–output data, and virtually constructs input–output data for the transfer function. Then the output deterioration is minimized by minimizing this H norm. The effectiveness of the proposed approach is demonstrated through a numerical example.
本文讨论了具有离散值输入的控制系统动态量化器的数据驱动设计。我们考虑了一个带有噪声整形滤波器的量化器,它可以将连续值输入转换为离散值输入,并讨论了如何优化滤波器以最小化有量化和没有量化的系统输出之间的误差。众所周知,这种输出劣化可以通过传递函数的H∞范数来测量,该传递函数依赖于系统和噪声整形滤波器。本文从H∞范数的输入输出数据出发,研究了H∞范数的数据驱动估计,并虚拟构造了传递函数的输入输出数据。然后通过最小化这个H∞范数来最小化输出劣化。通过数值算例验证了该方法的有效性。
{"title":"Data-driven design of dynamic quantizers applicable to nonminimum phase systems","authors":"Yusuke Fujimoto ,&nbsp;Yuki Minami","doi":"10.1016/j.ifacsc.2026.100369","DOIUrl":"10.1016/j.ifacsc.2026.100369","url":null,"abstract":"<div><div>This paper discusses the data-driven design of a dynamic quantizer for control systems with discrete-valued input. We consider a quantizer with a noise-shaping filter that converts the continuous-valued input into the discrete-valued input, and discuss how to optimize the filter to minimize the error between the system outputs with and without quantization. It is known that this output deterioration can be measured by the <span><math><msub><mrow><mi>H</mi></mrow><mrow><mi>∞</mi></mrow></msub></math></span> norm of a transfer function that depends on both the system and the noise-shaping filter. This paper focuses on data-driven estimation of the <span><math><msub><mrow><mi>H</mi></mrow><mrow><mi>∞</mi></mrow></msub></math></span> norm from its input–output data, and virtually constructs input–output data for the transfer function. Then the output deterioration is minimized by minimizing this <span><math><msub><mrow><mi>H</mi></mrow><mrow><mi>∞</mi></mrow></msub></math></span> norm. The effectiveness of the proposed approach is demonstrated through a numerical example.</div></div>","PeriodicalId":29926,"journal":{"name":"IFAC Journal of Systems and Control","volume":"35 ","pages":"Article 100369"},"PeriodicalIF":1.8,"publicationDate":"2026-01-23","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"146077719","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Adaptive optimal resource allocation for isolation interventions: Flattening the curve 隔离干预措施的自适应最优资源分配:曲线趋平
IF 1.8 Q3 AUTOMATION & CONTROL SYSTEMS Pub Date : 2026-01-23 DOI: 10.1016/j.ifacsc.2026.100367
Mohamed Arnouss , Yezekael Hayel , Karam Allali
Economic savings achieved through targeted isolation avoid additional disease burdens and effectively address the disease-economy trade-offs in epidemic control. In this study, we use phase-space analysis to derive the explicit solution of the optimal control problem that minimize the infection peak given budget limitation. The optimal policy obtained is an adaptive control where the isolation rate dynamically adjusts according to the current epidemic state. We show that targeted isolation control policy achieves the same infection peak as transmission reduction policies under equivalent budgets, while avoiding broad socio-economic disruptions. Additionally, we show through numerical simulations that the control resolves the epidemic faster and reduces total infections. This demonstrates that targeted isolation can strike a balance between public health and economic stability, offering actionable insights for public health decisions moving forward.
通过有针对性的隔离实现的经济节约避免了额外的疾病负担,并有效地处理了流行病控制中的疾病-经济权衡。在本研究中,我们使用相空间分析来导出在给定预算限制下感染峰值最小的最优控制问题的显式解。得到的最优策略是一个自适应控制,隔离率根据当前的流行状态动态调整。我们表明,在同等预算下,有针对性的隔离控制政策与减少传播政策实现了相同的感染峰值,同时避免了广泛的社会经济中断。此外,我们通过数值模拟表明,控制更快地解决了流行病,减少了总感染。这表明,有针对性的隔离可以在公共卫生和经济稳定之间取得平衡,为今后的公共卫生决策提供可行的见解。
{"title":"Adaptive optimal resource allocation for isolation interventions: Flattening the curve","authors":"Mohamed Arnouss ,&nbsp;Yezekael Hayel ,&nbsp;Karam Allali","doi":"10.1016/j.ifacsc.2026.100367","DOIUrl":"10.1016/j.ifacsc.2026.100367","url":null,"abstract":"<div><div>Economic savings achieved through targeted isolation avoid additional disease burdens and effectively address the disease-economy trade-offs in epidemic control. In this study, we use phase-space analysis to derive the explicit solution of the optimal control problem that minimize the infection peak given budget limitation. The optimal policy obtained is an adaptive control where the isolation rate dynamically adjusts according to the current epidemic state. We show that targeted isolation control policy achieves the same infection peak as transmission reduction policies under equivalent budgets, while avoiding broad socio-economic disruptions. Additionally, we show through numerical simulations that the control resolves the epidemic faster and reduces total infections. This demonstrates that targeted isolation can strike a balance between public health and economic stability, offering actionable insights for public health decisions moving forward.</div></div>","PeriodicalId":29926,"journal":{"name":"IFAC Journal of Systems and Control","volume":"35 ","pages":"Article 100367"},"PeriodicalIF":1.8,"publicationDate":"2026-01-23","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"146173694","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Regularized GLISp for sensor-guided human-in-the-loop optimization 用于传感器引导的人在环优化的正则化GLISp
IF 1.8 Q3 AUTOMATION & CONTROL SYSTEMS Pub Date : 2026-01-22 DOI: 10.1016/j.ifacsc.2026.100368
Matteo Cercola , Michele Lomuscio , Dario Piga , Simone Formentin
Human-in-the-loop calibration is often addressed via preference-based optimization, where algorithms learn from pairwise comparisons rather than explicit cost evaluations. While effective, methods such as Preferential Bayesian Optimization or Global optimization based on active preference learning with radial basis functions (GLISp) treat the system as a black box and ignore informative sensor measurements. In this work, we introduce a sensor-guided regularized extension of GLISp that integrates measurable descriptors into the preference-learning loop through a physics-informed hypothesis function and a least-squares regularization term. This injects grey-box structure, combining subjective feedback with quantitative sensor information while preserving the flexibility of preference-based search. Numerical evaluations on an analytical benchmark and on a human-in-the-loop vehicle suspension tuning task show faster convergence and superior final solutions compared to baseline GLISp.
人在环校准通常通过基于偏好的优化来解决,算法从两两比较中学习,而不是明确的成本评估。虽然有效,但诸如优先贝叶斯优化或基于径向基函数主动偏好学习(GLISp)的全局优化等方法将系统视为黑箱,忽略了信息传感器测量。在这项工作中,我们引入了GLISp的传感器引导正则化扩展,该扩展通过物理信息假设函数和最小二乘正则化项将可测量描述符集成到偏好学习循环中。这注入了灰盒结构,将主观反馈与定量传感器信息相结合,同时保持了基于偏好的搜索的灵活性。对分析基准和人在环车辆悬架调整任务的数值评估表明,与基线GLISp相比,该方法收敛速度更快,最终解决方案更优。
{"title":"Regularized GLISp for sensor-guided human-in-the-loop optimization","authors":"Matteo Cercola ,&nbsp;Michele Lomuscio ,&nbsp;Dario Piga ,&nbsp;Simone Formentin","doi":"10.1016/j.ifacsc.2026.100368","DOIUrl":"10.1016/j.ifacsc.2026.100368","url":null,"abstract":"<div><div>Human-in-the-loop calibration is often addressed via preference-based optimization, where algorithms learn from pairwise comparisons rather than explicit cost evaluations. While effective, methods such as Preferential Bayesian Optimization or Global optimization based on active preference learning with radial basis functions (GLISp) treat the system as a black box and ignore informative sensor measurements. In this work, we introduce a sensor-guided regularized extension of GLISp that integrates measurable descriptors into the preference-learning loop through a physics-informed hypothesis function and a least-squares regularization term. This injects grey-box structure, combining subjective feedback with quantitative sensor information while preserving the flexibility of preference-based search. Numerical evaluations on an analytical benchmark and on a human-in-the-loop vehicle suspension tuning task show faster convergence and superior final solutions compared to baseline GLISp.</div></div>","PeriodicalId":29926,"journal":{"name":"IFAC Journal of Systems and Control","volume":"35 ","pages":"Article 100368"},"PeriodicalIF":1.8,"publicationDate":"2026-01-22","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"146022733","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Stability-constrained policy optimization under unknown rewards 未知奖励下的稳定约束策略优化
IF 1.8 Q3 AUTOMATION & CONTROL SYSTEMS Pub Date : 2026-01-21 DOI: 10.1016/j.ifacsc.2026.100366
Thomas Banker, Nathan P. Lawrence, Ali Mesbah
A major challenge in reinforcement learning (RL) is guaranteeing an agent’s closed-loop stability under unknown, possibly sparse, reward functions. While model-free RL is flexible to a variety of systems and rewards, model-based control strategies such as optimization-based control naturally accommodate prior system models to provide guarantees on safety and stability. However, these models may not be representative of the true global performance objective, resulting in suboptimal policies. In this paper, we present a policy search RL approach that decouples the stability requirement from the global performance objective. The key idea is to use an optimization-based policy structure as an effective stabilizing parameterization with which the agent can learn to maximize an unknown reward in a model-free fashion. Specifically, the agent employs a predictive control architecture and implicitly learns a stabilizing terminal cost, which is constructed through fixed-point iterations of the discrete algebraic Riccati equation. By implicitly differentiating this fixed-point, derivatives of the stability condition inform policy gradients. The proposed approach is shown to design high-performance, stabilizing policies for various sparse, global performance objectives. Furthermore, the proposed approach can account for uncertainty in the dynamics using the stochastic discrete algebraic Riccati equation to promote robust stability. This work demonstrates a principled policy search RL approach, integrating prior models and system observations in an agent’s design, towards safe and reliable decision-making under uncertainty.
强化学习(RL)的一个主要挑战是保证智能体在未知的、可能稀疏的奖励函数下的闭环稳定性。虽然无模型强化学习对各种系统和奖励都很灵活,但基于模型的控制策略(如基于优化的控制)自然地适应了先前的系统模型,以提供安全性和稳定性的保证。然而,这些模型可能不能代表真正的全局性能目标,从而导致次优策略。在本文中,我们提出了一种策略搜索RL方法,该方法将稳定性要求与全局性能目标解耦。关键思想是使用基于优化的策略结构作为有效的稳定参数化,通过该参数化,智能体可以学习以无模型的方式最大化未知奖励。具体而言,该智能体采用预测控制体系结构,通过对离散代数Riccati方程的不动点迭代构造一个稳定的终端代价,并隐式学习。通过隐式微分这个不动点,稳定性条件的导数告知政策梯度。所提出的方法被证明可以为各种稀疏的全局性能目标设计高性能、稳定的策略。此外,该方法可以利用随机离散代数Riccati方程来解释动力学中的不确定性,从而提高鲁棒稳定性。这项工作展示了一种原则性的策略搜索强化学习方法,在智能体设计中集成了先前的模型和系统观察,以实现不确定性下的安全可靠决策。
{"title":"Stability-constrained policy optimization under unknown rewards","authors":"Thomas Banker,&nbsp;Nathan P. Lawrence,&nbsp;Ali Mesbah","doi":"10.1016/j.ifacsc.2026.100366","DOIUrl":"10.1016/j.ifacsc.2026.100366","url":null,"abstract":"<div><div>A major challenge in reinforcement learning (RL) is guaranteeing an agent’s closed-loop stability under unknown, possibly sparse, reward functions. While model-free RL is flexible to a variety of systems and rewards, model-based control strategies such as optimization-based control naturally accommodate prior system models to provide guarantees on safety and stability. However, these models may not be representative of the true global performance objective, resulting in suboptimal policies. In this paper, we present a policy search RL approach that decouples the stability requirement from the global performance objective. The key idea is to use an optimization-based policy structure as an effective stabilizing parameterization with which the agent can learn to maximize an unknown reward in a model-free fashion. Specifically, the agent employs a predictive control architecture and implicitly learns a stabilizing terminal cost, which is constructed through fixed-point iterations of the discrete algebraic Riccati equation. By implicitly differentiating this fixed-point, derivatives of the stability condition inform policy gradients. The proposed approach is shown to design high-performance, stabilizing policies for various sparse, global performance objectives. Furthermore, the proposed approach can account for uncertainty in the dynamics using the stochastic discrete algebraic Riccati equation to promote robust stability. This work demonstrates a principled policy search RL approach, integrating prior models and system observations in an agent’s design, towards safe and reliable decision-making under uncertainty.</div></div>","PeriodicalId":29926,"journal":{"name":"IFAC Journal of Systems and Control","volume":"35 ","pages":"Article 100366"},"PeriodicalIF":1.8,"publicationDate":"2026-01-21","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"146077767","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
On continuous-time sparse identification of nonlinear polynomial systems 非线性多项式系统的连续时间稀疏辨识
IF 1.8 Q3 AUTOMATION & CONTROL SYSTEMS Pub Date : 2026-01-15 DOI: 10.1016/j.ifacsc.2026.100365
Mazen Alamir
This paper leverages recent advances in high derivatives reconstruction from noisy-time series and sparse multivariate polynomial identification in order to improve the process of parsimoniously identifying, from a small amount of data, unknown Single-Input/Single-Output nonlinear dynamics of relative degree up to 4. The methodology is illustrated on the Electronic Throttle Controlled automotive system.
本文利用噪声时间序列的高导数重构和稀疏多元多项式辨识的最新进展,改进了从少量数据中简化识别相对程度高达4的未知单输入/单输出非线性动力学的过程。以电子节气门控制汽车系统为例说明了该方法。
{"title":"On continuous-time sparse identification of nonlinear polynomial systems","authors":"Mazen Alamir","doi":"10.1016/j.ifacsc.2026.100365","DOIUrl":"10.1016/j.ifacsc.2026.100365","url":null,"abstract":"<div><div>This paper leverages recent advances in high derivatives reconstruction from noisy-time series and sparse multivariate polynomial identification in order to improve the process of parsimoniously identifying, from a small amount of data, unknown Single-Input/Single-Output nonlinear dynamics of relative degree up to 4. The methodology is illustrated on the Electronic Throttle Controlled automotive system.</div></div>","PeriodicalId":29926,"journal":{"name":"IFAC Journal of Systems and Control","volume":"35 ","pages":"Article 100365"},"PeriodicalIF":1.8,"publicationDate":"2026-01-15","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145977346","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Joint state and parameter estimation in quantum systems using cubature Kalman filtering 利用培养卡尔曼滤波的量子系统联合状态和参数估计
IF 1.8 Q3 AUTOMATION & CONTROL SYSTEMS Pub Date : 2026-01-14 DOI: 10.1016/j.ifacsc.2026.100363
Eram Taslima, Shyam Kamal, R.K. Saket
This paper addresses the challenge of state estimation for two-level quantum systems governed by stochastic master equations, particularly when key Hamiltonian parameters are unknown. The critical parameters such as the qubit resonance frequency and the decay rate play a crucial role in determining system dynamics, hence their accurate estimation is essential for reliable state reconstruction. A robust framework based on the cubature Kalman filter (CKF) is developed that effectively handles both correlated and decorrelated noise processes inherent to quantum homodyne measurement. The proposed approach effectively mitigates performance degradation caused by parametric uncertainty, providing enhanced adaptability and robustness. Numerical simulations on a qubit in a cavity show that the CKF-based method achieves better estimation accuracy and faster convergence compared to the extended Kalman filter.
本文讨论了由随机主方程控制的二能级量子系统的状态估计问题,特别是在关键哈密顿参数未知的情况下。量子比特共振频率和衰减率等关键参数在系统动力学中起着至关重要的作用,因此它们的准确估计对于可靠的状态重建至关重要。提出了一种基于稳态卡尔曼滤波(CKF)的鲁棒框架,该框架能有效地处理量子同差测量中固有的相关和去相关噪声过程。该方法有效地缓解了参数不确定性引起的性能下降,增强了自适应性和鲁棒性。在腔内量子比特上的数值模拟表明,与扩展卡尔曼滤波相比,基于ckf的方法具有更好的估计精度和更快的收敛速度。
{"title":"Joint state and parameter estimation in quantum systems using cubature Kalman filtering","authors":"Eram Taslima,&nbsp;Shyam Kamal,&nbsp;R.K. Saket","doi":"10.1016/j.ifacsc.2026.100363","DOIUrl":"10.1016/j.ifacsc.2026.100363","url":null,"abstract":"<div><div>This paper addresses the challenge of state estimation for two-level quantum systems governed by stochastic master equations, particularly when key Hamiltonian parameters are unknown. The critical parameters such as the qubit resonance frequency and the decay rate play a crucial role in determining system dynamics, hence their accurate estimation is essential for reliable state reconstruction. A robust framework based on the cubature Kalman filter (CKF) is developed that effectively handles both correlated and decorrelated noise processes inherent to quantum homodyne measurement. The proposed approach effectively mitigates performance degradation caused by parametric uncertainty, providing enhanced adaptability and robustness. Numerical simulations on a qubit in a cavity show that the CKF-based method achieves better estimation accuracy and faster convergence compared to the extended Kalman filter.</div></div>","PeriodicalId":29926,"journal":{"name":"IFAC Journal of Systems and Control","volume":"35 ","pages":"Article 100363"},"PeriodicalIF":1.8,"publicationDate":"2026-01-14","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"146022732","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
期刊
IFAC Journal of Systems and Control
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1