Pub Date : 2025-12-15DOI: 10.1109/LCSYS.2025.3644797
Marjan Khaledi;Bahare Kiumarsi
This letter proposes a novel approach to safe navigation in environments with static and dynamic obstacles by embedding control barrier functions (CBFs) within the model predictive control (MPC) framework. Unlike conventional methods that rely on unbounded additive slack variables, the proposed approach enforces each CBF constraint separately, allowing individual flexibility through dedicated slack variables with bounded relaxation weights. These weights modulate the permissible degree of constraint relaxation, ensuring that any safety softening remains quantitatively bounded, systematically tunable, and theoretically consistent with the CBF-based safety guaranties. Furthermore, the feasibility of the proposed approach is guaranteed, and the effectiveness of our method is demonstrated through the simulation results.
{"title":"Safety-Certified Planning and Control in Dynamic Environments via Model Predictive Control","authors":"Marjan Khaledi;Bahare Kiumarsi","doi":"10.1109/LCSYS.2025.3644797","DOIUrl":"https://doi.org/10.1109/LCSYS.2025.3644797","url":null,"abstract":"This letter proposes a novel approach to safe navigation in environments with static and dynamic obstacles by embedding control barrier functions (CBFs) within the model predictive control (MPC) framework. Unlike conventional methods that rely on unbounded additive slack variables, the proposed approach enforces each CBF constraint separately, allowing individual flexibility through dedicated slack variables with bounded relaxation weights. These weights modulate the permissible degree of constraint relaxation, ensuring that any safety softening remains quantitatively bounded, systematically tunable, and theoretically consistent with the CBF-based safety guaranties. Furthermore, the feasibility of the proposed approach is guaranteed, and the effectiveness of our method is demonstrated through the simulation results.","PeriodicalId":37235,"journal":{"name":"IEEE Control Systems Letters","volume":"9 ","pages":"2837-2842"},"PeriodicalIF":2.0,"publicationDate":"2025-12-15","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=11300835","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145830765","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2025-12-15DOI: 10.1109/LCSYS.2025.3644800
Mohammad Rasoul Narimani;Katherine R. Davis;Daniel K. Molzahn
By providing the optimal operating point that satisfies both the power flow equations and engineering limits, the optimal power flow (OPF) problem is central to power systems operations. While extensive research has focused on computing high-quality OPF solutions, assessing the feasibility of transitioning between operating points remains challenging since the feasible spaces of OPF problems may consist of multiple disconnected components. It is not possible to transition between operating points in different disconnected components without violating OPF constraints. To identify such situations, this letter introduces an algorithm for certifying the infeasibility of transitioning between two operating points within an OPF feasible space. As an indication of potential disconnectedness, the algorithm first seeks an infeasible point on the line connecting a pair of feasible points. The algorithm then certifies disconnectedness by using convex relaxation and bound tightening techniques to show that all points on the plane that is normal to this line are infeasible. Using this algorithm, we provide the first certifications of disconnected feasible spaces for a variety of OPF test cases.
{"title":"Certifying the Nonexistence of Feasible Paths Between Power System Operating Points","authors":"Mohammad Rasoul Narimani;Katherine R. Davis;Daniel K. Molzahn","doi":"10.1109/LCSYS.2025.3644800","DOIUrl":"https://doi.org/10.1109/LCSYS.2025.3644800","url":null,"abstract":"By providing the optimal operating point that satisfies both the power flow equations and engineering limits, the optimal power flow (OPF) problem is central to power systems operations. While extensive research has focused on computing high-quality OPF solutions, assessing the feasibility of transitioning between operating points remains challenging since the feasible spaces of OPF problems may consist of multiple disconnected components. It is not possible to transition between operating points in different disconnected components without violating OPF constraints. To identify such situations, this letter introduces an algorithm for certifying the infeasibility of transitioning between two operating points within an OPF feasible space. As an indication of potential disconnectedness, the algorithm first seeks an infeasible point on the line connecting a pair of feasible points. The algorithm then certifies disconnectedness by using convex relaxation and bound tightening techniques to show that all points on the plane that is normal to this line are infeasible. Using this algorithm, we provide the first certifications of disconnected feasible spaces for a variety of OPF test cases.","PeriodicalId":37235,"journal":{"name":"IEEE Control Systems Letters","volume":"9 ","pages":"2987-2992"},"PeriodicalIF":2.0,"publicationDate":"2025-12-15","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145886565","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2025-12-10DOI: 10.1109/LCSYS.2025.3642534
M. Yusuf Uzun;Yildiray Yildiz
By combining automation accuracy with human adaptability, shared control provides enhanced performance and safety in dynamic, complex environments. Traditional arbitration methods for integrating automation and human inputs often rely on system-specific, parameter-dependent functions that are based on shared control metrics such as trust, workload, or attention. Meanwhile, Control Barrier Functions (CBFs) enforce safety constraints on automated systems but are typically limited to safeguarding plant states. This letter introduces a novel arbitration method based on Control Barrier Functions (CBFs), where shared control metrics such as workload, attention, and trust are expressed as real-time inequality constraints. The resulting quadratic-programming formulation determines the automation assistance input that enforces these constraints while preserving feasibility and safety. This CBF-based arbitration provides a systematic, interpretable, and scalable foundation for safe human–autonomy integration.
{"title":"Arbitration With Control Barrier Functions for Safe Shared Control","authors":"M. Yusuf Uzun;Yildiray Yildiz","doi":"10.1109/LCSYS.2025.3642534","DOIUrl":"https://doi.org/10.1109/LCSYS.2025.3642534","url":null,"abstract":"By combining automation accuracy with human adaptability, shared control provides enhanced performance and safety in dynamic, complex environments. Traditional arbitration methods for integrating automation and human inputs often rely on system-specific, parameter-dependent functions that are based on shared control metrics such as trust, workload, or attention. Meanwhile, Control Barrier Functions (CBFs) enforce safety constraints on automated systems but are typically limited to safeguarding plant states. This letter introduces a novel arbitration method based on Control Barrier Functions (CBFs), where shared control metrics such as workload, attention, and trust are expressed as real-time inequality constraints. The resulting quadratic-programming formulation determines the automation assistance input that enforces these constraints while preserving feasibility and safety. This CBF-based arbitration provides a systematic, interpretable, and scalable foundation for safe human–autonomy integration.","PeriodicalId":37235,"journal":{"name":"IEEE Control Systems Letters","volume":"9 ","pages":"2789-2794"},"PeriodicalIF":2.0,"publicationDate":"2025-12-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145778119","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2025-12-10DOI: 10.1109/LCSYS.2025.3642487
Reetish Padhi;Ion Victor Gosea;Igor Pontes Duff;Serkan Gugercin
We develop the theoretical formulation for a non-intrusive, quadrature-based method for approximate balanced truncation (QuadBT) of linear systems with quadratic outputs, thus extending the applicability of QuadBT, which was originally designed for data-driven balanced truncation of standard linear systems with linear outputs only. The new approach makes use of the time-domain and frequency-domain quadrature-based representation of the system’s infinite Gramians, only implicitly. We show that by sampling solely the extended impulse responses (kernels) of the original system and their derivatives (or the corresponding transfer functions), we construct a reduced-order model that mimics the approximation quality of the intrusive (projection-based) balanced truncation. Although the sampling of the required kernels via input/output simulations or physical experiments is still an open question, we demonstrate a proof of concept for the proposed framework on an example using numerically evaluated data.
{"title":"Data-Driven Balancing Formulation for Linear Systems With Quadratic Outputs","authors":"Reetish Padhi;Ion Victor Gosea;Igor Pontes Duff;Serkan Gugercin","doi":"10.1109/LCSYS.2025.3642487","DOIUrl":"https://doi.org/10.1109/LCSYS.2025.3642487","url":null,"abstract":"We develop the theoretical formulation for a non-intrusive, quadrature-based method for approximate balanced truncation (QuadBT) of linear systems with quadratic outputs, thus extending the applicability of QuadBT, which was originally designed for data-driven balanced truncation of standard linear systems with linear outputs only. The new approach makes use of the time-domain and frequency-domain quadrature-based representation of the system’s infinite Gramians, only implicitly. We show that by sampling solely the extended impulse responses (kernels) of the original system and their derivatives (or the corresponding transfer functions), we construct a reduced-order model that mimics the approximation quality of the intrusive (projection-based) balanced truncation. Although the sampling of the required kernels via input/output simulations or physical experiments is still an open question, we demonstrate a proof of concept for the proposed framework on an example using numerically evaluated data.","PeriodicalId":37235,"journal":{"name":"IEEE Control Systems Letters","volume":"9 ","pages":"2843-2848"},"PeriodicalIF":2.0,"publicationDate":"2025-12-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145830840","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2025-12-10DOI: 10.1109/LCSYS.2025.3642222
Hossein Gholampour;Logan E. Beaver
Many robotic systems must follow planned paths yet pause safely and resume when people or objects intervene. We present an output–space method for systems whose tracked output can be feedback-linearized to a double integrator (e.g., manipulators). The approach has two parts. Offline, we perform a pre-run reachability check to verify that the motion plan respects speed and acceleration magnitude limits. Online, we apply a quadratic program to track the motion plan under the same limits. We use a one-step reachability test to bound the maximum disturbance the system is capable of rejecting. When the state coincides with the reference path we recover perfect tracking in the deterministic case, and we correct errors using a KKT-inspired weight. We demonstrate that safety stops and unplanned deviations are handled efficiently, and the system returns to the motion plan without replanning. We demonstrate our system’s improved performance over pure pursuit in simulation.
{"title":"Trajectory Tracking With Reachability-Guided Quadratic Programming and Freeze-Resume","authors":"Hossein Gholampour;Logan E. Beaver","doi":"10.1109/LCSYS.2025.3642222","DOIUrl":"https://doi.org/10.1109/LCSYS.2025.3642222","url":null,"abstract":"Many robotic systems must follow planned paths yet pause safely and resume when people or objects intervene. We present an output–space method for systems whose tracked output can be feedback-linearized to a double integrator (e.g., manipulators). The approach has two parts. Offline, we perform a pre-run reachability check to verify that the motion plan respects speed and acceleration magnitude limits. Online, we apply a quadratic program to track the motion plan under the same limits. We use a one-step reachability test to bound the maximum disturbance the system is capable of rejecting. When the state coincides with the reference path we recover perfect tracking in the deterministic case, and we correct errors using a KKT-inspired weight. We demonstrate that safety stops and unplanned deviations are handled efficiently, and the system returns to the motion plan without replanning. We demonstrate our system’s improved performance over pure pursuit in simulation.","PeriodicalId":37235,"journal":{"name":"IEEE Control Systems Letters","volume":"9 ","pages":"2795-2800"},"PeriodicalIF":2.0,"publicationDate":"2025-12-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145778263","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2025-12-10DOI: 10.1109/LCSYS.2025.3642769
Delphine Bresch-Pietri;Jean Auriol
This letter addresses the existence and uniqueness of the solution to a class of Fredholm integral equations associated with scalar Linear Integral Delay Equations (LIDEs). Based on an operator-theoretic framework involving transport partial differential equations, we provide general conditions that guarantee well-posedness of these equations. Leveraging this result, we introduce a constructive approach for the numerical computation of the Lyapunov matrix corresponding to scalar LIDEs with commensurate delays. Specifically, the Lyapunov matrix equations are reformulated as the Fredholm integral equation under consideration, for which the proposed conditions are satisfied under exponential stability of the LIDE. The resulting integral equation can then be discretized, and the linear system solved to efficiently compute the Delay Lyapunov matrix. Numerical examples illustrate the effectiveness and applicability of this methodology.
{"title":"Existence and Uniqueness of the Solution to a Class of Fredholm Integral Equations Related to Difference Equations","authors":"Delphine Bresch-Pietri;Jean Auriol","doi":"10.1109/LCSYS.2025.3642769","DOIUrl":"https://doi.org/10.1109/LCSYS.2025.3642769","url":null,"abstract":"This letter addresses the existence and uniqueness of the solution to a class of Fredholm integral equations associated with scalar Linear Integral Delay Equations (LIDEs). Based on an operator-theoretic framework involving transport partial differential equations, we provide general conditions that guarantee well-posedness of these equations. Leveraging this result, we introduce a constructive approach for the numerical computation of the Lyapunov matrix corresponding to scalar LIDEs with commensurate delays. Specifically, the Lyapunov matrix equations are reformulated as the Fredholm integral equation under consideration, for which the proposed conditions are satisfied under exponential stability of the LIDE. The resulting integral equation can then be discretized, and the linear system solved to efficiently compute the Delay Lyapunov matrix. Numerical examples illustrate the effectiveness and applicability of this methodology.","PeriodicalId":37235,"journal":{"name":"IEEE Control Systems Letters","volume":"9 ","pages":"3113-3118"},"PeriodicalIF":2.0,"publicationDate":"2025-12-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145929507","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2025-12-09DOI: 10.1109/LCSYS.2025.3641995
Hang Zhang;Abigail J. Winn;Yuhao Zhang;Xiangru Xu
This letter investigates goal-reaching control synthesis for neural network control systems. A backward reachability framework is developed based on constrained zonotopes, in which the graph set of a ReLU-activated feedforward neural network is encoded as a finite union of constrained zonotopes. Using this representation, under-approximations of backward reachable sets are computed for systems with nonlinear plant models, ensuring the feasibility of the goal-reaching task. Control sequences are then synthesized through an optimization procedure that exploits the under-approximated set. A numerical example demonstrates the effectiveness of the proposed approach.
{"title":"Goal-Reaching Control Synthesis for Neural Network Control Systems via Backward Reachability","authors":"Hang Zhang;Abigail J. Winn;Yuhao Zhang;Xiangru Xu","doi":"10.1109/LCSYS.2025.3641995","DOIUrl":"https://doi.org/10.1109/LCSYS.2025.3641995","url":null,"abstract":"This letter investigates goal-reaching control synthesis for neural network control systems. A backward reachability framework is developed based on constrained zonotopes, in which the graph set of a ReLU-activated feedforward neural network is encoded as a finite union of constrained zonotopes. Using this representation, under-approximations of backward reachable sets are computed for systems with nonlinear plant models, ensuring the feasibility of the goal-reaching task. Control sequences are then synthesized through an optimization procedure that exploits the under-approximated set. A numerical example demonstrates the effectiveness of the proposed approach.","PeriodicalId":37235,"journal":{"name":"IEEE Control Systems Letters","volume":"9 ","pages":"2771-2776"},"PeriodicalIF":2.0,"publicationDate":"2025-12-09","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145778168","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2025-12-09DOI: 10.1109/LCSYS.2025.3641989
Zahra Hashemi;Dipankar Maity
This letter explores intelligent scheduling of sensor-to-controller communication in networked control systems, particularly when data transmission incurs a cost. While the optimal controller in a standard linear-quadratic Gaussian (LQG) setup can be computed analytically, determining the optimal times to transmit sensor data remains computationally and analytically challenging. We show that, through reformulation and the introduction of auxiliary binary variables, the scheduling problem can be cast as a computationally efficient mixed-integer linear program (MILP). This formulation not only simplifies the analysis but also reveals structural insights and provides clear decision criterion at each step. Embedding the approach within a model predictive control (MPC) framework enables dynamic adaptation, and we prove that the resulting scheduler performs at least as well as any deterministic strategy (e.g., periodic strategy). Simulation results further demonstrate that our method consistently outperforms traditional periodic scheduling.
{"title":"A Linear Programming Framework for Optimal Event-Triggered LQG Control","authors":"Zahra Hashemi;Dipankar Maity","doi":"10.1109/LCSYS.2025.3641989","DOIUrl":"https://doi.org/10.1109/LCSYS.2025.3641989","url":null,"abstract":"This letter explores intelligent scheduling of sensor-to-controller communication in networked control systems, particularly when data transmission incurs a cost. While the optimal controller in a standard linear-quadratic Gaussian (LQG) setup can be computed analytically, determining the optimal times to transmit sensor data remains computationally and analytically challenging. We show that, through reformulation and the introduction of auxiliary binary variables, the scheduling problem can be cast as a computationally efficient mixed-integer linear program (MILP). This formulation not only simplifies the analysis but also reveals structural insights and provides clear decision criterion at each step. Embedding the approach within a model predictive control (MPC) framework enables dynamic adaptation, and we prove that the resulting scheduler performs at least as well as any deterministic strategy (e.g., periodic strategy). Simulation results further demonstrate that our method consistently outperforms traditional periodic scheduling.","PeriodicalId":37235,"journal":{"name":"IEEE Control Systems Letters","volume":"9 ","pages":"2783-2788"},"PeriodicalIF":2.0,"publicationDate":"2025-12-09","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145778264","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2025-12-09DOI: 10.1109/LCSYS.2025.3642051
Jian Zheng;Mario Sznaier
This letter presents a robust data-driven receding-horizon control framework for the discrete-time linear quadratic regulator (LQR) with input constraints. Unlike earlier data-driven approaches that design a controller from initial data and apply it unchanged throughout the trajectory, our method exploits all available execution data in a receding-horizon manner, thereby capturing additional information about the unknown system and enabling less conservative performance. Existing data-driven LQR model predictive control methods rely on over-approximations of the consistency set and $ell _{2}$ descriptions of noise. In contrast, the proposed approach uses exact descriptions of the consistency set under $ell _{infty }$ -bounded noise, leveraging duality to recast the problem into a tractable convex optimization. Further, the proposed controller renders the closed-loop system input-to-state stable. Simulation results demonstrate the effectiveness of the method.
{"title":"Robust Data-Driven Receding-Horizon Control for LQR With Input Constraints","authors":"Jian Zheng;Mario Sznaier","doi":"10.1109/LCSYS.2025.3642051","DOIUrl":"https://doi.org/10.1109/LCSYS.2025.3642051","url":null,"abstract":"This letter presents a robust data-driven receding-horizon control framework for the discrete-time linear quadratic regulator (LQR) with input constraints. Unlike earlier data-driven approaches that design a controller from initial data and apply it unchanged throughout the trajectory, our method exploits all available execution data in a receding-horizon manner, thereby capturing additional information about the unknown system and enabling less conservative performance. Existing data-driven LQR model predictive control methods rely on over-approximations of the consistency set and <inline-formula> <tex-math>$ell _{2}$ </tex-math></inline-formula> descriptions of noise. In contrast, the proposed approach uses exact descriptions of the consistency set under <inline-formula> <tex-math>$ell _{infty }$ </tex-math></inline-formula>-bounded noise, leveraging duality to recast the problem into a tractable convex optimization. Further, the proposed controller renders the closed-loop system input-to-state stable. Simulation results demonstrate the effectiveness of the method.","PeriodicalId":37235,"journal":{"name":"IEEE Control Systems Letters","volume":"9 ","pages":"2765-2770"},"PeriodicalIF":2.0,"publicationDate":"2025-12-09","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145778265","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2025-12-08DOI: 10.1109/LCSYS.2025.3641882
Fat-Hy O. Rajab;Olugbenga M. Anubi;Marcos M. Vasconcelos
This letter investigates a two-player coordination game in which the players exhibit heterogeneous levels of bounded rationality. We analyze the log-linear learning dynamics where the probability distribution used to select which of the agents gets to revise its strategy is fixed but not necessarily uniform. The stationary distribution of the resulting Markov chain on the strategy profile space is derived in closed-form as a function of the rationalities and the agent selection probabilities. We proceed by showing that adjusting the selection probabilities can be used to bias the stationary distribution toward the potential-maximizing state. However, this optimization comes at the cost of a reduced convergence rate, whereas the uniform selection probabilities uniquely maximizes the convergence speed irrespective of the players’ rationality levels. A Pareto-optimal probability selection rule is proposed, trading-off the distributional bias with convergence rate. Moreover, it is shown that in coordination games, high levels of rationality sometimes accelerate convergence, whereas in other cases they may paradoxically hinder the convergence rate of log-linear learning dynamics.
{"title":"Log-Linear Learning for Coordination With Heterogeneous Bounded Rationalities","authors":"Fat-Hy O. Rajab;Olugbenga M. Anubi;Marcos M. Vasconcelos","doi":"10.1109/LCSYS.2025.3641882","DOIUrl":"https://doi.org/10.1109/LCSYS.2025.3641882","url":null,"abstract":"This letter investigates a two-player coordination game in which the players exhibit heterogeneous levels of bounded rationality. We analyze the log-linear learning dynamics where the probability distribution used to select which of the agents gets to revise its strategy is fixed but not necessarily uniform. The stationary distribution of the resulting Markov chain on the strategy profile space is derived in closed-form as a function of the rationalities and the agent selection probabilities. We proceed by showing that adjusting the selection probabilities can be used to bias the stationary distribution toward the potential-maximizing state. However, this optimization comes at the cost of a reduced convergence rate, whereas the uniform selection probabilities uniquely maximizes the convergence speed irrespective of the players’ rationality levels. A Pareto-optimal probability selection rule is proposed, trading-off the distributional bias with convergence rate. Moreover, it is shown that in coordination games, high levels of rationality sometimes accelerate convergence, whereas in other cases they may paradoxically hinder the convergence rate of log-linear learning dynamics.","PeriodicalId":37235,"journal":{"name":"IEEE Control Systems Letters","volume":"9 ","pages":"2741-2746"},"PeriodicalIF":2.0,"publicationDate":"2025-12-08","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145778192","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}