首页 > 最新文献

IEEE Control Systems Letters最新文献

英文 中文
Rationality of Learning Algorithms in Repeated Normal-Form Games 重复正态博弈中学习算法的合理性
IF 2.4 Q2 AUTOMATION & CONTROL SYSTEMS Pub Date : 2024-10-25 DOI: 10.1109/LCSYS.2024.3486631
Shivam Bajaj;Pranoy Das;Yevgeniy Vorobeychik;Vijay Gupta
Many learning algorithms are known to converge to an equilibrium for specific classes of games if the same learning algorithm is adopted by all agents. However, when the agents are self-interested, a natural question is whether the agents have an incentive to unilaterally shift to an alternative learning algorithm. We capture such incentives as an algorithm’s rationality ratio, which is the ratio of the highest payoff an agent can obtain by unilaterally deviating from a learning algorithm to its payoff from following it. We define a learning algorithm to be c-rational if its rationality ratio is at most c irrespective of the game. We show that popular learning algorithms such as fictitious play and regret-matching are not c-rational for any constant ${mathrm { c}}geq 1$ . We also show that if an agent can only observe the actions of the other agents but not their payoffs, then there are games for which c-rational algorithms do not exist. We then propose a framework that can build upon any existing learning algorithm and establish, under mild assumptions, that our proposed algorithm is (i) c-rational for a given ${mathrm { c}}geq 1$ and (ii) the strategies of the agents converge to an equilibrium, with high probability, if all agents follow it.
众所周知,对于特定类别的博弈,如果所有代理人都采用相同的学习算法,许多学习算法都会趋于均衡。然而,当博弈主体是自利的,一个自然的问题就是博弈主体是否有动机单方面转向另一种学习算法。我们用算法的合理性比率来表示这种动机,即代理人通过单方面偏离学习算法所能获得的最高报酬与遵循该算法所能获得的报酬之比。我们将一种学习算法定义为 c-理性算法,如果它的理性比率至多为 c,则无论博弈情况如何。我们证明,对于任意常数 ${mathrm { c}}geq 1$ 而言,流行的学习算法(如虚构博弈和后悔匹配)都不是 c-理性的。我们还证明,如果一个代理只能观察到其他代理的行动而不能观察到他们的回报,那么就存在不存在 c-理性算法的博弈。然后,我们提出了一个可以建立在任何现有学习算法基础上的框架,并在温和的假设条件下确定了我们提出的算法:(i) 对于给定的 ${mathrm { c}geq 1$ 是 c-合理的;(ii) 如果所有代理人都遵循它,那么代理人的策略就会高概率地收敛到均衡。
{"title":"Rationality of Learning Algorithms in Repeated Normal-Form Games","authors":"Shivam Bajaj;Pranoy Das;Yevgeniy Vorobeychik;Vijay Gupta","doi":"10.1109/LCSYS.2024.3486631","DOIUrl":"https://doi.org/10.1109/LCSYS.2024.3486631","url":null,"abstract":"Many learning algorithms are known to converge to an equilibrium for specific classes of games if the same learning algorithm is adopted by all agents. However, when the agents are self-interested, a natural question is whether the agents have an incentive to unilaterally shift to an alternative learning algorithm. We capture such incentives as an algorithm’s rationality ratio, which is the ratio of the highest payoff an agent can obtain by unilaterally deviating from a learning algorithm to its payoff from following it. We define a learning algorithm to be c-rational if its rationality ratio is at most c irrespective of the game. We show that popular learning algorithms such as fictitious play and regret-matching are not c-rational for any constant \u0000<inline-formula> <tex-math>${mathrm { c}}geq 1$ </tex-math></inline-formula>\u0000. We also show that if an agent can only observe the actions of the other agents but not their payoffs, then there are games for which c-rational algorithms do not exist. We then propose a framework that can build upon any existing learning algorithm and establish, under mild assumptions, that our proposed algorithm is (i) c-rational for a given \u0000<inline-formula> <tex-math>${mathrm { c}}geq 1$ </tex-math></inline-formula>\u0000 and (ii) the strategies of the agents converge to an equilibrium, with high probability, if all agents follow it.","PeriodicalId":37235,"journal":{"name":"IEEE Control Systems Letters","volume":null,"pages":null},"PeriodicalIF":2.4,"publicationDate":"2024-10-25","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142573620","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Impact of Opinion on Disease Transmission With Waterborne Pathogen and Stubborn Community 水传播病原体和顽固群落对疾病传播的影响
IF 2.4 Q2 AUTOMATION & CONTROL SYSTEMS Pub Date : 2024-10-24 DOI: 10.1109/LCSYS.2024.3485963
Qingsong Liu;Li Ma
How to control the spread of disease is a challenging problem for human beings, and one of the main reasons is the diversity of disease transmission pathways. Besides, the existing literature shows that community’s opinion plays an important role in controlling the spread of disease. However, the influence of opinions on disease transmission with a waterborne pathogen is unclear. In this letter, we propose a nonlinear dynamic system to analyze the impact of the opinion on disease transmission with a waterborne pathogen and stubborn community. The criteria for determining the global and local stability of the proposed dynamic system are established, respectively. Based on the real data and Email-Eu-Core network, our proposed dynamic system is utilized to show that only the proportion of infected individuals decreased, but diseases and a waterborne pathogen did not disappear, which is completely different from the existing result that the disease disappeared when stubborn communities are introduced.
如何控制疾病的传播对人类来说是一个具有挑战性的问题,其中一个主要原因就是疾病传播途径的多样性。此外,现有文献表明,社会舆论在控制疾病传播方面发挥着重要作用。然而,对于水传播病原体,舆论对疾病传播的影响尚不明确。在这封信中,我们提出了一个非线性动态系统来分析舆论对水媒病原体和顽固社区疾病传播的影响。分别建立了确定所提动态系统全局和局部稳定性的标准。基于真实数据和Email-Eu-Core网络,我们提出的动态系统表明,只有受感染个体的比例下降,但疾病和水传播病原体并没有消失,这与现有的引入顽固社区后疾病消失的结果完全不同。
{"title":"Impact of Opinion on Disease Transmission With Waterborne Pathogen and Stubborn Community","authors":"Qingsong Liu;Li Ma","doi":"10.1109/LCSYS.2024.3485963","DOIUrl":"https://doi.org/10.1109/LCSYS.2024.3485963","url":null,"abstract":"How to control the spread of disease is a challenging problem for human beings, and one of the main reasons is the diversity of disease transmission pathways. Besides, the existing literature shows that community’s opinion plays an important role in controlling the spread of disease. However, the influence of opinions on disease transmission with a waterborne pathogen is unclear. In this letter, we propose a nonlinear dynamic system to analyze the impact of the opinion on disease transmission with a waterborne pathogen and stubborn community. The criteria for determining the global and local stability of the proposed dynamic system are established, respectively. Based on the real data and Email-Eu-Core network, our proposed dynamic system is utilized to show that only the proportion of infected individuals decreased, but diseases and a waterborne pathogen did not disappear, which is completely different from the existing result that the disease disappeared when stubborn communities are introduced.","PeriodicalId":37235,"journal":{"name":"IEEE Control Systems Letters","volume":null,"pages":null},"PeriodicalIF":2.4,"publicationDate":"2024-10-24","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142555131","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Numerical and Lyapunov-Based Investigation of the Effect of Stenosis on Blood Transport Stability Using a Control-Theoretic PDE Model of Cardiovascular Flow 利用心血管流动的控制论 PDE 模型,基于数值和 Lyapunov 对狭窄对血液运输稳定性影响的研究
IF 2.4 Q2 AUTOMATION & CONTROL SYSTEMS Pub Date : 2024-10-23 DOI: 10.1109/LCSYS.2024.3484635
Shantanu Singh;Nikolaos Bekiaris-Liberis
We perform various numerical tests to study the effect of (boundary) stenosis on blood flow stability, employing a detailed and accurate, second-order finite-volume scheme for numerically implementing a partial differential equation (PDE) model, using clinically realistic values for the artery’s parameters and the blood inflow. The model consists of a baseline $2times 2$ hetero-directional, nonlinear hyperbolic PDE system, in which, the stenosis’ effect is described by a pressure drop at the outlet of an arterial segment considered. We then study the stability properties (observed in our numerical tests) of a reference trajectory, corresponding to a given time-varying inflow (e.g., a periodic trajectory with period equal to the time interval between two consecutive heartbeats) and stenosis severity, deriving the respective linearized system and constructing a Lyapunov functional. Due to the fact that the linearized system is time varying, with time-varying parameters depending on the reference trajectories themselves (that, in turn, depend in an implicit manner on the stenosis degree), which cannot be derived analytically, we verify the Lyapunov-based stability conditions obtained, numerically. Both the numerical tests and the Lyapunov-based stability analysis show that a reference trajectory is asymptotically stable with a decay rate that decreases as the stenosis severity deteriorates.
我们进行了各种数值测试来研究(边界)狭窄对血流稳定性的影响,采用了详细而精确的二阶有限体积方案来数值化一个偏微分方程(PDE)模型,使用临床上实际的动脉参数值和血液流入量。该模型由一个基线为 2 元/次 2 元的异方向非线性双曲偏微分方程系统组成,其中,狭窄的影响由动脉段出口处的压力降来描述。然后,我们研究了参考轨迹的稳定性(在数值测试中观察到),该轨迹对应于给定的时变流入量(例如,周期轨迹,其周期等于两次连续心跳之间的时间间隔)和狭窄严重程度,推导出各自的线性化系统并构建了 Lyapunov 函数。由于线性化系统是时变的,其时变参数取决于参考轨迹本身(而参考轨迹又以隐含的方式取决于狭窄程度),无法通过分析得出,因此我们通过数值方法验证了所获得的基于 Lyapunov 的稳定性条件。数值测试和基于 Lyapunov 的稳定性分析表明,参考轨迹是渐近稳定的,其衰减率随着狭窄严重程度的恶化而降低。
{"title":"Numerical and Lyapunov-Based Investigation of the Effect of Stenosis on Blood Transport Stability Using a Control-Theoretic PDE Model of Cardiovascular Flow","authors":"Shantanu Singh;Nikolaos Bekiaris-Liberis","doi":"10.1109/LCSYS.2024.3484635","DOIUrl":"https://doi.org/10.1109/LCSYS.2024.3484635","url":null,"abstract":"We perform various numerical tests to study the effect of (boundary) stenosis on blood flow stability, employing a detailed and accurate, second-order finite-volume scheme for numerically implementing a partial differential equation (PDE) model, using clinically realistic values for the artery’s parameters and the blood inflow. The model consists of a baseline \u0000<inline-formula> <tex-math>$2times 2$ </tex-math></inline-formula>\u0000 hetero-directional, nonlinear hyperbolic PDE system, in which, the stenosis’ effect is described by a pressure drop at the outlet of an arterial segment considered. We then study the stability properties (observed in our numerical tests) of a reference trajectory, corresponding to a given time-varying inflow (e.g., a periodic trajectory with period equal to the time interval between two consecutive heartbeats) and stenosis severity, deriving the respective linearized system and constructing a Lyapunov functional. Due to the fact that the linearized system is time varying, with time-varying parameters depending on the reference trajectories themselves (that, in turn, depend in an implicit manner on the stenosis degree), which cannot be derived analytically, we verify the Lyapunov-based stability conditions obtained, numerically. Both the numerical tests and the Lyapunov-based stability analysis show that a reference trajectory is asymptotically stable with a decay rate that decreases as the stenosis severity deteriorates.","PeriodicalId":37235,"journal":{"name":"IEEE Control Systems Letters","volume":null,"pages":null},"PeriodicalIF":2.4,"publicationDate":"2024-10-23","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142573618","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Almost Sure Convergence and Non-Asymptotic Concentration Bounds for Stochastic Mirror Descent Algorithm 随机镜像后裔算法的几乎确定收敛性和非渐近集中限界
IF 2.4 Q2 AUTOMATION & CONTROL SYSTEMS Pub Date : 2024-10-16 DOI: 10.1109/LCSYS.2024.3482148
Anik Kumar Paul;Arun D. Mahindrakar;Rachel K. Kalaimani
This letter investigates the convergence and concentration properties of the Stochastic Mirror Descent (SMD) algorithm utilizing biased stochastic subgradients. We establish the almost sure convergence of the algorithm’s iterates under the assumption of diminishing bias. Furthermore, we derive concentration bounds for the discrepancy between the iterates’ function values and the optimal value, based on standard assumptions. Subsequently, leveraging the assumption of Sub-Gaussian noise in stochastic subgradients, we present refined concentration bounds for this discrepancy.
这篇文章研究了利用偏置随机子梯度的随机镜像后裔(SMD)算法的收敛性和集中特性。在偏差递减的假设下,我们确定了算法迭代的几乎确定收敛性。此外,我们还基于标准假设,推导出了迭代函数值与最优值之间差异的集中约束。随后,利用随机子梯度中的亚高斯噪声假设,我们提出了这一差异的精炼集中限。
{"title":"Almost Sure Convergence and Non-Asymptotic Concentration Bounds for Stochastic Mirror Descent Algorithm","authors":"Anik Kumar Paul;Arun D. Mahindrakar;Rachel K. Kalaimani","doi":"10.1109/LCSYS.2024.3482148","DOIUrl":"https://doi.org/10.1109/LCSYS.2024.3482148","url":null,"abstract":"This letter investigates the convergence and concentration properties of the Stochastic Mirror Descent (SMD) algorithm utilizing biased stochastic subgradients. We establish the almost sure convergence of the algorithm’s iterates under the assumption of diminishing bias. Furthermore, we derive concentration bounds for the discrepancy between the iterates’ function values and the optimal value, based on standard assumptions. Subsequently, leveraging the assumption of Sub-Gaussian noise in stochastic subgradients, we present refined concentration bounds for this discrepancy.","PeriodicalId":37235,"journal":{"name":"IEEE Control Systems Letters","volume":null,"pages":null},"PeriodicalIF":2.4,"publicationDate":"2024-10-16","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142565636","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Opinion Dynamics With Set-Based Confidence: Convergence Criteria and Periodic Solutions 基于集合信心的意见动态:收敛标准和周期性解决方案
IF 2.4 Q2 AUTOMATION & CONTROL SYSTEMS Pub Date : 2024-10-14 DOI: 10.1109/LCSYS.2024.3479275
Iryna Zabarianska;Anton V. Proskurnikov
This letter introduces a new multidimensional extension of the Hegselmann-Krause (HK) opinion dynamics model, where opinion proximity is not determined by a norm or metric. Instead, each agent trusts opinions within the Minkowski sum $boldsymbol {xi }+boldsymbol {mathcal {O}}$ , where $boldsymbol {xi }$ is the agent’s current opinion and $boldsymbol {mathcal {O}}$ is the confidence set defining acceptable deviations. During each iteration, agents update their opinions by simultaneously averaging the trusted opinions. Unlike traditional HK systems, where $boldsymbol {mathcal {O}}$ is a ball in some norm, our model allows the confidence set to be non-convex and even unbounded. The new model, referred to as SCOD (Set-based Confidence Opinion Dynamics), can exhibit properties absent in the conventional HK model. Some solutions may converge to non-equilibrium points in the state space, while others oscillate periodically. These “pathologies” disappear if the set $boldsymbol {mathcal {O}}$ is symmetric and contains zero in its interior: similar to the usual HK model, the SCOD then converge in a finite number of iterations to one of the equilibrium points. The latter property is also preserved if one agent is “stubborn” and resists changing their opinion, yet still influences the others; however, two stubborn agents can lead to oscillations.
这封信介绍了海格塞曼-克劳斯(Hegselmann-Krause,HK)意见动态模型的一个新的多维扩展,在这个模型中,意见接近度不是由规范或度量决定的。取而代之的是,每个代理信任闵科夫斯基总和 $boldsymbol {xi }+boldsymbol {mathcal {O}}$ 内的意见,其中 $boldsymbol {xi }$ 是代理当前的意见,$boldsymbol {mathcal {O}}$ 是定义可接受偏差的置信度集。在每次迭代过程中,代理通过同时平均可信意见来更新自己的意见。在传统的香港系统中,$boldsymbol {mathcal {O}}$ 是一个符合某种规范的球,而我们的模型则不同,它允许置信度集是非凸的,甚至是无界的。这种新模型被称为 SCOD(基于置信度的意见动态模型),它可以表现出传统 HK 模型所不具备的特性。一些解可能会收敛到状态空间中的非平衡点,而另一些解则会周期性振荡。如果集合$boldsymbol {mathcal {O}}$是对称的,并且其内部包含零,那么这些 "病态 "就会消失:与通常的HK模型类似,SCOD会在有限次数的迭代中收敛到其中一个平衡点。如果一个代理是 "顽固的",不愿意改变自己的观点,但仍会影响其他代理,那么后一个属性也会保留;但是,两个顽固的代理会导致振荡。
{"title":"Opinion Dynamics With Set-Based Confidence: Convergence Criteria and Periodic Solutions","authors":"Iryna Zabarianska;Anton V. Proskurnikov","doi":"10.1109/LCSYS.2024.3479275","DOIUrl":"https://doi.org/10.1109/LCSYS.2024.3479275","url":null,"abstract":"This letter introduces a new multidimensional extension of the Hegselmann-Krause (HK) opinion dynamics model, where opinion proximity is not determined by a norm or metric. Instead, each agent trusts opinions within the Minkowski sum \u0000<inline-formula> <tex-math>$boldsymbol {xi }+boldsymbol {mathcal {O}}$ </tex-math></inline-formula>\u0000, where \u0000<inline-formula> <tex-math>$boldsymbol {xi }$ </tex-math></inline-formula>\u0000 is the agent’s current opinion and \u0000<inline-formula> <tex-math>$boldsymbol {mathcal {O}}$ </tex-math></inline-formula>\u0000 is the confidence set defining acceptable deviations. During each iteration, agents update their opinions by simultaneously averaging the trusted opinions. Unlike traditional HK systems, where \u0000<inline-formula> <tex-math>$boldsymbol {mathcal {O}}$ </tex-math></inline-formula>\u0000 is a ball in some norm, our model allows the confidence set to be non-convex and even unbounded. The new model, referred to as SCOD (Set-based Confidence Opinion Dynamics), can exhibit properties absent in the conventional HK model. Some solutions may converge to non-equilibrium points in the state space, while others oscillate periodically. These “pathologies” disappear if the set \u0000<inline-formula> <tex-math>$boldsymbol {mathcal {O}}$ </tex-math></inline-formula>\u0000 is symmetric and contains zero in its interior: similar to the usual HK model, the SCOD then converge in a finite number of iterations to one of the equilibrium points. The latter property is also preserved if one agent is “stubborn” and resists changing their opinion, yet still influences the others; however, two stubborn agents can lead to oscillations.","PeriodicalId":37235,"journal":{"name":"IEEE Control Systems Letters","volume":null,"pages":null},"PeriodicalIF":2.4,"publicationDate":"2024-10-14","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=10715999","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142552002","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Computation and Formal Verification of Neural Network Contraction Metrics 神经网络收缩指标的计算与形式验证
IF 2.4 Q2 AUTOMATION & CONTROL SYSTEMS Pub Date : 2024-10-11 DOI: 10.1109/LCSYS.2024.3478272
Maxwell Fitzsimmons;Jun Liu
A contraction metric defines a differential Lyapunov-like function that robustly captures the convergence between trajectories. In this letter, we investigate the use of neural networks for computing verifiable contraction metrics. We first prove the existence of a smooth neural network contraction metric within the domain of attraction of an exponentially stable equilibrium point. We then focus on the computation of a neural network contraction metric over a compact invariant set within the domain of attraction certified by a physics-informed neural network Lyapunov function. We consider both partial differential inequality (PDI) and equation (PDE) losses for computation. We show that sufficiently accurate neural approximate solutions to the PDI and PDE are guaranteed to be a contraction metric under mild technical assumptions. We rigorously verify the computed neural network contraction metric using a satisfiability modulo theories solver. Through numerical examples, we demonstrate that the proposed approach outperforms traditional semidefinite programming methods for finding sum-of-squares polynomial contraction metrics.
收缩度量定义了一个类似于李雅普诺夫的微分函数,它能稳健地捕捉轨迹之间的收敛性。在这封信中,我们研究了如何利用神经网络计算可验证的收缩度量。我们首先证明了在指数稳定平衡点的吸引域内存在平滑的神经网络收缩度量。然后,我们将重点放在计算由物理信息神经网络 Lyapunov 函数证明的吸引力域内紧凑不变集上的神经网络收缩度量。我们考虑了计算中的偏微分不等式(PDI)和方程(PDE)损失。我们证明,在温和的技术假设条件下,足够精确的偏微分不等式和偏微分方程的神经近似解保证是一个收缩度量。我们使用满足性模态理论求解器严格验证了计算出的神经网络收缩度量。通过数值示例,我们证明了在寻找平方和多项式收缩指标方面,所提出的方法优于传统的半有限编程方法。
{"title":"Computation and Formal Verification of Neural Network Contraction Metrics","authors":"Maxwell Fitzsimmons;Jun Liu","doi":"10.1109/LCSYS.2024.3478272","DOIUrl":"https://doi.org/10.1109/LCSYS.2024.3478272","url":null,"abstract":"A contraction metric defines a differential Lyapunov-like function that robustly captures the convergence between trajectories. In this letter, we investigate the use of neural networks for computing verifiable contraction metrics. We first prove the existence of a smooth neural network contraction metric within the domain of attraction of an exponentially stable equilibrium point. We then focus on the computation of a neural network contraction metric over a compact invariant set within the domain of attraction certified by a physics-informed neural network Lyapunov function. We consider both partial differential inequality (PDI) and equation (PDE) losses for computation. We show that sufficiently accurate neural approximate solutions to the PDI and PDE are guaranteed to be a contraction metric under mild technical assumptions. We rigorously verify the computed neural network contraction metric using a satisfiability modulo theories solver. Through numerical examples, we demonstrate that the proposed approach outperforms traditional semidefinite programming methods for finding sum-of-squares polynomial contraction metrics.","PeriodicalId":37235,"journal":{"name":"IEEE Control Systems Letters","volume":null,"pages":null},"PeriodicalIF":2.4,"publicationDate":"2024-10-11","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142452689","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Pointwise-Sparse Actuator Scheduling for Linear Systems With Controllability Guarantee 具有可控性保证的线性系统的点状稀疏执行器调度
IF 2.4 Q2 AUTOMATION & CONTROL SYSTEMS Pub Date : 2024-10-07 DOI: 10.1109/LCSYS.2024.3475886
Luca Ballotta;Geethu Joseph;Irawati Rahul Thete
This letter considers the design of sparse actuator schedules for linear time-invariant systems. An actuator schedule selects, for each time instant, which control inputs act on the system in that instant. We address the optimal scheduling of control inputs under a hard constraint on the number of inputs that can be used at each time. For a sparsely controllable system, we characterize sparse actuator schedules that make the system controllable, and then devise a greedy selection algorithm that guarantees controllability while heuristically providing low control effort. We further show how to enhance our greedy algorithm via Markov chain Monte Carlo-based randomized optimization.
这封信探讨了线性时变系统的稀疏执行器计划的设计问题。执行器计划表为每个时间瞬间选择在该瞬间对系统起作用的控制输入。我们要解决的问题是,在每次可使用的控制输入数量受到硬约束的情况下,如何优化控制输入的调度。对于稀疏可控系统,我们描述了使系统可控的稀疏执行器调度,然后设计了一种贪婪选择算法,在保证可控性的同时,启发式地提供了较低的控制努力。我们进一步展示了如何通过基于马尔科夫链蒙特卡罗的随机优化来增强我们的贪婪算法。
{"title":"Pointwise-Sparse Actuator Scheduling for Linear Systems With Controllability Guarantee","authors":"Luca Ballotta;Geethu Joseph;Irawati Rahul Thete","doi":"10.1109/LCSYS.2024.3475886","DOIUrl":"https://doi.org/10.1109/LCSYS.2024.3475886","url":null,"abstract":"This letter considers the design of sparse actuator schedules for linear time-invariant systems. An actuator schedule selects, for each time instant, which control inputs act on the system in that instant. We address the optimal scheduling of control inputs under a hard constraint on the number of inputs that can be used at each time. For a sparsely controllable system, we characterize sparse actuator schedules that make the system controllable, and then devise a greedy selection algorithm that guarantees controllability while heuristically providing low control effort. We further show how to enhance our greedy algorithm via Markov chain Monte Carlo-based randomized optimization.","PeriodicalId":37235,"journal":{"name":"IEEE Control Systems Letters","volume":null,"pages":null},"PeriodicalIF":2.4,"publicationDate":"2024-10-07","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142447055","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Negativizability for Nonlinear Estimation in Cyber–Physical Systems 网络物理系统中非线性估计的可否定性
IF 2.4 Q2 AUTOMATION & CONTROL SYSTEMS Pub Date : 2024-10-07 DOI: 10.1109/LCSYS.2024.3473789
Camilla Fioravanti;Stefano Panzieri;Gabriele Oliva
This letter introduces a novel fully distributed estimation scheme for nonlinear continuous-time dynamics over directed and strongly connected graphs. Leveraging on the assumption of local negativizability, the proposed approach performs the estimation of the interdependent subsystems of a cyber-physical system, despite the presence of nonlinear dependencies on the dynamics. This transforms the intricate task of nonlinear state estimation by each agent into more manageable local negativizability problems for the design of the estimation gains. A pivotal aspect of the approach is that each agent should be aware of an upper bound on the Lipschitz constant of the overall nonlinear function that characterizes the dynamics. To face this issue, we developed a novel distributed methodology for the estimation of the global Lipschitz constant, starting from the local observations of the system’s nonlinearities. The effectiveness of the proposed scheme is numerically demonstrated through simulations.
这封信介绍了一种新颖的全分布式有向强连接图非线性连续时间动力学估算方案。利用局部可否定性假设,尽管动态存在非线性依赖性,所提出的方法仍能对网络物理系统中相互依赖的子系统进行估计。这就将每个代理进行非线性状态估计的复杂任务转化为更易于管理的局部可否定性问题,以便设计估计增益。该方法的一个关键方面是,每个代理都应了解表征动态特性的整体非线性函数的 Lipschitz 常量的上限。面对这一问题,我们开发了一种新颖的分布式方法,从系统非线性的局部观测出发,估计全局的 Lipschitz 常数。我们通过模拟数值证明了所提方案的有效性。
{"title":"Negativizability for Nonlinear Estimation in Cyber–Physical Systems","authors":"Camilla Fioravanti;Stefano Panzieri;Gabriele Oliva","doi":"10.1109/LCSYS.2024.3473789","DOIUrl":"https://doi.org/10.1109/LCSYS.2024.3473789","url":null,"abstract":"This letter introduces a novel fully distributed estimation scheme for nonlinear continuous-time dynamics over directed and strongly connected graphs. Leveraging on the assumption of local negativizability, the proposed approach performs the estimation of the interdependent subsystems of a cyber-physical system, despite the presence of nonlinear dependencies on the dynamics. This transforms the intricate task of nonlinear state estimation by each agent into more manageable local negativizability problems for the design of the estimation gains. A pivotal aspect of the approach is that each agent should be aware of an upper bound on the Lipschitz constant of the overall nonlinear function that characterizes the dynamics. To face this issue, we developed a novel distributed methodology for the estimation of the global Lipschitz constant, starting from the local observations of the system’s nonlinearities. The effectiveness of the proposed scheme is numerically demonstrated through simulations.","PeriodicalId":37235,"journal":{"name":"IEEE Control Systems Letters","volume":null,"pages":null},"PeriodicalIF":2.4,"publicationDate":"2024-10-07","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=10706085","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142447233","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
The Design of ϵ-Optimal Strategy for Two-Person Zero-Sum Markov Games 两人零和马尔可夫博弈的ϵ-最优策略设计
IF 2.4 Q2 AUTOMATION & CONTROL SYSTEMS Pub Date : 2024-10-04 DOI: 10.1109/LCSYS.2024.3474057
Kaiyun Xie;Junlin Xiong
This letter focuses on designing approximate Nash strategies for the two-person zero-sum Markov game. Using the receding horizon method, the $epsilon $ -optimal strategies are designed to approximate Nash strategies by executing finite Gauss-Seidel iterations. The relationship between the approximation value of $epsilon $ and the number of iterations is also analyzed. Additionally, the $epsilon $ -optimal strategies are designed for two scenarios with imprecise parameters. For scenarios with imprecise values, the value of $epsilon $ is determined based on the errors between imprecise and iteration values. It provides a theoretical basis for efficiently designing $epsilon $ -optimal strategies using heuristic algorithms or approximate dynamic programming. For scenarios with imprecise transition probabilities, the value of $epsilon $ is determined based on the errors between the estimated and practical transition probabilities. It enables the use of pattern recognition technology or other methods to estimate practical transition probabilities for designing $epsilon $ -optimal strategies.
这封信的重点是设计两人零和马尔可夫博弈的近似纳什策略。利用后退视界法,通过执行有限的高斯-赛德尔迭代,设计了$epsilon $最优策略来近似纳什策略。同时还分析了 $epsilon $ 的近似值与迭代次数之间的关系。此外,还针对两种参数不精确的情况设计了$epsilon $最优策略。对于参数值不精确的情况,$epsilon $ 的值是根据不精确值和迭代值之间的误差确定的。它为使用启发式算法或近似动态编程有效设计 $epsilon $ 最佳策略提供了理论基础。对于过渡概率不精确的情况,$epsilon $ 的值是根据估计过渡概率和实际过渡概率之间的误差来确定的。它可以使用模式识别技术或其他方法来估计实际过渡概率,从而设计出最优策略。
{"title":"The Design of ϵ-Optimal Strategy for Two-Person Zero-Sum Markov Games","authors":"Kaiyun Xie;Junlin Xiong","doi":"10.1109/LCSYS.2024.3474057","DOIUrl":"https://doi.org/10.1109/LCSYS.2024.3474057","url":null,"abstract":"This letter focuses on designing approximate Nash strategies for the two-person zero-sum Markov game. Using the receding horizon method, the \u0000<inline-formula> <tex-math>$epsilon $ </tex-math></inline-formula>\u0000-optimal strategies are designed to approximate Nash strategies by executing finite Gauss-Seidel iterations. The relationship between the approximation value of \u0000<inline-formula> <tex-math>$epsilon $ </tex-math></inline-formula>\u0000 and the number of iterations is also analyzed. Additionally, the \u0000<inline-formula> <tex-math>$epsilon $ </tex-math></inline-formula>\u0000-optimal strategies are designed for two scenarios with imprecise parameters. For scenarios with imprecise values, the value of \u0000<inline-formula> <tex-math>$epsilon $ </tex-math></inline-formula>\u0000 is determined based on the errors between imprecise and iteration values. It provides a theoretical basis for efficiently designing \u0000<inline-formula> <tex-math>$epsilon $ </tex-math></inline-formula>\u0000-optimal strategies using heuristic algorithms or approximate dynamic programming. For scenarios with imprecise transition probabilities, the value of \u0000<inline-formula> <tex-math>$epsilon $ </tex-math></inline-formula>\u0000 is determined based on the errors between the estimated and practical transition probabilities. It enables the use of pattern recognition technology or other methods to estimate practical transition probabilities for designing \u0000<inline-formula> <tex-math>$epsilon $ </tex-math></inline-formula>\u0000-optimal strategies.","PeriodicalId":37235,"journal":{"name":"IEEE Control Systems Letters","volume":null,"pages":null},"PeriodicalIF":2.4,"publicationDate":"2024-10-04","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142442956","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Detecting Stubborn Behaviors in Influence Networks: A Model-Based Approach for Resilient Analysis 检测影响网络中的顽固行为:基于模型的弹性分析方法
IF 2.4 Q2 AUTOMATION & CONTROL SYSTEMS Pub Date : 2024-10-01 DOI: 10.1109/LCSYS.2024.3472495
Roberta Raineri;Chiara Ravazzi;Giacomo Como;Fabio Fagnani
The wide spread of on-line social networks poses new challenges in information environment and cybersecurity. A key issue is detecting stubborn behaviors to identify leaders and influencers for marketing purposes, or extremists and automatic bots as potential threats. Existing literature typically relies on known network topology and extensive centrality measures computation. However, the size of social networks and their often unknown structure could make social influence computation impractical. We propose a new approach based on opinion dynamics to estimate stubborn agents from data. We consider a DeGroot model in which regular agents adjust their opinions as a linear combination of their neighbors’ opinions, whereas stubborn agents keep their opinions constant over time. We formulate the stubborn nodes identification and their influence estimation problems as a low-rank approximation problem. We then propose an interpolative decomposition algorithm for their solution. We determine sufficient conditions on the model parameters to ensure the algorithm’s resilience to noisy observations. Finally, we corroborate our theoretical analysis through numerical results.
在线社交网络的广泛传播给信息环境和网络安全带来了新的挑战。其中一个关键问题是检测顽固行为,以识别出于营销目的的领导者和有影响力者,或作为潜在威胁的极端分子和自动机器人。现有文献通常依赖于已知的网络拓扑结构和大量的中心性度量计算。然而,社交网络的规模及其通常未知的结构会使社会影响力计算变得不切实际。我们提出了一种基于舆论动态的新方法,以从数据中估计顽固分子。我们考虑了一个 DeGroot 模型,在该模型中,常规代理将其意见调整为其邻居意见的线性组合,而顽固代理则将其意见保持不变。我们将顽固节点识别及其影响估计问题表述为低阶近似问题。然后,我们提出了一种插值分解算法来解决这些问题。我们确定了模型参数的充分条件,以确保算法对噪声观测的适应性。最后,我们通过数值结果证实了我们的理论分析。
{"title":"Detecting Stubborn Behaviors in Influence Networks: A Model-Based Approach for Resilient Analysis","authors":"Roberta Raineri;Chiara Ravazzi;Giacomo Como;Fabio Fagnani","doi":"10.1109/LCSYS.2024.3472495","DOIUrl":"https://doi.org/10.1109/LCSYS.2024.3472495","url":null,"abstract":"The wide spread of on-line social networks poses new challenges in information environment and cybersecurity. A key issue is detecting stubborn behaviors to identify leaders and influencers for marketing purposes, or extremists and automatic bots as potential threats. Existing literature typically relies on known network topology and extensive centrality measures computation. However, the size of social networks and their often unknown structure could make social influence computation impractical. We propose a new approach based on opinion dynamics to estimate stubborn agents from data. We consider a DeGroot model in which regular agents adjust their opinions as a linear combination of their neighbors’ opinions, whereas stubborn agents keep their opinions constant over time. We formulate the stubborn nodes identification and their influence estimation problems as a low-rank approximation problem. We then propose an interpolative decomposition algorithm for their solution. We determine sufficient conditions on the model parameters to ensure the algorithm’s resilience to noisy observations. Finally, we corroborate our theoretical analysis through numerical results.","PeriodicalId":37235,"journal":{"name":"IEEE Control Systems Letters","volume":null,"pages":null},"PeriodicalIF":2.4,"publicationDate":"2024-10-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142397394","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
期刊
IEEE Control Systems Letters
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1