首页 > 最新文献

arXiv - EE - Systems and Control最新文献

英文 中文
An Ultra-Low Power Wearable BMI System with Continual Learning Capabilities 具有持续学习功能的超低功耗可穿戴式 BMI 系统
Pub Date : 2024-09-16 DOI: arxiv-2409.10654
Lan Mei, Thorir Mar Ingolfsson, Cristian Cioflan, Victor Kartsch, Andrea Cossettini, Xiaying Wang, Luca Benini
Driven by the progress in efficient embedded processing, there is anaccelerating trend toward running machine learning models directly on wearableBrain-Machine Interfaces (BMIs) to improve portability and privacy and maximizebattery life. However, achieving low latency and high classificationperformance remains challenging due to the inherent variability ofelectroencephalographic (EEG) signals across sessions and the limited onboardresources. This work proposes a comprehensive BMI workflow based on a CNN-basedContinual Learning (CL) framework, allowing the system to adapt tointer-session changes. The workflow is deployed on a wearable, parallelultra-low power BMI platform (BioGAP). Our results based on two in-housedatasets, Dataset A and Dataset B, show that the CL workflow improves averageaccuracy by up to 30.36% and 10.17%, respectively. Furthermore, whenimplementing the continual learning on a Parallel Ultra-Low Power (PULP)microcontroller (GAP9), it achieves an energy consumption as low as 0.45mJ perinference and an adaptation time of only 21.5ms, yielding around 25h of batterylife with a small 100mAh, 3.7V battery on BioGAP. Our setup, coupled with thecompact CNN model and on-device CL capabilities, meets users' needs forimproved privacy, reduced latency, and enhanced inter-session performance,offering good promise for smart embedded real-world BMIs.
在高效嵌入式处理技术进步的推动下,直接在可穿戴脑机接口(BMI)上运行机器学习模型以提高便携性和隐私性并最大限度延长电池寿命的趋势正在加速。然而,由于脑电图(EEG)信号在不同会话中固有的可变性和有限的板载资源,实现低延迟和高分类性能仍然具有挑战性。这项研究基于基于 CNN 的持续学习(CL)框架,提出了一种全面的 BMI 工作流程,使系统能够适应会话间的变化。该工作流程部署在一个可穿戴的并行超低功耗 BMI 平台(BioGAP)上。我们基于两个室内数据集(数据集 A 和数据集 B)得出的结果表明,CL 工作流将平均准确率分别提高了 30.36% 和 10.17%。此外,在并行超低功耗(PULP)微控制器(GAP9)上实现持续学习时,每次推理的能耗低至 0.45mJ,适应时间仅为 21.5ms,使用 BioGAP 上的 100mAh 3.7V 小电池可获得约 25h 的电池寿命。我们的设置加上紧凑的 CNN 模型和设备上的 CL 功能,满足了用户对提高隐私性、减少延迟和增强会话间性能的需求,为智能嵌入式实际 BMI 提供了良好的前景。
{"title":"An Ultra-Low Power Wearable BMI System with Continual Learning Capabilities","authors":"Lan Mei, Thorir Mar Ingolfsson, Cristian Cioflan, Victor Kartsch, Andrea Cossettini, Xiaying Wang, Luca Benini","doi":"arxiv-2409.10654","DOIUrl":"https://doi.org/arxiv-2409.10654","url":null,"abstract":"Driven by the progress in efficient embedded processing, there is an\u0000accelerating trend toward running machine learning models directly on wearable\u0000Brain-Machine Interfaces (BMIs) to improve portability and privacy and maximize\u0000battery life. However, achieving low latency and high classification\u0000performance remains challenging due to the inherent variability of\u0000electroencephalographic (EEG) signals across sessions and the limited onboard\u0000resources. This work proposes a comprehensive BMI workflow based on a CNN-based\u0000Continual Learning (CL) framework, allowing the system to adapt to\u0000inter-session changes. The workflow is deployed on a wearable, parallel\u0000ultra-low power BMI platform (BioGAP). Our results based on two in-house\u0000datasets, Dataset A and Dataset B, show that the CL workflow improves average\u0000accuracy by up to 30.36% and 10.17%, respectively. Furthermore, when\u0000implementing the continual learning on a Parallel Ultra-Low Power (PULP)\u0000microcontroller (GAP9), it achieves an energy consumption as low as 0.45mJ per\u0000inference and an adaptation time of only 21.5ms, yielding around 25h of battery\u0000life with a small 100mAh, 3.7V battery on BioGAP. Our setup, coupled with the\u0000compact CNN model and on-device CL capabilities, meets users' needs for\u0000improved privacy, reduced latency, and enhanced inter-session performance,\u0000offering good promise for smart embedded real-world BMIs.","PeriodicalId":501175,"journal":{"name":"arXiv - EE - Systems and Control","volume":"26 1","pages":""},"PeriodicalIF":0.0,"publicationDate":"2024-09-16","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142264040","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Underapproximating Safe Domains of Attraction for Discrete-Time Systems Using Implicit Representations of Backward Reachable Sets 使用后向可达集的隐式表示法低估离散时间系统的安全吸引域
Pub Date : 2024-09-16 DOI: arxiv-2409.10657
Mohamed Serry, Jun Liu
Analyzing and certifying stability and attractivity of nonlinear systems is atopic of research interest that has been extensively investigated by controltheorists and engineers for many years. Despite that, accurately estimatingdomains of attraction for nonlinear systems remains a challenging task, whereavailable estimation approaches are either conservative or limited tolow-dimensional systems. In this work, we propose an iterative approach toaccurately underapproximate safe (i.e., state-constrained) domains ofattraction for general discrete-time autonomous nonlinear systems. Our approachrelies on implicit representations of safe backward reachable sets of saferegions of attraction, where such regions can be be easily constructed using,e.g., quadratic Lyapunov functions. The iterations of our approach aremonotonic (in the sense of set inclusion), where each iteration results in asafe region of attraction, given as a sublevel set, that underapproximates thesafe domain of attraction. The sublevel set representations of the resultingregions of attraction can be efficiently utilized in verifying the inclusion ofgiven points of interest in the safe domain of attraction. We illustrate ourapproach through two numerical examples, involving two- and four-dimensionalnonlinear systems.
分析和验证非线性系统的稳定性和吸引力是控制理论家和工程师多年来一直广泛研究的课题。尽管如此,准确估计非线性系统的吸引力域仍然是一项具有挑战性的任务,现有的估计方法要么保守,要么局限于低维系统。在这项工作中,我们提出了一种迭代方法,用于准确低估一般离散时间自主非线性系统的安全(即状态受限)吸引域。我们的方法依赖于安全吸引域的安全后向可达集的隐式表示,在这种情况下,可以使用二次李亚普诺夫函数等方法轻松构建此类区域。我们方法的迭代是单调的(在集合包含的意义上),每次迭代的结果都是作为子级集给出的安全吸引区域,它与安全吸引域的近似程度较低。在验证给定兴趣点是否包含在安全吸引域中时,可以有效利用所得到的吸引区域的子级集合表示。我们通过两个涉及二维和四维非线性系统的数值示例来说明我们的方法。
{"title":"Underapproximating Safe Domains of Attraction for Discrete-Time Systems Using Implicit Representations of Backward Reachable Sets","authors":"Mohamed Serry, Jun Liu","doi":"arxiv-2409.10657","DOIUrl":"https://doi.org/arxiv-2409.10657","url":null,"abstract":"Analyzing and certifying stability and attractivity of nonlinear systems is a\u0000topic of research interest that has been extensively investigated by control\u0000theorists and engineers for many years. Despite that, accurately estimating\u0000domains of attraction for nonlinear systems remains a challenging task, where\u0000available estimation approaches are either conservative or limited to\u0000low-dimensional systems. In this work, we propose an iterative approach to\u0000accurately underapproximate safe (i.e., state-constrained) domains of\u0000attraction for general discrete-time autonomous nonlinear systems. Our approach\u0000relies on implicit representations of safe backward reachable sets of safe\u0000regions of attraction, where such regions can be be easily constructed using,\u0000e.g., quadratic Lyapunov functions. The iterations of our approach are\u0000monotonic (in the sense of set inclusion), where each iteration results in a\u0000safe region of attraction, given as a sublevel set, that underapproximates the\u0000safe domain of attraction. The sublevel set representations of the resulting\u0000regions of attraction can be efficiently utilized in verifying the inclusion of\u0000given points of interest in the safe domain of attraction. We illustrate our\u0000approach through two numerical examples, involving two- and four-dimensional\u0000nonlinear systems.","PeriodicalId":501175,"journal":{"name":"arXiv - EE - Systems and Control","volume":"27 1","pages":""},"PeriodicalIF":0.0,"publicationDate":"2024-09-16","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142264041","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Uniform Ergodicity and Ergodic-Risk Constrained Policy Optimization 均匀遍历性与遍历风险约束政策优化
Pub Date : 2024-09-16 DOI: arxiv-2409.10767
Shahriar Talebi, Na Li
In stochastic systems, risk-sensitive control balances performance withresilience to less likely events. Although existing methods rely onfinite-horizon risk criteria, this paper introduces textit{limiting-riskcriteria} that capture long-term cumulative risks through probabilisticlimiting theorems. Extending the Linear Quadratic Regulation (LQR) framework,we incorporate constraints on these limiting-risk criteria derived from theasymptotic behavior of cumulative costs, accounting for extreme deviations.Using tailored Functional Central Limit Theorems (FCLT), we demonstrate thatthe time-correlated terms in the limiting-risk criteria converge under strongergodicity, and establish conditions for convergence in non-stationary settingswhile characterizing the distribution and providing explicit formulations forthe limiting variance of the risk functional. The FCLT is developed by applyingergodic theory for Markov chains and obtaining textit{uniform ergodicity} ofthe controlled process. For quadratic risk functionals on linear dynamics, inaddition to internal stability, the uniform ergodicity requires the (possiblyheavy-tailed) dynamic noise to have a finite fourth moment. This offers a clearpath to quantifying long-term uncertainty. We also propose a primal-dualconstrained policy optimization method that optimizes the average performancewhile ensuring limiting-risk constraints are satisfied. Our framework offers apractical, theoretically guaranteed approach for long-term risk-sensitivecontrol, backed by convergence guarantees and validations through simulations.
在随机系统中,对风险敏感的控制可以在性能与对较小概率事件的应变能力之间取得平衡。虽然现有的方法依赖于无限视距风险标准,但本文引入了textit{限制风险标准},通过概率限制定理捕捉长期累积风险。通过扩展线性四则运算(LQR)框架,我们在这些极限风险标准中加入了从累积成本渐近行为中得出的约束条件,并考虑到了极端偏差。通过使用量身定制的函数中心极限定理(FCLT),我们证明了极限风险标准中的时间相关项会在较强的正态性条件下收敛,并建立了在非稳态环境下收敛的条件,同时描述了风险函数的分布特征,并为风险函数的极限方差提供了明确的公式。FCLT 是通过应用马尔可夫链的正交理论并获得受控过程的 "均匀正交性"(textit{uniform ergodicity})而发展起来的。对于线性动态的二次风险函数,除了内部稳定性之外,均匀遍历性还要求(可能是重尾的)动态噪声具有有限的第四矩。这为量化长期不确定性提供了一条清晰的途径。我们还提出了一种基元-双约束策略优化方法,在确保满足极限风险约束的同时优化平均性能。我们的框架为长期风险敏感控制提供了一种实用的、理论上有保证的方法,并有收敛性保证和模拟验证。
{"title":"Uniform Ergodicity and Ergodic-Risk Constrained Policy Optimization","authors":"Shahriar Talebi, Na Li","doi":"arxiv-2409.10767","DOIUrl":"https://doi.org/arxiv-2409.10767","url":null,"abstract":"In stochastic systems, risk-sensitive control balances performance with\u0000resilience to less likely events. Although existing methods rely on\u0000finite-horizon risk criteria, this paper introduces textit{limiting-risk\u0000criteria} that capture long-term cumulative risks through probabilistic\u0000limiting theorems. Extending the Linear Quadratic Regulation (LQR) framework,\u0000we incorporate constraints on these limiting-risk criteria derived from the\u0000asymptotic behavior of cumulative costs, accounting for extreme deviations.\u0000Using tailored Functional Central Limit Theorems (FCLT), we demonstrate that\u0000the time-correlated terms in the limiting-risk criteria converge under strong\u0000ergodicity, and establish conditions for convergence in non-stationary settings\u0000while characterizing the distribution and providing explicit formulations for\u0000the limiting variance of the risk functional. The FCLT is developed by applying\u0000ergodic theory for Markov chains and obtaining textit{uniform ergodicity} of\u0000the controlled process. For quadratic risk functionals on linear dynamics, in\u0000addition to internal stability, the uniform ergodicity requires the (possibly\u0000heavy-tailed) dynamic noise to have a finite fourth moment. This offers a clear\u0000path to quantifying long-term uncertainty. We also propose a primal-dual\u0000constrained policy optimization method that optimizes the average performance\u0000while ensuring limiting-risk constraints are satisfied. Our framework offers a\u0000practical, theoretically guaranteed approach for long-term risk-sensitive\u0000control, backed by convergence guarantees and validations through simulations.","PeriodicalId":501175,"journal":{"name":"arXiv - EE - Systems and Control","volume":"118 1","pages":""},"PeriodicalIF":0.0,"publicationDate":"2024-09-16","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142264035","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Proximal Gradient Dynamics: Monotonicity, Exponential Convergence, and Applications 近端梯度动力学:单调性、指数收敛及应用
Pub Date : 2024-09-16 DOI: arxiv-2409.10664
Anand Gokhale, Alexander Davydov, Francesco Bullo
In this letter, we study the proximal gradient dynamics. Thisrecently-proposed continuous-time dynamics solves optimization problems whosecost functions are separable into a nonsmooth convex and a smooth component.First, we show that the cost function decreases monotonically along thetrajectories of the proximal gradient dynamics. We then introduce a newcondition that guarantees exponential convergence of the cost function to itsoptimal value, and show that this condition implies the proximalPolyak-{L}ojasiewicz condition. We also show that the proximalPolyak-{L}ojasiewicz condition guarantees exponential convergence of the costfunction. Moreover, we extend these results to time-varying optimizationproblems, providing bounds for equilibrium tracking. Finally, we discussapplications of these findings, including the LASSO problem, quadraticoptimization with polytopic constraints, and certain matrix based problems.
在这封信中,我们研究了近似梯度动力学。首先,我们证明了成本函数沿着近似梯度动力学的轨迹单调递减。然后,我们引入一个新条件,保证成本函数指数收敛到其最优值,并证明这个条件意味着近似波利亚克-{L}ojasiewicz 条件。我们还证明,proximalPolyak-{L}ojasiewicz 条件保证了成本函数的指数收敛。此外,我们还将这些结果扩展到了时变优化问题,为均衡跟踪提供了边界。最后,我们讨论了这些发现的应用,包括 LASSO 问题、带多点约束的二次优化以及某些基于矩阵的问题。
{"title":"Proximal Gradient Dynamics: Monotonicity, Exponential Convergence, and Applications","authors":"Anand Gokhale, Alexander Davydov, Francesco Bullo","doi":"arxiv-2409.10664","DOIUrl":"https://doi.org/arxiv-2409.10664","url":null,"abstract":"In this letter, we study the proximal gradient dynamics. This\u0000recently-proposed continuous-time dynamics solves optimization problems whose\u0000cost functions are separable into a nonsmooth convex and a smooth component.\u0000First, we show that the cost function decreases monotonically along the\u0000trajectories of the proximal gradient dynamics. We then introduce a new\u0000condition that guarantees exponential convergence of the cost function to its\u0000optimal value, and show that this condition implies the proximal\u0000Polyak-{L}ojasiewicz condition. We also show that the proximal\u0000Polyak-{L}ojasiewicz condition guarantees exponential convergence of the cost\u0000function. Moreover, we extend these results to time-varying optimization\u0000problems, providing bounds for equilibrium tracking. Finally, we discuss\u0000applications of these findings, including the LASSO problem, quadratic\u0000optimization with polytopic constraints, and certain matrix based problems.","PeriodicalId":501175,"journal":{"name":"arXiv - EE - Systems and Control","volume":"35 1","pages":""},"PeriodicalIF":0.0,"publicationDate":"2024-09-16","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142264037","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
XLM for Autonomous Driving Systems: A Comprehensive Review 用于自动驾驶系统的 XLM:全面回顾
Pub Date : 2024-09-16 DOI: arxiv-2409.10484
Sonda Fourati, Wael Jaafar, Noura Baccar, Safwan Alfattani
Large Language Models (LLMs) have showcased remarkable proficiency in variousinformation-processing tasks. These tasks span from extracting data andsummarizing literature to generating content, predictive modeling,decision-making, and system controls. Moreover, Vision Large Models (VLMs) andMultimodal LLMs (MLLMs), which represent the next generation of languagemodels, a.k.a., XLMs, can combine and integrate many data modalities with thestrength of language understanding, thus advancing several information-basedsystems, such as Autonomous Driving Systems (ADS). Indeed, by combininglanguage communication with multimodal sensory inputs, e.g., panoramic imagesand LiDAR or radar data, accurate driving actions can be taken. In thiscontext, we provide in this survey paper a comprehensive overview of thepotential of XLMs towards achieving autonomous driving. Specifically, we reviewthe relevant literature on ADS and XLMs, including their architectures, tools,and frameworks. Then, we detail the proposed approaches to deploy XLMs forautonomous driving solutions. Finally, we provide the related challenges to XLMdeployment for ADS and point to future research directions aiming to enable XLMadoption in future ADS frameworks.
大型语言模型(LLM)在各种信息处理任务中表现出了非凡的能力。这些任务包括提取数据、总结文献、生成内容、预测建模、决策和系统控制。此外,代表下一代语言模型(又称 XLM)的视觉大模型(VLM)和多模态 LLM(MLLM)可以将多种数据模态与语言理解能力相结合,从而推动自动驾驶系统(ADS)等基于信息的系统的发展。事实上,通过将语言交流与多模态感官输入(如全景图像、激光雷达或雷达数据)相结合,可以采取准确的驾驶行动。在此背景下,我们在本调查报告中全面概述了 XLM 在实现自动驾驶方面的潜力。具体来说,我们回顾了 ADS 和 XLM 的相关文献,包括其架构、工具和框架。然后,我们详细介绍了为自动驾驶解决方案部署 XLM 的建议方法。最后,我们提出了为 ADS 部署 XLM 所面临的相关挑战,并指出了未来的研究方向,旨在使 XLM 在未来的 ADS 框架中得到采用。
{"title":"XLM for Autonomous Driving Systems: A Comprehensive Review","authors":"Sonda Fourati, Wael Jaafar, Noura Baccar, Safwan Alfattani","doi":"arxiv-2409.10484","DOIUrl":"https://doi.org/arxiv-2409.10484","url":null,"abstract":"Large Language Models (LLMs) have showcased remarkable proficiency in various\u0000information-processing tasks. These tasks span from extracting data and\u0000summarizing literature to generating content, predictive modeling,\u0000decision-making, and system controls. Moreover, Vision Large Models (VLMs) and\u0000Multimodal LLMs (MLLMs), which represent the next generation of language\u0000models, a.k.a., XLMs, can combine and integrate many data modalities with the\u0000strength of language understanding, thus advancing several information-based\u0000systems, such as Autonomous Driving Systems (ADS). Indeed, by combining\u0000language communication with multimodal sensory inputs, e.g., panoramic images\u0000and LiDAR or radar data, accurate driving actions can be taken. In this\u0000context, we provide in this survey paper a comprehensive overview of the\u0000potential of XLMs towards achieving autonomous driving. Specifically, we review\u0000the relevant literature on ADS and XLMs, including their architectures, tools,\u0000and frameworks. Then, we detail the proposed approaches to deploy XLMs for\u0000autonomous driving solutions. Finally, we provide the related challenges to XLM\u0000deployment for ADS and point to future research directions aiming to enable XLM\u0000adoption in future ADS frameworks.","PeriodicalId":501175,"journal":{"name":"arXiv - EE - Systems and Control","volume":"32 1","pages":""},"PeriodicalIF":0.0,"publicationDate":"2024-09-16","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142264042","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Power Oscillation Damping Controllers for Grid-Forming Power Converters in Modern PowerSystems 现代电力系统中并网型电力转换器的功率振荡阻尼控制器
Pub Date : 2024-09-16 DOI: arxiv-2409.10726
Elia Mateu-Barriendos, Onur Alican, Javier Renedo, Carlos Collados-Rodriguez, Macarena Martin, Edgar Nuño, Eduardo Prieto-Araujo, Oriol Gomis-Bellmunt
Inter-area oscillations have been extensively studied in conventional powersystems dominated by synchronous machines, as well as methods to mitigate them.Several publications have addressed Power Oscillation Damping (POD) controllersin grid-following voltage source converters (GFOL). However, the performance ofPOD controllers for Grid-Forming voltage source converters (GFOR) in modernpower systems with increased penetration of power electronics requires furtherinvestigation. This paper investigates the performance of GFORs andsupplementary POD controllers in the damping of electromechanical oscillationsin modern power systems. This paper proposes POD controllers in GFORs bysupplementary modulation of active- and reactive-power injections of theconverter and both simultaneously (POD- P, POD-Q and POD-PQ, respectively). Theproposed POD controllers use the frequency imposed by the GFOR as the inputsignal, which has a simple implementation and it eliminates the need foradditional measurements. Eigenvalue-sensitivity methods using a synthetic testsystem are applied to the design of POD controllers in GFORs, which is usefulwhen limited information of the power system is available. This paperdemonstrates the effectiveness of POD controllers in GFOR converters to dampelectromechanical oscillations, by small-signal stability analysis andnon-linear time-domain simulations in a small test system and in a large-scalepower system.
在以同步电机为主导的传统电力系统中,人们已经广泛研究了区域间振荡以及缓解振荡的方法。然而,在电力电子技术渗透率不断提高的现代电力系统中,电网电压源变换器(GFOR)的功率振荡抑制(POD)控制器的性能还需要进一步研究。本文研究了现代电力系统中 GFOR 和辅助 POD 控制器在抑制机电振荡方面的性能。本文提出了 GFOR 中的 POD 控制器,即同时对变流器的有功功率注入和无功功率注入进行补充调制(分别为 POD-P、POD-Q 和 POD-PQ)。拟议的 POD 控制器使用 GFOR 施加的频率作为输入信号,实现简单,无需额外测量。使用合成测试系统的特征值灵敏度方法被应用于 GFOR 中 POD 控制器的设计,这在电力系统信息有限的情况下非常有用。本文通过在小型测试系统和大型电力系统中进行小信号稳定性分析和非线性时域仿真,证明了 POD 控制器在 GFOR 变流器中抑制机电振荡的有效性。
{"title":"Power Oscillation Damping Controllers for Grid-Forming Power Converters in Modern PowerSystems","authors":"Elia Mateu-Barriendos, Onur Alican, Javier Renedo, Carlos Collados-Rodriguez, Macarena Martin, Edgar Nuño, Eduardo Prieto-Araujo, Oriol Gomis-Bellmunt","doi":"arxiv-2409.10726","DOIUrl":"https://doi.org/arxiv-2409.10726","url":null,"abstract":"Inter-area oscillations have been extensively studied in conventional power\u0000systems dominated by synchronous machines, as well as methods to mitigate them.\u0000Several publications have addressed Power Oscillation Damping (POD) controllers\u0000in grid-following voltage source converters (GFOL). However, the performance of\u0000POD controllers for Grid-Forming voltage source converters (GFOR) in modern\u0000power systems with increased penetration of power electronics requires further\u0000investigation. This paper investigates the performance of GFORs and\u0000supplementary POD controllers in the damping of electromechanical oscillations\u0000in modern power systems. This paper proposes POD controllers in GFORs by\u0000supplementary modulation of active- and reactive-power injections of the\u0000converter and both simultaneously (POD- P, POD-Q and POD-PQ, respectively). The\u0000proposed POD controllers use the frequency imposed by the GFOR as the input\u0000signal, which has a simple implementation and it eliminates the need for\u0000additional measurements. Eigenvalue-sensitivity methods using a synthetic test\u0000system are applied to the design of POD controllers in GFORs, which is useful\u0000when limited information of the power system is available. This paper\u0000demonstrates the effectiveness of POD controllers in GFOR converters to damp\u0000electromechanical oscillations, by small-signal stability analysis and\u0000non-linear time-domain simulations in a small test system and in a large-scale\u0000power system.","PeriodicalId":501175,"journal":{"name":"arXiv - EE - Systems and Control","volume":"7 1","pages":""},"PeriodicalIF":0.0,"publicationDate":"2024-09-16","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142264324","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Disentangling Uncertainty for Safe Social Navigation using Deep Reinforcement Learning 利用深度强化学习消除不确定性,实现安全的社交导航
Pub Date : 2024-09-16 DOI: arxiv-2409.10655
Daniel Flögel, Marcos Gómez Villafañe, Joshua Ransiek, Sören Hohmann
Autonomous mobile robots are increasingly employed in pedestrian-richenvironments where safe navigation and appropriate human interaction arecrucial. While Deep Reinforcement Learning (DRL) enables socially integratedrobot behavior, challenges persist in novel or perturbed scenarios to indicatewhen and why the policy is uncertain. Unknown uncertainty in decision-makingcan lead to collisions or human discomfort and is one reason why safe andrisk-aware navigation is still an open problem. This work introduces a novelapproach that integrates aleatoric, epistemic, and predictive uncertaintyestimation into a DRL-based navigation framework for uncertainty estimates indecision-making. We, therefore, incorporate Observation-Dependent Variance(ODV) and dropout into the Proximal Policy Optimization (PPO) algorithm. Fordifferent types of perturbations, we compare the ability of Deep Ensembles andMonte-Carlo Dropout (MC-Dropout) to estimate the uncertainties of the policy.In uncertain decision-making situations, we propose to change the robot'ssocial behavior to conservative collision avoidance. The results show that theODV-PPO algorithm converges faster with better generalization and disentanglesthe aleatoric and epistemic uncertainties. In addition, the MC-Dropout approachis more sensitive to perturbations and capable to correlate the uncertaintytype to the perturbation type better. With the proposed safe action selectionscheme, the robot can navigate in perturbed environments with fewer collisions.
自主移动机器人越来越多地应用于行人密集的环境中,在这种环境中,安全导航和适当的人机交互至关重要。虽然深度强化学习(DRL)能够实现机器人行为的社会整合,但在新颖或受干扰的场景中,要指明何时以及为何策略不确定,仍然存在挑战。决策中未知的不确定性可能导致碰撞或人类不适,这也是安全和风险感知导航仍是一个未决问题的原因之一。这项工作介绍了一种新方法,它将估计不确定性、认识不确定性和预测不确定性估计整合到基于 DRL 的导航框架中,用于不确定性估计的优柔寡断决策。因此,我们在近端策略优化(PPO)算法中加入了观测依赖方差(ODV)和遗漏(Dropout)。针对不同类型的扰动,我们比较了深度集合(Deep Ensembles)和蒙特卡洛剔除(Monte-Carlo Dropout,MC-Dropout)估计策略不确定性的能力。结果表明,ODV-PPO 算法收敛速度更快,泛化能力更强,并能区分不确定性和认识不确定性。此外,MC-Dropout 方法对扰动更敏感,能够更好地将不确定性类型与扰动类型相关联。利用所提出的安全行动选择方案,机器人可以在扰动环境中以更少的碰撞进行导航。
{"title":"Disentangling Uncertainty for Safe Social Navigation using Deep Reinforcement Learning","authors":"Daniel Flögel, Marcos Gómez Villafañe, Joshua Ransiek, Sören Hohmann","doi":"arxiv-2409.10655","DOIUrl":"https://doi.org/arxiv-2409.10655","url":null,"abstract":"Autonomous mobile robots are increasingly employed in pedestrian-rich\u0000environments where safe navigation and appropriate human interaction are\u0000crucial. While Deep Reinforcement Learning (DRL) enables socially integrated\u0000robot behavior, challenges persist in novel or perturbed scenarios to indicate\u0000when and why the policy is uncertain. Unknown uncertainty in decision-making\u0000can lead to collisions or human discomfort and is one reason why safe and\u0000risk-aware navigation is still an open problem. This work introduces a novel\u0000approach that integrates aleatoric, epistemic, and predictive uncertainty\u0000estimation into a DRL-based navigation framework for uncertainty estimates in\u0000decision-making. We, therefore, incorporate Observation-Dependent Variance\u0000(ODV) and dropout into the Proximal Policy Optimization (PPO) algorithm. For\u0000different types of perturbations, we compare the ability of Deep Ensembles and\u0000Monte-Carlo Dropout (MC-Dropout) to estimate the uncertainties of the policy.\u0000In uncertain decision-making situations, we propose to change the robot's\u0000social behavior to conservative collision avoidance. The results show that the\u0000ODV-PPO algorithm converges faster with better generalization and disentangles\u0000the aleatoric and epistemic uncertainties. In addition, the MC-Dropout approach\u0000is more sensitive to perturbations and capable to correlate the uncertainty\u0000type to the perturbation type better. With the proposed safe action selection\u0000scheme, the robot can navigate in perturbed environments with fewer collisions.","PeriodicalId":501175,"journal":{"name":"arXiv - EE - Systems and Control","volume":"47 1","pages":""},"PeriodicalIF":0.0,"publicationDate":"2024-09-16","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142264039","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Trajectory-Oriented Control Using Gradient Descent: An Unconventional Approach 使用梯度下降的轨迹导向控制:非常规方法
Pub Date : 2024-09-16 DOI: arxiv-2409.10662
Ramin Esmzad, Hamidreza Modares
In this work, we introduce a novel gradient descent-based approach foroptimizing control systems, leveraging a new representation of stableclosed-loop dynamics as a function of two matrices i.e. the step size ordirection matrix and value matrix of the Lyapunov cost function. Thisformulation provides a new framework for analyzing and designing feedbackcontrol laws. We show that any stable closed-loop system can be expressed inthis form with appropriate values for the step size and value matrices.Furthermore, we show that this parameterization of the closed-loop system isequivalent to a linear quadratic regulator for appropriately chosen weightingmatrices. We also show that trajectories can be shaped using this approach toachieve a desired closed-loop behavior.
在这项工作中,我们介绍了一种基于梯度下降的优化控制系统的新方法,该方法利用稳定闭环动态的新表示法作为两个矩阵(即步长方向矩阵和 Lyapunov 成本函数的值矩阵)的函数。这种表述为分析和设计反馈控制法提供了一个新框架。我们证明,任何稳定的闭环系统都可以用这种形式表示,步长矩阵和值矩阵可以取适当的值。此外,我们还证明,对于适当选择的加权矩阵,闭环系统的这种参数化等价于线性二次调节器。我们还证明,使用这种方法可以塑造轨迹,从而实现理想的闭环行为。
{"title":"Trajectory-Oriented Control Using Gradient Descent: An Unconventional Approach","authors":"Ramin Esmzad, Hamidreza Modares","doi":"arxiv-2409.10662","DOIUrl":"https://doi.org/arxiv-2409.10662","url":null,"abstract":"In this work, we introduce a novel gradient descent-based approach for\u0000optimizing control systems, leveraging a new representation of stable\u0000closed-loop dynamics as a function of two matrices i.e. the step size or\u0000direction matrix and value matrix of the Lyapunov cost function. This\u0000formulation provides a new framework for analyzing and designing feedback\u0000control laws. We show that any stable closed-loop system can be expressed in\u0000this form with appropriate values for the step size and value matrices.\u0000Furthermore, we show that this parameterization of the closed-loop system is\u0000equivalent to a linear quadratic regulator for appropriately chosen weighting\u0000matrices. We also show that trajectories can be shaped using this approach to\u0000achieve a desired closed-loop behavior.","PeriodicalId":501175,"journal":{"name":"arXiv - EE - Systems and Control","volume":"19 1","pages":""},"PeriodicalIF":0.0,"publicationDate":"2024-09-16","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142264038","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Direct Data-Driven Discounted Infinite Horizon Linear Quadratic Regulator with Robustness Guarantees 具有鲁棒性保证的直接数据驱动贴现无限视距线性二次调节器
Pub Date : 2024-09-16 DOI: arxiv-2409.10703
Ramin Esmzad, Hamidreza Modares
This paper presents a one-shot learning approach with performance androbustness guarantees for the linear quadratic regulator (LQR) control ofstochastic linear systems. Even though data-based LQR control has been widelyconsidered, existing results suffer either from data hungriness due to theinherently iterative nature of the optimization formulation (e.g., valuelearning or policy gradient reinforcement learning algorithms) or from a lackof robustness guarantees in one-shot non-iterative algorithms. To avoid datahungriness while ensuing robustness guarantees, an adaptive dynamic programmingformalization of the LQR is presented that relies on solving a Bellmaninequality. The control gain and the value function are directly learned byusing a control-oriented approach that characterizes the closed-loop systemusing data and a decision variable from which the control is obtained. Thisclosed-loop characterization is noise-dependent. The effect of the closed-loopsystem noise on the Bellman inequality is considered to ensure both robuststability and suboptimal performance despite ignoring the measurement noise. Toensure robust stability, it is shown that this system characterization leads toa closed-loop system with multiplicative and additive noise, enabling theapplication of distributional robust control techniques. The analysis of thesuboptimality gap reveals that robustness can be achieved without the need forregularization or parameter tuning. The simulation results on the active carsuspension problem demonstrate the superiority of the proposed method in termsof robustness and performance gap compared to existing methods.
本文针对随机线性系统的线性二次调节器(LQR)控制,提出了一种具有性能和稳健性保证的单次学习方法。尽管基于数据的 LQR 控制已被广泛考虑,但现有结果要么因优化公式固有的迭代性质(如值学习或策略梯度强化学习算法)而存在数据饥饿问题,要么因单次非迭代算法缺乏鲁棒性保证而受到影响。为了在保证鲁棒性的同时避免数据混乱,本文提出了一种 LQR 的自适应动态编程形式化,它依赖于贝尔曼方程的求解。控制增益和价值函数是通过使用面向控制的方法直接学习的,这种方法使用数据和决策变量来描述闭环系统,并从中获得控制。这种闭环特性取决于噪声。我们考虑了闭环系统噪声对贝尔曼不等式的影响,以确保鲁棒稳定性和次优性能,尽管忽略了测量噪声。为了确保鲁棒稳定性,研究表明这种系统特性会导致闭环系统出现乘法和加法噪声,从而使分布式鲁棒控制技术的应用成为可能。对次优差距的分析表明,鲁棒性可以在不需要规则化或参数调整的情况下实现。对主动汽车悬架问题的仿真结果表明,与现有方法相比,所提方法在鲁棒性和性能差距方面更具优势。
{"title":"Direct Data-Driven Discounted Infinite Horizon Linear Quadratic Regulator with Robustness Guarantees","authors":"Ramin Esmzad, Hamidreza Modares","doi":"arxiv-2409.10703","DOIUrl":"https://doi.org/arxiv-2409.10703","url":null,"abstract":"This paper presents a one-shot learning approach with performance and\u0000robustness guarantees for the linear quadratic regulator (LQR) control of\u0000stochastic linear systems. Even though data-based LQR control has been widely\u0000considered, existing results suffer either from data hungriness due to the\u0000inherently iterative nature of the optimization formulation (e.g., value\u0000learning or policy gradient reinforcement learning algorithms) or from a lack\u0000of robustness guarantees in one-shot non-iterative algorithms. To avoid data\u0000hungriness while ensuing robustness guarantees, an adaptive dynamic programming\u0000formalization of the LQR is presented that relies on solving a Bellman\u0000inequality. The control gain and the value function are directly learned by\u0000using a control-oriented approach that characterizes the closed-loop system\u0000using data and a decision variable from which the control is obtained. This\u0000closed-loop characterization is noise-dependent. The effect of the closed-loop\u0000system noise on the Bellman inequality is considered to ensure both robust\u0000stability and suboptimal performance despite ignoring the measurement noise. To\u0000ensure robust stability, it is shown that this system characterization leads to\u0000a closed-loop system with multiplicative and additive noise, enabling the\u0000application of distributional robust control techniques. The analysis of the\u0000suboptimality gap reveals that robustness can be achieved without the need for\u0000regularization or parameter tuning. The simulation results on the active car\u0000suspension problem demonstrate the superiority of the proposed method in terms\u0000of robustness and performance gap compared to existing methods.","PeriodicalId":501175,"journal":{"name":"arXiv - EE - Systems and Control","volume":"31 1","pages":""},"PeriodicalIF":0.0,"publicationDate":"2024-09-16","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142264325","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Mitigating Partial Observability in Adaptive Traffic Signal Control with Transformers 利用变压器缓解自适应交通信号控制中的部分可观测性
Pub Date : 2024-09-16 DOI: arxiv-2409.10693
Xiaoyu Wang, Ayal Taitler, Scott Sanner, Baher Abdulhai
Efficient traffic signal control is essential for managing urbantransportation, minimizing congestion, and improving safety and sustainability.Reinforcement Learning (RL) has emerged as a promising approach to enhancingadaptive traffic signal control (ATSC) systems, allowing controllers to learnoptimal policies through interaction with the environment. However, challengesarise due to partial observability (PO) in traffic networks, where agents havelimited visibility, hindering effectiveness. This paper presents theintegration of Transformer-based controllers into ATSC systems to address POeffectively. We propose strategies to enhance training efficiency andeffectiveness, demonstrating improved coordination capabilities in real-worldscenarios. The results showcase the Transformer-based model's ability tocapture significant information from historical observations, leading to bettercontrol policies and improved traffic flow. This study highlights the potentialof leveraging the advanced Transformer architecture to enhance urbantransportation management.
高效的交通信号控制对于管理城市交通、减少拥堵、提高安全性和可持续性至关重要。强化学习(RL)已成为增强自适应交通信号控制系统(ATSC)的一种有前途的方法,它允许控制人员通过与环境的交互来学习最优策略。然而,由于交通网络中的部分可观测性(PO),代理的可视性有限,从而阻碍了系统的有效性,因此挑战也随之而来。本文介绍了如何将基于变压器的控制器集成到 ATSC 系统中,以有效解决部分可观测性问题。我们提出了提高训练效率和效果的策略,并在实际场景中展示了改进的协调能力。研究结果表明,基于变压器的模型能够从历史观测中获取重要信息,从而制定出更好的控制策略并改善交通流量。这项研究强调了利用先进的 Transformer 架构加强城市交通管理的潜力。
{"title":"Mitigating Partial Observability in Adaptive Traffic Signal Control with Transformers","authors":"Xiaoyu Wang, Ayal Taitler, Scott Sanner, Baher Abdulhai","doi":"arxiv-2409.10693","DOIUrl":"https://doi.org/arxiv-2409.10693","url":null,"abstract":"Efficient traffic signal control is essential for managing urban\u0000transportation, minimizing congestion, and improving safety and sustainability.\u0000Reinforcement Learning (RL) has emerged as a promising approach to enhancing\u0000adaptive traffic signal control (ATSC) systems, allowing controllers to learn\u0000optimal policies through interaction with the environment. However, challenges\u0000arise due to partial observability (PO) in traffic networks, where agents have\u0000limited visibility, hindering effectiveness. This paper presents the\u0000integration of Transformer-based controllers into ATSC systems to address PO\u0000effectively. We propose strategies to enhance training efficiency and\u0000effectiveness, demonstrating improved coordination capabilities in real-world\u0000scenarios. The results showcase the Transformer-based model's ability to\u0000capture significant information from historical observations, leading to better\u0000control policies and improved traffic flow. This study highlights the potential\u0000of leveraging the advanced Transformer architecture to enhance urban\u0000transportation management.","PeriodicalId":501175,"journal":{"name":"arXiv - EE - Systems and Control","volume":"92 1","pages":""},"PeriodicalIF":0.0,"publicationDate":"2024-09-16","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142264036","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
期刊
arXiv - EE - Systems and Control
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1