首页 > 最新文献

IEEE open journal of control systems最新文献

英文 中文
Human Trust in Robots: A Survey on Trust Models and Their Controls/Robotics Applications 人类对机器人的信任:信任模型及其控制/机器人应用调查
Pub Date : 2023-12-20 DOI: 10.1109/OJCSYS.2023.3345090
Yue Wang;Fangjian Li;Huanfei Zheng;Longsheng Jiang;Maziar Fooladi Mahani;Zhanrui Liao
Trust model is a topic that first gained interest in organizational studies and then human factors in automation. Thanks to recent advances in human-robot interaction (HRI) and human-autonomy teaming, human trust in robots has gained growing interest among researchers and practitioners. This article focuses on a survey of computational models of human-robot trust and their applications in robotics and robot controls. The motivation is to provide an overview of the state-of-the-art computational methods to quantify trust so as to provide feedback and situational awareness in HRI. Different from other existing survey papers on human-robot trust models, we seek to provide in-depth coverage of the trust model categorization, formulation, and analysis, with a focus on their utilization in robotics and robot controls. The paper starts with a discussion of the difference between human-robot trust with general agent-agent trust, interpersonal trust, and human trust in automation and machines. A list of impacting factors for human-robot trust and different trust measurement approaches, and their corresponding scales are summarized. We then review existing computational human-robot trust models and discuss the pros and cons of each category of models. These include performance-centric algebraic, time-series, Markov decision process (MDP)/Partially Observable MDP (POMDP)-based, Gaussian-based, and dynamic Bayesian network (DBN)-based trust models. Following the summary of each computational human-robot trust model, we examine its utilization in robot control applications, if any. We also enumerate the main limitations and open questions in this field and discuss potential future research directions.
信任模型是一个首先在组织研究中引起人们兴趣的话题,随后在自动化领域的人为因素中也引起了人们的兴趣。由于近年来人机交互(HRI)和人机协作的进步,人类对机器人的信任越来越受到研究人员和从业人员的关注。本文重点探讨人机信任的计算模型及其在机器人学和机器人控制中的应用。其目的是概述量化信任的最新计算方法,以便在人机交互中提供反馈和态势感知。与其他现有的关于人机信任模型的调查论文不同,我们力求深入介绍信任模型的分类、制定和分析,重点关注其在机器人学和机器人控制中的应用。本文首先讨论了人类-机器人信任与一般代理-代理信任、人际信任以及人类对自动化和机器的信任之间的区别。本文总结了人与机器人信任的影响因素、不同的信任测量方法及其相应的尺度。然后,我们回顾了现有的计算型人机信任模型,并讨论了各类模型的优缺点。这些模型包括以性能为中心的代数模型、时间序列模型、基于马尔可夫决策过程(MDP)/部分可观测 MDP(POMDP)的模型、基于高斯模型和基于动态贝叶斯网络(DBN)的信任模型。在对每种计算型人机信任模型进行总结后,我们将考察其在机器人控制应用中的使用情况(如果有的话)。我们还列举了这一领域的主要局限性和未决问题,并讨论了潜在的未来研究方向。
{"title":"Human Trust in Robots: A Survey on Trust Models and Their Controls/Robotics Applications","authors":"Yue Wang;Fangjian Li;Huanfei Zheng;Longsheng Jiang;Maziar Fooladi Mahani;Zhanrui Liao","doi":"10.1109/OJCSYS.2023.3345090","DOIUrl":"https://doi.org/10.1109/OJCSYS.2023.3345090","url":null,"abstract":"Trust model is a topic that first gained interest in organizational studies and then human factors in automation. Thanks to recent advances in human-robot interaction (HRI) and human-autonomy teaming, human trust in robots has gained growing interest among researchers and practitioners. This article focuses on a survey of computational models of human-robot trust and their applications in robotics and robot controls. The motivation is to provide an overview of the state-of-the-art computational methods to quantify trust so as to provide feedback and situational awareness in HRI. Different from other existing survey papers on human-robot trust models, we seek to provide in-depth coverage of the trust model categorization, formulation, and analysis, with a focus on their utilization in robotics and robot controls. The paper starts with a discussion of the difference between human-robot trust with general agent-agent trust, interpersonal trust, and human trust in automation and machines. A list of impacting factors for human-robot trust and different trust measurement approaches, and their corresponding scales are summarized. We then review existing computational human-robot trust models and discuss the pros and cons of each category of models. These include performance-centric algebraic, time-series, Markov decision process (MDP)/Partially Observable MDP (POMDP)-based, Gaussian-based, and dynamic Bayesian network (DBN)-based trust models. Following the summary of each computational human-robot trust model, we examine its utilization in robot control applications, if any. We also enumerate the main limitations and open questions in this field and discuss potential future research directions.","PeriodicalId":73299,"journal":{"name":"IEEE open journal of control systems","volume":"3 ","pages":"58-86"},"PeriodicalIF":0.0,"publicationDate":"2023-12-20","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=10366819","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"139431049","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
IEEE Control Systems Society Information 电气和电子工程师学会控制系统协会信息
Pub Date : 2023-12-15 DOI: 10.1109/OJCSYS.2023.3315635
{"title":"IEEE Control Systems Society Information","authors":"","doi":"10.1109/OJCSYS.2023.3315635","DOIUrl":"https://doi.org/10.1109/OJCSYS.2023.3315635","url":null,"abstract":"","PeriodicalId":73299,"journal":{"name":"IEEE open journal of control systems","volume":"2 ","pages":"C3-C3"},"PeriodicalIF":0.0,"publicationDate":"2023-12-15","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=10361563","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"138678663","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
IEEE Open Journal of Control Systems Publication Information IEEE Open Journal of Control Systems 出版信息
Pub Date : 2023-12-15 DOI: 10.1109/OJCSYS.2023.3315631
{"title":"IEEE Open Journal of Control Systems Publication Information","authors":"","doi":"10.1109/OJCSYS.2023.3315631","DOIUrl":"https://doi.org/10.1109/OJCSYS.2023.3315631","url":null,"abstract":"","PeriodicalId":73299,"journal":{"name":"IEEE open journal of control systems","volume":"2 ","pages":"C2-C2"},"PeriodicalIF":0.0,"publicationDate":"2023-12-15","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=10361595","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"138713667","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Human Modeling and Passivity Analysis for Semi-Autonomous Multi-Robot Navigation in Three Dimensions 用于半自主多机器人三维导航的人体建模和被动性分析
Pub Date : 2023-12-15 DOI: 10.1109/OJCSYS.2023.3343598
Takeshi Hatanaka;Takahiro Mochizuki;Takumi Sumino;José M. Maestre;Nikhil Chopra
In this article, we study a one-human-multiple-robot interaction for human-enabled multi-robot navigation in three dimensions. We employ two fully distributed control architectures designed based on human passivity and human passivity shortage. The first half of this article focuses on human modeling and analysis for the passivity-based control architecture through human operation data on a 3-D human-in-the-loop simulator. Specifically, we compare virtual reality (VR) interfaces with a traditional interface, and examine the impacts that VR technology has on human properties in terms of model accuracy, performance, passivity and workload, demonstrating that VR interfaces have a positive effect on all aspects. In contrast to 1-D operation, we confirm that operators hardly attain passivity regardless of the network structure, even with the VR interfaces. We thus take the passivity-shortage-based control architecture and analyze the degree of passivity shortage. We then observe through user studies that operators tend to meet the degree of shortage needed to prove closed-loop stability.
在这篇文章中,我们研究了一个人与多个机器人的互动,以实现人类支持的多机器人三维导航。我们采用了基于人的被动性和人的被动性不足而设计的两种全分布式控制架构。本文的前半部分侧重于通过三维人在环模拟器上的人类操作数据,对基于被动性的控制架构进行人类建模和分析。具体而言,我们将虚拟现实(VR)界面与传统界面进行了比较,并从模型准确性、性能、被动性和工作量等方面考察了 VR 技术对人体特性的影响,结果表明 VR 界面对各方面都有积极影响。与一维操作相比,我们证实,无论网络结构如何,即使使用 VR 界面,操作员也很难达到被动状态。因此,我们采用了基于被动性-短时控制的架构,并分析了被动性不足的程度。然后,我们通过用户研究发现,操作员往往能达到证明闭环稳定性所需的短缺程度。
{"title":"Human Modeling and Passivity Analysis for Semi-Autonomous Multi-Robot Navigation in Three Dimensions","authors":"Takeshi Hatanaka;Takahiro Mochizuki;Takumi Sumino;José M. Maestre;Nikhil Chopra","doi":"10.1109/OJCSYS.2023.3343598","DOIUrl":"https://doi.org/10.1109/OJCSYS.2023.3343598","url":null,"abstract":"In this article, we study a one-human-multiple-robot interaction for human-enabled multi-robot navigation in three dimensions. We employ two fully distributed control architectures designed based on human passivity and human passivity shortage. The first half of this article focuses on human modeling and analysis for the passivity-based control architecture through human operation data on a 3-D human-in-the-loop simulator. Specifically, we compare virtual reality (VR) interfaces with a traditional interface, and examine the impacts that VR technology has on human properties in terms of model accuracy, performance, passivity and workload, demonstrating that VR interfaces have a positive effect on all aspects. In contrast to 1-D operation, we confirm that operators hardly attain passivity regardless of the network structure, even with the VR interfaces. We thus take the passivity-shortage-based control architecture and analyze the degree of passivity shortage. We then observe through user studies that operators tend to meet the degree of shortage needed to prove closed-loop stability.","PeriodicalId":73299,"journal":{"name":"IEEE open journal of control systems","volume":"3 ","pages":"45-57"},"PeriodicalIF":0.0,"publicationDate":"2023-12-15","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=10361530","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"139431037","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
A Computational Framework for Optimal Adaptive Function Allocation in a Human-Autonomy Teaming Scenario 人类与自动驾驶机器人组队场景中优化自适应功能分配的计算框架
Pub Date : 2023-12-06 DOI: 10.1109/OJCSYS.2023.3340034
Sooyung Byeon;Joonwon Choi;Inseok Hwang
This article proposes a quantitative framework for optimally allocating task functions in human-autonomy teaming (HAT). HAT involves cooperation between humans and autonomous agents to achieve common goals. As humans and autonomous agents possess different capabilities, function allocation plays a crucial role in ensuring effective HAT. However, designing the best adaptive function allocation remains a challenge, as existing methods often rely on qualitative rules and intensive human-subject studies. To address this limitation, we propose a computational function allocation approach that leverages cognitive engineering, computational work models, and optimization techniques. The proposed optimal adaptive function allocation method is composed of three main elements: 1) analyze the teamwork to identify a set of all possible function allocations within a team construction, 2) numerically simulate the teamwork in temporal semantics to explore the interaction of the team with complex environments using the identified function allocations in a trial-and-error manner, and 3) optimize the adaptive function allocation with respect to a given situation such as physical conditions, available information resources, and human mental workload. For the optimization, we utilize performance metrics such as task performance, human mental workload, and coherency in function allocations. To illustrate the effectiveness of the proposed framework, we present a simulated HAT scenario involving a human work model and drone fleet for last-mile delivery in disaster relief operations.
本文提出了一个定量框架,用于优化人类-自主团队合作(HAT)中的任务功能分配。HAT 涉及人类与自主代理之间为实现共同目标而开展的合作。由于人类和自主代理拥有不同的能力,功能分配在确保有效的 HAT 中起着至关重要的作用。然而,设计最佳的自适应功能分配仍然是一项挑战,因为现有的方法通常依赖于定性规则和密集的人体研究。为了解决这一局限性,我们提出了一种计算功能分配方法,该方法利用了认知工程学、计算工作模型和优化技术。所提出的最优自适应功能分配方法由三个主要元素组成:1)分析团队工作,确定团队建设中所有可能的功能分配集合;2)在时间语义上对团队工作进行数值模拟,以试错方式探索团队与复杂环境的互动,并使用确定的功能分配;3)针对给定情况(如物理条件、可用信息资源和人的心理工作量)优化自适应功能分配。在优化过程中,我们利用任务性能、人的精神工作量和功能分配一致性等性能指标。为了说明所提框架的有效性,我们提出了一个模拟的 HAT 场景,其中涉及人类工作模型和无人机队,用于救灾行动中的最后一英里配送。
{"title":"A Computational Framework for Optimal Adaptive Function Allocation in a Human-Autonomy Teaming Scenario","authors":"Sooyung Byeon;Joonwon Choi;Inseok Hwang","doi":"10.1109/OJCSYS.2023.3340034","DOIUrl":"https://doi.org/10.1109/OJCSYS.2023.3340034","url":null,"abstract":"This article proposes a quantitative framework for optimally allocating task functions in human-autonomy teaming (HAT). HAT involves cooperation between humans and autonomous agents to achieve common goals. As humans and autonomous agents possess different capabilities, function allocation plays a crucial role in ensuring effective HAT. However, designing the best adaptive function allocation remains a challenge, as existing methods often rely on qualitative rules and intensive human-subject studies. To address this limitation, we propose a computational function allocation approach that leverages cognitive engineering, computational work models, and optimization techniques. The proposed optimal adaptive function allocation method is composed of three main elements: 1) analyze the teamwork to identify a set of all possible function allocations within a team construction, 2) numerically simulate the teamwork in temporal semantics to explore the interaction of the team with complex environments using the identified function allocations in a trial-and-error manner, and 3) optimize the adaptive function allocation with respect to a given situation such as physical conditions, available information resources, and human mental workload. For the optimization, we utilize performance metrics such as task performance, human mental workload, and coherency in function allocations. To illustrate the effectiveness of the proposed framework, we present a simulated HAT scenario involving a human work model and drone fleet for last-mile delivery in disaster relief operations.","PeriodicalId":73299,"journal":{"name":"IEEE open journal of control systems","volume":"3 ","pages":"32-44"},"PeriodicalIF":0.0,"publicationDate":"2023-12-06","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=10345767","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"139041247","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
On the Ratio of Reactive to Active Power in Wave Energy Converter Control 论波浪能转换器控制中无功功率与有功功率之比
Pub Date : 2023-11-08 DOI: 10.1109/OJCSYS.2023.3331193
Hafiz Ahsan Said;Demián García-Violini;Nicolás Faedo;John V. Ringwood
Optimal control of wave energy converters (WECs), while converting wave energy into a usable form, such as electricity, may inject (reactive) power into the system at various points in the wave cycle. Though somewhat counter-intuitive, this action usually results in improved overall energy conversion. However, recent experimental results show that, on occasion, reactive power peaks can be significantly in excess of active power levels, leaving device developers with difficult decision in how to rate the power take-off of the system i.e. whether to cater for these high reactive power peaks, or limit power flow to rated (active) levels. The origins of these excessive power peaks are currently poorly understood, creating significant uncertainty in how to deal with them. In this paper, we show that, using both theoretical results and an illustrative simulation case study, under matched controller conditions (impedance-matching optimal condition), for both monochromatic and panchromatic sea-states, that the maximum peak reactive/active power ratio never exceeds unity. However, under mismatched WEC/controller conditions, this peak power ratio can exceed unity, bringing unrealistic demands on the power take-off (PTO) rating. The paper examines the various origins of system/controller mismatch, including modelling error, controller synthesis inaccuracies, and non-ideal PTO behaviour, highlighting the consequences of such errors on reactive power flow levels. This important result points to the need for accurate WEC modeling, while also showing the folly of catering for excessive reactive power peaks.
波浪能转换器(WECs)的优化控制在将波浪能转换成电能等可用形式时,可能会在波浪周期的不同点向系统注入(无功)功率。虽然有些违背直觉,但这种做法通常会改善整体能量转换效果。然而,最近的实验结果表明,有时无功功率峰值会大大超过有功功率水平,这让设备开发人员难以决定如何评定系统的功率输出,即是满足这些高无功功率峰值,还是将功率流限制在额定(有功)水平。目前,人们对这些过高功率峰值的起源知之甚少,因此在如何处理这些峰值方面存在很大的不确定性。在本文中,我们利用理论结果和一个说明性仿真案例研究表明,在匹配控制器条件下(阻抗匹配最佳条件),对于单色和全色海况,最大无功/有功功率峰值比永远不会超过 1。然而,在不匹配的风电机组/控制器条件下,该峰值功率比可能会超过 1,从而对额定功率输出(PTO)提出不切实际的要求。本文探讨了系统/控制器不匹配的各种原因,包括建模错误、控制器合成不准确和非理想的 PTO 行为,并强调了这些错误对无功功率流水平的影响。这一重要结果表明了精确的水力发电建模的必要性,同时也说明了迎合过高的无功功率峰值是愚蠢的。
{"title":"On the Ratio of Reactive to Active Power in Wave Energy Converter Control","authors":"Hafiz Ahsan Said;Demián García-Violini;Nicolás Faedo;John V. Ringwood","doi":"10.1109/OJCSYS.2023.3331193","DOIUrl":"10.1109/OJCSYS.2023.3331193","url":null,"abstract":"Optimal control of wave energy converters (WECs), while converting wave energy into a usable form, such as electricity, may \u0000<italic>inject</i>\u0000 (reactive) power into the system at various points in the wave cycle. Though somewhat counter-intuitive, this action usually results in improved overall energy conversion. However, recent experimental results show that, on occasion, reactive power peaks can be significantly in excess of active power levels, leaving device developers with difficult decision in how to rate the power take-off of the system i.e. whether to cater for these high reactive power peaks, or limit power flow to rated (active) levels. The origins of these excessive power peaks are currently poorly understood, creating significant uncertainty in how to deal with them. In this paper, we show that, using both theoretical results and an illustrative simulation case study, \u0000<italic>under matched controller conditions</i>\u0000 (impedance-matching optimal condition), for both monochromatic and panchromatic sea-states, that the maximum peak reactive/active power ratio \u0000<italic>never exceeds unity</i>\u0000. However, under mismatched WEC/controller conditions, this peak power ratio can exceed unity, bringing unrealistic demands on the power take-off (PTO) rating. The paper examines the various origins of system/controller mismatch, including modelling error, controller synthesis inaccuracies, and non-ideal PTO behaviour, highlighting the consequences of such errors on reactive power flow levels. This important result points to the need for accurate WEC modeling, while also showing the folly of catering for excessive reactive power peaks.","PeriodicalId":73299,"journal":{"name":"IEEE open journal of control systems","volume":"3 ","pages":"14-31"},"PeriodicalIF":0.0,"publicationDate":"2023-11-08","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=10313027","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"135560786","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Vibrational Stabilization of Cluster Synchronization in Oscillator Networks 振子网络中簇同步的振动镇定性
Pub Date : 2023-11-08 DOI: 10.1109/OJCSYS.2023.3331195
Yuzhen Qin;Alberto Maria Nobili;Danielle S. Bassett;Fabio Pasqualetti
Cluster synchronization is of great importance for the normal functioning of numerous technological and natural systems. Deviations from normal cluster synchronization patterns are closely associated with various malfunctions, such as neurological disorders in the brain. Therefore, it is crucial to restore normal system functions by stabilizing the appropriate cluster synchronization patterns. Most existing studies focus on designing controllers based on state measurements to achieve system stabilization. However, in many real-world scenarios, measuring system states in real time, such as neuronal activity in the brain, poses significant challenges, rendering the stabilization of such systems difficult. To overcome this challenge, in this article, we employ an open-loop control strategy, vibrational control, which does not require any state measurements. We establish some sufficient conditions under which vibrational inputs stabilize cluster synchronization. Further, we provide a tractable approach to design vibrational control. Finally, numerical experiments are conducted to demonstrate our theoretical findings.
集群同步对于众多技术系统和自然系统的正常运行具有重要意义。偏离正常的集群同步模式与各种功能障碍密切相关,例如大脑中的神经系统疾病。因此,通过稳定适当的集群同步模式来恢复正常的系统功能至关重要。现有的研究大多集中在设计基于状态测量的控制器来实现系统的稳定。然而,在许多现实场景中,实时测量系统状态(如大脑中的神经元活动)带来了重大挑战,使此类系统难以稳定。为了克服这一挑战,在本文中,我们采用了一种开环控制策略,即振动控制,它不需要任何状态测量。建立了振动输入稳定集群同步的充分条件。此外,我们提供了一种易于处理的方法来设计振动控制。最后,通过数值实验对理论结果进行了验证。
{"title":"Vibrational Stabilization of Cluster Synchronization in Oscillator Networks","authors":"Yuzhen Qin;Alberto Maria Nobili;Danielle S. Bassett;Fabio Pasqualetti","doi":"10.1109/OJCSYS.2023.3331195","DOIUrl":"10.1109/OJCSYS.2023.3331195","url":null,"abstract":"Cluster synchronization is of great importance for the normal functioning of numerous technological and natural systems. Deviations from normal cluster synchronization patterns are closely associated with various malfunctions, such as neurological disorders in the brain. Therefore, it is crucial to restore normal system functions by stabilizing the appropriate cluster synchronization patterns. Most existing studies focus on designing controllers based on state measurements to achieve system stabilization. However, in many real-world scenarios, measuring system states in real time, such as neuronal activity in the brain, poses significant challenges, rendering the stabilization of such systems difficult. To overcome this challenge, in this article, we employ an open-loop control strategy, \u0000<italic>vibrational control</i>\u0000, which does not require any state measurements. We establish some sufficient conditions under which vibrational inputs stabilize cluster synchronization. Further, we provide a tractable approach to design vibrational control. Finally, numerical experiments are conducted to demonstrate our theoretical findings.","PeriodicalId":73299,"journal":{"name":"IEEE open journal of control systems","volume":"2 ","pages":"439-453"},"PeriodicalIF":0.0,"publicationDate":"2023-11-08","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=10313029","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"135559909","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Compositional Reinforcement Learning for Discrete-Time Stochastic Control Systems 离散随机控制系统的组合强化学习
Pub Date : 2023-11-01 DOI: 10.1109/OJCSYS.2023.3329394
Abolfazl Lavaei;Mateo Perez;Milad Kazemi;Fabio Somenzi;Sadegh Soudjani;Ashutosh Trivedi;Majid Zamani
We propose a compositional approach to synthesize policies for networks of continuous-space stochastic control systems with unknown dynamics using model-free reinforcement learning (RL). The approach is based on implicitly abstracting each subsystem in the network with a finite Markov decision process with unknown transition probabilities, synthesizing a strategy for each abstract model in an assume-guarantee fashion using RL, and then mapping the results back over the original network with approximate optimality guarantees. We provide lower bounds on the satisfaction probability of the overall network based on those over individual subsystems. A key contribution is to leverage the convergence results for adversarial RL (minimax Q-learning) on finite stochastic arenas to provide control strategies maximizing the probability of satisfaction over the network of continuous-space systems. We consider finite-horizon properties expressed in the syntactically co-safe fragment of linear temporal logic. These properties can readily be converted into automata-based reward functions, providing scalar reward signals suitable for RL. Since such reward functions are often sparse, we supply a potential-based reward shaping technique to accelerate learning by producing dense rewards. The effectiveness of the proposed approaches is demonstrated via two physical benchmarks including regulation of a room temperature network and control of a road traffic network.
我们提出了一种组合方法,利用无模型强化学习(RL)来综合具有未知动力学的连续空间随机控制系统网络的策略。该方法基于用未知转移概率的有限马尔可夫决策过程隐式抽象网络中的每个子系统,使用RL以假设-保证的方式综合每个抽象模型的策略,然后将结果映射回原始网络并提供近似最优性保证。我们在单个子系统的基础上给出了整个网络的满足概率的下界。一个关键的贡献是利用有限随机领域上对抗性RL (minimax Q-learning)的收敛结果来提供最大化连续空间系统网络满足概率的控制策略。我们考虑线性时间逻辑的语法共安全片段中表达的有限视界性质。这些属性可以很容易地转换为基于自动机的奖励函数,提供适合强化学习的标量奖励信号。由于这种奖励函数通常是稀疏的,我们提供了一种基于潜在的奖励塑造技术,通过产生密集的奖励来加速学习。通过两个物理基准,包括室温网络的调节和道路交通网络的控制,证明了所提出方法的有效性。
{"title":"Compositional Reinforcement Learning for Discrete-Time Stochastic Control Systems","authors":"Abolfazl Lavaei;Mateo Perez;Milad Kazemi;Fabio Somenzi;Sadegh Soudjani;Ashutosh Trivedi;Majid Zamani","doi":"10.1109/OJCSYS.2023.3329394","DOIUrl":"10.1109/OJCSYS.2023.3329394","url":null,"abstract":"We propose a compositional approach to synthesize policies for networks of continuous-space stochastic control systems with unknown dynamics using model-free reinforcement learning (RL). The approach is based on \u0000<italic>implicitly</i>\u0000 abstracting each subsystem in the network with a finite Markov decision process with \u0000<italic>unknown</i>\u0000 transition probabilities, synthesizing a strategy for each abstract model in an assume-guarantee fashion using RL, and then mapping the results back over the original network with \u0000<italic>approximate optimality</i>\u0000 guarantees. We provide lower bounds on the satisfaction probability of the overall network based on those over individual subsystems. A key contribution is to leverage the convergence results for adversarial RL (minimax Q-learning) on finite stochastic arenas to provide control strategies maximizing the probability of satisfaction over the network of continuous-space systems. We consider \u0000<italic>finite-horizon</i>\u0000 properties expressed in the syntactically co-safe fragment of linear temporal logic. These properties can readily be converted into automata-based reward functions, providing scalar reward signals suitable for RL. Since such reward functions are often sparse, we supply a potential-based \u0000<italic>reward shaping</i>\u0000 technique to accelerate learning by producing dense rewards. The effectiveness of the proposed approaches is demonstrated via two physical benchmarks including regulation of a room temperature network and control of a road traffic network.","PeriodicalId":73299,"journal":{"name":"IEEE open journal of control systems","volume":"2 ","pages":"425-438"},"PeriodicalIF":0.0,"publicationDate":"2023-11-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=10304199","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"135362318","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
Cluster Synchronization as a Mechanism of Free Recall in Working Memory Networks 集群同步是工作记忆网络中的自由回忆机制
Pub Date : 2023-10-30 DOI: 10.1109/OJCSYS.2023.3328201
Matin Jafarian;David Chávez Huerta;Gianluca Villani;Anders Lansner;Karl H. Johansson
This article studies free recall, i.e., the reactivation of stored memory items, namely patterns, in any order, of a model of working memory. Our free recall model is based on a biologically plausible modular neural network composed of $H$ modules, namely hypercolumns, each of which is a bundle of $M$ minicolumns. The coupling weights and constant bias values of the network are determined by a Hebbian plasticity rule. Using techniques from nonlinear stability theory, we show that cluster synchronization is the central mechanism governing free recall of orthogonally encoded patterns. Particularly, we show that free recall's cluster synchronization is the combination of two main mechanisms: simultaneous activities of minicolumns representing an encoded pattern, i.e., within-pattern synchronization, together with time-divided activities of minicolumns representing different patterns. We characterize the coupling and bias value conditions under which cluster synchronization emerges. We also discuss the role of heterogeneous coupling weights and bias values of minicolumns' dynamics in free recall. Specifically, we compare the behaviour of two $H times 2$ networks with identical and non-identical coupling weights and bias values. For these two networks, we obtain bounds on couplings and bias values under which both encoded patterns are recalled. Our analysis shows that having non-identical couplings and bias values for different patterns increases the possibility of their free recall. Numerical simulations are given to validate the theoretical analysis.
本文研究的是自由回忆,即重新激活工作记忆模型中存储的记忆项(即任意顺序的模式)。我们的自由回忆模型基于一个生物学上可信的模块化神经网络,该网络由 $H$ 模块(即超柱)组成,每个模块都是一束 $M$ 小柱。网络的耦合权重和恒定偏置值由海比可塑性规则决定。利用非线性稳定性理论的技术,我们证明了集群同步是支配正交编码模式自由回忆的核心机制。特别是,我们证明了自由回忆的集群同步是两种主要机制的结合:代表一种编码模式的小柱的同时活动(即模式内同步),以及代表不同模式的小柱的分时活动。我们描述了集群同步出现的耦合和偏置值条件。我们还讨论了小柱动态的异质耦合权重和偏置值在自由回忆中的作用。具体来说,我们比较了两个具有相同和非相同耦合权重和偏置值的 $H times 2$ 网络的行为。对于这两个网络,我们得到了耦合和偏置值的边界,在此边界下,两种编码模式都能被召回。我们的分析表明,不同模式的非相同耦合和偏置值增加了它们被自由调用的可能性。我们给出了数值模拟来验证理论分析。
{"title":"Cluster Synchronization as a Mechanism of Free Recall in Working Memory Networks","authors":"Matin Jafarian;David Chávez Huerta;Gianluca Villani;Anders Lansner;Karl H. Johansson","doi":"10.1109/OJCSYS.2023.3328201","DOIUrl":"10.1109/OJCSYS.2023.3328201","url":null,"abstract":"This article studies free recall, i.e., the reactivation of stored memory items, namely \u0000<italic>patterns</i>\u0000, in any order, of a model of working memory. Our free recall model is based on a biologically plausible modular neural network composed of \u0000<inline-formula><tex-math>$H$</tex-math></inline-formula>\u0000 modules, namely \u0000<italic>hypercolumns</i>\u0000, each of which is a bundle of \u0000<inline-formula><tex-math>$M$</tex-math></inline-formula>\u0000 \u0000<italic>minicolumns</i>\u0000. The coupling weights and constant bias values of the network are determined by a Hebbian plasticity rule. Using techniques from nonlinear stability theory, we show that cluster synchronization is the central mechanism governing free recall of orthogonally encoded patterns. Particularly, we show that free recall's cluster synchronization is the combination of two main mechanisms: simultaneous activities of minicolumns representing an encoded pattern, i.e., within-pattern synchronization, together with time-divided activities of minicolumns representing different patterns. We characterize the coupling and bias value conditions under which cluster synchronization emerges. We also discuss the role of heterogeneous coupling weights and bias values of minicolumns' dynamics in free recall. Specifically, we compare the behaviour of two \u0000<inline-formula><tex-math>$H times 2$</tex-math></inline-formula>\u0000 networks with identical and non-identical coupling weights and bias values. For these two networks, we obtain bounds on couplings and bias values under which both encoded patterns are recalled. Our analysis shows that having non-identical couplings and bias values for different patterns increases the possibility of their free recall. Numerical simulations are given to validate the theoretical analysis.","PeriodicalId":73299,"journal":{"name":"IEEE open journal of control systems","volume":"2 ","pages":"454-463"},"PeriodicalIF":0.0,"publicationDate":"2023-10-30","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=10301556","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"135313278","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Balancing Agility and Communication: Denser Networks Require Faster Agents 平衡敏捷性与通信:密集的网络需要更快的代理
Pub Date : 2023-10-12 DOI: 10.1109/OJCSYS.2023.3324274
Yu-Mei Huang;Arthur C. B. de Oliveira;Dinesh Murugan;Milad Siami
This article delves into the challenges of ensuring stability (in some sense) and robustness in large-scale second-order consensus networks (SOCNs) and autonomous vehicle platoons in the discrete-time domain. We propose a graph-theoretic methodology for designing a state feedback law for these systems in a discrete-time framework. By analyzing the behavior of the solutions of the networks based on the algebraic properties of the Laplacian matrices of the underlying graphs and on the value of the update cycle (also known as the time step) of each vehicle, we provide a necessary and sufficient condition for the stability of a linear second-order consensus network in the discrete-time domain. We then perform an $mathcal {H}_{2}$-based robustness analysis to demonstrate the relationship between the $mathcal {H}_{2}$-norm of the system, network size, connectivity, and update cycles, providing insights into how these factors impact the convergence and robustness of the system. A key contribution of this work is the development of a formal framework for understanding the link between an $mathcal {H}_{2}$-based performance measure and the restrictions on the update cycle of the vehicles. Specifically, we show that denser networks (i.e., networks with more communication links) might require faster agents (i.e., smaller update cycles) to outperform or achieve the same level of robustness as sparse networks (i.e., networks with fewer communication links) - see Fig. 1. These findings have important implications for the design and implementation of large-scale consensus networks and autonomous vehicle platoons, highlighting the need for a balance between network density and update cycle speed for optimal performance. We finish the article with results from simulations and experiments that illustrate the effectiveness of the proposed framework in predicting the behavior of vehicle platoons, even for more complex agents with nonlinear dynamics, using Quanser's Qlabs and Qcars.
本文深入探讨了在离散时间域中确保大规模二阶共识网络(SOCN)和自主车辆排的稳定性(在某种意义上)和鲁棒性所面临的挑战。我们提出了一种图论方法,用于在离散时间框架下为这些系统设计状态反馈法则。通过分析基于底层图的拉普拉斯矩阵的代数特性和每辆车的更新周期(也称为时间步长)值的网络解的行为,我们提供了离散时域中线性二阶共识网络稳定性的必要条件和充分条件。然后,我们进行了基于 $mathcal {H}_{2}$ 的鲁棒性分析,证明了系统的 $mathcal {H}_{2}$ 矩阵、网络大小、连通性和更新周期之间的关系,从而深入了解了这些因素如何影响系统的收敛性和鲁棒性。这项工作的一个重要贡献是建立了一个正式框架,用于理解基于 $mathcal {H}_{2}$ 的性能指标与车辆更新周期限制之间的联系。具体来说,我们表明,密集网络(即具有更多通信链路的网络)可能需要更快的代理(即更短的更新周期)才能优于稀疏网络(即具有较少通信链路的网络)或达到与之相同的鲁棒性水平--见图 1。这些发现对大规模共识网络和自动驾驶汽车排的设计和实施具有重要意义,强调了在网络密度和更新周期速度之间保持平衡以获得最佳性能的必要性。文章最后,我们介绍了使用 Quanser 的 Qlabs 和 Qcars 进行模拟和实验的结果,这些结果表明了所提出的框架在预测车辆排的行为方面的有效性,甚至对于具有非线性动力学的更复杂的代理也是如此。
{"title":"Balancing Agility and Communication: Denser Networks Require Faster Agents","authors":"Yu-Mei Huang;Arthur C. B. de Oliveira;Dinesh Murugan;Milad Siami","doi":"10.1109/OJCSYS.2023.3324274","DOIUrl":"10.1109/OJCSYS.2023.3324274","url":null,"abstract":"This article delves into the challenges of ensuring stability (in some sense) and robustness in large-scale second-order consensus networks (SOCNs) and autonomous vehicle platoons in the discrete-time domain. We propose a graph-theoretic methodology for designing a state feedback law for these systems in a discrete-time framework. By analyzing the behavior of the solutions of the networks based on the algebraic properties of the Laplacian matrices of the underlying graphs and on the value of the update cycle (also known as the time step) of each vehicle, we provide a necessary and sufficient condition for the stability of a linear second-order consensus network in the discrete-time domain. We then perform an \u0000<inline-formula><tex-math>$mathcal {H}_{2}$</tex-math></inline-formula>\u0000-based robustness analysis to demonstrate the relationship between the \u0000<inline-formula><tex-math>$mathcal {H}_{2}$</tex-math></inline-formula>\u0000-norm of the system, network size, connectivity, and update cycles, providing insights into how these factors impact the convergence and robustness of the system. A key contribution of this work is the development of a formal framework for understanding the link between an \u0000<inline-formula><tex-math>$mathcal {H}_{2}$</tex-math></inline-formula>\u0000-based performance measure and the restrictions on the update cycle of the vehicles. Specifically, we show that denser networks (i.e., networks with more communication links) might require faster agents (i.e., smaller update cycles) to outperform or achieve the same level of robustness as sparse networks (i.e., networks with fewer communication links) - see Fig. 1. These findings have important implications for the design and implementation of large-scale consensus networks and autonomous vehicle platoons, highlighting the need for a balance between network density and update cycle speed for optimal performance. We finish the article with results from simulations and experiments that illustrate the effectiveness of the proposed framework in predicting the behavior of vehicle platoons, even for more complex agents with nonlinear dynamics, using Quanser's Qlabs and Qcars.","PeriodicalId":73299,"journal":{"name":"IEEE open journal of control systems","volume":"2 ","pages":"464-476"},"PeriodicalIF":0.0,"publicationDate":"2023-10-12","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=10283896","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"136304144","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
期刊
IEEE open journal of control systems
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1