首页 > 最新文献

Ieee-Caa Journal of Automatica Sinica最新文献

英文 中文
Urban Traffic Control Meets Decision Recommendation System: A Survey and Perspective 城市交通管制与决策建议系统:调查与展望
IF 15.3 1区 计算机科学 Q1 AUTOMATION & CONTROL SYSTEMS Pub Date : 2024-09-04 DOI: 10.1109/JAS.2024.124659
Qingyuan Ji;Xiaoyue Wen;Junchen Jin;Yongdong Zhu;Yisheng Lv
Urban traffic control is a multifaceted and demanding task that necessitates extensive decision-making to ensure the safety and efficiency of urban transportation systems. Traditional approaches require traffic signal professionals to manually intervene on traffic control devices at the intersection level, utilizing their knowledge and expertise. However, this process is cumbersome, labor-intensive, and cannot be applied on a large network scale. Recent studies have begun to explore the applicability of recommendation system for urban traffic control, which offer increased control efficiency and scalability. Such a decision recommendation system is complex, with various interdependent components, but a systematic literature review has not yet been conducted. In this work, we present an up-to-date survey that elucidates all the detailed components of a recommendation system for urban traffic control, demonstrates the utility and efficacy of such a system in the real world using data and knowledge-driven approaches, and discusses the current challenges and potential future directions of this field.
城市交通管制是一项多方面的艰巨任务,需要进行广泛的决策,以确保城市交通系统的安全和效率。传统方法要求交通信号专业人员利用自己的知识和专长,在交叉路口层面对交通控制设备进行人工干预。然而,这一过程既繁琐又耗费人力,而且无法应用于大型网络规模。最近的研究已开始探索推荐系统在城市交通控制中的适用性,该系统可提高控制效率和可扩展性。这种决策推荐系统非常复杂,包含各种相互依存的组成部分,但目前还没有系统的文献综述。在这项工作中,我们提出了一项最新调查,阐明了城市交通控制推荐系统的所有详细组成部分,利用数据和知识驱动方法展示了这种系统在现实世界中的实用性和有效性,并讨论了该领域当前面临的挑战和潜在的未来发展方向。
{"title":"Urban Traffic Control Meets Decision Recommendation System: A Survey and Perspective","authors":"Qingyuan Ji;Xiaoyue Wen;Junchen Jin;Yongdong Zhu;Yisheng Lv","doi":"10.1109/JAS.2024.124659","DOIUrl":"https://doi.org/10.1109/JAS.2024.124659","url":null,"abstract":"Urban traffic control is a multifaceted and demanding task that necessitates extensive decision-making to ensure the safety and efficiency of urban transportation systems. Traditional approaches require traffic signal professionals to manually intervene on traffic control devices at the intersection level, utilizing their knowledge and expertise. However, this process is cumbersome, labor-intensive, and cannot be applied on a large network scale. Recent studies have begun to explore the applicability of recommendation system for urban traffic control, which offer increased control efficiency and scalability. Such a decision recommendation system is complex, with various interdependent components, but a systematic literature review has not yet been conducted. In this work, we present an up-to-date survey that elucidates all the detailed components of a recommendation system for urban traffic control, demonstrates the utility and efficacy of such a system in the real world using data and knowledge-driven approaches, and discusses the current challenges and potential future directions of this field.","PeriodicalId":54230,"journal":{"name":"Ieee-Caa Journal of Automatica Sinica","volume":"11 10","pages":"2043-2058"},"PeriodicalIF":15.3,"publicationDate":"2024-09-04","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142137523","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Neural Network-Based State Estimation for Nonlinear Systems with Denial-of-Service Attack Under Try-Once-Discard Protocol 基于神经网络的拒绝服务攻击非线性系统状态估计(试一试协议下
IF 15.3 1区 计算机科学 Q1 AUTOMATION & CONTROL SYSTEMS Pub Date : 2024-09-04 DOI: 10.1109/JAS.2023.123690
Xueli Wang;Shangwei Zhao;Ming Yang;Xin Wang;Xiaoming Wu
Dear Editor, This letter deals with state estimation issues of discrete-time non-linear systems subject to denial-of-service (DoS) attacks under the try-once-discard (TOD) protocol. More specifically, to reduce the communication burden, a TOD protocol with novel update rules on protocol weights is designed for scheduling measurement outputs. In addition, unknown nonlinear functions vulnerable to DoS attacks are considered due to the openness and vulnerability of the network. For such systems, the neural networks (NNs) are exploited to estimate the unknown nonlinear system dynamics in the designed Luenberger-like observer. With the help of Lyapunov theory, some sufficient conditions are derived under which the estimation error and the approximation errors of NNs weights are uniformly ultimately bounded (UUB). Finally, the validity of designed observers is demonstrated by a power system example.
亲爱的编辑,这封信讨论了在 "尝试-一次-丢弃"(TOD)协议下受到拒绝服务(DoS)攻击的离散-时间非线性系统的状态估计问题。更具体地说,为了减少通信负担,我们设计了一种具有新颖的协议权重更新规则的 TOD 协议,用于调度测量输出。此外,由于网络的开放性和脆弱性,还考虑了容易受到 DoS 攻击的未知非线性函数。对于此类系统,利用神经网络(NN)来估计所设计的类似卢恩贝格尔观测器的未知非线性系统动态。在 Lyapunov 理论的帮助下,推导出了一些充分条件,在这些条件下,神经网络权重的估计误差和近似误差是均匀最终有界的(UUB)。最后,通过一个电力系统实例证明了所设计的观测器的有效性。
{"title":"Neural Network-Based State Estimation for Nonlinear Systems with Denial-of-Service Attack Under Try-Once-Discard Protocol","authors":"Xueli Wang;Shangwei Zhao;Ming Yang;Xin Wang;Xiaoming Wu","doi":"10.1109/JAS.2023.123690","DOIUrl":"https://doi.org/10.1109/JAS.2023.123690","url":null,"abstract":"Dear Editor, This letter deals with state estimation issues of discrete-time non-linear systems subject to denial-of-service (DoS) attacks under the try-once-discard (TOD) protocol. More specifically, to reduce the communication burden, a TOD protocol with novel update rules on protocol weights is designed for scheduling measurement outputs. In addition, unknown nonlinear functions vulnerable to DoS attacks are considered due to the openness and vulnerability of the network. For such systems, the neural networks (NNs) are exploited to estimate the unknown nonlinear system dynamics in the designed Luenberger-like observer. With the help of Lyapunov theory, some sufficient conditions are derived under which the estimation error and the approximation errors of NNs weights are uniformly ultimately bounded (UUB). Finally, the validity of designed observers is demonstrated by a power system example.","PeriodicalId":54230,"journal":{"name":"Ieee-Caa Journal of Automatica Sinica","volume":"11 10","pages":"2182-2184"},"PeriodicalIF":15.3,"publicationDate":"2024-09-04","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=10664600","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142137598","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Scalable Temporal Dimension Preserved Tensor Completion for Missing Traffic Data Imputation with Orthogonal Initialization 利用正交初始化对缺失交通数据进行可扩展的时空维度保留张量补全
IF 15.3 1区 计算机科学 Q1 AUTOMATION & CONTROL SYSTEMS Pub Date : 2024-09-04 DOI: 10.1109/JAS.2024.124278
Hong Chen;Mingwei Lin;Jiaqi Liu;Zeshui Xu
Dear Editor, This letter puts forward a novel scalable temporal dimension preserved tensor completion model based on orthogonal initialization for missing traffic data (MTD) imputation. The MTD imputation acts directly on accessing the traffic state, and affects the traffic management. However, it still faces the following challenges: 1) The MTD imputation is usually formulated as matrix completion or tensor completion, which ignores the information across different dimensions; 2) Most of the existing models cannot generalize to traffic datasets of different scales or different missing rates; and 3) The MTD imputation models based on Gaussian random initialization easily leads to gradient explosion or vanishing, so that the training accuracy is not effectively improved. Inspired by these findings, the proposed scalable temporal dimension preserved tensor completion (ST-DPTC) model creatively establishes the following three-fold ideas: a) Incorporating the dimension preserved tensor completion (DPTC) to extract more distinctive traffic structure changes from the low-rank latent factor tensors; b) Adopting a scalable temporal (ST) regularization with first-order difference and second-order difference operators to adapt to different scales of traffic data; and c) Embedding ST regularization into DPTC with orthogonal initialization to perform low-rank latent factor tensor extraction and MTD imputation. Results on real-world traffic datasets with different scales show that our proposed model exceeds the state-of-the-art models in terms of the imputation accuracy.
亲爱的编辑,这封信提出了一种基于正交初始化的新型可扩展时维保留张量补全模型,用于缺失交通数据(MTD)估算。MTD 估算直接作用于访问流量状态,并影响流量管理。然而,它仍面临以下挑战:1)MTD 估算通常被表述为矩阵补全或张量补全,忽略了不同维度的信息;2)现有模型大多无法泛化到不同尺度或不同缺失率的交通数据集;3)基于高斯随机初始化的 MTD 估算模型容易导致梯度爆炸或消失,从而无法有效提高训练精度。受这些发现的启发,所提出的可扩展时维保留张量补全(ST-DPTC)模型创造性地建立了以下三方面的思想:a) 结合维度保留张量补全(DPTC),从低阶潜因子张量中提取更独特的交通结构变化;b) 采用一阶差分和二阶差分算子的可扩展时间(ST)正则化,以适应不同尺度的交通数据;以及 c) 将 ST 正则化嵌入 DPTC,并进行正交初始化,以执行低阶潜因子张量提取和 MTD 估算。在不同规模的真实交通数据集上的结果表明,我们提出的模型在估算准确性方面超过了最先进的模型。
{"title":"Scalable Temporal Dimension Preserved Tensor Completion for Missing Traffic Data Imputation with Orthogonal Initialization","authors":"Hong Chen;Mingwei Lin;Jiaqi Liu;Zeshui Xu","doi":"10.1109/JAS.2024.124278","DOIUrl":"https://doi.org/10.1109/JAS.2024.124278","url":null,"abstract":"Dear Editor, This letter puts forward a novel scalable temporal dimension preserved tensor completion model based on orthogonal initialization for missing traffic data (MTD) imputation. The MTD imputation acts directly on accessing the traffic state, and affects the traffic management. However, it still faces the following challenges: 1) The MTD imputation is usually formulated as matrix completion or tensor completion, which ignores the information across different dimensions; 2) Most of the existing models cannot generalize to traffic datasets of different scales or different missing rates; and 3) The MTD imputation models based on Gaussian random initialization easily leads to gradient explosion or vanishing, so that the training accuracy is not effectively improved. Inspired by these findings, the proposed scalable temporal dimension preserved tensor completion (ST-DPTC) model creatively establishes the following three-fold ideas: a) Incorporating the dimension preserved tensor completion (DPTC) to extract more distinctive traffic structure changes from the low-rank latent factor tensors; b) Adopting a scalable temporal (ST) regularization with first-order difference and second-order difference operators to adapt to different scales of traffic data; and c) Embedding ST regularization into DPTC with orthogonal initialization to perform low-rank latent factor tensor extraction and MTD imputation. Results on real-world traffic datasets with different scales show that our proposed model exceeds the state-of-the-art models in terms of the imputation accuracy.","PeriodicalId":54230,"journal":{"name":"Ieee-Caa Journal of Automatica Sinica","volume":"11 10","pages":"2188-2190"},"PeriodicalIF":15.3,"publicationDate":"2024-09-04","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=10664599","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142137608","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Learning Sequential and Structural Dependencies Between Nucleotides for RNA N6-Methyladenosine Site Identification 学习核苷酸之间的序列和结构依赖性以识别 RNA N6-甲基腺苷位点
IF 15.3 1区 计算机科学 Q1 AUTOMATION & CONTROL SYSTEMS Pub Date : 2024-09-04 DOI: 10.1109/JAS.2024.124233
Guodong Li;Bowei Zhao;Xiaorui Su;Dongxu Li;Yue Yang;Zhi Zeng;Lun Hu
N6-methyladenosine (m6A) is an important RNA methylation modification involved in regulating diverse biological processes across multiple species. Hence, the identification of m6A modification sites provides valuable insight into the biological mechanisms of complex diseases at the post-transcriptional level. Although a variety of identification algorithms have been proposed recently, most of them capture the features of m6A modification sites by focusing on the sequential dependencies of nucleotides at different positions in RNA sequences, while ignoring the structural dependencies of nucleotides in their three-dimensional structures. To overcome this issue, we propose a cross-species end-to-end deep learning model, namely CR-NSSD, which conduct a cross-domain representation learning process integrating nucleotide structural and sequential dependencies for RNA m6A site identification. Specifically, CR-NSSD first obtains the pre-coded representations of RNA sequences by incorporating the position information into single-nucleotide states with chaos game representation theory. It then constructs a cross-domain reconstruction encoder to learn the sequential and structural dependencies between nucleotides. By minimizing the reconstruction and binary cross-entropy losses, CR-NSSD is trained to complete the task of m6A site identification. Extensive experiments have demonstrated the promising performance of CR-NSSD by comparing it with several state-of-the-art m6A identification algorithms. Moreover, the results of cross-species prediction indicate that the integration of sequential and structural dependencies allows CR-NSSD to capture general features of m6A modification sites among different species, thus improving the accuracy of cross-species identification.
N6-甲基腺苷(m6A)是一种重要的 RNA 甲基化修饰,参与调控多个物种的多种生物过程。因此,对 m6A 修饰位点的鉴定可在转录后水平为复杂疾病的生物学机制提供有价值的见解。虽然近来提出了多种识别算法,但它们大多只关注 RNA 序列中不同位置核苷酸的序列依赖关系,而忽略了核苷酸在其三维结构中的结构依赖关系,因而无法捕捉 m6A 修饰位点的特征。为了克服这一问题,我们提出了一种跨物种端到端深度学习模型,即CR-NSSD,该模型进行了跨域表征学习,将核苷酸结构依赖性和序列依赖性整合在一起,用于RNA m6A位点的识别。具体来说,CR-NSSD 首先利用混沌博弈表示理论将位置信息纳入单核苷酸状态,从而获得 RNA 序列的预编码表示。然后,它构建了一个跨域重构编码器,以学习核苷酸之间的序列和结构依赖关系。通过最小化重构损失和二元交叉熵损失,CR-NSSD 被训练来完成 m6A 位点识别任务。通过与几种最先进的 m6A 识别算法进行比较,大量实验证明了 CR-NSSD 的良好性能。此外,跨物种预测的结果表明,序列和结构依赖性的整合使 CR-NSSD 能够捕捉不同物种 m6A 修饰位点的一般特征,从而提高跨物种鉴定的准确性。
{"title":"Learning Sequential and Structural Dependencies Between Nucleotides for RNA N6-Methyladenosine Site Identification","authors":"Guodong Li;Bowei Zhao;Xiaorui Su;Dongxu Li;Yue Yang;Zhi Zeng;Lun Hu","doi":"10.1109/JAS.2024.124233","DOIUrl":"https://doi.org/10.1109/JAS.2024.124233","url":null,"abstract":"N6-methyladenosine (m6A) is an important RNA methylation modification involved in regulating diverse biological processes across multiple species. Hence, the identification of m6A modification sites provides valuable insight into the biological mechanisms of complex diseases at the post-transcriptional level. Although a variety of identification algorithms have been proposed recently, most of them capture the features of m6A modification sites by focusing on the sequential dependencies of nucleotides at different positions in RNA sequences, while ignoring the structural dependencies of nucleotides in their three-dimensional structures. To overcome this issue, we propose a cross-species end-to-end deep learning model, namely CR-NSSD, which conduct a cross-domain representation learning process integrating nucleotide structural and sequential dependencies for RNA m6A site identification. Specifically, CR-NSSD first obtains the pre-coded representations of RNA sequences by incorporating the position information into single-nucleotide states with chaos game representation theory. It then constructs a cross-domain reconstruction encoder to learn the sequential and structural dependencies between nucleotides. By minimizing the reconstruction and binary cross-entropy losses, CR-NSSD is trained to complete the task of m6A site identification. Extensive experiments have demonstrated the promising performance of CR-NSSD by comparing it with several state-of-the-art m6A identification algorithms. Moreover, the results of cross-species prediction indicate that the integration of sequential and structural dependencies allows CR-NSSD to capture general features of m6A modification sites among different species, thus improving the accuracy of cross-species identification.","PeriodicalId":54230,"journal":{"name":"Ieee-Caa Journal of Automatica Sinica","volume":"11 10","pages":"2123-2134"},"PeriodicalIF":15.3,"publicationDate":"2024-09-04","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142137513","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Fuzzy-Model-Based Finite Frequency Fault Detection Filtering Design for Two-Dimensional Nonlinear Systems 基于模糊模型的二维非线性系统有限频率故障检测滤波设计
IF 15.3 1区 计算机科学 Q1 AUTOMATION & CONTROL SYSTEMS Pub Date : 2024-09-04 DOI: 10.1109/JAS.2024.124452
Meng Wang;Huaicheng Yan;Jianbin Qiu;Wenqiang Ji
This article studies the fault detection filtering design problem for Roesser type two-dimensional (2-D) nonlinear systems described by uncertain 2-D Takagi-Sugeno (T-S) fuzzy models. Firstly, fuzzy Lyapunov functions are constructed and the 2-D Fourier transform is exploited, based on which a finite frequency fault detection filtering design method is proposed such that a residual signal is generated with robustness to external disturbances and sensitivity to faults. It has been shown that the utilization of available frequency spectrum information of faults and disturbances makes the proposed filtering design method more general and less conservative compared with a conventional non-frequency based filtering design approach. Then, with the proposed evaluation function and its threshold, a novel mixed finite frequency $mathcal{H}_{infty}/mathcal{H}_{-}$ fault detection algorithm is developed, based on which the fault can be immediately detected once the evaluation function exceeds the threshold. Finally, it is verified with simulation studies that the proposed method is effective and less conservative than conventional non-frequency and/or common Lyapunov function based filtering design methods.
本文研究了由不确定的二维高木-菅野(T-S)模糊模型描述的 Roesser 型二维(2-D)非线性系统的故障检测滤波设计问题。首先,构建了模糊 Lyapunov 函数,并利用二维傅立叶变换,在此基础上提出了一种有限频率故障检测滤波设计方法,从而产生对外部干扰具有鲁棒性且对故障敏感的残差信号。研究表明,与传统的非基于频率的滤波设计方法相比,利用故障和干扰的可用频谱信息使得所提出的滤波设计方法更具通用性,且不那么保守。然后,利用所提出的评估函数及其阈值,开发了一种新颖的混合有限频率 $mathcal{H}_{infty}/mathcal{H}_{-}$ 故障检测算法,基于该算法,一旦评估函数超过阈值,就能立即检测到故障。最后,通过仿真研究验证了所提出的方法比传统的基于非频率和/或普通 Lyapunov 函数的滤波设计方法更有效、更经济。
{"title":"Fuzzy-Model-Based Finite Frequency Fault Detection Filtering Design for Two-Dimensional Nonlinear Systems","authors":"Meng Wang;Huaicheng Yan;Jianbin Qiu;Wenqiang Ji","doi":"10.1109/JAS.2024.124452","DOIUrl":"https://doi.org/10.1109/JAS.2024.124452","url":null,"abstract":"This article studies the fault detection filtering design problem for Roesser type two-dimensional (2-D) nonlinear systems described by uncertain 2-D Takagi-Sugeno (T-S) fuzzy models. Firstly, fuzzy Lyapunov functions are constructed and the 2-D Fourier transform is exploited, based on which a finite frequency fault detection filtering design method is proposed such that a residual signal is generated with robustness to external disturbances and sensitivity to faults. It has been shown that the utilization of available frequency spectrum information of faults and disturbances makes the proposed filtering design method more general and less conservative compared with a conventional non-frequency based filtering design approach. Then, with the proposed evaluation function and its threshold, a novel mixed finite frequency \u0000<tex>$mathcal{H}_{infty}/mathcal{H}_{-}$</tex>\u0000 fault detection algorithm is developed, based on which the fault can be immediately detected once the evaluation function exceeds the threshold. Finally, it is verified with simulation studies that the proposed method is effective and less conservative than conventional non-frequency and/or common Lyapunov function based filtering design methods.","PeriodicalId":54230,"journal":{"name":"Ieee-Caa Journal of Automatica Sinica","volume":"11 10","pages":"2099-2110"},"PeriodicalIF":15.3,"publicationDate":"2024-09-04","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142137530","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Evolutionary Multi/Many-Objective Optimisation via Bilevel Decomposition 通过双层分解进行多目标/多目标进化优化
IF 15.3 1区 计算机科学 Q1 AUTOMATION & CONTROL SYSTEMS Pub Date : 2024-08-16 DOI: 10.1109/JAS.2024.124515
Shouyong Jiang;Jinglei Guo;Yong Wang;Shengxiang Yang
Decomposition of a complex multi-objective optimisation problem (MOP) to multiple simple subMOPs, known as M2M for short, is an effective approach to multi-objective optimisation. However, M2M facilitates little communication/collaboration between subMOPs, which limits its use in complex optimisation scenarios. This paper extends the M2M framework to develop a unified algorithm for both multi-objective and many-objective optimisation. Through bilevel decomposition, an MOP is divided into multiple subMOPs at upper level, each of which is further divided into a number of single-objective subproblems at lower level. Neighbouring subMOPs are allowed to share some subproblems so that the knowledge gained from solving one sub-MOP can be transferred to another, and eventually to all the sub-MOPs. The bilevel decomposition is readily combined with some new mating selection and population update strategies, leading to a high-performance algorithm that competes effectively against a number of state-of-the-arts studied in this paper for both multi- and many-objective optimisation. Parameter analysis and component analysis have been also carried out to further justify the proposed algorithm.
将复杂的多目标优化问题(MOP)分解为多个简单的子 MOP(简称 M2M),是一种有效的多目标优化方法。然而,M2M 几乎无法促进子 MOP 之间的交流/协作,这限制了它在复杂优化场景中的应用。本文扩展了 M2M 框架,为多目标和多目标优化开发了一种统一算法。通过双层分解,一个 MOP 在上层被划分为多个子 MOP,每个子 MOP 在下层又被进一步划分为多个单目标子问题。相邻的子 MOP 可以共享一些子问题,这样从解决一个子 MOP 中获得的知识就可以传授给另一个子 MOP,并最终传授给所有的子 MOP。双层分解很容易与一些新的交配选择和种群更新策略相结合,从而产生了一种高性能算法,可以有效地与本文所研究的一些多目标和多目标优化的先进算法竞争。本文还进行了参数分析和成分分析,以进一步证明所提算法的合理性。
{"title":"Evolutionary Multi/Many-Objective Optimisation via Bilevel Decomposition","authors":"Shouyong Jiang;Jinglei Guo;Yong Wang;Shengxiang Yang","doi":"10.1109/JAS.2024.124515","DOIUrl":"https://doi.org/10.1109/JAS.2024.124515","url":null,"abstract":"Decomposition of a complex multi-objective optimisation problem (MOP) to multiple simple subMOPs, known as M2M for short, is an effective approach to multi-objective optimisation. However, M2M facilitates little communication/collaboration between subMOPs, which limits its use in complex optimisation scenarios. This paper extends the M2M framework to develop a unified algorithm for both multi-objective and many-objective optimisation. Through bilevel decomposition, an MOP is divided into multiple subMOPs at upper level, each of which is further divided into a number of single-objective subproblems at lower level. Neighbouring subMOPs are allowed to share some subproblems so that the knowledge gained from solving one sub-MOP can be transferred to another, and eventually to all the sub-MOPs. The bilevel decomposition is readily combined with some new mating selection and population update strategies, leading to a high-performance algorithm that competes effectively against a number of state-of-the-arts studied in this paper for both multi- and many-objective optimisation. Parameter analysis and component analysis have been also carried out to further justify the proposed algorithm.","PeriodicalId":54230,"journal":{"name":"Ieee-Caa Journal of Automatica Sinica","volume":"11 9","pages":"1973-1986"},"PeriodicalIF":15.3,"publicationDate":"2024-08-16","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141993916","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Risk-Informed Model-Free Safe Control of Linear Parameter-Varying Systems 线性参数变化系统的风险知情无模型安全控制
IF 15.3 1区 计算机科学 Q1 AUTOMATION & CONTROL SYSTEMS Pub Date : 2024-08-15 DOI: 10.1109/JAS.2024.124479
Babak Esmaeili;Hamidreza Modares
This paper presents a risk-informed data-driven safe control design approach for a class of stochastic uncertain nonlinear discrete-time systems. The nonlinear system is modeled using linear parameter-varying (LPV) systems. A model-based probabilistic safe controller is first designed to guarantee probabilistic $lambda$-contractivity (i.e., stability and invariance) of the LPV system with respect to a given polyhedral safe set. To obviate the requirement of knowing the LPV system model and to bypass identifying its open-loop model, its closed-loop data-based representation is provided in terms of state and scheduling data as well as a decision variable. It is shown that the variance of the closed-loop system, as well as the probability of safety satisfaction, depends on the decision variable and the noise covariance. A minimum-variance direct data-driven gain-scheduling safe control design approach is presented next by designing the decision variable such that all possible closed-loop system realizations satisfy safety with the highest confidence level. This minimum-variance approach is a control-oriented learning method since it minimizes the variance of the state of the closed-loop system with respect to the safe set, and thus minimizes the risk of safety violation. Unlike the certainty-equivalent approach that results in a risk-neutral control design, the minimum-variance method leads to a risk-averse control design. It is shown that the presented direct risk-averse learning approach requires weaker data richness conditions than existing indirect learning methods based on system identification and can lead to a lower risk of safety violation. Two simulation examples along with an experimental validation on an autonomous vehicle are provided to show the effectiveness of the presented approach.
本文针对一类随机不确定非线性离散时间系统提出了一种风险知情的数据驱动安全控制设计方法。非线性系统采用线性参数变化(LPV)系统建模。首先设计一个基于模型的概率安全控制器,以保证 LPV 系统相对于给定多面体安全集的概率$lambda$-契约性(即稳定性和不变性)。为了省去了解 LPV 系统模型的要求,并绕过识别其开环模型,我们用状态和调度数据以及一个决策变量来提供其基于数据的闭环表示。结果表明,闭环系统的方差以及安全满足概率取决于决策变量和噪声协方差。接下来介绍了一种最小方差直接数据驱动增益调度安全控制设计方法,即通过设计决策变量,使所有可能的闭环系统实现以最高置信度满足安全要求。这种最小方差方法是一种以控制为导向的学习方法,因为它能使闭环系统的状态相对于安全集的方差最小,从而将违反安全的风险降至最低。与确定性等价方法不同的是,确定性等价方法会导致风险中性的控制设计,而最小方差方法则会导致规避风险的控制设计。研究表明,与现有的基于系统识别的间接学习方法相比,本文提出的直接规避风险学习方法所需的数据丰富度条件更弱,而且可以降低违反安全规定的风险。本文提供了两个仿真实例,并在一辆自动驾驶汽车上进行了实验验证,以说明所提出方法的有效性。
{"title":"Risk-Informed Model-Free Safe Control of Linear Parameter-Varying Systems","authors":"Babak Esmaeili;Hamidreza Modares","doi":"10.1109/JAS.2024.124479","DOIUrl":"https://doi.org/10.1109/JAS.2024.124479","url":null,"abstract":"This paper presents a risk-informed data-driven safe control design approach for a class of stochastic uncertain nonlinear discrete-time systems. The nonlinear system is modeled using linear parameter-varying (LPV) systems. A model-based probabilistic safe controller is first designed to guarantee probabilistic \u0000<tex>$lambda$</tex>\u0000-contractivity (i.e., stability and invariance) of the LPV system with respect to a given polyhedral safe set. To obviate the requirement of knowing the LPV system model and to bypass identifying its open-loop model, its closed-loop data-based representation is provided in terms of state and scheduling data as well as a decision variable. It is shown that the variance of the closed-loop system, as well as the probability of safety satisfaction, depends on the decision variable and the noise covariance. A minimum-variance direct data-driven gain-scheduling safe control design approach is presented next by designing the decision variable such that all possible closed-loop system realizations satisfy safety with the highest confidence level. This minimum-variance approach is a control-oriented learning method since it minimizes the variance of the state of the closed-loop system with respect to the safe set, and thus minimizes the risk of safety violation. Unlike the certainty-equivalent approach that results in a risk-neutral control design, the minimum-variance method leads to a risk-averse control design. It is shown that the presented direct risk-averse learning approach requires weaker data richness conditions than existing indirect learning methods based on system identification and can lead to a lower risk of safety violation. Two simulation examples along with an experimental validation on an autonomous vehicle are provided to show the effectiveness of the presented approach.","PeriodicalId":54230,"journal":{"name":"Ieee-Caa Journal of Automatica Sinica","volume":"11 9","pages":"1918-1932"},"PeriodicalIF":15.3,"publicationDate":"2024-08-15","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141991489","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Regional Multi-Agent Cooperative Reinforcement Learning for City-Level Traffic Grid Signal Control 城市级交通网格信号控制的区域多代理合作强化学习
IF 15.3 1区 计算机科学 Q1 AUTOMATION & CONTROL SYSTEMS Pub Date : 2024-08-15 DOI: 10.1109/JAS.2024.124365
Yisha Li;Ya Zhang;Xinde Li;Changyin Sun
This article studies the effective traffic signal control problem of multiple intersections in a city-level traffic system. A novel regional multi-agent cooperative reinforcement learning algorithm called RegionSTLight is proposed to improve the traffic efficiency. Firstly a regional multi-agent Q-learning framework is proposed, which can equivalently decompose the global Q value of the traffic system into the local values of several regions. Based on the framework and the idea of human-machine cooperation, a dynamic zoning method is designed to divide the traffic network into several strong-coupled regions according to real-time traffic flow densities. In order to achieve better cooperation inside each region, a lightweight spatio-temporal fusion feature extraction network is designed. The experiments in synthetic, real-world and city-level scenarios show that the proposed RegionSTLight converges more quickly, is more stable, and obtains better asymptotic performance compared to state-of-the-art models.
本文研究了城市级交通系统中多个交叉口的有效交通信号控制问题。提出了一种名为 RegionSTLight 的新型区域多代理合作强化学习算法,以提高交通效率。首先提出了一个区域多代理 Q 值学习框架,该框架可将交通系统的全局 Q 值等价分解为多个区域的局部 Q 值。基于该框架和人机合作思想,设计了一种动态分区方法,根据实时交通流密度将交通网络划分为若干强耦合区域。为了在每个区域内实现更好的合作,设计了一个轻量级时空融合特征提取网络。在合成、真实世界和城市级场景中的实验表明,与最先进的模型相比,所提出的 RegionSTLight 收敛更快、更稳定,并获得了更好的渐近性能。
{"title":"Regional Multi-Agent Cooperative Reinforcement Learning for City-Level Traffic Grid Signal Control","authors":"Yisha Li;Ya Zhang;Xinde Li;Changyin Sun","doi":"10.1109/JAS.2024.124365","DOIUrl":"https://doi.org/10.1109/JAS.2024.124365","url":null,"abstract":"This article studies the effective traffic signal control problem of multiple intersections in a city-level traffic system. A novel regional multi-agent cooperative reinforcement learning algorithm called RegionSTLight is proposed to improve the traffic efficiency. Firstly a regional multi-agent Q-learning framework is proposed, which can equivalently decompose the global Q value of the traffic system into the local values of several regions. Based on the framework and the idea of human-machine cooperation, a dynamic zoning method is designed to divide the traffic network into several strong-coupled regions according to real-time traffic flow densities. In order to achieve better cooperation inside each region, a lightweight spatio-temporal fusion feature extraction network is designed. The experiments in synthetic, real-world and city-level scenarios show that the proposed RegionSTLight converges more quickly, is more stable, and obtains better asymptotic performance compared to state-of-the-art models.","PeriodicalId":54230,"journal":{"name":"Ieee-Caa Journal of Automatica Sinica","volume":"11 9","pages":"1987-1998"},"PeriodicalIF":15.3,"publicationDate":"2024-08-15","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141991509","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Target Controllability of Multi-Layer Networks with High-Dimensional Nodes 具有高维节点的多层网络的目标可控性
IF 15.3 1区 计算机科学 Q1 AUTOMATION & CONTROL SYSTEMS Pub Date : 2024-08-15 DOI: 10.1109/JAS.2023.124152
Lifu Wang;Zhaofei Li;Ge Guo;Zhi Kong
This paper studies the target controllability of multi-layer complex networked systems, in which the nodes are high-dimensional linear time invariant (LTI) dynamical systems, and the network topology is directed and weighted. The influence of inter-layer couplings on the target controllability of multi-layer networks is discussed. It is found that even if there exists a layer which is not target controllable, the entire multi-layer network can still be target controllable due to the inter-layer couplings. For the multi-layer networks with general structure, a necessary and sufficient condition for target controllability is given by establishing the relationship between uncontrollable subspace and output matrix. By the derived condition, it can be found that the system may be target controllable even if it is not state controllable. On this basis, two corollaries are derived, which clarify the relationship between target controllability, state controllability and output controllability. For the multi-layer networks where the inter-layer couplings are directed chains and directed stars, sufficient conditions for target controllability of networked systems are given, respectively. These conditions are easier to verify than the classic criterion.
本文研究了多层复杂网络系统的目标可控性,其中节点是高维线性时不变(LTI)动态系统,网络拓扑结构是有向和加权的。讨论了层间耦合对多层网络目标可控性的影响。研究发现,由于层间耦合的存在,即使存在不具有目标可控性的层,整个多层网络仍然具有目标可控性。对于一般结构的多层网络,通过建立不可控子空间和输出矩阵之间的关系,给出了目标可控性的必要条件和充分条件。根据推导出的条件,可以发现即使系统不是状态可控的,也可能是目标可控的。在此基础上推导出两个推论,阐明了目标可控性、状态可控性和输出可控性之间的关系。对于层间耦合为有向链和有向星的多层网络,分别给出了网络系统目标可控性的充分条件。这些条件比经典标准更容易验证。
{"title":"Target Controllability of Multi-Layer Networks with High-Dimensional Nodes","authors":"Lifu Wang;Zhaofei Li;Ge Guo;Zhi Kong","doi":"10.1109/JAS.2023.124152","DOIUrl":"https://doi.org/10.1109/JAS.2023.124152","url":null,"abstract":"This paper studies the target controllability of multi-layer complex networked systems, in which the nodes are high-dimensional linear time invariant (LTI) dynamical systems, and the network topology is directed and weighted. The influence of inter-layer couplings on the target controllability of multi-layer networks is discussed. It is found that even if there exists a layer which is not target controllable, the entire multi-layer network can still be target controllable due to the inter-layer couplings. For the multi-layer networks with general structure, a necessary and sufficient condition for target controllability is given by establishing the relationship between uncontrollable subspace and output matrix. By the derived condition, it can be found that the system may be target controllable even if it is not state controllable. On this basis, two corollaries are derived, which clarify the relationship between target controllability, state controllability and output controllability. For the multi-layer networks where the inter-layer couplings are directed chains and directed stars, sufficient conditions for target controllability of networked systems are given, respectively. These conditions are easier to verify than the classic criterion.","PeriodicalId":54230,"journal":{"name":"Ieee-Caa Journal of Automatica Sinica","volume":"11 9","pages":"1999-2010"},"PeriodicalIF":15.3,"publicationDate":"2024-08-15","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141991492","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Safe Efficient Policy Optimization Algorithm for Unsignalized Intersection Navigation 无信号交叉路口导航的安全高效策略优化算法
IF 15.3 1区 计算机科学 Q1 AUTOMATION & CONTROL SYSTEMS Pub Date : 2024-08-15 DOI: 10.1109/JAS.2024.124287
Xiaolong Chen;Biao Xu;Manjiang Hu;Yougang Bian;Yang Li;Xin Xu
Unsignalized intersections pose a challenge for autonomous vehicles that must decide how to navigate them safely and efficiently. This paper proposes a reinforcement learning (RL) method for autonomous vehicles to navigate unsignalized intersections safely and efficiently. The method uses a semantic scene representation to handle variable numbers of vehicles and a universal reward function to facilitate stable learning. A collision risk function is designed to penalize unsafe actions and guide the agent to avoid them. A scalable policy optimization algorithm is introduced to improve data efficiency and safety for vehicle learning at intersections. The algorithm employs experience replay to overcome the on-policy limitation of proximal policy optimization and incorporates the collision risk constraint into the policy optimization problem. The proposed safe RL algorithm can balance the trade-off between vehicle traffic safety and policy learning efficiency. Simulated intersection scenarios with different traffic situations are used to test the algorithm and demonstrate its high success rates and low collision rates under different traffic conditions. The algorithm shows the potential of RL for enhancing the safety and reliability of autonomous driving systems at unsignalized intersections.
没有信号灯的交叉路口给自动驾驶车辆带来了挑战,它们必须决定如何安全高效地导航这些交叉路口。本文提出了一种强化学习(RL)方法,用于自动驾驶车辆安全高效地导航无信号交叉路口。该方法使用语义场景表示法来处理车辆数量的变化,并使用通用奖励函数来促进稳定学习。碰撞风险函数用于惩罚不安全行为,并引导驾驶员避免这些行为。该方法引入了一种可扩展的策略优化算法,以提高交叉路口车辆学习的数据效率和安全性。该算法采用经验重放来克服近似策略优化的策略限制,并将碰撞风险约束纳入策略优化问题。所提出的安全 RL 算法可以在车辆交通安全和策略学习效率之间取得平衡。利用不同交通状况的模拟交叉口场景对算法进行了测试,证明了该算法在不同交通状况下的高成功率和低碰撞率。该算法显示了 RL 在提高无信号交叉路口自动驾驶系统的安全性和可靠性方面的潜力。
{"title":"Safe Efficient Policy Optimization Algorithm for Unsignalized Intersection Navigation","authors":"Xiaolong Chen;Biao Xu;Manjiang Hu;Yougang Bian;Yang Li;Xin Xu","doi":"10.1109/JAS.2024.124287","DOIUrl":"https://doi.org/10.1109/JAS.2024.124287","url":null,"abstract":"Unsignalized intersections pose a challenge for autonomous vehicles that must decide how to navigate them safely and efficiently. This paper proposes a reinforcement learning (RL) method for autonomous vehicles to navigate unsignalized intersections safely and efficiently. The method uses a semantic scene representation to handle variable numbers of vehicles and a universal reward function to facilitate stable learning. A collision risk function is designed to penalize unsafe actions and guide the agent to avoid them. A scalable policy optimization algorithm is introduced to improve data efficiency and safety for vehicle learning at intersections. The algorithm employs experience replay to overcome the on-policy limitation of proximal policy optimization and incorporates the collision risk constraint into the policy optimization problem. The proposed safe RL algorithm can balance the trade-off between vehicle traffic safety and policy learning efficiency. Simulated intersection scenarios with different traffic situations are used to test the algorithm and demonstrate its high success rates and low collision rates under different traffic conditions. The algorithm shows the potential of RL for enhancing the safety and reliability of autonomous driving systems at unsignalized intersections.","PeriodicalId":54230,"journal":{"name":"Ieee-Caa Journal of Automatica Sinica","volume":"11 9","pages":"2011-2026"},"PeriodicalIF":15.3,"publicationDate":"2024-08-15","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141991473","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
期刊
Ieee-Caa Journal of Automatica Sinica
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1