首页 > 最新文献

自主智能系统(英文)最新文献

英文 中文
Fault diagnosis of industrial robot based on dual-module attention convolutional neural network 基于双模注意卷积神经网络的工业机器人故障诊断
Pub Date : 2022-06-01 DOI: 10.1007/s43684-022-00031-5
Kaijie Lu, Chong Chen, Tao Wang, Lianglun Cheng, Jian Qin

Fault diagnosis plays a vital role in assessing the health management of industrial robots and improving maintenance schedules. In recent decades, artificial intelligence-based data-driven approaches have made significant progress in machine fault diagnosis using monitoring data. However, current methods pay less attention to correlations and internal differences in monitoring data, resulting in limited diagnostic performance. In this paper, a data-driven method is proposed for the fault diagnosis of industrial robot reducers, that is, a dual-module attention convolutional neural network (DMA-CNN). This method aims to diagnose the fault state of industrial robot reducer. It establishes two parallel convolutional neural networks with two different attentions to capture the different features related to the fault. Finally, the features are fused to obtain the fault diagnosis results (normal or abnormal). The fault diagnosis effect of the DMA-CNN method and other attention models are compared and analyzed. The effectiveness of the method is verified on a dataset of real industrial robots.

故障诊断在评估工业机器人的健康管理和改进维护计划方面发挥着至关重要的作用。近几十年来,基于人工智能的数据驱动方法在利用监测数据进行机器故障诊断方面取得了重大进展。然而,目前的方法较少关注监测数据的相关性和内部差异,导致诊断性能有限。本文提出了一种用于工业机器人减速器故障诊断的数据驱动方法,即双模块注意力卷积神经网络(DMA-CNN)。该方法旨在诊断工业机器人减速器的故障状态。它建立了两个并行的、具有两种不同注意力的卷积神经网络,以捕捉与故障相关的不同特征。最后,通过融合这些特征得出故障诊断结果(正常或异常)。对 DMA-CNN 方法和其他注意力模型的故障诊断效果进行了比较和分析。在真实工业机器人数据集上验证了该方法的有效性。
{"title":"Fault diagnosis of industrial robot based on dual-module attention convolutional neural network","authors":"Kaijie Lu,&nbsp;Chong Chen,&nbsp;Tao Wang,&nbsp;Lianglun Cheng,&nbsp;Jian Qin","doi":"10.1007/s43684-022-00031-5","DOIUrl":"10.1007/s43684-022-00031-5","url":null,"abstract":"<div><p>Fault diagnosis plays a vital role in assessing the health management of industrial robots and improving maintenance schedules. In recent decades, artificial intelligence-based data-driven approaches have made significant progress in machine fault diagnosis using monitoring data. However, current methods pay less attention to correlations and internal differences in monitoring data, resulting in limited diagnostic performance. In this paper, a data-driven method is proposed for the fault diagnosis of industrial robot reducers, that is, a dual-module attention convolutional neural network (DMA-CNN). This method aims to diagnose the fault state of industrial robot reducer. It establishes two parallel convolutional neural networks with two different attentions to capture the different features related to the fault. Finally, the features are fused to obtain the fault diagnosis results (normal or abnormal). The fault diagnosis effect of the DMA-CNN method and other attention models are compared and analyzed. The effectiveness of the method is verified on a dataset of real industrial robots.</p></div>","PeriodicalId":71187,"journal":{"name":"自主智能系统(英文)","volume":"2 1","pages":""},"PeriodicalIF":0.0,"publicationDate":"2022-06-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://link.springer.com/content/pdf/10.1007/s43684-022-00031-5.pdf","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"48044082","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
An attention enhanced dilated CNN approach for cross-axis industrial robotics fault diagnosis 一种用于十字轴工业机器人故障诊断的注意力增强扩张CNN方法
Pub Date : 2022-05-31 DOI: 10.1007/s43684-022-00030-6
Yuxin Liu, Chong Chen, Tao Wang, Lianglun Cheng

An industrial robot is a complex mechatronics system, whose failure is hard to diagnose based on monitoring data. Previous studies have reported various methods with deep network models to improve the accuracy of fault diagnosis, which can get an accurate prediction model when the amount of data sample is sufficient. However, the failure data is hard to obtain, which leads to the few-shot issue and the bad generalization ability of the model. Therefore, this paper proposes an attention enhanced dilated convolutional neural network (D-CNN) approach for the cross-axis industrial robotics fault diagnosis method. Firstly, key feature extraction and sliding window are adopted to pre-process the monitoring data of industrial robots before D-CNN is introduced to extract data features. And self-attention is used to enhance feature attention capability. Finally, the pre-trained model is used for transfer learning, and a small number of the dataset from another axis of the multi-axis industrial robot are used for fine-tuning experiments. The experimental results show that the proposed method can reach satisfactory fault diagnosis accuracy in both the source domain and target domain.

工业机器人是一个复杂的机电一体化系统,其故障很难根据监测数据进行诊断。以往的研究报道了多种利用深度网络模型提高故障诊断准确率的方法,在数据样本量充足的情况下,可以得到准确的预测模型。然而,故障数据难以获得,这就导致了模型的寥寥无几和泛化能力差的问题。因此,本文提出了一种注意力增强型扩张卷积神经网络(D-CNN)方法,用于跨轴工业机器人故障诊断方法。首先,采用关键特征提取和滑动窗口对工业机器人的监测数据进行预处理,然后引入 D-CNN 提取数据特征。此外,还采用了自注意技术来增强特征注意能力。最后,利用预训练模型进行迁移学习,并利用多轴工业机器人另一轴的少量数据集进行微调实验。实验结果表明,所提出的方法在源域和目标域都能达到令人满意的故障诊断精度。
{"title":"An attention enhanced dilated CNN approach for cross-axis industrial robotics fault diagnosis","authors":"Yuxin Liu,&nbsp;Chong Chen,&nbsp;Tao Wang,&nbsp;Lianglun Cheng","doi":"10.1007/s43684-022-00030-6","DOIUrl":"10.1007/s43684-022-00030-6","url":null,"abstract":"<div><p>An industrial robot is a complex mechatronics system, whose failure is hard to diagnose based on monitoring data. Previous studies have reported various methods with deep network models to improve the accuracy of fault diagnosis, which can get an accurate prediction model when the amount of data sample is sufficient. However, the failure data is hard to obtain, which leads to the few-shot issue and the bad generalization ability of the model. Therefore, this paper proposes an attention enhanced dilated convolutional neural network (D-CNN) approach for the cross-axis industrial robotics fault diagnosis method. Firstly, key feature extraction and sliding window are adopted to pre-process the monitoring data of industrial robots before D-CNN is introduced to extract data features. And self-attention is used to enhance feature attention capability. Finally, the pre-trained model is used for transfer learning, and a small number of the dataset from another axis of the multi-axis industrial robot are used for fine-tuning experiments. The experimental results show that the proposed method can reach satisfactory fault diagnosis accuracy in both the source domain and target domain.</p></div>","PeriodicalId":71187,"journal":{"name":"自主智能系统(英文)","volume":"2 1","pages":""},"PeriodicalIF":0.0,"publicationDate":"2022-05-31","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://link.springer.com/content/pdf/10.1007/s43684-022-00030-6.pdf","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"46725199","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Distributed constrained aggregative games of uncertain Euler-Lagrange systems under unbalanced digraphs 不平衡有向图下不确定欧拉-拉格朗日系统的分布约束聚合对策
Pub Date : 2022-05-27 DOI: 10.1007/s43684-022-00027-1
Yanqiong Zhang, Chaoqun Liu, Yu-Ping Tian

In this paper, the constrained Nash equilibrium seeking problem of aggregative games is investigated for uncertain nonlinear Euler-Lagrange (EL) systems under unbalanced digraphs, where the cost function for each agent depends on its own decision variable and the aggregate of all other decisions. By embedding a distributed estimator of the left eigenvector associated with zero eigenvalue of the digraph Laplacian matrix, a dynamic adaptive average consensus protocol is employed to estimate the aggregate function in the unbalanced case. To solve the constrained Nash equilibrium seeking problem, an integrated distributed protocol based on output-constrained nonlinear control and projected dynamics is proposed for uncertain EL players to reach the Nash equilibrium. The convergence analysis is established by using variational inequality technique and Lyapunov stability analysis. Finally, a numerical example in electricity market is provided to validate the effectiveness of the proposed method.

本文研究了不平衡数字图下不确定非线性欧拉-拉格朗日(EL)系统的聚合博弈受限纳什均衡寻求问题,其中每个代理的成本函数取决于其自身的决策变量和所有其他决策的总和。通过嵌入与数图拉普拉奇矩阵零特征值相关的左特征向量的分布式估计器,采用动态自适应平均共识协议来估计不平衡情况下的合计函数。为解决受限纳什均衡寻求问题,提出了一种基于输出受限非线性控制和投影动力学的集成分布式协议,用于不确定的 EL 参与者达到纳什均衡。利用变分不等式技术和 Lyapunov 稳定性分析建立了收敛性分析。最后,以电力市场为例,验证了所提方法的有效性。
{"title":"Distributed constrained aggregative games of uncertain Euler-Lagrange systems under unbalanced digraphs","authors":"Yanqiong Zhang,&nbsp;Chaoqun Liu,&nbsp;Yu-Ping Tian","doi":"10.1007/s43684-022-00027-1","DOIUrl":"10.1007/s43684-022-00027-1","url":null,"abstract":"<div><p>In this paper, the constrained Nash equilibrium seeking problem of aggregative games is investigated for uncertain nonlinear Euler-Lagrange (EL) systems under unbalanced digraphs, where the cost function for each agent depends on its own decision variable and the aggregate of all other decisions. By embedding a distributed estimator of the left eigenvector associated with zero eigenvalue of the digraph Laplacian matrix, a dynamic adaptive average consensus protocol is employed to estimate the aggregate function in the unbalanced case. To solve the constrained Nash equilibrium seeking problem, an integrated distributed protocol based on output-constrained nonlinear control and projected dynamics is proposed for uncertain EL players to reach the Nash equilibrium. The convergence analysis is established by using variational inequality technique and Lyapunov stability analysis. Finally, a numerical example in electricity market is provided to validate the effectiveness of the proposed method.</p></div>","PeriodicalId":71187,"journal":{"name":"自主智能系统(英文)","volume":"2 1","pages":""},"PeriodicalIF":0.0,"publicationDate":"2022-05-27","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://link.springer.com/content/pdf/10.1007/s43684-022-00027-1.pdf","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"48421696","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Two-stage reward allocation with decay for multi-agent coordinated behavior for sequential cooperative task by using deep reinforcement learning 基于深度强化学习的顺序合作任务多智能体协调行为两阶段奖励衰减分配
Pub Date : 2022-05-27 DOI: 10.1007/s43684-022-00029-z
Yuki Miyashita, Toshiharu Sugawara

We propose a two-stage reward allocation method with decay using an extension of replay memory to adapt this rewarding method for deep reinforcement learning (DRL), to generate coordinated behaviors for tasks that can be completed by executing a few subtasks sequentially by heterogeneous agents. An independent learner in cooperative multi-agent systems needs to learn its policies for effective execution of its own responsible subtask, as well as for coordinated behaviors under a certain coordination structure. Although the reward scheme is an issue for DRL, it is difficult to design it to learn both policies. Our proposed method attempts to generate these different behaviors in multi-agent DRL by dividing the timing of rewards into two stages and varying the ratio between them over time. By introducing the coordinated delivery and execution problem with an expiration time, where a task can be executed sequentially by two heterogeneous agents, we experimentally analyze the effect of using various ratios of the reward division in the two-stage allocations on the generated behaviors. The results demonstrate that the proposed method could improve the overall performance relative to those with the conventional one-time or fixed reward and can establish robust coordinated behavior.

我们利用重放记忆的扩展,提出了一种带衰减的两阶段奖励分配方法,使这种奖励方法适用于深度强化学习(DRL),为异构代理依次执行几个子任务即可完成的任务生成协调行为。合作式多代理系统中的独立学习者需要学习其策略,以有效执行自己负责的子任务,以及在特定协调结构下的协调行为。虽然奖励方案是 DRL 的一个问题,但很难设计出同时学习这两种策略的方案。我们提出的方法试图在多代理 DRL 中生成这些不同的行为,方法是将奖励的时间分为两个阶段,并随时间改变它们之间的比例。我们引入了有过期时间的协调交付和执行问题,在这个问题中,任务可以由两个异构代理依次执行,我们通过实验分析了在两阶段分配中使用不同的奖励划分比例对生成行为的影响。结果表明,与传统的一次性奖励或固定奖励相比,建议的方法可以提高整体性能,并能建立稳健的协调行为。
{"title":"Two-stage reward allocation with decay for multi-agent coordinated behavior for sequential cooperative task by using deep reinforcement learning","authors":"Yuki Miyashita,&nbsp;Toshiharu Sugawara","doi":"10.1007/s43684-022-00029-z","DOIUrl":"10.1007/s43684-022-00029-z","url":null,"abstract":"<div><p>We propose a two-stage reward allocation method with decay using an extension of replay memory to adapt this rewarding method for deep reinforcement learning (DRL), to generate coordinated behaviors for tasks that can be completed by executing a few subtasks sequentially by heterogeneous agents. An independent learner in cooperative multi-agent systems needs to learn its policies for effective execution of its own responsible subtask, as well as for coordinated behaviors under a certain coordination structure. Although the reward scheme is an issue for DRL, it is difficult to design it to learn both policies. Our proposed method attempts to generate these different behaviors in multi-agent DRL by dividing the timing of rewards into two stages and varying the ratio between them over time. By introducing the coordinated delivery and execution problem with an expiration time, where a task can be executed sequentially by two heterogeneous agents, we experimentally analyze the effect of using various ratios of the reward division in the two-stage allocations on the generated behaviors. The results demonstrate that the proposed method could improve the overall performance relative to those with the conventional one-time or fixed reward and can establish robust coordinated behavior.</p></div>","PeriodicalId":71187,"journal":{"name":"自主智能系统(英文)","volume":"2 1","pages":""},"PeriodicalIF":0.0,"publicationDate":"2022-05-27","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://link.springer.com/content/pdf/10.1007/s43684-022-00029-z.pdf","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"45075022","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Machine learning techniques for robotic and autonomous inspection of mechanical systems and civil infrastructure 机械系统和民用基础设施的机器人和自主检测的机器学习技术
Pub Date : 2022-04-29 DOI: 10.1007/s43684-022-00025-3
Michael O. Macaulay, Mahmood Shafiee

Machine learning and in particular deep learning techniques have demonstrated the most efficacy in training, learning, analyzing, and modelling large complex structured and unstructured datasets. These techniques have recently been commonly deployed in different industries to support robotic and autonomous system (RAS) requirements and applications ranging from planning and navigation to machine vision and robot manipulation in complex environments. This paper reviews the state-of-the-art with regard to RAS technologies (including unmanned marine robot systems, unmanned ground robot systems, climbing and crawler robots, unmanned aerial vehicles, and space robot systems) and their application for the inspection and monitoring of mechanical systems and civil infrastructure. We explore various types of data provided by such systems and the analytical techniques being adopted to process and analyze these data. This paper provides a brief overview of machine learning and deep learning techniques, and more importantly, a classification of the literature which have reported the deployment of such techniques for RAS-based inspection and monitoring of utility pipelines, wind turbines, aircrafts, power lines, pressure vessels, bridges, etc. Our research provides documented information on the use of advanced data-driven technologies in the analysis of critical assets and examines the main challenges to the applications of such technologies in the industry.

机器学习,尤其是深度学习技术,在训练、学习、分析和模拟大型复杂结构化和非结构化数据集方面表现出极大的功效。最近,这些技术已被广泛应用于不同行业,以支持机器人和自主系统(RAS)的需求和应用,包括复杂环境中的规划和导航、机器视觉和机器人操纵等。本文回顾了 RAS 技术(包括无人海洋机器人系统、无人地面机器人系统、爬行和履带机器人、无人飞行器和空间机器人系统)的最新发展,以及它们在机械系统和民用基础设施检测和监控方面的应用。我们将探讨此类系统提供的各类数据,以及处理和分析这些数据所采用的分析技术。本文简要概述了机器学习和深度学习技术,更重要的是对文献进行了分类,这些文献报道了在基于 RAS 的公用事业管道、风力涡轮机、飞机、电力线、压力容器、桥梁等的检测和监控中部署此类技术的情况。我们的研究提供了在关键资产分析中使用先进数据驱动技术的文献信息,并探讨了在行业中应用此类技术所面临的主要挑战。
{"title":"Machine learning techniques for robotic and autonomous inspection of mechanical systems and civil infrastructure","authors":"Michael O. Macaulay,&nbsp;Mahmood Shafiee","doi":"10.1007/s43684-022-00025-3","DOIUrl":"10.1007/s43684-022-00025-3","url":null,"abstract":"<div><p>Machine learning and in particular <i>deep learning</i> techniques have demonstrated the most efficacy in training, learning, analyzing, and modelling large complex structured and unstructured datasets. These techniques have recently been commonly deployed in different industries to support robotic and autonomous system (RAS) requirements and applications ranging from planning and navigation to machine vision and robot manipulation in complex environments. This paper reviews the state-of-the-art with regard to RAS technologies (including unmanned marine robot systems, unmanned ground robot systems, climbing and crawler robots, unmanned aerial vehicles, and space robot systems) and their application for the inspection and monitoring of mechanical systems and civil infrastructure. We explore various types of data provided by such systems and the analytical techniques being adopted to process and analyze these data. This paper provides a brief overview of machine learning and deep learning techniques, and more importantly, a classification of the literature which have reported the deployment of such techniques for RAS-based inspection and monitoring of utility pipelines, wind turbines, aircrafts, power lines, pressure vessels, bridges, etc. Our research provides documented information on the use of advanced data-driven technologies in the analysis of critical assets and examines the main challenges to the applications of such technologies in the industry.</p></div>","PeriodicalId":71187,"journal":{"name":"自主智能系统(英文)","volume":"2 1","pages":""},"PeriodicalIF":0.0,"publicationDate":"2022-04-29","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://link.springer.com/content/pdf/10.1007/s43684-022-00025-3.pdf","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"42816843","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Nash equilibrium seeking over directed graphs 有向图上的纳什均衡寻求
Pub Date : 2022-04-18 DOI: 10.1007/s43684-022-00026-2
Yutao Tang, Peng Yi, Yanqiong Zhang, Dawei Liu

In this paper, we aim to develop distributed continuous-time algorithms over directed graphs to seek the Nash equilibrium in a noncooperative game. Motivated by the recent consensus-based designs, we present a distributed algorithm with a proportional gain for weight-balanced directed graphs. By further embedding a distributed estimator of the left eigenvector associated with zero eigenvalue of the graph Laplacian, we extend it to the case with arbitrary strongly connected directed graphs having possible unbalanced weights. In both cases, the Nash equilibrium is proven to be exactly reached with an exponential convergence rate. An example is given to illustrate the validity of the theoretical results.

本文旨在开发有向图上的分布式连续时间算法,以寻求非合作博弈中的纳什均衡。受最近基于共识的设计的启发,我们提出了一种对权重平衡有向图具有比例增益的分布式算法。通过进一步嵌入与图拉普拉奇零特征值相关的左特征向量的分布式估计器,我们将其扩展到任意强连接有向图(权重可能不平衡)的情况。在这两种情况下,纳什均衡都能以指数收敛率精确达到。我们举例说明了理论结果的有效性。
{"title":"Nash equilibrium seeking over directed graphs","authors":"Yutao Tang,&nbsp;Peng Yi,&nbsp;Yanqiong Zhang,&nbsp;Dawei Liu","doi":"10.1007/s43684-022-00026-2","DOIUrl":"10.1007/s43684-022-00026-2","url":null,"abstract":"<div><p>In this paper, we aim to develop distributed continuous-time algorithms over directed graphs to seek the Nash equilibrium in a noncooperative game. Motivated by the recent consensus-based designs, we present a distributed algorithm with a proportional gain for weight-balanced directed graphs. By further embedding a distributed estimator of the left eigenvector associated with zero eigenvalue of the graph Laplacian, we extend it to the case with arbitrary strongly connected directed graphs having possible unbalanced weights. In both cases, the Nash equilibrium is proven to be exactly reached with an exponential convergence rate. An example is given to illustrate the validity of the theoretical results.</p></div>","PeriodicalId":71187,"journal":{"name":"自主智能系统(英文)","volume":"2 1","pages":""},"PeriodicalIF":0.0,"publicationDate":"2022-04-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://link.springer.com/content/pdf/10.1007/s43684-022-00026-2.pdf","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"43002504","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Exponentially convergent distributed Nash equilibrium seeking for constrained aggregative games 约束聚合对策的指数收敛分布式纳什均衡寻求
Pub Date : 2022-04-12 DOI: 10.1007/s43684-022-00024-4
Shu Liang, Peng Yi, Yiguang Hong, Kaixiang Peng

Distributed Nash equilibrium seeking of aggregative games is investigated and a continuous-time algorithm is proposed. The algorithm is designed by virtue of projected gradient play dynamics and aggregation tracking dynamics, and is applicable to games with constrained strategy sets and weight-balanced communication graphs. The key feature of our method is that the proposed projected dynamics achieves exponential convergence, whereas such convergence results are only obtained for non-projected dynamics in existing works on distributed optimization and equilibrium seeking. Numerical examples illustrate the effectiveness of our methods.

研究了聚合博弈的分布式纳什均衡寻求,并提出了一种连续时间算法。该算法是根据投影梯度博弈动力学和聚合跟踪动力学设计的,适用于具有受限策略集和权重平衡通信图的博弈。我们的方法的主要特点是所提出的投影动力学实现了指数收敛,而在现有的分布式优化和均衡寻求著作中,只有非投影动力学才能获得这样的收敛结果。数值示例说明了我们方法的有效性。
{"title":"Exponentially convergent distributed Nash equilibrium seeking for constrained aggregative games","authors":"Shu Liang,&nbsp;Peng Yi,&nbsp;Yiguang Hong,&nbsp;Kaixiang Peng","doi":"10.1007/s43684-022-00024-4","DOIUrl":"10.1007/s43684-022-00024-4","url":null,"abstract":"<div><p>Distributed Nash equilibrium seeking of aggregative games is investigated and a continuous-time algorithm is proposed. The algorithm is designed by virtue of projected gradient play dynamics and aggregation tracking dynamics, and is applicable to games with constrained strategy sets and weight-balanced communication graphs. The key feature of our method is that the proposed projected dynamics achieves exponential convergence, whereas such convergence results are only obtained for non-projected dynamics in existing works on distributed optimization and equilibrium seeking. Numerical examples illustrate the effectiveness of our methods.</p></div>","PeriodicalId":71187,"journal":{"name":"自主智能系统(英文)","volume":"2 1","pages":""},"PeriodicalIF":0.0,"publicationDate":"2022-04-12","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://link.springer.com/content/pdf/10.1007/s43684-022-00024-4.pdf","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"49479824","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Multi-agent reinforcement learning for cooperative lane changing of connected and autonomous vehicles in mixed traffic 混合交通中联网和自动驾驶车辆合作变道的多代理强化学习
Pub Date : 2022-03-16 DOI: 10.1007/s43684-022-00023-5
Wei Zhou, Dong Chen, Jun Yan, Zhaojian Li, Huilin Yin, Wanchen Ge

Autonomous driving has attracted significant research interests in the past two decades as it offers many potential benefits, including releasing drivers from exhausting driving and mitigating traffic congestion, among others. Despite promising progress, lane-changing remains a great challenge for autonomous vehicles (AV), especially in mixed and dynamic traffic scenarios. Recently, reinforcement learning (RL) has been widely explored for lane-changing decision makings in AVs with encouraging results demonstrated. However, the majority of those studies are focused on a single-vehicle setting, and lane-changing in the context of multiple AVs coexisting with human-driven vehicles (HDVs) have received scarce attention. In this paper, we formulate the lane-changing decision-making of multiple AVs in a mixed-traffic highway environment as a multi-agent reinforcement learning (MARL) problem, where each AV makes lane-changing decisions based on the motions of both neighboring AVs and HDVs. Specifically, a multi-agent advantage actor-critic (MA2C) method is proposed with a novel local reward design and a parameter sharing scheme. In particular, a multi-objective reward function is designed to incorporate fuel efficiency, driving comfort, and the safety of autonomous driving. A comprehensive experimental study is made that our proposed MARL framework consistently outperforms several state-of-the-art benchmarks in terms of efficiency, safety, and driver comfort.

在过去二十年里,自动驾驶吸引了大量研究人员的关注,因为它能带来许多潜在的好处,包括让驾驶员从疲惫的驾驶中解脱出来,缓解交通拥堵等。尽管取得了可喜的进展,但变道仍然是自动驾驶汽车(AV)面临的巨大挑战,尤其是在混合和动态交通场景中。最近,强化学习(RL)被广泛用于自动驾驶汽车的变道决策,并取得了令人鼓舞的成果。然而,这些研究大多集中在单车环境下,而在多辆自动驾驶汽车与人类驾驶汽车(HDV)共存的情况下进行变道决策却很少受到关注。在本文中,我们将混合交通高速公路环境中的多辆自动驾驶汽车变道决策问题表述为多代理强化学习(MARL)问题,其中每辆自动驾驶汽车根据相邻自动驾驶汽车和 HDV 的运动做出变道决策。具体而言,本文提出了一种多代理优势代理批评(MA2C)方法,该方法具有新颖的局部奖励设计和参数共享方案。特别是,设计了一个多目标奖励函数,将燃油效率、驾驶舒适性和自动驾驶的安全性结合在一起。通过全面的实验研究,我们提出的 MARL 框架在效率、安全性和驾驶舒适性方面始终优于多个最先进的基准。
{"title":"Multi-agent reinforcement learning for cooperative lane changing of connected and autonomous vehicles in mixed traffic","authors":"Wei Zhou,&nbsp;Dong Chen,&nbsp;Jun Yan,&nbsp;Zhaojian Li,&nbsp;Huilin Yin,&nbsp;Wanchen Ge","doi":"10.1007/s43684-022-00023-5","DOIUrl":"10.1007/s43684-022-00023-5","url":null,"abstract":"<div><p>Autonomous driving has attracted significant research interests in the past two decades as it offers many potential benefits, including releasing drivers from exhausting driving and mitigating traffic congestion, among others. Despite promising progress, lane-changing remains a great challenge for autonomous vehicles (AV), especially in mixed and dynamic traffic scenarios. Recently, reinforcement learning (RL) has been widely explored for lane-changing decision makings in AVs with encouraging results demonstrated. However, the majority of those studies are focused on a single-vehicle setting, and lane-changing in the context of multiple AVs coexisting with human-driven vehicles (HDVs) have received scarce attention. In this paper, we formulate the lane-changing decision-making of multiple AVs in a mixed-traffic highway environment as a multi-agent reinforcement learning (MARL) problem, where each AV makes lane-changing decisions based on the motions of both neighboring AVs and HDVs. Specifically, a multi-agent advantage actor-critic (MA2C) method is proposed with a novel local reward design and a parameter sharing scheme. In particular, a multi-objective reward function is designed to incorporate fuel efficiency, driving comfort, and the safety of autonomous driving. A comprehensive experimental study is made that our proposed MARL framework consistently outperforms several state-of-the-art benchmarks in terms of efficiency, safety, and driver comfort.</p></div>","PeriodicalId":71187,"journal":{"name":"自主智能系统(英文)","volume":"2 1","pages":""},"PeriodicalIF":0.0,"publicationDate":"2022-03-16","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://link.springer.com/content/pdf/10.1007/s43684-022-00023-5.pdf","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"52856324","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
End-of-Life Decision making in circular economy using generalized colored stochastic Petri nets 广义有色随机Petri网在循环经济中的寿命终止决策
Pub Date : 2022-03-12 DOI: 10.1007/s43684-022-00022-6
Gautier Vanson, Pascale Marangé, Eric Levrat

Circular economy enables to restore product value at the end of life i.e. when no longer used or damaged. Thus, the product life cycle is extended and this economy permits to reduce waste increase and resources rarefaction. There are several revaluation options (reuse, remanufacturing, recycling, …). So, decision makers need to assess these options to determine which is the best decision. Thus, we will present a study about an End-Of-Life (EoL) decision making which aims to facilitate the industrialization of circular economy. For this, it is essential to consider all variables and parameters impacting the decision of the product trajectory. A first part of the work proposes to identify the variables and parameters impacting the decision making. A second part proposes an assessment approach based on a modeling by Generalized Colored Stochastic Petri Net (GCSPN) and on a Monte-Carlo simulation. The approach developed is tested on an industrial example from the literature to analyze the efficiency and effectiveness of the model. This first application showed the feasibility of the approach, and also the limits of the GCSPN modelling.

循环经济能够在产品寿命终止时,即不再使用或损坏时,恢复产品的价值。这样,产品的生命周期就得到了延长,这种经济可以减少废物的增加和资源的稀缺。有几种重估选择(再利用、再制造、再循环......)。因此,决策者需要对这些方案进行评估,以确定哪个是最佳决策。因此,我们将介绍一项关于寿命终结(EoL)决策的研究,旨在促进循环经济的产业化。为此,必须考虑影响产品轨迹决策的所有变量和参数。工作的第一部分建议确定影响决策的变量和参数。第二部分提出了一种基于广义彩色随机 Petri 网(GCSPN)建模和蒙特卡洛模拟的评估方法。该方法在文献中的一个工业实例中进行了测试,以分析模型的效率和有效性。首次应用表明了该方法的可行性,以及 GCSPN 建模的局限性。
{"title":"End-of-Life Decision making in circular economy using generalized colored stochastic Petri nets","authors":"Gautier Vanson,&nbsp;Pascale Marangé,&nbsp;Eric Levrat","doi":"10.1007/s43684-022-00022-6","DOIUrl":"10.1007/s43684-022-00022-6","url":null,"abstract":"<div><p>Circular economy enables to restore product value at the end of life i.e. when no longer used or damaged. Thus, the product life cycle is extended and this economy permits to reduce waste increase and resources rarefaction. There are several revaluation options (reuse, remanufacturing, recycling, …). So, decision makers need to assess these options to determine which is the best decision. Thus, we will present a study about an End-Of-Life (EoL) decision making which aims to facilitate the industrialization of circular economy. For this, it is essential to consider all variables and parameters impacting the decision of the product trajectory. A first part of the work proposes to identify the variables and parameters impacting the decision making. A second part proposes an assessment approach based on a modeling by Generalized Colored Stochastic Petri Net (GCSPN) and on a Monte-Carlo simulation. The approach developed is tested on an industrial example from the literature to analyze the efficiency and effectiveness of the model. This first application showed the feasibility of the approach, and also the limits of the GCSPN modelling.</p></div>","PeriodicalId":71187,"journal":{"name":"自主智能系统(英文)","volume":"2 1","pages":""},"PeriodicalIF":0.0,"publicationDate":"2022-03-12","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://link.springer.com/content/pdf/10.1007/s43684-022-00022-6.pdf","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"43164694","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Mass estimation method for intelligent vehicles based on fusion of machine learning and vehicle dynamic model 基于机器学习和车辆动态模型融合的智能车辆质量估计方法
Pub Date : 2022-03-11 DOI: 10.1007/s43684-022-00020-8
Zhuoping Yu, Xinchen Hou, Bo Leng, Yuyao Huang

Vehicle mass is an important parameter for motion control of intelligent vehicles, but is hard to directly measure using normal sensors. Therefore, accurate estimation of vehicle mass becomes crucial. In this paper, a vehicle mass estimation method based on fusion of machine learning and vehicle dynamic model is introduced. In machine learning method, a feedforward neural network (FFNN) is used to learn the relationship between vehicle mass and other state parameters, namely longitudinal speed and acceleration, driving or braking torque, and wheel angular speed. In dynamics-based method, recursive least square (RLS) with forgetting factor based on vehicle dynamic model is used to estimate the vehicle mass. According to the reliability of each method under different conditions, these two methods are fused using fuzzy logic. Simulation tests under New European Driving Cycle (NEDC) condition are carried out. The simulation results show that the estimation accuracy of the fusion method is around 97%, and that the fusion method performs better stability and robustness compared with each single method.

车辆质量是智能车辆运动控制的一个重要参数,但很难用普通传感器直接测量。因此,准确估计车辆质量变得至关重要。本文介绍了一种基于机器学习和车辆动态模型融合的车辆质量估计方法。在机器学习方法中,使用前馈神经网络(FFNN)来学习车辆质量与其他状态参数(即纵向速度和加速度、驱动或制动扭矩以及车轮角速度)之间的关系。在基于动力学的方法中,使用基于车辆动力学模型的带遗忘因子的递归最小二乘法(RLS)来估计车辆质量。根据每种方法在不同条件下的可靠性,这两种方法使用模糊逻辑进行了融合。在新欧洲行驶循环(NEDC)条件下进行了模拟测试。仿真结果表明,融合方法的估计精度约为 97%,与每种单一方法相比,融合方法具有更好的稳定性和鲁棒性。
{"title":"Mass estimation method for intelligent vehicles based on fusion of machine learning and vehicle dynamic model","authors":"Zhuoping Yu,&nbsp;Xinchen Hou,&nbsp;Bo Leng,&nbsp;Yuyao Huang","doi":"10.1007/s43684-022-00020-8","DOIUrl":"10.1007/s43684-022-00020-8","url":null,"abstract":"<div><p>Vehicle mass is an important parameter for motion control of intelligent vehicles, but is hard to directly measure using normal sensors. Therefore, accurate estimation of vehicle mass becomes crucial. In this paper, a vehicle mass estimation method based on fusion of machine learning and vehicle dynamic model is introduced. In machine learning method, a feedforward neural network (FFNN) is used to learn the relationship between vehicle mass and other state parameters, namely longitudinal speed and acceleration, driving or braking torque, and wheel angular speed. In dynamics-based method, recursive least square (RLS) with forgetting factor based on vehicle dynamic model is used to estimate the vehicle mass. According to the reliability of each method under different conditions, these two methods are fused using fuzzy logic. Simulation tests under New European Driving Cycle (NEDC) condition are carried out. The simulation results show that the estimation accuracy of the fusion method is around 97%, and that the fusion method performs better stability and robustness compared with each single method.</p></div>","PeriodicalId":71187,"journal":{"name":"自主智能系统(英文)","volume":"2 1","pages":""},"PeriodicalIF":0.0,"publicationDate":"2022-03-11","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://link.springer.com/content/pdf/10.1007/s43684-022-00020-8.pdf","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"47404510","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
期刊
自主智能系统(英文)
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1