Frontiers in Neurorobotics最新文献_第9页

Human-robot planar co-manipulation of extended objects: data-driven models and control from human-human dyads 扩展物体的人与机器人平面协同操纵：数据驱动模型和人与人之间的控制

IF 3.1 4区计算机科学 Q3 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE

Frontiers in Neurorobotics

Pub Date : 2024-02-12 DOI: 10.3389/fnbot.2024.1291694

Erich Mielke, Eric Townsend, David Wingate, John L. Salmon, Marc D. Killpack

Human teams are able to easily perform collaborative manipulation tasks. However, simultaneously manipulating a large extended object for a robot and human is a difficult task due to the inherent ambiguity in the desired motion. Our approach in this paper is to leverage data from human-human dyad experiments to determine motion intent for a physical human-robot co-manipulation task. We do this by showing that the human-human dyad data exhibits distinct torque triggers for a lateral movement. As an alternative intent estimation method, we also develop a deep neural network based on motion data from human-human trials to predict future trajectories based on past object motion. We then show how force and motion data can be used to determine robot control in a human-robot dyad. Finally, we compare human-human dyad performance to the performance of two controllers that we developed for human-robot co-manipulation. We evaluate these controllers in three-degree-of-freedom planar motion where determining if the task involves rotation or translation is ambiguous.

人类团队能够轻松完成协作操纵任务。然而，机器人和人类同时操纵一个大型扩展物体是一项艰巨的任务，因为所需的运动本身就存在模糊性。我们在本文中采用的方法是利用来自人机对偶实验的数据来确定物理人机协同操纵任务的运动意图。我们的方法是证明人与人之间的双人实验数据显示了横向运动的不同扭矩触发点。作为另一种意图估计方法，我们还开发了一种基于人机试验运动数据的深度神经网络，以根据过去的物体运动预测未来的轨迹。然后，我们展示了如何利用力和运动数据来确定机器人在人机协作中的控制。最后，我们将人机合作的性能与我们为人机合作操纵开发的两个控制器的性能进行比较。我们在三自由度平面运动中对这些控制器进行了评估，在这种运动中，确定任务是旋转还是平移是模棱两可的。

引用次数: 0

Dynamic event-based optical identification and communication 基于事件的动态光学识别和通信

IF 3.1 4区计算机科学 Q3 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE

Frontiers in Neurorobotics

Pub Date : 2024-02-12 DOI: 10.3389/fnbot.2024.1290965

Axel von Arnim, Jules Lecomte, Naima Elosegui Borras, Stanisław Woźniak, Angeliki Pantazi

Optical identification is often done with spatial or temporal visual pattern recognition and localization. Temporal pattern recognition, depending on the technology, involves a trade-off between communication frequency, range, and accurate tracking. We propose a solution with light-emitting beacons that improves this trade-off by exploiting fast event-based cameras and, for tracking, sparse neuromorphic optical flow computed with spiking neurons. The system is embedded in a simulated drone and evaluated in an asset monitoring use case. It is robust to relative movements and enables simultaneous communication with, and tracking of, multiple moving beacons. Finally, in a hardware lab prototype, we demonstrate for the first time beacon tracking performed simultaneously with state-of-the-art frequency communication in the kHz range.

光学识别通常采用空间或时间视觉模式识别和定位。根据技术的不同，时间模式识别需要在通信频率、范围和精确跟踪之间进行权衡。我们提出了一种使用发光信标的解决方案，通过利用基于事件的快速摄像头和利用尖峰神经元计算的稀疏神经形态光流进行跟踪，来改善这种权衡。该系统嵌入了一架模拟无人机，并在资产监控使用案例中进行了评估。该系统对相对运动具有鲁棒性，可同时与多个移动信标通信并对其进行跟踪。最后，在硬件实验室原型中，我们首次展示了在千赫兹范围内与最先进的频率通信同时进行的信标跟踪。

引用次数: 0

Velocity-aware spatial-temporal attention LSTM model for inverse dynamic model learning of manipulators 用于机械手反动态模型学习的速度感知时空注意力 LSTM 模型

IF 3.1 4区计算机科学 Q3 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE

Frontiers in Neurorobotics

Pub Date : 2024-02-09 DOI: 10.3389/fnbot.2024.1353879

Wenhui Huang, Yunhan Lin, Mingxin Liu, Huasong Min

IntroductionAn accurate inverse dynamics model of manipulators can be effectively learned using neural networks. However, further research is required to investigate the impact of spatiotemporal variations in manipulator motion sequences on network learning. In this work, the Velocity Aware Spatial-Temporal Attention Residual LSTM neural network (VA-STA-ResLSTM) is proposed to learn a more accurate inverse dynamics model, which uses a velocity-aware spatial-temporal attention mechanism to extract dynamic spatiotemporal features selectively from the motion sequence of the serial manipulator.MethodsThe multi-layer perception (MLP) attention mechanism is adopted to capture the correlation between joint position and velocity in the motion sequence, and the state correlation between hidden units in the LSTM network to reduce the weight of invalid features. A velocity-aware state fusion approach of LSTM network hidden units' states is proposed, which utilizes variation in joint velocity to adapt to the temporal characteristics of the manipulator dynamic motion, improving the generalization and accuracy of the neural network.ResultsComparative experiments have been conducted on two open datasets and a self-built dataset. Specifically, the proposed method achieved an average accuracy improvement of 61.88% and 43.93% on the two different open datasets and 71.13% on the self-built dataset compared to the LSTM network. These results demonstrate a significant advancement in accuracy for the proposed method.DiscussionCompared with the state-of-the-art inverse dynamics model learning methods of manipulators, the modeling accuracy of the proposed method in this paper is higher by an average of 10%. Finally, by visualizing attention weights to explain the training procedure, it was found that dynamic modeling only relies on partial features, which is meaningful for future optimization of inverse dynamic model learning methods.

引言利用神经网络可以有效地学习机械手的精确逆动力学模型。然而，还需要进一步研究机械手运动序列的时空变化对网络学习的影响。本研究提出了速度感知时空注意力残差 LSTM 神经网络（VA-STA-ResLSTM），利用速度感知时空注意力机制从序列机械手的运动序列中选择性地提取动态时空特征，从而学习更精确的反动力学模型。方法采用多层感知（MLP）注意机制来捕捉运动序列中关节位置和速度之间的相关性，并利用 LSTM 网络中隐藏单元之间的状态相关性来降低无效特征的权重。提出了一种速度感知的 LSTM 网络隐藏单元状态融合方法，利用关节速度的变化来适应机械手动态运动的时间特性，提高了神经网络的泛化能力和准确性。具体来说，与 LSTM 网络相比，所提出的方法在两个不同的开放数据集上实现了 61.88% 和 43.93% 的平均准确率提升，在自建数据集上实现了 71.13% 的平均准确率提升。讨论与最先进的机械手反动力学模型学习方法相比，本文提出的方法的建模精度平均提高了 10%。最后，通过将注意力权重可视化来解释训练过程，发现动态建模只依赖于部分特征，这对未来优化逆动态模型学习方法很有意义。

{"title":"Velocity-aware spatial-temporal attention LSTM model for inverse dynamic model learning of manipulators","authors":"Wenhui Huang, Yunhan Lin, Mingxin Liu, Huasong Min","doi":"10.3389/fnbot.2024.1353879","DOIUrl":"https://doi.org/10.3389/fnbot.2024.1353879","url":null,"abstract":"IntroductionAn accurate inverse dynamics model of manipulators can be effectively learned using neural networks. However, further research is required to investigate the impact of spatiotemporal variations in manipulator motion sequences on network learning. In this work, the Velocity Aware Spatial-Temporal Attention Residual LSTM neural network (VA-STA-ResLSTM) is proposed to learn a more accurate inverse dynamics model, which uses a velocity-aware spatial-temporal attention mechanism to extract dynamic spatiotemporal features selectively from the motion sequence of the serial manipulator.MethodsThe multi-layer perception (MLP) attention mechanism is adopted to capture the correlation between joint position and velocity in the motion sequence, and the state correlation between hidden units in the LSTM network to reduce the weight of invalid features. A velocity-aware state fusion approach of LSTM network hidden units' states is proposed, which utilizes variation in joint velocity to adapt to the temporal characteristics of the manipulator dynamic motion, improving the generalization and accuracy of the neural network.ResultsComparative experiments have been conducted on two open datasets and a self-built dataset. Specifically, the proposed method achieved an average accuracy improvement of 61.88% and 43.93% on the two different open datasets and 71.13% on the self-built dataset compared to the LSTM network. These results demonstrate a significant advancement in accuracy for the proposed method.DiscussionCompared with the state-of-the-art inverse dynamics model learning methods of manipulators, the modeling accuracy of the proposed method in this paper is higher by an average of 10%. Finally, by visualizing attention weights to explain the training procedure, it was found that dynamic modeling only relies on partial features, which is meaningful for future optimization of inverse dynamic model learning methods.","PeriodicalId":12628,"journal":{"name":"Frontiers in Neurorobotics","volume":"23 1","pages":""},"PeriodicalIF":3.1,"publicationDate":"2024-02-09","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"139764990","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

ADAM: a robotic companion for enhanced quality of life in aging populations ADAM：提高老龄人口生活质量的机器人伴侣

IF 3.1 4区计算机科学 Q3 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE

Frontiers in Neurorobotics

Pub Date : 2024-02-09 DOI: 10.3389/fnbot.2024.1337608

Alicia Mora, Adrian Prados, Alberto Mendez, Gonzalo Espinoza, Pavel Gonzalez, Blanca Lopez, Victor Muñoz, Luis Moreno, Santiago Garrido, Ramon Barber

One of the major problems of today's society is the rapid aging of its population. Life expectancy is increasing, but the quality of life is not. Faced with the growing number of people who require cognitive or physical assistance, new technological tools are emerging to help them. In this article, we present the ADAM robot, a new robot designed for domestic physical assistance. It mainly consists of a mobile base, two arms with grippers and vision systems. All this allows the performance of physical tasks that require navigation and manipulation of the environment. Among ADAM's features are its modularity, its adaptability to indoor environments and its versatility to function as an experimental platform and for service applications. In addition, it is designed to work respecting the user's personal space and is collaborative, so it can learn from experiences taught by them. We present the design of the robot as well as examples of use in domestic environments both alone and in collaboration with other domestic platforms, demonstrating its potential.

当今社会的主要问题之一是人口迅速老龄化。预期寿命在延长，但生活质量却没有提高。面对越来越多的人需要认知或身体方面的帮助，新的技术工具正在出现，以帮助他们。在这篇文章中，我们将介绍 ADAM 机器人，这是一款专为家庭物理辅助而设计的新型机器人。它主要由一个移动底座、两个带抓手的手臂和视觉系统组成。所有这一切都使它能够完成需要导航和操纵环境的物理任务。ADAM 的特点包括模块化、对室内环境的适应性以及作为实验平台和服务应用的多功能性。此外，ADAM 的设计还尊重用户的个人空间，并具有协作性，因此可以从用户传授的经验中学习。我们介绍了该机器人的设计以及在家庭环境中单独使用或与其他家用平台合作使用的实例，展示了它的潜力。

{"title":"ADAM: a robotic companion for enhanced quality of life in aging populations","authors":"Alicia Mora, Adrian Prados, Alberto Mendez, Gonzalo Espinoza, Pavel Gonzalez, Blanca Lopez, Victor Muñoz, Luis Moreno, Santiago Garrido, Ramon Barber","doi":"10.3389/fnbot.2024.1337608","DOIUrl":"https://doi.org/10.3389/fnbot.2024.1337608","url":null,"abstract":"One of the major problems of today's society is the rapid aging of its population. Life expectancy is increasing, but the quality of life is not. Faced with the growing number of people who require cognitive or physical assistance, new technological tools are emerging to help them. In this article, we present the ADAM robot, a new robot designed for domestic physical assistance. It mainly consists of a mobile base, two arms with grippers and vision systems. All this allows the performance of physical tasks that require navigation and manipulation of the environment. Among ADAM's features are its modularity, its adaptability to indoor environments and its versatility to function as an experimental platform and for service applications. In addition, it is designed to work respecting the user's personal space and is collaborative, so it can learn from experiences taught by them. We present the design of the robot as well as examples of use in domestic environments both alone and in collaboration with other domestic platforms, demonstrating its potential.","PeriodicalId":12628,"journal":{"name":"Frontiers in Neurorobotics","volume":"23 1","pages":""},"PeriodicalIF":3.1,"publicationDate":"2024-02-09","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"139764999","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Identifying the characteristics of patients with stroke who have difficulty benefiting from gait training with the hybrid assistive limb: a retrospective cohort study 识别难以从混合辅助肢体步态训练中获益的中风患者的特征：一项回顾性队列研究

IF 3.1 4区计算机科学 Q3 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE

Frontiers in Neurorobotics

Pub Date : 2024-02-08 DOI: 10.3389/fnbot.2024.1336812

Shingo Taki, Takeshi Imura, Tsubasa Mitsutake, Yuji Iwamoto, Ryo Tanaka, Naoki Imada, Hayato Araki, Osamu Araki

Robot-assisted gait training is effective for walking independence in stroke rehabilitation, the hybrid assistive limb (HAL) is an example. However, gait training with HAL may not be effective for everyone, and it is not clear who is not expected to benefit. Therefore, we aimed to identify the characteristics of stroke patients who have difficulty gaining benefits from gait training with HAL. We conducted a single-institutional retrospective cohort study. The participants were 82 stroke patients who had received gait training with HAL during hospitalization. The dependent variable was the functional ambulation category (FAC) that a measure of gait independence in stroke patients, and five independent [age, National Institutes of Health Stroke Scale, Brunnstrom recovery stage (BRS), days from stroke onset, and functional independence measure total score (cognitive items)] variables were selected from previous studies and analyzed by logistic regression analysis. We evaluated the validity of logistic regression analysis by using several indicators, such as the area under the curve (AUC), and a confusion matrix. Age, days from stroke onset to HAL initiation, and BRS were identified as factors that significantly influenced walking independence through gait training with HAL. The AUC was 0.86. Furthermore, after building a confusion matrix, the calculated binary accuracy, sensitivity (recall), and specificity were 0.80, 0.80, and 0.81, respectively, indicated high accuracy. Our findings confirmed that older age, greater degree of paralysis, and delayed initiation of HAL-assisted training after stroke onset were associated with increased likelihood of walking dependence upon hospital discharge.

机器人辅助步态训练对中风康复中的独立行走很有效，混合辅助肢体（HAL）就是一个例子。然而，使用 HAL 进行步态训练并非对每个人都有效，而且目前还不清楚哪些人无法从中获益。因此，我们旨在确定难以从使用 HAL 进行步态训练中获益的中风患者的特征。我们进行了一项单一机构的回顾性队列研究。研究对象是 82 名在住院期间接受过 HAL 步态训练的脑卒中患者。因变量是衡量脑卒中患者步态独立性的功能性行走类别（FAC），五个自变量（年龄、美国国立卫生研究院脑卒中量表、Brunnstrom 恢复阶段（BRS）、脑卒中发病天数和功能独立性测量总分（认知项目））均选自既往研究，并通过逻辑回归分析进行了分析。我们使用曲线下面积（AUC）和混淆矩阵等指标评估了逻辑回归分析的有效性。结果表明，年龄、卒中发生到开始 HAL 训练的天数以及 BRS 是对通过 HAL 步态训练实现独立行走有显著影响的因素。AUC为0.86。此外，在建立混淆矩阵后，计算出的二元准确度、灵敏度（召回）和特异度分别为 0.80、0.80 和 0.81，表明准确度很高。我们的研究结果证实，年龄越大、瘫痪程度越重、卒中发生后开始 HAL 辅助训练的时间越晚，出院后出现行走依赖的可能性就越大。

{"title":"Identifying the characteristics of patients with stroke who have difficulty benefiting from gait training with the hybrid assistive limb: a retrospective cohort study","authors":"Shingo Taki, Takeshi Imura, Tsubasa Mitsutake, Yuji Iwamoto, Ryo Tanaka, Naoki Imada, Hayato Araki, Osamu Araki","doi":"10.3389/fnbot.2024.1336812","DOIUrl":"https://doi.org/10.3389/fnbot.2024.1336812","url":null,"abstract":"Robot-assisted gait training is effective for walking independence in stroke rehabilitation, the hybrid assistive limb (HAL) is an example. However, gait training with HAL may not be effective for everyone, and it is not clear who is not expected to benefit. Therefore, we aimed to identify the characteristics of stroke patients who have difficulty gaining benefits from gait training with HAL. We conducted a single-institutional retrospective cohort study. The participants were 82 stroke patients who had received gait training with HAL during hospitalization. The dependent variable was the functional ambulation category (FAC) that a measure of gait independence in stroke patients, and five independent [age, National Institutes of Health Stroke Scale, Brunnstrom recovery stage (BRS), days from stroke onset, and functional independence measure total score (cognitive items)] variables were selected from previous studies and analyzed by logistic regression analysis. We evaluated the validity of logistic regression analysis by using several indicators, such as the area under the curve (AUC), and a confusion matrix. Age, days from stroke onset to HAL initiation, and BRS were identified as factors that significantly influenced walking independence through gait training with HAL. The AUC was 0.86. Furthermore, after building a confusion matrix, the calculated binary accuracy, sensitivity (recall), and specificity were 0.80, 0.80, and 0.81, respectively, indicated high accuracy. Our findings confirmed that older age, greater degree of paralysis, and delayed initiation of HAL-assisted training after stroke onset were associated with increased likelihood of walking dependence upon hospital discharge.","PeriodicalId":12628,"journal":{"name":"Frontiers in Neurorobotics","volume":"7 1","pages":""},"PeriodicalIF":3.1,"publicationDate":"2024-02-08","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"139764912","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Multimodal audio-visual robot fusing 3D CNN and CRNN for player behavior recognition and prediction in basketball matches 融合 3D CNN 和 CRNN 的多模态视听机器人用于篮球比赛中球员行为的识别和预测

IF 3.1 4区计算机科学 Q3 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE

Frontiers in Neurorobotics

Pub Date : 2024-02-06 DOI: 10.3389/fnbot.2024.1284175

Haiyan Wang

Introduction

Intelligent robots play a crucial role in enhancing efficiency, reducing costs, and improving safety in the logistics industry. However, traditional path planning methods often struggle to adapt to dynamic environments, leading to issues such as collisions and conflicts. This study aims to address the challenges of path planning and control for logistics robots in complex environments.

Methods

The proposed method integrates information from different perception modalities to achieve more accurate path planning and obstacle avoidance control, thereby enhancing the autonomy and reliability of logistics robots. Firstly, a 3D convolutional neural network (CNN) is employed to learn the feature representation of objects in the environment for object recognition. Next, long short-term memory (LSTM) is used to model spatio-temporal features and predict the behavior and trajectory of dynamic obstacles. This enables the robot to accurately predict the future position of obstacles in complex environments, reducing collision risks. Finally, the Dijkstra algorithm is applied for path planning and control decisions to ensure the robot selects the optimal path in various scenarios.

Results

Experimental results demonstrate the effectiveness of the proposed method in terms of path planning accuracy and obstacle avoidance performance. The method outperforms traditional approaches, showing significant improvements in both aspects.

Discussion

The intelligent path planning and control scheme presented in this paper enhances the practicality of logistics robots in complex environments, thereby promoting efficiency and safety in the logistics industry.

导言智能机器人在物流行业提高效率、降低成本和改善安全方面发挥着至关重要的作用。然而，传统的路径规划方法往往难以适应动态环境，从而导致碰撞和冲突等问题。本研究旨在解决物流机器人在复杂环境中的路径规划和控制难题。方法所提出的方法整合了来自不同感知模式的信息，以实现更精确的路径规划和避障控制，从而提高物流机器人的自主性和可靠性。首先，采用三维卷积神经网络（CNN）来学习环境中物体的特征表示，以进行物体识别。然后，利用长短期记忆（LSTM）建立时空特征模型，预测动态障碍物的行为和轨迹。这样，机器人就能准确预测复杂环境中障碍物的未来位置，从而降低碰撞风险。最后，Dijkstra 算法被应用于路径规划和控制决策，以确保机器人在各种情况下选择最优路径。本文提出的智能路径规划和控制方案增强了物流机器人在复杂环境中的实用性，从而提高了物流行业的效率和安全性。

{"title":"Multimodal audio-visual robot fusing 3D CNN and CRNN for player behavior recognition and prediction in basketball matches","authors":"Haiyan Wang","doi":"10.3389/fnbot.2024.1284175","DOIUrl":"https://doi.org/10.3389/fnbot.2024.1284175","url":null,"abstract":"<sec><title>Introduction</title>Intelligent robots play a crucial role in enhancing efficiency, reducing costs, and improving safety in the logistics industry. However, traditional path planning methods often struggle to adapt to dynamic environments, leading to issues such as collisions and conflicts. This study aims to address the challenges of path planning and control for logistics robots in complex environments.</sec><sec><title>Methods</title>The proposed method integrates information from different perception modalities to achieve more accurate path planning and obstacle avoidance control, thereby enhancing the autonomy and reliability of logistics robots. Firstly, a 3D convolutional neural network (CNN) is employed to learn the feature representation of objects in the environment for object recognition. Next, long short-term memory (LSTM) is used to model spatio-temporal features and predict the behavior and trajectory of dynamic obstacles. This enables the robot to accurately predict the future position of obstacles in complex environments, reducing collision risks. Finally, the Dijkstra algorithm is applied for path planning and control decisions to ensure the robot selects the optimal path in various scenarios.</sec><sec><title>Results</title>Experimental results demonstrate the effectiveness of the proposed method in terms of path planning accuracy and obstacle avoidance performance. The method outperforms traditional approaches, showing significant improvements in both aspects.</sec><sec><title>Discussion</title>The intelligent path planning and control scheme presented in this paper enhances the practicality of logistics robots in complex environments, thereby promoting efficiency and safety in the logistics industry.</sec>","PeriodicalId":12628,"journal":{"name":"Frontiers in Neurorobotics","volume":"98 1","pages":""},"PeriodicalIF":3.1,"publicationDate":"2024-02-06","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"140045383","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Motion planning framework based on dual-agent DDPG method for dual-arm robots guided by human joint angle constraints 基于双代理 DDPG 方法的运动规划框架，适用于由人类关节角度约束引导的双臂机器人

IF 3.1 4区计算机科学 Q3 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE

Frontiers in Neurorobotics

Pub Date : 2024-02-05 DOI: 10.3389/fnbot.2024.1362359

Keyao Liang, Fusheng Zha, Wei Guo, Shengkai Liu, Pengfei Wang, Lining Sun

Introduction

Reinforcement learning has been widely used in robot motion planning. However, for multi-step complex tasks of dual-arm robots, the trajectory planning method based on reinforcement learning still has some problems, such as ample exploration space, long training time, and uncontrollable training process. Based on the dual-agent depth deterministic strategy gradient (DADDPG) algorithm, this study proposes a motion planning framework constrained by the human joint angle, simultaneously realizing the humanization of learning content and learning style. It quickly plans the coordinated trajectory of dual-arm for complex multi-step tasks.

Methods

The proposed framework mainly includes two parts: one is the modeling of human joint angle constraints. The joint angle is calculated from the human arm motion data measured by the inertial measurement unit (IMU) by establishing a human-robot dual-arm kinematic mapping model. Then, the joint angle range constraints are extracted from multiple groups of demonstration data and expressed as inequalities. Second, the segmented reward function is designed. The human joint angle constraint guides the exploratory learning process of the reinforcement learning method in the form of step reward. Therefore, the exploration space is reduced, the training speed is accelerated, and the learning process is controllable to a certain extent.

Results and discussion

The effectiveness of the framework was verified in the gym simulation environment of the Baxter robot's reach-grasp-align task. The results show that in this framework, human experience knowledge has a significant impact on the guidance of learning, and this method can more quickly plan the coordinated trajectory of dual-arm for multi-step tasks.

引言强化学习在机器人运动规划中得到了广泛应用。然而，对于双臂机器人的多步复杂任务，基于强化学习的轨迹规划方法仍存在探索空间大、训练时间长、训练过程不可控等问题。本研究基于双代理深度确定性策略梯度（DADDPG）算法，提出了一种以人的关节角度为约束的运动规划框架，同时实现了学习内容和学习方式的人性化。方法所提出的框架主要包括两部分：一是人体关节角度约束建模。通过建立人机双臂运动学映射模型，根据惯性测量单元（IMU）测量的人体手臂运动数据计算关节角度。然后，从多组演示数据中提取关节角度范围约束，并表示为不等式。其次，设计分段奖励函数。人体关节角度约束以阶跃奖励的形式引导强化学习方法的探索学习过程。结果与讨论在 Baxter 机器人伸手抓握对齐任务的健身房仿真环境中验证了该框架的有效性。结果表明，在该框架中，人类的经验知识对学习的指导作用非常明显，该方法可以更快地规划多步任务的双臂协调轨迹。

{"title":"Motion planning framework based on dual-agent DDPG method for dual-arm robots guided by human joint angle constraints","authors":"Keyao Liang, Fusheng Zha, Wei Guo, Shengkai Liu, Pengfei Wang, Lining Sun","doi":"10.3389/fnbot.2024.1362359","DOIUrl":"https://doi.org/10.3389/fnbot.2024.1362359","url":null,"abstract":"<sec><title>Introduction</title>Reinforcement learning has been widely used in robot motion planning. However, for multi-step complex tasks of dual-arm robots, the trajectory planning method based on reinforcement learning still has some problems, such as ample exploration space, long training time, and uncontrollable training process. Based on the dual-agent depth deterministic strategy gradient (DADDPG) algorithm, this study proposes a motion planning framework constrained by the human joint angle, simultaneously realizing the humanization of learning content and learning style. It quickly plans the coordinated trajectory of dual-arm for complex multi-step tasks.</sec><sec><title>Methods</title>The proposed framework mainly includes two parts: one is the modeling of human joint angle constraints. The joint angle is calculated from the human arm motion data measured by the inertial measurement unit (IMU) by establishing a human-robot dual-arm kinematic mapping model. Then, the joint angle range constraints are extracted from multiple groups of demonstration data and expressed as inequalities. Second, the segmented reward function is designed. The human joint angle constraint guides the exploratory learning process of the reinforcement learning method in the form of step reward. Therefore, the exploration space is reduced, the training speed is accelerated, and the learning process is controllable to a certain extent.</sec><sec><title>Results and discussion</title>The effectiveness of the framework was verified in the gym simulation environment of the Baxter robot's reach-grasp-align task. The results show that in this framework, human experience knowledge has a significant impact on the guidance of learning, and this method can more quickly plan the coordinated trajectory of dual-arm for multi-step tasks.</sec>","PeriodicalId":12628,"journal":{"name":"Frontiers in Neurorobotics","volume":"54 1","pages":""},"PeriodicalIF":3.1,"publicationDate":"2024-02-05","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"139918630","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Metastability indexes global changes in the dynamic working point of the brain following brain stimulation 脑刺激后大脑动态工作点全球变化的可转移性指数

IF 3.1 4区计算机科学 Q3 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE

Frontiers in Neurorobotics

Pub Date : 2024-02-01 DOI: 10.3389/fnbot.2024.1336438

Rishabh Bapat, Anagh Pathak, Arpan Banerjee

Several studies have shown that coordination among neural ensembles is a key to understand human cognition. A well charted path is to identify coordination states associated with cognitive functions from spectral changes in the oscillations of EEG or MEG. A growing number of studies suggest that the tendency to switch between coordination states, sculpts the dynamic repertoire of the brain and can be indexed by a measure known as metastability. In this article, we characterize perturbations in the metastability of global brain network dynamics following Transcranial Magnetic Stimulation that could quantify the duration for which information processing is altered. Thus allowing researchers to understand the network effects of brain stimulation, standardize stimulation protocols and design experimental tasks. We demonstrate the effect empirically using publicly available datasets and use a digital twin (a whole brain connectome model) to understand the dynamic principles that generate such observations. We observed a significant reduction in metastability, concurrent with an increase in coherence following single-pulse TMS reflecting the existence of a window where neural coordination is altered. The reduction in complexity was validated by an additional measure based on the Lempel-Ziv complexity of microstate labeled EEG data. Interestingly, higher frequencies in the EEG signal showed faster recovery in metastability than lower frequencies. The digital twin shed light on how the phase resetting introduced by the single-pulse TMS in local cortical networks can propagate globally, giving rise to changes in metastability and coherence.

多项研究表明，神经组合之间的协调是理解人类认知的关键。从脑电图（EEG）或脑电图（MEG）振荡的频谱变化中识别与认知功能相关的协调状态，是一条行之有效的途径。越来越多的研究表明，在协调状态之间切换的趋势是大脑动态剧目的标志，可以用一种被称为 "易变性 "的指标来衡量。在这篇文章中，我们描述了经颅磁刺激后全球大脑网络动态可变性的扰动特征，它可以量化信息处理被改变的持续时间。这样，研究人员就能了解脑刺激的网络效应、规范刺激方案和设计实验任务。我们利用公开的数据集通过经验证明了这种效应，并使用数字孪生（全脑连接组模型）来理解产生这种观察结果的动态原理。我们观察到单脉冲经颅磁刺激后，可变性明显降低，同时一致性增加，这反映出存在一个改变神经协调的窗口。基于微状态标记脑电图数据的 Lempel-Ziv 复杂性的额外测量验证了复杂性的降低。有趣的是，脑电信号中的高频率比低频率显示出更快的转移性恢复。这对数字孪生子揭示了单脉冲 TMS 在局部皮层网络中引入的相位重置如何在全球范围内传播，从而引起可变性和一致性的变化。

{"title":"Metastability indexes global changes in the dynamic working point of the brain following brain stimulation","authors":"Rishabh Bapat, Anagh Pathak, Arpan Banerjee","doi":"10.3389/fnbot.2024.1336438","DOIUrl":"https://doi.org/10.3389/fnbot.2024.1336438","url":null,"abstract":"Several studies have shown that coordination among neural ensembles is a key to understand human cognition. A well charted path is to identify coordination states associated with cognitive functions from spectral changes in the oscillations of EEG or MEG. A growing number of studies suggest that the tendency to switch between coordination states, sculpts the dynamic repertoire of the brain and can be indexed by a measure known as metastability. In this article, we characterize perturbations in the metastability of global brain network dynamics following Transcranial Magnetic Stimulation that could quantify the duration for which information processing is altered. Thus allowing researchers to understand the network effects of brain stimulation, standardize stimulation protocols and design experimental tasks. We demonstrate the effect empirically using publicly available datasets and use a digital twin (a whole brain connectome model) to understand the dynamic principles that generate such observations. We observed a significant reduction in metastability, concurrent with an increase in coherence following single-pulse TMS reflecting the existence of a window where neural coordination is altered. The reduction in complexity was validated by an additional measure based on the Lempel-Ziv complexity of microstate labeled EEG data. Interestingly, higher frequencies in the EEG signal showed faster recovery in metastability than lower frequencies. The digital twin shed light on how the phase resetting introduced by the single-pulse TMS in local cortical networks can propagate globally, giving rise to changes in metastability and coherence.","PeriodicalId":12628,"journal":{"name":"Frontiers in Neurorobotics","volume":"52 1","pages":""},"PeriodicalIF":3.1,"publicationDate":"2024-02-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"139903614","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

SR-TTS: a rhyme-based end-to-end speech synthesis system SR-TTS：基于韵律的端到端语音合成系统

IF 3.1 4区计算机科学 Q3 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE

Frontiers in Neurorobotics

Pub Date : 2024-01-30 DOI: 10.3389/fnbot.2024.1322312

Yihao Yao, Tao Liang, Rui Feng, Keke Shi, Junxiao Yu, Wei Wang, Jianqing Li

Deep learning has significantly advanced text-to-speech (TTS) systems. These neural network-based systems have enhanced speech synthesis quality and are increasingly vital in applications like human-computer interaction. However, conventional TTS models still face challenges, as the synthesized speeches often lack naturalness and expressiveness. Additionally, the slow inference speed, reflecting low efficiency, contributes to the reduced voice quality. This paper introduces SynthRhythm-TTS (SR-TTS), an optimized Transformer-based structure designed to enhance synthesized speech. SR-TTS not only improves phonological quality and naturalness but also accelerates the speech generation process, thereby increasing inference efficiency. SR-TTS contains an encoder, a rhythm coordinator, and a decoder. In particular, a pre-duration predictor within the cadence coordinator and a self-attention-based feature predictor work together to enhance the naturalness and articulatory accuracy of speech. In addition, the introduction of causal convolution enhances the consistency of the time series. The cross-linguistic capability of SR-TTS is validated by training it on both English and Chinese corpora. Human evaluation shows that SR-TTS outperforms existing techniques in terms of speech quality and naturalness of expression. This technology is particularly suitable for applications that require high-quality natural speech, such as intelligent assistants, speech synthesized podcasts, and human-computer interaction.

深度学习极大地推动了文本到语音（TTS）系统的发展。这些基于神经网络的系统提高了语音合成质量，在人机交互等应用中越来越重要。然而，传统的 TTS 模型仍然面临挑战，因为合成的语音往往缺乏自然性和表现力。此外，推理速度慢，效率低，也是语音质量下降的原因之一。本文介绍了 SynthRhythm-TTS (SR-TTS)，这是一种基于变换器的优化结构，旨在增强合成语音。SR-TTS 不仅能提高语音质量和自然度，还能加快语音生成过程，从而提高推理效率。SR-TTS 包含编码器、节奏协调器和解码器。其中，节奏协调器中的预持续时间预测器和基于自我注意力的特征预测器共同作用，提高了语音的自然度和发音准确性。此外，因果卷积的引入也增强了时间序列的一致性。通过在中英文语料库中进行训练，SR-TTS 的跨语言能力得到了验证。人工评估表明，SR-TTS 在语音质量和表达自然度方面优于现有技术。这项技术特别适用于需要高质量自然语音的应用，如智能助理、语音合成播客和人机交互。

{"title":"SR-TTS: a rhyme-based end-to-end speech synthesis system","authors":"Yihao Yao, Tao Liang, Rui Feng, Keke Shi, Junxiao Yu, Wei Wang, Jianqing Li","doi":"10.3389/fnbot.2024.1322312","DOIUrl":"https://doi.org/10.3389/fnbot.2024.1322312","url":null,"abstract":"Deep learning has significantly advanced text-to-speech (TTS) systems. These neural network-based systems have enhanced speech synthesis quality and are increasingly vital in applications like human-computer interaction. However, conventional TTS models still face challenges, as the synthesized speeches often lack naturalness and expressiveness. Additionally, the slow inference speed, reflecting low efficiency, contributes to the reduced voice quality. This paper introduces SynthRhythm-TTS (SR-TTS), an optimized Transformer-based structure designed to enhance synthesized speech. SR-TTS not only improves phonological quality and naturalness but also accelerates the speech generation process, thereby increasing inference efficiency. SR-TTS contains an encoder, a rhythm coordinator, and a decoder. In particular, a pre-duration predictor within the cadence coordinator and a self-attention-based feature predictor work together to enhance the naturalness and articulatory accuracy of speech. In addition, the introduction of causal convolution enhances the consistency of the time series. The cross-linguistic capability of SR-TTS is validated by training it on both English and Chinese corpora. Human evaluation shows that SR-TTS outperforms existing techniques in terms of speech quality and naturalness of expression. This technology is particularly suitable for applications that require high-quality natural speech, such as intelligent assistants, speech synthesized podcasts, and human-computer interaction.","PeriodicalId":12628,"journal":{"name":"Frontiers in Neurorobotics","volume":"139 1","pages":""},"PeriodicalIF":3.1,"publicationDate":"2024-01-30","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"139978786","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

A study on robot force control based on the GMM/GMR algorithm fusing different compensation strategies 基于融合不同补偿策略的 GMM/GMR 算法的机器人力控制研究

IF 3.1 4区计算机科学 Q3 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE

Frontiers in Neurorobotics

Pub Date : 2024-01-29 DOI: 10.3389/fnbot.2024.1290853

Meng Xiao, Xuefei Zhang, Tie Zhang, Shouyan Chen, Yanbiao Zou, Wen Wu

To address traditional impedance control methods' difficulty with obtaining stable forces during robot-skin contact, a force control based on the Gaussian mixture model/Gaussian mixture regression (GMM/GMR) algorithm fusing different compensation strategies is proposed. The contact relationship between a robot end effector and human skin is established through an impedance control model. To allow the robot to adapt to flexible skin environments, reinforcement learning algorithms and a strategy based on the skin mechanics model compensate for the impedance control strategy. Two different environment dynamics models for reinforcement learning that can be trained offline are proposed to quickly obtain reinforcement learning strategies. Three different compensation strategies are fused based on the GMM/GMR algorithm, exploiting the online calculation of physical models and offline strategies of reinforcement learning, which can improve the robustness and versatility of the algorithm when adapting to different skin environments. The experimental results show that the contact force obtained by the robot force control based on the GMM/GMR algorithm fusing different compensation strategies is relatively stable. It has better versatility than impedance control, and the force error is within ~±0.2 N.

传统的阻抗控制方法难以在机器人与皮肤接触时获得稳定的力，为了解决这一问题，我们提出了一种基于高斯混合模型/高斯混合回归（GMM/GMR）算法的力控制方法，该算法融合了不同的补偿策略。机器人末端效应器与人体皮肤之间的接触关系是通过阻抗控制模型建立的。为了让机器人适应灵活的皮肤环境，强化学习算法和基于皮肤力学模型的策略对阻抗控制策略进行了补偿。为快速获得强化学习策略，提出了两种可离线训练的强化学习环境动力学模型。基于 GMM/GMR 算法融合了三种不同的补偿策略，利用物理模型的在线计算和强化学习的离线策略，可以提高算法在适应不同皮肤环境时的鲁棒性和通用性。实验结果表明，基于 GMM/GMR 算法融合不同补偿策略的机器人力控制所获得的接触力相对稳定。与阻抗控制相比，它具有更好的通用性，力误差在 ~±0.2 N 范围内。

{"title":"A study on robot force control based on the GMM/GMR algorithm fusing different compensation strategies","authors":"Meng Xiao, Xuefei Zhang, Tie Zhang, Shouyan Chen, Yanbiao Zou, Wen Wu","doi":"10.3389/fnbot.2024.1290853","DOIUrl":"https://doi.org/10.3389/fnbot.2024.1290853","url":null,"abstract":"To address traditional impedance control methods' difficulty with obtaining stable forces during robot-skin contact, a force control based on the Gaussian mixture model/Gaussian mixture regression (GMM/GMR) algorithm fusing different compensation strategies is proposed. The contact relationship between a robot end effector and human skin is established through an impedance control model. To allow the robot to adapt to flexible skin environments, reinforcement learning algorithms and a strategy based on the skin mechanics model compensate for the impedance control strategy. Two different environment dynamics models for reinforcement learning that can be trained offline are proposed to quickly obtain reinforcement learning strategies. Three different compensation strategies are fused based on the GMM/GMR algorithm, exploiting the online calculation of physical models and offline strategies of reinforcement learning, which can improve the robustness and versatility of the algorithm when adapting to different skin environments. The experimental results show that the contact force obtained by the robot force control based on the GMM/GMR algorithm fusing different compensation strategies is relatively stable. It has better versatility than impedance control, and the force error is within ~±0.2 N.","PeriodicalId":12628,"journal":{"name":"Frontiers in Neurorobotics","volume":"49 1","pages":""},"PeriodicalIF":3.1,"publicationDate":"2024-01-29","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"139581107","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0