Frontiers in Neurorobotics最新文献

英文中文

Editorial: Assistive and service robots for health and home applications (RH3 - Robot Helpers in Health and Home). 社论：用于健康和家庭应用的辅助和服务机器人（RH3--健康和家庭机器人助手）。

IF 2.6 4区计算机科学 Q3 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE

Frontiers in Neurorobotics

Pub Date : 2024-10-29 eCollection Date: 2024-01-01 DOI: 10.3389/fnbot.2024.1503038

Paloma de la Puente, Markus Vincze, Diego Guffanti, Daniel Galan

引用次数: 0

A modified A* algorithm combining remote sensing technique to collect representative samples from unmanned surface vehicles. 一种结合遥感技术的改良 A* 算法，用于从无人驾驶地表飞行器上采集具有代表性的样本。

IF 2.6 4区计算机科学 Q3 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE

Frontiers in Neurorobotics

Pub Date : 2024-10-22 eCollection Date: 2024-01-01 DOI: 10.3389/fnbot.2024.1488337

Lei Wang, Danping Liu, Jun Wang

Ensuring representativeness of collected samples is the most critical requirement of water sampling. Unmanned surface vehicles (USVs) have been widely adopted in water sampling, but current USV sampling path planning tend to overemphasize path optimization, neglecting the representative samples collection. This study proposed a modified A* algorithm that combined remote sensing technique while considering both path length and the representativeness of collected samples. Water quality parameters were initially retrieved using satellite remote sensing imagery and a deep belief network model, with the parameter value incorporated as coefficient Q in the heuristic function of A* algorithm. The adjustment coefficient k was then introduced into the coefficient Q to optimize the trade-off between sampling representativeness and path length. To evaluate the effectiveness of this algorithm, Chlorophyll-a concentration (Chl-a) was employed as the test parameter, with Chaohu Lake as the study area. Results showed that the algorithm was effective in collecting more representative samples in real-world conditions. As the coefficient k increased, the representativeness of collected samples enhanced, indicated by the Chl-a closely approximating the overall mean Chl-a and exhibiting a gradient distribution. This enhancement was also associated with increased path length. This study is significant in USV water sampling and water environment protection.

确保采集样本的代表性是水样采集的最关键要求。无人水面飞行器（USV）已被广泛应用于水样采集，但目前的 USV 采样路径规划往往过于强调路径优化，而忽视了样品采集的代表性。本研究提出了一种改进的 A* 算法，该算法结合了遥感技术，同时考虑了路径长度和采集样本的代表性。首先利用卫星遥感图像和深度信念网络模型检索水质参数，并将参数值作为系数 Q 加入 A* 算法的启发式函数中。然后在系数 Q 中引入调整系数 k，以优化取样代表性和路径长度之间的权衡。为评估该算法的有效性，以巢湖为研究区域，采用叶绿素 a 浓度（Chl-a）作为测试参数。结果表明，该算法在实际条件下能有效地采集到更具代表性的样本。随着系数 k 的增大，所采集样本的代表性增强，表现为 Chl-a 非常接近总体平均 Chl-a，并呈现梯度分布。这种增强也与路径长度的增加有关。这项研究对 USV 水样采集和水环境保护具有重要意义。

{"title":"A modified A* algorithm combining remote sensing technique to collect representative samples from unmanned surface vehicles.","authors":"Lei Wang, Danping Liu, Jun Wang","doi":"10.3389/fnbot.2024.1488337","DOIUrl":"10.3389/fnbot.2024.1488337","url":null,"abstract":"Ensuring representativeness of collected samples is the most critical requirement of water sampling. Unmanned surface vehicles (USVs) have been widely adopted in water sampling, but current USV sampling path planning tend to overemphasize path optimization, neglecting the representative samples collection. This study proposed a modified A* algorithm that combined remote sensing technique while considering both path length and the representativeness of collected samples. Water quality parameters were initially retrieved using satellite remote sensing imagery and a deep belief network model, with the parameter value incorporated as coefficient Q in the heuristic function of A* algorithm. The adjustment coefficient k was then introduced into the coefficient Q to optimize the trade-off between sampling representativeness and path length. To evaluate the effectiveness of this algorithm, Chlorophyll-a concentration (Chl-a) was employed as the test parameter, with Chaohu Lake as the study area. Results showed that the algorithm was effective in collecting more representative samples in real-world conditions. As the coefficient k increased, the representativeness of collected samples enhanced, indicated by the Chl-a closely approximating the overall mean Chl-a and exhibiting a gradient distribution. This enhancement was also associated with increased path length. This study is significant in USV water sampling and water environment protection.","PeriodicalId":12628,"journal":{"name":"Frontiers in Neurorobotics","volume":"18 ","pages":"1488337"},"PeriodicalIF":2.6,"publicationDate":"2024-10-22","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11535655/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142582574","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

TL-CStrans Net: a vision robot for table tennis player action recognition driven via CS-Transformer. TL-CStrans Net：通过 CS 变压器驱动的乒乓球运动员动作识别视觉机器人。

IF 2.6 4区计算机科学 Q3 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE

Frontiers in Neurorobotics

Pub Date : 2024-10-21 eCollection Date: 2024-01-01 DOI: 10.3389/fnbot.2024.1443177

Libo Ma, Yan Tong

Currently, the application of robotics technology in sports training and competitions is rapidly increasing. Traditional methods mainly rely on image or video data, neglecting the effective utilization of textual information. To address this issue, we propose: TL-CStrans Net: A vision robot for table tennis player action recognition driven via CS-Transformer. This is a multimodal approach that combines CS-Transformer, CLIP, and transfer learning techniques to effectively integrate visual and textual information. Firstly, we employ the CS-Transformer model as the neural computing backbone. By utilizing the CS-Transformer, we can effectively process visual information extracted from table tennis game scenes, enabling accurate stroke recognition. Then, we introduce the CLIP model, which combines computer vision and natural language processing. CLIP allows us to jointly learn representations of images and text, thereby aligning the visual and textual modalities. Finally, to reduce training and computational requirements, we leverage pre-trained CS-Transformer and CLIP models through transfer learning, which have already acquired knowledge from relevant domains, and apply them to table tennis stroke recognition tasks. Experimental results demonstrate the outstanding performance of TL-CStrans Net in table tennis stroke recognition. Our research is of significant importance in promoting the application of multimodal robotics technology in the field of sports and bridging the gap between neural computing, computer vision, and neuroscience.

目前，机器人技术在体育训练和比赛中的应用正在迅速增加。传统方法主要依赖图像或视频数据，忽视了文本信息的有效利用。针对这一问题，我们提出了：TL-CStrans Net：通过 CS 变换器驱动的乒乓球运动员动作识别视觉机器人。这是一种多模态方法，结合了 CS-Transformer、CLIP 和迁移学习技术，有效地整合了视觉和文本信息。首先，我们采用 CS-Transformer 模型作为神经计算骨干。通过利用 CS-Transformer，我们可以有效处理从乒乓球比赛场景中提取的视觉信息，从而实现准确的击球识别。然后，我们介绍了结合计算机视觉和自然语言处理的 CLIP 模型。CLIP 允许我们联合学习图像和文本的表征，从而使视觉和文本模式保持一致。最后，为了降低训练和计算要求，我们通过迁移学习利用预先训练好的 CS-Transformer 和 CLIP 模型，这些模型已经从相关领域获取了知识，并将它们应用于乒乓球击球识别任务。实验结果表明，TL-CStrans Net 在乒乓球击球识别中表现出色。我们的研究对于促进多模态机器人技术在体育领域的应用，以及弥合神经计算、计算机视觉和神经科学之间的鸿沟具有重要意义。

{"title":"TL-CStrans Net: a vision robot for table tennis player action recognition driven via CS-Transformer.","authors":"Libo Ma, Yan Tong","doi":"10.3389/fnbot.2024.1443177","DOIUrl":"10.3389/fnbot.2024.1443177","url":null,"abstract":"Currently, the application of robotics technology in sports training and competitions is rapidly increasing. Traditional methods mainly rely on image or video data, neglecting the effective utilization of textual information. To address this issue, we propose: TL-CStrans Net: A vision robot for table tennis player action recognition driven via CS-Transformer. This is a multimodal approach that combines CS-Transformer, CLIP, and transfer learning techniques to effectively integrate visual and textual information. Firstly, we employ the CS-Transformer model as the neural computing backbone. By utilizing the CS-Transformer, we can effectively process visual information extracted from table tennis game scenes, enabling accurate stroke recognition. Then, we introduce the CLIP model, which combines computer vision and natural language processing. CLIP allows us to jointly learn representations of images and text, thereby aligning the visual and textual modalities. Finally, to reduce training and computational requirements, we leverage pre-trained CS-Transformer and CLIP models through transfer learning, which have already acquired knowledge from relevant domains, and apply them to table tennis stroke recognition tasks. Experimental results demonstrate the outstanding performance of TL-CStrans Net in table tennis stroke recognition. Our research is of significant importance in promoting the application of multimodal robotics technology in the field of sports and bridging the gap between neural computing, computer vision, and neuroscience.","PeriodicalId":12628,"journal":{"name":"Frontiers in Neurorobotics","volume":"18 ","pages":"1443177"},"PeriodicalIF":2.6,"publicationDate":"2024-10-21","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11532032/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142575211","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Erratum: Swimtrans Net: a multimodal robotic system for swimming action recognition driven via Swin-Transformer. 更正：Swimtrans Net：通过斯温变换器驱动的游泳动作识别多模式机器人系统。

IF 2.6 4区计算机科学 Q3 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE

Frontiers in Neurorobotics

Pub Date : 2024-10-21 eCollection Date: 2024-01-01 DOI: 10.3389/fnbot.2024.1508032

[This corrects the article DOI: 10.3389/fnbot.2024.1452019.].

[此处更正了文章 DOI：10.3389/fnbot.2024.1452019.]。

引用次数: 0

Cascade contour-enhanced panoptic segmentation for robotic vision perception. 用于机器人视觉感知的级联轮廓增强全景分割。

IF 2.6 4区计算机科学 Q3 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE

Frontiers in Neurorobotics

Pub Date : 2024-10-21 eCollection Date: 2024-01-01 DOI: 10.3389/fnbot.2024.1489021

Yue Xu, Runze Liu, Dongchen Zhu, Lili Chen, Xiaolin Zhang, Jiamao Li

Panoptic segmentation plays a crucial role in enabling robots to comprehend their surroundings, providing fine-grained scene understanding information for robots' intelligent tasks. Although existing methods have made some progress, they are prone to fail in areas with weak textures, small objects, etc. Inspired by biological vision research, we propose a cascaded contour-enhanced panoptic segmentation network called CCPSNet, attempting to enhance the discriminability of instances through structural knowledge. To acquire the scene structure, a cascade contour detection stream is designed, which extracts comprehensive scene contours using channel regulation structural perception module and coarse-to-fine cascade strategy. Furthermore, the contour-guided multi-scale feature enhancement stream is developed to boost the discrimination ability for small objects and weak textures. The stream integrates contour information and multi-scale context features through structural-aware feature modulation module and inverse aggregation technique. Experimental results show that our method improves accuracy on the Cityscapes (61.2 PQ) and COCO (43.5 PQ) datasets while also demonstrating robustness in challenging simulated real-world complex scenarios faced by robots, such as dirty cameras and rainy conditions. The proposed network promises to help the robot perceive the real scene. In future work, an unsupervised training strategy for the network could be explored to reduce the training cost.

全景分割在帮助机器人理解周围环境方面发挥着至关重要的作用，它为机器人的智能任务提供了精细的场景理解信息。虽然现有的方法已经取得了一些进展，但在纹理较弱、物体较小等区域容易失效。受生物视觉研究的启发，我们提出了一种级联轮廓增强全景分割网络（CCPSNet），试图通过结构知识增强实例的可辨别性。为了获取场景结构，我们设计了一个级联轮廓检测流，利用通道调节结构感知模块和从粗到细的级联策略提取全面的场景轮廓。此外，还开发了轮廓引导的多尺度特征增强流，以提高对小物体和弱纹理的辨别能力。该信息流通过结构感知特征调制模块和反向聚合技术整合了轮廓信息和多尺度背景特征。实验结果表明，我们的方法在城市景观（61.2 PQ）和 COCO（43.5 PQ）数据集上提高了准确性，同时在机器人面临的具有挑战性的模拟真实世界复杂场景（如肮脏的摄像头和雨天环境）中也表现出了鲁棒性。拟议的网络有望帮助机器人感知真实场景。在未来的工作中，可以探索网络的无监督训练策略，以降低训练成本。

{"title":"Cascade contour-enhanced panoptic segmentation for robotic vision perception.","authors":"Yue Xu, Runze Liu, Dongchen Zhu, Lili Chen, Xiaolin Zhang, Jiamao Li","doi":"10.3389/fnbot.2024.1489021","DOIUrl":"10.3389/fnbot.2024.1489021","url":null,"abstract":"Panoptic segmentation plays a crucial role in enabling robots to comprehend their surroundings, providing fine-grained scene understanding information for robots' intelligent tasks. Although existing methods have made some progress, they are prone to fail in areas with weak textures, small objects, etc. Inspired by biological vision research, we propose a cascaded contour-enhanced panoptic segmentation network called CCPSNet, attempting to enhance the discriminability of instances through structural knowledge. To acquire the scene structure, a cascade contour detection stream is designed, which extracts comprehensive scene contours using channel regulation structural perception module and coarse-to-fine cascade strategy. Furthermore, the contour-guided multi-scale feature enhancement stream is developed to boost the discrimination ability for small objects and weak textures. The stream integrates contour information and multi-scale context features through structural-aware feature modulation module and inverse aggregation technique. Experimental results show that our method improves accuracy on the Cityscapes (61.2 PQ) and COCO (43.5 PQ) datasets while also demonstrating robustness in challenging simulated real-world complex scenarios faced by robots, such as dirty cameras and rainy conditions. The proposed network promises to help the robot perceive the real scene. In future work, an unsupervised training strategy for the network could be explored to reduce the training cost.","PeriodicalId":12628,"journal":{"name":"Frontiers in Neurorobotics","volume":"18 ","pages":"1489021"},"PeriodicalIF":2.6,"publicationDate":"2024-10-21","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11532083/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142577450","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Design and analysis of exoskeleton devices for rehabilitation of distal radius fracture. 设计和分析用于桡骨远端骨折康复的外骨骼装置。

IF 2.6 4区计算机科学 Q3 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE

Frontiers in Neurorobotics

Pub Date : 2024-10-18 eCollection Date: 2024-01-01 DOI: 10.3389/fnbot.2024.1477232

Zhiquan Chen, Jiabao Guo, Yishan Liu, Mengqian Tian, Xingsong Wang

In this work, the mechanical principles of external fixation and resistance training for the wrist affected by a distal radius fracture (DRF) are revealed. Based on the biomechanical analysis, two wearable exoskeleton devices are proposed to facilitate the DRF rehabilitation progress. Chronologically, the adjustable fixation device (AFD) provides fixed protection and limited mobilization of the fractured wrist in the early stage, while the functional recovery of relevant muscles is achieved by the resistance training device (RTD) in the later stage. According to the designed mechatronic systems of AFD and RTD, the experimental prototypes for these two apparatuses are established. By experiments, the actual motion ranges of AFD are investigated, and the feasibility in monitoring joint angles are validated. Meanwhile, the resistant influences of RTD are analyzed based on the surface electromyography (sEMG) signal features, the results demonstrate that the training-induced muscle strength enhancement is generally increased with the increment in external resistance. The exoskeleton devices presented in this work would be beneficial for the active rehabilitation of patients with DRF.

本研究揭示了桡骨远端骨折（DRF）腕部外固定和阻力训练的力学原理。根据生物力学分析，提出了两种可穿戴外骨骼装置，以促进桡骨远端骨折的康复进展。从时间上看，可调节固定装置（AFD）可在早期为骨折腕部提供固定保护和有限的活动能力，而阻力训练装置（RTD）则可在后期实现相关肌肉的功能恢复。根据所设计的 AFD 和 RTD 机电系统，建立了这两个装置的实验原型。通过实验，研究了 AFD 的实际运动范围，并验证了监测关节角度的可行性。同时，根据表面肌电图（sEMG）信号特征分析了 RTD 的阻力影响，结果表明训练引起的肌肉力量增强一般随外部阻力的增加而增加。本研究提出的外骨骼装置将有利于 DRF 患者的积极康复。

{"title":"Design and analysis of exoskeleton devices for rehabilitation of distal radius fracture.","authors":"Zhiquan Chen, Jiabao Guo, Yishan Liu, Mengqian Tian, Xingsong Wang","doi":"10.3389/fnbot.2024.1477232","DOIUrl":"10.3389/fnbot.2024.1477232","url":null,"abstract":"In this work, the mechanical principles of external fixation and resistance training for the wrist affected by a distal radius fracture (DRF) are revealed. Based on the biomechanical analysis, two wearable exoskeleton devices are proposed to facilitate the DRF rehabilitation progress. Chronologically, the adjustable fixation device (AFD) provides fixed protection and limited mobilization of the fractured wrist in the early stage, while the functional recovery of relevant muscles is achieved by the resistance training device (RTD) in the later stage. According to the designed mechatronic systems of AFD and RTD, the experimental prototypes for these two apparatuses are established. By experiments, the actual motion ranges of AFD are investigated, and the feasibility in monitoring joint angles are validated. Meanwhile, the resistant influences of RTD are analyzed based on the surface electromyography (sEMG) signal features, the results demonstrate that the training-induced muscle strength enhancement is generally increased with the increment in external resistance. The exoskeleton devices presented in this work would be beneficial for the active rehabilitation of patients with DRF.","PeriodicalId":12628,"journal":{"name":"Frontiers in Neurorobotics","volume":"18 ","pages":"1477232"},"PeriodicalIF":2.6,"publicationDate":"2024-10-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11527727/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142570965","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

NAN-DETR: noising multi-anchor makes DETR better for object detection. NAN-DETR：噪声多锚使 DETR 更好地用于物体检测。

IF 2.6 4区计算机科学 Q3 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE

Frontiers in Neurorobotics

Pub Date : 2024-10-14 eCollection Date: 2024-01-01 DOI: 10.3389/fnbot.2024.1484088

Zixin Huang, Xuesong Tao, Xinyuan Liu

Object detection plays a crucial role in robotic vision, focusing on accurately identifying and localizing objects within images. However, many existing methods encounter limitations, particularly when it comes to effectively implementing a one-to-many matching strategy. To address these challenges, we propose NAN-DETR (Noising Multi-Anchor Detection Transformer), an innovative framework based on DETR (Detection Transformer). NAN-DETR introduces three key improvements to transformer-based object detection: a decoder-based multi-anchor strategy, a centralization noising mechanism, and the integration of Complete Intersection over Union (CIoU) loss. The multi-anchor strategy leverages multiple anchors per object, significantly enhancing detection accuracy by improving the one-to-many matching process. The centralization noising mechanism mitigates conflicts among anchors by injecting controlled noise into the detection boxes, thereby increasing the robustness of the model. Additionally, CIoU loss, which incorporates both aspect ratio and spatial distance in its calculations, results in more precise bounding box predictions compared to the conventional IoU loss. Although NAN-DETR may not drastically improve real-time processing capabilities, its exceptional performance positions it as a highly reliable solution for diverse object detection scenarios.

物体检测在机器人视觉中起着至关重要的作用，其重点是准确识别和定位图像中的物体。然而，许多现有方法都存在局限性，尤其是在有效实施一对多匹配策略时。为了应对这些挑战，我们提出了基于 DETR（检测变换器）的创新框架 NAN-DETR（噪声多锚检测变换器）。NAN-DETR 对基于变换器的物体检测引入了三项关键改进：基于解码器的多锚（multi-anchor）策略、集中噪声机制以及完整交叉联合（CIoU）损失的集成。多锚策略利用每个对象的多个锚点，通过改进一对多的匹配过程显著提高了检测精度。集中噪声机制通过向检测盒注入受控噪声来缓解锚点之间的冲突，从而提高模型的鲁棒性。此外，CIoU 丢失在计算中同时考虑了长宽比和空间距离，因此与传统的 IoU 丢失相比，CIoU 丢失能更精确地预测边界框。尽管 NAN-DETR 可能无法大幅提高实时处理能力，但其卓越的性能使其成为适用于各种物体检测场景的高度可靠的解决方案。

{"title":"NAN-DETR: noising multi-anchor makes DETR better for object detection.","authors":"Zixin Huang, Xuesong Tao, Xinyuan Liu","doi":"10.3389/fnbot.2024.1484088","DOIUrl":"10.3389/fnbot.2024.1484088","url":null,"abstract":"Object detection plays a crucial role in robotic vision, focusing on accurately identifying and localizing objects within images. However, many existing methods encounter limitations, particularly when it comes to effectively implementing a one-to-many matching strategy. To address these challenges, we propose NAN-DETR (Noising Multi-Anchor Detection Transformer), an innovative framework based on DETR (Detection Transformer). NAN-DETR introduces three key improvements to transformer-based object detection: a decoder-based multi-anchor strategy, a centralization noising mechanism, and the integration of Complete Intersection over Union (CIoU) loss. The multi-anchor strategy leverages multiple anchors per object, significantly enhancing detection accuracy by improving the one-to-many matching process. The centralization noising mechanism mitigates conflicts among anchors by injecting controlled noise into the detection boxes, thereby increasing the robustness of the model. Additionally, CIoU loss, which incorporates both aspect ratio and spatial distance in its calculations, results in more precise bounding box predictions compared to the conventional IoU loss. Although NAN-DETR may not drastically improve real-time processing capabilities, its exceptional performance positions it as a highly reliable solution for diverse object detection scenarios.","PeriodicalId":12628,"journal":{"name":"Frontiers in Neurorobotics","volume":"18 ","pages":"1484088"},"PeriodicalIF":2.6,"publicationDate":"2024-10-14","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11513373/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142521681","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Noisy Dueling Double Deep Q-Network algorithm for autonomous underwater vehicle path planning. 用于自主水下航行器路径规划的噪声决斗双深 Q 网络算法。

IF 2.6 4区计算机科学 Q3 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE

Frontiers in Neurorobotics

Pub Date : 2024-10-14 eCollection Date: 2024-01-01 DOI: 10.3389/fnbot.2024.1466571

Xu Liao, Le Li, Chuangxia Huang, Xian Zhao, Shumin Tan

How to improve the success rate of autonomous underwater vehicle (AUV) path planning and reduce travel time as much as possible is a very challenging and crucial problem in the practical applications of AUV in the complex ocean current environment. Traditional reinforcement learning algorithms lack exploration of the environment, and the strategies learned by the agent may not generalize well to other different environments. To address these challenges, we propose a novel AUV path planning algorithm named the Noisy Dueling Double Deep Q-Network (ND3QN) algorithm by modifying the reward function and introducing a noisy network, which generalizes the traditional D3QN algorithm. Compared with the classical algorithm [e.g., Rapidly-exploring Random Trees Star (RRT*), DQN, and D3QN], with simulation experiments conducted in realistic terrain and ocean currents, the proposed ND3QN algorithm demonstrates the outstanding characteristics of a higher success rate of AUV path planning, shorter travel time, and smoother paths.

在复杂的洋流环境中，如何提高自主潜水器（AUV）路径规划的成功率并尽可能缩短航行时间，是 AUV 实际应用中一个极具挑战性的关键问题。传统的强化学习算法缺乏对环境的探索，代理学习到的策略可能无法很好地推广到其他不同的环境中。为了应对这些挑战，我们通过修改奖励函数和引入噪声网络，提出了一种新型的 AUV 路径规划算法，即噪声决斗双深 Q 网络（ND3QN）算法，该算法是对传统 D3QN 算法的泛化。通过在真实地形和洋流中进行仿真实验，与经典算法[如快速探索随机树星（RRT*）、DQN 和 D3QN]相比，所提出的 ND3QN 算法具有 AUV 路径规划成功率更高、行进时间更短、路径更平滑等突出特点。

引用次数: 0

CAM-Vtrans: real-time sports training utilizing multi-modal robot data. CAM-Vtrans：利用多模态机器人数据进行实时运动训练。

IF 2.6 4区计算机科学 Q3 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE

Frontiers in Neurorobotics

Pub Date : 2024-10-11 eCollection Date: 2024-01-01 DOI: 10.3389/fnbot.2024.1453571

Hong LinLin, Lee Sangheang, Song GuanTing

Introduction: Assistive robots and human-robot interaction have become integral parts of sports training. However, existing methods often fail to provide real-time and accurate feedback, and they often lack integration of comprehensive multi-modal data.

Methods: To address these issues, we propose a groundbreaking and innovative approach: CAM-Vtrans-Cross-Attention Multi-modal Visual Transformer. By leveraging the strengths of state-of-the-art techniques such as Visual Transformers (ViT) and models like CLIP, along with cross-attention mechanisms, CAM-Vtrans harnesses the power of visual and textual information to provide athletes with highly accurate and timely feedback. Through the utilization of multi-modal robot data, CAM-Vtrans offers valuable assistance, enabling athletes to optimize their performance while minimizing potential injury risks. This novel approach represents a significant advancement in the field, offering an innovative solution to overcome the limitations of existing methods and enhance the precision and efficiency of sports training programs.

简介辅助机器人和人机交互已成为体育训练不可或缺的一部分。然而，现有的方法往往无法提供实时、准确的反馈，而且往往缺乏对综合多模态数据的整合：为了解决这些问题，我们提出了一种突破性的创新方法：CAM-Vtrans-Cross-Attention Multi-modal Visual Transformer。通过利用视觉转换器（ViT）等先进技术和 CLIP 等模型以及交叉注意机制的优势，CAM-Vtrans 利用视觉和文本信息的力量为运动员提供高度准确和及时的反馈。通过利用多模态机器人数据，CAM-Vtrans 提供了宝贵的帮助，使运动员能够优化其表现，同时将潜在的受伤风险降至最低。这种新颖的方法代表了该领域的重大进步，为克服现有方法的局限性、提高运动训练计划的精确性和效率提供了创新解决方案。

{"title":"CAM-Vtrans: real-time sports training utilizing multi-modal robot data.","authors":"Hong LinLin, Lee Sangheang, Song GuanTing","doi":"10.3389/fnbot.2024.1453571","DOIUrl":"10.3389/fnbot.2024.1453571","url":null,"abstract":"Introduction: Assistive robots and human-robot interaction have become integral parts of sports training. However, existing methods often fail to provide real-time and accurate feedback, and they often lack integration of comprehensive multi-modal data.Methods: To address these issues, we propose a groundbreaking and innovative approach: CAM-Vtrans-Cross-Attention Multi-modal Visual Transformer. By leveraging the strengths of state-of-the-art techniques such as Visual Transformers (ViT) and models like CLIP, along with cross-attention mechanisms, CAM-Vtrans harnesses the power of visual and textual information to provide athletes with highly accurate and timely feedback. Through the utilization of multi-modal robot data, CAM-Vtrans offers valuable assistance, enabling athletes to optimize their performance while minimizing potential injury risks. This novel approach represents a significant advancement in the field, offering an innovative solution to overcome the limitations of existing methods and enhance the precision and efficiency of sports training programs.","PeriodicalId":12628,"journal":{"name":"Frontiers in Neurorobotics","volume":"18 ","pages":"1453571"},"PeriodicalIF":2.6,"publicationDate":"2024-10-11","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11502466/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142516399","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Sports-ACtrans Net: research on multimodal robotic sports action recognition driven via ST-GCN. Sports-ACtrans Net：通过 ST-GCN 驱动的多模态机器人运动动作识别研究。

IF 2.6 4区计算机科学 Q3 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE

Frontiers in Neurorobotics

Pub Date : 2024-10-11 eCollection Date: 2024-01-01 DOI: 10.3389/fnbot.2024.1443432

Qi Lu

Introduction: Accurately recognizing and understanding human motion actions presents a key challenge in the development of intelligent sports robots. Traditional methods often encounter significant drawbacks, such as high computational resource requirements and suboptimal real-time performance. To address these limitations, this study proposes a novel approach called Sports-ACtrans Net.

Methods: In this approach, the Swin Transformer processes visual data to extract spatial features, while the Spatio-Temporal Graph Convolutional Network (ST-GCN) models human motion as graphs to handle skeleton data. By combining these outputs, a comprehensive representation of motion actions is created. Reinforcement learning is employed to optimize the action recognition process, framing it as a sequential decision-making problem. Deep Q-learning is utilized to learn the optimal policy, thereby enhancing the robot's ability to accurately recognize and engage in motion.

Results and discussion: Experiments demonstrate significant improvements over state-of-the-art methods. This research advances the fields of neural computation, computer vision, and neuroscience, aiding in the development of intelligent robotic systems capable of understanding and participating in sports activities.

简介准确识别和理解人类运动动作是开发智能运动机器人的关键挑战。传统方法往往存在严重缺陷，如计算资源要求高、实时性不理想等。为了解决这些局限性，本研究提出了一种名为 Sports-ACtrans Net.Methods 的新方法：在这种方法中，斯文变换器（Swin Transformer）处理视觉数据以提取空间特征，而时空图卷积网络（ST-GCN）将人体运动建模为图形以处理骨架数据。通过将这些输出组合起来，就能创建一个全面的运动动作表示。强化学习用于优化动作识别过程，将其视为一个连续决策问题。利用深度 Q-learning 学习最优策略，从而提高机器人准确识别和参与运动的能力：实验表明，与最先进的方法相比，该方法有了显著改进。这项研究推动了神经计算、计算机视觉和神经科学领域的发展，有助于开发能够理解和参与体育活动的智能机器人系统。

{"title":"Sports-ACtrans Net: research on multimodal robotic sports action recognition driven via ST-GCN.","authors":"Qi Lu","doi":"10.3389/fnbot.2024.1443432","DOIUrl":"10.3389/fnbot.2024.1443432","url":null,"abstract":"Introduction: Accurately recognizing and understanding human motion actions presents a key challenge in the development of intelligent sports robots. Traditional methods often encounter significant drawbacks, such as high computational resource requirements and suboptimal real-time performance. To address these limitations, this study proposes a novel approach called Sports-ACtrans Net.Methods: In this approach, the Swin Transformer processes visual data to extract spatial features, while the Spatio-Temporal Graph Convolutional Network (ST-GCN) models human motion as graphs to handle skeleton data. By combining these outputs, a comprehensive representation of motion actions is created. Reinforcement learning is employed to optimize the action recognition process, framing it as a sequential decision-making problem. Deep Q-learning is utilized to learn the optimal policy, thereby enhancing the robot's ability to accurately recognize and engage in motion.Results and discussion: Experiments demonstrate significant improvements over state-of-the-art methods. This research advances the fields of neural computation, computer vision, and neuroscience, aiding in the development of intelligent robotic systems capable of understanding and participating in sports activities.","PeriodicalId":12628,"journal":{"name":"Frontiers in Neurorobotics","volume":"18 ","pages":"1443432"},"PeriodicalIF":2.6,"publicationDate":"2024-10-11","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11502397/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142498770","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

首页上一页

下一页尾页

类型

全部化学•材料生命科学医学物理工程技术环境•农林材料科学地球科学法学管理学化学环境科学与生态学计算机科学教育学经济学农林科学人文科学生物学数学物理与天体物理心理学综合性期刊其他工业工程理学历史学农学文学信息工程

数据库

全部 ACS Publications Elsevier ieeexplore Springer The Royal Society of Chemistry Wiley

期刊

Frontiers in Neurorobotics

全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.

﹀