首页 > 最新文献

Frontiers in Neurorobotics最新文献

英文 中文
Correction: Path planning of industrial robots based on the adaptive field cooperative sampling algorithm. 修正:基于自适应现场协同采样算法的工业机器人路径规划。
IF 2.8 4区 计算机科学 Q3 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE Pub Date : 2025-12-18 eCollection Date: 2025-01-01 DOI: 10.3389/fnbot.2025.1754834
Yongbo Zhuang, Sha Luo, Qingdang Li, Dianming Chu, Wenjuan Bai, Xintao Liu, Mingyuan Fan, Lv Wei

[This corrects the article DOI: 10.3389/fnbot.2025.1574044.].

[这更正了文章DOI: 10.3389/fnbot.2025.1574044.]。
{"title":"Correction: Path planning of industrial robots based on the adaptive field cooperative sampling algorithm.","authors":"Yongbo Zhuang, Sha Luo, Qingdang Li, Dianming Chu, Wenjuan Bai, Xintao Liu, Mingyuan Fan, Lv Wei","doi":"10.3389/fnbot.2025.1754834","DOIUrl":"https://doi.org/10.3389/fnbot.2025.1754834","url":null,"abstract":"<p><p>[This corrects the article DOI: 10.3389/fnbot.2025.1574044.].</p>","PeriodicalId":12628,"journal":{"name":"Frontiers in Neurorobotics","volume":"19 ","pages":"1754834"},"PeriodicalIF":2.8,"publicationDate":"2025-12-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12757013/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145900306","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
IAP-TransUNet: integration of the attention mechanism and pyramid pooling for medical image segmentation. IAP-TransUNet:将注意力机制与金字塔池相结合用于医学图像分割。
IF 2.8 4区 计算机科学 Q3 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE Pub Date : 2025-12-01 eCollection Date: 2025-01-01 DOI: 10.3389/fnbot.2025.1706626
Yuxuan Shi, Fang Li, Shuting Zhao, Hongmeng Yu, Xinrong Chen, Quan Liu

Introduction: The combination of CNN and Transformer has attracted much attention for medical image segmentation due to its superior performance at present. However, the segmentation performance is affected by limitations such as the local receptive field and static weights of CNN convolution operations, as well as insufficient information exchange between Transformer local regions.

Methods: To address these issues, an integrated attention mechanism and pyramid pooling network is proposed in this paper. Firstly, an efficient channel attention mechanism is embedded into CNN to extract more comprehensive image features. Then, CBAM_ASPP module is introduced into the bottleneck layer to obtain multi-scale context information. Finally, in order to address the limitations of traditional convolution, depthwise separable convolution is used to achieve a lightweight network.

Results: The experiments based on the Synapse multi organ segmentation dataset and ACDC dataset showed that the proposed IAP-TransUNet achieved Dice similarity coefficients (DSCs) of 78.85% and 90.46%, respectively. Compared with the state-of-the-art method, for the Synapse multi organ segmentation dataset, the Hausdorff distance was reduced by 2.92%. For the ACDC dataset, the segmentation accuracy of the left ventricle, myocardium, and right ventricle was improved by 0.14%, 1.89%, and 0.23%, respectively.

Discussion: The experimental results demonstrate that the proposed network has improved the effectiveness and shows strong performance on both CT and MRI data, which suggests its potential for generalization across different medical imaging modalities.

导语:CNN与Transformer的结合以其优越的性能成为目前医学图像分割的热点。然而,CNN卷积运算的局部接受域和静态权值等限制,以及Transformer局部区域之间信息交换不足,都会影响分割性能。方法:针对这些问题,本文提出了一种集成的注意力机制和金字塔池网络。首先,在CNN中嵌入有效的通道关注机制,提取更全面的图像特征;然后,在瓶颈层引入CBAM_ASPP模块,获取多尺度上下文信息;最后,为了解决传统卷积的局限性,采用深度可分离卷积来实现轻量级网络。结果:基于Synapse多器官分割数据集和ACDC数据集的实验表明,所提出的IAP-TransUNet分别获得了78.85%和90.46%的Dice相似系数(dsc)。与现有方法相比,对于Synapse多器官分割数据集,Hausdorff距离减小了2.92%。对于ACDC数据集,左心室、心肌和右心室的分割精度分别提高了0.14%、1.89%和0.23%。讨论:实验结果表明,所提出的网络提高了有效性,并在CT和MRI数据上表现出很强的性能,这表明它有可能在不同的医学成像模式中推广。
{"title":"IAP-TransUNet: integration of the attention mechanism and pyramid pooling for medical image segmentation.","authors":"Yuxuan Shi, Fang Li, Shuting Zhao, Hongmeng Yu, Xinrong Chen, Quan Liu","doi":"10.3389/fnbot.2025.1706626","DOIUrl":"10.3389/fnbot.2025.1706626","url":null,"abstract":"<p><strong>Introduction: </strong>The combination of CNN and Transformer has attracted much attention for medical image segmentation due to its superior performance at present. However, the segmentation performance is affected by limitations such as the local receptive field and static weights of CNN convolution operations, as well as insufficient information exchange between Transformer local regions.</p><p><strong>Methods: </strong>To address these issues, an integrated attention mechanism and pyramid pooling network is proposed in this paper. Firstly, an efficient channel attention mechanism is embedded into CNN to extract more comprehensive image features. Then, CBAM_ASPP module is introduced into the bottleneck layer to obtain multi-scale context information. Finally, in order to address the limitations of traditional convolution, depthwise separable convolution is used to achieve a lightweight network.</p><p><strong>Results: </strong>The experiments based on the Synapse multi organ segmentation dataset and ACDC dataset showed that the proposed IAP-TransUNet achieved Dice similarity coefficients (DSCs) of 78.85% and 90.46%, respectively. Compared with the state-of-the-art method, for the Synapse multi organ segmentation dataset, the Hausdorff distance was reduced by 2.92%. For the ACDC dataset, the segmentation accuracy of the left ventricle, myocardium, and right ventricle was improved by 0.14%, 1.89%, and 0.23%, respectively.</p><p><strong>Discussion: </strong>The experimental results demonstrate that the proposed network has improved the effectiveness and shows strong performance on both CT and MRI data, which suggests its potential for generalization across different medical imaging modalities.</p>","PeriodicalId":12628,"journal":{"name":"Frontiers in Neurorobotics","volume":"19 ","pages":"1706626"},"PeriodicalIF":2.8,"publicationDate":"2025-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12702935/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145767822","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
A novel intelligent physiotherapy robot based on dynamic acupoint recognition method. 一种基于动态穴位识别方法的智能理疗机器人。
IF 2.8 4区 计算机科学 Q3 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE Pub Date : 2025-11-24 eCollection Date: 2025-01-01 DOI: 10.3389/fnbot.2025.1696824
Yuhan Zhang, Shiyang Sun, Donghui Zhao, Junyou Yang, Shuoyu Wang

Background: Physiotherapy robots offer a feasible and promising solution for achieving safe and efficient treatment. Among these, acupoint recognition is the core component that ensures the precision of physiotherapy robots. Although the research on the acupoint recognition such as hand and ear has been extensive, the accurate location of acupoints on the back of the human body still faces great challenges due to the lack of significant external features.

Methods: This paper designs a two-stage acupoint recognition method, which is achieved through the cooperation of two detection networks. First, a lightweight RTMDet network is used to extract the effective back range from the image, and then the acupoint coordinates are inferred from the extracted back range, reducing the inference consumption caused by invalid information. In addition, the RTMPose network based on the SimCC framework converts the acupoint coordinate regression problem into a classification problem of sub-pixel block subregions on the X and Y axes by performing sub-pixel-level segmentation of images, significantly improving detection speed and accuracy. Meanwhile, the multi-layer feature fusion of CSPNeXt enhances feature extraction capabilities. Then, we designed a physiotherapy interaction interface. Through the three-dimensional coordinates of the acupoints, we independently planned the physiotherapy task path of the physiotherapy robot.

Results: We conducted performance tests on the acupoint recognition system and physiotherapy task planning in the physiotherapy robot system. The experiments have proven our effectiveness, achieving a recall of 90.17% on human datasets, with a detection error of around 5.78 mm. At the same time, it can accurately identify different back postures and achieve an inference speed of 30 FPS on a 4070Ti GPU. Finally, we conducted continuous physiotherapy tasks on multiple acupoints for the user.

Conclusion: The experimental results demonstrate the significant advantages and broad application potential of this method in improving the accuracy and reliability of autonomous acupoint recognition by physiotherapy robots.

背景:物理治疗机器人为实现安全高效的治疗提供了一种可行且有前景的解决方案。其中,穴位识别是保证理疗机器人精准度的核心组件。虽然对手、耳等穴位识别的研究已经非常广泛,但由于缺乏重要的外部特征,人体背部穴位的准确定位仍然面临着很大的挑战。方法:本文设计了一种两阶段的穴位识别方法,该方法是通过两个检测网络的合作来实现的。首先利用轻量级的RTMDet网络从图像中提取有效背距,然后从提取的背距中推断出穴位坐标,减少了无效信息带来的推断消耗。此外,基于SimCC框架的RTMPose网络通过对图像进行亚像素级分割,将穴位坐标回归问题转化为X轴和Y轴上亚像素块子区域的分类问题,显著提高了检测速度和精度。同时,CSPNeXt的多层特征融合增强了特征提取能力。然后,我们设计了一个物理治疗交互界面。通过穴位的三维坐标,我们独立规划了理疗机器人的理疗任务路径。结果:我们对理疗机器人系统中的穴位识别系统和理疗任务规划进行了性能测试。实验证明了我们的有效性,在人类数据集上实现了90.17%的召回率,检测误差约为5.78 mm。同时,它可以准确识别不同的背部姿势,在4070Ti GPU上实现30 FPS的推理速度。最后,我们对用户进行多个穴位的连续物理治疗任务。结论:实验结果表明该方法在提高物理治疗机器人自主穴位识别的准确性和可靠性方面具有显著的优势和广阔的应用潜力。
{"title":"A novel intelligent physiotherapy robot based on dynamic acupoint recognition method.","authors":"Yuhan Zhang, Shiyang Sun, Donghui Zhao, Junyou Yang, Shuoyu Wang","doi":"10.3389/fnbot.2025.1696824","DOIUrl":"10.3389/fnbot.2025.1696824","url":null,"abstract":"<p><strong>Background: </strong>Physiotherapy robots offer a feasible and promising solution for achieving safe and efficient treatment. Among these, acupoint recognition is the core component that ensures the precision of physiotherapy robots. Although the research on the acupoint recognition such as hand and ear has been extensive, the accurate location of acupoints on the back of the human body still faces great challenges due to the lack of significant external features.</p><p><strong>Methods: </strong>This paper designs a two-stage acupoint recognition method, which is achieved through the cooperation of two detection networks. First, a lightweight RTMDet network is used to extract the effective back range from the image, and then the acupoint coordinates are inferred from the extracted back range, reducing the inference consumption caused by invalid information. In addition, the RTMPose network based on the SimCC framework converts the acupoint coordinate regression problem into a classification problem of sub-pixel block subregions on the X and Y axes by performing sub-pixel-level segmentation of images, significantly improving detection speed and accuracy. Meanwhile, the multi-layer feature fusion of CSPNeXt enhances feature extraction capabilities. Then, we designed a physiotherapy interaction interface. Through the three-dimensional coordinates of the acupoints, we independently planned the physiotherapy task path of the physiotherapy robot.</p><p><strong>Results: </strong>We conducted performance tests on the acupoint recognition system and physiotherapy task planning in the physiotherapy robot system. The experiments have proven our effectiveness, achieving a recall of 90.17% on human datasets, with a detection error of around 5.78 mm. At the same time, it can accurately identify different back postures and achieve an inference speed of 30 FPS on a 4070Ti GPU. Finally, we conducted continuous physiotherapy tasks on multiple acupoints for the user.</p><p><strong>Conclusion: </strong>The experimental results demonstrate the significant advantages and broad application potential of this method in improving the accuracy and reliability of autonomous acupoint recognition by physiotherapy robots.</p>","PeriodicalId":12628,"journal":{"name":"Frontiers in Neurorobotics","volume":"19 ","pages":"1696824"},"PeriodicalIF":2.8,"publicationDate":"2025-11-24","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12682758/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145714114","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
On collective behavior in C. elegans. 秀丽隐杆线虫的集体行为。
IF 2.8 4区 计算机科学 Q3 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE Pub Date : 2025-11-17 eCollection Date: 2025-01-01 DOI: 10.3389/fnbot.2025.1689332
Nemanja Antonic, Aymeric Vellinger, Elio Tuci

C. elegans is a model organism in many biological domains, such as genetics, neurophysiology, and behavioral ecology. Despite our relatively deep knowledge of the neuronal, genetic and molecular mechanisms underlying C. elegans communication, we still lack a comprehensive understanding of emergent group-level dynamics. We review the literature on collective behavior of C. elegans by categorizing works in this relatively small research field along three main axes corresponding to primary collective responses: aggregation, swarming, and collective decision-making. Through an analysis of the methods and scientific contributions of these works, we develop a critical perspective that points to important gaps in our understanding of the mechanisms underlaying the emergence of collective responses. We discuss the consequences of the lack of evidence concerning the effect of population density on the emergence of specific group dynamics, and the relatively limited knowledge related to how self-generated pheromones regulate local interactions and contribute to the emergence of group responses. We elaborate on the methodological problems of developing experimental scenarios to disentangle causal relationships between population density, pheromone-based interactions and collective responses. We propose to overcome these limitations with an interdisciplinary approach based on the use of in vivo experiments, mathematical and computer-based models.

秀丽隐杆线虫是许多生物学领域的模式生物,如遗传学、神经生理学和行为生态学。尽管我们对秀丽隐杆线虫通讯的神经元、遗传和分子机制有较深的了解,但我们仍然缺乏对紧急群体水平动力学的全面理解。本文对秀丽隐杆线虫的集体行为进行了综述,并将这一相对较小的研究领域的研究成果按照三个主要的集体反应进行了分类:聚集、蜂群和集体决策。通过对这些作品的方法和科学贡献的分析,我们提出了一个批判性的观点,指出了我们对集体反应出现的机制的理解中的重要差距。我们讨论了缺乏关于种群密度对特定群体动态出现的影响的证据的后果,以及与自我产生的信息素如何调节局部相互作用和促进群体反应出现有关的相对有限的知识。我们详细阐述了开发实验场景的方法问题,以解开人口密度、基于信息素的相互作用和集体反应之间的因果关系。我们建议通过基于体内实验、数学和计算机模型的跨学科方法来克服这些限制。
{"title":"On collective behavior in <i>C. elegans</i>.","authors":"Nemanja Antonic, Aymeric Vellinger, Elio Tuci","doi":"10.3389/fnbot.2025.1689332","DOIUrl":"10.3389/fnbot.2025.1689332","url":null,"abstract":"<p><p><i>C. elegans</i> is a model organism in many biological domains, such as genetics, neurophysiology, and behavioral ecology. Despite our relatively deep knowledge of the neuronal, genetic and molecular mechanisms underlying <i>C. elegans</i> communication, we still lack a comprehensive understanding of emergent group-level dynamics. We review the literature on collective behavior of <i>C. elegans</i> by categorizing works in this relatively small research field along three main axes corresponding to primary collective responses: aggregation, swarming, and collective decision-making. Through an analysis of the methods and scientific contributions of these works, we develop a critical perspective that points to important gaps in our understanding of the mechanisms underlaying the emergence of collective responses. We discuss the consequences of the lack of evidence concerning the effect of population density on the emergence of specific group dynamics, and the relatively limited knowledge related to how self-generated pheromones regulate local interactions and contribute to the emergence of group responses. We elaborate on the methodological problems of developing experimental scenarios to disentangle causal relationships between population density, pheromone-based interactions and collective responses. We propose to overcome these limitations with an interdisciplinary approach based on the use of <i>in vivo</i> experiments, mathematical and computer-based models.</p>","PeriodicalId":12628,"journal":{"name":"Frontiers in Neurorobotics","volume":"19 ","pages":"1689332"},"PeriodicalIF":2.8,"publicationDate":"2025-11-17","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12665682/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145660934","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Path planning of industrial robots based on the adaptive field cooperative sampling algorithm. 基于自适应现场协同采样算法的工业机器人路径规划。
IF 2.8 4区 计算机科学 Q3 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE Pub Date : 2025-11-13 eCollection Date: 2025-01-01 DOI: 10.3389/fnbot.2025.1574044
Yongbo Zhuang, Sha Luo, Qingdang Li, Dianming Chu, Wenjuan Bai, Xintao Liu, Mingyuan Fan, Lv Wei

For the low efficiency and poor generalization ability of path planning algorithm of industrial robots, this work proposes an adaptive field co-sampling algorithm (AFCS). Firstly, the environment complexity function is proposed to make full use of environment information and improve its generalization ability of the traditional rapidly random search tree algorithm (RRT) algorithm. Then an optimal sampling strategy is proposed to make the improvement of the efficiency and optimal direction of RRT algorithm. Finally, this article designs a collaborative extension strategy, which introduces the improved artificial potential field algorithm (APF) into the traditional RRT algorithm to determine the new nodes, so as to improve the orientation and expansion efficiency of the algorithm. The proposed AFCS algorithm completes simulation experiments in two environments with different complexity. Compared with the traditional RRT, RRT* and tRRT algorithm, the results show that the AFCS algorithm has achieved great improvement in environmental adaptability, stability and efficiency. At last, ROKAE industrial robot is taken as the object to build a simulation environment for the path planning, which further verifies the practicability of the algorithm.

针对工业机器人路径规划算法效率低、泛化能力差的问题,提出了一种自适应现场协同采样算法(AFCS)。首先,为了充分利用环境信息,提高传统快速随机搜索树算法(RRT)的泛化能力,提出了环境复杂度函数;然后提出了一种最优采样策略,提高了RRT算法的效率和最优方向。最后,本文设计了一种协同扩展策略,在传统的RRT算法中引入改进的人工势场算法(APF)来确定新节点,从而提高了算法的定向和扩展效率。本文提出的AFCS算法在两种不同复杂度的环境下完成了仿真实验。结果表明,与传统的RRT、RRT*和tRRT算法相比,AFCS算法在环境适应性、稳定性和效率上都有了较大的提高。最后以ROKAE工业机器人为对象建立了路径规划仿真环境,进一步验证了算法的实用性。
{"title":"Path planning of industrial robots based on the adaptive field cooperative sampling algorithm.","authors":"Yongbo Zhuang, Sha Luo, Qingdang Li, Dianming Chu, Wenjuan Bai, Xintao Liu, Mingyuan Fan, Lv Wei","doi":"10.3389/fnbot.2025.1574044","DOIUrl":"10.3389/fnbot.2025.1574044","url":null,"abstract":"<p><p>For the low efficiency and poor generalization ability of path planning algorithm of industrial robots, this work proposes an adaptive field co-sampling algorithm (AFCS). Firstly, the environment complexity function is proposed to make full use of environment information and improve its generalization ability of the traditional rapidly random search tree algorithm (RRT) algorithm. Then an optimal sampling strategy is proposed to make the improvement of the efficiency and optimal direction of RRT algorithm. Finally, this article designs a collaborative extension strategy, which introduces the improved artificial potential field algorithm (APF) into the traditional RRT algorithm to determine the new nodes, so as to improve the orientation and expansion efficiency of the algorithm. The proposed AFCS algorithm completes simulation experiments in two environments with different complexity. Compared with the traditional RRT, RRT* and tRRT algorithm, the results show that the AFCS algorithm has achieved great improvement in environmental adaptability, stability and efficiency. At last, ROKAE industrial robot is taken as the object to build a simulation environment for the path planning, which further verifies the practicability of the algorithm.</p>","PeriodicalId":12628,"journal":{"name":"Frontiers in Neurorobotics","volume":"19 ","pages":"1574044"},"PeriodicalIF":2.8,"publicationDate":"2025-11-13","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12657479/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145648374","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Correction: RSA-TransUNet: a robust structure-adaptive TransUNet for enhanced road crack segmentation. RSA-TransUNet:一种鲁棒的结构自适应TransUNet,用于增强道路裂缝分割。
IF 2.8 4区 计算机科学 Q3 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE Pub Date : 2025-11-10 eCollection Date: 2025-01-01 DOI: 10.3389/fnbot.2025.1711642
Liling Hou, Fei Yu, Yaowen Hu, Yang Hu, Ruoli Yang

[This corrects the article DOI: 10.3389/fnbot.2025.1633697.].

[这更正了文章DOI: 10.3389/fnbot.2025.1633697.]。
{"title":"Correction: RSA-TransUNet: a robust structure-adaptive TransUNet for enhanced road crack segmentation.","authors":"Liling Hou, Fei Yu, Yaowen Hu, Yang Hu, Ruoli Yang","doi":"10.3389/fnbot.2025.1711642","DOIUrl":"https://doi.org/10.3389/fnbot.2025.1711642","url":null,"abstract":"<p><p>[This corrects the article DOI: 10.3389/fnbot.2025.1633697.].</p>","PeriodicalId":12628,"journal":{"name":"Frontiers in Neurorobotics","volume":"19 ","pages":"1711642"},"PeriodicalIF":2.8,"publicationDate":"2025-11-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12641505/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145603799","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Effective and efficient self-supervised masked model based on mixed feature training. 基于混合特征训练的有效自监督掩码模型。
IF 2.8 4区 计算机科学 Q3 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE Pub Date : 2025-10-30 eCollection Date: 2025-01-01 DOI: 10.3389/fnbot.2025.1705970
Qingjiu Kang, Feng Liu, Chunliu Cai

Under the influence of Masked Language Modeling (MLM), Masked Image Modeling (MIM) employs an attention mechanism to perform masked training on images. However, processing a single image requires numerous iterations and substantial computational resources to reconstruct the masked regions, resulting in high computational complexity and significant time costs. To address this issue, we propose an Effective and Efficient self-supervised Masked model based on Mixed feature training (EESMM). First, we stack two images for encoding and input the fused features into the network, which not only reduces computational complexity but also enables the learning of more features. Second, during decoding, we obtain the decoding features corresponding to the original images based on the decoding features of the two input original images and the mixed images, and then construct a corresponding loss function to enhance feature representation. EESMM significantly reduces pre-training time without sacrificing accuracy, achieving 83% accuracy on ImageNet in just 363 h using four V100 GPUs-only one-tenth of the training time required by SimMIM. This validates that the method can substantially accelerate the pre-training process without noticeable performance degradation.

在蒙面语言建模(mask Language Modeling, MLM)的影响下,蒙面图像建模(mask Image Modeling, MIM)采用注意机制对图像进行蒙面训练。然而,处理单幅图像需要大量的迭代和大量的计算资源来重建被掩盖的区域,导致高的计算复杂度和显著的时间成本。为了解决这一问题,我们提出了一种基于混合特征训练(EESMM)的高效自监督屏蔽模型。首先,我们将两幅图像进行叠加编码,并将融合后的特征输入到网络中,这样不仅降低了计算复杂度,而且可以学习到更多的特征。其次,在解码过程中,根据输入的两幅原始图像和混合图像的解码特征,得到与原始图像对应的解码特征,并构造相应的损失函数来增强特征表示。EESMM在不牺牲精度的情况下显著减少了预训练时间,在使用4个V100 gpu的情况下,仅在363小时内就在ImageNet上实现了83%的准确率,仅为SimMIM所需训练时间的十分之一。这验证了该方法可以大大加快预训练过程,而不会出现明显的性能下降。
{"title":"Effective and efficient self-supervised masked model based on mixed feature training.","authors":"Qingjiu Kang, Feng Liu, Chunliu Cai","doi":"10.3389/fnbot.2025.1705970","DOIUrl":"https://doi.org/10.3389/fnbot.2025.1705970","url":null,"abstract":"<p><p>Under the influence of Masked Language Modeling (MLM), Masked Image Modeling (MIM) employs an attention mechanism to perform masked training on images. However, processing a single image requires numerous iterations and substantial computational resources to reconstruct the masked regions, resulting in high computational complexity and significant time costs. To address this issue, we propose an Effective and Efficient self-supervised Masked model based on Mixed feature training (EESMM). First, we stack two images for encoding and input the fused features into the network, which not only reduces computational complexity but also enables the learning of more features. Second, during decoding, we obtain the decoding features corresponding to the original images based on the decoding features of the two input original images and the mixed images, and then construct a corresponding loss function to enhance feature representation. EESMM significantly reduces pre-training time without sacrificing accuracy, achieving 83% accuracy on ImageNet in just 363 h using four V100 GPUs-only one-tenth of the training time required by SimMIM. This validates that the method can substantially accelerate the pre-training process without noticeable performance degradation.</p>","PeriodicalId":12628,"journal":{"name":"Frontiers in Neurorobotics","volume":"19 ","pages":"1705970"},"PeriodicalIF":2.8,"publicationDate":"2025-10-30","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12611909/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145540224","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
A simple robot suggests trunk rotation is essential for emergence of inside leading limb during quadruped galloping turns. 一个简单的机器人表明,躯干旋转对于四足疾驰转弯时内侧前肢的出现是必不可少的。
IF 2.8 4区 计算机科学 Q3 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE Pub Date : 2025-10-23 eCollection Date: 2025-01-01 DOI: 10.3389/fnbot.2025.1628368
Tomoe Maeta, Shoei Hattori, Takeshi Kano, Akira Fukuhara, Akio Ishiguro

During turning maneuvers in the galloping gait of quadruped animals, a strong relationship exists between the turning direction and the sequence in which the forelimbs make ground contact: the outer forelimb acts as the "trailing limb" while the inner forelimb serves as the "leading limb." However, the control mechanisms underlying this behavior remain largely unclear. Understanding these mechanisms could deepen biological knowledge and assist in developing more agile robots. To address this issue, we hypothesized that decentralized interlimb coordination mechanism and trunk movement are essential for the emergence of an inside leading limb in a galloping turn. To test the hypothesis, we developed a quasi-quadruped robot with simplified wheeled hind limbs and variable trunk roll and yaw angles. For forelimb coordination, we implemented a simple decentralized control based on local load-dependent sensory feedback, utilizing trunk roll inclination and yaw bending as turning methods. Our experimental results confirmed that in addition to the decentralized control from previous studies which reproduces animal locomotion in a straight line, adjusting the trunk roll angle spontaneously generates a ground contact sequence similar to gallop turning in quadruped animals. Furthermore, roll inclination showed a greater influence than yaw bending on differentiating the leading and trailing limbs. This study suggests that physical interactions serve as a universal mechanism of locomotor control in both forward and turning movements of quadrupedal animals.

在四足动物疾驰步态的转弯动作中,前肢与地面接触的顺序与转弯方向有很强的关系:外前肢为“后肢”,内前肢为“前肢”。然而,这种行为背后的控制机制在很大程度上仍不清楚。了解这些机制可以加深生物学知识,并有助于开发更灵活的机器人。为了解决这一问题,我们假设分散的肢间协调机制和躯干运动是策马急转弯中出现内侧前肢的必要条件。为了验证这一假设,我们开发了一种具有简化轮式后肢和可变躯干侧倾和偏航角的准四足机器人。对于前肢协调,我们采用躯干侧倾和偏航弯曲作为转向方法,实现了基于局部负载相关感官反馈的简单分散控制。我们的实验结果证实,除了先前研究中再现动物直线运动的分散控制外,调整躯干滚动角度会自发地产生类似于四足动物的飞奔转弯的地面接触序列。横摇倾角比偏航弯曲对前后肢区分的影响更大。本研究表明,在四足动物的前进和转身运动中,身体相互作用是一种普遍的运动控制机制。
{"title":"A simple robot suggests trunk rotation is essential for emergence of inside leading limb during quadruped galloping turns.","authors":"Tomoe Maeta, Shoei Hattori, Takeshi Kano, Akira Fukuhara, Akio Ishiguro","doi":"10.3389/fnbot.2025.1628368","DOIUrl":"10.3389/fnbot.2025.1628368","url":null,"abstract":"<p><p>During turning maneuvers in the galloping gait of quadruped animals, a strong relationship exists between the turning direction and the sequence in which the forelimbs make ground contact: the outer forelimb acts as the \"trailing limb\" while the inner forelimb serves as the \"leading limb.\" However, the control mechanisms underlying this behavior remain largely unclear. Understanding these mechanisms could deepen biological knowledge and assist in developing more agile robots. To address this issue, we hypothesized that decentralized interlimb coordination mechanism and trunk movement are essential for the emergence of an inside leading limb in a galloping turn. To test the hypothesis, we developed a quasi-quadruped robot with simplified wheeled hind limbs and variable trunk roll and yaw angles. For forelimb coordination, we implemented a simple decentralized control based on local load-dependent sensory feedback, utilizing trunk roll inclination and yaw bending as turning methods. Our experimental results confirmed that in addition to the decentralized control from previous studies which reproduces animal locomotion in a straight line, adjusting the trunk roll angle spontaneously generates a ground contact sequence similar to gallop turning in quadruped animals. Furthermore, roll inclination showed a greater influence than yaw bending on differentiating the leading and trailing limbs. This study suggests that physical interactions serve as a universal mechanism of locomotor control in both forward and turning movements of quadrupedal animals.</p>","PeriodicalId":12628,"journal":{"name":"Frontiers in Neurorobotics","volume":"19 ","pages":"1628368"},"PeriodicalIF":2.8,"publicationDate":"2025-10-23","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12590251/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145481800","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
TSLNet: a hierarchical multi-head attention-enabled two-stream LSTM network for accurate pedestrian tracking and behavior recognition. TSLNet:一个分层的多头注意双流LSTM网络,用于准确的行人跟踪和行为识别。
IF 2.8 4区 计算机科学 Q3 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE Pub Date : 2025-10-20 eCollection Date: 2025-01-01 DOI: 10.3389/fnbot.2025.1663565
Shouye Lv, Rui He, Xiaofei Cheng, Xiaoting Ma

Accurate pedestrian tracking and behavior recognition are essential for intelligent surveillance, smart transportation, and human-computer interaction systems. This paper introduces TSLNet, a Hierarchical Multi-Head Attention-Enabled Two-Stream LSTM Network, designed to overcome challenges such as environmental variability, high-density crowds, and diverse pedestrian movements in real-world video data. TSLNet combines a Two-Stream Convolutional Neural Network (Two-Stream CNN) with Long Short-Term Memory (LSTM) networks to effectively capture spatial and temporal features. The addition of a Multi-Head Attention mechanism allows the model to focus on relevant features in complex environments, while Hierarchical Classifiers within a Multi-Task Learning framework enable the simultaneous recognition of basic and complex behaviors. Experimental results on multiple public and proprietary datasets demonstrate that TSLNet significantly outperforms existing baseline models, achieving higher Accuracy, Precision, Recall, F1-Score, and Mean Average Precision (mAP) in behavior recognition, as well as superior Multiple Object Tracking Accuracy (MOTA) and ID F1 Score (IDF1) in pedestrian tracking. These improvements highlight TSLNet's effectiveness in enhancing tracking and recognition performance.

准确的行人跟踪和行为识别对于智能监控、智能交通和人机交互系统至关重要。本文介绍了TSLNet,一种分层多头注意力支持的双流LSTM网络,旨在克服现实世界视频数据中的环境可变性、高密度人群和不同行人运动等挑战。TSLNet将两流卷积神经网络(Two-Stream CNN)与长短期记忆(LSTM)网络相结合,有效地捕捉时空特征。增加了多头注意机制,使模型能够专注于复杂环境中的相关特征,而多任务学习框架中的分层分类器可以同时识别基本和复杂的行为。在多个公共和专有数据集上的实验结果表明,TSLNet显著优于现有的基线模型,在行为识别方面具有更高的准确率、精度、召回率、F1-Score和平均精度(mAP),在行人跟踪方面具有更高的多目标跟踪精度(MOTA)和IDF1分数(IDF1)。这些改进突出了TSLNet在提高跟踪和识别性能方面的有效性。
{"title":"TSLNet: a hierarchical multi-head attention-enabled two-stream LSTM network for accurate pedestrian tracking and behavior recognition.","authors":"Shouye Lv, Rui He, Xiaofei Cheng, Xiaoting Ma","doi":"10.3389/fnbot.2025.1663565","DOIUrl":"10.3389/fnbot.2025.1663565","url":null,"abstract":"<p><p>Accurate pedestrian tracking and behavior recognition are essential for intelligent surveillance, smart transportation, and human-computer interaction systems. This paper introduces TSLNet, a Hierarchical Multi-Head Attention-Enabled Two-Stream LSTM Network, designed to overcome challenges such as environmental variability, high-density crowds, and diverse pedestrian movements in real-world video data. TSLNet combines a Two-Stream Convolutional Neural Network (Two-Stream CNN) with Long Short-Term Memory (LSTM) networks to effectively capture spatial and temporal features. The addition of a Multi-Head Attention mechanism allows the model to focus on relevant features in complex environments, while Hierarchical Classifiers within a Multi-Task Learning framework enable the simultaneous recognition of basic and complex behaviors. Experimental results on multiple public and proprietary datasets demonstrate that TSLNet significantly outperforms existing baseline models, achieving higher Accuracy, Precision, Recall, F1-Score, and Mean Average Precision (mAP) in behavior recognition, as well as superior Multiple Object Tracking Accuracy (MOTA) and ID F1 Score (IDF1) in pedestrian tracking. These improvements highlight TSLNet's effectiveness in enhancing tracking and recognition performance.</p>","PeriodicalId":12628,"journal":{"name":"Frontiers in Neurorobotics","volume":"19 ","pages":"1663565"},"PeriodicalIF":2.8,"publicationDate":"2025-10-20","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12580384/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145444659","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Adaptive-expert-weight-based load balance scheme for dynamic routing of MoE. 基于自适应专家权重的MoE动态路由负载均衡方案。
IF 2.8 4区 计算机科学 Q3 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE Pub Date : 2025-10-14 eCollection Date: 2025-01-01 DOI: 10.3389/fnbot.2025.1590994
Jialin Wen, Xiaojun Li, Junping Yao, Xinyan Kong, Peng Cheng

Load imbalance is a major performance bottleneck in training mixture-of-experts (MoE) models, as unbalanced expert loads can lead to routing collapse. Most existing approaches address this issue by introducing auxiliary loss functions to balance the load; however, the hyperparameters within these loss functions often need to be tuned for different tasks. Furthermore, increasing the number of activated experts tends to exacerbate load imbalance, while fixing the activation count can reduce the model's confidence in handling difficult tasks. To address these challenges, this paper proposes a dynamically balanced routing strategy that employs a threshold-based dynamic routing algorithm. After each routing step, the method adjusts expert weights to influence the load distribution in the subsequent routing. Unlike loss-function-based balancing methods, our approach operates directly at the routing level, avoiding gradient perturbations that could degrade model quality, while dynamically routing to make more efficient use of computational resources. Experiments on Natural Language Understanding (NLU) benchmarks demonstrate that the proposed method achieves accuracy comparable to top-2 routing, while significantly reducing the load standard deviation (e.g., from 12.25 to 1.18 on MNLI). In addition, threshold-based dynamic expert activation reduces model parameters and provides a new perspective for mitigating load imbalance among experts.

负载不平衡是训练混合专家(MoE)模型的主要性能瓶颈,因为不平衡的专家负载可能导致路由崩溃。大多数现有方法通过引入辅助损失函数来平衡负载来解决这个问题;然而,这些损失函数中的超参数通常需要针对不同的任务进行调优。此外,增加激活专家的数量往往会加剧负载不平衡,而固定激活数会降低模型处理困难任务的置信度。为了解决这些问题,本文提出了一种采用基于阈值的动态路由算法的动态平衡路由策略。在每一步路由后,该方法调整专家权重,以影响后续路由中的负载分配。与基于损失函数的平衡方法不同,我们的方法直接在路由级别操作,避免了可能降低模型质量的梯度扰动,同时动态路由以更有效地利用计算资源。在自然语言理解(NLU)基准上的实验表明,所提出的方法达到了与top-2路由相当的精度,同时显著降低了负载标准差(例如,在MNLI上从12.25降至1.18)。此外,基于阈值的专家动态激活减少了模型参数,为缓解专家之间的负载不平衡提供了新的视角。
{"title":"Adaptive-expert-weight-based load balance scheme for dynamic routing of MoE.","authors":"Jialin Wen, Xiaojun Li, Junping Yao, Xinyan Kong, Peng Cheng","doi":"10.3389/fnbot.2025.1590994","DOIUrl":"10.3389/fnbot.2025.1590994","url":null,"abstract":"<p><p>Load imbalance is a major performance bottleneck in training mixture-of-experts (MoE) models, as unbalanced expert loads can lead to routing collapse. Most existing approaches address this issue by introducing auxiliary loss functions to balance the load; however, the hyperparameters within these loss functions often need to be tuned for different tasks. Furthermore, increasing the number of activated experts tends to exacerbate load imbalance, while fixing the activation count can reduce the model's confidence in handling difficult tasks. To address these challenges, this paper proposes a dynamically balanced routing strategy that employs a threshold-based dynamic routing algorithm. After each routing step, the method adjusts expert weights to influence the load distribution in the subsequent routing. Unlike loss-function-based balancing methods, our approach operates directly at the routing level, avoiding gradient perturbations that could degrade model quality, while dynamically routing to make more efficient use of computational resources. Experiments on Natural Language Understanding (NLU) benchmarks demonstrate that the proposed method achieves accuracy comparable to top-2 routing, while significantly reducing the load standard deviation (e.g., from 12.25 to 1.18 on MNLI). In addition, threshold-based dynamic expert activation reduces model parameters and provides a new perspective for mitigating load imbalance among experts.</p>","PeriodicalId":12628,"journal":{"name":"Frontiers in Neurorobotics","volume":"19 ","pages":"1590994"},"PeriodicalIF":2.8,"publicationDate":"2025-10-14","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12558867/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145400459","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
期刊
Frontiers in Neurorobotics
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1