首页 > 最新文献

Cognitive Robotics最新文献

英文 中文
DSR-YOLO: A lightweight and efficient YOLOv8 model for enhanced pedestrian detection DSR-YOLO:一款轻量级高效的YOLOv8模型,用于增强行人检测
Pub Date : 2025-01-01 DOI: 10.1016/j.cogr.2025.04.001
Mustapha Oussouaddi , Omar Bouazizi , Aimad El mourabit , Zine el Abidine Alaoui Ismaili , Yassine Attaoui , Mohamed Chentouf
This paper presents DSR-YOLO, a pedestrian detection network that addresses critical challenges, such as scale variations and complex backgrounds. Built on the lightweight YOLOv8n architecture, it incorporates DCNv4 modules to enhance the detection rates and reduce missed detections by effectively learning key pedestrian features. A new head component enables detection across various scales, whereas RFB modules improve accuracy for smaller or occluded objects. Additionally, we enhance the initial C2f layers with a modified block that integrates SimAM and DCNv4, minimizing the background noise and sharpening the focus on the relevant features. A second version of the C2f block using SimAM and standard convolutions ensures robust feature extraction in deeper layers with optimized computational efficiency. The WIoUv3 loss function was utilized to reduce the regression loss associated with bounding boxes, further boosting the performance. Evaluated on the CityPersons dataset, DSR-YOLO outperformed YOLOv8n with a 14.9 % increase in mAP@50 and 6.3 % increase in mAP@50:95, while maintaining competitive FLOPS, parameter counts, and inference speed.
本文介绍了DSR-YOLO,这是一种行人检测网络,可解决规模变化和复杂背景等关键挑战。它基于轻量级的YOLOv8n架构,结合了DCNv4模块,通过有效地学习行人的关键特征,提高了检测率,减少了遗漏的检测。新的头部组件可实现各种尺度的检测,而RFB模块可提高较小或遮挡物体的精度。此外,我们使用集成了SimAM和DCNv4的修改块增强了初始C2f层,最大限度地减少了背景噪声并锐化了对相关特征的关注。第二个版本的C2f块使用了SimAM和标准卷积,确保了更深层的鲁棒特征提取,并优化了计算效率。利用WIoUv3损失函数减少了与边界盒相关的回归损失,进一步提高了性能。在CityPersons数据集上进行评估,DSR-YOLO在保持具有竞争力的FLOPS、参数数量和推理速度的同时,在mAP@50和mAP@50:95上分别提高了14.9%和6.3%,优于YOLOv8n。
{"title":"DSR-YOLO: A lightweight and efficient YOLOv8 model for enhanced pedestrian detection","authors":"Mustapha Oussouaddi ,&nbsp;Omar Bouazizi ,&nbsp;Aimad El mourabit ,&nbsp;Zine el Abidine Alaoui Ismaili ,&nbsp;Yassine Attaoui ,&nbsp;Mohamed Chentouf","doi":"10.1016/j.cogr.2025.04.001","DOIUrl":"10.1016/j.cogr.2025.04.001","url":null,"abstract":"<div><div>This paper presents DSR-YOLO, a pedestrian detection network that addresses critical challenges, such as scale variations and complex backgrounds. Built on the lightweight YOLOv8n architecture, it incorporates DCNv4 modules to enhance the detection rates and reduce missed detections by effectively learning key pedestrian features. A new head component enables detection across various scales, whereas RFB modules improve accuracy for smaller or occluded objects. Additionally, we enhance the initial C2f layers with a modified block that integrates SimAM and DCNv4, minimizing the background noise and sharpening the focus on the relevant features. A second version of the C2f block using SimAM and standard convolutions ensures robust feature extraction in deeper layers with optimized computational efficiency. The WIoUv3 loss function was utilized to reduce the regression loss associated with bounding boxes, further boosting the performance. Evaluated on the CityPersons dataset, DSR-YOLO outperformed YOLOv8n with a 14.9 % increase in mAP@50 and 6.3 % increase in mAP@50:95, while maintaining competitive FLOPS, parameter counts, and inference speed.</div></div>","PeriodicalId":100288,"journal":{"name":"Cognitive Robotics","volume":"5 ","pages":"Pages 152-165"},"PeriodicalIF":0.0,"publicationDate":"2025-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"143844145","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Attention-assisted dual-branch interactive face super-resolution network 注意辅助双分支交互式人脸超分辨网络
Pub Date : 2025-01-01 DOI: 10.1016/j.cogr.2025.01.001
Xujie Wan , Siyu Xu , Guangwei Gao
We propose a deep learning-based Attention-Assisted Dual-Branch Interactive Network (ADBINet) to improve facial super-resolution by addressing key challenges like inadequate feature extraction and poor multi-scale information handling. ADBINet features a multi-scale encoder-decoder architecture that captures and integrates features across scales, enhancing detail and reconstruction quality. The key to our approach is the Transformer and CNN Interaction Module (TCIM), which includes a Dual Attention Collaboration Module (DACM) for improved local and spatial feature extraction. The Channel Attention Guidance Module (CAGM) refines CNN and Transformer fusion, ensuring precise facial detail restoration. Additionally, the Attention Feature Fusion Unit (AFFM) optimizes multi-scale feature integration. Experimental results demonstrate that ADBINet outperforms existing methods in both quantitative and qualitative facial super-resolution metrics.
我们提出了一种基于深度学习的注意力辅助双分支交互网络(ADBINet),通过解决特征提取不足和多尺度信息处理差等关键挑战来提高面部超分辨率。ADBINet具有多尺度编码器-解码器架构,可捕获和集成跨尺度的功能,增强细节和重建质量。我们方法的关键是Transformer和CNN交互模块(TCIM),其中包括用于改进局部和空间特征提取的双注意协作模块(DACM)。通道注意力引导模块(CAGM)改进了CNN和Transformer融合,确保精确的面部细节恢复。此外,注意特征融合单元(AFFM)优化了多尺度特征集成。实验结果表明,ADBINet在定量和定性面部超分辨指标上都优于现有方法。
{"title":"Attention-assisted dual-branch interactive face super-resolution network","authors":"Xujie Wan ,&nbsp;Siyu Xu ,&nbsp;Guangwei Gao","doi":"10.1016/j.cogr.2025.01.001","DOIUrl":"10.1016/j.cogr.2025.01.001","url":null,"abstract":"<div><div>We propose a deep learning-based Attention-Assisted Dual-Branch Interactive Network (ADBINet) to improve facial super-resolution by addressing key challenges like inadequate feature extraction and poor multi-scale information handling. ADBINet features a multi-scale encoder-decoder architecture that captures and integrates features across scales, enhancing detail and reconstruction quality. The key to our approach is the Transformer and CNN Interaction Module (TCIM), which includes a Dual Attention Collaboration Module (DACM) for improved local and spatial feature extraction. The Channel Attention Guidance Module (CAGM) refines CNN and Transformer fusion, ensuring precise facial detail restoration. Additionally, the Attention Feature Fusion Unit (AFFM) optimizes multi-scale feature integration. Experimental results demonstrate that ADBINet outperforms existing methods in both quantitative and qualitative facial super-resolution metrics.</div></div>","PeriodicalId":100288,"journal":{"name":"Cognitive Robotics","volume":"5 ","pages":"Pages 77-85"},"PeriodicalIF":0.0,"publicationDate":"2025-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"143143955","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Deep reinforcement learning-based routing framework for bidirectional communication in UAV-UGV networks 基于深度强化学习的无人机- ugv网络双向通信路由框架
Pub Date : 2025-01-01 DOI: 10.1016/j.cogr.2025.06.003
Prabhakar Saxena , Gayatri M. Phade
Unmanned Aerial Vehicles (UAVs) and Unmanned Ground Vehicles (UGVs) performs crucial function in many applications like military operations, disaster management, hazardous operations and surveillance. Efficient bidirectional communication between UAVs and UGVs is necessary for effective coordination and successful task completion. Traditional routing protocols facilitate communication either between UAVs or between UGVs, but not efficiently across both platforms. Moreover traditional routing protocol often fail to adapt dynamically to varying network conditions, such as mobility, interference, and congestion. To overcome these challenges, this paper presents a design, implementation, and optimization of adaptive routing protocol engineered for specific requirements of coordinated network consisting of UAV and UGV. This novel protocol design integrates the Greedy Perimeter Stateless Routing (GPSR) and Deep Reinforcement Learning (DRL) to optimize packet routing based on real-time network states and ensuring obstacle avoidance, enhanced throughput, minimal latency and reduced packet loss. Simulations are conducted in python to evaluate the performance of the proposed protocol. The results shows that the DRL-based routing protocol enables communication between UAVs and UGVs through the shortest and most efficient path. This research contributes to the advancement of AI enabled communication architecture for co-ordinated UAV-UGV networks, for robust and efficient mission-critical operations.
无人驾驶飞行器(uav)和无人地面飞行器(ugv)在军事行动、灾害管理、危险行动和监视等许多应用中发挥着至关重要的作用。无人机和地面机器人之间有效的双向通信是有效协调和顺利完成任务的必要条件。传统的路由协议可以促进无人机之间或ugv之间的通信,但不能有效地跨两个平台。此外,传统的路由协议往往不能动态地适应各种网络条件,如移动性、干扰和拥塞。为了克服这些挑战,本文提出了一种针对无人机和无人驾驶汽车组成的协调网络的特定需求而设计的自适应路由协议的设计、实现和优化。这种新颖的协议设计集成了贪婪周边无状态路由(GPSR)和深度强化学习(DRL),以优化基于实时网络状态的数据包路由,并确保避免障碍,提高吞吐量,最小化延迟和减少数据包丢失。在python中进行了仿真,以评估所提出协议的性能。结果表明,基于drl的路由协议能够使无人机和ugv之间通过最短、最有效的路径进行通信。这项研究有助于推进人工智能支持的通信架构,用于协调无人机- ugv网络,实现强大而高效的关键任务操作。
{"title":"Deep reinforcement learning-based routing framework for bidirectional communication in UAV-UGV networks","authors":"Prabhakar Saxena ,&nbsp;Gayatri M. Phade","doi":"10.1016/j.cogr.2025.06.003","DOIUrl":"10.1016/j.cogr.2025.06.003","url":null,"abstract":"<div><div>Unmanned Aerial Vehicles (UAVs) and Unmanned Ground Vehicles (UGVs) performs crucial function in many applications like military operations, disaster management, hazardous operations and surveillance. Efficient bidirectional communication between UAVs and UGVs is necessary for effective coordination and successful task completion. Traditional routing protocols facilitate communication either between UAVs or between UGVs, but not efficiently across both platforms. Moreover traditional routing protocol often fail to adapt dynamically to varying network conditions, such as mobility, interference, and congestion. To overcome these challenges, this paper presents a design, implementation, and optimization of adaptive routing protocol engineered for specific requirements of coordinated network consisting of UAV and UGV. This novel protocol design integrates the Greedy Perimeter Stateless Routing (GPSR) and Deep Reinforcement Learning (DRL) to optimize packet routing based on real-time network states and ensuring obstacle avoidance, enhanced throughput, minimal latency and reduced packet loss. Simulations are conducted in python to evaluate the performance of the proposed protocol. The results shows that the DRL-based routing protocol enables communication between UAVs and UGVs through the shortest and most efficient path. This research contributes to the advancement of AI enabled communication architecture for co-ordinated UAV-UGV networks, for robust and efficient mission-critical operations.</div></div>","PeriodicalId":100288,"journal":{"name":"Cognitive Robotics","volume":"5 ","pages":"Pages 249-259"},"PeriodicalIF":0.0,"publicationDate":"2025-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"144611613","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Rehab-Bot: A home-based lower-extremity rehabilitation robot for muscle recovery Rehab-Bot:一种基于家庭的下肢肌肉康复机器人
Pub Date : 2025-01-01 DOI: 10.1016/j.cogr.2025.02.001
Sandro Mihradi , Edgar Buwana Sutawika , Vani Virdyawan , Rachmat Zulkarnain Goesasi , Masahiro Todoh
This paper presents a proof-of-concept for a lower-extremity rehabilitation device, called Rehab-bot, that would aid patients with lower-limb impairments in continuing their rehabilitation in its required intensity at home after inpatient care. This research focuses on developing the patient‘s muscle training feature using admittance control to generate resistance for isotonic exercise, particularly emphasizing the potential for progressive resistance training. The mechanical structure of the Rehab-bot was inspired by a continuous passive motion machine that can be optimized to be a light and compact device suitable for home-based use. Systems design, development, and experimental evaluation are presented. Experiments were performed with one healthy subject by monitoring two parameters: the forces exerted by leg muscles through a force sensor and the resulting position of the foot support that is actuated by the robot. Results have shown that Rehab-bot can demonstrate lower-limb isotonic exercise by generating a virtual load that can be progressively increased.
本文提出了一种名为Rehab-bot的下肢康复设备的概念验证,该设备将帮助患有下肢损伤的患者在住院治疗后在家中继续进行所需强度的康复。本研究的重点是开发患者的肌肉训练特点,利用导纳控制来产生阻力等张运动,特别强调了进行性阻力训练的潜力。Rehab-bot的机械结构受到连续被动运动机器的启发,可以优化为适合家庭使用的轻便紧凑设备。介绍了系统的设计、开发和实验评估。通过监测两个参数,在一名健康受试者身上进行实验:通过力传感器监测腿部肌肉施加的力,以及由机器人驱动的足部支架的最终位置。结果表明,Rehab-bot可以通过产生可逐步增加的虚拟负荷来演示下肢等张力运动。
{"title":"Rehab-Bot: A home-based lower-extremity rehabilitation robot for muscle recovery","authors":"Sandro Mihradi ,&nbsp;Edgar Buwana Sutawika ,&nbsp;Vani Virdyawan ,&nbsp;Rachmat Zulkarnain Goesasi ,&nbsp;Masahiro Todoh","doi":"10.1016/j.cogr.2025.02.001","DOIUrl":"10.1016/j.cogr.2025.02.001","url":null,"abstract":"<div><div>This paper presents a proof-of-concept for a lower-extremity rehabilitation device, called Rehab-bot, that would aid patients with lower-limb impairments in continuing their rehabilitation in its required intensity at home after inpatient care. This research focuses on developing the patient‘s muscle training feature using admittance control to generate resistance for isotonic exercise, particularly emphasizing the potential for progressive resistance training. The mechanical structure of the Rehab-bot was inspired by a continuous passive motion machine that can be optimized to be a light and compact device suitable for home-based use. Systems design, development, and experimental evaluation are presented. Experiments were performed with one healthy subject by monitoring two parameters: the forces exerted by leg muscles through a force sensor and the resulting position of the foot support that is actuated by the robot. Results have shown that Rehab-bot can demonstrate lower-limb isotonic exercise by generating a virtual load that can be progressively increased.</div></div>","PeriodicalId":100288,"journal":{"name":"Cognitive Robotics","volume":"5 ","pages":"Pages 114-125"},"PeriodicalIF":0.0,"publicationDate":"2025-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"143601104","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Small target drone algorithm in low-altitude complex urban scenarios based on ESMS-YOLOv7 基于ESMS-YOLOv7的低空复杂城市场景小目标无人机算法
Pub Date : 2025-01-01 DOI: 10.1016/j.cogr.2024.11.004
Yuntao Wei, Xiujia Wang, Chunjuan Bo, Zhan Shi
The increasing use and militarization of UAV technology presents significant challenges to nations and societies. Notably, there is a deficit in anti- UAV technologies for civilian use, particularly in complex urban environments at low altitudes. This paper proposes the ESMS-YOLOv7 algorithm, which is specifically engineered to detect small target UAVs in such challenging urban landscapes. The algorithm focuses on the extraction of features from small target UAVs in urban contexts. Enhancements to YOLOv7 include the integration of the ELAN-C module, the SimSPPFCSPC-R module, and the MP-CBAM module, which collectively improve the network's ability to extract features and focus on small target UAVs. Additionally, the SIOU loss function is employed to increase the model's robustness. The effectiveness of the ESMS-YOLOv7 algorithm is validated through its performance on the DUT Anti-UAV dataset, where it exhibits superior capabilities relative to other leading algorithms.
无人机技术的日益使用和军事化给国家和社会带来了重大挑战。值得注意的是,民用反无人机技术存在缺陷,特别是在低空复杂的城市环境中。本文提出了ESMS-YOLOv7算法,该算法专门用于在这种具有挑战性的城市景观中检测小型目标无人机。该算法主要研究城市环境下小型目标无人机的特征提取。YOLOv7的增强包括集成了ELAN-C模块、SimSPPFCSPC-R模块和MP-CBAM模块,它们共同提高了网络提取特征和专注于小型目标无人机的能力。此外,采用SIOU损失函数来提高模型的鲁棒性。ESMS-YOLOv7算法的有效性通过其在DUT反无人机数据集上的性能得到验证,相对于其他领先算法,它表现出更优越的能力。
{"title":"Small target drone algorithm in low-altitude complex urban scenarios based on ESMS-YOLOv7","authors":"Yuntao Wei,&nbsp;Xiujia Wang,&nbsp;Chunjuan Bo,&nbsp;Zhan Shi","doi":"10.1016/j.cogr.2024.11.004","DOIUrl":"10.1016/j.cogr.2024.11.004","url":null,"abstract":"<div><div>The increasing use and militarization of UAV technology presents significant challenges to nations and societies. Notably, there is a deficit in anti- UAV technologies for civilian use, particularly in complex urban environments at low altitudes. This paper proposes the ESMS-YOLOv7 algorithm, which is specifically engineered to detect small target UAVs in such challenging urban landscapes. The algorithm focuses on the extraction of features from small target UAVs in urban contexts. Enhancements to YOLOv7 include the integration of the ELAN-C module, the SimSPPFCSPC-R module, and the MP-CBAM module, which collectively improve the network's ability to extract features and focus on small target UAVs. Additionally, the SIOU loss function is employed to increase the model's robustness. The effectiveness of the ESMS-YOLOv7 algorithm is validated through its performance on the DUT Anti-UAV dataset, where it exhibits superior capabilities relative to other leading algorithms.</div></div>","PeriodicalId":100288,"journal":{"name":"Cognitive Robotics","volume":"5 ","pages":"Pages 14-25"},"PeriodicalIF":0.0,"publicationDate":"2025-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"143143536","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Research on the analysis and application of technological supply and demand structure based on LDA and BERTopic models 基于LDA和BERTopic模型的技术供需结构分析与应用研究
Pub Date : 2025-01-01 DOI: 10.1016/j.cogr.2025.07.001
Xuan Jin , Wen Zhou , Qinyou Zhu , Weijie Wang , Guoteng Xu
This paper employs text mining techniques, specifically Latent Dirichlet Allocation (LDA) and BERTopic topic models, to conduct an in-depth investigation of the supply and demand structure of regional scientific and technological achievements. The objective is to identify imbalances in supply and demand, thereby providing novel insights for enhancing the efficiency of technology transfer. The research findings indicate that the LDA model outperforms the BERTopic model in this study. Taking Guizhou Province, China, as a case study, the LDA model analysis categorizes the demand side into 16 domains and the supply side into 18 domains, both exhibiting a "long-tail distribution" characteristic. Further analysis reveals a structural imbalance in the supply and demand of scientific and technological achievements in Guizhou Province. For instance, there is a high demand in areas such as mineral extraction and utilization, as well as digital and intelligent applications, accounting for 20.3 % and 14.3 % respectively, yet the supply is insufficient, with only 5.1 % and 3.1 % respectively. Conversely, areas like mechanical processing, and bridge and building construction experience an oversupply, with the supply accounting for 17.9 % and 13.8 % respectively. Addressing the structural imbalance in the supply and demand of scientific and technological achievements, this study proposes development recommendations from three perspectives: policy and management systems, regional collaboration, and ecological construction. The aim is to optimize the supply and demand structure of scientific and technological achievements in Guizhou Province and promote the deep integration of technology and the economy.
本文采用文本挖掘技术,特别是潜狄利克雷分配(Latent Dirichlet Allocation, LDA)和BERTopic主题模型,对区域科技成果的供需结构进行了深入研究。目标是查明供需不平衡,从而为提高技术转让的效率提供新的见解。研究结果表明,LDA模型在本研究中优于BERTopic模型。以中国贵州省为例,LDA模型分析将需求侧划分为16个域,供给侧划分为18个域,均呈现“长尾分布”特征。进一步分析发现,贵州省科技成果供给与需求存在结构性失衡。例如,矿产开采和利用、数字化和智能化应用等领域的需求较高,分别占20.3%和14.3%,但供给不足,分别仅占5.1%和3.1%。相反,机械加工、桥梁和建筑施工等行业供过于求,供过于求的比例分别为17.9%和13.8%。针对科技成果供给与需求的结构性失衡,本文从政策与管理制度、区域协作和生态建设三个方面提出了发展建议。优化贵州省科技成果供需结构,促进科技与经济深度融合。
{"title":"Research on the analysis and application of technological supply and demand structure based on LDA and BERTopic models","authors":"Xuan Jin ,&nbsp;Wen Zhou ,&nbsp;Qinyou Zhu ,&nbsp;Weijie Wang ,&nbsp;Guoteng Xu","doi":"10.1016/j.cogr.2025.07.001","DOIUrl":"10.1016/j.cogr.2025.07.001","url":null,"abstract":"<div><div>This paper employs text mining techniques, specifically Latent Dirichlet Allocation (LDA) and BERTopic topic models, to conduct an in-depth investigation of the supply and demand structure of regional scientific and technological achievements. The objective is to identify imbalances in supply and demand, thereby providing novel insights for enhancing the efficiency of technology transfer. The research findings indicate that the LDA model outperforms the BERTopic model in this study. Taking Guizhou Province, China, as a case study, the LDA model analysis categorizes the demand side into 16 domains and the supply side into 18 domains, both exhibiting a \"long-tail distribution\" characteristic. Further analysis reveals a structural imbalance in the supply and demand of scientific and technological achievements in Guizhou Province. For instance, there is a high demand in areas such as mineral extraction and utilization, as well as digital and intelligent applications, accounting for 20.3 % and 14.3 % respectively, yet the supply is insufficient, with only 5.1 % and 3.1 % respectively. Conversely, areas like mechanical processing, and bridge and building construction experience an oversupply, with the supply accounting for 17.9 % and 13.8 % respectively. Addressing the structural imbalance in the supply and demand of scientific and technological achievements, this study proposes development recommendations from three perspectives: policy and management systems, regional collaboration, and ecological construction. The aim is to optimize the supply and demand structure of scientific and technological achievements in Guizhou Province and promote the deep integration of technology and the economy.</div></div>","PeriodicalId":100288,"journal":{"name":"Cognitive Robotics","volume":"5 ","pages":"Pages 260-275"},"PeriodicalIF":0.0,"publicationDate":"2025-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"144694867","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Innovative strategy and practice of using underwater robot for marine cable inspection and operation and maintenance 水下机器人用于海洋电缆检测与运维的创新策略与实践
Pub Date : 2025-01-01 DOI: 10.1016/j.cogr.2025.06.001
Xiang Liu, Shuntian Xie
This research explores underwater robot applications in marine cable inspection and maintenance with solutions to accuracy, reliability, and efficiency challenges. Current methods using human divers and remotely operated vehicles (ROVs) are expensive, time-consuming, and involve safety hazards. The suggested AI-based robotic system incorporates sensor technology, predictive maintenance, and statistical validation to maximize marine cable inspections. A quantitative research method was employed, surveying data from 400 Marine Engineers and Underwater Robotics Specialists. Statistical analysis, such as reliability analysis, regression model, and hypothesis testing, determined the influence of technology adoption, environmental aspects, and predictive maintenance on inspection accuracy and cost savings. Model fit was confirmed through CFI =0.94, RMSEA =0.047, and SRMR=0.052. Results show that Maintenance Strategy & Cost Reduction (β=0.55,p<0.01) is most influential. The research assures that AI-enhanced underwater robots provide a cost-efficient, guaranteed substitute to conventional approaches, promoting efficiency, safety, and long-term sustainability in marine cable operations.
本研究探讨了水下机器人在海洋电缆检测和维护中的应用,并解决了准确性、可靠性和效率方面的挑战。目前使用人工潜水员和远程操作车辆(rov)的方法既昂贵又耗时,而且存在安全隐患。建议的基于人工智能的机器人系统结合了传感器技术,预测性维护和统计验证,以最大限度地提高海上电缆检查。采用定量研究方法,调查了400名海洋工程师和水下机器人专家的数据。统计分析,如可靠性分析、回归模型和假设检验,确定了技术采用、环境因素和预测性维护对检查准确性和成本节约的影响。通过CFI =0.94, RMSEA =0.047, SRMR=0.052证实模型拟合。结果表明:维修策略&;成本降低(β=0.55,p<0.01)影响最大。该研究确保了人工智能增强的水下机器人为传统方法提供了一种经济高效、有保证的替代品,提高了海洋电缆作业的效率、安全性和长期可持续性。
{"title":"Innovative strategy and practice of using underwater robot for marine cable inspection and operation and maintenance","authors":"Xiang Liu,&nbsp;Shuntian Xie","doi":"10.1016/j.cogr.2025.06.001","DOIUrl":"10.1016/j.cogr.2025.06.001","url":null,"abstract":"<div><div>This research explores underwater robot applications in marine cable inspection and maintenance with solutions to accuracy, reliability, and efficiency challenges. Current methods using human divers and remotely operated vehicles (ROVs) are expensive, time-consuming, and involve safety hazards. The suggested AI-based robotic system incorporates sensor technology, predictive maintenance, and statistical validation to maximize marine cable inspections. A quantitative research method was employed, surveying data from 400 Marine Engineers and Underwater Robotics Specialists. Statistical analysis, such as reliability analysis, regression model, and hypothesis testing, determined the influence of technology adoption, environmental aspects, and predictive maintenance on inspection accuracy and cost savings. Model fit was confirmed through CFI <span><math><mrow><mo>=</mo><mn>0.94</mn></mrow></math></span>, RMSEA <span><math><mrow><mo>=</mo><mn>0.047</mn></mrow></math></span>, and <span><math><mrow><mi>SRMR</mi><mo>=</mo><mn>0.052</mn></mrow></math></span>. Results show that Maintenance Strategy &amp; Cost Reduction <span><math><mrow><mo>(</mo><mi>β</mi><mo>=</mo><mn>0.55</mn><mo>,</mo><mrow><mi>p</mi></mrow><mo>&lt;</mo><mn>0.01</mn><mo>)</mo></mrow></math></span> is most influential. The research assures that AI-enhanced underwater robots provide a cost-efficient, guaranteed substitute to conventional approaches, promoting efficiency, safety, and long-term sustainability in marine cable operations.</div></div>","PeriodicalId":100288,"journal":{"name":"Cognitive Robotics","volume":"5 ","pages":"Pages 226-239"},"PeriodicalIF":0.0,"publicationDate":"2025-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"144570295","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
FB-YOLOv8s: A fire detection algorithm based on YOLOv8s FB-YOLOv8s:一种基于YOLOv8s的火灾探测算法
Pub Date : 2025-01-01 DOI: 10.1016/j.cogr.2025.06.002
Yuhang Liu, Chunjuan Bo, Chong Feng
The significance of fire detection lies in protecting public safety and safeguarding the lives and property of people. However, there exist some problems in traditional detection algorithms of fire, such as low accuracy, high miss rate, and low detection rate of small targets. To effectively solve these issues, a fire detection algorithm based on YOLOv8s is introduced in this paper, called FB-YOLOv8s. First, the FasterNet lightweight network is introduced into the YOLOv8s network, merging the FasterNet Block structure of FasterNet with the original C2f modules to reduce the number of model parameters. Second, the Bi-directional Feature Pyramid Network (BiFPN) is incorporated to replace the Path Aggregation Network (PANet) in the neck network to enhance the model’s feature fusion capability. Finally, we adopt the WIoUv3 loss function to optimize the training process and improve detection accuracy. The experimental results demonstrate that compared to the original algorithm, the mAP0.5 of FB-YOLOv8s increases by 2.0 %, and the number of parameters decreases by 25.23 %. This method has better detection performance for fire targets.
火灾探测的意义在于保护公共安全,保障人民生命财产安全。然而,传统的火力检测算法存在精度低、脱靶率高、对小目标的检测率低等问题。为了有效解决这些问题,本文介绍了一种基于YOLOv8s的火灾探测算法FB-YOLOv8s。首先,在YOLOv8s网络中引入FasterNet轻量级网络,将FasterNet的FasterNet Block结构与原有的C2f模块合并,减少模型参数的数量。其次,引入双向特征金字塔网络(BiFPN)取代颈部网络中的路径聚合网络(PANet),增强模型的特征融合能力;最后,采用WIoUv3损失函数优化训练过程,提高检测精度。实验结果表明,与原算法相比,FB-YOLOv8s的mAP0.5提高了2.0%,参数个数减少了25.23%。该方法对火力目标具有较好的探测性能。
{"title":"FB-YOLOv8s: A fire detection algorithm based on YOLOv8s","authors":"Yuhang Liu,&nbsp;Chunjuan Bo,&nbsp;Chong Feng","doi":"10.1016/j.cogr.2025.06.002","DOIUrl":"10.1016/j.cogr.2025.06.002","url":null,"abstract":"<div><div>The significance of fire detection lies in protecting public safety and safeguarding the lives and property of people. However, there exist some problems in traditional detection algorithms of fire, such as low accuracy, high miss rate, and low detection rate of small targets. To effectively solve these issues, a fire detection algorithm based on YOLOv8s is introduced in this paper, called FB-YOLOv8s. First, the FasterNet lightweight network is introduced into the YOLOv8s network, merging the FasterNet Block structure of FasterNet with the original C2f modules to reduce the number of model parameters. Second, the Bi-directional Feature Pyramid Network (BiFPN) is incorporated to replace the Path Aggregation Network (PANet) in the neck network to enhance the model’s feature fusion capability. Finally, we adopt the WIoUv3 loss function to optimize the training process and improve detection accuracy. The experimental results demonstrate that compared to the original algorithm, the mAP<span><math><msub><mrow></mrow><mrow><mn>0.5</mn></mrow></msub></math></span> of FB-YOLOv8s increases by 2.0 %, and the number of parameters decreases by 25.23 %. This method has better detection performance for fire targets.</div></div>","PeriodicalId":100288,"journal":{"name":"Cognitive Robotics","volume":"5 ","pages":"Pages 240-248"},"PeriodicalIF":0.0,"publicationDate":"2025-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"144588764","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
LFEN: A language feature enhanced network for scene text recognition LFEN:用于场景文本识别的语言特征增强网络
Pub Date : 2025-01-01 DOI: 10.1016/j.cogr.2025.08.001
Hui Chen , Runming Jiang , Fang Hu , Min Chen , Yin Zhang
In the context of natural scenes, traditional text recognition methods exhibit limitations when confronted with the substantial differences in characters and context among diverse languages. To address this challenge, we propose an approach LFEN for text recognition and correction in natural scenes. By directly embedding language features into the text recognition model, we effectively address the issue of accuracy in scene text recognition, reducing the potential risk of error accumulation compared to traditional language recognition-text recognition serial connections. Through a detailed analysis of global and local language features, this paper successfully achieves more accurate differentiation between languages with similar characters, thereby enhancing text recognition accuracy. Furthermore, by incorporating the intrinsic semantic relationships of text content, this paper employs a sequence-to-sequence (Seq2Seq) model based on convolutional neural networks for text correction. Through the integration of language information, different feature embeddings, and global residual connections, the paper provides a robust solution for text correction in scene text recognition. Compared to the baselines, the experimental results demonstrate that LFEN achieves superior performance in most evaluation metrics. Specifically, LFEN has around 2% in recall improved to BERT. This research contributes substantial support to the advancement of natural scene text recognition and correction.
在自然场景语境下,面对不同语言之间字符和语境的巨大差异,传统的文本识别方法表现出局限性。为了解决这一挑战,我们提出了一种用于自然场景文本识别和校正的LFEN方法。通过将语言特征直接嵌入到文本识别模型中,我们有效地解决了场景文本识别的准确性问题,与传统的语言识别-文本识别串行连接相比,减少了潜在的错误积累风险。通过对全局语言和局部语言特征的详细分析,本文成功地实现了对具有相似字符的语言更准确的区分,从而提高了文本识别的准确率。此外,通过结合文本内容的内在语义关系,本文采用基于卷积神经网络的序列到序列(Seq2Seq)模型进行文本校正。本文通过整合语言信息、不同特征嵌入和全局残差连接,为场景文本识别中的文本校正提供了鲁棒性解决方案。与基线相比,实验结果表明LFEN在大多数评估指标上都取得了优异的性能。具体来说,LFEN的召回率提高到了BERT的2%左右。本研究为自然场景文本识别与校正的发展提供了有力的支持。
{"title":"LFEN: A language feature enhanced network for scene text recognition","authors":"Hui Chen ,&nbsp;Runming Jiang ,&nbsp;Fang Hu ,&nbsp;Min Chen ,&nbsp;Yin Zhang","doi":"10.1016/j.cogr.2025.08.001","DOIUrl":"10.1016/j.cogr.2025.08.001","url":null,"abstract":"<div><div>In the context of natural scenes, traditional text recognition methods exhibit limitations when confronted with the substantial differences in characters and context among diverse languages. To address this challenge, we propose an approach LFEN for text recognition and correction in natural scenes. By directly embedding language features into the text recognition model, we effectively address the issue of accuracy in scene text recognition, reducing the potential risk of error accumulation compared to traditional language recognition-text recognition serial connections. Through a detailed analysis of global and local language features, this paper successfully achieves more accurate differentiation between languages with similar characters, thereby enhancing text recognition accuracy. Furthermore, by incorporating the intrinsic semantic relationships of text content, this paper employs a sequence-to-sequence (Seq2Seq) model based on convolutional neural networks for text correction. Through the integration of language information, different feature embeddings, and global residual connections, the paper provides a robust solution for text correction in scene text recognition. Compared to the baselines, the experimental results demonstrate that LFEN achieves superior performance in most evaluation metrics. Specifically, LFEN has around 2% in recall improved to BERT. This research contributes substantial support to the advancement of natural scene text recognition and correction.</div></div>","PeriodicalId":100288,"journal":{"name":"Cognitive Robotics","volume":"5 ","pages":"Pages 276-285"},"PeriodicalIF":0.0,"publicationDate":"2025-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"144880314","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
PCCNN: A CNN classification model integrating EEG time-frequency features for stroke classification PCCNN:一种集成脑电时频特征的CNN分类模型,用于脑卒中分类
Pub Date : 2025-01-01 DOI: 10.1016/j.cogr.2025.05.002
Teng Wang , Fenglian Li , Jia Yang , Wenhui Jia , Fengyun Hu
Stroke classification is crucial for timely diagnosis and treatment, as it helps differentiate between hemorrhagic and ischemic strokes, which require distinct clinical interventions. This paper proposes a stroke classification method using multi-channel electroencephalography (EEG) data. Unlike single-channel data or simple multi-channel concatenation, our method processes EEG data as a channel matrix, significantly improving classification performance. We employ two complementary feature extraction techniques: discrete wavelet transform (DWT) and empirical mode decomposition (EMD). DWT extracts multi-scale wavelet coefficients from stroke-related frequency bands, while EMD decomposes EEG signals into intrinsic mode functions (IMFs), representing narrowband oscillation components. To enhance feature quality, we propose a hybrid selection method that integrates four metrics—information entropy, power spectral density (PSD) distance, statistical significance, and maximum information coefficient (MIC)—to comprehensively evaluate IMFs. This method accounts for both the intrinsic information content of EEG signals and the inter-class differences between hemorrhagic and ischemic stroke subjects. Furthermore, this paper designs a pyramid cascade convolutional neural network (PCCNN) model with multi-branch independent learning and hierarchical fusion. Each DWT and EMD feature is processed by an independent one-dimensional convolutional neural networks (1D-CNN) branch for targeted extraction. A pyramid fusion mechanism integrates branch outputs into a fused feature vector, enabling the feature interaction through a top-level fusion CNN. Experimental results demonstrate that the proposed method, which integrates channel matrix processing, high-quality DWT and EMD feature selection, and multi-branch feature fusion, significantly outperforms single-feature methods. The fusion feature achieves a classification accuracy of 99.48 %, effectively distinguishing EEG data of hemorrhagic and ischemic stroke.
中风分类对于及时诊断和治疗至关重要,因为它有助于区分出血性和缺血性中风,这需要不同的临床干预措施。提出了一种基于多通道脑电图数据的脑卒中分类方法。与单通道数据或简单的多通道拼接不同,我们的方法将脑电数据作为通道矩阵处理,显著提高了分类性能。我们采用了两种互补的特征提取技术:离散小波变换(DWT)和经验模态分解(EMD)。DWT从脑卒中相关频带提取多尺度小波系数,EMD将脑电信号分解为表征窄带振荡分量的内禀模态函数(IMFs)。为了提高特征质量,我们提出了一种综合信息熵、功率谱密度(PSD)距离、统计显著性和最大信息系数(MIC)四个指标的混合选择方法来综合评价imf。该方法既考虑了脑电图信号的固有信息量,又考虑了出血性脑卒中与缺血性脑卒中受试者的类间差异。在此基础上,设计了一种具有多分支独立学习和层次融合的金字塔级联卷积神经网络模型。每个DWT和EMD特征由一个独立的一维卷积神经网络(1D-CNN)分支进行处理,进行有针对性的提取。金字塔融合机制将分支输出整合为融合的特征向量,通过顶层融合CNN实现特征交互。实验结果表明,该方法集成了信道矩阵处理、高质量DWT和EMD特征选择以及多分支特征融合,显著优于单特征方法。该融合特征分类准确率达99.48%,可有效区分出血性脑卒中和缺血性脑卒中。
{"title":"PCCNN: A CNN classification model integrating EEG time-frequency features for stroke classification","authors":"Teng Wang ,&nbsp;Fenglian Li ,&nbsp;Jia Yang ,&nbsp;Wenhui Jia ,&nbsp;Fengyun Hu","doi":"10.1016/j.cogr.2025.05.002","DOIUrl":"10.1016/j.cogr.2025.05.002","url":null,"abstract":"<div><div>Stroke classification is crucial for timely diagnosis and treatment, as it helps differentiate between hemorrhagic and ischemic strokes, which require distinct clinical interventions. This paper proposes a stroke classification method using multi-channel electroencephalography (EEG) data. Unlike single-channel data or simple multi-channel concatenation, our method processes EEG data as a channel matrix, significantly improving classification performance. We employ two complementary feature extraction techniques: discrete wavelet transform (DWT) and empirical mode decomposition (EMD). DWT extracts multi-scale wavelet coefficients from stroke-related frequency bands, while EMD decomposes EEG signals into intrinsic mode functions (IMFs), representing narrowband oscillation components. To enhance feature quality, we propose a hybrid selection method that integrates four metrics—information entropy, power spectral density (PSD) distance, statistical significance, and maximum information coefficient (MIC)—to comprehensively evaluate IMFs. This method accounts for both the intrinsic information content of EEG signals and the inter-class differences between hemorrhagic and ischemic stroke subjects. Furthermore, this paper designs a pyramid cascade convolutional neural network (PCCNN) model with multi-branch independent learning and hierarchical fusion. Each DWT and EMD feature is processed by an independent one-dimensional convolutional neural networks (1D-CNN) branch for targeted extraction. A pyramid fusion mechanism integrates branch outputs into a fused feature vector, enabling the feature interaction through a top-level fusion CNN. Experimental results demonstrate that the proposed method, which integrates channel matrix processing, high-quality DWT and EMD feature selection, and multi-branch feature fusion, significantly outperforms single-feature methods. The fusion feature achieves a classification accuracy of 99.48 %, effectively distinguishing EEG data of hemorrhagic and ischemic stroke.</div></div>","PeriodicalId":100288,"journal":{"name":"Cognitive Robotics","volume":"5 ","pages":"Pages 211-225"},"PeriodicalIF":0.0,"publicationDate":"2025-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"144189322","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
期刊
Cognitive Robotics
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1