首页 > 最新文献

alexandria engineering journal最新文献

英文 中文
Indoor mobile robot localization system based on ORB-SLAM3 and multi-sensor fusion 基于ORB-SLAM3和多传感器融合的室内移动机器人定位系统
IF 6.8 2区 工程技术 Q1 ENGINEERING, MULTIDISCIPLINARY Pub Date : 2026-01-21 DOI: 10.1016/j.aej.2026.01.029
Siyong Fu, Qinghua Zhao, Qiuxiang Tao, Hesheng Liu, Qing Wang, Danjuan Liu
Indoor localization is a fundamental capability for autonomous mobile robots operating in complex indoor environments, where visual degradation, sensor noise, and rotational motion often lead to accumulated drift. This paper presents MFL-SLAM, a practical multi-sensor fusion localization system that extends the ORB-SLAM3 framework by explicitly integrating wheel odometry with visual–inertial SLAM. Unlike conventional visual–inertial approaches, MFL-SLAM employs an Extended Kalman Filter (EKF) to tightly fuse Inertial Measurement Unit (IMU) and wheel odometry, effectively compensating for vibration-induced inertial drift and wheel slippage during rotational motion. The EKF fusion output is then incorporated as a prior in a nonlinear optimization back-end together with RGB-D visual constraints, enabling accurate and globally consistent pose estimation. Extensive experiments demonstrate that MFL-SLAM achieves a 47.3 % reduction in relative pose error compared to ORB-SLAM3 and reduces the average localization error to 0.29 m, outperforming ORB-SLAM2 and LIO-SAM across small- and large-scale indoor environments. These results indicate that the proposed fusion strategy provides a robust and deployable solution for reliable indoor mobile robot localization.
室内定位是自主移动机器人在复杂室内环境中工作的基本能力,在复杂的室内环境中,视觉退化、传感器噪声和旋转运动经常导致累积漂移。MFL-SLAM是一种实用的多传感器融合定位系统,它扩展了ORB-SLAM3框架,显式地将车轮里程测量与视觉惯性SLAM相结合。与传统的视觉惯性方法不同,MFL-SLAM采用扩展卡尔曼滤波(EKF)将惯性测量单元(IMU)和车轮里程计紧密融合,有效补偿旋转运动中振动引起的惯性漂移和车轮滑移。然后将EKF融合输出与RGB-D视觉约束一起作为非线性优化后端的先验,实现准确和全局一致的姿态估计。大量实验表明,与ORB-SLAM3相比,MFL-SLAM的相对位姿误差降低了47.3% %,平均定位误差降低到0.29 m,在小型和大型室内环境中都优于ORB-SLAM2和LIO-SAM。这些结果表明,该融合策略为室内移动机器人的可靠定位提供了一种鲁棒性和可部署的解决方案。
{"title":"Indoor mobile robot localization system based on ORB-SLAM3 and multi-sensor fusion","authors":"Siyong Fu,&nbsp;Qinghua Zhao,&nbsp;Qiuxiang Tao,&nbsp;Hesheng Liu,&nbsp;Qing Wang,&nbsp;Danjuan Liu","doi":"10.1016/j.aej.2026.01.029","DOIUrl":"10.1016/j.aej.2026.01.029","url":null,"abstract":"<div><div>Indoor localization is a fundamental capability for autonomous mobile robots operating in complex indoor environments, where visual degradation, sensor noise, and rotational motion often lead to accumulated drift. This paper presents MFL-SLAM, a practical multi-sensor fusion localization system that extends the ORB-SLAM3 framework by explicitly integrating wheel odometry with visual–inertial SLAM. Unlike conventional visual–inertial approaches, MFL-SLAM employs an Extended Kalman Filter (EKF) to tightly fuse Inertial Measurement Unit (IMU) and wheel odometry, effectively compensating for vibration-induced inertial drift and wheel slippage during rotational motion. The EKF fusion output is then incorporated as a prior in a nonlinear optimization back-end together with RGB-D visual constraints, enabling accurate and globally consistent pose estimation. Extensive experiments demonstrate that MFL-SLAM achieves a 47.3 % reduction in relative pose error compared to ORB-SLAM3 and reduces the average localization error to 0.29 m, outperforming ORB-SLAM2 and LIO-SAM across small- and large-scale indoor environments. These results indicate that the proposed fusion strategy provides a robust and deployable solution for reliable indoor mobile robot localization.</div></div>","PeriodicalId":7484,"journal":{"name":"alexandria engineering journal","volume":"137 ","pages":"Pages 194-205"},"PeriodicalIF":6.8,"publicationDate":"2026-01-21","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"146036659","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Topological approach with decision making based on nano beta and its application 基于纳米β的拓扑决策方法及其应用
IF 6.8 2区 工程技术 Q1 ENGINEERING, MULTIDISCIPLINARY Pub Date : 2026-01-21 DOI: 10.1016/j.aej.2026.01.020
S.A. Alblowi , Hala Alzumi , A.N. Al Qarni , M. El Sayed , M.A. El Safty
The concepts of nano beta are introduced in this article along with their relationships. We also analyze some of their properties. Based on this concept, we introduce the topological approach to decision-making and its application in the medical field, which uses topological tools like nano beta. These mathematical tools are used in medicine to improve diagnostic accuracy and aid in the development of treatment plans. In a study on the diagnosis of chronic kidney disease, the authors show that this strategy is beneficial. Based on the results, the topological application of nano beta offers a trustworthy and accurate way to make medical decisions. This paper employs the concept of attribute and basis elimination in nano beta topology to identify the principal factors contributing to chronic kidney disease. Diabetes and high blood pressure were found to be the main things that put people at risk for CKD. This risk can be prevented by taking healthy food and proper medical care. To help physicians determine whether a patient has chronic kidney disease, we developed an algorithm. Healthcare providers can use this approach to accurately diagnose illnesses, develop effective treatment strategies, and assist patients in recovering. These can also act as a starting point for the creation of sophisticated nano systems. Finally, we describe a medical method that helps people with chronic kidney disease to determine the underlying cause of their illness.
本文介绍了纳米β的概念以及它们之间的关系。我们还分析了它们的一些性质。基于这一概念,我们介绍了拓扑决策方法及其在医学领域的应用,该方法使用了纳米β等拓扑工具。这些数学工具在医学上用于提高诊断的准确性和帮助制定治疗计划。在一项关于慢性肾脏疾病诊断的研究中,作者表明这种策略是有益的。基于这些结果,纳米β的拓扑应用为医疗决策提供了一种可靠而准确的方法。本文采用纳米β拓扑中的属性和基消除概念来识别导致慢性肾脏疾病的主要因素。糖尿病和高血压被发现是使人们面临慢性肾病风险的主要因素。这种风险可以通过健康饮食和适当的医疗护理来预防。为了帮助医生确定患者是否患有慢性肾脏疾病,我们开发了一种算法。医疗保健提供者可以使用这种方法来准确诊断疾病,制定有效的治疗策略,并帮助患者康复。这些也可以作为创建复杂纳米系统的起点。最后,我们描述了一种医学方法,帮助人们与慢性肾脏疾病,以确定其疾病的根本原因。
{"title":"Topological approach with decision making based on nano beta and its application","authors":"S.A. Alblowi ,&nbsp;Hala Alzumi ,&nbsp;A.N. Al Qarni ,&nbsp;M. El Sayed ,&nbsp;M.A. El Safty","doi":"10.1016/j.aej.2026.01.020","DOIUrl":"10.1016/j.aej.2026.01.020","url":null,"abstract":"<div><div>The concepts of nano beta are introduced in this article along with their relationships. We also analyze some of their properties. Based on this concept, we introduce the topological approach to decision-making and its application in the medical field, which uses topological tools like nano beta. These mathematical tools are used in medicine to improve diagnostic accuracy and aid in the development of treatment plans. In a study on the diagnosis of chronic kidney disease, the authors show that this strategy is beneficial. Based on the results, the topological application of nano beta offers a trustworthy and accurate way to make medical decisions. This paper employs the concept of attribute and basis elimination in nano beta topology to identify the principal factors contributing to chronic kidney disease. Diabetes and high blood pressure were found to be the main things that put people at risk for CKD. This risk can be prevented by taking healthy food and proper medical care. To help physicians determine whether a patient has chronic kidney disease, we developed an algorithm. Healthcare providers can use this approach to accurately diagnose illnesses, develop effective treatment strategies, and assist patients in recovering. These can also act as a starting point for the creation of sophisticated nano systems. Finally, we describe a medical method that helps people with chronic kidney disease to determine the underlying cause of their illness.</div></div>","PeriodicalId":7484,"journal":{"name":"alexandria engineering journal","volume":"137 ","pages":"Pages 218-224"},"PeriodicalIF":6.8,"publicationDate":"2026-01-21","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"146036719","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
AI-driven optimization techniques for smart sustainable manufacturing in Industry 5.0 ecosystem: A comprehensive review 工业5.0生态系统中智能可持续制造的人工智能驱动优化技术综述
IF 6.8 2区 工程技术 Q1 ENGINEERING, MULTIDISCIPLINARY Pub Date : 2026-01-20 DOI: 10.1016/j.aej.2026.01.016
Sita Rani , Ramesh Karnati , Vivek Patel , M.K. Ranganathaswamy , Prakhar Tomar , Aman Kataria , Amrindra Pal
The integration of Artificial Intelligence (AI) driven optimization techniques is transforming smart manufacturing in the industry 5.0 landscape leading to sustainable industrial processes. This review comprehensively explores AI-driven optimization methods that enhance efficiency, resilience, and sustainability in modern manufacturing ecosystems. It highlights the role of various AI - based algorithms in optimizing production processes, energy consumption, and supply chains. Along with this, it also presents the significance of AI-driven manufacturing in improving secure production by facilitating real-time monitoring, anomaly detection, and predictive maintenance. In this work, the authors also examine how AI contributes to human-centric manufacturing, addressing challenges such as resource utilization, waste reduction, and adaptive decision-making. Key advancements, limitations, and future research directions are analyzed to provide a holistic view of AI’s transformative potential. The findings underscore the necessity of AI-driven optimization for achieving sustainable, efficient, and flexible manufacturing processes in Industry 5.0. This work serves as a significant reference for researchers, industry professionals, and policymakers seeking to leverage AI for sustainable industrial advancements. This paper presents the comprehensive synthesis of AI-driven optimization techniques represented for the emerging Industry 5.0 model, prioritizing smart sustainable manufacturing. Unlike prior reviews, it systematically compares traditional and AI-based approaches, highlights the transformative synergy of advanced technologies like AI, IoT, digital twins, and blockchain for real-time, human-centric manufacturing, and details hybrid optimization methods integrating AI algorithms. This review uniquely maps the integration of these innovations with sustainability, adaptability, and mass personalization, presenting a roadmap to help industries employ intelligent, data-driven, and eco-friendly optimization solutions for future-ready manufacturing.
人工智能(AI)驱动的优化技术的集成正在改变工业5.0中的智能制造,从而实现可持续的工业过程。本文全面探讨了人工智能驱动的优化方法,以提高现代制造业生态系统的效率、弹性和可持续性。它强调了各种基于人工智能的算法在优化生产过程、能源消耗和供应链中的作用。与此同时,它还通过促进实时监控、异常检测和预测性维护,展示了人工智能驱动制造在提高安全生产方面的重要性。在这项工作中,作者还研究了人工智能如何为以人为中心的制造做出贡献,应对资源利用、减少浪费和适应性决策等挑战。分析了人工智能的主要进展、局限性和未来的研究方向,以提供对人工智能变革潜力的整体看法。研究结果强调了人工智能驱动的优化对于实现工业5.0中可持续、高效和灵活的制造流程的必要性。这项工作为寻求利用人工智能实现可持续工业发展的研究人员、行业专业人士和政策制定者提供了重要参考。本文介绍了以新兴工业5.0模型为代表的人工智能驱动优化技术的综合综合,优先考虑智能可持续制造。与之前的评论不同,它系统地比较了传统方法和基于人工智能的方法,强调了人工智能、物联网、数字孪生和区块链等先进技术在实时、以人为中心的制造中的变革协同作用,并详细介绍了集成人工智能算法的混合优化方法。这篇综述独特地描绘了这些创新与可持续性、适应性和大规模个性化的整合,提出了一个路线图,帮助行业采用智能、数据驱动和生态友好的优化解决方案,为未来的制造业做好准备。
{"title":"AI-driven optimization techniques for smart sustainable manufacturing in Industry 5.0 ecosystem: A comprehensive review","authors":"Sita Rani ,&nbsp;Ramesh Karnati ,&nbsp;Vivek Patel ,&nbsp;M.K. Ranganathaswamy ,&nbsp;Prakhar Tomar ,&nbsp;Aman Kataria ,&nbsp;Amrindra Pal","doi":"10.1016/j.aej.2026.01.016","DOIUrl":"10.1016/j.aej.2026.01.016","url":null,"abstract":"<div><div>The integration of Artificial Intelligence (AI) driven optimization techniques is transforming smart manufacturing in the industry 5.0 landscape leading to sustainable industrial processes. This review comprehensively explores AI-driven optimization methods that enhance efficiency, resilience, and sustainability in modern manufacturing ecosystems. It highlights the role of various AI - based algorithms in optimizing production processes, energy consumption, and supply chains. Along with this, it also presents the significance of AI-driven manufacturing in improving secure production by facilitating real-time monitoring, anomaly detection, and predictive maintenance. In this work, the authors also examine how AI contributes to human-centric manufacturing, addressing challenges such as resource utilization, waste reduction, and adaptive decision-making. Key advancements, limitations, and future research directions are analyzed to provide a holistic view of AI’s transformative potential. The findings underscore the necessity of AI-driven optimization for achieving sustainable, efficient, and flexible manufacturing processes in Industry 5.0. This work serves as a significant reference for researchers, industry professionals, and policymakers seeking to leverage AI for sustainable industrial advancements. This paper presents the comprehensive synthesis of AI-driven optimization techniques represented for the emerging Industry 5.0 model, prioritizing smart sustainable manufacturing. Unlike prior reviews, it systematically compares traditional and AI-based approaches, highlights the transformative synergy of advanced technologies like AI, IoT, digital twins, and blockchain for real-time, human-centric manufacturing, and details hybrid optimization methods integrating AI algorithms. This review uniquely maps the integration of these innovations with sustainability, adaptability, and mass personalization, presenting a roadmap to help industries employ intelligent, data-driven, and eco-friendly optimization solutions for future-ready manufacturing.</div></div>","PeriodicalId":7484,"journal":{"name":"alexandria engineering journal","volume":"137 ","pages":"Pages 133-158"},"PeriodicalIF":6.8,"publicationDate":"2026-01-20","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"146036730","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Analysis of cycloid motor output characteristics based on fluid simulation 基于流体仿真的摆线电机输出特性分析
IF 6.8 2区 工程技术 Q1 ENGINEERING, MULTIDISCIPLINARY Pub Date : 2026-01-20 DOI: 10.1016/j.aej.2026.01.019
Jiaxing Lu , Lingrong Kong , Yu Wang , Jiong Li
In the field of drilling engineering, the rubber screw motor adopts the cycloidal principle for drilling, and the rubber bushing will carbonize and fail when the drilling at 180°C and above. As a positive displacement motor, the cycloidal motor has high temperature and corrosion resistance, low speed, and high torque. It is used in traditional mechanical engineering fields, such as the hydraulic cycloidal motor of the hoist, while the hydraulic cycloidal motor applied in the drilling fluid field has been blank. Therefore, the cycloidal motor in drilling fluid medium is proposed in this paper, which is different from the traditional mechanical cycloidal motor in oil medium and applied in the field of high-temperature drilling engineering. Based on the clearance flow theory and energy conservation law, the mathematical model of fluid leakage of the cycloidal motor was calculated. Through Matlab programming, the cloud diagram of fluid pressure drop, clearance, and viscosity on the leakage of the cycloidal motor was calculated numerically. The characteristic curve of motor numerical calculation was compared and verified by experimental data of the OMT 160 cycloidal motor produced by Danfoss Company. A theoretical basis for selecting clearance of machining and manufacturing all-metal cycloidal motor were provided.
在钻井工程领域,橡胶螺杆电机采用摆线原理进行钻井,在180℃及以上钻井时,橡胶衬套会碳化失效。摆线电机作为一种正排量电机,具有耐高温、耐腐蚀、低转速、高转矩等特点。它主要用于传统的机械工程领域,如提升机的液压摆线马达,而液压摆线马达在钻井液领域的应用一直是空白。因此,本文提出了钻井液介质中的摆线电机,区别于传统的油介质中的机械式摆线电机,将其应用于高温钻井工程领域。基于间隙流动理论和能量守恒定律,计算了摆线电机流体泄漏的数学模型。通过Matlab编程,数值计算了摆线电机泄漏时流体压降、间隙和粘度的云图。用丹佛斯公司生产的omt160摆线电机的实验数据对电机数值计算的特性曲线进行了比较和验证。为加工间隙的选择和全金属摆线电机的制造提供了理论依据。
{"title":"Analysis of cycloid motor output characteristics based on fluid simulation","authors":"Jiaxing Lu ,&nbsp;Lingrong Kong ,&nbsp;Yu Wang ,&nbsp;Jiong Li","doi":"10.1016/j.aej.2026.01.019","DOIUrl":"10.1016/j.aej.2026.01.019","url":null,"abstract":"<div><div>In the field of drilling engineering, the rubber screw motor adopts the cycloidal principle for drilling, and the rubber bushing will carbonize and fail when the drilling at 180°C and above. As a positive displacement motor, the cycloidal motor has high temperature and corrosion resistance, low speed, and high torque. It is used in traditional mechanical engineering fields, such as the hydraulic cycloidal motor of the hoist, while the hydraulic cycloidal motor applied in the drilling fluid field has been blank. Therefore, the cycloidal motor in drilling fluid medium is proposed in this paper, which is different from the traditional mechanical cycloidal motor in oil medium and applied in the field of high-temperature drilling engineering. Based on the clearance flow theory and energy conservation law, the mathematical model of fluid leakage of the cycloidal motor was calculated. Through Matlab programming, the cloud diagram of fluid pressure drop, clearance, and viscosity on the leakage of the cycloidal motor was calculated numerically. The characteristic curve of motor numerical calculation was compared and verified by experimental data of the OMT 160 cycloidal motor produced by Danfoss Company. A theoretical basis for selecting clearance of machining and manufacturing all-metal cycloidal motor were provided.</div></div>","PeriodicalId":7484,"journal":{"name":"alexandria engineering journal","volume":"137 ","pages":"Pages 159-170"},"PeriodicalIF":6.8,"publicationDate":"2026-01-20","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"146036657","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Vote3D-AD: Unsupervised point cloud anomaly localization via varied defect synthesis and differentiable vote-clustering Vote3D-AD:基于变化缺陷综合和可微分投票聚类的无监督点云异常定位
IF 6.8 2区 工程技术 Q1 ENGINEERING, MULTIDISCIPLINARY Pub Date : 2026-01-20 DOI: 10.1016/j.aej.2026.01.024
Dinh-Cuong Hoang , Phan Xuan Tan , Anh-Nhat Nguyen , Ta Huu Anh Duong , Tuan-Minh Huynh , Minh-Anh Nguyen , Duc-Manh Nguyen , Minh-Duc Cao , Duc-Huy Ngo , Minh-Quang Vu , Van-Duc Vu , Van-Thiep Nguyen , Thu-Uyen Nguyen , Khanh-Toan Phan , Van-Hiep Duong
Existing three-dimensional (3D) anomaly detection approaches typically rely on reconstruction, external memory banks, or fixed-radius clustering and often fail to generalize to noisy, irregularly sampled industrial scans or to capture the full diversity of real defects. We present Vote3D-AD, a single-pass framework that trains only on defect-free data and addresses these gaps with two principal contributions. First, we introduce Varied Defect Synthesis (VDS), a saliency-guided pseudo-anomaly generator that produces diverse, physically plausible defects (bulges, dents, holes, cracks, and surface roughness) together with sensor-level degradations to narrow the synthetic-to-real gap. Second, we develop a vote-and-cluster architecture in which local geometric representations predict learned, scale-aware votes that encode both spatial and boundary cues, and a differentiable soft-assignment clustering module aggregates these votes into coherent anomaly regions without relying on fixed-radius grouping or external memory structures. We evaluated our method on the synthetic Anomaly-ShapeNet benchmark and a new industrial dataset using three metrics: point-level Area Under the Receiver Operating Characteristic curve (AUROC), object-level Area Under the Precision-Recall curve (AUPR), and F1-Score. On average across both benchmarks, our method improves point-level AUROC by 6.7%, AUPR by 10.1% and F1 by 11.2%, and improves object-level AUROC by 5.3%, AUPR by 3.8% and F1 by 5.4% over the strongest baseline, while maintaining inference speeds above 9 frames per second (FPS).
现有的三维(3D)异常检测方法通常依赖于重建,外部存储库或固定半径聚类,并且通常无法推广到有噪声,不规则采样的工业扫描或捕获真实缺陷的全部多样性。我们提出了Vote3D-AD,这是一个单通道框架,仅在无缺陷数据上进行训练,并通过两个主要贡献来解决这些差距。首先,我们引入了可变缺陷合成(VDS),这是一种显著性引导的伪异常发生器,可以产生各种物理上合理的缺陷(凸起、凹痕、孔洞、裂缝和表面粗糙度),以及传感器级的退化,以缩小合成到真实的差距。其次,我们开发了一种投票和聚类架构,其中局部几何表征预测学习的、规模感知的投票,这些投票编码空间和边界线索,可微分的软分配聚类模块将这些投票聚集到连贯的异常区域,而不依赖于固定半径分组或外部记忆结构。我们在综合的Anomaly-ShapeNet基准和一个新的工业数据集上评估了我们的方法,使用了三个指标:点级的接收者工作特征曲线下面积(AUROC)、对象级的精确召回曲线下面积(AUPR)和F1-Score。在两个基准测试中,我们的方法平均将点级AUROC提高了6.7%,AUPR提高了10.1%,F1提高了11.2%,并将对象级AUROC提高了5.3%,AUPR提高了3.8%,F1提高了5.4%,同时将推理速度保持在每秒9帧(FPS)以上。
{"title":"Vote3D-AD: Unsupervised point cloud anomaly localization via varied defect synthesis and differentiable vote-clustering","authors":"Dinh-Cuong Hoang ,&nbsp;Phan Xuan Tan ,&nbsp;Anh-Nhat Nguyen ,&nbsp;Ta Huu Anh Duong ,&nbsp;Tuan-Minh Huynh ,&nbsp;Minh-Anh Nguyen ,&nbsp;Duc-Manh Nguyen ,&nbsp;Minh-Duc Cao ,&nbsp;Duc-Huy Ngo ,&nbsp;Minh-Quang Vu ,&nbsp;Van-Duc Vu ,&nbsp;Van-Thiep Nguyen ,&nbsp;Thu-Uyen Nguyen ,&nbsp;Khanh-Toan Phan ,&nbsp;Van-Hiep Duong","doi":"10.1016/j.aej.2026.01.024","DOIUrl":"10.1016/j.aej.2026.01.024","url":null,"abstract":"<div><div>Existing three-dimensional (3D) anomaly detection approaches typically rely on reconstruction, external memory banks, or fixed-radius clustering and often fail to generalize to noisy, irregularly sampled industrial scans or to capture the full diversity of real defects. We present Vote3D-AD, a single-pass framework that trains only on defect-free data and addresses these gaps with two principal contributions. First, we introduce Varied Defect Synthesis (VDS), a saliency-guided pseudo-anomaly generator that produces diverse, physically plausible defects (bulges, dents, holes, cracks, and surface roughness) together with sensor-level degradations to narrow the synthetic-to-real gap. Second, we develop a vote-and-cluster architecture in which local geometric representations predict learned, scale-aware votes that encode both spatial and boundary cues, and a differentiable soft-assignment clustering module aggregates these votes into coherent anomaly regions without relying on fixed-radius grouping or external memory structures. We evaluated our method on the synthetic Anomaly-ShapeNet benchmark and a new industrial dataset using three metrics: point-level Area Under the Receiver Operating Characteristic curve (AUROC), object-level Area Under the Precision-Recall curve (AUPR), and F1-Score. On average across both benchmarks, our method improves point-level AUROC by 6.7%, AUPR by 10.1% and F1 by 11.2%, and improves object-level AUROC by 5.3%, AUPR by 3.8% and F1 by 5.4% over the strongest baseline, while maintaining inference speeds above 9 frames per second (FPS).</div></div>","PeriodicalId":7484,"journal":{"name":"alexandria engineering journal","volume":"137 ","pages":"Pages 171-193"},"PeriodicalIF":6.8,"publicationDate":"2026-01-20","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"146036658","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
DGGAT: Dual-branch gated graph attention transformer for high-accuracy semantic segmentation DGGAT:用于高精度语义分割的双分支门控图注意力转换器
IF 6.8 2区 工程技术 Q1 ENGINEERING, MULTIDISCIPLINARY Pub Date : 2026-01-20 DOI: 10.1016/j.aej.2026.01.027
Jie He
Semantic segmentation requires both global context and fine-grained details, yet CNNs struggle with long-range dependencies and Transformers can under-represent low-level structure and be computationally heavy. We propose DGGAT, a dual-branch gated graph attention Transformer: a global branch models long-range context, while a graph-embedded local branch groups semantically related pixels into nodes and applies gated graph attention to sharpen boundaries and small objects. An Attention-Based Feature Selection Fusion Module (ASFM) fuses global and local features to suppress redundancy and balance detail with context. On Cityscapes, DGGAT reaches 84.7% mIoU with a ResNet-101 backbone (82.8% with ResNet-50), and on ADE20K it attains 48.2% mIoU, validating both accuracy and efficiency. These results demonstrate that DGGAT effectively integrates global semantics with fine-detail representation.
语义分割需要全局上下文和细粒度的细节,但cnn在长期依赖关系方面存在问题,而transformer可能无法充分表示低级结构,并且计算量很大。我们提出了一种双分支门控图注意力转换器DGGAT:一个全局分支建模远程上下文,而一个图嵌入的局部分支将语义相关的像素分组到节点中,并应用门控图注意力来锐利边界和小对象。基于注意力的特征选择融合模块(ASFM)融合全局和局部特征以抑制冗余并平衡细节与上下文。在cityscape上,DGGAT在ResNet-101骨干网下达到84.7%的mIoU(在ResNet-50骨干网下达到82.8%),在ADE20K上达到48.2%的mIoU,验证了准确性和效率。这些结果表明,DGGAT有效地将全局语义与细节表示相结合。
{"title":"DGGAT: Dual-branch gated graph attention transformer for high-accuracy semantic segmentation","authors":"Jie He","doi":"10.1016/j.aej.2026.01.027","DOIUrl":"10.1016/j.aej.2026.01.027","url":null,"abstract":"<div><div>Semantic segmentation requires both global context and fine-grained details, yet CNNs struggle with long-range dependencies and Transformers can under-represent low-level structure and be computationally heavy. We propose DGGAT, a dual-branch gated graph attention Transformer: a global branch models long-range context, while a graph-embedded local branch groups semantically related pixels into nodes and applies gated graph attention to sharpen boundaries and small objects. An Attention-Based Feature Selection Fusion Module (ASFM) fuses global and local features to suppress redundancy and balance detail with context. On Cityscapes, DGGAT reaches 84.7% mIoU with a ResNet-101 backbone (82.8% with ResNet-50), and on ADE20K it attains 48.2% mIoU, validating both accuracy and efficiency. These results demonstrate that DGGAT effectively integrates global semantics with fine-detail representation.</div></div>","PeriodicalId":7484,"journal":{"name":"alexandria engineering journal","volume":"137 ","pages":"Pages 123-132"},"PeriodicalIF":6.8,"publicationDate":"2026-01-20","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"146036708","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
From words to proverbs: Evaluating LLMs’ linguistic and cultural competence in Saudi dialects with Absher 从单词到谚语:用Absher评估法学硕士在沙特方言中的语言和文化能力
IF 6.8 2区 工程技术 Q1 ENGINEERING, MULTIDISCIPLINARY Pub Date : 2026-01-19 DOI: 10.1016/j.aej.2025.12.066
Renad Al-Monef , Hassan Alhuzali , Nora Alturayeif , Ashwag Alasmari
As large language models (LLMs) become increasingly central to Arabic NLP applications, their effectiveness in linguistically diverse settings, particularly regions with rich dialectal variation such as Saudi Arabia, remains underexplored. Existing evaluation paradigms tend to prioritize high-resource languages or Modern Standard Arabic (MSA), overlooking regional linguistic and cultural specificities. This leads to performance limitations and cultural biases in real-world deployments. To address this gap, we introduce Absher, the first comprehensive and fine-grained benchmark designed to assess the understanding of LLMs regarding Saudi dialects and their embedded cultural nuances. Absher consists of over 18,000 multiple choice questions derived from a curated dataset of dialectal words, phrases, and proverbs sourced from five major Saudi regions. The benchmark spans six task categories: Meaning, True/False, Fill-in-the-Blank, Contextual Usage, Cultural Interpretation, and Location Recognition, enabling multifaceted evaluation across both linguistic and cultural dimensions. We perform zero-shot evaluations on six state-of-the-art open LLMs: ALLaM, LLaMA, Jais, Mistral, Qwen, and AceGPT. Our results reveal substantial performance variability across dialects and question types. Qwen achieved the highest overall accuracy, excelling in word-level questions (63%), while ALLaM outperformed others in the interpretation of proverbs (48% accuracy). All models struggled with content from underrepresented dialects, particularly Southern and Eastern variants, and with context-free True/False questions, highlighting weaknesses in dialect grounding and binary reasoning. These findings demonstrate the need for dialect-aware training and culturally aligned evaluation. We position Absher as a critical step toward more equitable and effective LLMs development for real-world Arabic applications.
随着大型语言模型(llm)在阿拉伯语NLP应用中变得越来越重要,它们在语言多样性环境中的有效性仍未得到充分探索,特别是在方言差异丰富的地区,如沙特阿拉伯。现有的评价范式往往优先考虑资源丰富的语言或现代标准阿拉伯语(MSA),忽视了区域语言和文化的特殊性。这导致了实际部署中的性能限制和文化偏差。为了解决这一差距,我们介绍了Absher,这是第一个全面而细致的基准,旨在评估法学硕士对沙特方言及其嵌入的文化细微差别的理解。Absher由18000多个选择题组成,这些选择题来自沙特五个主要地区的方言单词、短语和谚语的精心整理的数据集。该基准涵盖了六个任务类别:意义、真假、填空、语境用法、文化解释和位置识别,从而实现了跨语言和文化维度的多方面评估。我们对六个最先进的开放式llm进行零射击评估:ALLaM, LLaMA, Jais, Mistral, Qwen和AceGPT。我们的研究结果揭示了方言和问题类型之间的实质性表现差异。Qwen达到了最高的整体准确率,在单词水平的问题上表现出色(63%),而ALLaM在谚语的解释上表现出色(48%)。所有模型都难以处理未被充分代表的方言内容,尤其是南方和东方方言,以及与上下文无关的真/假问题,突出了方言基础和二元推理的弱点。这些发现表明,需要进行方言意识培训和与文化相一致的评估。我们将Absher定位为朝着更加公平和有效的法学硕士开发现实世界阿拉伯语应用的关键一步。
{"title":"From words to proverbs: Evaluating LLMs’ linguistic and cultural competence in Saudi dialects with Absher","authors":"Renad Al-Monef ,&nbsp;Hassan Alhuzali ,&nbsp;Nora Alturayeif ,&nbsp;Ashwag Alasmari","doi":"10.1016/j.aej.2025.12.066","DOIUrl":"10.1016/j.aej.2025.12.066","url":null,"abstract":"<div><div>As large language models (LLMs) become increasingly central to Arabic NLP applications, their effectiveness in linguistically diverse settings, particularly regions with rich dialectal variation such as Saudi Arabia, remains underexplored. Existing evaluation paradigms tend to prioritize high-resource languages or Modern Standard Arabic (MSA), overlooking regional linguistic and cultural specificities. This leads to performance limitations and cultural biases in real-world deployments. To address this gap, we introduce <span>Absher</span>, the first comprehensive and fine-grained benchmark designed to assess the understanding of LLMs regarding Saudi dialects and their embedded cultural nuances. <span>Absher</span> consists of over 18,000 multiple choice questions derived from a curated dataset of dialectal words, phrases, and proverbs sourced from five major Saudi regions. The benchmark spans six task categories: Meaning, True/False, Fill-in-the-Blank, Contextual Usage, Cultural Interpretation, and Location Recognition, enabling multifaceted evaluation across both linguistic and cultural dimensions. We perform zero-shot evaluations on six state-of-the-art open LLMs: ALLaM, LLaMA, Jais, Mistral, Qwen, and AceGPT. Our results reveal substantial performance variability across dialects and question types. Qwen achieved the highest overall accuracy, excelling in word-level questions (63%), while ALLaM outperformed others in the interpretation of proverbs (48% accuracy). All models struggled with content from underrepresented dialects, particularly Southern and Eastern variants, and with context-free True/False questions, highlighting weaknesses in dialect grounding and binary reasoning. These findings demonstrate the need for dialect-aware training and culturally aligned evaluation. We position <span>Absher</span> as a critical step toward more equitable and effective LLMs development for real-world Arabic applications.</div></div>","PeriodicalId":7484,"journal":{"name":"alexandria engineering journal","volume":"137 ","pages":"Pages 25-41"},"PeriodicalIF":6.8,"publicationDate":"2026-01-19","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145993580","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Rotationally symmetric resonator-based metamaterial for wideband EMI shielding and blood dielectric property sensing applications 基于旋转对称谐振器的超材料,用于宽带电磁干扰屏蔽和血介电特性传感应用
IF 6.8 2区 工程技术 Q1 ENGINEERING, MULTIDISCIPLINARY Pub Date : 2026-01-19 DOI: 10.1016/j.aej.2026.01.021
Abdullah Al Mahfazur Rahman , Mohamad A. Alawad , Md. Moniruzzaman , Yazeed Alkhrijah , Badariah Bais , Abdulmajeed M. Alenezi , Mohammad Tariqul Islam
This paper presents a rotationally symmetric metamaterial (MTM) designed for electromagnetic interference (EMI) shielding and blood dielectric sensing applications. The geometry of the MTM unit cell (9.6mm× 9.6mm×1.6mm) is optimized through CST simulation. The array of unit cells ensures the S21 resonance at 5.961 GHz, with a broader bandwidth of 4.28 GHz (71.80 %) spanning from 3.75 to 8.03 GHz for the optimized dimensions of various segments of the rotationally symmetric unit cell. Utilizing field distribution, surface current, and effective parameter responses, the resonance phenomena are analyzed. The array structure of the MTM achieves a peak shielding effectiveness of 39.78 dB within the C-band while maintaining angular stability. Additionally, it performs nonlinear sensing responses, establishing a high-frequency deviation ranging from 4.037 to 4.230 GHz and demonstrating a high sensitivity of 4.44 %, which enables it to detect variations in blood dielectric properties. For sensing analysis, samples are replicated in a laboratory to accurately imitate blood dielectric properties. The performance of the designed MTM is validated by prototype measurements, which align well with the simulations. The findings confirm the design's effectiveness for EMI shielding in microwave communication and its potential for blood dielectric sensing in biomedical applications.
本文提出了一种旋转对称超材料(MTM),用于电磁干扰屏蔽和血介质传感。通过CST仿真优化了MTM单元格(9.6 mmx 9.6mm×1.6mm)的几何形状。单元格阵列保证了S21在5.961 GHz的共振,并且由于旋转对称单元格各段的尺寸优化,其带宽为4.28 GHz(71.80%),跨越3.75至8.03 GHz。利用场分布、表面电流和有效参数响应分析了共振现象。该MTM阵列结构在保持角稳定性的情况下,在c波段的峰值屏蔽效率为39.78 dB。此外,它执行非线性传感响应,建立了从4.037到4.230 GHz的高频偏差,并显示出4.44%的高灵敏度,这使得它能够检测血液介电特性的变化。对于传感分析,样品在实验室中复制,以准确地模拟血液的介电特性。通过样机测量验证了所设计MTM的性能,结果与仿真结果吻合较好。研究结果证实了该设计在微波通信中的EMI屏蔽效果及其在生物医学应用中的血液电介质传感潜力。
{"title":"Rotationally symmetric resonator-based metamaterial for wideband EMI shielding and blood dielectric property sensing applications","authors":"Abdullah Al Mahfazur Rahman ,&nbsp;Mohamad A. Alawad ,&nbsp;Md. Moniruzzaman ,&nbsp;Yazeed Alkhrijah ,&nbsp;Badariah Bais ,&nbsp;Abdulmajeed M. Alenezi ,&nbsp;Mohammad Tariqul Islam","doi":"10.1016/j.aej.2026.01.021","DOIUrl":"10.1016/j.aej.2026.01.021","url":null,"abstract":"<div><div>This paper presents a rotationally symmetric metamaterial (MTM) designed for electromagnetic interference (EMI) shielding and blood dielectric sensing applications. The geometry of the MTM unit cell (<span><math><mrow><mn>9.6</mn><mi>mm</mi><mo>×</mo></mrow></math></span> <span><math><mrow><mn>9.6</mn><mi>mm</mi><mo>×</mo><mn>1.6</mn><mi>mm</mi></mrow></math></span>) is optimized through CST simulation. The array of unit cells ensures the S<sub>21</sub> resonance at 5.961 GHz, with a broader bandwidth of 4.28 GHz (71.80 %) spanning from 3.75 to 8.03 GHz for the optimized dimensions of various segments of the rotationally symmetric unit cell. Utilizing field distribution, surface current, and effective parameter responses, the resonance phenomena are analyzed. The array structure of the MTM achieves a peak shielding effectiveness of 39.78 dB within the C-band while maintaining angular stability. Additionally, it performs nonlinear sensing responses, establishing a high-frequency deviation ranging from 4.037 to 4.230 GHz and demonstrating a high sensitivity of 4.44 %, which enables it to detect variations in blood dielectric properties. For sensing analysis, samples are replicated in a laboratory to accurately imitate blood dielectric properties. The performance of the designed MTM is validated by prototype measurements, which align well with the simulations. The findings confirm the design's effectiveness for EMI shielding in microwave communication and its potential for blood dielectric sensing in biomedical applications.</div></div>","PeriodicalId":7484,"journal":{"name":"alexandria engineering journal","volume":"137 ","pages":"Pages 101-122"},"PeriodicalIF":6.8,"publicationDate":"2026-01-19","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"146036660","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
A power network anomaly alarm denoising method based on a hybrid LSTM-attention model 基于lstm -注意力混合模型的电网异常报警去噪方法
IF 6.8 2区 工程技术 Q1 ENGINEERING, MULTIDISCIPLINARY Pub Date : 2026-01-19 DOI: 10.1016/j.aej.2026.01.006
Mingfei Zeng, Yuting Lian
Power grid fault alarms play a crucial role in minimizing system damage and service disruptions. However, existing deep learning approaches frequently neglect the topological structure and physical characteristics of power system data, leading to suboptimal fault identification performance. To address this limitation, this paper proposes PhysLSTM-Attn, a novel physics-informed deep learning method for power network anomaly alarm denoising. The model incorporates Kirchhoff’s Laws directly within the feature embedding layer, ensuring that learned representations adhere to fundamental circuit conservation principles. A topology-aware bidirectional LSTM encoder captures both temporal dependencies and spatial relationships by integrating graph-Laplacian-based positional encodings into its gating mechanism. In addition, an electrical-distance-enhanced multi-head attention mechanism computes attention weights based on electrical coupling strength rather than semantic similarity, providing a more accurate reflection of device interactions. A multi-hop graph convolutional network further models cascading fault propagation across multiple electrical distances, while a confidence calibration module supplies reliability estimates to support decision-making. Comprehensive experiments on the East China Power Grid Alarm Dataset and the IEEE 118-Node Extended Dataset demonstrate accuracy improvements of 8.32 % and 7.43 % over the LSTM-Attention baseline, respectively.
电网故障报警在减少系统损害和服务中断方面起着至关重要的作用。然而,现有的深度学习方法往往忽略了电力系统数据的拓扑结构和物理特性,导致故障识别性能欠佳。为了解决这一限制,本文提出了一种新的基于物理的深度学习方法PhysLSTM-Attn,用于电网异常报警去噪。该模型将Kirchhoff定律直接纳入特征嵌入层,确保学习到的表示符合基本的电路守恒原则。拓扑感知的双向LSTM编码器通过将基于图拉普拉斯的位置编码集成到其门控机制中来捕获时间依赖关系和空间关系。此外,电距离增强的多头注意机制基于电耦合强度而不是语义相似性来计算注意权重,从而更准确地反映设备交互。多跳图卷积网络进一步模拟跨多个电距离的级联故障传播,而置信度校准模块提供可靠性估计以支持决策。在华东电网报警数据集和IEEE 118节点扩展数据集上的综合实验表明,与LSTM-Attention基线相比,准确率分别提高了8.32 %和7.43 %。
{"title":"A power network anomaly alarm denoising method based on a hybrid LSTM-attention model","authors":"Mingfei Zeng,&nbsp;Yuting Lian","doi":"10.1016/j.aej.2026.01.006","DOIUrl":"10.1016/j.aej.2026.01.006","url":null,"abstract":"<div><div>Power grid fault alarms play a crucial role in minimizing system damage and service disruptions. However, existing deep learning approaches frequently neglect the topological structure and physical characteristics of power system data, leading to suboptimal fault identification performance. To address this limitation, this paper proposes PhysLSTM-Attn, a novel physics-informed deep learning method for power network anomaly alarm denoising. The model incorporates Kirchhoff’s Laws directly within the feature embedding layer, ensuring that learned representations adhere to fundamental circuit conservation principles. A topology-aware bidirectional LSTM encoder captures both temporal dependencies and spatial relationships by integrating graph-Laplacian-based positional encodings into its gating mechanism. In addition, an electrical-distance-enhanced multi-head attention mechanism computes attention weights based on electrical coupling strength rather than semantic similarity, providing a more accurate reflection of device interactions. A multi-hop graph convolutional network further models cascading fault propagation across multiple electrical distances, while a confidence calibration module supplies reliability estimates to support decision-making. Comprehensive experiments on the East China Power Grid Alarm Dataset and the IEEE 118-Node Extended Dataset demonstrate accuracy improvements of 8.32 % and 7.43 % over the LSTM-Attention baseline, respectively.</div></div>","PeriodicalId":7484,"journal":{"name":"alexandria engineering journal","volume":"137 ","pages":"Pages 76-88"},"PeriodicalIF":6.8,"publicationDate":"2026-01-19","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"146036709","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
ET-YOLO:A study on a malaria pathogen detection model based on YOLO11 ET-YOLO:基于YOLO11的疟疾病原检测模型研究
IF 6.8 2区 工程技术 Q1 ENGINEERING, MULTIDISCIPLINARY Pub Date : 2026-01-19 DOI: 10.1016/j.aej.2026.01.025
Yong Lu , Chenxu Wang , Xuze Gu , Xiuqin Pan , Yijin Gang
In order to effectively prevent the global spread of malaria, classical deep learning models have been applied to malaria detection. However, these models generally suffer from low accuracy. In order to address the identified limitations, an Efficient Target-Oriented YOLO model (ET-YOLO) is proposed in this thesis. To address the limited discriminability of C3k2 in malaria microscopy images, we redesigned it into C3k2fECA, which integrates efficient channel attention and a refined fusion pathway to emphasize parasite-related regions. We further developed C3k2fTR, leveraging Transformer-based global context modeling to remedy the loss of contextual cues and improve robustness under complex backgrounds. In addition, a lightweight ConvNeXt variant, CNeB (ConvNeXt Block), was incorporated to effectively reduce model parameters while maintaining strong representational capacity. The experimental results of the improved model on two different datasets demonstrate the effectiveness of the improved model, specifically achieving [email protected] of 86.2% and 77.9% on two different datasets, both of which outperform other traditional YOLO models, while the number of parameters is reduced by about 7.2% compared to the reference model. A balance has been achieved between detection accuracy and computational resource utilization, providing a practical technical solution for malaria control in resource-constrained regions.
为了有效防止疟疾的全球传播,经典的深度学习模型被应用于疟疾检测。然而,这些模型通常存在精度低的问题。为了解决上述局限性,本文提出了一种高效目标导向的YOLO模型(ET-YOLO)。为了解决疟疾显微镜图像中C3k2的局限性,我们将其重新设计为C3k2fECA,它集成了有效的通道关注和精细的融合途径,以强调寄生虫相关区域。我们进一步开发了C3k2fTR,利用基于transformer的全局上下文建模来弥补上下文线索的丢失并提高复杂背景下的鲁棒性。此外,引入了轻量级的ConvNeXt变体CNeB (ConvNeXt Block),以有效地减少模型参数,同时保持强大的表示能力。改进模型在两个不同数据集上的实验结果证明了改进模型的有效性,[email protected]在两个不同数据集上的准确率分别达到了86.2%和77.9%,均优于其他传统的YOLO模型,而参数数量比参考模型减少了约7.2%。在检测精度和计算资源利用之间取得了平衡,为资源受限地区的疟疾控制提供了实用的技术解决方案。
{"title":"ET-YOLO:A study on a malaria pathogen detection model based on YOLO11","authors":"Yong Lu ,&nbsp;Chenxu Wang ,&nbsp;Xuze Gu ,&nbsp;Xiuqin Pan ,&nbsp;Yijin Gang","doi":"10.1016/j.aej.2026.01.025","DOIUrl":"10.1016/j.aej.2026.01.025","url":null,"abstract":"<div><div>In order to effectively prevent the global spread of malaria, classical deep learning models have been applied to malaria detection. However, these models generally suffer from low accuracy. In order to address the identified limitations, an Efficient Target-Oriented YOLO model (ET-YOLO) is proposed in this thesis. To address the limited discriminability of C3k2 in malaria microscopy images, we redesigned it into C3k2fECA, which integrates efficient channel attention and a refined fusion pathway to emphasize parasite-related regions. We further developed C3k2fTR, leveraging Transformer-based global context modeling to remedy the loss of contextual cues and improve robustness under complex backgrounds. In addition, a lightweight ConvNeXt variant, CNeB (ConvNeXt Block), was incorporated to effectively reduce model parameters while maintaining strong representational capacity. The experimental results of the improved model on two different datasets demonstrate the effectiveness of the improved model, specifically achieving [email protected] of 86.2% and 77.9% on two different datasets, both of which outperform other traditional YOLO models, while the number of parameters is reduced by about 7.2% compared to the reference model. A balance has been achieved between detection accuracy and computational resource utilization, providing a practical technical solution for malaria control in resource-constrained regions.</div></div>","PeriodicalId":7484,"journal":{"name":"alexandria engineering journal","volume":"137 ","pages":"Pages 14-24"},"PeriodicalIF":6.8,"publicationDate":"2026-01-19","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145993579","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
期刊
alexandria engineering journal
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1