首页 > 最新文献

Engineering Applications of Artificial Intelligence最新文献

英文 中文
Seeing the unseen: Semantic segmentation and uncertainty quantification for delamination detection in building facades 看到看不见的:建筑立面分层检测的语义分割和不确定性量化
IF 8 2区 计算机科学 Q1 AUTOMATION & CONTROL SYSTEMS Pub Date : 2026-04-01 Epub Date: 2026-02-11 DOI: 10.1016/j.engappai.2026.114129
Yuebo Meng , Guotong Yin , Songtao Ye , Qiaoqiao Wang , Guanghui Liu , Xiaohan Li , Xiaojiao Geng
Accurate detection of delamination in building facades is critical for prolonging service life and ensuring structural safety. Current inspection methodologies heavily rely on manual interpretation, lacking efficiency and intelligent robustness. While infrared thermography provides a non-destructive means for detecting subsurface delamination, its accuracy is often compromised by low thermal contrast under uncontrolled conditions and the absence of uncertainty quantification in deep learning models. To address these limitations, this paper proposes TIHSNet, a novel delamination detection framework based on semantic segmentation and uncertainty quantification. Specifically, a physics-informed thermal gradient attention module is introduced to emphasize thermodynamically meaningful gradients and enable accurate delamination boundary delineation. Subsequently, a dual output mechanism is proposed to simultaneously generate prediction and uncertainty maps, enabling quantitative assessment of predictive reliability and identification of regions requiring expert review. To further enhance spatial localization, visible light images are integrated to capture tile boundary information and support spatial classification of delamination. Experiments were conducted on a self constructed dataset comprising 2102 infrared thermography and visible light images collected from reinforced concrete and brick masonry walls. The results demonstrate that TIHSNet achieves a precision of 96.1%, surpassing traditional thresholding methods with a 27.9% gain, and further outperforming existing deep learning approaches by 10.5%. The uncertainty quantification results further validate the model’s robustness and its ability to support reliable decision making in real world inspection scenarios.
准确检测建筑外立面的脱层对延长使用寿命和确保结构安全至关重要。目前的检测方法严重依赖人工解释,缺乏效率和智能鲁棒性。虽然红外热成像为探测地下分层提供了一种非破坏性的手段,但在不受控制的条件下,其准确性往往受到低热对比度和深度学习模型中缺乏不确定性量化的影响。为了解决这些问题,本文提出了一种基于语义分割和不确定性量化的分层检测框架TIHSNet。具体来说,引入了一个物理通知的热梯度注意模块,以强调热力学上有意义的梯度,并实现准确的分层边界划定。随后,提出了一种双输出机制来同时生成预测和不确定性图,从而能够定量评估预测可靠性并识别需要专家审查的区域。为了进一步增强空间定位能力,我们将可见光图像整合在一起,捕捉瓷砖边界信息,支持分层的空间分类。实验以自建的2102张钢筋混凝土和砖砌体墙体的红外热像图和可见光图像为数据集。结果表明,TIHSNet达到了96.1%的精度,比传统的阈值方法提高了27.9%,并进一步比现有的深度学习方法提高了10.5%。不确定性量化结果进一步验证了模型的鲁棒性及其在实际检验场景中支持可靠决策的能力。
{"title":"Seeing the unseen: Semantic segmentation and uncertainty quantification for delamination detection in building facades","authors":"Yuebo Meng ,&nbsp;Guotong Yin ,&nbsp;Songtao Ye ,&nbsp;Qiaoqiao Wang ,&nbsp;Guanghui Liu ,&nbsp;Xiaohan Li ,&nbsp;Xiaojiao Geng","doi":"10.1016/j.engappai.2026.114129","DOIUrl":"10.1016/j.engappai.2026.114129","url":null,"abstract":"<div><div>Accurate detection of delamination in building facades is critical for prolonging service life and ensuring structural safety. Current inspection methodologies heavily rely on manual interpretation, lacking efficiency and intelligent robustness. While infrared thermography provides a non-destructive means for detecting subsurface delamination, its accuracy is often compromised by low thermal contrast under uncontrolled conditions and the absence of uncertainty quantification in deep learning models. To address these limitations, this paper proposes TIHSNet, a novel delamination detection framework based on semantic segmentation and uncertainty quantification. Specifically, a physics-informed thermal gradient attention module is introduced to emphasize thermodynamically meaningful gradients and enable accurate delamination boundary delineation. Subsequently, a dual output mechanism is proposed to simultaneously generate prediction and uncertainty maps, enabling quantitative assessment of predictive reliability and identification of regions requiring expert review. To further enhance spatial localization, visible light images are integrated to capture tile boundary information and support spatial classification of delamination. Experiments were conducted on a self constructed dataset comprising 2102 infrared thermography and visible light images collected from reinforced concrete and brick masonry walls. The results demonstrate that TIHSNet achieves a precision of 96.1%, surpassing traditional thresholding methods with a 27.9% gain, and further outperforming existing deep learning approaches by 10.5%. The uncertainty quantification results further validate the model’s robustness and its ability to support reliable decision making in real world inspection scenarios.</div></div>","PeriodicalId":50523,"journal":{"name":"Engineering Applications of Artificial Intelligence","volume":"169 ","pages":"Article 114129"},"PeriodicalIF":8.0,"publicationDate":"2026-04-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"146174777","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Intelligent control framework for Unmanned Aerial Vehicle autonomous docking based on Linear Active Disturbance Rejection Control and improved Particle Swarm Optimization 基于线性自抗扰和改进粒子群优化的无人机自主对接智能控制框架
IF 8 2区 计算机科学 Q1 AUTOMATION & CONTROL SYSTEMS Pub Date : 2026-04-01 Epub Date: 2026-02-11 DOI: 10.1016/j.engappai.2026.114070
Mingzhi Shao , Xin Liu , Wenchao Cui , Chengmeng Sun , Haiwen Yuan
Autonomous aerial docking of Unmanned Aerial Vehicle (UAV) is essential for aerial refueling, payload replacement, and cooperative operations, yet existing methods often exhibit low docking accuracy, weak disturbance rejection, and empirical parameter tuning. To overcome these limitations, this study proposes an intelligent control framework that integrates Linear Active Disturbance Rejection Control (LADRC) with an Improved Particle Swarm Optimization (IPSO) algorithm. First, a six degree of freedom dynamic model of the UAV and cone sleeve system is developed, incorporating wind disturbance, turbulence, and parameter perturbations. Second, the LADRC method realizes decoupled control of altitude, lateral, and velocity channels, ensuring robust dynamic compensation. Third, the IPSO algorithm, an Artificial Intelligence (AI) based optimization approach, is employed to adaptively tune the controller bandwidth and observer gains. This AI enhanced parameter learning process improves the generalization capability of LADRC under varying flight conditions. Simulation and scaled flight experiments demonstrate that the proposed AI driven LADRC achieves stable docking under fifty percent perturbations, with a trajectory root mean square error of 0.04 m and a relative velocity error of 0.03 m per second. Compared with conventional controllers, the tracking error is reduced by up to 38 percent. These results confirm that combining LADRC with AI based optimization offers a robust and precise solution for UAV autonomous aerial docking in complex and uncertain environments.
无人机的自主空中对接对于空中加油、载荷替换和协同作战至关重要,但现有的对接方法往往存在对接精度低、抗干扰能力弱、经验参数可调等问题。为了克服这些限制,本研究提出了一种集成线性自抗扰控制(LADRC)和改进粒子群优化(IPSO)算法的智能控制框架。首先,建立了考虑风扰动、湍流和参数扰动的六自由度无人机与锥套系统动力学模型;其次,LADRC方法实现了高度通道、横向通道和速度通道的解耦控制,保证了鲁棒动态补偿。第三,采用基于人工智能(AI)的优化方法IPSO算法自适应调节控制器带宽和观测器增益。这种人工智能增强的参数学习过程提高了LADRC在不同飞行条件下的泛化能力。仿真和比例飞行实验表明,人工智能驱动的LADRC在50%扰动下实现了稳定对接,轨迹均方根误差为0.04 m / s,相对速度误差为0.03 m / s。与传统控制器相比,跟踪误差降低了38%。这些结果证实,将LADRC与基于人工智能的优化相结合,为复杂和不确定环境下的无人机自主空中对接提供了鲁棒和精确的解决方案。
{"title":"Intelligent control framework for Unmanned Aerial Vehicle autonomous docking based on Linear Active Disturbance Rejection Control and improved Particle Swarm Optimization","authors":"Mingzhi Shao ,&nbsp;Xin Liu ,&nbsp;Wenchao Cui ,&nbsp;Chengmeng Sun ,&nbsp;Haiwen Yuan","doi":"10.1016/j.engappai.2026.114070","DOIUrl":"10.1016/j.engappai.2026.114070","url":null,"abstract":"<div><div>Autonomous aerial docking of Unmanned Aerial Vehicle (UAV) is essential for aerial refueling, payload replacement, and cooperative operations, yet existing methods often exhibit low docking accuracy, weak disturbance rejection, and empirical parameter tuning. To overcome these limitations, this study proposes an intelligent control framework that integrates Linear Active Disturbance Rejection Control (LADRC) with an Improved Particle Swarm Optimization (IPSO) algorithm. First, a six degree of freedom dynamic model of the UAV and cone sleeve system is developed, incorporating wind disturbance, turbulence, and parameter perturbations. Second, the LADRC method realizes decoupled control of altitude, lateral, and velocity channels, ensuring robust dynamic compensation. Third, the IPSO algorithm, an Artificial Intelligence (AI) based optimization approach, is employed to adaptively tune the controller bandwidth and observer gains. This AI enhanced parameter learning process improves the generalization capability of LADRC under varying flight conditions. Simulation and scaled flight experiments demonstrate that the proposed AI driven LADRC achieves stable docking under fifty percent perturbations, with a trajectory root mean square error of 0.04 m and a relative velocity error of 0.03 m per second. Compared with conventional controllers, the tracking error is reduced by up to 38 percent. These results confirm that combining LADRC with AI based optimization offers a robust and precise solution for UAV autonomous aerial docking in complex and uncertain environments.</div></div>","PeriodicalId":50523,"journal":{"name":"Engineering Applications of Artificial Intelligence","volume":"169 ","pages":"Article 114070"},"PeriodicalIF":8.0,"publicationDate":"2026-04-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"146174804","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Gated Memory-Guided Multi-scale spatio–temporal–spectral feature fusion network for unsupervised Internet of Things time series anomaly detection 门控记忆引导多尺度时空光谱特征融合网络无监督物联网时间序列异常检测
IF 8 2区 计算机科学 Q1 AUTOMATION & CONTROL SYSTEMS Pub Date : 2026-04-01 Epub Date: 2026-02-12 DOI: 10.1016/j.engappai.2026.114104
Peng You, Peng Chen, Xi Li, Ang Bian
Internet of Things (IoT) devices generate huge time series data during operation, crucial for device monitoring, fault prediction, and system security. However, these data often contain noise interference and exhibit complex spatio–temporal characteristics, posing significant challenges to anomaly detection. To address these challenges, this paper proposes an unsupervised anomaly detection model Gated Memory-guided Multi-scale spatio–temporal–spectral feature fusion network (GMMnet). GMMnet firstly leverages Positional Multi-scale Temporal-convolution and Multi-scale Spatio-spectral Self-attention to efficiently learn the temporal and spatio-spectral features of time series data, with an adaptive threshold filtering employed to mitigate high-frequency noise interference. By introducing Gated Memory-guided Fusion, GMMnet can accurately fuse the normal spatio–temporal–spectral features within the data, effectively guiding the model training process and significantly enhancing its generalization capability. Additionally, a Radial Basis Functions based Enhanced Reconstruction module is proposed to further improve GMMnet’s capability in detecting subtle anomalies. Extensive experiments on five publicly available IoT time series datasets demonstrate that the proposed method outperformed existing thirteen state-of-the-art baselines on nine metrics, with an average F1 score improvement of 17.72%.
物联网(IoT)设备在运行过程中会产生大量的时间序列数据,这对设备监控、故障预测和系统安全至关重要。然而,这些数据往往包含噪声干扰,并表现出复杂的时空特征,给异常检测带来了重大挑战。为了解决这些问题,本文提出了一种门控记忆引导多尺度时空光谱特征融合网络(GMMnet)的无监督异常检测模型。GMMnet首先利用位置多尺度时间卷积和多尺度空间光谱自关注来有效学习时间序列数据的时间和空间光谱特征,并采用自适应阈值滤波来减轻高频噪声干扰。通过引入门控记忆引导融合(Gated Memory-guided Fusion), GMMnet可以准确融合数据中正常的时空谱特征,有效指导模型训练过程,显著增强模型的泛化能力。此外,提出了基于径向基函数的增强重构模块,进一步提高了GMMnet检测细微异常的能力。在5个公开可用的物联网时间序列数据集上进行的大量实验表明,所提出的方法在9个指标上优于现有的13个最先进的基线,平均F1分数提高了17.72%。
{"title":"Gated Memory-Guided Multi-scale spatio–temporal–spectral feature fusion network for unsupervised Internet of Things time series anomaly detection","authors":"Peng You,&nbsp;Peng Chen,&nbsp;Xi Li,&nbsp;Ang Bian","doi":"10.1016/j.engappai.2026.114104","DOIUrl":"10.1016/j.engappai.2026.114104","url":null,"abstract":"<div><div>Internet of Things (IoT) devices generate huge time series data during operation, crucial for device monitoring, fault prediction, and system security. However, these data often contain noise interference and exhibit complex spatio–temporal characteristics, posing significant challenges to anomaly detection. To address these challenges, this paper proposes an unsupervised anomaly detection model <strong>G</strong>ated <strong>M</strong>emory-guided <strong>M</strong>ulti-scale spatio–temporal–spectral feature fusion <strong>net</strong>work (<strong>GMMnet</strong>). GMMnet firstly leverages Positional Multi-scale Temporal-convolution and Multi-scale Spatio-spectral Self-attention to efficiently learn the temporal and spatio-spectral features of time series data, with an adaptive threshold filtering employed to mitigate high-frequency noise interference. By introducing Gated Memory-guided Fusion, GMMnet can accurately fuse the normal spatio–temporal–spectral features within the data, effectively guiding the model training process and significantly enhancing its generalization capability. Additionally, a Radial Basis Functions based Enhanced Reconstruction module is proposed to further improve GMMnet’s capability in detecting subtle anomalies. Extensive experiments on five publicly available IoT time series datasets demonstrate that the proposed method outperformed existing thirteen state-of-the-art baselines on nine metrics, with an average F1 score improvement of 17.72%.</div></div>","PeriodicalId":50523,"journal":{"name":"Engineering Applications of Artificial Intelligence","volume":"169 ","pages":"Article 114104"},"PeriodicalIF":8.0,"publicationDate":"2026-04-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"146174860","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Predicting stress in two-phase random materials and super-resolution method for stress images by embedding physical information 两相随机材料应力预测及嵌入物理信息的应力图像超分辨方法
IF 8 2区 计算机科学 Q1 AUTOMATION & CONTROL SYSTEMS Pub Date : 2026-04-01 Epub Date: 2026-02-07 DOI: 10.1016/j.engappai.2026.114087
Tengfei Xing , Xiaodan Ren , Jie Li
Stress analysis is essential in material design. In materials with complex microstructures, such as two-phase random materials (TRMs), failure is typically associated with stress concentration at phase interfaces that govern mechanical performance. Existing Image Super-Resolution (ISR) methods are mainly data-driven and rely on supervised learning, where the achievable magnification of stress images is tightly constrained by the resolution of the training dataset. This limits their applicability for TRMs, where high-resolution stress information at phase interfaces is essential but often unavailable. In practical engineering, limited pixel resolution in microstructure images further constrains stress image clarity and hinders the observation of stress concentration regions. To address this research gap, we propose a stress prediction framework tailored for TRMs that combines microstructural information with physics-based constraints. First, the framework employs a Multiple Compositions U-net (MC U-net) to predict stress in low-resolution material microstructures. By incorporating phase interface information, the MC U-net effectively reduces prediction errors at phase interfaces. Secondly, a Mixed Physics-Informed Neural Network (MPINN)-based stress ISR method (SRMPINN) is introduced. Unlike conventional ISR methods, SRMPINN leverages physical constraints to achieve stress image super-resolution without requiring paired high-resolution training images, enabling stress images to be generated at substantially high magnification factors, including non-integer scales, with magnification ratios not restricted by the training dataset. Finally, transfer learning is applied to perform stress analysis on TRMs with different loading states and anisotropy. The results demonstrate that the proposed framework achieves high accuracy, generalization, and flexibility, particularly in resolving stress concentrations at phase interfaces.
应力分析在材料设计中是必不可少的。在具有复杂微观结构的材料中,如两相随机材料(TRMs),失效通常与控制力学性能的相界面应力集中有关。现有的图像超分辨率(ISR)方法主要是数据驱动的,依赖于监督学习,其中应力图像的可实现放大程度受到训练数据集分辨率的严格限制。这限制了它们对trm的适用性,在trm中,相位界面的高分辨率应力信息是必不可少的,但通常不可用。在实际工程中,微观结构图像像素分辨率的限制进一步限制了应力图像的清晰度,阻碍了对应力集中区域的观察。为了解决这一研究缺口,我们提出了一种针对trm的应力预测框架,该框架将微观结构信息与基于物理的约束相结合。首先,该框架采用多组分U-net (MC U-net)来预测低分辨率材料微结构中的应力。MC - U-net通过引入相界面信息,有效地降低了相界面处的预测误差。其次,介绍了一种基于混合物理信息神经网络的应力ISR方法(SRMPINN)。与传统的ISR方法不同,SRMPINN利用物理约束来实现应力图像的超分辨率,而不需要配对的高分辨率训练图像,使应力图像能够以非常高的放大因子生成,包括非整数尺度,放大倍率不受训练数据集的限制。最后,应用迁移学习对具有不同加载状态和各向异性的trm进行应力分析。结果表明,该框架具有较高的精度、通用性和灵活性,特别是在解决相界面应力集中方面。
{"title":"Predicting stress in two-phase random materials and super-resolution method for stress images by embedding physical information","authors":"Tengfei Xing ,&nbsp;Xiaodan Ren ,&nbsp;Jie Li","doi":"10.1016/j.engappai.2026.114087","DOIUrl":"10.1016/j.engappai.2026.114087","url":null,"abstract":"<div><div>Stress analysis is essential in material design. In materials with complex microstructures, such as two-phase random materials (TRMs), failure is typically associated with stress concentration at phase interfaces that govern mechanical performance. Existing Image Super-Resolution (ISR) methods are mainly data-driven and rely on supervised learning, where the achievable magnification of stress images is tightly constrained by the resolution of the training dataset. This limits their applicability for TRMs, where high-resolution stress information at phase interfaces is essential but often unavailable. In practical engineering, limited pixel resolution in microstructure images further constrains stress image clarity and hinders the observation of stress concentration regions. To address this research gap, we propose a stress prediction framework tailored for TRMs that combines microstructural information with physics-based constraints. First, the framework employs a Multiple Compositions U-net (MC U-net) to predict stress in low-resolution material microstructures. By incorporating phase interface information, the MC U-net effectively reduces prediction errors at phase interfaces. Secondly, a Mixed Physics-Informed Neural Network (MPINN)-based stress ISR method (SRMPINN) is introduced. Unlike conventional ISR methods, SRMPINN leverages physical constraints to achieve stress image super-resolution without requiring paired high-resolution training images, enabling stress images to be generated at substantially high magnification factors, including non-integer scales, with magnification ratios not restricted by the training dataset. Finally, transfer learning is applied to perform stress analysis on TRMs with different loading states and anisotropy. The results demonstrate that the proposed framework achieves high accuracy, generalization, and flexibility, particularly in resolving stress concentrations at phase interfaces.</div></div>","PeriodicalId":50523,"journal":{"name":"Engineering Applications of Artificial Intelligence","volume":"169 ","pages":"Article 114087"},"PeriodicalIF":8.0,"publicationDate":"2026-04-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"146174845","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Enhanced recognition of low-discernibility Railway Sleeper Serial Numbers via dual-stage adaptive image enhancement and position prior-guided detection 基于双级自适应图像增强和位置先验引导检测的铁路轨枕序列号识别方法
IF 8 2区 计算机科学 Q1 AUTOMATION & CONTROL SYSTEMS Pub Date : 2026-04-01 Epub Date: 2026-02-10 DOI: 10.1016/j.engappai.2026.114110
Peng Shi , Jingjing Guo , Lu Deng , Yingkai Liu , Lizhi Long , Shaopeng Xu
Automatic recognition of Railway Sleeper Serial Numbers (RSSNs) is essential for traceability, quality management, and lifecycle maintenance of railway infrastructure. In practice, embossed serial numbers on concrete surfaces exhibit extremely low discernibility with minimal height variations (<1 mm) and negligible color differentiation. Traditional Red-Green-Blue-based (RGB-based) image enhancement and Optical Character Recognition (OCR) methods face a fundamental limitation: they cannot directly capture the three-dimensional geometric features distinguishing embossed characters from their surroundings. To address this challenge, this study proposes an integrated framework based on line laser height imaging and position prior-guided detection with three key innovations: (1) a cascaded processing framework leverages geometric height information to overcome RGB-based method limitations; (2) a Dual-stage Adaptive Image Enhancement (DAIE) strategy converts 16 binary-digit (bit) height images into optimized 8-bit visualizations by systematically selecting optimal methods: modulo truncation for global structure and Minimum-Maximum (Min-Max) normalization for local detail enhancement; and (3) a Position Prior-guided Spatial Attention (PPSA) Feature Pyramid Network (FPN) integrates statistically-derived position priors to enhance small target detection. Comprehensive validation on 2234 images demonstrates superior performance: 98.2% F1-score and 99.38% recognition accuracy at 27 Frames Per Second (FPS), achieving 2.4% improvement over state-of-the-art methods. Ablation experiments confirm the individual contributions of the PPSA module (4.0%), the Small Target Enhancement (STE) module (1.4%), and the DAIE strategy (3.08%). Field testing in a prefabricated factory validates industrial applicability, providing a scalable technical framework and valuable reference for low-discernibility embossed industrial character recognition. Code is publicly available at https://github.com/shipeng38/RSSN-recognition.
铁路轨枕序列号的自动识别对铁路基础设施的可追溯性、质量管理和生命周期维护至关重要。在实践中,混凝土表面上的浮雕序列号表现出极低的可辨性,高度变化最小(1毫米),颜色差异可以忽略不计。传统的基于红-绿-蓝(rgb)的图像增强和光学字符识别(OCR)方法面临着一个根本性的局限性:它们不能直接捕获将浮雕字符与周围环境区分开来的三维几何特征。为了解决这一挑战,本研究提出了一个基于线激光高度成像和位置先验制导检测的集成框架,其中有三个关键创新:(1)级联处理框架利用几何高度信息克服基于rgb方法的局限性;(2)双阶段自适应图像增强(DAIE)策略通过系统选择最优方法将16位二进制(bit)高度图像转换为优化的8位可视化图像:模截断用于全局结构,最小-最大(Min-Max)归一化用于局部细节增强;(3)位置先验引导的空间注意(PPSA)特征金字塔网络(FPN)集成了统计导出的位置先验,增强了小目标检测。对2234张图像的综合验证显示了卓越的性能:在27帧每秒(FPS)下,f1得分为98.2%,识别准确率为99.38%,比最先进的方法提高了2.4%。烧蚀实验证实了PPSA模块(4.0%)、小目标增强(STE)模块(1.4%)和DAIE策略(3.08%)的个人贡献。在预制工厂的现场测试验证了工业适用性,为低可辨度压花工业字符识别提供了可扩展的技术框架和有价值的参考。代码可在https://github.com/shipeng38/RSSN-recognition上公开获取。
{"title":"Enhanced recognition of low-discernibility Railway Sleeper Serial Numbers via dual-stage adaptive image enhancement and position prior-guided detection","authors":"Peng Shi ,&nbsp;Jingjing Guo ,&nbsp;Lu Deng ,&nbsp;Yingkai Liu ,&nbsp;Lizhi Long ,&nbsp;Shaopeng Xu","doi":"10.1016/j.engappai.2026.114110","DOIUrl":"10.1016/j.engappai.2026.114110","url":null,"abstract":"<div><div>Automatic recognition of Railway Sleeper Serial Numbers (RSSNs) is essential for traceability, quality management, and lifecycle maintenance of railway infrastructure. In practice, embossed serial numbers on concrete surfaces exhibit extremely low discernibility with minimal height variations (&lt;1 mm) and negligible color differentiation. Traditional Red-Green-Blue-based (RGB-based) image enhancement and Optical Character Recognition (OCR) methods face a fundamental limitation: they cannot directly capture the three-dimensional geometric features distinguishing embossed characters from their surroundings. To address this challenge, this study proposes an integrated framework based on line laser height imaging and position prior-guided detection with three key innovations: (1) a cascaded processing framework leverages geometric height information to overcome RGB-based method limitations; (2) a Dual-stage Adaptive Image Enhancement (DAIE) strategy converts 16 binary-digit (bit) height images into optimized 8-bit visualizations by systematically selecting optimal methods: modulo truncation for global structure and Minimum-Maximum (Min-Max) normalization for local detail enhancement; and (3) a Position Prior-guided Spatial Attention (PPSA) Feature Pyramid Network (FPN) integrates statistically-derived position priors to enhance small target detection. Comprehensive validation on 2234 images demonstrates superior performance: 98.2% F1-score and 99.38% recognition accuracy at 27 Frames Per Second (FPS), achieving 2.4% improvement over state-of-the-art methods. Ablation experiments confirm the individual contributions of the PPSA module (4.0%), the Small Target Enhancement (STE) module (1.4%), and the DAIE strategy (3.08%). Field testing in a prefabricated factory validates industrial applicability, providing a scalable technical framework and valuable reference for low-discernibility embossed industrial character recognition. Code is publicly available at <span><span>https://github.com/shipeng38/RSSN-recognition</span><svg><path></path></svg></span>.</div></div>","PeriodicalId":50523,"journal":{"name":"Engineering Applications of Artificial Intelligence","volume":"169 ","pages":"Article 114110"},"PeriodicalIF":8.0,"publicationDate":"2026-04-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"146175302","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Few-shot transfer learning for laser welding prediction 激光焊接预测的少射次迁移学习
IF 8 2区 计算机科学 Q1 AUTOMATION & CONTROL SYSTEMS Pub Date : 2026-04-01 Epub Date: 2026-02-11 DOI: 10.1016/j.engappai.2026.113983
Luchen Wu , Shijie Wu , Hongxin Hu , Hao Sun , Shuang Ma , Zhenya Wang
Laser wire filling welding is a key joining technique in the manufacturing of aluminum battery packs for new energy vehicles, yet predictive modeling of melt pool geometry remains limited by scarce experimental data. A two-stage transfer learning framework is developed by integrating multiphysics numerical simulation, data augmentation, and Bayesian neural network (BNN). High-fidelity multiphysics simulations within the experimental process window are generated to expand parameter space coverage and to provide physics-informed data for model pretraining. Limited experimental samples are augmented using a Wasserstein Generative Adversarial Network with gradient penalty applied to laser power and wire feed speed. Gaussian perturbations on travel speed are introduced to represent measurement uncertainties. A shallow BNN is pretrained on simulated samples and fine-tuned on the augmented experimental dataset using physics-consistent regularization and partial layer-freezing strategies. The augmentation strategy is evaluated through leave-one-out cross-validation on eight experimental samples, and generalization is examined using a separate test under previously unobserved travel-speed conditions. After inverse normalization, the framework achieves root mean square errors of 0.027 mm for melt pool depth and 0.025 mm for width, with coefficients of determination of 0.788 and 0.741, respectively. Uncertainty-aware quantitative analysis based on Sobol sensitivity indices and reliability assessment is conducted after model validation to characterize dominant parameter influences and to identify high-confidence process windows under limited data conditions. The proposed framework provides a general simulation-informed and uncertainty-aware learning strategy for manufacturing processes with severely limited experimental data.
激光填丝焊接是新能源汽车铝电池组的关键连接技术,但由于实验数据的缺乏,熔池几何形状的预测建模仍然受到限制。通过集成多物理场数值模拟、数据增强和贝叶斯神经网络(BNN),构建了一个两阶段迁移学习框架。在实验过程窗口内生成高保真的多物理场模拟,以扩大参数空间覆盖范围,并为模型预训练提供物理信息数据。使用Wasserstein生成对抗网络增强有限的实验样本,并对激光功率和送丝速度施加梯度惩罚。引入高斯摄动对行进速度的影响来表示测量的不确定性。浅层BNN在模拟样本上进行预训练,并使用物理一致正则化和部分层冻结策略对增强实验数据集进行微调。通过对8个实验样本的留一交叉验证来评估增强策略,并在先前未观察到的行驶速度条件下使用单独的测试来检验泛化。经反归一化后,框架对熔池深度和宽度的均方根误差分别为0.027 mm和0.025 mm,决定系数分别为0.788和0.741。在模型验证后,进行基于Sobol敏感性指标和可靠性评估的不确定性感知定量分析,以表征主导参数的影响,并在有限数据条件下识别高置信度的过程窗口。该框架为实验数据严重受限的制造过程提供了一种通用的仿真信息和不确定性感知学习策略。
{"title":"Few-shot transfer learning for laser welding prediction","authors":"Luchen Wu ,&nbsp;Shijie Wu ,&nbsp;Hongxin Hu ,&nbsp;Hao Sun ,&nbsp;Shuang Ma ,&nbsp;Zhenya Wang","doi":"10.1016/j.engappai.2026.113983","DOIUrl":"10.1016/j.engappai.2026.113983","url":null,"abstract":"<div><div>Laser wire filling welding is a key joining technique in the manufacturing of aluminum battery packs for new energy vehicles, yet predictive modeling of melt pool geometry remains limited by scarce experimental data. A two-stage transfer learning framework is developed by integrating multiphysics numerical simulation, data augmentation, and Bayesian neural network (BNN). High-fidelity multiphysics simulations within the experimental process window are generated to expand parameter space coverage and to provide physics-informed data for model pretraining. Limited experimental samples are augmented using a Wasserstein Generative Adversarial Network with gradient penalty applied to laser power and wire feed speed. Gaussian perturbations on travel speed are introduced to represent measurement uncertainties. A shallow BNN is pretrained on simulated samples and fine-tuned on the augmented experimental dataset using physics-consistent regularization and partial layer-freezing strategies. The augmentation strategy is evaluated through leave-one-out cross-validation on eight experimental samples, and generalization is examined using a separate test under previously unobserved travel-speed conditions. After inverse normalization, the framework achieves root mean square errors of 0.027 mm for melt pool depth and 0.025 mm for width, with coefficients of determination of 0.788 and 0.741, respectively. Uncertainty-aware quantitative analysis based on Sobol sensitivity indices and reliability assessment is conducted after model validation to characterize dominant parameter influences and to identify high-confidence process windows under limited data conditions. The proposed framework provides a general simulation-informed and uncertainty-aware learning strategy for manufacturing processes with severely limited experimental data.</div></div>","PeriodicalId":50523,"journal":{"name":"Engineering Applications of Artificial Intelligence","volume":"169 ","pages":"Article 113983"},"PeriodicalIF":8.0,"publicationDate":"2026-04-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"146174798","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Hesitant fuzzy three-way decision-making for large-scale data based on a new distance measure and behavioral theory 基于新距离测度和行为理论的大数据犹豫模糊三向决策
IF 8 2区 计算机科学 Q1 AUTOMATION & CONTROL SYSTEMS Pub Date : 2026-04-01 Epub Date: 2026-02-09 DOI: 10.1016/j.engappai.2026.114105
Jing Li , Haidong Zhang , Zhuoma Dawa , Yanping He
In highly uncertain real-world environments, making robust decisions amid incomplete information and the cognitive biases of decision-makers remain critical challenges in project management and other complex system decision-making-related domains. Although the three-way decision-making (3WD) method based on hesitant fuzzy (HF) environments provides an effective approach for managing uncertainty, the current research still has shortcomings in several key areas. On the one hand, the distance formulas that are commonly used for computing hesitant fuzzy elements (HFEs) generally suffer from insufficient sensitivity to the information captured by the score function and weak discriminative power. On the other hand, the process of determining loss functions is subjective and fails to consider the behavioral psychological factors of decision-makers, making it difficult to reflect the cognitive characteristics of humans during actual decision-making processes. These issues collectively limit the adaptability and practicality of the existing decision-making methods in complex real-world environments. To address the aforementioned issues, this study is aimed at constructing an HF 3WD framework that possesses both cognitive rationality and computational robustness. To this end, the core contributions of this work are as follows. First, a novel HF distance measure is developed, significantly improving the ability to distinguish fuzzy information differences. Second, a novel o-dominance relation is introduced, and the conditional probability is calculated using a data-driven approach, eliminating the reliance on expert scoring and thereby improving the objectivity and accuracy of the conditional probability. Finally, an objective loss function is established, effectively capturing the decision-maker’s nonlinear value perceptions and comparative psychology in gain and loss scenarios. Furthermore, comparative experiments and parameter analyses are conducted in big data scenarios to validate the fact that the proposed method outperforms the existing methods in terms of classification accuracy and decision stability, demonstrating superior effectiveness and robustness. We believe that by simulating human judgments made under uncertainty, this method opens up new avenues for implementing artificial intelligence-based decision-making systems in high-risk scenarios.
在高度不确定的现实世界环境中,在不完整的信息和决策者的认知偏差中做出稳健的决策仍然是项目管理和其他复杂系统决策相关领域的关键挑战。基于犹豫模糊(HF)环境的三向决策(3WD)方法为管理不确定性提供了一种有效的方法,但目前的研究在几个关键领域仍存在不足。一方面,通常用于计算犹豫模糊元素(hfe)的距离公式对分数函数捕获的信息灵敏度不足,判别能力较弱。另一方面,损失函数的确定过程是主观的,没有考虑决策者的行为心理因素,难以反映人在实际决策过程中的认知特征。这些问题共同限制了现有决策方法在复杂现实环境中的适应性和实用性。为了解决上述问题,本研究旨在构建一个既具有认知合理性又具有计算鲁棒性的高频3WD框架。为此,本工作的核心贡献如下:首先,提出了一种新的高频距离测度,显著提高了模糊信息差异的识别能力。其次,引入了一种新的o-优势关系,采用数据驱动的方法计算条件概率,消除了对专家评分的依赖,提高了条件概率的客观性和准确性;最后,建立了目标损失函数,有效捕捉了决策者在得失情景下的非线性价值感知和比较心理。并在大数据场景下进行对比实验和参数分析,验证了所提方法在分类精度和决策稳定性方面优于现有方法,显示出优越的有效性和鲁棒性。我们相信,通过模拟人类在不确定性下做出的判断,该方法为在高风险场景中实施基于人工智能的决策系统开辟了新的途径。
{"title":"Hesitant fuzzy three-way decision-making for large-scale data based on a new distance measure and behavioral theory","authors":"Jing Li ,&nbsp;Haidong Zhang ,&nbsp;Zhuoma Dawa ,&nbsp;Yanping He","doi":"10.1016/j.engappai.2026.114105","DOIUrl":"10.1016/j.engappai.2026.114105","url":null,"abstract":"<div><div>In highly uncertain real-world environments, making robust decisions amid incomplete information and the cognitive biases of decision-makers remain critical challenges in project management and other complex system decision-making-related domains. Although the three-way decision-making (3WD) method based on hesitant fuzzy (HF) environments provides an effective approach for managing uncertainty, the current research still has shortcomings in several key areas. On the one hand, the distance formulas that are commonly used for computing hesitant fuzzy elements (HFEs) generally suffer from insufficient sensitivity to the information captured by the score function and weak discriminative power. On the other hand, the process of determining loss functions is subjective and fails to consider the behavioral psychological factors of decision-makers, making it difficult to reflect the cognitive characteristics of humans during actual decision-making processes. These issues collectively limit the adaptability and practicality of the existing decision-making methods in complex real-world environments. To address the aforementioned issues, this study is aimed at constructing an HF 3WD framework that possesses both cognitive rationality and computational robustness. To this end, the core contributions of this work are as follows. First, a novel HF distance measure is developed, significantly improving the ability to distinguish fuzzy information differences. Second, a novel <span><math><mi>o</mi></math></span>-dominance relation is introduced, and the conditional probability is calculated using a data-driven approach, eliminating the reliance on expert scoring and thereby improving the objectivity and accuracy of the conditional probability. Finally, an objective loss function is established, effectively capturing the decision-maker’s nonlinear value perceptions and comparative psychology in gain and loss scenarios. Furthermore, comparative experiments and parameter analyses are conducted in big data scenarios to validate the fact that the proposed method outperforms the existing methods in terms of classification accuracy and decision stability, demonstrating superior effectiveness and robustness. We believe that by simulating human judgments made under uncertainty, this method opens up new avenues for implementing artificial intelligence-based decision-making systems in high-risk scenarios.</div></div>","PeriodicalId":50523,"journal":{"name":"Engineering Applications of Artificial Intelligence","volume":"169 ","pages":"Article 114105"},"PeriodicalIF":8.0,"publicationDate":"2026-04-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"146174976","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Assessment and evaluation of machine learning algorithms for recommendation system of E-commerce based on bipolar fuzzy- method based on the removal effects of criteria-elimination and choice translating Reality-I approach 基于标准消除和选择翻译现实- i方法去除效果的双极模糊方法的电子商务推荐系统机器学习算法的评估与评价
IF 8 2区 计算机科学 Q1 AUTOMATION & CONTROL SYSTEMS Pub Date : 2026-04-01 Epub Date: 2026-02-09 DOI: 10.1016/j.engappai.2026.114107
Muhammad Arsal, Ubaid ur Rehman, Abaid ur Rehman Virk
E-commerce websites are increasingly depending on machine learning (ML) algorithms to improve their recommendation systems to deliver personalized user experiences and enhance customer satisfaction. ML models provide useful solutions, but choosing the best ML model is still difficult, especially for e-commerce platforms' recommendation systems. This selection challenge involves dual aspects (bipolarity), which means that both the positive and negative aspects of the evaluation criteria for ML models must be taken into consideration. It is a multi-criteria decision-making (MCDM) problem with uncertainty. However, previous research has failed to consider the bipolar character of these model selection criteria. Finding the right weights for the evaluation criteria also becomes important in MCDM, particularly when handling the bipolarity and uncertainty present in actual e-commerce situations. To overcome this limitation, we introduce a novel bipolar fuzzy (BF)-method based on the removal effects of criteria (MEREC)-elimination and choice translating reality (ELECTRE-I) methodology that combines the method based on the removal effects of criteria (MEREC) method's objective weighting capability with the elimination and choice translating reality (ELECTRE-I) technique's decision-ranking power. The methodology is used in a real-world case study that focuses on choosing the best ML model for an e-commerce platform's recommendation system. The proposed structure provides a strong decision-support tool that tackles the difficulties associated with choosing an ML algorithm and captures the dual nature of evaluation criteria. Comparative findings show how well the BF-MEREC-ELECTRE-I method works to provide data-driven, interpretable, and useful recommendations for e-commerce applications.
电子商务网站越来越依赖于机器学习(ML)算法来改进他们的推荐系统,以提供个性化的用户体验并提高客户满意度。ML模型提供了有用的解决方案,但选择最好的ML模型仍然很困难,特别是对于电子商务平台的推荐系统。这种选择挑战涉及双重方面(双极性),这意味着必须考虑ML模型评估标准的积极和消极方面。这是一个具有不确定性的多准则决策问题。然而,以往的研究未能考虑到这些模型选择标准的双极性特征。在MCDM中,为评估标准找到正确的权重也变得很重要,特别是在处理实际电子商务情况中存在的两极化和不确定性时。为了克服这一局限性,我们引入了一种新的基于标准去除效果的双极模糊(BF)方法(MEREC)-消除和选择翻译现实(electrei)方法,该方法将基于标准去除效果(MEREC)方法的客观加权能力与基于消除和选择翻译现实(electrei)技术的决策排序能力相结合。该方法被用于一个现实世界的案例研究,重点是为电子商务平台的推荐系统选择最佳的机器学习模型。所提出的结构提供了一个强大的决策支持工具,解决了与选择ML算法相关的困难,并捕获了评估标准的双重性质。对比结果显示BF-MEREC-ELECTRE-I方法在为电子商务应用程序提供数据驱动的、可解释的和有用的建议方面是多么有效。
{"title":"Assessment and evaluation of machine learning algorithms for recommendation system of E-commerce based on bipolar fuzzy- method based on the removal effects of criteria-elimination and choice translating Reality-I approach","authors":"Muhammad Arsal,&nbsp;Ubaid ur Rehman,&nbsp;Abaid ur Rehman Virk","doi":"10.1016/j.engappai.2026.114107","DOIUrl":"10.1016/j.engappai.2026.114107","url":null,"abstract":"<div><div>E-commerce websites are increasingly depending on machine learning (ML) algorithms to improve their recommendation systems to deliver personalized user experiences and enhance customer satisfaction. ML models provide useful solutions, but choosing the best ML model is still difficult, especially for e-commerce platforms' recommendation systems. This selection challenge involves dual aspects (bipolarity), which means that both the positive and negative aspects of the evaluation criteria for ML models must be taken into consideration. It is a multi-criteria decision-making (MCDM) problem with uncertainty. However, previous research has failed to consider the bipolar character of these model selection criteria. Finding the right weights for the evaluation criteria also becomes important in MCDM, particularly when handling the bipolarity and uncertainty present in actual e-commerce situations. To overcome this limitation, we introduce a novel bipolar fuzzy (BF)-method based on the removal effects of criteria (MEREC)-elimination and choice translating reality (ELECTRE-I) methodology that combines the method based on the removal effects of criteria (MEREC) method's objective weighting capability with the elimination and choice translating reality (ELECTRE-I) technique's decision-ranking power. The methodology is used in a real-world case study that focuses on choosing the best ML model for an e-commerce platform's recommendation system. The proposed structure provides a strong decision-support tool that tackles the difficulties associated with choosing an ML algorithm and captures the dual nature of evaluation criteria. Comparative findings show how well the BF-MEREC-ELECTRE-I method works to provide data-driven, interpretable, and useful recommendations for e-commerce applications.</div></div>","PeriodicalId":50523,"journal":{"name":"Engineering Applications of Artificial Intelligence","volume":"169 ","pages":"Article 114107"},"PeriodicalIF":8.0,"publicationDate":"2026-04-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"146174977","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Biologically inspired vision fusion: Central-peripheral synergy for medical image classification 生物学启发的视觉融合:医学图像分类的中枢-外周协同作用
IF 8 2区 计算机科学 Q1 AUTOMATION & CONTROL SYSTEMS Pub Date : 2026-04-01 Epub Date: 2026-02-06 DOI: 10.1016/j.engappai.2026.114026
Rui Lu , Long Yu , Shengwei Tian , Yukun Xiao
The synergy between foveal and peripheral processing is fundamental to the efficiency of biological vision. While hybrid Convolutional Neural Network (CNN)-Transformer architectures aim to capture both local and global features, they often rely on static, predefined structures that struggle to dynamically align information and adaptively allocate computational resources, ultimately limiting their performance. To address this limitation, we introduce the Central-Peripheral Vision Transformer (CPVT), a novel architecture that explicitly and hierarchically mimics this biological dichotomy. CPVT employs fine-grained, convolutionally modulated attention in its shallow layers to emulate foveal vision, while seamlessly transitioning to a coarse-grained, global attention mechanism in deeper layers to emulate peripheral vision. This design is enhanced by two specialized Feed-Forward Networks that facilitate synergistic information interaction. Rigorously validated on diverse medical imaging benchmarks, CPVT achieves state-of-the-art performance, attaining classification accuracies of 87.98% on the International Skin Imaging Collaboration (ISIC) 2018 challenge dataset and 90.41% on the Kvasir dataset. These results demonstrate that an adaptive, hierarchical integration of biological vision principles can significantly enhance machine perception for medical image analysis.
中央凹和外周处理之间的协同作用是生物视觉效率的基础。虽然混合卷积神经网络(CNN)-Transformer架构旨在捕获局部和全局特征,但它们通常依赖于静态的预定义结构,这些结构难以动态对齐信息并自适应地分配计算资源,最终限制了它们的性能。为了解决这一限制,我们引入了中央-周边视觉转换器(CPVT),这是一种明确地、分层地模仿这种生物二分法的新架构。CPVT在其浅层中采用细粒度、卷积调制的注意力来模拟中央凹视觉,而在更深层中无缝过渡到粗粒度、全局注意力机制来模拟周边视觉。该设计通过两个专门的前馈网络来增强,以促进协同信息交互。经过各种医学成像基准的严格验证,CPVT达到了最先进的性能,在国际皮肤成像协作(ISIC) 2018挑战数据集上的分类准确率为87.98%,在Kvasir数据集上的分类准确率为90.41%。这些结果表明,生物视觉原理的自适应分层集成可以显著增强医学图像分析的机器感知。
{"title":"Biologically inspired vision fusion: Central-peripheral synergy for medical image classification","authors":"Rui Lu ,&nbsp;Long Yu ,&nbsp;Shengwei Tian ,&nbsp;Yukun Xiao","doi":"10.1016/j.engappai.2026.114026","DOIUrl":"10.1016/j.engappai.2026.114026","url":null,"abstract":"<div><div>The synergy between foveal and peripheral processing is fundamental to the efficiency of biological vision. While hybrid Convolutional Neural Network (CNN)-Transformer architectures aim to capture both local and global features, they often rely on static, predefined structures that struggle to dynamically align information and adaptively allocate computational resources, ultimately limiting their performance. To address this limitation, we introduce the Central-Peripheral Vision Transformer (CPVT), a novel architecture that explicitly and hierarchically mimics this biological dichotomy. CPVT employs fine-grained, convolutionally modulated attention in its shallow layers to emulate foveal vision, while seamlessly transitioning to a coarse-grained, global attention mechanism in deeper layers to emulate peripheral vision. This design is enhanced by two specialized Feed-Forward Networks that facilitate synergistic information interaction. Rigorously validated on diverse medical imaging benchmarks, CPVT achieves state-of-the-art performance, attaining classification accuracies of 87.98% on the International Skin Imaging Collaboration (ISIC) 2018 challenge dataset and 90.41% on the Kvasir dataset. These results demonstrate that an adaptive, hierarchical integration of biological vision principles can significantly enhance machine perception for medical image analysis.</div></div>","PeriodicalId":50523,"journal":{"name":"Engineering Applications of Artificial Intelligence","volume":"169 ","pages":"Article 114026"},"PeriodicalIF":8.0,"publicationDate":"2026-04-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"146122757","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
A sustainable multi-objective economic production quantity model with learning–forgetting and green technology investment 具有学习-遗忘和绿色技术投资的可持续多目标经济产量模型
IF 8 2区 计算机科学 Q1 AUTOMATION & CONTROL SYSTEMS Pub Date : 2026-03-15 Epub Date: 2026-02-05 DOI: 10.1016/j.engappai.2026.114064
Tanmay Halder, Bijoy Krishna Debnath
The economic production quantity (EPQ) model enables production managers to determine optimal production lot sizes. Traditional EPQ models often overlook practical factors such as workforce learning and forgetting (LaF), imperfect production processes, and carbon emissions, which are increasingly critical for economically and environmentally conscious decision making. This study develops a multi-objective EPQ (MOEPQ) model that integrates LaF dynamics and green technology (GT) investment within a sustainable manufacturing framework. The model incorporates both non-rework and rework cycles, capturing realistic production behavior affected by labor efficiency variations and environmentally conscious investment decisions. The problem is formulated as a multi-objective optimization under a carbon cap-and-trade policy, aiming to minimize total cost and carbon emissions. To solve the problem, the non-dominated sorting genetic algorithm II (NSGA-II) and the strength pareto evolutionary algorithm 2 (SPEA2) are implemented with control parameters tuned using response surface methodology (RSM) to ensure convergence accuracy and solution diversity. Pareto fronts are generated, and statistical evaluation includes normality assessment using the Shapiro–Wilk test, followed by performance comparison through the Wilcoxon signed-rank test. The best solution is selected using the multi-criteria decision analysis (MCDA) approach through the technique for order preference by similarity to an ideal solution (TOPSIS). Numerical experiments demonstrate that integrating LaF and GT significantly reduces total cost and carbon emissions. Sensitivity analysis identifies the key economic and emission related parameters, offering practical insights for managerial decision making.
经济生产数量(EPQ)模型使生产管理者能够确定最优的生产批量。传统的EPQ模型往往忽略了劳动力学习和遗忘(LaF)、不完善的生产过程和碳排放等实际因素,而这些因素对经济和环境意识的决策越来越重要。本研究开发了一个多目标EPQ (MOEPQ)模型,该模型在可持续制造框架内整合了LaF动力学和绿色技术(GT)投资。该模型结合了非返工和返工周期,捕捉了受劳动效率变化和环境意识投资决策影响的现实生产行为。将该问题表述为碳限额与交易政策下的多目标优化问题,其目标是使总成本和碳排放量最小化。为了解决这一问题,采用非支配排序遗传算法II (NSGA-II)和强度pareto进化算法2 (SPEA2),并利用响应面方法(RSM)调整控制参数,以保证收敛精度和解的多样性。生成Pareto front,统计评价包括使用Shapiro-Wilk检验进行正态性评价,然后通过Wilcoxon sign -rank检验进行性能比较。使用多准则决策分析(MCDA)方法,通过与理想解的相似性排序偏好技术(TOPSIS)来选择最佳解。数值实验表明,将LaF和GT相结合可以显著降低总成本和碳排放。敏感性分析确定了关键的经济和排放相关参数,为管理决策提供了实用的见解。
{"title":"A sustainable multi-objective economic production quantity model with learning–forgetting and green technology investment","authors":"Tanmay Halder,&nbsp;Bijoy Krishna Debnath","doi":"10.1016/j.engappai.2026.114064","DOIUrl":"10.1016/j.engappai.2026.114064","url":null,"abstract":"<div><div>The economic production quantity (EPQ) model enables production managers to determine optimal production lot sizes. Traditional EPQ models often overlook practical factors such as workforce learning and forgetting (LaF), imperfect production processes, and carbon emissions, which are increasingly critical for economically and environmentally conscious decision making. This study develops a multi-objective EPQ (MOEPQ) model that integrates LaF dynamics and green technology (GT) investment within a sustainable manufacturing framework. The model incorporates both non-rework and rework cycles, capturing realistic production behavior affected by labor efficiency variations and environmentally conscious investment decisions. The problem is formulated as a multi-objective optimization under a carbon cap-and-trade policy, aiming to minimize total cost and carbon emissions. To solve the problem, the non-dominated sorting genetic algorithm II (NSGA-II) and the strength pareto evolutionary algorithm 2 (SPEA2) are implemented with control parameters tuned using response surface methodology (RSM) to ensure convergence accuracy and solution diversity. Pareto fronts are generated, and statistical evaluation includes normality assessment using the Shapiro–Wilk test, followed by performance comparison through the Wilcoxon signed-rank test. The best solution is selected using the multi-criteria decision analysis (MCDA) approach through the technique for order preference by similarity to an ideal solution (TOPSIS). Numerical experiments demonstrate that integrating LaF and GT significantly reduces total cost and carbon emissions. Sensitivity analysis identifies the key economic and emission related parameters, offering practical insights for managerial decision making.</div></div>","PeriodicalId":50523,"journal":{"name":"Engineering Applications of Artificial Intelligence","volume":"168 ","pages":"Article 114064"},"PeriodicalIF":8.0,"publicationDate":"2026-03-15","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"146191499","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
期刊
Engineering Applications of Artificial Intelligence
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1