首页 > 最新文献

Engineering Applications of Artificial Intelligence最新文献

英文 中文
Hybrid Inception-Transformer model for signals classification: The case of electrical faults in power transformers 信号分类的混合启变模型:以电力变压器电气故障为例
IF 8 2区 计算机科学 Q1 AUTOMATION & CONTROL SYSTEMS Pub Date : 2026-02-05 DOI: 10.1016/j.engappai.2026.114093
Elías Herrero Jaraba , Eduardo Martínez Carrasco , Anibal Antonio Prada Hurtado , María Teresa Villen Martínez , Guillermo Rios Gómez , David Hernando Polo , Julio David Buldain Pérez
This paper presents a hybrid deep learning model for fault detection in power transformers, addressing the limitations of conventional protection schemes under transient operating conditions. The proposed model, TransInception, integrates InceptionTime for efficient feature extraction in multivariate time series and Gated Transformer for capturing dependencies between variables. The architecture is modified by replacing the original gating mechanism with a linear double-layer output and removing a bottleneck layer responsible for handling temporal dependencies. The dataset used for training and testing was generated in a real-time digital simulation (RTDS) environment, consisting of an external grid, a delta-wye transformer, and a dynamic load. After training, the hybrid deep learning model was validated in a test grid specifically designed for this stage, where a parallel transformer configuration was implemented. This validation allowed for the evaluation of its performance in classifying internal, external, and no-fault conditions, as well as assessing cases of current transformer saturation. Additionally, sympathetic inrush conditions were studied to analyse the model’s response to interactions between power transformers. As future work, efforts will focus on improving the model’s adaptability to transient conditions and optimising its computational efficiency for deployment in substation protection systems.
本文提出了一种用于电力变压器故障检测的混合深度学习模型,解决了传统保护方案在暂态运行条件下的局限性。所提出的模型TransInception集成了用于在多变量时间序列中有效提取特征的InceptionTime和用于捕获变量之间依赖关系的门控变压器。通过将原有的门控机制替换为线性双层输出,并删除负责处理时间依赖性的瓶颈层,对体系结构进行了修改。用于训练和测试的数据集是在实时数字仿真(RTDS)环境中生成的,包括外部网格、三角向变压器和动态负载。训练后,混合深度学习模型在专门为该阶段设计的测试网格中进行验证,其中实现了并联变压器配置。该验证允许评估其在分类内部,外部和无故障条件下的性能,以及评估电流互感器饱和的情况。此外,研究了交感涌流条件,以分析模型对电力变压器相互作用的响应。在未来的工作中,将致力于提高模型对暂态条件的适应性,优化其在变电站保护系统中的部署计算效率。
{"title":"Hybrid Inception-Transformer model for signals classification: The case of electrical faults in power transformers","authors":"Elías Herrero Jaraba ,&nbsp;Eduardo Martínez Carrasco ,&nbsp;Anibal Antonio Prada Hurtado ,&nbsp;María Teresa Villen Martínez ,&nbsp;Guillermo Rios Gómez ,&nbsp;David Hernando Polo ,&nbsp;Julio David Buldain Pérez","doi":"10.1016/j.engappai.2026.114093","DOIUrl":"10.1016/j.engappai.2026.114093","url":null,"abstract":"<div><div>This paper presents a hybrid deep learning model for fault detection in power transformers, addressing the limitations of conventional protection schemes under transient operating conditions. The proposed model, TransInception, integrates InceptionTime for efficient feature extraction in multivariate time series and Gated Transformer for capturing dependencies between variables. The architecture is modified by replacing the original gating mechanism with a linear double-layer output and removing a bottleneck layer responsible for handling temporal dependencies. The dataset used for training and testing was generated in a real-time digital simulation (RTDS) environment, consisting of an external grid, a delta-wye transformer, and a dynamic load. After training, the hybrid deep learning model was validated in a test grid specifically designed for this stage, where a parallel transformer configuration was implemented. This validation allowed for the evaluation of its performance in classifying internal, external, and no-fault conditions, as well as assessing cases of current transformer saturation. Additionally, sympathetic inrush conditions were studied to analyse the model’s response to interactions between power transformers. As future work, efforts will focus on improving the model’s adaptability to transient conditions and optimising its computational efficiency for deployment in substation protection systems.</div></div>","PeriodicalId":50523,"journal":{"name":"Engineering Applications of Artificial Intelligence","volume":"168 ","pages":"Article 114093"},"PeriodicalIF":8.0,"publicationDate":"2026-02-05","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"146191634","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Hierarchical Region-Context Attention for image captioning 图像字幕的分层区域-上下文关注
IF 8 2区 计算机科学 Q1 AUTOMATION & CONTROL SYSTEMS Pub Date : 2026-02-05 DOI: 10.1016/j.engappai.2026.114014
Mohammad Alamgir Hossain , ZhongFu Ye , Md. Bipul Hossen , Md. Atiqur Rahman , Md Shohidul Islam , Md. Ibrahim Abdullah
Image captioning is a challenging task that requires a deep understanding of both visual and linguistic modalities to generate accurate and meaningful descriptions. Traditional methods often struggle to effectively integrate object-level and global scene features, leading to limited contextual awareness in generated captions. To address this, we propose a novel Hierarchical Region-Context Attention for Image Captioning framework that combines a Region-Context Attention Network for multi-scale visual feature fusion with a Hierarchical Attention-Based context encoding mechanism for refined representation learning. The Region-Context and Hierarchical Attention module extracts object-level features using Faster Region-based Convolutional Neural Network and global context features from Residual Networks, integrating them through a multi-head attention mechanism. This fusion enables localized object representations to be enriched with scene-level semantics. The fused visual features are further refined using a hierarchical attention-based approach, which employs both spatial and channel-wise attention to emphasize semantically relevant information across regions and dimensions. The decoder is implemented using a hierarchical Long Short-Term Memory network that generates captions in an autoregressive manner, leveraging the hierarchical attention-based refined features to guide each word prediction. This structure enables the model to maintain temporal coherence while dynamically attending to informative visual content. We evaluate our model on the Microsoft Common Objects in Context 2014 dataset, achieving a Bilingual Evaluation Understudy score of 40.0 and a Consensus-based Image Description Evaluation score of 132.5, surpassing state-of-the-art models. Results indicate that the model effectively captures object details and context, producing more coherent and accurate captions. The code for this project is publicly available at https://github.com/alamgirustc/HRcAIC.
图像字幕是一项具有挑战性的任务,需要对视觉和语言模式有深刻的理解,才能生成准确而有意义的描述。传统方法往往难以有效地整合对象级和全局场景特征,导致生成的字幕上下文意识有限。为了解决这个问题,我们提出了一种新的分层区域-上下文注意图像字幕框架,该框架结合了用于多尺度视觉特征融合的区域-上下文注意网络和用于精细表征学习的基于分层注意的上下文编码机制。区域-上下文和分层注意模块使用Faster基于区域的卷积神经网络和残差网络的全局上下文特征提取对象级特征,并通过多头注意机制将它们整合起来。这种融合使本地化的对象表示能够丰富场景级语义。融合的视觉特征使用基于分层注意力的方法进一步细化,该方法采用空间和通道智能注意力来强调跨区域和维度的语义相关信息。解码器使用分层长短期记忆网络实现,该网络以自回归的方式生成字幕,利用分层的基于注意力的精细特征来指导每个单词的预测。这种结构使模型能够在动态关注信息视觉内容的同时保持时间一致性。我们在Microsoft Common Objects in Context 2014数据集上评估了我们的模型,获得了40.0的双语评估替补分数和132.5的基于共识的图像描述评估分数,超过了最先进的模型。结果表明,该模型有效地捕获了目标细节和上下文,生成了更连贯和准确的标题。这个项目的代码可以在https://github.com/alamgirustc/HRcAIC上公开获得。
{"title":"Hierarchical Region-Context Attention for image captioning","authors":"Mohammad Alamgir Hossain ,&nbsp;ZhongFu Ye ,&nbsp;Md. Bipul Hossen ,&nbsp;Md. Atiqur Rahman ,&nbsp;Md Shohidul Islam ,&nbsp;Md. Ibrahim Abdullah","doi":"10.1016/j.engappai.2026.114014","DOIUrl":"10.1016/j.engappai.2026.114014","url":null,"abstract":"<div><div>Image captioning is a challenging task that requires a deep understanding of both visual and linguistic modalities to generate accurate and meaningful descriptions. Traditional methods often struggle to effectively integrate object-level and global scene features, leading to limited contextual awareness in generated captions. To address this, we propose a novel Hierarchical Region-Context Attention for Image Captioning framework that combines a Region-Context Attention Network for multi-scale visual feature fusion with a Hierarchical Attention-Based context encoding mechanism for refined representation learning. The Region-Context and Hierarchical Attention module extracts object-level features using Faster Region-based Convolutional Neural Network and global context features from Residual Networks, integrating them through a multi-head attention mechanism. This fusion enables localized object representations to be enriched with scene-level semantics. The fused visual features are further refined using a hierarchical attention-based approach, which employs both spatial and channel-wise attention to emphasize semantically relevant information across regions and dimensions. The decoder is implemented using a hierarchical Long Short-Term Memory network that generates captions in an autoregressive manner, leveraging the hierarchical attention-based refined features to guide each word prediction. This structure enables the model to maintain temporal coherence while dynamically attending to informative visual content. We evaluate our model on the Microsoft Common Objects in Context 2014 dataset, achieving a Bilingual Evaluation Understudy score of 40.0 and a Consensus-based Image Description Evaluation score of 132.5, surpassing state-of-the-art models. Results indicate that the model effectively captures object details and context, producing more coherent and accurate captions. The code for this project is publicly available at <span><span>https://github.com/alamgirustc/HRcAIC</span><svg><path></path></svg></span>.</div></div>","PeriodicalId":50523,"journal":{"name":"Engineering Applications of Artificial Intelligence","volume":"168 ","pages":"Article 114014"},"PeriodicalIF":8.0,"publicationDate":"2026-02-05","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"146191637","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Advances in You Only Look Once (YOLO) algorithms for lane and object detection in autonomous vehicles 用于自动驾驶车辆车道和目标检测的YOLO算法的进展
IF 8 2区 计算机科学 Q1 AUTOMATION & CONTROL SYSTEMS Pub Date : 2026-02-05 DOI: 10.1016/j.engappai.2026.113893
Busuyi Omodaratan , Ali Jamali , Timothy Wiley , Ziad Al-Saadi , Rammohan Mallipeddi , Ehsan Asadi , Hoshyar Asadi , Rasoul Sadeghian , Sina Sareh , Hamid Khayyam
Ensuring the safety and efficiency of Autonomous Vehicles (AVs) necessitates highly accurate perception, especially for lane detection and lane-change manoeuvres. Among object detection frameworks, “You Only Look Once” (YOLO) algorithms have emerged as prominent contenders due to their rapid inference and commendable accuracy. However, the broad spectrum of YOLO variants and their applications in complex, real-world environments remain insufficiently mapped, necessitating a more integrative and critical perspective than what is typically offered by surveys. This comprehensive review synthesizes theoretical foundations, architectural innovations, and empirical evaluations of YOLO-based algorithms in AV-related tasks. It not only highlights key findings—such as the notable gains in real-time detection and adaptability to a range of driving conditions—but also explicitly identifies persistent gaps and limitations. These include difficulties in detecting subtle or degraded lane markings, handling unpredictable environmental factors like adverse weather and varied lighting, mitigating adversarial perturbations, and scaling effectively across diverse datasets and geographic regions. By critically examining these vulnerabilities, we illuminate the opportunities for refining YOLO's training paradigms, optimizing model architectures, incorporating sensor fusion, and fostering universally applicable datasets. The implications of addressing these gaps extend beyond mere technical refinements. Proactively tackling YOLO's current challenges can expedite the realization of safer, more robust, and globally adaptable AV navigation systems. In doing so, this review provides clear, actionable insights for researchers, engineers, and policymakers, guiding them toward strategic innovations that will strengthen AV perception and contribute to more reliable, future-ready transportation solutions.
确保自动驾驶汽车(av)的安全和效率需要高度准确的感知,特别是在车道检测和变道操作方面。在目标检测框架中,“你只看一次”(YOLO)算法由于其快速推理和令人称赞的准确性而成为突出的竞争者。然而,广泛的YOLO变体及其在复杂的现实环境中的应用仍然没有得到充分的映射,因此需要比调查通常提供的更综合和批判性的观点。本综述综合了基于yolo的自动驾驶相关任务算法的理论基础、架构创新和实证评估。它不仅突出了关键的发现——比如在实时检测和对一系列驾驶条件的适应性方面的显著进步——而且明确指出了持续存在的差距和局限性。这些困难包括检测细微或退化的车道标记,处理不可预测的环境因素,如恶劣天气和不同的照明,减轻对抗性扰动,以及在不同数据集和地理区域之间有效缩放。通过严格检查这些漏洞,我们阐明了改进YOLO训练范例、优化模型架构、整合传感器融合和培养普遍适用的数据集的机会。解决这些差距的意义超出了单纯的技术改进。积极应对YOLO目前面临的挑战,可以加快实现更安全、更强大、更适应全球的自动驾驶导航系统。在此过程中,本综述为研究人员、工程师和政策制定者提供了清晰、可行的见解,指导他们进行战略创新,以增强自动驾驶的认知,并为更可靠、面向未来的交通解决方案做出贡献。
{"title":"Advances in You Only Look Once (YOLO) algorithms for lane and object detection in autonomous vehicles","authors":"Busuyi Omodaratan ,&nbsp;Ali Jamali ,&nbsp;Timothy Wiley ,&nbsp;Ziad Al-Saadi ,&nbsp;Rammohan Mallipeddi ,&nbsp;Ehsan Asadi ,&nbsp;Hoshyar Asadi ,&nbsp;Rasoul Sadeghian ,&nbsp;Sina Sareh ,&nbsp;Hamid Khayyam","doi":"10.1016/j.engappai.2026.113893","DOIUrl":"10.1016/j.engappai.2026.113893","url":null,"abstract":"<div><div>Ensuring the safety and efficiency of Autonomous Vehicles (AVs) necessitates highly accurate perception, especially for lane detection and lane-change manoeuvres. Among object detection frameworks, “You Only Look Once” (YOLO) algorithms have emerged as prominent contenders due to their rapid inference and commendable accuracy. However, the broad spectrum of YOLO variants and their applications in complex, real-world environments remain insufficiently mapped, necessitating a more integrative and critical perspective than what is typically offered by surveys. This comprehensive review synthesizes theoretical foundations, architectural innovations, and empirical evaluations of YOLO-based algorithms in AV-related tasks. It not only highlights key findings—such as the notable gains in real-time detection and adaptability to a range of driving conditions—but also explicitly identifies persistent gaps and limitations. These include difficulties in detecting subtle or degraded lane markings, handling unpredictable environmental factors like adverse weather and varied lighting, mitigating adversarial perturbations, and scaling effectively across diverse datasets and geographic regions. By critically examining these vulnerabilities, we illuminate the opportunities for refining YOLO's training paradigms, optimizing model architectures, incorporating sensor fusion, and fostering universally applicable datasets. The implications of addressing these gaps extend beyond mere technical refinements. Proactively tackling YOLO's current challenges can expedite the realization of safer, more robust, and globally adaptable AV navigation systems. In doing so, this review provides clear, actionable insights for researchers, engineers, and policymakers, guiding them toward strategic innovations that will strengthen AV perception and contribute to more reliable, future-ready transportation solutions.</div></div>","PeriodicalId":50523,"journal":{"name":"Engineering Applications of Artificial Intelligence","volume":"168 ","pages":"Article 113893"},"PeriodicalIF":8.0,"publicationDate":"2026-02-05","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"146191645","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Multi-shape enhancement pyramid network for real-time semantic segmentation 多形状增强金字塔网络实时语义分割
IF 8 2区 计算机科学 Q1 AUTOMATION & CONTROL SYSTEMS Pub Date : 2026-02-05 DOI: 10.1016/j.engappai.2026.114059
Zhiyong Chen , Zhao Yang , Peng Liu , Feng Wang , Yamei Dou
Based on powerful convolutional neural networks (CNNs) and complex model structures, semantic segmentation achieves good segmentation accuracy, but its slow inference speed limits its use in practical applications, such as autonomous driving and medical diagnosis. Thus, real-time semantic segmentation receives increasing attention. However, most existing real-time semantic segmentation methods improve inference speed while significantly sacrificing segmentation precision. Striking a well balance between inference speed and precision remains a major issue in real-time semantic segmentation. To address this issue, we propose a real-time semantic segmentation network, the Multi-Shape Enhancement Pyramid Network (MSEPNet). First, we propose an efficient spatial inverted residual (ESIR) module to effectively extract multi-scale spatial information. Next, to capture multi-scale semantic information while maintaining efficient inference speed, we introduce an efficient contextual residual (ECR) module. Finally, we present the multi-shape enhancement pyramid (MSEP) module to capture multi-scale and multi-shape contextual information. The proposed MSEPNet achieves competitive results on street scene datasets. Specifically, with only 1.04 million (1.04M) parameters, it achieves the accuracy of 76.7% and 72.5% mean Intersection over Union (mIoU) with the speed of 144.4 and 108.9 Frames Per Second (FPS) on Cityscapes and Cambridge-driving Labeled Video Database (CamVid) test sets, respectively. Furthermore, we conduct additional experiments on the Stanford Background dataset to verify the robustness of MSEPNet in diverse real-world environments, demonstrating its generalization ability beyond standard benchmarks.
基于强大的卷积神经网络(cnn)和复杂的模型结构,语义分割实现了良好的分割精度,但其缓慢的推理速度限制了其在自动驾驶和医疗诊断等实际应用中的应用。因此,实时语义分割越来越受到人们的关注。然而,现有的大多数实时语义分割方法在提高推理速度的同时,显著牺牲了分割精度。在实时语义分割中,如何在推理速度和精度之间取得良好的平衡仍然是一个重要的问题。为了解决这一问题,我们提出了一种实时语义分割网络——多形状增强金字塔网络(MSEPNet)。首先,我们提出了一种高效的空间逆残差(ESIR)模块来有效地提取多尺度空间信息。其次,为了在保持高效推理速度的同时捕获多尺度语义信息,我们引入了高效的上下文残差(ECR)模块。最后,我们提出了多形状增强金字塔(MSEP)模块来捕获多尺度和多形状的上下文信息。所提出的MSEPNet在街景数据集上取得了具有竞争力的结果。具体而言,该方法仅使用104万个(1.04万个)参数,在城市景观和剑桥驾驶标记视频数据库(CamVid)测试集上分别以144.4帧/秒和108.9帧/秒的速度实现了76.7%和72.5%的平均交叉路口(mIoU)准确率。此外,我们在斯坦福背景数据集上进行了额外的实验,以验证MSEPNet在不同现实环境中的鲁棒性,证明其超越标准基准的泛化能力。
{"title":"Multi-shape enhancement pyramid network for real-time semantic segmentation","authors":"Zhiyong Chen ,&nbsp;Zhao Yang ,&nbsp;Peng Liu ,&nbsp;Feng Wang ,&nbsp;Yamei Dou","doi":"10.1016/j.engappai.2026.114059","DOIUrl":"10.1016/j.engappai.2026.114059","url":null,"abstract":"<div><div>Based on powerful convolutional neural networks (CNNs) and complex model structures, semantic segmentation achieves good segmentation accuracy, but its slow inference speed limits its use in practical applications, such as autonomous driving and medical diagnosis. Thus, real-time semantic segmentation receives increasing attention. However, most existing real-time semantic segmentation methods improve inference speed while significantly sacrificing segmentation precision. Striking a well balance between inference speed and precision remains a major issue in real-time semantic segmentation. To address this issue, we propose a real-time semantic segmentation network, the Multi-Shape Enhancement Pyramid Network (MSEPNet). First, we propose an efficient spatial inverted residual (ESIR) module to effectively extract multi-scale spatial information. Next, to capture multi-scale semantic information while maintaining efficient inference speed, we introduce an efficient contextual residual (ECR) module. Finally, we present the multi-shape enhancement pyramid (MSEP) module to capture multi-scale and multi-shape contextual information. The proposed MSEPNet achieves competitive results on street scene datasets. Specifically, with only 1.04 million (1.04M) parameters, it achieves the accuracy of 76.7% and 72.5% mean Intersection over Union (mIoU) with the speed of 144.4 and 108.9 Frames Per Second (FPS) on Cityscapes and Cambridge-driving Labeled Video Database (CamVid) test sets, respectively. Furthermore, we conduct additional experiments on the Stanford Background dataset to verify the robustness of MSEPNet in diverse real-world environments, demonstrating its generalization ability beyond standard benchmarks.</div></div>","PeriodicalId":50523,"journal":{"name":"Engineering Applications of Artificial Intelligence","volume":"168 ","pages":"Article 114059"},"PeriodicalIF":8.0,"publicationDate":"2026-02-05","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"146191491","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Efficient deep learning-based prediction of floor response spectra for nuclear power plants using a multi-head attention-based convolutional bidirectional long short-term memory network 基于多头注意的卷积双向长短期记忆网络的高效深度学习核电厂楼板响应谱预测
IF 8 2区 计算机科学 Q1 AUTOMATION & CONTROL SYSTEMS Pub Date : 2026-02-05 DOI: 10.1016/j.engappai.2026.114067
Fan Yang, Zhi Zheng, Xiaolan Pan, Zhongyao Lin, Pengkun Zhang
The seismic safety assessment of nuclear power plants (NPPs) fundamentally relies on accurate floor response spectra (FRS). Conventional generation methods, such as nonlinear time history analysis (NLTHA), are computationally prohibitive, while direct spectra-to-spectra methods struggle with structural nonlinearity and single ground-motion input. To overcome these limitations, this study proposes a novel deep learning framework, the multi-head attention-based convolutional bidirectional long short-term memory network (MAC-BiLSTM), for efficient and accurate FRS prediction in NPPs. This architecture strategically integrates convolutional neural networks (CNNs) for local feature extraction, bidirectional long short-term memory (BiLSTM) for modeling long-term temporal dependencies, and a multi-head attention (MHA) mechanism for dynamically weighting critical spectral features. A comprehensive FRS dataset is generated via NLTHA using a lumped-mass stick model of an NPP, explicitly incorporating structural uncertainties, and subjected to a suite of near-fault ground motions. Results demonstrate that the MAC-BiLSTM model achieves competitive accuracy across critical NPPs nodes while offering a computational speedup of approximately 120 times compared to conventional NLTHA. Sensitivity analyses further confirm the model's robustness across varying seismic intensities, frequency contents, and ground motion durations. Comparative studies against a suite of baseline machine learning models highlight the superiority of MAC-BiLSTM and underscore the critical role of the MHA mechanism in capturing short-period-dominated FRS characteristics unique to NPP structures. This work provides a powerful data-driven tool for the rapid seismic performance assessment and safety design of acceleration-sensitive nonstructural components in NPPs.
核电厂的地震安全性评估从根本上依赖于准确的楼板反应谱。传统的生成方法,如非线性时程分析(NLTHA),在计算上是禁止的,而直接的光谱到光谱方法与结构非线性和单一地面运动输入作斗争。为了克服这些限制,本研究提出了一种新的深度学习框架,即基于多头注意的卷积双向长短期记忆网络(MAC-BiLSTM),用于高效准确地预测NPPs中的FRS。该架构战略性地集成了卷积神经网络(cnn)用于局部特征提取,双向长短期记忆(BiLSTM)用于长期时间依赖性建模,以及多头注意(MHA)机制用于动态加权关键光谱特征。综合的FRS数据集通过NLTHA生成,使用核电厂的集中质量棒模型,明确纳入结构不确定性,并受到一系列近断层地面运动的影响。结果表明,与传统NLTHA相比,MAC-BiLSTM模型在关键核电站节点上实现了具有竞争力的精度,同时提供了大约120倍的计算速度。敏感性分析进一步证实了该模型在不同地震强度、频率内容和地面运动持续时间下的稳健性。与一系列基线机器学习模型的比较研究强调了MAC-BiLSTM的优越性,并强调了MHA机制在捕获NPP结构特有的短期主导FRS特征方面的关键作用。这项工作为核电站加速度敏感非结构部件的快速抗震性能评估和安全设计提供了强大的数据驱动工具。
{"title":"Efficient deep learning-based prediction of floor response spectra for nuclear power plants using a multi-head attention-based convolutional bidirectional long short-term memory network","authors":"Fan Yang,&nbsp;Zhi Zheng,&nbsp;Xiaolan Pan,&nbsp;Zhongyao Lin,&nbsp;Pengkun Zhang","doi":"10.1016/j.engappai.2026.114067","DOIUrl":"10.1016/j.engappai.2026.114067","url":null,"abstract":"<div><div>The seismic safety assessment of nuclear power plants (NPPs) fundamentally relies on accurate floor response spectra (FRS). Conventional generation methods, such as nonlinear time history analysis (NLTHA), are computationally prohibitive, while direct spectra-to-spectra methods struggle with structural nonlinearity and single ground-motion input. To overcome these limitations, this study proposes a novel deep learning framework, the multi-head attention-based convolutional bidirectional long short-term memory network (MAC-BiLSTM), for efficient and accurate FRS prediction in NPPs. This architecture strategically integrates convolutional neural networks (CNNs) for local feature extraction, bidirectional long short-term memory (BiLSTM) for modeling long-term temporal dependencies, and a multi-head attention (MHA) mechanism for dynamically weighting critical spectral features. A comprehensive FRS dataset is generated via NLTHA using a lumped-mass stick model of an NPP, explicitly incorporating structural uncertainties, and subjected to a suite of near-fault ground motions. Results demonstrate that the MAC-BiLSTM model achieves competitive accuracy across critical NPPs nodes while offering a computational speedup of approximately 120 times compared to conventional NLTHA. Sensitivity analyses further confirm the model's robustness across varying seismic intensities, frequency contents, and ground motion durations. Comparative studies against a suite of baseline machine learning models highlight the superiority of MAC-BiLSTM and underscore the critical role of the MHA mechanism in capturing short-period-dominated FRS characteristics unique to NPP structures. This work provides a powerful data-driven tool for the rapid seismic performance assessment and safety design of acceleration-sensitive nonstructural components in NPPs.</div></div>","PeriodicalId":50523,"journal":{"name":"Engineering Applications of Artificial Intelligence","volume":"168 ","pages":"Article 114067"},"PeriodicalIF":8.0,"publicationDate":"2026-02-05","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"146191633","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
A novel fault diagnosis method for wind turbine gearbox based on data-driven incremental broad network 基于数据驱动的增量广义网络的风电齿轮箱故障诊断方法
IF 8 2区 计算机科学 Q1 AUTOMATION & CONTROL SYSTEMS Pub Date : 2026-02-05 DOI: 10.1016/j.engappai.2026.114050
Yuchen He, Xi Li, Lijuan Qian, JiaWei Lu
Wind power, as a crucial form of clean energy, has its reliability seriously affected by faults in the gearbox of wind power generation system. To address this challenge, a novel spatiotemporal incremental broad learning network and pseudo-label driven adaptation (SIBN-PLDA) is proposed in this paper for the fault diagnosis of wind turbine gearboxes during multi-condition non-stationary process. Firstly, a temporal convolutional generalized mapping structure integrating spatiotemporal features is constructed within SIBN-PLDA to enhance the representation capability for non-stationary signals. In addition, the SIBN-PLDA model exhibits extendibility in order to adapt to newly introduced fault samples and fault classes. Then, under new working conditions, a pseudo label quality constraint mechanism cooperated by global and local classifiers is used to dynamically evaluate the pseudo label and adjust the distribution alignment strategy, so as to effectively improve the cross-working condition migration performance. Finally, a sample confidence ranking approach with a center loss function is designed to enable accurate identification of unknown fault classes. The performance of the proposed method is further validated on two industrial wind power gearbox datasets where the experimental results show that the proposed method is superior to the existing mainstream methods in terms of diagnosis accuracy, cross-condition robustness, unknown fault adaptability and online update capacity, revealing better practical application potential. Specifically, SIBN-PLDA achieves an average overall accuracy of 94.41% and 93.56% on the SEU and WT gearbox datasets respectively, with the average accuracy of unknown fault class identification reaching 82.25% and 80.97% on the two datasets, which is 7.22% and 5.92% higher than the second-best method respectively.
风电作为清洁能源的重要形式,风力发电系统齿轮箱故障严重影响其可靠性。为了解决这一问题,本文提出了一种新的时空增量广义学习网络和伪标签驱动自适应(SIBN-PLDA)用于风电齿轮箱多工况非平稳过程故障诊断。首先,在SIBN-PLDA中构建了一个融合时空特征的时间卷积广义映射结构,增强了对非平稳信号的表示能力;此外,SIBN-PLDA模型具有可扩展性,可以适应新引入的故障样本和故障类别。然后,在新的工况下,采用全局分类器和局部分类器协同的伪标签质量约束机制,对伪标签进行动态评估,调整分布对齐策略,有效提高跨工况迁移性能。最后,设计了一种带有中心损失函数的样本置信度排序方法,以实现对未知故障类别的准确识别。在两个工业风电齿轮箱数据集上进一步验证了所提出方法的性能,实验结果表明,所提出方法在诊断精度、交叉条件鲁棒性、未知故障自适应能力和在线更新能力等方面均优于现有主流方法,显示出更好的实际应用潜力。其中,SIBN-PLDA在SEU和WT齿轮箱数据集上的平均总体准确率分别为94.41%和93.56%,其中未知故障类别识别的平均准确率分别达到82.25%和80.97%,分别比次优方法提高了7.22%和5.92%。
{"title":"A novel fault diagnosis method for wind turbine gearbox based on data-driven incremental broad network","authors":"Yuchen He,&nbsp;Xi Li,&nbsp;Lijuan Qian,&nbsp;JiaWei Lu","doi":"10.1016/j.engappai.2026.114050","DOIUrl":"10.1016/j.engappai.2026.114050","url":null,"abstract":"<div><div>Wind power, as a crucial form of clean energy, has its reliability seriously affected by faults in the gearbox of wind power generation system. To address this challenge, a novel spatiotemporal incremental broad learning network and pseudo-label driven adaptation (SIBN-PLDA) is proposed in this paper for the fault diagnosis of wind turbine gearboxes during multi-condition non-stationary process. Firstly, a temporal convolutional generalized mapping structure integrating spatiotemporal features is constructed within SIBN-PLDA to enhance the representation capability for non-stationary signals. In addition, the SIBN-PLDA model exhibits extendibility in order to adapt to newly introduced fault samples and fault classes. Then, under new working conditions, a pseudo label quality constraint mechanism cooperated by global and local classifiers is used to dynamically evaluate the pseudo label and adjust the distribution alignment strategy, so as to effectively improve the cross-working condition migration performance. Finally, a sample confidence ranking approach with a center loss function is designed to enable accurate identification of unknown fault classes. The performance of the proposed method is further validated on two industrial wind power gearbox datasets where the experimental results show that the proposed method is superior to the existing mainstream methods in terms of diagnosis accuracy, cross-condition robustness, unknown fault adaptability and online update capacity, revealing better practical application potential. Specifically, SIBN-PLDA achieves an average overall accuracy of 94.41% and 93.56% on the SEU and WT gearbox datasets respectively, with the average accuracy of unknown fault class identification reaching 82.25% and 80.97% on the two datasets, which is 7.22% and 5.92% higher than the second-best method respectively.</div></div>","PeriodicalId":50523,"journal":{"name":"Engineering Applications of Artificial Intelligence","volume":"168 ","pages":"Article 114050"},"PeriodicalIF":8.0,"publicationDate":"2026-02-05","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"146191638","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
A reparameterized hierarchical feature learning network designed for accurate sizing and localization of steel surface defects 一种用于钢表面缺陷精确定位的重参数化分层特征学习网络
IF 8 2区 计算机科学 Q1 AUTOMATION & CONTROL SYSTEMS Pub Date : 2026-02-05 DOI: 10.1016/j.engappai.2026.114101
Ronggang Ge , Yue Wang , Yonggeng Wei
Accurately measuring the size and spatial distribution of surface defects on steel products is essential for ensuring product quality. However, existing detection methods exhibit notable limitations in achieving high-precision measurements of defect dimensions and spatial localization. To address this issue, this study proposes two new modules: the reparameterized multi-scale receptive field module (RMRFM) and the hierarchical enhanced feature propagation network (HEFPN). By integrating the strengths of both modules, we further develop a unified detection architecture, termed You Only Look Once with Reparameterized Hierarchical Feature Learning (YOLO-RH). RMRFM significantly improves the model's measurement accuracy of defect size and spatial distribution without sacrificing detection speed through a reparameterized multi branch feature extraction strategy Meanwhile, HEFPN introduces a cross-layer feature interaction mechanism that effectively preserves shallow-layer texture information during feature extraction, providing essential support for the accurate measurement of defect attributes. Extensive experiments conducted on the NEU-DET dataset demonstrate that both RMRFM and HEFPN yield strong individual performance, while their combination in YOLO-RH achieves AP50, AP50:95, AR, APS, and ARS scores of 71.9%, 38.4%, 55.1%, 49.1%, and 61.1%, respectively, which were 3.7%, 1.9%, 5.2%, 11.8%, and 12.6% higher than baseline and consistently outperformed other state-of-the-art methods. Furthermore, generalization experiments on GC10-DET and PV-Multi datasets confirm the robustness of YOLO-RH across different materials and defect types. Finally, a steel defect measurement platform based on YOLO-RH is developed to validate its practical feasibility, offering a viable solution for intelligent defect measurement and automated sorting in real-world industrial environments.
准确测量钢制品表面缺陷的大小和空间分布,是保证产品质量的关键。然而,现有的检测方法在实现缺陷尺寸的高精度测量和空间定位方面存在明显的局限性。为了解决这一问题,本研究提出了两个新的模块:重参数化多尺度感受野模块(RMRFM)和分层增强特征传播网络(HEFPN)。通过整合这两个模块的优势,我们进一步开发了一个统一的检测架构,称为You Only Look Once with Reparameterized Hierarchical Feature Learning (YOLO-RH)。RMRFM通过重新参数化的多分支特征提取策略,在不牺牲检测速度的前提下,显著提高了模型对缺陷尺寸和空间分布的测量精度。同时,HEFPN引入了一种跨层特征交互机制,在特征提取过程中有效保留了浅层纹理信息,为缺陷属性的准确测量提供了必要的支持。在nue - det数据集上进行的大量实验表明,RMRFM和HEFPN都产生了很强的个体性能,而它们在ylo - rh中的组合分别达到了71.9%,38.4%,55.1%,49.1%和61.1%的AP50, AP50:95, AR, APS和ARS得分,分别比基线高3.7%,1.9%,5.2%,11.8%和12.6%,并且始终优于其他最先进的方法。此外,在GC10-DET和PV-Multi数据集上的推广实验证实了YOLO-RH对不同材料和缺陷类型的鲁棒性。最后,开发了基于YOLO-RH的钢缺陷测量平台,验证了其实际可行性,为现实工业环境下的智能缺陷测量和自动化分选提供了可行的解决方案。
{"title":"A reparameterized hierarchical feature learning network designed for accurate sizing and localization of steel surface defects","authors":"Ronggang Ge ,&nbsp;Yue Wang ,&nbsp;Yonggeng Wei","doi":"10.1016/j.engappai.2026.114101","DOIUrl":"10.1016/j.engappai.2026.114101","url":null,"abstract":"<div><div>Accurately measuring the size and spatial distribution of surface defects on steel products is essential for ensuring product quality. However, existing detection methods exhibit notable limitations in achieving high-precision measurements of defect dimensions and spatial localization. To address this issue, this study proposes two new modules: the reparameterized multi-scale receptive field module (RMRFM) and the hierarchical enhanced feature propagation network (HEFPN). By integrating the strengths of both modules, we further develop a unified detection architecture, termed You Only Look Once with Reparameterized Hierarchical Feature Learning (YOLO-RH). RMRFM significantly improves the model's measurement accuracy of defect size and spatial distribution without sacrificing detection speed through a reparameterized multi branch feature extraction strategy Meanwhile, HEFPN introduces a cross-layer feature interaction mechanism that effectively preserves shallow-layer texture information during feature extraction, providing essential support for the accurate measurement of defect attributes. Extensive experiments conducted on the NEU-DET dataset demonstrate that both RMRFM and HEFPN yield strong individual performance, while their combination in YOLO-RH achieves AP<sub>50</sub>, AP<sub>50:95</sub>, AR, AP<sub>S</sub>, and AR<sub>S</sub> scores of 71.9%, 38.4%, 55.1%, 49.1%, and 61.1%, respectively, which were 3.7%, 1.9%, 5.2%, 11.8%, and 12.6% higher than baseline and consistently outperformed other state-of-the-art methods. Furthermore, generalization experiments on GC10-DET and PV-Multi datasets confirm the robustness of YOLO-RH across different materials and defect types. Finally, a steel defect measurement platform based on YOLO-RH is developed to validate its practical feasibility, offering a viable solution for intelligent defect measurement and automated sorting in real-world industrial environments.</div></div>","PeriodicalId":50523,"journal":{"name":"Engineering Applications of Artificial Intelligence","volume":"168 ","pages":"Article 114101"},"PeriodicalIF":8.0,"publicationDate":"2026-02-05","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"146191646","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Symmetry-based authority allocation for enhanced multi-agent decision-making 基于对称的增强多智能体决策权限分配
IF 8 2区 计算机科学 Q1 AUTOMATION & CONTROL SYSTEMS Pub Date : 2026-02-05 DOI: 10.1016/j.engappai.2026.113949
Zhi Fang, Li Huang
Ensuring effective multi-agent decision-making (MAD) is crucial for tackling complex, high-stakes challenges in fields such as finance, healthcare, and defense, where collaborative, adaptive solutions are essential for navigating dynamic and interconnected environments. However, current methods still struggle with effective role distribution and system coordination, leading to issues like premature opinion convergence and suboptimal decision efficiency. These challenges limit the practical adoption of MAD systems in real-world applications. This paper introduces a novel multi-agent decision-making system based on symmetric authority distribution (SDMADS). By incorporating a dynamic delegation mechanism and a dynamic consensus strategy, our system enhances fairness and flexibility in decision-making. Additionally, we propose the Hierarchical Direct Preference Optimization (HDPO) algorithm to optimize agent behaviors across multiple levels. Experimental results demonstrate that our system significantly improves decision quality, increases the adoption of diverse opinions, reduces bias, and outperforms existing approaches in terms of decision efficiency and adaptability.
确保有效的多智能体决策(MAD)对于应对金融、医疗保健和国防等领域的复杂、高风险挑战至关重要,在这些领域,协作、自适应解决方案对于导航动态和相互关联的环境至关重要。然而,目前的方法仍然难以实现有效的角色分配和系统协调,导致意见过早收敛和决策效率次优等问题。这些挑战限制了MAD系统在实际应用中的实际应用。介绍了一种新的基于对称权限分配的多智能体决策系统。通过引入动态授权机制和动态共识策略,我们的制度增强了决策的公平性和灵活性。此外,我们提出了层次直接偏好优化(HDPO)算法来优化多层智能体的行为。实验结果表明,我们的系统显著提高了决策质量,增加了对不同意见的采纳,减少了偏见,在决策效率和适应性方面优于现有方法。
{"title":"Symmetry-based authority allocation for enhanced multi-agent decision-making","authors":"Zhi Fang,&nbsp;Li Huang","doi":"10.1016/j.engappai.2026.113949","DOIUrl":"10.1016/j.engappai.2026.113949","url":null,"abstract":"<div><div>Ensuring effective multi-agent decision-making (MAD) is crucial for tackling complex, high-stakes challenges in fields such as finance, healthcare, and defense, where collaborative, adaptive solutions are essential for navigating dynamic and interconnected environments. However, current methods still struggle with effective role distribution and system coordination, leading to issues like premature opinion convergence and suboptimal decision efficiency. These challenges limit the practical adoption of MAD systems in real-world applications. This paper introduces a novel multi-agent decision-making system based on symmetric authority distribution (SDMADS). By incorporating a dynamic delegation mechanism and a dynamic consensus strategy, our system enhances fairness and flexibility in decision-making. Additionally, we propose the Hierarchical Direct Preference Optimization (HDPO) algorithm to optimize agent behaviors across multiple levels. Experimental results demonstrate that our system significantly improves decision quality, increases the adoption of diverse opinions, reduces bias, and outperforms existing approaches in terms of decision efficiency and adaptability.</div></div>","PeriodicalId":50523,"journal":{"name":"Engineering Applications of Artificial Intelligence","volume":"168 ","pages":"Article 113949"},"PeriodicalIF":8.0,"publicationDate":"2026-02-05","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"146191632","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Quantum computing for capacity checks of reinforced concrete sections 钢筋混凝土截面承载力验算的量子计算
IF 8 2区 计算机科学 Q1 AUTOMATION & CONTROL SYSTEMS Pub Date : 2026-02-05 DOI: 10.1016/j.engappai.2026.113970
Salvatore Sessa, Luciano Rosati
We explore the application of quantum-enhanced machine learning techniques for the structural analysis of reinforced concrete sections subjected to combined axial force and biaxial bending. A quantum support vector machine framework is developed and employed to approximate the limit state surface that defines the ultimate capacity of Reinforced Concrete cross-sections. Several quantum kernel architectures, namely Fidelity, Mercer-inspired, ZZ-Feature Map, and HE2, are implemented and tested on two representative geometries: a symmetric rectangular section and an asymmetric L-shaped one. Kernel performance is evaluated in terms of classification accuracy, generalization behavior, and decision boundary coherence, showing strengths and limitations of each quantum feature map. Analyses have shown that the Fidelity, Mercer Static, and single-layer HE2 kernels provide the most robust and geometrically sound decision surfaces, thus achieving high classification accuracy and stability. On the contrary, deeper or highly expressive circuits exhibit overfitting and reduced generalization. Despite the limited accuracy observed for some kernel configurations, the results prove that quantum kernels can effectively approximate capacity domains, particularly in symmetric cases. Hence, quantum-enhanced models represent a promising direction for efficient structural verification, even if the technology is still in its early stages.
我们探索了量子增强机器学习技术在钢筋混凝土截面受轴向力和双轴弯曲的结构分析中的应用。提出了一种量子支持向量机框架,并将其用于逼近钢筋混凝土截面极限承载力的极限状态面。几种量子内核架构,即Fidelity, Mercer-inspired, ZZ-Feature Map和HE2,在两种代表性几何形状上实现和测试:对称矩形截面和不对称l形截面。从分类精度、泛化行为和决策边界相干性方面对核性能进行了评估,显示了每个量子特征映射的优势和局限性。分析表明,Fidelity、Mercer Static和单层HE2核提供了最鲁棒和几何上最健全的决策面,从而实现了高分类精度和稳定性。相反,深度或高表达电路表现出过度拟合和降低泛化。尽管对某些核配置观察到的精度有限,但结果证明量子核可以有效地近似容量域,特别是在对称情况下。因此,量子增强模型代表了有效结构验证的一个有前途的方向,即使该技术仍处于早期阶段。
{"title":"Quantum computing for capacity checks of reinforced concrete sections","authors":"Salvatore Sessa,&nbsp;Luciano Rosati","doi":"10.1016/j.engappai.2026.113970","DOIUrl":"10.1016/j.engappai.2026.113970","url":null,"abstract":"<div><div>We explore the application of quantum-enhanced machine learning techniques for the structural analysis of reinforced concrete sections subjected to combined axial force and biaxial bending. A quantum support vector machine framework is developed and employed to approximate the limit state surface that defines the ultimate capacity of Reinforced Concrete cross-sections. Several quantum kernel architectures, namely Fidelity, Mercer-inspired, ZZ-Feature Map, and HE2, are implemented and tested on two representative geometries: a symmetric rectangular section and an asymmetric L-shaped one. Kernel performance is evaluated in terms of classification accuracy, generalization behavior, and decision boundary coherence, showing strengths and limitations of each quantum feature map. Analyses have shown that the Fidelity, Mercer Static, and single-layer HE2 kernels provide the most robust and geometrically sound decision surfaces, thus achieving high classification accuracy and stability. On the contrary, deeper or highly expressive circuits exhibit overfitting and reduced generalization. Despite the limited accuracy observed for some kernel configurations, the results prove that quantum kernels can effectively approximate capacity domains, particularly in symmetric cases. Hence, quantum-enhanced models represent a promising direction for efficient structural verification, even if the technology is still in its early stages.</div></div>","PeriodicalId":50523,"journal":{"name":"Engineering Applications of Artificial Intelligence","volume":"168 ","pages":"Article 113970"},"PeriodicalIF":8.0,"publicationDate":"2026-02-05","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"146191745","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Towards neural-symbolic grammatical inference for endangered languages using integrating graph neural networks and instruction-tuned language models 基于图神经网络和指令调优语言模型的濒危语言神经符号语法推理研究
IF 8 2区 计算机科学 Q1 AUTOMATION & CONTROL SYSTEMS Pub Date : 2026-02-05 DOI: 10.1016/j.engappai.2026.114011
Manu Singh , Neha Gupta , Shiva Tyagi , Ashima Rani , Vinod Kumar , Surbhi Sharma
The rapid disappearance of endangered languages presents a critical challenge to cultural preservation and linguistic diversity. Traditional grammatical induction techniques face the challenge of having limited annotated data to work with on such language. To address these issues, this study proposes a Neural-Symbolic Artificial Intelligence (AI) based Grammatical Inference framework. It integrates Graph Neural Networks (GNNs) with instruction-tuned language models for AI-driven grammar induction in low-resource languages. This framework uses Few-shot Relational Graph Convolutional Networks (FS-R-GCN) to convert grammatically erroneous texts into relation graphs. Then, the Instruction-Tuned Bidirectional Encoder Representations from Transformers (BERT) AI model processes these graph representations alongside the original erroneous sentences, generating grammatically accurate alternatives. This Instruction-Tuned Language model assists by identifying grammatical inconsistencies and providing structural guidance to the BERT model. Experimental results on the AlexEBall/Endangered_Languages_Capstone_Proj_1 dataset show that the proposed method improves performance significantly with 99.01 % accuracy, 98.70 % precision, and 98.00 % F1-score compared to the existing methods.
濒危语言的迅速消失对文化保护和语言多样性提出了严峻的挑战。传统的语法归纳技术面临着在这种语言上使用有限的注释数据的挑战。为了解决这些问题,本研究提出了一个基于神经符号人工智能(AI)的语法推理框架。它将图神经网络(gnn)与指令调整的语言模型集成在一起,用于人工智能驱动的低资源语言语法归纳。该框架使用少射关系图卷积网络(FS-R-GCN)将语法错误的文本转换为关系图。然后,来自变形金刚的指令调谐双向编码器表示(BERT)人工智能模型将这些图形表示与原始错误句子一起处理,生成语法准确的替代方案。这个指令调优语言模型通过识别语法不一致和为BERT模型提供结构指导来帮助。在AlexEBall/ endangered ered_languages_capstone_proj_1数据集上的实验结果表明,与现有方法相比,该方法的准确率为99.01%,精密度为98.70%,F1-score为98.00%,显著提高了性能。
{"title":"Towards neural-symbolic grammatical inference for endangered languages using integrating graph neural networks and instruction-tuned language models","authors":"Manu Singh ,&nbsp;Neha Gupta ,&nbsp;Shiva Tyagi ,&nbsp;Ashima Rani ,&nbsp;Vinod Kumar ,&nbsp;Surbhi Sharma","doi":"10.1016/j.engappai.2026.114011","DOIUrl":"10.1016/j.engappai.2026.114011","url":null,"abstract":"<div><div>The rapid disappearance of endangered languages presents a critical challenge to cultural preservation and linguistic diversity. Traditional grammatical induction techniques face the challenge of having limited annotated data to work with on such language. To address these issues, this study proposes a Neural-Symbolic Artificial Intelligence (AI) based Grammatical Inference framework. It integrates Graph Neural Networks (GNNs) with instruction-tuned language models for AI-driven grammar induction in low-resource languages. This framework uses Few-shot Relational Graph Convolutional Networks (FS-R-GCN) to convert grammatically erroneous texts into relation graphs. Then, the Instruction-Tuned Bidirectional Encoder Representations from Transformers (BERT) AI model processes these graph representations alongside the original erroneous sentences, generating grammatically accurate alternatives. This Instruction-Tuned Language model assists by identifying grammatical inconsistencies and providing structural guidance to the BERT model. Experimental results on the AlexEBall/Endangered_Languages_Capstone_Proj_1 dataset show that the proposed method improves performance significantly with 99.01 % accuracy, 98.70 % precision, and 98.00 % F1-score compared to the existing methods.</div></div>","PeriodicalId":50523,"journal":{"name":"Engineering Applications of Artificial Intelligence","volume":"168 ","pages":"Article 114011"},"PeriodicalIF":8.0,"publicationDate":"2026-02-05","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"146191644","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
期刊
Engineering Applications of Artificial Intelligence
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1