首页 > 最新文献

International Journal of Intelligent Systems最新文献

英文 中文
Meta-Explainers: A Unified Ensemble Approach for Multifaceted XAI 元解释器:面向多面XAI的统一集成方法
IF 3.7 2区 计算机科学 Q1 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE Pub Date : 2025-11-26 DOI: 10.1155/int/4841666
Marilyn Bello, Rosalís Amador, María-Matilde García, Rafael Bello, Óscar Cordón, Francisco Herrera

Artificial intelligence (AI) systems are increasingly adopted in high-stakes domains such as healthcare and finance, so the demand for transparency and interpretability has grown substantially. EXplainable AI (XAI) methods have emerged to address this challenge, but individual techniques often offer limited, fragmented insights. This paper introduces Meta-explainers, a novel ensemble-based XAI framework that integrates multiple explanation types—specifically relevance-based and counterfactual methods—into unified, multifaceted and complementary meta-explanations. Inspired by meta-classification principles, our approach structures the explanation process into five stages: generation, grouping, evaluation, aggregation, and visualization. Each stage is designed to preserve the unique strengths of individual XAI techniques while enhancing their interpretability and coherence when combined. Experimental results on both image (MNIST) and tabular (Breast Cancer) datasets show that Meta-explainers consistently outperform individual and state-of-the-art ensemble explanation methods in terms of explanation quality, as measured by established metrics. This work paves the way toward more holistic and user-centered AI explainability with a flexible methodology that can be extended to incorporate additional explanation paradigms.

人工智能(AI)系统越来越多地应用于医疗保健和金融等高风险领域,因此对透明度和可解释性的需求大幅增长。可解释的人工智能(XAI)方法已经出现,以应对这一挑战,但单个技术通常提供有限的,碎片化的见解。本文介绍了元解释器,这是一种新颖的基于集成的XAI框架,它将多种解释类型(特别是基于关联和反事实的方法)集成到统一的、多方面的和互补的元解释中。受元分类原理的启发,我们的方法将解释过程分为五个阶段:生成、分组、评估、聚合和可视化。每个阶段的设计都是为了保留单个XAI技术的独特优势,同时增强它们在组合时的可解释性和一致性。在图像(MNIST)和表格(乳腺癌)数据集上的实验结果表明,就解释质量而言,元解释器始终优于个体和最先进的集成解释方法。这项工作为更全面和以用户为中心的人工智能可解释性铺平了道路,它采用了一种灵活的方法,可以扩展到包含其他解释范式。
{"title":"Meta-Explainers: A Unified Ensemble Approach for Multifaceted XAI","authors":"Marilyn Bello,&nbsp;Rosalís Amador,&nbsp;María-Matilde García,&nbsp;Rafael Bello,&nbsp;Óscar Cordón,&nbsp;Francisco Herrera","doi":"10.1155/int/4841666","DOIUrl":"https://doi.org/10.1155/int/4841666","url":null,"abstract":"<p>Artificial intelligence (AI) systems are increasingly adopted in high-stakes domains such as healthcare and finance, so the demand for transparency and interpretability has grown substantially. EXplainable AI (XAI) methods have emerged to address this challenge, but individual techniques often offer limited, fragmented insights. This paper introduces Meta-explainers, a novel ensemble-based XAI framework that integrates multiple explanation types—specifically relevance-based and counterfactual methods—into unified, multifaceted and complementary meta-explanations. Inspired by meta-classification principles, our approach structures the explanation process into five stages: generation, grouping, evaluation, aggregation, and visualization. Each stage is designed to preserve the unique strengths of individual XAI techniques while enhancing their interpretability and coherence when combined. Experimental results on both image (MNIST) and tabular (Breast Cancer) datasets show that Meta-explainers consistently outperform individual and state-of-the-art ensemble explanation methods in terms of explanation quality, as measured by established metrics. This work paves the way toward more holistic and user-centered AI explainability with a flexible methodology that can be extended to incorporate additional explanation paradigms.</p>","PeriodicalId":14089,"journal":{"name":"International Journal of Intelligent Systems","volume":"2025 1","pages":""},"PeriodicalIF":3.7,"publicationDate":"2025-11-26","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://onlinelibrary.wiley.com/doi/epdf/10.1155/int/4841666","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145626361","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Improving Ancient Chinese Word Segmentation With Knowledge-Enhanced Prompting for Large Language Models 基于知识增强提示的大型语言模型古汉语分词改进
IF 3.7 2区 计算机科学 Q1 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE Pub Date : 2025-11-26 DOI: 10.1155/int/9612240
Meng-Tian Tang, Cheng-Gang Mi

This paper introduces a cost-effective prompt optimization strategy for ancient Chinese word segmentation using large language models, aiming to mitigate the substantial computational resources and training expenses of fine-tuning. We developed two knowledge-enhanced frameworks, a General Knowledge Prompt framework and a Domain-Specific Knowledge Prompt framework, and evaluated their effectiveness across various ancient Chinese corpora using seven mainstream LLMs, including ERNIE Bot, Qwen, SparkDesk, DeepSeek, ChatGPT, Gemini, and Copilot. Our findings confirm that both prompt frameworks enhance the segmentation capability of LLMs to varying extents, with the Domain-Specific Knowledge Prompt framework yielding the most significant improvements. Notably, the DeepSeek model achieves 94.01% F1 score (94.24% precision, 93.79% recall) on the test set, while the Qwen model demonstrates a remarkable 15.73% increase in the F1 score with the Domain-Specific Knowledge Prompt framework. Our ablation studies indicate that the entries Rules and Examples are the most crucial to the success of prompt frameworks, effectively addressing the challenges of rule inconsistency and insufficient annotated data.

本文提出了一种基于大型语言模型的古汉语分词快速优化策略,旨在减少大量的计算资源和训练费用。我们开发了两个知识增强框架,一个是通用知识提示框架,一个是特定领域知识提示框架,并使用七个主流llm(包括ERNIE Bot、Qwen、SparkDesk、DeepSeek、ChatGPT、Gemini和Copilot)评估了它们在各种古代汉语语料库中的有效性。我们的研究结果证实,这两种提示框架都在不同程度上增强了法学硕士的分割能力,其中领域特定知识提示框架的改进最为显著。值得注意的是,DeepSeek模型在测试集上获得了94.01%的F1分数(准确率94.24%,召回率93.79%),而Qwen模型在特定领域知识提示框架下的F1分数提高了15.73%。我们的消融研究表明,条目规则和示例是提示框架成功的最关键,有效地解决了规则不一致和注释数据不足的挑战。
{"title":"Improving Ancient Chinese Word Segmentation With Knowledge-Enhanced Prompting for Large Language Models","authors":"Meng-Tian Tang,&nbsp;Cheng-Gang Mi","doi":"10.1155/int/9612240","DOIUrl":"https://doi.org/10.1155/int/9612240","url":null,"abstract":"<p>This paper introduces a cost-effective prompt optimization strategy for ancient Chinese word segmentation using large language models, aiming to mitigate the substantial computational resources and training expenses of fine-tuning. We developed two knowledge-enhanced frameworks, a General Knowledge Prompt framework and a Domain-Specific Knowledge Prompt framework, and evaluated their effectiveness across various ancient Chinese corpora using seven mainstream LLMs, including ERNIE Bot, Qwen, SparkDesk, DeepSeek, ChatGPT, Gemini, and Copilot. Our findings confirm that both prompt frameworks enhance the segmentation capability of LLMs to varying extents, with the Domain-Specific Knowledge Prompt framework yielding the most significant improvements. Notably, the DeepSeek model achieves 94.01% <i>F</i>1 score (94.24% precision, 93.79% recall) on the test set, while the Qwen model demonstrates a remarkable 15.73% increase in the <i>F</i>1 score with the Domain-Specific Knowledge Prompt framework. Our ablation studies indicate that the entries Rules and Examples are the most crucial to the success of prompt frameworks, effectively addressing the challenges of rule inconsistency and insufficient annotated data.</p>","PeriodicalId":14089,"journal":{"name":"International Journal of Intelligent Systems","volume":"2025 1","pages":""},"PeriodicalIF":3.7,"publicationDate":"2025-11-26","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://onlinelibrary.wiley.com/doi/epdf/10.1155/int/9612240","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145626360","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
A Method of Extractive Text Summarization Using Document Semantic Graph With Node Ranking 基于节点排序的文档语义图提取文本摘要方法
IF 3.7 2区 计算机科学 Q1 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE Pub Date : 2025-11-21 DOI: 10.1155/int/5530784
Zhenhao Li, Miao Liu, Wenbin Chen, Ligang Zheng

With the rise of neural networks and pre-trained models such as BERT, abstractive text summarization techniques have received widespread attention. Nevertheless, traditional extractive text summarization methods still hold substantial research value due to their low computational cost, interpretability, and robustness. In algorithms like TextRank and its variants, graph nodes are typically constructed based on surface-level lexical features. These graphs often fail to incorporate many contextual relationships, such as coreference relationships among nodes, resulting in fragmented representations of key concepts. For edge construction, a sliding window of size T is commonly used to connect word nodes within the window. However, these methods often fall short in modeling the rich contextual dependencies embedded in the document. Several recent studies have demonstrated that semantic graphs can effectively improve the accuracy of text summarization. In this paper, we construct a more interpretable semantic graph from syntax trees and propose a novel unsupervised algorithm based on the personalized PageRank algorithm for summary extraction. We utilize tree transformation methods to enrich word-level information for graph construction, define node-merging rules to reduce graph complexity, use coreference chains to merge coreferring entities across sentences for enriching contextual links, and introduce the concept of Meta Node sets to capture thematic relationships that are not fully represented by syntactic dependencies or coreference chains alone. By clustering semantically related words, Meta Nodes enhance the graph’s ability to reflect deeper contextual coherence across the document. Compared with previous TextRank-based methods, our improvement yields significant ROUGE score boosts on the CNN-DM dataset. While the method was developed and evaluated using English-language datasets, its underlying design is language agnostic and can be adapted to other languages with suitable linguistic tools.

随着神经网络和BERT等预训练模型的兴起,抽象文本摘要技术受到了广泛的关注。然而,传统的提取文本摘要方法由于其计算成本低、可解释性强、鲁棒性好等优点,仍然具有很大的研究价值。在像TextRank及其变体这样的算法中,图节点通常是基于表面级词法特征构造的。这些图通常不能包含许多上下文关系,例如节点之间的共引用关系,从而导致关键概念的碎片化表示。对于边缘构造,通常使用大小为T的滑动窗口来连接窗口内的词节点。然而,这些方法在对嵌入在文档中的丰富的上下文依赖关系进行建模方面常常存在不足。最近的一些研究表明,语义图可以有效地提高文本摘要的准确性。在本文中,我们从语法树构造了一个更可解释的语义图,并提出了一种新的基于个性化PageRank算法的无监督摘要提取算法。我们利用树转换方法来丰富词级信息以构建图,定义节点合并规则以降低图的复杂性,使用共引用链来合并句子间的共引用实体以丰富上下文链接,并引入元节点集的概念来捕获不能完全由句法依赖或共引用链单独表示的主题关系。通过聚类语义相关的词,元节点增强了图在整个文档中反映更深层次上下文一致性的能力。与之前基于texrank的方法相比,我们的改进在CNN-DM数据集上产生了显著的ROUGE分数提升。虽然该方法是使用英语数据集开发和评估的,但其基本设计是语言不可知的,可以通过合适的语言工具适应其他语言。
{"title":"A Method of Extractive Text Summarization Using Document Semantic Graph With Node Ranking","authors":"Zhenhao Li,&nbsp;Miao Liu,&nbsp;Wenbin Chen,&nbsp;Ligang Zheng","doi":"10.1155/int/5530784","DOIUrl":"https://doi.org/10.1155/int/5530784","url":null,"abstract":"<p>With the rise of neural networks and pre-trained models such as BERT, abstractive text summarization techniques have received widespread attention. Nevertheless, traditional extractive text summarization methods still hold substantial research value due to their low computational cost, interpretability, and robustness. In algorithms like TextRank and its variants, graph nodes are typically constructed based on surface-level lexical features. These graphs often fail to incorporate many contextual relationships, such as coreference relationships among nodes, resulting in fragmented representations of key concepts. For edge construction, a sliding window of size T is commonly used to connect word nodes within the window. However, these methods often fall short in modeling the rich contextual dependencies embedded in the document. Several recent studies have demonstrated that semantic graphs can effectively improve the accuracy of text summarization. In this paper, we construct a more interpretable semantic graph from syntax trees and propose a novel unsupervised algorithm based on the personalized PageRank algorithm for summary extraction. We utilize tree transformation methods to enrich word-level information for graph construction, define node-merging rules to reduce graph complexity, use coreference chains to merge coreferring entities across sentences for enriching contextual links, and introduce the concept of Meta Node sets to capture thematic relationships that are not fully represented by syntactic dependencies or coreference chains alone. By clustering semantically related words, Meta Nodes enhance the graph’s ability to reflect deeper contextual coherence across the document. Compared with previous TextRank-based methods, our improvement yields significant ROUGE score boosts on the CNN-DM dataset. While the method was developed and evaluated using English-language datasets, its underlying design is language agnostic and can be adapted to other languages with suitable linguistic tools.</p>","PeriodicalId":14089,"journal":{"name":"International Journal of Intelligent Systems","volume":"2025 1","pages":""},"PeriodicalIF":3.7,"publicationDate":"2025-11-21","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://onlinelibrary.wiley.com/doi/epdf/10.1155/int/5530784","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145572552","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Pragmatic Brain Tumor Imaging Classification Using Federated Learning 使用联邦学习的实用脑肿瘤成像分类
IF 3.7 2区 计算机科学 Q1 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE Pub Date : 2025-11-20 DOI: 10.1155/int/8817677
Jun Wen, Long Liu, Xiaoli Li, Xiusheng Li, Hang Mao

Brain tumors account for approximately 2.5% of cancer-related deaths. Accurate classification of brain tumor types is essential for timely diagnosis and enhancing survival rates. Convolutional neural networks (CNNs) have demonstrated state-of-the-art performance in computer-aided diagnosis of brain tumors; however, the quality and availability of medical data significantly influence this process. Medical data must adhere to stringent privacy regulations, such as the General Data Protection Regulation (GDPR) in the European Union and the Health Insurance Portability and Accountability Act (HIPAA) in the United States. Federated learning (FL) enables the sharing of only model update parameters during collaborative training on locally stored data. However, these parameters may inadvertently enable reconstruction of the original data. Furthermore, medical data often exhibit nonindependent and nonidentically distributed (non-IID) characteristics, impeding model training performance. To address these challenges, this paper proposes a scheme that partitions confidential data into multiple segments during FL training, ensuring that only a subset exceeding a predefined threshold can reconstruct the data. The proposed scheme guarantees enhanced security, distributed control, and fault tolerance. In addition, this paper introduces a Conditional Mutual Information (CMI) regularizer to mitigate variability in model predictions. By minimizing the Kullback–Leibler (KL) divergence between local and global feature distributions, the CMI regularizer substantially enhances performance and convergence stability. Extensive experiments conducted on the Figshare dataset with varying α-values for data distributions validate the efficacy of the proposed model. Compared to FedAvg, FedProx, and FedDyn at α = 0.3, as well as the central model, the proposed model achieves a top-1 accuracy of 92.94% on the Figshare dataset, surpassing FedProx, FedAvg, and FedDyn by 2.42%, 2.82%, and 3.53%, respectively. Federated IID achieves performance comparable to that of the central model, further demonstrating its viability for practical applications.

脑肿瘤约占癌症相关死亡人数的2.5%。准确的脑肿瘤类型分类对于及时诊断和提高生存率至关重要。卷积神经网络(cnn)在脑肿瘤的计算机辅助诊断中表现出了最先进的性能;然而,医疗数据的质量和可用性对这一进程有重大影响。医疗数据必须遵守严格的隐私法规,例如欧盟的《通用数据保护条例》(GDPR)和美国的《健康保险流通与责任法案》(HIPAA)。联邦学习(FL)允许在对本地存储的数据进行协作训练期间仅共享模型更新参数。然而,这些参数可能无意中启用原始数据的重建。此外,医疗数据往往表现出非独立和非同分布(non-IID)的特征,阻碍了模型训练的性能。为了解决这些挑战,本文提出了一种方案,该方案在FL训练期间将机密数据划分为多个部分,确保只有超过预定义阈值的子集才能重建数据。该方案保证了增强的安全性、分布式控制和容错性。此外,本文还引入了条件互信息(CMI)正则化器来减轻模型预测中的可变性。通过最小化局部和全局特征分布之间的Kullback-Leibler (KL)散度,CMI正则化器大大提高了性能和收敛稳定性。在Figshare数据集上进行的大量实验验证了该模型的有效性,该数据集具有不同的数据分布α-值。与α = 0.3时的fedag、FedProx和FedDyn以及中心模型相比,该模型在Figshare数据集上达到了92.94%的top-1精度,分别比FedProx、fedprog和FedDyn高2.42%、2.82%和3.53%。Federated IID实现了与中心模型相当的性能,进一步证明了其在实际应用中的可行性。
{"title":"Pragmatic Brain Tumor Imaging Classification Using Federated Learning","authors":"Jun Wen,&nbsp;Long Liu,&nbsp;Xiaoli Li,&nbsp;Xiusheng Li,&nbsp;Hang Mao","doi":"10.1155/int/8817677","DOIUrl":"https://doi.org/10.1155/int/8817677","url":null,"abstract":"<p>Brain tumors account for approximately 2.5% of cancer-related deaths. Accurate classification of brain tumor types is essential for timely diagnosis and enhancing survival rates. Convolutional neural networks (CNNs) have demonstrated state-of-the-art performance in computer-aided diagnosis of brain tumors; however, the quality and availability of medical data significantly influence this process. Medical data must adhere to stringent privacy regulations, such as the General Data Protection Regulation (GDPR) in the European Union and the Health Insurance Portability and Accountability Act (HIPAA) in the United States. Federated learning (FL) enables the sharing of only model update parameters during collaborative training on locally stored data. However, these parameters may inadvertently enable reconstruction of the original data. Furthermore, medical data often exhibit nonindependent and nonidentically distributed (non-IID) characteristics, impeding model training performance. To address these challenges, this paper proposes a scheme that partitions confidential data into multiple segments during FL training, ensuring that only a subset exceeding a predefined threshold can reconstruct the data. The proposed scheme guarantees enhanced security, distributed control, and fault tolerance. In addition, this paper introduces a Conditional Mutual Information (CMI) regularizer to mitigate variability in model predictions. By minimizing the Kullback–Leibler (KL) divergence between local and global feature distributions, the CMI regularizer substantially enhances performance and convergence stability. Extensive experiments conducted on the Figshare dataset with varying α-values for data distributions validate the efficacy of the proposed model. Compared to FedAvg, FedProx, and FedDyn at <i>α</i> = 0.3, as well as the central model, the proposed model achieves a top-1 accuracy of 92.94% on the Figshare dataset, surpassing FedProx, FedAvg, and FedDyn by 2.42%, 2.82%, and 3.53%, respectively. Federated IID achieves performance comparable to that of the central model, further demonstrating its viability for practical applications.</p>","PeriodicalId":14089,"journal":{"name":"International Journal of Intelligent Systems","volume":"2025 1","pages":""},"PeriodicalIF":3.7,"publicationDate":"2025-11-20","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://onlinelibrary.wiley.com/doi/epdf/10.1155/int/8817677","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145572353","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
A Robust Watermarking Method for Hyperspectral Images Based on Hybrid Attention Mechanism 基于混合注意机制的高光谱图像鲁棒水印方法
IF 3.7 2区 计算机科学 Q1 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE Pub Date : 2025-11-19 DOI: 10.1155/int/8844705
De Li, Zhewei Zhang, Xuanyou Li, Xun Jin, Yanwei Wang

Because of the copyright issues of hyperspectral images continue to rise, in this paper, we propose to use a neural network–based watermarking model to protect the copyright. By applying normalization-based attention module (NAM) to deep dispersed watermarking with synchronization and fusion (DWSF), a NDWSF model is proposed for robust hyperspectral image watermarking. It consists of encoding, decoding, discrimination, and attack modules. The encoding and decoding modules are used for embedding and extracting watermarks. Discrimination module is proposed for improving the quality of watermarked image. The discrimination module and the encoding module are in an adversarial relationship to motivate the encoder to generate watermarks with stronger invisibility. Attack module is employed between embedding and extraction to improve robustness against compression and noise and geometric attacks. In order to more effectively utilize image features for watermarking, a kind of hybrid attention mechanism is employed in embedding and extraction by adding NAM. Experimental results show that the loss convergence and stability in training is improved. The peak signal-to-noise ratio of the proposed method is 48.08 dB, higher than other methods about 2.5 dB. The bit error rate of the proposed method is less than 2.5% for various hybrid attacks, showing good robustness.

由于高光谱图像的版权问题不断增加,本文提出了一种基于神经网络的水印模型来保护高光谱图像的版权。将基于归一化的注意力模块(NAM)应用于深度分散同步融合水印(DWSF),提出了一种鲁棒高光谱图像同步融合水印模型。它由编码、解码、鉴别和攻击四个模块组成。编码和解码模块用于水印的嵌入和提取。为了提高水印图像的质量,提出了识别模块。识别模块和编码模块是一种对抗关系,以激励编码器产生不可见性更强的水印。在嵌入和提取之间采用攻击模块,提高了对压缩、噪声和几何攻击的鲁棒性。为了更有效地利用图像特征进行水印,在嵌入和提取中采用了一种混合注意机制,加入了NAM。实验结果表明,该方法提高了训练中的损失收敛性和稳定性。该方法的峰值信噪比为48.08 dB,比其他方法高约2.5 dB。该方法对各种混合攻击的误码率均小于2.5%,具有较好的鲁棒性。
{"title":"A Robust Watermarking Method for Hyperspectral Images Based on Hybrid Attention Mechanism","authors":"De Li,&nbsp;Zhewei Zhang,&nbsp;Xuanyou Li,&nbsp;Xun Jin,&nbsp;Yanwei Wang","doi":"10.1155/int/8844705","DOIUrl":"https://doi.org/10.1155/int/8844705","url":null,"abstract":"<p>Because of the copyright issues of hyperspectral images continue to rise, in this paper, we propose to use a neural network–based watermarking model to protect the copyright. By applying normalization-based attention module (NAM) to deep dispersed watermarking with synchronization and fusion (DWSF), a NDWSF model is proposed for robust hyperspectral image watermarking. It consists of encoding, decoding, discrimination, and attack modules. The encoding and decoding modules are used for embedding and extracting watermarks. Discrimination module is proposed for improving the quality of watermarked image. The discrimination module and the encoding module are in an adversarial relationship to motivate the encoder to generate watermarks with stronger invisibility. Attack module is employed between embedding and extraction to improve robustness against compression and noise and geometric attacks. In order to more effectively utilize image features for watermarking, a kind of hybrid attention mechanism is employed in embedding and extraction by adding NAM. Experimental results show that the loss convergence and stability in training is improved. The peak signal-to-noise ratio of the proposed method is 48.08 dB, higher than other methods about 2.5 dB. The bit error rate of the proposed method is less than 2.5% for various hybrid attacks, showing good robustness.</p>","PeriodicalId":14089,"journal":{"name":"International Journal of Intelligent Systems","volume":"2025 1","pages":""},"PeriodicalIF":3.7,"publicationDate":"2025-11-19","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://onlinelibrary.wiley.com/doi/epdf/10.1155/int/8844705","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145581097","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Optimization Enabled Online Tiger-Claw Fuzzy Region With Clustering Based Neovascularization Segmentation and Classification Using YOLO-V5 From Retinal Fundus Images 基于YOLO-V5优化的基于聚类的在线虎爪模糊区域视网膜眼底图像新生血管分割与分类
IF 3.7 2区 计算机科学 Q1 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE Pub Date : 2025-11-17 DOI: 10.1155/int/6119924
M. Kathiravan, Ashwini A., Balasubramaniam S., T. D. Subha, Gururama Senthilvel P., Sivakumar T. A.

The pathological development of abnormal blood vessels results in neovascularization as a major vision-threatening condition of diabetic retinopathy. The main factor behind pathological vessel growth results from retinal capillary depletion of oxygen that causes abnormal vascular development patterns. Early detection of these fundus image abnormalities requires precision because it enables ophthalmologists to provide effective treatment and make proper diagnoses. A multiple-step image processing system treats this problem. A fusion-based contrast enhancement method begins the process of enhancing diabetic retinopathy fundus image brightness and contrast. After the initial process, the system applies detail weighted histogram equalization to the green channel for better structural detail visualization. In the second stage, the proposed online tiger-claw algorithm segments abnormal neovascularization from normal blood vessels. Next, the combination of fuzzy zone-based clustering with optimization and classifier thresholding performs local identification along with highlight generation for neovascularized areas. Neovascularization detection makes use of a YOLOv5 neural network in the third stage through feature extraction and classification operations. A refined segmentation process occurs with the application of multistage gray wolf optimization. The proposed algorithm underwent testing through its application to the public datasets STARE, DRIVE, MESSIDOR, and DIARETDB1. Experimental tests indicate that the neovascularization region marking performed with 98.19% sensitivity and 96.56% specificity while reaching 99.27% accuracy. The proposed approach demonstrates 97.03% accuracy and 98.94% sensitivity, together with 97.17% specificity in neovascularization detection.

异常血管的病理发展导致新生血管形成是糖尿病视网膜病变的主要视力威胁条件。病理血管生长背后的主要因素是视网膜毛细血管缺氧,导致血管发育模式异常。早期发现这些眼底图像异常需要精确,因为它使眼科医生能够提供有效的治疗和做出正确的诊断。多步图像处理系统解决了这个问题。一种基于融合的对比度增强方法开始了增强糖尿病视网膜病变眼底图像亮度和对比度的过程。经过初始处理后,系统对绿色通道进行细节加权直方图均衡化,以获得更好的结构细节可视化效果。第二阶段,提出的在线虎爪算法将异常新生血管从正常血管中分割出来。接下来,基于模糊区域的聚类与优化和分类器阈值相结合,对新血管化区域进行局部识别和高光生成。新生血管检测在第三阶段通过特征提取和分类操作使用YOLOv5神经网络。应用多阶段灰狼优化,实现了精细的分割过程。本文提出的算法通过对公共数据集STARE、DRIVE、MESSIDOR和DIARETDB1的应用进行了测试。实验结果表明,新血管区标记的灵敏度为98.19%,特异性为96.56%,准确率为99.27%。该方法在新生血管检测中准确率为97.03%,灵敏度为98.94%,特异性为97.17%。
{"title":"Optimization Enabled Online Tiger-Claw Fuzzy Region With Clustering Based Neovascularization Segmentation and Classification Using YOLO-V5 From Retinal Fundus Images","authors":"M. Kathiravan,&nbsp;Ashwini A.,&nbsp;Balasubramaniam S.,&nbsp;T. D. Subha,&nbsp;Gururama Senthilvel P.,&nbsp;Sivakumar T. A.","doi":"10.1155/int/6119924","DOIUrl":"https://doi.org/10.1155/int/6119924","url":null,"abstract":"<p>The pathological development of abnormal blood vessels results in neovascularization as a major vision-threatening condition of diabetic retinopathy. The main factor behind pathological vessel growth results from retinal capillary depletion of oxygen that causes abnormal vascular development patterns. Early detection of these fundus image abnormalities requires precision because it enables ophthalmologists to provide effective treatment and make proper diagnoses. A multiple-step image processing system treats this problem. A fusion-based contrast enhancement method begins the process of enhancing diabetic retinopathy fundus image brightness and contrast. After the initial process, the system applies detail weighted histogram equalization to the green channel for better structural detail visualization. In the second stage, the proposed online tiger-claw algorithm segments abnormal neovascularization from normal blood vessels. Next, the combination of fuzzy zone-based clustering with optimization and classifier thresholding performs local identification along with highlight generation for neovascularized areas. Neovascularization detection makes use of a YOLOv5 neural network in the third stage through feature extraction and classification operations. A refined segmentation process occurs with the application of multistage gray wolf optimization. The proposed algorithm underwent testing through its application to the public datasets STARE, DRIVE, MESSIDOR, and DIARETDB1. Experimental tests indicate that the neovascularization region marking performed with 98.19% sensitivity and 96.56% specificity while reaching 99.27% accuracy. The proposed approach demonstrates 97.03% accuracy and 98.94% sensitivity, together with 97.17% specificity in neovascularization detection.</p>","PeriodicalId":14089,"journal":{"name":"International Journal of Intelligent Systems","volume":"2025 1","pages":""},"PeriodicalIF":3.7,"publicationDate":"2025-11-17","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://onlinelibrary.wiley.com/doi/epdf/10.1155/int/6119924","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145572459","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Energy-Aware Regression in Spiking Neural Networks for Autonomous Driving: A Comparative Study With Convolutional Networks 用于自动驾驶的脉冲神经网络能量感知回归:与卷积网络的比较研究
IF 3.7 2区 计算机科学 Q1 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE Pub Date : 2025-11-14 DOI: 10.1155/int/4879993
Fernando Sevilla Martínez, Jordi Casas-Roma, Laia Subirats, Raúl Parada

As autonomous driving (AD) systems grow more complex, their rising computational demands pose significant energy and sustainability challenges. This paper investigates spiking neural networks (SNNs) as low-power alternatives to convolutional neural networks (CNNs) for regression tasks in AD. We introduce a membrane-potential (Vmem) decoding framework that converts binary spike trains into continuous outputs and propose the energy-to-error ratio (EER), a unified metric combining prediction error with energy consumption. Three CNN architectures (PilotNet, LaksNet, and MiniNet) and their corresponding SNN variants are trained and evaluated using delta, latency, and rate encoding across varied parameter settings, with energy use and emissions logged. Delta-encoded SNNs achieve the highest EER, substantial energy savings with minimal performance loss, whereas CNNs, despite slightly better MSE, incur 10–20 × higher energy costs. Rate encoding underperforms, and latency encoding, though improving relative error, demands excessive energy. Parameter tuning (threshold θ, temporal dynamics (S), membrane time constant (τ), and gain G) directly influences eco-efficiency. All experiments run on standard GPUs, showing SNNs can surpass CNNs in eco-efficiency without specialized hardware. Paired statistical tests confirm that only delta-encoded SNNs achieve significant EER improvements. This work presents a practical, energy-aware evaluation framework for neural architectures, establishing EER as a critical metric for sustainable machine learning in intelligent transport and beyond.

随着自动驾驶(AD)系统变得越来越复杂,其不断增长的计算需求带来了重大的能源和可持续性挑战。本文研究了尖峰神经网络(SNNs)作为卷积神经网络(cnn)的低功耗替代品,用于AD中的回归任务。我们引入了一个膜电位(Vmem)解码框架,将二进制尖峰串转换为连续输出,并提出了能量误差率(EER),这是一个结合预测误差和能量消耗的统一度量。三种CNN架构(PilotNet, LaksNet和MiniNet)及其相应的SNN变体使用delta,延迟和速率编码在不同参数设置下进行训练和评估,并记录能源使用和排放。delta编码snn实现了最高的EER,以最小的性能损失节省了大量的能源,而cnn尽管MSE稍好,但会产生10-20倍的能源成本。速率编码性能不佳,延迟编码虽然可以改善相对误差,但需要过多的能量。参数调整(阈值θ、时间动态(S)、膜时间常数(τ)和增益G)直接影响生态效率。所有实验都在标准gpu上运行,表明SNNs在没有专门硬件的情况下可以超越cnn的生态效率。配对统计检验证实,只有delta编码的snn实现了显著的EER改进。这项工作提出了一个实用的、能源意识的神经架构评估框架,将EER作为智能交通及其他领域可持续机器学习的关键指标。
{"title":"Energy-Aware Regression in Spiking Neural Networks for Autonomous Driving: A Comparative Study With Convolutional Networks","authors":"Fernando Sevilla Martínez,&nbsp;Jordi Casas-Roma,&nbsp;Laia Subirats,&nbsp;Raúl Parada","doi":"10.1155/int/4879993","DOIUrl":"https://doi.org/10.1155/int/4879993","url":null,"abstract":"<p>As autonomous driving (AD) systems grow more complex, their rising computational demands pose significant energy and sustainability challenges. This paper investigates spiking neural networks (SNNs) as low-power alternatives to convolutional neural networks (CNNs) for regression tasks in AD. We introduce a membrane-potential (<i>V</i><sub>mem</sub>) decoding framework that converts binary spike trains into continuous outputs and propose the energy-to-error ratio (EER), a unified metric combining prediction error with energy consumption. Three CNN architectures (PilotNet, LaksNet, and MiniNet) and their corresponding SNN variants are trained and evaluated using delta, latency, and rate encoding across varied parameter settings, with energy use and emissions logged. Delta-encoded SNNs achieve the highest EER, substantial energy savings with minimal performance loss, whereas CNNs, despite slightly better MSE, incur 10–20 × higher energy costs. Rate encoding underperforms, and latency encoding, though improving relative error, demands excessive energy. Parameter tuning (threshold <i>θ</i>, temporal dynamics (<i>S</i>), membrane time constant (<i>τ</i>), and gain <i>G</i>) directly influences eco-efficiency. All experiments run on standard GPUs, showing SNNs can surpass CNNs in eco-efficiency without specialized hardware. Paired statistical tests confirm that only delta-encoded SNNs achieve significant EER improvements. This work presents a practical, energy-aware evaluation framework for neural architectures, establishing EER as a critical metric for sustainable machine learning in intelligent transport and beyond.</p>","PeriodicalId":14089,"journal":{"name":"International Journal of Intelligent Systems","volume":"2025 1","pages":""},"PeriodicalIF":3.7,"publicationDate":"2025-11-14","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://onlinelibrary.wiley.com/doi/epdf/10.1155/int/4879993","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145522267","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Knowledge Distillation in Federated Learning: A Survey on Long Lasting Challenges and New Solutions 联邦学习中的知识提炼:长期挑战与新解决方案综述
IF 3.7 2区 计算机科学 Q1 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE Pub Date : 2025-11-13 DOI: 10.1155/int/7406934
Laiqiao Qin, Tianqing Zhu, Wanlei Zhou, Philip S. Yu

Federated learning (FL) is a distributed and privacy-preserving machine learning paradigm that coordinates multiple clients to train a model while keeping the raw data localized. However, this traditional FL poses some challenges, including privacy risks, data heterogeneity, communication bottlenecks, and system heterogeneity issues. To tackle these challenges, knowledge distillation (KD) has been widely applied in FL since 2020. KD is a validated and efficacious model compression and enhancement algorithm. The core concept of KD involves facilitating knowledge transfer between models by exchanging logits at intermediate or output layers. These properties make KD an excellent solution for the long-lasting challenges in FL. Up to now, there have been few reviews that summarize and analyze the current trend and methods for how KD can be applied in FL efficiently. This article aims to provide a comprehensive survey of KD-based FL, focusing on addressing the above challenges. First, we provide an overview of KD-based FL, including its motivation, basics, taxonomy, and a comparison with traditional FL and where KD should execute. We also analyze the critical factors in KD-based FL in the Appendix, including teachers, knowledge, data, and methods. We discuss how KD can address the challenges in FL, including privacy protection, data heterogeneity, communication efficiency, and personalization. Finally, we discuss the challenges facing KD-based FL algorithms and future research directions. We hope this survey can provide insights and guidance for researchers and practitioners in the FL area.

联邦学习(FL)是一种分布式和保护隐私的机器学习范例,它协调多个客户端来训练模型,同时保持原始数据的本地化。然而,这种传统的FL带来了一些挑战,包括隐私风险、数据异构、通信瓶颈和系统异构问题。为了应对这些挑战,自2020年以来,知识蒸馏(KD)在FL中得到了广泛应用。KD是一种经过验证的有效的模型压缩和增强算法。KD的核心概念包括通过在中间层或输出层交换逻辑来促进模型之间的知识转移。这些特性使KD成为FL长期挑战的极好解决方案。到目前为止,很少有综述总结和分析KD如何有效应用于FL的当前趋势和方法。本文旨在提供基于kd的FL的全面调查,重点是解决上述挑战。首先,我们概述了基于KD的FL,包括其动机,基础知识,分类法,以及与传统FL的比较,以及KD应该在哪里执行。我们还在附录中分析了基于kd的外语教学的关键因素,包括教师、知识、数据和方法。我们讨论了KD如何解决FL中的挑战,包括隐私保护、数据异构、通信效率和个性化。最后,我们讨论了基于kd的FL算法面临的挑战和未来的研究方向。我们希望这项调查能够为FL领域的研究人员和从业者提供见解和指导。
{"title":"Knowledge Distillation in Federated Learning: A Survey on Long Lasting Challenges and New Solutions","authors":"Laiqiao Qin,&nbsp;Tianqing Zhu,&nbsp;Wanlei Zhou,&nbsp;Philip S. Yu","doi":"10.1155/int/7406934","DOIUrl":"https://doi.org/10.1155/int/7406934","url":null,"abstract":"<p>Federated learning (FL) is a distributed and privacy-preserving machine learning paradigm that coordinates multiple clients to train a model while keeping the raw data localized. However, this traditional FL poses some challenges, including privacy risks, data heterogeneity, communication bottlenecks, and system heterogeneity issues. To tackle these challenges, knowledge distillation (KD) has been widely applied in FL since 2020. KD is a validated and efficacious model compression and enhancement algorithm. The core concept of KD involves facilitating knowledge transfer between models by exchanging logits at intermediate or output layers. These properties make KD an excellent solution for the long-lasting challenges in FL. Up to now, there have been few reviews that summarize and analyze the current trend and methods for how KD can be applied in FL efficiently. This article aims to provide a comprehensive survey of KD-based FL, focusing on addressing the above challenges. First, we provide an overview of KD-based FL, including its motivation, basics, taxonomy, and a comparison with traditional FL and where KD should execute. We also analyze the critical factors in KD-based FL in the Appendix, including teachers, knowledge, data, and methods. We discuss how KD can address the challenges in FL, including privacy protection, data heterogeneity, communication efficiency, and personalization. Finally, we discuss the challenges facing KD-based FL algorithms and future research directions. We hope this survey can provide insights and guidance for researchers and practitioners in the FL area.</p>","PeriodicalId":14089,"journal":{"name":"International Journal of Intelligent Systems","volume":"2025 1","pages":""},"PeriodicalIF":3.7,"publicationDate":"2025-11-13","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://onlinelibrary.wiley.com/doi/epdf/10.1155/int/7406934","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145521665","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Noncontact Fault Diagnosis of Electrical Equipment Using Modified Multiscale Two-Dimensional Color Distribution Entropy and Thermal Imaging 基于改进多尺度二维颜色分布熵和热成像的电气设备非接触故障诊断
IF 3.7 2区 计算机科学 Q1 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE Pub Date : 2025-11-12 DOI: 10.1155/int/4805844
Shun Wang, Yolanda Vidal, Francesc Pozo

Effective health monitoring of electrical equipment is critical for industrial reliability. Although infrared thermal imaging offers a powerful noncontact diagnostic method, accurately interpreting its complex and often noisy thermal patterns remains a significant challenge. Entropy-based analysis is well suited for quantifying this complexity, but its application to images has been limited. Existing two-dimensional entropy methods are not only less developed than their one-dimensional counterparts but also typically require converting thermal images to grayscale, which discards vital diagnostic information from color channels. To overcome these limitations, this study introduces the modified multiscale two-dimensional color distribution entropy (MMCDEn2D). This novel method directly integrates the attributes of the RGB, preserving a richer feature set for analysis. The effectiveness of the proposed method is demonstrated first through synthetic signals, showing low sensitivity to image size and high computational efficiency. The study further extends the application of entropy-based analysis to noncontact health monitoring scenarios, implementing MMCDEn2D for thermal image-based fault diagnosis of induction motors and power transformers. The method achieves a diagnostic accuracy that exceeds 95%, significantly outperforming traditional approaches. Crucially, it demonstrates superior robustness in challenging scenarios, improving accuracy by 2%–5% under high-noise conditions and with small sample sizes. These results establish MMCDEn2D as a highly effective and reliable tool to advance noncontact fault diagnosis in critical electrical equipment.

有效的电气设备健康监测对工业可靠性至关重要。尽管红外热成像提供了一种强大的非接触式诊断方法,但准确解释其复杂且经常嘈杂的热模式仍然是一个重大挑战。基于熵的分析非常适合量化这种复杂性,但它在图像上的应用受到限制。现有的二维熵方法不仅不如一维熵方法发达,而且通常需要将热图像转换为灰度,从而丢弃了来自颜色通道的重要诊断信息。为了克服这些局限性,本研究引入了改进的多尺度二维颜色分布熵(MMCDEn2D)。该方法直接集成了RGB的属性,为分析保留了更丰富的特征集。首先通过合成信号验证了该方法的有效性,该方法对图像大小的敏感性低,计算效率高。本研究将基于熵的分析方法进一步扩展到非接触健康监测场景,实现了基于MMCDEn2D的感应电机和电力变压器热图像故障诊断。该方法的诊断准确率超过95%,显著优于传统方法。至关重要的是,它在具有挑战性的场景中表现出了卓越的鲁棒性,在高噪声条件下和小样本量下,准确率提高了2%-5%。这些结果表明,MMCDEn2D是一种非常有效和可靠的工具,可以推进关键电气设备的非接触故障诊断。
{"title":"Noncontact Fault Diagnosis of Electrical Equipment Using Modified Multiscale Two-Dimensional Color Distribution Entropy and Thermal Imaging","authors":"Shun Wang,&nbsp;Yolanda Vidal,&nbsp;Francesc Pozo","doi":"10.1155/int/4805844","DOIUrl":"https://doi.org/10.1155/int/4805844","url":null,"abstract":"<p>Effective health monitoring of electrical equipment is critical for industrial reliability. Although infrared thermal imaging offers a powerful noncontact diagnostic method, accurately interpreting its complex and often noisy thermal patterns remains a significant challenge. Entropy-based analysis is well suited for quantifying this complexity, but its application to images has been limited. Existing two-dimensional entropy methods are not only less developed than their one-dimensional counterparts but also typically require converting thermal images to grayscale, which discards vital diagnostic information from color channels. To overcome these limitations, this study introduces the modified multiscale two-dimensional color distribution entropy (MMCDEn<sub>2D</sub>). This novel method directly integrates the attributes of the RGB, preserving a richer feature set for analysis. The effectiveness of the proposed method is demonstrated first through synthetic signals, showing low sensitivity to image size and high computational efficiency. The study further extends the application of entropy-based analysis to noncontact health monitoring scenarios, implementing MMCDEn<sub>2D</sub> for thermal image-based fault diagnosis of induction motors and power transformers. The method achieves a diagnostic accuracy that exceeds 95%, significantly outperforming traditional approaches. Crucially, it demonstrates superior robustness in challenging scenarios, improving accuracy by 2%–5% under high-noise conditions and with small sample sizes. These results establish MMCDEn<sub>2D</sub> as a highly effective and reliable tool to advance noncontact fault diagnosis in critical electrical equipment.</p>","PeriodicalId":14089,"journal":{"name":"International Journal of Intelligent Systems","volume":"2025 1","pages":""},"PeriodicalIF":3.7,"publicationDate":"2025-11-12","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://onlinelibrary.wiley.com/doi/epdf/10.1155/int/4805844","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145521741","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
DiffG-MTL: A Dynamic Multidiffusion Graph Network for Multitask Traffic Accident Prediction 多任务交通事故预测的动态多扩散图网络
IF 3.7 2区 计算机科学 Q1 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE Pub Date : 2025-11-11 DOI: 10.1155/int/8995422
Nana Bu, Zongtao Duan, Wen Dang

Traffic accident prediction serves as a cornerstone of intelligent transportation systems, enabling proactive city-wide control strategies and public safety interventions. Effective models must capture the evolving spatiotemporal propagation of risk while addressing heterogeneous data distributions across urban regions. Current approaches face significant limitations: fixed graph topologies fail to represent nonstationary accident patterns, while uniform task weighting leads to optimization bias toward data-rich areas, ultimately constraining adaptability in adjacency construction and multihop spatial reasoning. To address these challenges, we propose a dynamic multidiffusion graph network with multitask learning (DiffG-MTL) for city-scale accident prediction. Specifically, a dynamic diffusion adjacency generation (DDAG) module constructs time-varying, diffusion-based adjacency matrices through multiple propagation pathways. A multiscale graph structure learning (MGSL) module captures multihop spatial relationships and temporal cues, while effectively highlighting anomalous traffic behaviors. To alleviate regional data imbalance, we introduce a dynamic multitask learning objective that adaptively redistributes learning focus using recall-aware weighting and task-level normalization. Comprehensive evaluations on six widely used datasets demonstrate that DiffG-MTL consistently outperforms state-of-the-art baselines across multiple evaluation metrics. Additional experiments validate its robustness and effectiveness in modeling complex spatiotemporal accident patterns.

交通事故预测是智能交通系统的基石,可以实现城市范围内的主动控制策略和公共安全干预。有效的模型必须捕捉风险的时空传播,同时处理城市区域间的异构数据分布。目前的方法面临着明显的局限性:固定的图拓扑不能表示非平稳的事故模式,而统一的任务加权导致优化偏向于数据丰富的区域,最终限制了邻接构建和多跳空间推理的适应性。为了解决这些挑战,我们提出了一个具有多任务学习的动态多扩散图网络(DiffG-MTL)用于城市规模的事故预测。具体来说,动态扩散邻接生成(DDAG)模块通过多种传播途径构建时变的、基于扩散的邻接矩阵。多尺度图结构学习(MGSL)模块捕获多跳空间关系和时间线索,同时有效地突出异常交通行为。为了缓解区域数据不平衡,我们引入了一个动态多任务学习目标,该目标使用回忆感知加权和任务级归一化自适应地重新分配学习焦点。对六个广泛使用的数据集的综合评估表明,DiffG-MTL在多个评估指标上始终优于最先进的基线。实验验证了该方法对复杂时空事故模式建模的鲁棒性和有效性。
{"title":"DiffG-MTL: A Dynamic Multidiffusion Graph Network for Multitask Traffic Accident Prediction","authors":"Nana Bu,&nbsp;Zongtao Duan,&nbsp;Wen Dang","doi":"10.1155/int/8995422","DOIUrl":"https://doi.org/10.1155/int/8995422","url":null,"abstract":"<p>Traffic accident prediction serves as a cornerstone of intelligent transportation systems, enabling proactive city-wide control strategies and public safety interventions. Effective models must capture the evolving spatiotemporal propagation of risk while addressing heterogeneous data distributions across urban regions. Current approaches face significant limitations: fixed graph topologies fail to represent nonstationary accident patterns, while uniform task weighting leads to optimization bias toward data-rich areas, ultimately constraining adaptability in adjacency construction and multihop spatial reasoning. To address these challenges, we propose a dynamic multidiffusion graph network with multitask learning (DiffG-MTL) for city-scale accident prediction. Specifically, a dynamic diffusion adjacency generation (DDAG) module constructs time-varying, diffusion-based adjacency matrices through multiple propagation pathways. A multiscale graph structure learning (MGSL) module captures multihop spatial relationships and temporal cues, while effectively highlighting anomalous traffic behaviors. To alleviate regional data imbalance, we introduce a dynamic multitask learning objective that adaptively redistributes learning focus using recall-aware weighting and task-level normalization. Comprehensive evaluations on six widely used datasets demonstrate that DiffG-MTL consistently outperforms state-of-the-art baselines across multiple evaluation metrics. Additional experiments validate its robustness and effectiveness in modeling complex spatiotemporal accident patterns.</p>","PeriodicalId":14089,"journal":{"name":"International Journal of Intelligent Systems","volume":"2025 1","pages":""},"PeriodicalIF":3.7,"publicationDate":"2025-11-11","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://onlinelibrary.wiley.com/doi/epdf/10.1155/int/8995422","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145521832","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
期刊
International Journal of Intelligent Systems
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1