首页 > 最新文献

Artificial Intelligence Review最新文献

英文 中文
Emerging computational intelligence based techniques for lung cancer diagnosis and classification on chest CT scan images: a comprehensive survey 基于新兴计算智能的胸部CT扫描图像肺癌诊断和分类技术:一项综合调查
IF 13.9 2区 计算机科学 Q1 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE Pub Date : 2026-01-07 DOI: 10.1007/s10462-025-11374-9
Pankaj Kumari, Lavika Goel

Worldwide lung cancer is a significant reason for death resulting from cancer with early diagnosis crucial for enhancing patient results. This comprehensive survey looks at the most recent developments in methods for detecting lung cancer by using chest CT scan images. The study describes a broad variety of approaches includes methods for machine learning such random forests support vector machines logistic regression and k-nearest neighbors in addition to deep learning frameworks such as variational autoencoders recurrent neural networks convolutional neural networks and generative adversarial networks. Additionally the survey explores hybrid models that combine deep learning and machine learning with nature-inspired optimization techniques to enhance performance. All the techniques discussed in this paper mainly focus on the diagnosis of NSCLC i.e. non-small cell lung cancer as it is more prevalent. The paper also reviews multiple advanced techniques used in diagnosis of lung cancer, including 3D-CNN i.e. Convolutional Neural Networks, multimodal logistic regression models and Cyclic Variational Autoencoders. It highlights key publicly available datasets frequently used in this research area such as LIDC-IDRI (Lung Image Database Consortium and Image Database Resource Initiative), LUNA16 (Lung Nodule Analysis 2016), the Kaggle lung cancer dataset, NSCLC Radiogenomics and the NIH (National Institutes of Health) chest X-ray database. This survey provides a detailed comparison of each technique, describing their advantages, limitations, and reported performance metrics, especially in terms of classification accuracy. Transfer learning with Vision Transformer achieves the highest accuracy of 94.6%, while 3D Convolutional Neural Network (3D -CNN) achieves an accuracy of 93.7%, both of which are showcasing highest performance on applicable datasets. Furthermore, the research demonstrates the potential of emerging techniques like federated learning and explainable AI in addressing challenges pertaining to data privacy and model interpretability. This survey paper reviews several techniques and finds that deep learning is the most extensively researched area in lung cancer diagnosis. This approach is not only widely used but also exhibits notable success in identifying and categorizing lung cancer with a high degree of accuracy.

在世界范围内,肺癌是导致癌症死亡的一个重要原因,早期诊断对于提高患者的治疗效果至关重要。这项综合调查着眼于使用胸部CT扫描图像检测肺癌方法的最新进展。该研究描述了各种各样的方法,包括机器学习方法,如随机森林,支持向量机,逻辑回归和k近邻,以及深度学习框架,如变分自编码器,循环神经网络,卷积神经网络和生成对抗网络。此外,该调查还探讨了将深度学习和机器学习与自然优化技术相结合的混合模型,以提高性能。本文讨论的所有技术主要集中在非小细胞肺癌的诊断,即非小细胞肺癌,因为它更普遍。本文还回顾了用于肺癌诊断的多种先进技术,包括3D-CNN即卷积神经网络,多模态逻辑回归模型和循环变分自编码器。它突出了该研究领域经常使用的关键公开可用数据集,如LIDC-IDRI(肺图像数据库联盟和图像数据库资源倡议),LUNA16(肺结节分析2016),Kaggle肺癌数据集,NSCLC放射基因组学和NIH(美国国立卫生研究院)胸部x射线数据库。本调查提供了每种技术的详细比较,描述了它们的优点、局限性和报告的性能指标,特别是在分类准确性方面。使用Vision Transformer的迁移学习达到了94.6%的最高准确率,而3D卷积神经网络(3D -CNN)达到了93.7%的准确率,两者都在适用的数据集上展示了最高的性能。此外,该研究还展示了联邦学习和可解释人工智能等新兴技术在解决与数据隐私和模型可解释性相关的挑战方面的潜力。本文综述了几种技术,发现深度学习是肺癌诊断中研究最广泛的领域。这种方法不仅被广泛使用,而且在肺癌的识别和分类方面也取得了显著的成功,准确率很高。
{"title":"Emerging computational intelligence based techniques for lung cancer diagnosis and classification on chest CT scan images: a comprehensive survey","authors":"Pankaj Kumari,&nbsp;Lavika Goel","doi":"10.1007/s10462-025-11374-9","DOIUrl":"10.1007/s10462-025-11374-9","url":null,"abstract":"<div><p>Worldwide lung cancer is a significant reason for death resulting from cancer with early diagnosis crucial for enhancing patient results. This comprehensive survey looks at the most recent developments in methods for detecting lung cancer by using chest CT scan images. The study describes a broad variety of approaches includes methods for machine learning such random forests support vector machines logistic regression and k-nearest neighbors in addition to deep learning frameworks such as variational autoencoders recurrent neural networks convolutional neural networks and generative adversarial networks. Additionally the survey explores hybrid models that combine deep learning and machine learning with nature-inspired optimization techniques to enhance performance. All the techniques discussed in this paper mainly focus on the diagnosis of NSCLC i.e. non-small cell lung cancer as it is more prevalent. The paper also reviews multiple advanced techniques used in diagnosis of lung cancer, including 3D-CNN i.e. Convolutional Neural Networks, multimodal logistic regression models and Cyclic Variational Autoencoders. It highlights key publicly available datasets frequently used in this research area such as LIDC-IDRI (Lung Image Database Consortium and Image Database Resource Initiative), LUNA16 (Lung Nodule Analysis 2016), the Kaggle lung cancer dataset, NSCLC Radiogenomics and the NIH (National Institutes of Health) chest X-ray database. This survey provides a detailed comparison of each technique, describing their advantages, limitations, and reported performance metrics, especially in terms of classification accuracy. Transfer learning with Vision Transformer achieves the highest accuracy of 94.6%, while 3D Convolutional Neural Network (3D -CNN) achieves an accuracy of 93.7%, both of which are showcasing highest performance on applicable datasets. Furthermore, the research demonstrates the potential of emerging techniques like federated learning and explainable AI in addressing challenges pertaining to data privacy and model interpretability. This survey paper reviews several techniques and finds that deep learning is the most extensively researched area in lung cancer diagnosis. This approach is not only widely used but also exhibits notable success in identifying and categorizing lung cancer with a high degree of accuracy.</p></div>","PeriodicalId":8449,"journal":{"name":"Artificial Intelligence Review","volume":"59 2","pages":""},"PeriodicalIF":13.9,"publicationDate":"2026-01-07","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://link.springer.com/content/pdf/10.1007/s10462-025-11374-9.pdf","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145909007","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Farthest better or nearest worse optimizer: a novel metaheuristic algorithm 最优或最劣优化器:一种新的元启发式算法
IF 13.9 2区 计算机科学 Q1 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE Pub Date : 2026-01-07 DOI: 10.1007/s10462-025-11443-z
Ahmad Taheri, Keyvan RahimiZadeh, Jan Baumbach, Amin Beheshti, Olga Zolotareva, Mohammed Azmi Al-Betar, Seyedali Mirjalili, Amir H. Gandomi

A novel metaheuristic optimization algorithm, namely the Farthest better or Nearest worse Optimizer (FNO) algorithm, is proposed in this paper. The idea behind the FNO algorithm is derived from the qualities and distances between agents’ positions in a search space. The process of searching in the FNO includes two phases. During the first phase of the FNO, it jumps over the nearest regions with lower potential to avoid local optima. In the second phase, the algorithm tries to explore the farthest positions with higher potential to reach or explore the global optimum. These operations aim to enhance population diversity and provide the FNO with opportunities to discover high-quality regions while avoiding low-quality regions. A structural component within FNO, called Dynamic Focus Strategy (DFS), is also presented for controlling the exploration ratio. The DFS applies a random vector as a coefficient to shrink the area around the farthest better positions throughout the search process. Several experimental studies have been conducted on well-known benchmark suites, comprising 45 benchmarks, to assess the efficacy of the FNO algorithm. Additionally, five engineering problems were used to evaluate the practical applicability of the proposed FNO algorithm. The Wilcoxon test, as a well-known non-parametric statistical test, is conducted to fairly compare results. The findings indicate that the FNO algorithm performs competitively against other state-of-the-art population-based metaheuristic algorithms on the tested problems.

提出了一种新的元启发式优化算法,即最优或最劣优化算法(FNO)。FNO算法背后的思想来源于搜索空间中代理位置之间的质量和距离。FNO中的搜索过程包括两个阶段。在FNO的第一阶段,它跳过最近的低电位区域,以避免局部最优。在第二阶段,算法试图探索具有更高潜力的最远位置,以达到或探索全局最优。这些行动的目的是增加人口多样性,并为FNO提供机会,发现优质地区,同时避开劣质地区。本文还提出了FNO中的一个结构组件,即动态聚焦策略(DFS),用于控制探测比。在整个搜索过程中,DFS应用一个随机向量作为系数来缩小最远的最佳位置周围的面积。为了评估FNO算法的有效性,已经在知名的基准测试套件上进行了一些实验研究,其中包括45个基准测试。此外,用5个工程问题来评价所提出的FNO算法的实际适用性。Wilcoxon检验是一种众所周知的非参数统计检验,用于公平比较结果。研究结果表明,在测试问题上,FNO算法与其他最先进的基于种群的元启发式算法相比具有竞争力。
{"title":"Farthest better or nearest worse optimizer: a novel metaheuristic algorithm","authors":"Ahmad Taheri,&nbsp;Keyvan RahimiZadeh,&nbsp;Jan Baumbach,&nbsp;Amin Beheshti,&nbsp;Olga Zolotareva,&nbsp;Mohammed Azmi Al-Betar,&nbsp;Seyedali Mirjalili,&nbsp;Amir H. Gandomi","doi":"10.1007/s10462-025-11443-z","DOIUrl":"10.1007/s10462-025-11443-z","url":null,"abstract":"<div><p>A novel metaheuristic optimization algorithm, namely the Farthest better or Nearest worse Optimizer (FNO) algorithm, is proposed in this paper. The idea behind the FNO algorithm is derived from the qualities and distances between agents’ positions in a search space. The process of searching in the FNO includes two phases. During the first phase of the FNO, it jumps over the nearest regions with lower potential to avoid local optima. In the second phase, the algorithm tries to explore the farthest positions with higher potential to reach or explore the global optimum. These operations aim to enhance population diversity and provide the FNO with opportunities to discover high-quality regions while avoiding low-quality regions. A structural component within FNO, called Dynamic Focus Strategy (DFS), is also presented for controlling the exploration ratio. The DFS applies a random vector as a coefficient to shrink the area around the farthest better positions throughout the search process. Several experimental studies have been conducted on well-known benchmark suites, comprising 45 benchmarks, to assess the efficacy of the FNO algorithm. Additionally, five engineering problems were used to evaluate the practical applicability of the proposed FNO algorithm. The Wilcoxon test, as a well-known non-parametric statistical test, is conducted to fairly compare results. The findings indicate that the FNO algorithm performs competitively against other state-of-the-art population-based metaheuristic algorithms on the tested problems.</p></div>","PeriodicalId":8449,"journal":{"name":"Artificial Intelligence Review","volume":"59 2","pages":""},"PeriodicalIF":13.9,"publicationDate":"2026-01-07","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://link.springer.com/content/pdf/10.1007/s10462-025-11443-z.pdf","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"146027026","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Multi-agent generalized cooperative optimization scheduling for multi-energy complementarity in microgrids 微电网多能互补的多智能体广义协同优化调度
IF 13.9 2区 计算机科学 Q1 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE Pub Date : 2026-01-07 DOI: 10.1007/s10462-025-11354-z
Na Xu, Chaoxu Mu, Ke Wang, Liang Ma, Zhaoyang Liu

In this paper, we study a collaborative optimization scheduling approach for high-proportion renewable energy smart microgrids to achieve multi-energy management in a distributed execution framework with centralized training. First, we construct a multi-agent distributed microgrid optimization model for this optimization problem based on different types of renewable energy sources, energy storage, power exchange with the upper grid, and time-of-use electricity prices. Then, multiple long-term optimization objectives are designed to transform the cooperative optimization scheduling problem into a multi-agent multi-objective optimization problem, addressing the challenges of dynamic optimization. To enhance the correlation of policy sampling, we propose a novel multi-objective generalized normal distribution optimization (MGNDO) algorithm. By updating the covariance matrix, the policy correlations between different agents are better captured, resulting in more cooperative action sequences. Compared to traditional action sampling methods, this approach can better accommodate complex dynamic constraints and multi-objective requirements. Finally, a smart distribution network connected to three microgrids is taken as an example to realize the cooperative optimal scheduling problem by using the proposed algorithm, MADDPG algorithm and PSO algorithm, respectively. Operational cost and new energy consumption are compared separately to further illustrate the effectiveness of the proposed approach.

本文研究了高比例可再生能源智能微电网协同优化调度方法,以实现分布式执行框架下集中训练的多能源管理。首先,针对该优化问题,构建了基于不同类型可再生能源、储能、与上网交换电力和分时电价的多智能体分布式微电网优化模型。然后,设计多个长期优化目标,将协同优化调度问题转化为多智能体多目标优化问题,解决了动态优化的挑战;为了增强策略抽样的相关性,提出了一种新的多目标广义正态分布优化算法。通过更新协方差矩阵,可以更好地捕获不同agent之间的策略相关性,从而产生更具协作性的动作序列。与传统的动作采样方法相比,该方法能更好地适应复杂的动态约束和多目标要求。最后,以连接三个微电网的智能配电网为例,分别采用本文提出的算法、madpg算法和PSO算法实现协同最优调度问题。分别比较了运行成本和新能源消耗,进一步说明了所提方法的有效性。
{"title":"Multi-agent generalized cooperative optimization scheduling for multi-energy complementarity in microgrids","authors":"Na Xu,&nbsp;Chaoxu Mu,&nbsp;Ke Wang,&nbsp;Liang Ma,&nbsp;Zhaoyang Liu","doi":"10.1007/s10462-025-11354-z","DOIUrl":"10.1007/s10462-025-11354-z","url":null,"abstract":"<div><p>In this paper, we study a collaborative optimization scheduling approach for high-proportion renewable energy smart microgrids to achieve multi-energy management in a distributed execution framework with centralized training. First, we construct a multi-agent distributed microgrid optimization model for this optimization problem based on different types of renewable energy sources, energy storage, power exchange with the upper grid, and time-of-use electricity prices. Then, multiple long-term optimization objectives are designed to transform the cooperative optimization scheduling problem into a multi-agent multi-objective optimization problem, addressing the challenges of dynamic optimization. To enhance the correlation of policy sampling, we propose a novel multi-objective generalized normal distribution optimization (MGNDO) algorithm. By updating the covariance matrix, the policy correlations between different agents are better captured, resulting in more cooperative action sequences. Compared to traditional action sampling methods, this approach can better accommodate complex dynamic constraints and multi-objective requirements. Finally, a smart distribution network connected to three microgrids is taken as an example to realize the cooperative optimal scheduling problem by using the proposed algorithm, MADDPG algorithm and PSO algorithm, respectively. Operational cost and new energy consumption are compared separately to further illustrate the effectiveness of the proposed approach.</p></div>","PeriodicalId":8449,"journal":{"name":"Artificial Intelligence Review","volume":"59 2","pages":""},"PeriodicalIF":13.9,"publicationDate":"2026-01-07","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://link.springer.com/content/pdf/10.1007/s10462-025-11354-z.pdf","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145909003","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
A survey on large language models driven meta-optimizers for automated intelligent optimization 面向自动化智能优化的大型语言模型驱动元优化器综述
IF 13.9 2区 计算机科学 Q1 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE Pub Date : 2026-01-06 DOI: 10.1007/s10462-025-11470-w
Yan Zheng, Lida Zhang, Kaiwen Li, Rui Wang, Wenhua Li, Tao Zhang, Qingfu Zhang, Yaochu Jin

This review systematically summarizes the research progress of Large Language Models (LLMs) as meta-optimizers in the field of automated intelligent optimization algorithm design, aiming to establish a unified framework for this emerging research direction. The study first defines the core paradigm and research scope of LLM-driven meta-optimization, explaining how it overcomes the limitations of traditional optimization algorithms in terms of parameter sensitivity and cross-domain generalization. It then systematically summarizes representative methodological frameworks and typical methods for key stages such as algorithm auto-generation, dynamic selection, parameter configuration, and mutation control using LLMs, revealing the potential for synergies between generative AI and gradient-free optimization heuristics. Additionally, this paper integrates relevant performance evaluation metrics and benchmarking problems, providing practical references for researchers. Finally, the paper discusses the main challenges in this field and outlines future research directions, emphasizing the core role of LLMs as meta-optimizers in driving paradigm shifts toward algorithm design automation and intelligence.

本文系统总结了大语言模型作为元优化器在自动化智能优化算法设计领域的研究进展,旨在为这一新兴的研究方向建立一个统一的框架。本研究首先定义了llm驱动元优化的核心范式和研究范围,解释了它如何克服传统优化算法在参数敏感性和跨域泛化方面的局限性。然后系统地总结了具有代表性的方法框架和关键阶段的典型方法,如算法自动生成、动态选择、参数配置和使用llm的突变控制,揭示了生成人工智能和无梯度优化启发式之间协同作用的潜力。此外,本文还整合了相关绩效评估指标和标杆问题,为研究人员提供了实践参考。最后,本文讨论了该领域的主要挑战,并概述了未来的研究方向,强调法学硕士作为元优化器在推动范式转向算法设计自动化和智能方面的核心作用。
{"title":"A survey on large language models driven meta-optimizers for automated intelligent optimization","authors":"Yan Zheng,&nbsp;Lida Zhang,&nbsp;Kaiwen Li,&nbsp;Rui Wang,&nbsp;Wenhua Li,&nbsp;Tao Zhang,&nbsp;Qingfu Zhang,&nbsp;Yaochu Jin","doi":"10.1007/s10462-025-11470-w","DOIUrl":"10.1007/s10462-025-11470-w","url":null,"abstract":"<div>\u0000 \u0000 <p>This review systematically summarizes the research progress of Large Language Models (LLMs) as meta-optimizers in the field of automated intelligent optimization algorithm design, aiming to establish a unified framework for this emerging research direction. The study first defines the core paradigm and research scope of LLM-driven meta-optimization, explaining how it overcomes the limitations of traditional optimization algorithms in terms of parameter sensitivity and cross-domain generalization. It then systematically summarizes representative methodological frameworks and typical methods for key stages such as algorithm auto-generation, dynamic selection, parameter configuration, and mutation control using LLMs, revealing the potential for synergies between generative AI and gradient-free optimization heuristics. Additionally, this paper integrates relevant performance evaluation metrics and benchmarking problems, providing practical references for researchers. Finally, the paper discusses the main challenges in this field and outlines future research directions, emphasizing the core role of LLMs as meta-optimizers in driving paradigm shifts toward algorithm design automation and intelligence.</p>\u0000 </div>","PeriodicalId":8449,"journal":{"name":"Artificial Intelligence Review","volume":"59 2","pages":""},"PeriodicalIF":13.9,"publicationDate":"2026-01-06","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://link.springer.com/content/pdf/10.1007/s10462-025-11470-w.pdf","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"146027089","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
From language to action: a review of large language models as autonomous agents and tool users 从语言到行动:作为自主代理和工具用户的大型语言模型的回顾
IF 13.9 2区 计算机科学 Q1 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE Pub Date : 2026-01-06 DOI: 10.1007/s10462-025-11471-9
Sadia Sultana Chowa, Riasad Alvi, Subhey Sadi Rahman, Md Abdur Rahman, Mohaimenul Azam Khan Raiaan, Md Rafiqul Islam, Mukhtar Hussain, Sami Azam

The pursuit of human-level artificial intelligence (AI) has significantly advanced the development of autonomous agents and Large Language Models (LLMs). LLMs are now widely utilized as decision-making agents for their ability to interpret instructions, manage sequential tasks, and adapt through feedback. This review examines recent developments in employing LLMs as autonomous agents and tool users and comprises seven research questions. We only used the papers published between 2023 and 2025 in conferences of the A* and A-ranked and Q1 journals. A structured analysis of the LLM agents’ architectural design principles, dividing their applications into single-agent and multi-agent systems, and strategies for integrating external tools is presented. In addition, the cognitive mechanisms of LLMs, including reasoning, planning, and memory, and the impact of prompting methods and fine-tuning procedures on agent performance are also investigated. Furthermore, we have evaluated current benchmarks and assessment protocols and provided an analysis of 68 publicly available datasets to assess the performance of LLM-based agents in various tasks. In conducting this review, we have identified critical findings on verifiable reasoning of LLMs, the capacity for self-improvement, and the personalization of LLM-based agents. Finally, we have discussed ten future research directions to overcome these gaps.

对人类水平人工智能(AI)的追求极大地推动了自主代理和大型语言模型(llm)的发展。法学硕士现在被广泛用作决策代理,因为他们有能力解释指令,管理连续的任务,并通过反馈进行适应。这篇综述考察了最近在使用法学硕士作为自主代理和工具用户方面的发展,包括七个研究问题。我们只使用了2023 - 2025年在A*、A级和Q1期刊会议上发表的论文。结构化地分析了LLM智能体的体系结构设计原则,将其应用划分为单智能体和多智能体系统,并提出了集成外部工具的策略。此外,还研究了llm的认知机制,包括推理、计划和记忆,以及提示方法和微调程序对代理性能的影响。此外,我们评估了当前的基准和评估协议,并提供了68个公开可用数据集的分析,以评估基于llm的代理在各种任务中的性能。在进行这项审查时,我们已经确定了法学硕士的可验证推理,自我完善的能力和基于法学硕士的代理人的个性化的关键发现。最后,我们讨论了十个未来的研究方向,以克服这些差距。
{"title":"From language to action: a review of large language models as autonomous agents and tool users","authors":"Sadia Sultana Chowa,&nbsp;Riasad Alvi,&nbsp;Subhey Sadi Rahman,&nbsp;Md Abdur Rahman,&nbsp;Mohaimenul Azam Khan Raiaan,&nbsp;Md Rafiqul Islam,&nbsp;Mukhtar Hussain,&nbsp;Sami Azam","doi":"10.1007/s10462-025-11471-9","DOIUrl":"10.1007/s10462-025-11471-9","url":null,"abstract":"<div>\u0000 \u0000 <p>The pursuit of human-level artificial intelligence (AI) has significantly advanced the development of autonomous agents and Large Language Models (LLMs). LLMs are now widely utilized as decision-making agents for their ability to interpret instructions, manage sequential tasks, and adapt through feedback. This review examines recent developments in employing LLMs as autonomous agents and tool users and comprises seven research questions. We only used the papers published between 2023 and 2025 in conferences of the A* and A-ranked and Q1 journals. A structured analysis of the LLM agents’ architectural design principles, dividing their applications into single-agent and multi-agent systems, and strategies for integrating external tools is presented. In addition, the cognitive mechanisms of LLMs, including reasoning, planning, and memory, and the impact of prompting methods and fine-tuning procedures on agent performance are also investigated. Furthermore, we have evaluated current benchmarks and assessment protocols and provided an analysis of 68 publicly available datasets to assess the performance of LLM-based agents in various tasks. In conducting this review, we have identified critical findings on verifiable reasoning of LLMs, the capacity for self-improvement, and the personalization of LLM-based agents. Finally, we have discussed ten future research directions to overcome these gaps.</p>\u0000 </div>","PeriodicalId":8449,"journal":{"name":"Artificial Intelligence Review","volume":"59 2","pages":""},"PeriodicalIF":13.9,"publicationDate":"2026-01-06","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://link.springer.com/content/pdf/10.1007/s10462-025-11471-9.pdf","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"146027088","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Aigc-driven human-machine intelligence in ITS: technologies, applications, evaluation framework, challenges, and future directions 智能交通系统中由ai驱动的人机智能:技术、应用、评估框架、挑战和未来方向
IF 13.9 2区 计算机科学 Q1 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE Pub Date : 2026-01-05 DOI: 10.1007/s10462-025-11467-5
Doreen Sebastian Sarwatt, Frank Kulwa, Adamu Gaston Philipo, Angela-Aida Karugila Runyoro, Huansheng Ning, Jianguo Ding

This paper explores the integration of Artificial Intelligence Generated Content (AIGC), a rapidly evolving branch of generative AI, with Human-Machine intelligence (HMI) to enhance the functionality of Intelligent Transportation Systems (ITS). As transportation systems grow increasingly complex, adaptive decision-making becomes essential for interpreting vast streams of real-time data from vehicles, infrastructure, and users. AIGC plays a transformative role in optimizing traffic flow through dynamic routing and real-time traffic management, while human intelligence ensures these systems remain responsive to evolving real-world conditions. For safety, AIGC is used to simulate complex driving scenarios for autonomous vehicle training and detect traffic anomalies, with human oversight providing contextual decisions in ambiguous situations. For sustainability, AIGC supports data-driven strategies to reduce emissions and energy use, while human expertise ensures alignment with ethical and environmental goals. This synergy enhances real-time decision-making, improving both accuracy and adaptability across ITS scenarios. The paper presents a comprehensive review of core and supporting AIGC technologies and their applications across key ITS domains. Case studies and initiatives from industry leaders demonstrate practical implementations of AIGC-driven HMI collaboration. To guide future deployments, we propose a conceptual five-layer evaluation framework for assessing AIGC-HMI systems, encompassing functional performance, human interaction, explainability, ethical compliance, and robustness. We also address challenges such as legacy system integration, data privacy, model bias, and scalability. The paper concludes by outlining future research directions, emphasizing the need for scalable, interpretable, and ethically aligned AIGC models. This work contributes to the development of intelligent, adaptive, and trustworthy transportation systems.

本文探讨了人工智能生成内容(AIGC)与人机智能(HMI)的集成,这是生成式人工智能的一个快速发展的分支,以增强智能交通系统(ITS)的功能。随着交通系统变得越来越复杂,适应性决策对于解释来自车辆、基础设施和用户的大量实时数据流变得至关重要。AIGC通过动态路由和实时交通管理在优化交通流量方面发挥着变革性作用,而人工智能则确保这些系统保持对不断变化的现实世界条件的响应。为了安全起见,AIGC用于模拟自动驾驶车辆训练的复杂驾驶场景,并检测交通异常,在模糊情况下,人工监督提供上下文决策。在可持续发展方面,AIGC支持以数据为导向的战略,以减少排放和能源使用,而人类专业知识则确保与道德和环境目标保持一致。这种协同作用增强了实时决策,提高了ITS场景的准确性和适应性。本文介绍了核心和支持AIGC技术及其在关键ITS领域的应用。来自行业领导者的案例研究和计划演示了aigc驱动的HMI协作的实际实现。为了指导未来的部署,我们提出了一个概念性的五层评估框架,用于评估AIGC-HMI系统,包括功能性能、人机交互、可解释性、道德遵从性和鲁棒性。我们还解决了遗留系统集成、数据隐私、模型偏差和可扩展性等挑战。论文最后概述了未来的研究方向,强调需要可扩展、可解释和符合伦理的AIGC模型。这项工作有助于智能、适应性和可信赖的交通系统的发展。
{"title":"Aigc-driven human-machine intelligence in ITS: technologies, applications, evaluation framework, challenges, and future directions","authors":"Doreen Sebastian Sarwatt,&nbsp;Frank Kulwa,&nbsp;Adamu Gaston Philipo,&nbsp;Angela-Aida Karugila Runyoro,&nbsp;Huansheng Ning,&nbsp;Jianguo Ding","doi":"10.1007/s10462-025-11467-5","DOIUrl":"10.1007/s10462-025-11467-5","url":null,"abstract":"<div><p>This paper explores the integration of Artificial Intelligence Generated Content (AIGC), a rapidly evolving branch of generative AI, with Human-Machine intelligence (HMI) to enhance the functionality of Intelligent Transportation Systems (ITS). As transportation systems grow increasingly complex, adaptive decision-making becomes essential for interpreting vast streams of real-time data from vehicles, infrastructure, and users. AIGC plays a transformative role in optimizing traffic flow through dynamic routing and real-time traffic management, while human intelligence ensures these systems remain responsive to evolving real-world conditions. For safety, AIGC is used to simulate complex driving scenarios for autonomous vehicle training and detect traffic anomalies, with human oversight providing contextual decisions in ambiguous situations. For sustainability, AIGC supports data-driven strategies to reduce emissions and energy use, while human expertise ensures alignment with ethical and environmental goals. This synergy enhances real-time decision-making, improving both accuracy and adaptability across ITS scenarios. The paper presents a comprehensive review of core and supporting AIGC technologies and their applications across key ITS domains. Case studies and initiatives from industry leaders demonstrate practical implementations of AIGC-driven HMI collaboration. To guide future deployments, we propose a conceptual five-layer evaluation framework for assessing AIGC-HMI systems, encompassing functional performance, human interaction, explainability, ethical compliance, and robustness. We also address challenges such as legacy system integration, data privacy, model bias, and scalability. The paper concludes by outlining future research directions, emphasizing the need for scalable, interpretable, and ethically aligned AIGC models. This work contributes to the development of intelligent, adaptive, and trustworthy transportation systems.</p></div>","PeriodicalId":8449,"journal":{"name":"Artificial Intelligence Review","volume":"59 2","pages":""},"PeriodicalIF":13.9,"publicationDate":"2026-01-05","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://link.springer.com/content/pdf/10.1007/s10462-025-11467-5.pdf","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"146027086","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Watermarking techniques for large language models: a survey 大型语言模型的水印技术综述
IF 13.9 2区 计算机科学 Q1 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE Pub Date : 2026-01-05 DOI: 10.1007/s10462-025-11474-6
Yuqing Liang, Jiancheng Xiao, Wensheng Gan, Philip S. Yu

With the rapid advancement and extensive application of artificial intelligence technology, large language models (LLMs) are extensively used to enhance production, creativity, learning, and work efficiency across various domains. However, the abuse of LLMs also poses potential harm to human society, such as intellectual property rights issues, academic misconduct, false content, and hallucinations. Relevant research has proposed the use of LLM watermarking to achieve IP protection for LLMs and traceability of multimedia data output by LLMs. To our knowledge, this is the first thorough review that investigates and analyzes LLM watermarking technology in detail. This review begins by recounting the history of traditional watermarking technology, then analyzes the current state of LLM watermarking research, and thoroughly examines the inheritance and relevance of these techniques. By analyzing their inheritance and relevance, this review can provide research with ideas for applying traditional digital watermarking techniques to LLM watermarking, to promote the cross-integration and innovation of watermarking technology. In addition, this review examines the pros and cons of LLM watermarking. Considering the current multimodal development trend of LLMs, it provides a detailed analysis of emerging multimodal LLM watermarking, such as visual and audio data, to offer more reference ideas for relevant research. This review delves into the challenges and future prospects of current watermarking technologies, offering valuable insights for future LLM watermarking research and applications.

随着人工智能技术的快速发展和广泛应用,大型语言模型(llm)被广泛用于提高各个领域的生产、创造、学习和工作效率。然而,法学硕士的滥用也给人类社会带来了潜在的危害,如知识产权问题、学术不端、虚假内容、幻觉等。相关研究提出利用LLM水印实现LLM的IP保护和LLM多媒体数据输出的可追溯性。据我们所知,这是第一次详细调查和分析LLM水印技术的全面审查。本文首先回顾了传统水印技术的发展历史,然后分析了LLM水印研究的现状,并对这些技术的继承性和相关性进行了深入的研究。通过分析二者的继承性和相关性,为将传统数字水印技术应用于LLM水印提供研究思路,促进水印技术的交叉融合和创新。此外,本文还分析了LLM水印技术的优缺点。考虑到当前LLM的多模态发展趋势,对新兴的多模态LLM水印,如视觉和音频数据进行了详细的分析,为相关研究提供更多的参考思路。本文综述了当前水印技术面临的挑战和未来的发展前景,为今后法学硕士水印技术的研究和应用提供了有价值的见解。
{"title":"Watermarking techniques for large language models: a survey","authors":"Yuqing Liang,&nbsp;Jiancheng Xiao,&nbsp;Wensheng Gan,&nbsp;Philip S. Yu","doi":"10.1007/s10462-025-11474-6","DOIUrl":"10.1007/s10462-025-11474-6","url":null,"abstract":"<div><p>With the rapid advancement and extensive application of artificial intelligence technology, large language models (LLMs) are extensively used to enhance production, creativity, learning, and work efficiency across various domains. However, the abuse of LLMs also poses potential harm to human society, such as intellectual property rights issues, academic misconduct, false content, and hallucinations. Relevant research has proposed the use of LLM watermarking to achieve IP protection for LLMs and traceability of multimedia data output by LLMs. To our knowledge, this is the first thorough review that investigates and analyzes LLM watermarking technology in detail. This review begins by recounting the history of traditional watermarking technology, then analyzes the current state of LLM watermarking research, and thoroughly examines the inheritance and relevance of these techniques. By analyzing their inheritance and relevance, this review can provide research with ideas for applying traditional digital watermarking techniques to LLM watermarking, to promote the cross-integration and innovation of watermarking technology. In addition, this review examines the pros and cons of LLM watermarking. Considering the current multimodal development trend of LLMs, it provides a detailed analysis of emerging multimodal LLM watermarking, such as visual and audio data, to offer more reference ideas for relevant research. This review delves into the challenges and future prospects of current watermarking technologies, offering valuable insights for future LLM watermarking research and applications.</p></div>","PeriodicalId":8449,"journal":{"name":"Artificial Intelligence Review","volume":"59 2","pages":""},"PeriodicalIF":13.9,"publicationDate":"2026-01-05","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://link.springer.com/content/pdf/10.1007/s10462-025-11474-6.pdf","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"146027085","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
A systematic review of metaheuristic based feature selection strategies for cyber-attack detection in the IIoT 基于元启发式特征选择策略的工业物联网网络攻击检测系统综述
IF 13.9 2区 计算机科学 Q1 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE Pub Date : 2026-01-05 DOI: 10.1007/s10462-025-11473-7
Georg Thamer Francis, Alireza Souri, Nihat Inanç

The Industrial Internet of Things (IIoT) has been spreading across all fields of applicable environments, one of the most crucial ones being the industrial environment. IIoT is the integration of the internet within various industrial fields such as smart manufacturing, supply chain optimization, and predictive maintenance. These applications require two key features to be efficient, that is, real-time processing and connection security and reliability. From this standpoint arise the studies of cyberattack detection in industrial environments. Many methodologies have approached this field using traditional or hybridized machine learning or deep learning algorithms. In this review paper, we explore 36 of the most recent cyber-attack detection systems using metaheuristic models, mainly metaheuristic feature selection (MFS) algorithms. Additionally, we also explore hybrid models of metaheuristics and machine learning or deep learning models that are used to increase the accuracy of the models on various benchmark datasets. Our SLR separates the MFS utilized in this field into four main types, including Swarm Intelligence (SI), Evolutionary Algorithms (EA), Physics-Based (PHY), and Human-Behavior-Inspired (HBI). Our findings showed that SI-MFS dominates the field, with 25/36 case studies proposing it, while EA was proposed in 3/36 and PHY and HBI were each proposed in 2/36. We also demonstrate the most effective methodologies, such as FS-ID, MFS-D, and Novel Hybrid MFS. We also outline potential open challenges and gaps that require resolution.

工业物联网(IIoT)已经扩展到所有适用环境的领域,其中最关键的是工业环境。工业物联网是互联网在智能制造、供应链优化和预测性维护等各个工业领域的集成。这些应用程序需要两个关键特性才能高效,即实时处理和连接的安全性和可靠性。从这个角度出发,出现了工业环境中网络攻击检测的研究。许多方法使用传统或混合机器学习或深度学习算法来接近该领域。在这篇综述文章中,我们使用元启发式模型,主要是元启发式特征选择(MFS)算法,探索了36个最新的网络攻击检测系统。此外,我们还探索了元启发式和机器学习或深度学习模型的混合模型,用于提高模型在各种基准数据集上的准确性。我们的SLR将该领域使用的MFS分为四种主要类型,包括群体智能(SI),进化算法(EA),基于物理(PHY)和人类行为启发(HBI)。我们的研究结果显示SI-MFS占据主导地位,有25/36的案例研究提出了SI-MFS,而EA在3/36中被提出,PHY和HBI分别在2/36中被提出。我们还展示了最有效的方法,如FS-ID, MFS- d和新型混合MFS。我们还概述了需要解决的潜在挑战和差距。
{"title":"A systematic review of metaheuristic based feature selection strategies for cyber-attack detection in the IIoT","authors":"Georg Thamer Francis,&nbsp;Alireza Souri,&nbsp;Nihat Inanç","doi":"10.1007/s10462-025-11473-7","DOIUrl":"10.1007/s10462-025-11473-7","url":null,"abstract":"<div><p>The Industrial Internet of Things (IIoT) has been spreading across all fields of applicable environments, one of the most crucial ones being the industrial environment. IIoT is the integration of the internet within various industrial fields such as smart manufacturing, supply chain optimization, and predictive maintenance. These applications require two key features to be efficient, that is, real-time processing and connection security and reliability. From this standpoint arise the studies of cyberattack detection in industrial environments. Many methodologies have approached this field using traditional or hybridized machine learning or deep learning algorithms. In this review paper, we explore 36 of the most recent cyber-attack detection systems using metaheuristic models, mainly metaheuristic feature selection (MFS) algorithms. Additionally, we also explore hybrid models of metaheuristics and machine learning or deep learning models that are used to increase the accuracy of the models on various benchmark datasets. Our SLR separates the MFS utilized in this field into four main types, including Swarm Intelligence (SI), Evolutionary Algorithms (EA), Physics-Based (PHY), and Human-Behavior-Inspired (HBI). Our findings showed that SI-MFS dominates the field, with 25/36 case studies proposing it, while EA was proposed in 3/36 and PHY and HBI were each proposed in 2/36. We also demonstrate the most effective methodologies, such as FS-ID, MFS-D, and Novel Hybrid MFS. We also outline potential open challenges and gaps that require resolution.</p></div>","PeriodicalId":8449,"journal":{"name":"Artificial Intelligence Review","volume":"59 2","pages":""},"PeriodicalIF":13.9,"publicationDate":"2026-01-05","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://link.springer.com/content/pdf/10.1007/s10462-025-11473-7.pdf","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"146027087","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Hallucination to truth: a review of fact-checking and factuality evaluation in large language models 对真相的幻觉:对大型语言模型中事实核查和事实评估的回顾
IF 13.9 2区 计算机科学 Q1 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE Pub Date : 2026-01-03 DOI: 10.1007/s10462-025-11454-w
Subhey Sadi Rahman, Md. Adnanul Islam, Md. Mahbub Alam, Musarrat Zeba, Md. Abdur Rahman, Sadia Sultana Chowa, Mohaimenul Azam Khan Raiaan, Sami Azam

Large language models (LLMs) are trained on vast and diverse internet corpora that often include inaccurate or misleading content. Consequently, LLMs can generate misinformation, making robust fact-checking essential. This review systematically analyzes how LLM-generated content is evaluated for factual accuracy by exploring key challenges such as hallucinations, dataset limitations, and the reliability of evaluation metrics. The review emphasizes the need for strong fact-checking frameworks that integrate advanced prompting strategies, domain-specific fine-tuning, and retrieval-augmented generation (RAG) methods. It proposes five research questions that guide the analysis of the recent literature from 2020 to 2025, focusing on evaluation methods and mitigation techniques. Instruction tuning, multi-agent reasoning, and RAG frameworks for external knowledge access are also reviewed. The key findings demonstrate the limitations of current metrics, the importance of validated external evidence, and the improvement of factual consistency through domain-specific customization. The review underscores the importance of building more accurate, understandable, and context-aware fact-checking. These insights contribute to the advancement of research toward more trustworthy models.

大型语言模型(llm)是在大量多样的互联网语料库上训练的,这些语料库通常包含不准确或误导性的内容。因此,法学硕士可能会产生错误信息,这使得强有力的事实核查变得至关重要。这篇综述系统地分析了法学硕士生成的内容是如何通过探索关键挑战(如幻觉、数据集限制和评估指标的可靠性)来评估事实准确性的。该综述强调需要强大的事实检查框架,该框架集成了先进的提示策略、特定领域的微调和检索增强生成(RAG)方法。它提出了五个研究问题,以指导对2020年至2025年近期文献的分析,重点是评估方法和缓解技术。指令调优、多智能体推理和外部知识访问的RAG框架也进行了回顾。关键的发现表明了当前度量的局限性、经过验证的外部证据的重要性,以及通过特定于领域的定制对事实一致性的改进。审查强调了建立更准确、可理解和上下文敏感的事实核查的重要性。这些见解有助于研究朝着更可靠的模型发展。
{"title":"Hallucination to truth: a review of fact-checking and factuality evaluation in large language models","authors":"Subhey Sadi Rahman,&nbsp;Md. Adnanul Islam,&nbsp;Md. Mahbub Alam,&nbsp;Musarrat Zeba,&nbsp;Md. Abdur Rahman,&nbsp;Sadia Sultana Chowa,&nbsp;Mohaimenul Azam Khan Raiaan,&nbsp;Sami Azam","doi":"10.1007/s10462-025-11454-w","DOIUrl":"10.1007/s10462-025-11454-w","url":null,"abstract":"<div><p>Large language models (LLMs) are trained on vast and diverse internet corpora that often include inaccurate or misleading content. Consequently, LLMs can generate misinformation, making robust fact-checking essential. This review systematically analyzes how LLM-generated content is evaluated for factual accuracy by exploring key challenges such as hallucinations, dataset limitations, and the reliability of evaluation metrics. The review emphasizes the need for strong fact-checking frameworks that integrate advanced prompting strategies, domain-specific fine-tuning, and retrieval-augmented generation (RAG) methods. It proposes five research questions that guide the analysis of the recent literature from 2020 to 2025, focusing on evaluation methods and mitigation techniques. Instruction tuning, multi-agent reasoning, and RAG frameworks for external knowledge access are also reviewed. The key findings demonstrate the limitations of current metrics, the importance of validated external evidence, and the improvement of factual consistency through domain-specific customization. The review underscores the importance of building more accurate, understandable, and context-aware fact-checking. These insights contribute to the advancement of research toward more trustworthy models.</p></div>","PeriodicalId":8449,"journal":{"name":"Artificial Intelligence Review","volume":"59 2","pages":""},"PeriodicalIF":13.9,"publicationDate":"2026-01-03","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://link.springer.com/content/pdf/10.1007/s10462-025-11454-w.pdf","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"146027082","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
A review of artificial intelligence techniques for anomaly detection in smart grid 智能电网异常检测的人工智能技术综述
IF 13.9 2区 计算机科学 Q1 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE Pub Date : 2026-01-03 DOI: 10.1007/s10462-025-11429-x
Md Al Amin Sarker, Irrai Anbu Jayaraj, Bharanidharan Shanmugam, Sami Azam, Suresh Thennadil

In the era of smart grids (SGs), as more interconnected energy sources and renewable sources are used, it is becoming increasingly important to have robust and accurate advanced anomaly detection methods. Due to the complexity of modern power systems, anomalies need to be detected more efficiently. This study provides a comprehensive overview of integrating renewable energy sources into SGs and the increasing importance of robust anomaly detection methods in ensuring grid security and reliability. Addressing four key research areas, we explore the current trends in applying machine learning techniques to SG anomaly detection research, identifying anomalies such as electricity theft, cyber-attacks, power system disturbances, and abnormal consumption patterns. We systematically evaluate the utilization of different machine learning models, including supervised, unsupervised, semi-supervised, and reinforcement learning, to detect each anomaly within SG environments. Furthermore, we assess the effectiveness of the anomaly detection algorithms and discuss the potential for further research, emphasizing the need for multidisciplinary collaboration and continuous development to overcome challenges and adapt to evolving grid dynamics and cyber threats. The findings of this study suggest that machine learning significantly contributes to ensuring the resilience and efficiency of SGs in the face of evolving challenges.

在智能电网(SGs)时代,随着越来越多的互联能源和可再生能源的使用,拥有强大而准确的先进异常检测方法变得越来越重要。由于现代电力系统的复杂性,需要更有效地检测异常。本研究全面概述了将可再生能源整合到SGs中,以及鲁棒异常检测方法在确保电网安全和可靠性方面日益重要的意义。针对四个关键研究领域,我们探讨了将机器学习技术应用于SG异常检测研究的当前趋势,识别诸如电力盗窃、网络攻击、电力系统干扰和异常消费模式等异常。我们系统地评估了不同机器学习模型的使用情况,包括监督、无监督、半监督和强化学习,以检测SG环境中的每种异常。此外,我们评估了异常检测算法的有效性,并讨论了进一步研究的潜力,强调需要多学科合作和持续发展,以克服挑战并适应不断变化的网格动态和网络威胁。本研究的结果表明,机器学习对确保SGs在面对不断变化的挑战时的弹性和效率有重要贡献。
{"title":"A review of artificial intelligence techniques for anomaly detection in smart grid","authors":"Md Al Amin Sarker,&nbsp;Irrai Anbu Jayaraj,&nbsp;Bharanidharan Shanmugam,&nbsp;Sami Azam,&nbsp;Suresh Thennadil","doi":"10.1007/s10462-025-11429-x","DOIUrl":"10.1007/s10462-025-11429-x","url":null,"abstract":"<div><p>In the era of smart grids (SGs), as more interconnected energy sources and renewable sources are used, it is becoming increasingly important to have robust and accurate advanced anomaly detection methods. Due to the complexity of modern power systems, anomalies need to be detected more efficiently. This study provides a comprehensive overview of integrating renewable energy sources into SGs and the increasing importance of robust anomaly detection methods in ensuring grid security and reliability. Addressing four key research areas, we explore the current trends in applying machine learning techniques to SG anomaly detection research, identifying anomalies such as electricity theft, cyber-attacks, power system disturbances, and abnormal consumption patterns. We systematically evaluate the utilization of different machine learning models, including supervised, unsupervised, semi-supervised, and reinforcement learning, to detect each anomaly within SG environments. Furthermore, we assess the effectiveness of the anomaly detection algorithms and discuss the potential for further research, emphasizing the need for multidisciplinary collaboration and continuous development to overcome challenges and adapt to evolving grid dynamics and cyber threats. The findings of this study suggest that machine learning significantly contributes to ensuring the resilience and efficiency of SGs in the face of evolving challenges.</p></div>","PeriodicalId":8449,"journal":{"name":"Artificial Intelligence Review","volume":"59 2","pages":""},"PeriodicalIF":13.9,"publicationDate":"2026-01-03","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://link.springer.com/content/pdf/10.1007/s10462-025-11429-x.pdf","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"146027081","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
期刊
Artificial Intelligence Review
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1