首页 > 最新文献

Artificial Intelligence Review最新文献

英文 中文
A systematic review of metaheuristic based feature selection strategies for cyber-attack detection in the IIoT 基于元启发式特征选择策略的工业物联网网络攻击检测系统综述
IF 13.9 2区 计算机科学 Q1 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE Pub Date : 2026-01-05 DOI: 10.1007/s10462-025-11473-7
Georg Thamer Francis, Alireza Souri, Nihat Inanç

The Industrial Internet of Things (IIoT) has been spreading across all fields of applicable environments, one of the most crucial ones being the industrial environment. IIoT is the integration of the internet within various industrial fields such as smart manufacturing, supply chain optimization, and predictive maintenance. These applications require two key features to be efficient, that is, real-time processing and connection security and reliability. From this standpoint arise the studies of cyberattack detection in industrial environments. Many methodologies have approached this field using traditional or hybridized machine learning or deep learning algorithms. In this review paper, we explore 36 of the most recent cyber-attack detection systems using metaheuristic models, mainly metaheuristic feature selection (MFS) algorithms. Additionally, we also explore hybrid models of metaheuristics and machine learning or deep learning models that are used to increase the accuracy of the models on various benchmark datasets. Our SLR separates the MFS utilized in this field into four main types, including Swarm Intelligence (SI), Evolutionary Algorithms (EA), Physics-Based (PHY), and Human-Behavior-Inspired (HBI). Our findings showed that SI-MFS dominates the field, with 25/36 case studies proposing it, while EA was proposed in 3/36 and PHY and HBI were each proposed in 2/36. We also demonstrate the most effective methodologies, such as FS-ID, MFS-D, and Novel Hybrid MFS. We also outline potential open challenges and gaps that require resolution.

工业物联网(IIoT)已经扩展到所有适用环境的领域,其中最关键的是工业环境。工业物联网是互联网在智能制造、供应链优化和预测性维护等各个工业领域的集成。这些应用程序需要两个关键特性才能高效,即实时处理和连接的安全性和可靠性。从这个角度出发,出现了工业环境中网络攻击检测的研究。许多方法使用传统或混合机器学习或深度学习算法来接近该领域。在这篇综述文章中,我们使用元启发式模型,主要是元启发式特征选择(MFS)算法,探索了36个最新的网络攻击检测系统。此外,我们还探索了元启发式和机器学习或深度学习模型的混合模型,用于提高模型在各种基准数据集上的准确性。我们的SLR将该领域使用的MFS分为四种主要类型,包括群体智能(SI),进化算法(EA),基于物理(PHY)和人类行为启发(HBI)。我们的研究结果显示SI-MFS占据主导地位,有25/36的案例研究提出了SI-MFS,而EA在3/36中被提出,PHY和HBI分别在2/36中被提出。我们还展示了最有效的方法,如FS-ID, MFS- d和新型混合MFS。我们还概述了需要解决的潜在挑战和差距。
{"title":"A systematic review of metaheuristic based feature selection strategies for cyber-attack detection in the IIoT","authors":"Georg Thamer Francis,&nbsp;Alireza Souri,&nbsp;Nihat Inanç","doi":"10.1007/s10462-025-11473-7","DOIUrl":"10.1007/s10462-025-11473-7","url":null,"abstract":"<div><p>The Industrial Internet of Things (IIoT) has been spreading across all fields of applicable environments, one of the most crucial ones being the industrial environment. IIoT is the integration of the internet within various industrial fields such as smart manufacturing, supply chain optimization, and predictive maintenance. These applications require two key features to be efficient, that is, real-time processing and connection security and reliability. From this standpoint arise the studies of cyberattack detection in industrial environments. Many methodologies have approached this field using traditional or hybridized machine learning or deep learning algorithms. In this review paper, we explore 36 of the most recent cyber-attack detection systems using metaheuristic models, mainly metaheuristic feature selection (MFS) algorithms. Additionally, we also explore hybrid models of metaheuristics and machine learning or deep learning models that are used to increase the accuracy of the models on various benchmark datasets. Our SLR separates the MFS utilized in this field into four main types, including Swarm Intelligence (SI), Evolutionary Algorithms (EA), Physics-Based (PHY), and Human-Behavior-Inspired (HBI). Our findings showed that SI-MFS dominates the field, with 25/36 case studies proposing it, while EA was proposed in 3/36 and PHY and HBI were each proposed in 2/36. We also demonstrate the most effective methodologies, such as FS-ID, MFS-D, and Novel Hybrid MFS. We also outline potential open challenges and gaps that require resolution.</p></div>","PeriodicalId":8449,"journal":{"name":"Artificial Intelligence Review","volume":"59 2","pages":""},"PeriodicalIF":13.9,"publicationDate":"2026-01-05","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://link.springer.com/content/pdf/10.1007/s10462-025-11473-7.pdf","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"146027087","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Hallucination to truth: a review of fact-checking and factuality evaluation in large language models 对真相的幻觉:对大型语言模型中事实核查和事实评估的回顾
IF 13.9 2区 计算机科学 Q1 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE Pub Date : 2026-01-03 DOI: 10.1007/s10462-025-11454-w
Subhey Sadi Rahman, Md. Adnanul Islam, Md. Mahbub Alam, Musarrat Zeba, Md. Abdur Rahman, Sadia Sultana Chowa, Mohaimenul Azam Khan Raiaan, Sami Azam

Large language models (LLMs) are trained on vast and diverse internet corpora that often include inaccurate or misleading content. Consequently, LLMs can generate misinformation, making robust fact-checking essential. This review systematically analyzes how LLM-generated content is evaluated for factual accuracy by exploring key challenges such as hallucinations, dataset limitations, and the reliability of evaluation metrics. The review emphasizes the need for strong fact-checking frameworks that integrate advanced prompting strategies, domain-specific fine-tuning, and retrieval-augmented generation (RAG) methods. It proposes five research questions that guide the analysis of the recent literature from 2020 to 2025, focusing on evaluation methods and mitigation techniques. Instruction tuning, multi-agent reasoning, and RAG frameworks for external knowledge access are also reviewed. The key findings demonstrate the limitations of current metrics, the importance of validated external evidence, and the improvement of factual consistency through domain-specific customization. The review underscores the importance of building more accurate, understandable, and context-aware fact-checking. These insights contribute to the advancement of research toward more trustworthy models.

大型语言模型(llm)是在大量多样的互联网语料库上训练的,这些语料库通常包含不准确或误导性的内容。因此,法学硕士可能会产生错误信息,这使得强有力的事实核查变得至关重要。这篇综述系统地分析了法学硕士生成的内容是如何通过探索关键挑战(如幻觉、数据集限制和评估指标的可靠性)来评估事实准确性的。该综述强调需要强大的事实检查框架,该框架集成了先进的提示策略、特定领域的微调和检索增强生成(RAG)方法。它提出了五个研究问题,以指导对2020年至2025年近期文献的分析,重点是评估方法和缓解技术。指令调优、多智能体推理和外部知识访问的RAG框架也进行了回顾。关键的发现表明了当前度量的局限性、经过验证的外部证据的重要性,以及通过特定于领域的定制对事实一致性的改进。审查强调了建立更准确、可理解和上下文敏感的事实核查的重要性。这些见解有助于研究朝着更可靠的模型发展。
{"title":"Hallucination to truth: a review of fact-checking and factuality evaluation in large language models","authors":"Subhey Sadi Rahman,&nbsp;Md. Adnanul Islam,&nbsp;Md. Mahbub Alam,&nbsp;Musarrat Zeba,&nbsp;Md. Abdur Rahman,&nbsp;Sadia Sultana Chowa,&nbsp;Mohaimenul Azam Khan Raiaan,&nbsp;Sami Azam","doi":"10.1007/s10462-025-11454-w","DOIUrl":"10.1007/s10462-025-11454-w","url":null,"abstract":"<div><p>Large language models (LLMs) are trained on vast and diverse internet corpora that often include inaccurate or misleading content. Consequently, LLMs can generate misinformation, making robust fact-checking essential. This review systematically analyzes how LLM-generated content is evaluated for factual accuracy by exploring key challenges such as hallucinations, dataset limitations, and the reliability of evaluation metrics. The review emphasizes the need for strong fact-checking frameworks that integrate advanced prompting strategies, domain-specific fine-tuning, and retrieval-augmented generation (RAG) methods. It proposes five research questions that guide the analysis of the recent literature from 2020 to 2025, focusing on evaluation methods and mitigation techniques. Instruction tuning, multi-agent reasoning, and RAG frameworks for external knowledge access are also reviewed. The key findings demonstrate the limitations of current metrics, the importance of validated external evidence, and the improvement of factual consistency through domain-specific customization. The review underscores the importance of building more accurate, understandable, and context-aware fact-checking. These insights contribute to the advancement of research toward more trustworthy models.</p></div>","PeriodicalId":8449,"journal":{"name":"Artificial Intelligence Review","volume":"59 2","pages":""},"PeriodicalIF":13.9,"publicationDate":"2026-01-03","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://link.springer.com/content/pdf/10.1007/s10462-025-11454-w.pdf","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"146027082","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
A review of artificial intelligence techniques for anomaly detection in smart grid 智能电网异常检测的人工智能技术综述
IF 13.9 2区 计算机科学 Q1 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE Pub Date : 2026-01-03 DOI: 10.1007/s10462-025-11429-x
Md Al Amin Sarker, Irrai Anbu Jayaraj, Bharanidharan Shanmugam, Sami Azam, Suresh Thennadil

In the era of smart grids (SGs), as more interconnected energy sources and renewable sources are used, it is becoming increasingly important to have robust and accurate advanced anomaly detection methods. Due to the complexity of modern power systems, anomalies need to be detected more efficiently. This study provides a comprehensive overview of integrating renewable energy sources into SGs and the increasing importance of robust anomaly detection methods in ensuring grid security and reliability. Addressing four key research areas, we explore the current trends in applying machine learning techniques to SG anomaly detection research, identifying anomalies such as electricity theft, cyber-attacks, power system disturbances, and abnormal consumption patterns. We systematically evaluate the utilization of different machine learning models, including supervised, unsupervised, semi-supervised, and reinforcement learning, to detect each anomaly within SG environments. Furthermore, we assess the effectiveness of the anomaly detection algorithms and discuss the potential for further research, emphasizing the need for multidisciplinary collaboration and continuous development to overcome challenges and adapt to evolving grid dynamics and cyber threats. The findings of this study suggest that machine learning significantly contributes to ensuring the resilience and efficiency of SGs in the face of evolving challenges.

在智能电网(SGs)时代,随着越来越多的互联能源和可再生能源的使用,拥有强大而准确的先进异常检测方法变得越来越重要。由于现代电力系统的复杂性,需要更有效地检测异常。本研究全面概述了将可再生能源整合到SGs中,以及鲁棒异常检测方法在确保电网安全和可靠性方面日益重要的意义。针对四个关键研究领域,我们探讨了将机器学习技术应用于SG异常检测研究的当前趋势,识别诸如电力盗窃、网络攻击、电力系统干扰和异常消费模式等异常。我们系统地评估了不同机器学习模型的使用情况,包括监督、无监督、半监督和强化学习,以检测SG环境中的每种异常。此外,我们评估了异常检测算法的有效性,并讨论了进一步研究的潜力,强调需要多学科合作和持续发展,以克服挑战并适应不断变化的网格动态和网络威胁。本研究的结果表明,机器学习对确保SGs在面对不断变化的挑战时的弹性和效率有重要贡献。
{"title":"A review of artificial intelligence techniques for anomaly detection in smart grid","authors":"Md Al Amin Sarker,&nbsp;Irrai Anbu Jayaraj,&nbsp;Bharanidharan Shanmugam,&nbsp;Sami Azam,&nbsp;Suresh Thennadil","doi":"10.1007/s10462-025-11429-x","DOIUrl":"10.1007/s10462-025-11429-x","url":null,"abstract":"<div><p>In the era of smart grids (SGs), as more interconnected energy sources and renewable sources are used, it is becoming increasingly important to have robust and accurate advanced anomaly detection methods. Due to the complexity of modern power systems, anomalies need to be detected more efficiently. This study provides a comprehensive overview of integrating renewable energy sources into SGs and the increasing importance of robust anomaly detection methods in ensuring grid security and reliability. Addressing four key research areas, we explore the current trends in applying machine learning techniques to SG anomaly detection research, identifying anomalies such as electricity theft, cyber-attacks, power system disturbances, and abnormal consumption patterns. We systematically evaluate the utilization of different machine learning models, including supervised, unsupervised, semi-supervised, and reinforcement learning, to detect each anomaly within SG environments. Furthermore, we assess the effectiveness of the anomaly detection algorithms and discuss the potential for further research, emphasizing the need for multidisciplinary collaboration and continuous development to overcome challenges and adapt to evolving grid dynamics and cyber threats. The findings of this study suggest that machine learning significantly contributes to ensuring the resilience and efficiency of SGs in the face of evolving challenges.</p></div>","PeriodicalId":8449,"journal":{"name":"Artificial Intelligence Review","volume":"59 2","pages":""},"PeriodicalIF":13.9,"publicationDate":"2026-01-03","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://link.springer.com/content/pdf/10.1007/s10462-025-11429-x.pdf","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"146027081","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Artificial neural networks fighting real neural decline: a systematic review of AI in Alzheimer's research. 人工神经网络对抗真正的神经衰退:阿尔茨海默病研究中的人工智能系统综述。
IF 13.9 2区 计算机科学 Q1 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE Pub Date : 2026-01-01 Epub Date: 2026-02-02 DOI: 10.1007/s10462-025-11484-4
Farzana Sharmin Mou, Tanvir Ahmed, Md Nazmul Huda, Asoke K Nandi

Alzheimer's disease (AD) is a major global health challenge, with Artificial Intelligence (AI) increasingly recognized as a transformative tool for early detection, disease progression modeling, and therapeutic discovery. This systematic review, conducted in accordance with PRISMA guidelines, analyzed 156 peer-reviewed studies published between 2010 and 2024, identified from four major databases (Scopus, PubMed, Web of Science, IEEE Xplore). A particular emphasis was placed on multimodal approaches that integrate neuroimaging, genetics, biomarkers, and clinical data to improve accuracy and translational value. To organize this fragmented field, we introduce a novel Layered Framework that categorizes AI applications into four domains: Early Detection, Disease Progression Modeling, Therapeutic Discovery, and Real-World Integration. In addition, we applied ARIMA-based forecasting to project research trajectories through 2030, which revealed generative models and transformer architectures as the fastest-growing and most promising methodologies. The review highlights substantial advances in early detection and multimodal fusion, particularly through deep learning, while also identifying persistent challenges such as limited model generalizability, ethical concerns, and underexplored clinical implementation. Addressing these barriers will require multi-cohort validation, interpretable AI, and equity-driven model development. By consolidating evidence and forecasting future directions, this review provides a roadmap for accelerating precision-driven innovations in Alzheimer's care.

阿尔茨海默病(AD)是一项重大的全球健康挑战,人工智能(AI)越来越被认为是早期发现、疾病进展建模和治疗发现的变革性工具。根据PRISMA指南进行的系统评价,分析了2010年至2024年间发表的156项同行评议研究,这些研究来自四个主要数据库(Scopus, PubMed, Web of Science, IEEE Xplore)。特别强调的是整合神经影像学、遗传学、生物标志物和临床数据的多模式方法,以提高准确性和转化价值。为了组织这个碎片化的领域,我们引入了一个新的分层框架,将人工智能应用分为四个领域:早期检测、疾病进展建模、治疗发现和现实世界整合。此外,我们将基于arima的预测应用于到2030年的项目研究轨迹,结果显示生成模型和变压器架构是增长最快、最有前途的方法。该综述强调了早期检测和多模态融合方面的实质性进展,特别是通过深度学习,同时也指出了持续存在的挑战,如有限的模型泛化性、伦理问题和未充分探索的临床实施。解决这些障碍需要多队列验证、可解释的人工智能和公平驱动的模型开发。通过巩固证据和预测未来方向,本综述为加速阿尔茨海默病治疗的精确驱动创新提供了路线图。
{"title":"Artificial neural networks fighting real neural decline: a systematic review of AI in Alzheimer's research.","authors":"Farzana Sharmin Mou, Tanvir Ahmed, Md Nazmul Huda, Asoke K Nandi","doi":"10.1007/s10462-025-11484-4","DOIUrl":"https://doi.org/10.1007/s10462-025-11484-4","url":null,"abstract":"<p><p>Alzheimer's disease (AD) is a major global health challenge, with Artificial Intelligence (AI) increasingly recognized as a transformative tool for early detection, disease progression modeling, and therapeutic discovery. This systematic review, conducted in accordance with PRISMA guidelines, analyzed 156 peer-reviewed studies published between 2010 and 2024, identified from four major databases (Scopus, PubMed, Web of Science, IEEE Xplore). A particular emphasis was placed on multimodal approaches that integrate neuroimaging, genetics, biomarkers, and clinical data to improve accuracy and translational value. To organize this fragmented field, we introduce a novel Layered Framework that categorizes AI applications into four domains: Early Detection, Disease Progression Modeling, Therapeutic Discovery, and Real-World Integration. In addition, we applied ARIMA-based forecasting to project research trajectories through 2030, which revealed generative models and transformer architectures as the fastest-growing and most promising methodologies. The review highlights substantial advances in early detection and multimodal fusion, particularly through deep learning, while also identifying persistent challenges such as limited model generalizability, ethical concerns, and underexplored clinical implementation. Addressing these barriers will require multi-cohort validation, interpretable AI, and equity-driven model development. By consolidating evidence and forecasting future directions, this review provides a roadmap for accelerating precision-driven innovations in Alzheimer's care.</p>","PeriodicalId":8449,"journal":{"name":"Artificial Intelligence Review","volume":"59 4","pages":"124"},"PeriodicalIF":13.9,"publicationDate":"2026-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12999701/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"147497559","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
An optimized hybrid framework for car theft detection: comparative insights from deep transfer learning and feature-based machine learning 汽车盗窃检测的优化混合框架:深度迁移学习和基于特征的机器学习的比较见解
IF 13.9 2区 计算机科学 Q1 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE Pub Date : 2025-12-31 DOI: 10.1007/s10462-025-11480-8
Yashar Jebraeily, Yousef Sharafi, Mohammad Teshnehlab, Nastaran Ahmadi Ramezanloo

Car theft has become a significant issue in modern societies, with far-reaching individual and social consequences. This criminal act causes substantial financial losses for vehicle owners, undermines public trust in security systems, and increases social and governmental costs. Therefore, research on developing innovative and efficient methods for detecting and preventing car theft holds particular importance. In this study, advanced methods for detecting car theft have been evaluated and compared through two main approaches: deep learning and machine learning. First, pre-trained deep neural networks were examined. In the second phase, various image features were extracted using feature extraction methods, such as Edge Direction Histogram (EDH), Edge Oriented Histogram (EOH), and Histogram Oriented Gradient (HOG), followed by the assessment of machine learning approaches. Finally, a hybrid model based on Hybrid Edge and Gradient-Based Features (HFEM) combined with an XGBoost classifier was proposed, achieving an accuracy of 98.6% in predicting car theft.

汽车盗窃已经成为现代社会的一个重要问题,对个人和社会都有深远的影响。这种犯罪行为给车主造成了巨大的经济损失,破坏了公众对安全系统的信任,并增加了社会和政府的成本。因此,研究开发创新和有效的方法来检测和防止汽车盗窃具有特别重要的意义。在本研究中,通过深度学习和机器学习两种主要方法,对检测汽车盗窃的先进方法进行了评估和比较。首先,检查预训练的深度神经网络。在第二阶段,使用边缘方向直方图(EDH)、边缘定向直方图(EOH)和直方图定向梯度(HOG)等特征提取方法提取各种图像特征,然后对机器学习方法进行评估。最后,提出了基于混合边缘和梯度特征(hybrid Edge and Gradient-Based Features, HFEM)与XGBoost分类器相结合的混合模型,预测汽车盗窃的准确率达到98.6%。
{"title":"An optimized hybrid framework for car theft detection: comparative insights from deep transfer learning and feature-based machine learning","authors":"Yashar Jebraeily,&nbsp;Yousef Sharafi,&nbsp;Mohammad Teshnehlab,&nbsp;Nastaran Ahmadi Ramezanloo","doi":"10.1007/s10462-025-11480-8","DOIUrl":"10.1007/s10462-025-11480-8","url":null,"abstract":"<div><p>Car theft has become a significant issue in modern societies, with far-reaching individual and social consequences. This criminal act causes substantial financial losses for vehicle owners, undermines public trust in security systems, and increases social and governmental costs. Therefore, research on developing innovative and efficient methods for detecting and preventing car theft holds particular importance. In this study, advanced methods for detecting car theft have been evaluated and compared through two main approaches: deep learning and machine learning. First, pre-trained deep neural networks were examined. In the second phase, various image features were extracted using feature extraction methods, such as Edge Direction Histogram (EDH), Edge Oriented Histogram (EOH), and Histogram Oriented Gradient (HOG), followed by the assessment of machine learning approaches. Finally, a hybrid model based on Hybrid Edge and Gradient-Based Features (HFEM) combined with an XGBoost classifier was proposed, achieving an accuracy of 98.6% in predicting car theft.</p></div>","PeriodicalId":8449,"journal":{"name":"Artificial Intelligence Review","volume":"59 2","pages":""},"PeriodicalIF":13.9,"publicationDate":"2025-12-31","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://link.springer.com/content/pdf/10.1007/s10462-025-11480-8.pdf","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"146027305","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Emotion-aware adaptation of CLIP model for facial expression recognition 基于情绪感知的CLIP模型面部表情识别
IF 13.9 2区 计算机科学 Q1 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE Pub Date : 2025-12-28 DOI: 10.1007/s10462-025-11468-4
Jing Huan, Mingxing Li, Haoliang Zhou

Facial expression recognition (FER) remains a challenging task due to subtle variations in facial details and unconstrained conditions such as changes in head posture, illumination, and occlusion. Current FER approaches primarily focus on capturing discriminative facial features in vision manner, often neglecting the rich semantic information available in textual modalities. Additionally, these methods typically rely on generic classification templates, which fail to capture instance-specific features, resulting in inadequate representation and fine-grained discrimination ability. To tackle the above issues, we propose a novel emotion-aware adaptation framework that integrates the pre-trained CLIP model for FER, leveraging both visual and textual modalities to enhance representation learning and capture fine-grained emotional details. Specifically, we introduce the Expression-aware adapter module to capture emotion-specific facial representations through task-specific fine-tuning while preserving the generalization capabilities of the CLIP model. Furthermore, the instance-enhanced expression classifier module is proposed to enhance textual descriptors with instance-specific visual embeddings using spherical linear interpolation, creating a more precise and discriminative classifier. Extensive experiments on three in-the-wild FER benchmarks demonstrate superiority of our proposed approach.

面部表情识别(FER)仍然是一项具有挑战性的任务,因为面部细节的微妙变化和不受约束的条件,如头部姿势、光照和遮挡的变化。目前的人脸识别方法主要侧重于以视觉方式捕捉人脸特征,往往忽略了文本模式中丰富的语义信息。此外,这些方法通常依赖于通用的分类模板,而这些模板无法捕获特定于实例的特征,从而导致不充分的表示和细粒度区分能力。为了解决上述问题,我们提出了一种新的情绪感知适应框架,该框架集成了预训练的FER CLIP模型,利用视觉和文本模式来增强表征学习并捕获细粒度的情绪细节。具体来说,我们引入了表情感知适配器模块,通过特定于任务的微调来捕获特定于情绪的面部表征,同时保留了CLIP模型的泛化能力。在此基础上,提出了实例增强的表达式分类器模块,利用球面线性插值对文本描述符进行特定于实例的视觉嵌入,从而创建一个更加精确和判别的分类器。在三个野外FER基准上的大量实验证明了我们提出的方法的优越性。
{"title":"Emotion-aware adaptation of CLIP model for facial expression recognition","authors":"Jing Huan,&nbsp;Mingxing Li,&nbsp;Haoliang Zhou","doi":"10.1007/s10462-025-11468-4","DOIUrl":"10.1007/s10462-025-11468-4","url":null,"abstract":"<div>\u0000 \u0000 <p>Facial expression recognition (FER) remains a challenging task due to subtle variations in facial details and unconstrained conditions such as changes in head posture, illumination, and occlusion. Current FER approaches primarily focus on capturing discriminative facial features in vision manner, often neglecting the rich semantic information available in textual modalities. Additionally, these methods typically rely on generic classification templates, which fail to capture instance-specific features, resulting in inadequate representation and fine-grained discrimination ability. To tackle the above issues, we propose a novel emotion-aware adaptation framework that integrates the pre-trained CLIP model for FER, leveraging both visual and textual modalities to enhance representation learning and capture fine-grained emotional details. Specifically, we introduce the Expression-aware adapter module to capture emotion-specific facial representations through task-specific fine-tuning while preserving the generalization capabilities of the CLIP model. Furthermore, the instance-enhanced expression classifier module is proposed to enhance textual descriptors with instance-specific visual embeddings using spherical linear interpolation, creating a more precise and discriminative classifier. Extensive experiments on three in-the-wild FER benchmarks demonstrate superiority of our proposed approach.</p>\u0000 </div>","PeriodicalId":8449,"journal":{"name":"Artificial Intelligence Review","volume":"59 2","pages":""},"PeriodicalIF":13.9,"publicationDate":"2025-12-28","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://link.springer.com/content/pdf/10.1007/s10462-025-11468-4.pdf","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145983237","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Deep reinforcement learning for robotic bipedal locomotion: a brief survey 机器人两足运动的深度强化学习:综述
IF 13.9 2区 计算机科学 Q1 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE Pub Date : 2025-12-27 DOI: 10.1007/s10462-025-11451-z
Lingfan Bao, Joseph Humphreys, Tianhu Peng, Chengxu Zhou

Bipedal robots are gaining global recognition due to their potential applications and the rapid advancements in artificial intelligence, particularly through deep reinforcement learning (DRL). While DRL has significantly advanced bipedal locomotion, the development of a unified framework capable of handling a wide range of tasks remains an ongoing challenge. This survey systematically categorises, compares, and analyses existing DRL frameworks for bipedal locomotion, organising them into end-to-end and hierarchical control schemes. End-to-end frameworks are evaluated based on their learning approaches, whereas hierarchical frameworks are examined in terms of their layered structures that integrate learning-based and traditional model-based methods. We provide a detailed evaluation of the composition, strengths, limitations, and capabilities of each framework. Furthermore, this survey identifies key research gaps and proposes future directions aimed at creating a more integrated and efficient unified framework for bipedal locomotion, with broad applicability in real-world environments.

由于其潜在的应用和人工智能的快速发展,特别是通过深度强化学习(DRL),双足机器人正在获得全球的认可。虽然DRL在两足运动方面取得了显著进展,但开发一个能够处理广泛任务的统一框架仍然是一个持续的挑战。本调查系统地分类、比较和分析了现有的双足运动DRL框架,将它们组织成端到端和分层控制方案。端到端框架根据其学习方法进行评估,而分层框架则根据其分层结构进行检查,该结构集成了基于学习的方法和传统的基于模型的方法。我们提供了对每个框架的组成、优势、限制和功能的详细评估。此外,本调查确定了关键的研究差距,并提出了未来的方向,旨在创建一个更集成、更高效的两足运动统一框架,在现实环境中具有广泛的适用性。
{"title":"Deep reinforcement learning for robotic bipedal locomotion: a brief survey","authors":"Lingfan Bao,&nbsp;Joseph Humphreys,&nbsp;Tianhu Peng,&nbsp;Chengxu Zhou","doi":"10.1007/s10462-025-11451-z","DOIUrl":"10.1007/s10462-025-11451-z","url":null,"abstract":"<div><p>Bipedal robots are gaining global recognition due to their potential applications and the rapid advancements in artificial intelligence, particularly through deep reinforcement learning (DRL). While DRL has significantly advanced bipedal locomotion, the development of a unified framework capable of handling a wide range of tasks remains an ongoing challenge. This survey systematically categorises, compares, and analyses existing DRL frameworks for bipedal locomotion, organising them into end-to-end and hierarchical control schemes. End-to-end frameworks are evaluated based on their learning approaches, whereas hierarchical frameworks are examined in terms of their layered structures that integrate learning-based and traditional model-based methods. We provide a detailed evaluation of the composition, strengths, limitations, and capabilities of each framework. Furthermore, this survey identifies key research gaps and proposes future directions aimed at creating a more integrated and efficient unified framework for bipedal locomotion, with broad applicability in real-world environments.</p></div>","PeriodicalId":8449,"journal":{"name":"Artificial Intelligence Review","volume":"59 1","pages":""},"PeriodicalIF":13.9,"publicationDate":"2025-12-27","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://link.springer.com/content/pdf/10.1007/s10462-025-11451-z.pdf","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145887053","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Cryptography-based privacy-preserving large language models: a lifecycle survey of frameworks, methods, and future directions 基于密码学的隐私保护大型语言模型:框架、方法和未来方向的生命周期调查
IF 13.9 2区 计算机科学 Q1 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE Pub Date : 2025-12-26 DOI: 10.1007/s10462-025-11466-6
Jinglong Luo, Yehong Zhang, Zhuo Zhang, Shiyu Liu, Ye Dong, Haoran Li, Yue Yu, Hui Wang, Xun Zhou, Zenglin Xu

The rapid development of Transformer-based large language models (LLMs) has made them one of the most critical technological infrastructures in modern society. However, this rapid deployment has transformed the risk of privacy breaches from a theoretical concern into a systemic threat spanning the entire lifecycle of LLMs. These risks continually challenge existing data compliance and regulatory frameworks, directly limiting the large-scale adoption of LLMs in highly sensitive and heavily regulated industries. Cryptographic technologies, such as fully homomorphic encryption (FHE) and secure multi-party computation (MPC), have garnered significant attention due to their provable security guarantees, theoretically safeguarding the privacy of sensitive data and LLMs weights. These cryptographic techniques have rapidly permeated key stages of LLMs, including data selection, fine-tuning, and inference. Despite these advancements, there is currently no comprehensive survey summarizing the work related to cryptography-based privacy-preserving LLMs (CPLMs), leaving their research isolated and fragmented. To fill this gap, We provide a comprehensive review of existing CPLMs research and systematically classifies them, enabling researchers to effectively coordinate optimization strategies for the efficient design of CPLMs algorithms. Finally, based on the limitations of current CPLMs research, we outline several promising directions for future exploration.

基于transformer的大型语言模型(llm)的快速发展使其成为现代社会中最关键的技术基础设施之一。然而,这种快速部署已经将隐私泄露的风险从理论上的担忧转变为跨越法学硕士整个生命周期的系统性威胁。这些风险不断挑战现有的数据遵从性和监管框架,直接限制了llm在高度敏感和严格监管行业的大规模采用。加密技术,如完全同态加密(FHE)和安全多方计算(MPC),由于其可证明的安全保证,从理论上保护敏感数据和llm权重的隐私性,已经引起了极大的关注。这些加密技术已经迅速渗透到法学硕士的关键阶段,包括数据选择、微调和推理。尽管取得了这些进步,但目前还没有全面的调查总结与基于密码学的隐私保护llm (cplm)相关的工作,使他们的研究孤立和碎片化。为了填补这一空白,我们对现有的cplm研究进行了全面的回顾,并对它们进行了系统的分类,使研究人员能够有效地协调优化策略,以实现cplm算法的高效设计。最后,基于当前cplm研究的局限性,我们概述了未来探索的几个有希望的方向。
{"title":"Cryptography-based privacy-preserving large language models: a lifecycle survey of frameworks, methods, and future directions","authors":"Jinglong Luo,&nbsp;Yehong Zhang,&nbsp;Zhuo Zhang,&nbsp;Shiyu Liu,&nbsp;Ye Dong,&nbsp;Haoran Li,&nbsp;Yue Yu,&nbsp;Hui Wang,&nbsp;Xun Zhou,&nbsp;Zenglin Xu","doi":"10.1007/s10462-025-11466-6","DOIUrl":"10.1007/s10462-025-11466-6","url":null,"abstract":"<div><p>The rapid development of Transformer-based large language models (LLMs) has made them one of the most critical technological infrastructures in modern society. However, this rapid deployment has transformed the risk of privacy breaches from a theoretical concern into a systemic threat spanning the entire lifecycle of LLMs. These risks continually challenge existing data compliance and regulatory frameworks, directly limiting the large-scale adoption of LLMs in highly sensitive and heavily regulated industries. Cryptographic technologies, such as fully homomorphic encryption (FHE) and secure multi-party computation (MPC), have garnered significant attention due to their provable security guarantees, theoretically safeguarding the privacy of sensitive data and LLMs weights. These cryptographic techniques have rapidly permeated key stages of LLMs, including data selection, fine-tuning, and inference. Despite these advancements, there is currently no comprehensive survey summarizing the work related to cryptography-based privacy-preserving LLMs (CPLMs), leaving their research isolated and fragmented. To fill this gap, We provide a comprehensive review of existing CPLMs research and systematically classifies them, enabling researchers to effectively coordinate optimization strategies for the efficient design of CPLMs algorithms. Finally, based on the limitations of current CPLMs research, we outline several promising directions for future exploration.</p></div>","PeriodicalId":8449,"journal":{"name":"Artificial Intelligence Review","volume":"59 2","pages":""},"PeriodicalIF":13.9,"publicationDate":"2025-12-26","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://link.springer.com/content/pdf/10.1007/s10462-025-11466-6.pdf","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145983232","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
The quality of AI-generated answers for patient inquiries on urolithiasis: a comparative study of ChatGPT and Deepseek 人工智能对尿石症患者问询的回答质量:ChatGPT和Deepseek的比较研究
IF 13.9 2区 计算机科学 Q1 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE Pub Date : 2025-12-23 DOI: 10.1007/s10462-025-11478-2
Wojciech Tomczak, Jan Łaszkiewicz, Łukasz Nowak, Łukasz Biesiadecki, Klaudia Molik, Katarzyna Grunwald, Joanna Chorbińska, Bartosz Małkiewicz, Tomasz Szydełko, Wojciech Krajewski

Patients increasingly rely on easily accessible online resources, often ignoring source credibility. Large Language Models such as ChatGPT and DeepSeek provide free, near human interaction on any imaginable topic, including medical conditions. While the benefits provided by this technology are evident and undeniable, concerns regarding the reliability and safety remain. In this study, we assessed the quality, safety, and reproducibility of responses generated by ChatGPT-4o mini and DeepSeek-R1 on the urolithiasis - an increasingly prevalent condition with complex aetiology and diverse management options. We screened for the most frequently asked questions on kidney stone disease. A set of 76 questions was generated and divided into six categories: general information, risk factors, symptoms, diagnosis, treatment and prognosis. Each question was entered into DeepSeek-R1 and ChatGPT-4o mini. Responses were independently evaluated by two attending urologists using a four-point scale based on clearly defined, pre-established criteria. Discrepancies were resolved by a third expert. Cosine similarity index was applied to evaluate the degree to which LLM responses remained stable over time in wording and meaning. Direct comparisons on the response lengths were conducted. Initial analysis with no category differentiation favoured DeepSeek R1 (p < 0.001). The worst outcomes for both models were recorded in the “treatment” category, yet with DeepSeek’s statistically significant advantage. Moreover, the Chinese LLM provided more accurate responses in “general information” category. The median cosine similarity score for responses generated by DeepSeek-R1 and ChatGPT-4o was 0.7 (IQR 0.655–0.736) and 0.86 (IQR 0.805–0.9), respectively. Responses from DeepSeek-R1 were significantly shorter, with a median word count of 385.5 (330.5–448.5) compared to and 672.5 (438–873.25) words for ChatGPT-4o mini (p < 0.001). Additionally, DeepSeek-R1 responses were more consistent in terms of length exhibiting a narrower distribution when compared to ChatGPT-4o mini. Among the evaluated LLMs available free of charge, DeepSeek-R1 emerged as a more accurate and concise source of patient information, while ChatGPT-4o mini demonstrated significantly greater reproducible responses. The reasoning process of DeepSeek-R1 has the potential to enhance patient comprehension of complex medical concepts thereby improving treatment adherence. Nevertheless, limitations of LLMs such as susceptibility to hallucinations and biases derived from their training data must be carefully considered.

患者越来越依赖易于获取的在线资源,往往忽略了来源的可信度。ChatGPT和DeepSeek等大型语言模型提供免费的、接近人类的互动,涉及任何可以想象的话题,包括医疗条件。虽然这项技术带来的好处是显而易见和不可否认的,但关于可靠性和安全性的担忧仍然存在。在这项研究中,我们评估了chatgpt - 40 mini和DeepSeek-R1对尿石症的疗效的质量、安全性和可重复性。尿石症是一种病因复杂、治疗方法多样的日益普遍的疾病。我们筛选了肾结石疾病最常见的问题。总共有76个问题,分为6类:一般信息、危险因素、症状、诊断、治疗和预后。每个问题都被输入DeepSeek-R1和chatgpt - 40 mini。反应由两名主治泌尿科医生使用基于明确定义,预先建立的标准的四分制独立评估。差异由第三位专家解决。余弦相似度指数用于评价LLM反应在措辞和意义上随时间保持稳定的程度。对反应长度进行了直接比较。没有类别区分的初步分析有利于DeepSeek R1 (p < 0.001)。两种模型的最差结果都记录在“治疗”类别中,但DeepSeek在统计上具有显著优势。此外,中国法学硕士在“一般信息”方面的回答更为准确。DeepSeek-R1和chatgpt - 40生成的响应的中位数余弦相似度评分分别为0.7 (IQR 0.655-0.736)和0.86 (IQR 0.805-0.9)。DeepSeek-R1的回复明显更短,中位数字数为385.5(330.5-448.5),而chatgpt - 40 mini的中位数字数为672.5 (438-873.25)(p < 0.001)。此外,与chatgpt - 40 mini相比,DeepSeek-R1的响应在长度方面更加一致,分布更窄。在可免费获得的评估llm中,DeepSeek-R1成为更准确和简洁的患者信息来源,而chatgpt - 40mini显示出更大的可重复性反应。DeepSeek-R1的推理过程有可能增强患者对复杂医学概念的理解,从而提高治疗依从性。然而,法学硕士的局限性,如对幻觉的敏感性和来自训练数据的偏见,必须仔细考虑。
{"title":"The quality of AI-generated answers for patient inquiries on urolithiasis: a comparative study of ChatGPT and Deepseek","authors":"Wojciech Tomczak,&nbsp;Jan Łaszkiewicz,&nbsp;Łukasz Nowak,&nbsp;Łukasz Biesiadecki,&nbsp;Klaudia Molik,&nbsp;Katarzyna Grunwald,&nbsp;Joanna Chorbińska,&nbsp;Bartosz Małkiewicz,&nbsp;Tomasz Szydełko,&nbsp;Wojciech Krajewski","doi":"10.1007/s10462-025-11478-2","DOIUrl":"10.1007/s10462-025-11478-2","url":null,"abstract":"<div><p>Patients increasingly rely on easily accessible online resources, often ignoring source credibility. Large Language Models such as ChatGPT and DeepSeek provide free, near human interaction on any imaginable topic, including medical conditions. While the benefits provided by this technology are evident and undeniable, concerns regarding the reliability and safety remain. In this study, we assessed the quality, safety, and reproducibility of responses generated by ChatGPT-4o mini and DeepSeek-R1 on the urolithiasis - an increasingly prevalent condition with complex aetiology and diverse management options. We screened for the most frequently asked questions on kidney stone disease. A set of 76 questions was generated and divided into six categories: general information, risk factors, symptoms, diagnosis, treatment and prognosis. Each question was entered into DeepSeek-R1 and ChatGPT-4o mini. Responses were independently evaluated by two attending urologists using a four-point scale based on clearly defined, pre-established criteria. Discrepancies were resolved by a third expert. Cosine similarity index was applied to evaluate the degree to which LLM responses remained stable over time in wording and meaning. Direct comparisons on the response lengths were conducted. Initial analysis with no category differentiation favoured DeepSeek R1 (<i>p</i> &lt; 0.001). The worst outcomes for both models were recorded in the “treatment” category, yet with DeepSeek’s statistically significant advantage. Moreover, the Chinese LLM provided more accurate responses in “general information” category. The median cosine similarity score for responses generated by DeepSeek-R1 and ChatGPT-4o was 0.7 (IQR 0.655–0.736) and 0.86 (IQR 0.805–0.9), respectively. Responses from DeepSeek-R1 were significantly shorter, with a median word count of 385.5 (330.5–448.5) compared to and 672.5 (438–873.25) words for ChatGPT-4o mini (<i>p</i> &lt; 0.001). Additionally, DeepSeek-R1 responses were more consistent in terms of length exhibiting a narrower distribution when compared to ChatGPT-4o mini. Among the evaluated LLMs available free of charge, DeepSeek-R1 emerged as a more accurate and concise source of patient information, while ChatGPT-4o mini demonstrated significantly greater reproducible responses. The reasoning process of DeepSeek-R1 has the potential to enhance patient comprehension of complex medical concepts thereby improving treatment adherence. Nevertheless, limitations of LLMs such as susceptibility to hallucinations and biases derived from their training data must be carefully considered.</p></div>","PeriodicalId":8449,"journal":{"name":"Artificial Intelligence Review","volume":"59 2","pages":""},"PeriodicalIF":13.9,"publicationDate":"2025-12-23","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://link.springer.com/content/pdf/10.1007/s10462-025-11478-2.pdf","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145983315","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Systematic taxonomic framework of metaheuristic algorithms using hierarchical clustering and structural criteria: how novel is the novelty? 使用层次聚类和结构标准的元启发式算法的系统分类框架:新颖性有多新颖?
IF 13.9 2区 计算机科学 Q1 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE Pub Date : 2025-12-23 DOI: 10.1007/s10462-025-11456-8
Manuel Soto Calvo, Han Soo Lee

The proliferation of metaheuristic optimization algorithms has led to concerns about their novelty. This study introduces three key contributions to address this challenge: (1) a novel systematic taxonomic framework that employs nineteen rigorously selected, metaphor-free criteria to evaluate algorithmic distinctiveness; (2) a comprehensive clustering methodology that combines Rogers-Tanimoto distance analysis with principal component analysis (PCA) and hierarchical clustering to quantify algorithmic similarities; and (3) an objective assessment method for evaluating genuine algorithmic innovations. Through the analysis of 145 metaheuristic algorithms, we demonstrate that 74 algorithms (51.0%) exhibit distances below the confidence interval threshold, indicating profound structural overlap. Network analysis reveals 26 algorithms with perfect structural identity (distance = 0.0) and 512 algorithm pairs showing high similarity (distance < 0.039), representing 18.9% of all pairwise comparisons. The results show that numerous algorithms claiming innovation deliver only incremental modifications to existing implementation patterns, lacking fundamental methodological advancement. The framework provides both a theoretical foundation for understanding algorithmic similarities and a practical tool for evaluating new algorithmic proposals, potentially transforming how the field assesses and develops novel optimization methods.

元启发式优化算法的激增导致了对其新颖性的担忧。本研究提出了三个关键贡献来应对这一挑战:(1)一个新的系统分类框架,该框架采用19个严格选择的、无隐喻的标准来评估算法的独特性;(2)采用Rogers-Tanimoto距离分析、主成分分析(PCA)和层次聚类相结合的综合聚类方法量化算法相似度;(3)一种评价真正算法创新的客观评价方法。通过对145种元启发式算法的分析,我们发现74种算法(51.0%)的距离低于置信区间阈值,表明存在严重的结构重叠。网络分析显示,26个算法具有完美的结构同一性(距离= 0.0),512个算法对具有高相似性(距离<; 0.039),占所有成对比较的18.9%。结果表明,许多声称创新的算法只提供了对现有实现模式的增量修改,缺乏基本的方法进步。该框架既为理解算法相似性提供了理论基础,也为评估新的算法建议提供了实用工具,可能会改变该领域评估和开发新的优化方法的方式。
{"title":"Systematic taxonomic framework of metaheuristic algorithms using hierarchical clustering and structural criteria: how novel is the novelty?","authors":"Manuel Soto Calvo,&nbsp;Han Soo Lee","doi":"10.1007/s10462-025-11456-8","DOIUrl":"10.1007/s10462-025-11456-8","url":null,"abstract":"<div><p>The proliferation of metaheuristic optimization algorithms has led to concerns about their novelty. This study introduces three key contributions to address this challenge: (1) a novel systematic taxonomic framework that employs nineteen rigorously selected, metaphor-free criteria to evaluate algorithmic distinctiveness; (2) a comprehensive clustering methodology that combines Rogers-Tanimoto distance analysis with principal component analysis (PCA) and hierarchical clustering to quantify algorithmic similarities; and (3) an objective assessment method for evaluating genuine algorithmic innovations. Through the analysis of 145 metaheuristic algorithms, we demonstrate that 74 algorithms (51.0%) exhibit distances below the confidence interval threshold, indicating profound structural overlap. Network analysis reveals 26 algorithms with perfect structural identity (distance = 0.0) and 512 algorithm pairs showing high similarity (distance &lt; 0.039), representing 18.9% of all pairwise comparisons. The results show that numerous algorithms claiming innovation deliver only incremental modifications to existing implementation patterns, lacking fundamental methodological advancement. The framework provides both a theoretical foundation for understanding algorithmic similarities and a practical tool for evaluating new algorithmic proposals, potentially transforming how the field assesses and develops novel optimization methods.</p></div>","PeriodicalId":8449,"journal":{"name":"Artificial Intelligence Review","volume":"59 2","pages":""},"PeriodicalIF":13.9,"publicationDate":"2025-12-23","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://link.springer.com/content/pdf/10.1007/s10462-025-11456-8.pdf","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145983316","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
期刊
Artificial Intelligence Review
全部 Geobiology Appl. Clay Sci. Geochim. Cosmochim. Acta J. Hydrol. Org. Geochem. Carbon Balance Manage. Contrib. Mineral. Petrol. Int. J. Biometeorol. IZV-PHYS SOLID EART+ J. Atmos. Chem. Acta Oceanolog. Sin. Acta Geophys. ACTA GEOL POL ACTA PETROL SIN ACTA GEOL SIN-ENGL AAPG Bull. Acta Geochimica Adv. Atmos. Sci. Adv. Meteorol. Am. J. Phys. Anthropol. Am. J. Sci. Am. Mineral. Annu. Rev. Earth Planet. Sci. Appl. Geochem. Aquat. Geochem. Ann. Glaciol. Archaeol. Anthropol. Sci. ARCHAEOMETRY ARCT ANTARCT ALP RES Asia-Pac. J. Atmos. Sci. ATMOSPHERE-BASEL Atmos. Res. Aust. J. Earth Sci. Atmos. Chem. Phys. Atmos. Meas. Tech. Basin Res. Big Earth Data BIOGEOSCIENCES Geostand. Geoanal. Res. GEOLOGY Geosci. J. Geochem. J. Geochem. Trans. Geosci. Front. Geol. Ore Deposits Global Biogeochem. Cycles Gondwana Res. Geochem. Int. Geol. J. Geophys. Prospect. Geosci. Model Dev. GEOL BELG GROUNDWATER Hydrogeol. J. Hydrol. Earth Syst. Sci. Hydrol. Processes Int. J. Climatol. Int. J. Earth Sci. Int. Geol. Rev. Int. J. Disaster Risk Reduct. Int. J. Geomech. Int. J. Geog. Inf. Sci. Isl. Arc J. Afr. Earth. Sci. J. Adv. Model. Earth Syst. J APPL METEOROL CLIM J. Atmos. Oceanic Technol. J. Atmos. Sol. Terr. Phys. J. Clim. J. Earth Sci. J. Earth Syst. Sci. J. Environ. Eng. Geophys. J. Geog. Sci. Mineral. Mag. Miner. Deposita Mon. Weather Rev. Nat. Hazards Earth Syst. Sci. Nat. Clim. Change Nat. Geosci. Ocean Dyn. Ocean and Coastal Research npj Clim. Atmos. Sci. Ocean Modell. Ocean Sci. Ore Geol. Rev. OCEAN SCI J Paleontol. J. PALAEOGEOGR PALAEOCL PERIOD MINERAL PETROLOGY+ Phys. Chem. Miner. Polar Sci. Prog. Oceanogr. Quat. Sci. Rev. Q. J. Eng. Geol. Hydrogeol. RADIOCARBON Pure Appl. Geophys. Resour. Geol. Rev. Geophys. Sediment. Geol.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1