首页 > 最新文献

Artificial Intelligence Review最新文献

英文 中文
An optimized hybrid framework for car theft detection: comparative insights from deep transfer learning and feature-based machine learning 汽车盗窃检测的优化混合框架:深度迁移学习和基于特征的机器学习的比较见解
IF 13.9 2区 计算机科学 Q1 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE Pub Date : 2025-12-31 DOI: 10.1007/s10462-025-11480-8
Yashar Jebraeily, Yousef Sharafi, Mohammad Teshnehlab, Nastaran Ahmadi Ramezanloo

Car theft has become a significant issue in modern societies, with far-reaching individual and social consequences. This criminal act causes substantial financial losses for vehicle owners, undermines public trust in security systems, and increases social and governmental costs. Therefore, research on developing innovative and efficient methods for detecting and preventing car theft holds particular importance. In this study, advanced methods for detecting car theft have been evaluated and compared through two main approaches: deep learning and machine learning. First, pre-trained deep neural networks were examined. In the second phase, various image features were extracted using feature extraction methods, such as Edge Direction Histogram (EDH), Edge Oriented Histogram (EOH), and Histogram Oriented Gradient (HOG), followed by the assessment of machine learning approaches. Finally, a hybrid model based on Hybrid Edge and Gradient-Based Features (HFEM) combined with an XGBoost classifier was proposed, achieving an accuracy of 98.6% in predicting car theft.

汽车盗窃已经成为现代社会的一个重要问题,对个人和社会都有深远的影响。这种犯罪行为给车主造成了巨大的经济损失,破坏了公众对安全系统的信任,并增加了社会和政府的成本。因此,研究开发创新和有效的方法来检测和防止汽车盗窃具有特别重要的意义。在本研究中,通过深度学习和机器学习两种主要方法,对检测汽车盗窃的先进方法进行了评估和比较。首先,检查预训练的深度神经网络。在第二阶段,使用边缘方向直方图(EDH)、边缘定向直方图(EOH)和直方图定向梯度(HOG)等特征提取方法提取各种图像特征,然后对机器学习方法进行评估。最后,提出了基于混合边缘和梯度特征(hybrid Edge and Gradient-Based Features, HFEM)与XGBoost分类器相结合的混合模型,预测汽车盗窃的准确率达到98.6%。
{"title":"An optimized hybrid framework for car theft detection: comparative insights from deep transfer learning and feature-based machine learning","authors":"Yashar Jebraeily,&nbsp;Yousef Sharafi,&nbsp;Mohammad Teshnehlab,&nbsp;Nastaran Ahmadi Ramezanloo","doi":"10.1007/s10462-025-11480-8","DOIUrl":"10.1007/s10462-025-11480-8","url":null,"abstract":"<div><p>Car theft has become a significant issue in modern societies, with far-reaching individual and social consequences. This criminal act causes substantial financial losses for vehicle owners, undermines public trust in security systems, and increases social and governmental costs. Therefore, research on developing innovative and efficient methods for detecting and preventing car theft holds particular importance. In this study, advanced methods for detecting car theft have been evaluated and compared through two main approaches: deep learning and machine learning. First, pre-trained deep neural networks were examined. In the second phase, various image features were extracted using feature extraction methods, such as Edge Direction Histogram (EDH), Edge Oriented Histogram (EOH), and Histogram Oriented Gradient (HOG), followed by the assessment of machine learning approaches. Finally, a hybrid model based on Hybrid Edge and Gradient-Based Features (HFEM) combined with an XGBoost classifier was proposed, achieving an accuracy of 98.6% in predicting car theft.</p></div>","PeriodicalId":8449,"journal":{"name":"Artificial Intelligence Review","volume":"59 2","pages":""},"PeriodicalIF":13.9,"publicationDate":"2025-12-31","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://link.springer.com/content/pdf/10.1007/s10462-025-11480-8.pdf","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"146027305","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Emotion-aware adaptation of CLIP model for facial expression recognition 基于情绪感知的CLIP模型面部表情识别
IF 13.9 2区 计算机科学 Q1 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE Pub Date : 2025-12-28 DOI: 10.1007/s10462-025-11468-4
Jing Huan, Mingxing Li, Haoliang Zhou

Facial expression recognition (FER) remains a challenging task due to subtle variations in facial details and unconstrained conditions such as changes in head posture, illumination, and occlusion. Current FER approaches primarily focus on capturing discriminative facial features in vision manner, often neglecting the rich semantic information available in textual modalities. Additionally, these methods typically rely on generic classification templates, which fail to capture instance-specific features, resulting in inadequate representation and fine-grained discrimination ability. To tackle the above issues, we propose a novel emotion-aware adaptation framework that integrates the pre-trained CLIP model for FER, leveraging both visual and textual modalities to enhance representation learning and capture fine-grained emotional details. Specifically, we introduce the Expression-aware adapter module to capture emotion-specific facial representations through task-specific fine-tuning while preserving the generalization capabilities of the CLIP model. Furthermore, the instance-enhanced expression classifier module is proposed to enhance textual descriptors with instance-specific visual embeddings using spherical linear interpolation, creating a more precise and discriminative classifier. Extensive experiments on three in-the-wild FER benchmarks demonstrate superiority of our proposed approach.

面部表情识别(FER)仍然是一项具有挑战性的任务,因为面部细节的微妙变化和不受约束的条件,如头部姿势、光照和遮挡的变化。目前的人脸识别方法主要侧重于以视觉方式捕捉人脸特征,往往忽略了文本模式中丰富的语义信息。此外,这些方法通常依赖于通用的分类模板,而这些模板无法捕获特定于实例的特征,从而导致不充分的表示和细粒度区分能力。为了解决上述问题,我们提出了一种新的情绪感知适应框架,该框架集成了预训练的FER CLIP模型,利用视觉和文本模式来增强表征学习并捕获细粒度的情绪细节。具体来说,我们引入了表情感知适配器模块,通过特定于任务的微调来捕获特定于情绪的面部表征,同时保留了CLIP模型的泛化能力。在此基础上,提出了实例增强的表达式分类器模块,利用球面线性插值对文本描述符进行特定于实例的视觉嵌入,从而创建一个更加精确和判别的分类器。在三个野外FER基准上的大量实验证明了我们提出的方法的优越性。
{"title":"Emotion-aware adaptation of CLIP model for facial expression recognition","authors":"Jing Huan,&nbsp;Mingxing Li,&nbsp;Haoliang Zhou","doi":"10.1007/s10462-025-11468-4","DOIUrl":"10.1007/s10462-025-11468-4","url":null,"abstract":"<div>\u0000 \u0000 <p>Facial expression recognition (FER) remains a challenging task due to subtle variations in facial details and unconstrained conditions such as changes in head posture, illumination, and occlusion. Current FER approaches primarily focus on capturing discriminative facial features in vision manner, often neglecting the rich semantic information available in textual modalities. Additionally, these methods typically rely on generic classification templates, which fail to capture instance-specific features, resulting in inadequate representation and fine-grained discrimination ability. To tackle the above issues, we propose a novel emotion-aware adaptation framework that integrates the pre-trained CLIP model for FER, leveraging both visual and textual modalities to enhance representation learning and capture fine-grained emotional details. Specifically, we introduce the Expression-aware adapter module to capture emotion-specific facial representations through task-specific fine-tuning while preserving the generalization capabilities of the CLIP model. Furthermore, the instance-enhanced expression classifier module is proposed to enhance textual descriptors with instance-specific visual embeddings using spherical linear interpolation, creating a more precise and discriminative classifier. Extensive experiments on three in-the-wild FER benchmarks demonstrate superiority of our proposed approach.</p>\u0000 </div>","PeriodicalId":8449,"journal":{"name":"Artificial Intelligence Review","volume":"59 2","pages":""},"PeriodicalIF":13.9,"publicationDate":"2025-12-28","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://link.springer.com/content/pdf/10.1007/s10462-025-11468-4.pdf","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145983237","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Deep reinforcement learning for robotic bipedal locomotion: a brief survey 机器人两足运动的深度强化学习:综述
IF 13.9 2区 计算机科学 Q1 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE Pub Date : 2025-12-27 DOI: 10.1007/s10462-025-11451-z
Lingfan Bao, Joseph Humphreys, Tianhu Peng, Chengxu Zhou

Bipedal robots are gaining global recognition due to their potential applications and the rapid advancements in artificial intelligence, particularly through deep reinforcement learning (DRL). While DRL has significantly advanced bipedal locomotion, the development of a unified framework capable of handling a wide range of tasks remains an ongoing challenge. This survey systematically categorises, compares, and analyses existing DRL frameworks for bipedal locomotion, organising them into end-to-end and hierarchical control schemes. End-to-end frameworks are evaluated based on their learning approaches, whereas hierarchical frameworks are examined in terms of their layered structures that integrate learning-based and traditional model-based methods. We provide a detailed evaluation of the composition, strengths, limitations, and capabilities of each framework. Furthermore, this survey identifies key research gaps and proposes future directions aimed at creating a more integrated and efficient unified framework for bipedal locomotion, with broad applicability in real-world environments.

由于其潜在的应用和人工智能的快速发展,特别是通过深度强化学习(DRL),双足机器人正在获得全球的认可。虽然DRL在两足运动方面取得了显著进展,但开发一个能够处理广泛任务的统一框架仍然是一个持续的挑战。本调查系统地分类、比较和分析了现有的双足运动DRL框架,将它们组织成端到端和分层控制方案。端到端框架根据其学习方法进行评估,而分层框架则根据其分层结构进行检查,该结构集成了基于学习的方法和传统的基于模型的方法。我们提供了对每个框架的组成、优势、限制和功能的详细评估。此外,本调查确定了关键的研究差距,并提出了未来的方向,旨在创建一个更集成、更高效的两足运动统一框架,在现实环境中具有广泛的适用性。
{"title":"Deep reinforcement learning for robotic bipedal locomotion: a brief survey","authors":"Lingfan Bao,&nbsp;Joseph Humphreys,&nbsp;Tianhu Peng,&nbsp;Chengxu Zhou","doi":"10.1007/s10462-025-11451-z","DOIUrl":"10.1007/s10462-025-11451-z","url":null,"abstract":"<div><p>Bipedal robots are gaining global recognition due to their potential applications and the rapid advancements in artificial intelligence, particularly through deep reinforcement learning (DRL). While DRL has significantly advanced bipedal locomotion, the development of a unified framework capable of handling a wide range of tasks remains an ongoing challenge. This survey systematically categorises, compares, and analyses existing DRL frameworks for bipedal locomotion, organising them into end-to-end and hierarchical control schemes. End-to-end frameworks are evaluated based on their learning approaches, whereas hierarchical frameworks are examined in terms of their layered structures that integrate learning-based and traditional model-based methods. We provide a detailed evaluation of the composition, strengths, limitations, and capabilities of each framework. Furthermore, this survey identifies key research gaps and proposes future directions aimed at creating a more integrated and efficient unified framework for bipedal locomotion, with broad applicability in real-world environments.</p></div>","PeriodicalId":8449,"journal":{"name":"Artificial Intelligence Review","volume":"59 1","pages":""},"PeriodicalIF":13.9,"publicationDate":"2025-12-27","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://link.springer.com/content/pdf/10.1007/s10462-025-11451-z.pdf","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145887053","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Cryptography-based privacy-preserving large language models: a lifecycle survey of frameworks, methods, and future directions 基于密码学的隐私保护大型语言模型:框架、方法和未来方向的生命周期调查
IF 13.9 2区 计算机科学 Q1 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE Pub Date : 2025-12-26 DOI: 10.1007/s10462-025-11466-6
Jinglong Luo, Yehong Zhang, Zhuo Zhang, Shiyu Liu, Ye Dong, Haoran Li, Yue Yu, Hui Wang, Xun Zhou, Zenglin Xu

The rapid development of Transformer-based large language models (LLMs) has made them one of the most critical technological infrastructures in modern society. However, this rapid deployment has transformed the risk of privacy breaches from a theoretical concern into a systemic threat spanning the entire lifecycle of LLMs. These risks continually challenge existing data compliance and regulatory frameworks, directly limiting the large-scale adoption of LLMs in highly sensitive and heavily regulated industries. Cryptographic technologies, such as fully homomorphic encryption (FHE) and secure multi-party computation (MPC), have garnered significant attention due to their provable security guarantees, theoretically safeguarding the privacy of sensitive data and LLMs weights. These cryptographic techniques have rapidly permeated key stages of LLMs, including data selection, fine-tuning, and inference. Despite these advancements, there is currently no comprehensive survey summarizing the work related to cryptography-based privacy-preserving LLMs (CPLMs), leaving their research isolated and fragmented. To fill this gap, We provide a comprehensive review of existing CPLMs research and systematically classifies them, enabling researchers to effectively coordinate optimization strategies for the efficient design of CPLMs algorithms. Finally, based on the limitations of current CPLMs research, we outline several promising directions for future exploration.

基于transformer的大型语言模型(llm)的快速发展使其成为现代社会中最关键的技术基础设施之一。然而,这种快速部署已经将隐私泄露的风险从理论上的担忧转变为跨越法学硕士整个生命周期的系统性威胁。这些风险不断挑战现有的数据遵从性和监管框架,直接限制了llm在高度敏感和严格监管行业的大规模采用。加密技术,如完全同态加密(FHE)和安全多方计算(MPC),由于其可证明的安全保证,从理论上保护敏感数据和llm权重的隐私性,已经引起了极大的关注。这些加密技术已经迅速渗透到法学硕士的关键阶段,包括数据选择、微调和推理。尽管取得了这些进步,但目前还没有全面的调查总结与基于密码学的隐私保护llm (cplm)相关的工作,使他们的研究孤立和碎片化。为了填补这一空白,我们对现有的cplm研究进行了全面的回顾,并对它们进行了系统的分类,使研究人员能够有效地协调优化策略,以实现cplm算法的高效设计。最后,基于当前cplm研究的局限性,我们概述了未来探索的几个有希望的方向。
{"title":"Cryptography-based privacy-preserving large language models: a lifecycle survey of frameworks, methods, and future directions","authors":"Jinglong Luo,&nbsp;Yehong Zhang,&nbsp;Zhuo Zhang,&nbsp;Shiyu Liu,&nbsp;Ye Dong,&nbsp;Haoran Li,&nbsp;Yue Yu,&nbsp;Hui Wang,&nbsp;Xun Zhou,&nbsp;Zenglin Xu","doi":"10.1007/s10462-025-11466-6","DOIUrl":"10.1007/s10462-025-11466-6","url":null,"abstract":"<div><p>The rapid development of Transformer-based large language models (LLMs) has made them one of the most critical technological infrastructures in modern society. However, this rapid deployment has transformed the risk of privacy breaches from a theoretical concern into a systemic threat spanning the entire lifecycle of LLMs. These risks continually challenge existing data compliance and regulatory frameworks, directly limiting the large-scale adoption of LLMs in highly sensitive and heavily regulated industries. Cryptographic technologies, such as fully homomorphic encryption (FHE) and secure multi-party computation (MPC), have garnered significant attention due to their provable security guarantees, theoretically safeguarding the privacy of sensitive data and LLMs weights. These cryptographic techniques have rapidly permeated key stages of LLMs, including data selection, fine-tuning, and inference. Despite these advancements, there is currently no comprehensive survey summarizing the work related to cryptography-based privacy-preserving LLMs (CPLMs), leaving their research isolated and fragmented. To fill this gap, We provide a comprehensive review of existing CPLMs research and systematically classifies them, enabling researchers to effectively coordinate optimization strategies for the efficient design of CPLMs algorithms. Finally, based on the limitations of current CPLMs research, we outline several promising directions for future exploration.</p></div>","PeriodicalId":8449,"journal":{"name":"Artificial Intelligence Review","volume":"59 2","pages":""},"PeriodicalIF":13.9,"publicationDate":"2025-12-26","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://link.springer.com/content/pdf/10.1007/s10462-025-11466-6.pdf","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145983232","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
The quality of AI-generated answers for patient inquiries on urolithiasis: a comparative study of ChatGPT and Deepseek 人工智能对尿石症患者问询的回答质量:ChatGPT和Deepseek的比较研究
IF 13.9 2区 计算机科学 Q1 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE Pub Date : 2025-12-23 DOI: 10.1007/s10462-025-11478-2
Wojciech Tomczak, Jan Łaszkiewicz, Łukasz Nowak, Łukasz Biesiadecki, Klaudia Molik, Katarzyna Grunwald, Joanna Chorbińska, Bartosz Małkiewicz, Tomasz Szydełko, Wojciech Krajewski

Patients increasingly rely on easily accessible online resources, often ignoring source credibility. Large Language Models such as ChatGPT and DeepSeek provide free, near human interaction on any imaginable topic, including medical conditions. While the benefits provided by this technology are evident and undeniable, concerns regarding the reliability and safety remain. In this study, we assessed the quality, safety, and reproducibility of responses generated by ChatGPT-4o mini and DeepSeek-R1 on the urolithiasis - an increasingly prevalent condition with complex aetiology and diverse management options. We screened for the most frequently asked questions on kidney stone disease. A set of 76 questions was generated and divided into six categories: general information, risk factors, symptoms, diagnosis, treatment and prognosis. Each question was entered into DeepSeek-R1 and ChatGPT-4o mini. Responses were independently evaluated by two attending urologists using a four-point scale based on clearly defined, pre-established criteria. Discrepancies were resolved by a third expert. Cosine similarity index was applied to evaluate the degree to which LLM responses remained stable over time in wording and meaning. Direct comparisons on the response lengths were conducted. Initial analysis with no category differentiation favoured DeepSeek R1 (p < 0.001). The worst outcomes for both models were recorded in the “treatment” category, yet with DeepSeek’s statistically significant advantage. Moreover, the Chinese LLM provided more accurate responses in “general information” category. The median cosine similarity score for responses generated by DeepSeek-R1 and ChatGPT-4o was 0.7 (IQR 0.655–0.736) and 0.86 (IQR 0.805–0.9), respectively. Responses from DeepSeek-R1 were significantly shorter, with a median word count of 385.5 (330.5–448.5) compared to and 672.5 (438–873.25) words for ChatGPT-4o mini (p < 0.001). Additionally, DeepSeek-R1 responses were more consistent in terms of length exhibiting a narrower distribution when compared to ChatGPT-4o mini. Among the evaluated LLMs available free of charge, DeepSeek-R1 emerged as a more accurate and concise source of patient information, while ChatGPT-4o mini demonstrated significantly greater reproducible responses. The reasoning process of DeepSeek-R1 has the potential to enhance patient comprehension of complex medical concepts thereby improving treatment adherence. Nevertheless, limitations of LLMs such as susceptibility to hallucinations and biases derived from their training data must be carefully considered.

患者越来越依赖易于获取的在线资源,往往忽略了来源的可信度。ChatGPT和DeepSeek等大型语言模型提供免费的、接近人类的互动,涉及任何可以想象的话题,包括医疗条件。虽然这项技术带来的好处是显而易见和不可否认的,但关于可靠性和安全性的担忧仍然存在。在这项研究中,我们评估了chatgpt - 40 mini和DeepSeek-R1对尿石症的疗效的质量、安全性和可重复性。尿石症是一种病因复杂、治疗方法多样的日益普遍的疾病。我们筛选了肾结石疾病最常见的问题。总共有76个问题,分为6类:一般信息、危险因素、症状、诊断、治疗和预后。每个问题都被输入DeepSeek-R1和chatgpt - 40 mini。反应由两名主治泌尿科医生使用基于明确定义,预先建立的标准的四分制独立评估。差异由第三位专家解决。余弦相似度指数用于评价LLM反应在措辞和意义上随时间保持稳定的程度。对反应长度进行了直接比较。没有类别区分的初步分析有利于DeepSeek R1 (p < 0.001)。两种模型的最差结果都记录在“治疗”类别中,但DeepSeek在统计上具有显著优势。此外,中国法学硕士在“一般信息”方面的回答更为准确。DeepSeek-R1和chatgpt - 40生成的响应的中位数余弦相似度评分分别为0.7 (IQR 0.655-0.736)和0.86 (IQR 0.805-0.9)。DeepSeek-R1的回复明显更短,中位数字数为385.5(330.5-448.5),而chatgpt - 40 mini的中位数字数为672.5 (438-873.25)(p < 0.001)。此外,与chatgpt - 40 mini相比,DeepSeek-R1的响应在长度方面更加一致,分布更窄。在可免费获得的评估llm中,DeepSeek-R1成为更准确和简洁的患者信息来源,而chatgpt - 40mini显示出更大的可重复性反应。DeepSeek-R1的推理过程有可能增强患者对复杂医学概念的理解,从而提高治疗依从性。然而,法学硕士的局限性,如对幻觉的敏感性和来自训练数据的偏见,必须仔细考虑。
{"title":"The quality of AI-generated answers for patient inquiries on urolithiasis: a comparative study of ChatGPT and Deepseek","authors":"Wojciech Tomczak,&nbsp;Jan Łaszkiewicz,&nbsp;Łukasz Nowak,&nbsp;Łukasz Biesiadecki,&nbsp;Klaudia Molik,&nbsp;Katarzyna Grunwald,&nbsp;Joanna Chorbińska,&nbsp;Bartosz Małkiewicz,&nbsp;Tomasz Szydełko,&nbsp;Wojciech Krajewski","doi":"10.1007/s10462-025-11478-2","DOIUrl":"10.1007/s10462-025-11478-2","url":null,"abstract":"<div><p>Patients increasingly rely on easily accessible online resources, often ignoring source credibility. Large Language Models such as ChatGPT and DeepSeek provide free, near human interaction on any imaginable topic, including medical conditions. While the benefits provided by this technology are evident and undeniable, concerns regarding the reliability and safety remain. In this study, we assessed the quality, safety, and reproducibility of responses generated by ChatGPT-4o mini and DeepSeek-R1 on the urolithiasis - an increasingly prevalent condition with complex aetiology and diverse management options. We screened for the most frequently asked questions on kidney stone disease. A set of 76 questions was generated and divided into six categories: general information, risk factors, symptoms, diagnosis, treatment and prognosis. Each question was entered into DeepSeek-R1 and ChatGPT-4o mini. Responses were independently evaluated by two attending urologists using a four-point scale based on clearly defined, pre-established criteria. Discrepancies were resolved by a third expert. Cosine similarity index was applied to evaluate the degree to which LLM responses remained stable over time in wording and meaning. Direct comparisons on the response lengths were conducted. Initial analysis with no category differentiation favoured DeepSeek R1 (<i>p</i> &lt; 0.001). The worst outcomes for both models were recorded in the “treatment” category, yet with DeepSeek’s statistically significant advantage. Moreover, the Chinese LLM provided more accurate responses in “general information” category. The median cosine similarity score for responses generated by DeepSeek-R1 and ChatGPT-4o was 0.7 (IQR 0.655–0.736) and 0.86 (IQR 0.805–0.9), respectively. Responses from DeepSeek-R1 were significantly shorter, with a median word count of 385.5 (330.5–448.5) compared to and 672.5 (438–873.25) words for ChatGPT-4o mini (<i>p</i> &lt; 0.001). Additionally, DeepSeek-R1 responses were more consistent in terms of length exhibiting a narrower distribution when compared to ChatGPT-4o mini. Among the evaluated LLMs available free of charge, DeepSeek-R1 emerged as a more accurate and concise source of patient information, while ChatGPT-4o mini demonstrated significantly greater reproducible responses. The reasoning process of DeepSeek-R1 has the potential to enhance patient comprehension of complex medical concepts thereby improving treatment adherence. Nevertheless, limitations of LLMs such as susceptibility to hallucinations and biases derived from their training data must be carefully considered.</p></div>","PeriodicalId":8449,"journal":{"name":"Artificial Intelligence Review","volume":"59 2","pages":""},"PeriodicalIF":13.9,"publicationDate":"2025-12-23","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://link.springer.com/content/pdf/10.1007/s10462-025-11478-2.pdf","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145983315","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Systematic taxonomic framework of metaheuristic algorithms using hierarchical clustering and structural criteria: how novel is the novelty? 使用层次聚类和结构标准的元启发式算法的系统分类框架:新颖性有多新颖?
IF 13.9 2区 计算机科学 Q1 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE Pub Date : 2025-12-23 DOI: 10.1007/s10462-025-11456-8
Manuel Soto Calvo, Han Soo Lee

The proliferation of metaheuristic optimization algorithms has led to concerns about their novelty. This study introduces three key contributions to address this challenge: (1) a novel systematic taxonomic framework that employs nineteen rigorously selected, metaphor-free criteria to evaluate algorithmic distinctiveness; (2) a comprehensive clustering methodology that combines Rogers-Tanimoto distance analysis with principal component analysis (PCA) and hierarchical clustering to quantify algorithmic similarities; and (3) an objective assessment method for evaluating genuine algorithmic innovations. Through the analysis of 145 metaheuristic algorithms, we demonstrate that 74 algorithms (51.0%) exhibit distances below the confidence interval threshold, indicating profound structural overlap. Network analysis reveals 26 algorithms with perfect structural identity (distance = 0.0) and 512 algorithm pairs showing high similarity (distance < 0.039), representing 18.9% of all pairwise comparisons. The results show that numerous algorithms claiming innovation deliver only incremental modifications to existing implementation patterns, lacking fundamental methodological advancement. The framework provides both a theoretical foundation for understanding algorithmic similarities and a practical tool for evaluating new algorithmic proposals, potentially transforming how the field assesses and develops novel optimization methods.

元启发式优化算法的激增导致了对其新颖性的担忧。本研究提出了三个关键贡献来应对这一挑战:(1)一个新的系统分类框架,该框架采用19个严格选择的、无隐喻的标准来评估算法的独特性;(2)采用Rogers-Tanimoto距离分析、主成分分析(PCA)和层次聚类相结合的综合聚类方法量化算法相似度;(3)一种评价真正算法创新的客观评价方法。通过对145种元启发式算法的分析,我们发现74种算法(51.0%)的距离低于置信区间阈值,表明存在严重的结构重叠。网络分析显示,26个算法具有完美的结构同一性(距离= 0.0),512个算法对具有高相似性(距离<; 0.039),占所有成对比较的18.9%。结果表明,许多声称创新的算法只提供了对现有实现模式的增量修改,缺乏基本的方法进步。该框架既为理解算法相似性提供了理论基础,也为评估新的算法建议提供了实用工具,可能会改变该领域评估和开发新的优化方法的方式。
{"title":"Systematic taxonomic framework of metaheuristic algorithms using hierarchical clustering and structural criteria: how novel is the novelty?","authors":"Manuel Soto Calvo,&nbsp;Han Soo Lee","doi":"10.1007/s10462-025-11456-8","DOIUrl":"10.1007/s10462-025-11456-8","url":null,"abstract":"<div><p>The proliferation of metaheuristic optimization algorithms has led to concerns about their novelty. This study introduces three key contributions to address this challenge: (1) a novel systematic taxonomic framework that employs nineteen rigorously selected, metaphor-free criteria to evaluate algorithmic distinctiveness; (2) a comprehensive clustering methodology that combines Rogers-Tanimoto distance analysis with principal component analysis (PCA) and hierarchical clustering to quantify algorithmic similarities; and (3) an objective assessment method for evaluating genuine algorithmic innovations. Through the analysis of 145 metaheuristic algorithms, we demonstrate that 74 algorithms (51.0%) exhibit distances below the confidence interval threshold, indicating profound structural overlap. Network analysis reveals 26 algorithms with perfect structural identity (distance = 0.0) and 512 algorithm pairs showing high similarity (distance &lt; 0.039), representing 18.9% of all pairwise comparisons. The results show that numerous algorithms claiming innovation deliver only incremental modifications to existing implementation patterns, lacking fundamental methodological advancement. The framework provides both a theoretical foundation for understanding algorithmic similarities and a practical tool for evaluating new algorithmic proposals, potentially transforming how the field assesses and develops novel optimization methods.</p></div>","PeriodicalId":8449,"journal":{"name":"Artificial Intelligence Review","volume":"59 2","pages":""},"PeriodicalIF":13.9,"publicationDate":"2025-12-23","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://link.springer.com/content/pdf/10.1007/s10462-025-11456-8.pdf","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145983316","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Topological data analysis and topological deep learning beyond persistent homology: a review 超越持久同调的拓扑数据分析和拓扑深度学习综述
IF 13.9 2区 计算机科学 Q1 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE Pub Date : 2025-12-21 DOI: 10.1007/s10462-025-11462-w
Zhe Su, Xiang Liu, Layal Bou Hamdan, Vasileios Maroulas, Jie Wu, Gunnar Carlsson, Guo-Wei Wei

Topological data analysis (TDA) is a rapidly evolving field in applied mathematics and data science that leverages tools from topology to uncover robust, shape-driven, and explainable insights in complex datasets. The main workhorse is persistent homology, a technique rooted in algebraic topology. Paired with topological deep learning (TDL) or topological machine learning, persistent homology has achieved tremendous success in a wide variety of applications in science, engineering, medicine, and industry. However, persistent homology has many limitations due to its high-level abstraction, insensitivity to non-topological changes, and restriction to point cloud data. This paper presents a comprehensive review of TDA and TDL beyond persistent homology. It analyzes how persistent topological Laplacians and Dirac operators provide spectral representations to capture both topological invariants and homotopic evolution. Other formulations are presented in terms of sheaf theory, Mayer topology, and interaction topology. For data on differentiable manifolds, techniques rooted in differential topology, such as persistent de Rham cohomology, persistent Hodge Laplacian, and Hodge decomposition, are reviewed. For one-dimensional (1D) curves embedded in 3-space, approaches from geometric topology are discussed, including multiscale Gauss-link integrals, persistent Jones polynomials, and persistent Khovanov homology. This paper further discusses the appropriate selection of topological tools for different input data, such as point clouds, sequential data, data on manifolds, curves embedded in 3-space, and data with additional non-geometric information. A review is also given of various topological representations, software packages, and machine learning vectorizations. Finally, this review ends with concluding remarks.

拓扑数据分析(TDA)是应用数学和数据科学中一个快速发展的领域,它利用拓扑工具来揭示复杂数据集中强大的、形状驱动的和可解释的见解。主要的工作是持久同调,一种植根于代数拓扑的技术。与拓扑深度学习(TDL)或拓扑机器学习相结合,持久同源性在科学、工程、医学和工业的广泛应用中取得了巨大的成功。然而,由于其高度抽象、对非拓扑变化不敏感以及对点云数据的限制,持久同构存在许多局限性。本文综述了TDA和TDL在持久同源性之外的研究进展。它分析了持久拓扑拉普拉斯算子和狄拉克算子如何提供谱表示来捕获拓扑不变量和同伦进化。其他公式是根据束理论、Mayer拓扑和相互作用拓扑提出的。对于可微流形的数据,基于微分拓扑的技术,如持久的de Rham上同调,持久的Hodge拉普拉斯算子和Hodge分解,进行了回顾。对于嵌入在三维空间中的一维曲线,讨论了从几何拓扑出发的方法,包括多尺度高斯链接积分、持久琼斯多项式和持久Khovanov同调。本文进一步讨论了不同输入数据,如点云、序列数据、流形数据、嵌入三维空间的曲线以及附加非几何信息的数据,拓扑工具的适当选择。回顾了各种拓扑表示、软件包和机器学习向量化。最后,本文以结束语结束。
{"title":"Topological data analysis and topological deep learning beyond persistent homology: a review","authors":"Zhe Su,&nbsp;Xiang Liu,&nbsp;Layal Bou Hamdan,&nbsp;Vasileios Maroulas,&nbsp;Jie Wu,&nbsp;Gunnar Carlsson,&nbsp;Guo-Wei Wei","doi":"10.1007/s10462-025-11462-w","DOIUrl":"10.1007/s10462-025-11462-w","url":null,"abstract":"<div><p>Topological data analysis (TDA) is a rapidly evolving field in applied mathematics and data science that leverages tools from topology to uncover robust, shape-driven, and explainable insights in complex datasets. The main workhorse is persistent homology, a technique rooted in algebraic topology. Paired with topological deep learning (TDL) or topological machine learning, persistent homology has achieved tremendous success in a wide variety of applications in science, engineering, medicine, and industry. However, persistent homology has many limitations due to its high-level abstraction, insensitivity to non-topological changes, and restriction to point cloud data. This paper presents a comprehensive review of TDA and TDL beyond persistent homology. It analyzes how persistent topological Laplacians and Dirac operators provide spectral representations to capture both topological invariants and homotopic evolution. Other formulations are presented in terms of sheaf theory, Mayer topology, and interaction topology. For data on differentiable manifolds, techniques rooted in differential topology, such as persistent de Rham cohomology, persistent Hodge Laplacian, and Hodge decomposition, are reviewed. For one-dimensional (1D) curves embedded in 3-space, approaches from geometric topology are discussed, including multiscale Gauss-link integrals, persistent Jones polynomials, and persistent Khovanov homology. This paper further discusses the appropriate selection of topological tools for different input data, such as point clouds, sequential data, data on manifolds, curves embedded in 3-space, and data with additional non-geometric information. A review is also given of various topological representations, software packages, and machine learning vectorizations. Finally, this review ends with concluding remarks. </p></div>","PeriodicalId":8449,"journal":{"name":"Artificial Intelligence Review","volume":"59 2","pages":""},"PeriodicalIF":13.9,"publicationDate":"2025-12-21","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://link.springer.com/content/pdf/10.1007/s10462-025-11462-w.pdf","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145982967","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Convolutional neural network algorithms in diabetic retinopathy: how far does it go? 卷积神经网络算法在糖尿病视网膜病变中的应用:它能走多远?
IF 13.9 2区 计算机科学 Q1 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE Pub Date : 2025-12-18 DOI: 10.1007/s10462-025-11453-x
Zhanchi Hu, Jie Ji, Jian-Wei Lin, Chi Xiao, Ling-Ping Cen

Convolutional Neural Networks (CNNs) have rapidly transformed the landscape of medical image analysis, particularly in the detection and management of diabetic retinopathy (DR), a leading cause of irreversible vision loss worldwide. Since their inception in early architectures such as LeNet and AlexNet, CNNs have evolved into advanced designs including VGGNet, Inception, ResNet, U‑Net, DenseNet, EfficientNet, and ConvNeXt, enabling increasingly precise classification, detection, and segmentation of retinal lesions. This systematic review synthesizes current evidence on CNN applications across diverse DR tasks, including disease classification, lesion localization, vessel segmentation, diabetic macular edema detection, ischemia assessment, and disease monitoring. Comparative analyses highlight the performance of CNNs relative to traditional machine learning methods, as well as the benefits of ensemble strategies and customized architectures tailored to multimodal imaging. Despite remarkable progress, real‑world deployment faces persistent challenges related to data quality, computational demands, interpretability, clinical reliability, and ethical considerations. Emerging solutions such as lightweight CNNs, hybrid models, and ensemble learning approaches aim to enhance scalability, efficiency, and clinical integration. The findings reveal both the transformative potential and the limitations of CNNs in DR management, underscoring the need for future research on robust, interpretable, and ethically aligned systems. This review provides ophthalmologists, data scientists, and healthcare practitioners with the latest insights into CNN‑based diabetic retinopathy analysis, fostering the advancement of accessible and clinically reliable diagnostic technologies poised to transform patient care and outcomes.

卷积神经网络(cnn)已经迅速改变了医学图像分析的格局,特别是在糖尿病视网膜病变(DR)的检测和管理方面,糖尿病视网膜病变是全球范围内不可逆转的视力丧失的主要原因。从LeNet和AlexNet等早期架构开始,cnn已经发展成为包括VGGNet、inception、ResNet、U - Net、DenseNet、EfficientNet和ConvNeXt在内的先进设计,能够越来越精确地对视网膜病变进行分类、检测和分割。本系统综述综合了CNN在不同DR任务中的应用的现有证据,包括疾病分类、病变定位、血管分割、糖尿病黄斑水肿检测、缺血评估和疾病监测。对比分析强调了cnn相对于传统机器学习方法的性能,以及针对多模态成像的集成策略和定制架构的优势。尽管取得了显著进展,但现实世界的部署仍面临着与数据质量、计算需求、可解释性、临床可靠性和伦理考虑相关的持续挑战。诸如轻量级cnn、混合模型和集成学习方法等新兴解决方案旨在增强可扩展性、效率和临床集成。研究结果揭示了cnn在DR管理中的变革潜力和局限性,强调了未来对稳健、可解释和符合伦理的系统进行研究的必要性。本综述为眼科医生、数据科学家和医疗保健从业者提供了基于CNN的糖尿病视网膜病变分析的最新见解,促进了易于获取和临床可靠的诊断技术的进步,有望改变患者的护理和结果。
{"title":"Convolutional neural network algorithms in diabetic retinopathy: how far does it go?","authors":"Zhanchi Hu,&nbsp;Jie Ji,&nbsp;Jian-Wei Lin,&nbsp;Chi Xiao,&nbsp;Ling-Ping Cen","doi":"10.1007/s10462-025-11453-x","DOIUrl":"10.1007/s10462-025-11453-x","url":null,"abstract":"<div><p>Convolutional Neural Networks (CNNs) have rapidly transformed the landscape of medical image analysis, particularly in the detection and management of diabetic retinopathy (DR), a leading cause of irreversible vision loss worldwide. Since their inception in early architectures such as LeNet and AlexNet, CNNs have evolved into advanced designs including VGGNet, Inception, ResNet, U‑Net, DenseNet, EfficientNet, and ConvNeXt, enabling increasingly precise classification, detection, and segmentation of retinal lesions. This systematic review synthesizes current evidence on CNN applications across diverse DR tasks, including disease classification, lesion localization, vessel segmentation, diabetic macular edema detection, ischemia assessment, and disease monitoring. Comparative analyses highlight the performance of CNNs relative to traditional machine learning methods, as well as the benefits of ensemble strategies and customized architectures tailored to multimodal imaging. Despite remarkable progress, real‑world deployment faces persistent challenges related to data quality, computational demands, interpretability, clinical reliability, and ethical considerations. Emerging solutions such as lightweight CNNs, hybrid models, and ensemble learning approaches aim to enhance scalability, efficiency, and clinical integration. The findings reveal both the transformative potential and the limitations of CNNs in DR management, underscoring the need for future research on robust, interpretable, and ethically aligned systems. This review provides ophthalmologists, data scientists, and healthcare practitioners with the latest insights into CNN‑based diabetic retinopathy analysis, fostering the advancement of accessible and clinically reliable diagnostic technologies poised to transform patient care and outcomes.</p></div>","PeriodicalId":8449,"journal":{"name":"Artificial Intelligence Review","volume":"59 2","pages":""},"PeriodicalIF":13.9,"publicationDate":"2025-12-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://link.springer.com/content/pdf/10.1007/s10462-025-11453-x.pdf","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"146026759","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Systematic review on aspect-based sentiment analysis in cross-domain 跨领域基于方面的情感分析系统综述
IF 13.9 2区 计算机科学 Q1 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE Pub Date : 2025-12-16 DOI: 10.1007/s10462-025-11437-x
René Vieira Santin, Solange Oliveira Rezende

Aspect-level sentiment analysis is crucial for consumers and institutions, enabling them to monitor satisfaction regarding specific aspects of products and services through user reviews. Over time, various artificial intelligence techniques have been implemented with significant success. However, most of these techniques rely heavily on a substantial amount of labeled data. In this context, Cross-Domain Aspect-Based Sentiment Analysis emerges, leveraging data from source domains to enhance performance in the target domain. This systematic review contributes to this framework by outlining the primary solutions developed to tackle this challenge. It presents their data sources, compared methods, and the evolution of the main technologies adopted while identifying gaps that may inspire future research endeavors. A new classification of models is proposed here, considering the cross-domain approach. This fresh perspective aims to assist researchers in their quest for innovation, clarifying the context of their proposal and suggesting relevant comparisons with existing works.

方面级情感分析对于消费者和机构来说是至关重要的,它使他们能够通过用户评论来监控产品和服务的特定方面的满意度。随着时间的推移,各种人工智能技术已经取得了巨大的成功。然而,大多数这些技术严重依赖于大量的标记数据。在这种情况下,跨领域基于方面的情感分析出现了,利用来自源领域的数据来增强目标领域的性能。本系统综述概述了为应对这一挑战而制定的主要解决方案,从而有助于建立这一框架。它介绍了它们的数据来源、比较的方法以及所采用的主要技术的演变,同时确定了可能激发未来研究努力的差距。本文提出了一种基于跨域方法的模型分类方法。这种新的视角旨在帮助研究人员寻求创新,澄清他们的建议的背景,并建议与现有作品进行相关比较。
{"title":"Systematic review on aspect-based sentiment analysis in cross-domain","authors":"René Vieira Santin,&nbsp;Solange Oliveira Rezende","doi":"10.1007/s10462-025-11437-x","DOIUrl":"10.1007/s10462-025-11437-x","url":null,"abstract":"<div><p>Aspect-level sentiment analysis is crucial for consumers and institutions, enabling them to monitor satisfaction regarding specific aspects of products and services through user reviews. Over time, various artificial intelligence techniques have been implemented with significant success. However, most of these techniques rely heavily on a substantial amount of labeled data. In this context, Cross-Domain Aspect-Based Sentiment Analysis emerges, leveraging data from source domains to enhance performance in the target domain. This systematic review contributes to this framework by outlining the primary solutions developed to tackle this challenge. It presents their data sources, compared methods, and the evolution of the main technologies adopted while identifying gaps that may inspire future research endeavors. A new classification of models is proposed here, considering the cross-domain approach. This fresh perspective aims to assist researchers in their quest for innovation, clarifying the context of their proposal and suggesting relevant comparisons with existing works.</p></div>","PeriodicalId":8449,"journal":{"name":"Artificial Intelligence Review","volume":"59 1","pages":""},"PeriodicalIF":13.9,"publicationDate":"2025-12-16","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://link.springer.com/content/pdf/10.1007/s10462-025-11437-x.pdf","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145778795","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
A review of network delay prediction and advances in large language models for air traffic 空中交通网络延迟预测及大语言模型研究进展
IF 13.9 2区 计算机科学 Q1 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE Pub Date : 2025-12-16 DOI: 10.1007/s10462-025-11400-w
Mengyuan Sun, Yong Tian, Jiangchen Li, Cheng-Lung Wu, Liqun Peng, Shucai Xu

Traffic network delays seriously affect the air transportation system’s safety, economy, and efficiency, and have always been a global concern. Flight delays usually propagate within airport networks, causing subsequent flights to be delayed. However, existing works lack in considering network causality, and the incorporation of emerging large language models (LLMs). Thus, this paper endeavours to examine the literature on network delay prediction that combines different background knowledge with journal paper publishing data. Particularly, the network delay prediction methods are categorized into four aspects: classic methods without explicit network topology modelling, traditional explicit network-based prediction methods, emerging deep learning methods, and the application of LLMs in transportation. Classic methods without explicit network topology modelling, including statistical analysis, operations research, traditional machine learning and causal inference without network structures, offer interpretable baselines but fail to capture the complexity and nonlinearity of air traffic systems. Traditional explicit network-based prediction methods often approach air traffic systems through frameworks such as complex networks and queuing theory, with an increasing focus on causal relationship analysis. However, these methods fall short in capturing the spatiotemporal dependencies of network delays, particularly in modelling spatiotemporal causality. In contrast, emerging deep learning methods have advanced significantly, enabling the construction of spatiotemporal causal networks and improving the accuracy of network delay prediction. In addition, some future trends are analyzed. It is concluded that graph neural networks with causality and emerging deep learning methods (e.g., spatiotemporal GCN) are identified as essential directions. Moreover, a conceptual AirTraffic LLM is suggested via a novel Spatial-Temporal Causal Large Language Model (STC-LLM) framework for high-precision flight delay prediction, which requires further experimental validation and real-world testing. Nevertheless, issues such as data privacy, model opacity, and high computational costs must be carefully addressed when applying LLMs. Finally, the findings are expected to enhance understanding of delay propagation among researchers, practitioners, and policymakers, while providing insights and guidance to airports, airlines, and air traffic control.

交通网络延误严重影响航空运输系统的安全性、经济性和效率,一直是全球关注的问题。航班延误通常会在机场网络内传播,导致后续航班延误。然而,现有的工作缺乏考虑网络因果关系,并纳入新兴的大语言模型(llm)。因此,本文试图对结合不同背景知识和期刊论文发表数据的网络延迟预测文献进行研究。其中,网络延迟预测方法主要分为经典的无显式网络拓扑建模方法、传统的基于显式网络的预测方法、新兴的深度学习方法和llm在交通运输中的应用四个方面。没有明确网络拓扑建模的经典方法,包括统计分析、运筹学、传统机器学习和没有网络结构的因果推理,提供了可解释的基线,但未能捕捉空中交通系统的复杂性和非线性。传统的基于显式网络的预测方法通常通过复杂网络和排队论等框架来研究空中交通系统,并越来越关注因果关系分析。然而,这些方法在捕捉网络延迟的时空依赖性方面存在不足,特别是在建模时空因果关系方面。相比之下,新兴的深度学习方法取得了显著进展,可以构建时空因果网络,提高网络延迟预测的准确性。此外,对未来的发展趋势进行了分析。结论是,具有因果关系的图神经网络和新兴的深度学习方法(如时空GCN)被确定为基本方向。此外,通过一种新颖的时空因果大语言模型(STC-LLM)框架,提出了用于高精度航班延误预测的概念性空中交通LLM,这需要进一步的实验验证和实际测试。然而,在应用法学硕士时,必须仔细解决数据隐私、模型不透明和高计算成本等问题。最后,研究结果有望增强研究人员、从业者和政策制定者对延误传播的理解,同时为机场、航空公司和空中交通管制提供见解和指导。
{"title":"A review of network delay prediction and advances in large language models for air traffic","authors":"Mengyuan Sun,&nbsp;Yong Tian,&nbsp;Jiangchen Li,&nbsp;Cheng-Lung Wu,&nbsp;Liqun Peng,&nbsp;Shucai Xu","doi":"10.1007/s10462-025-11400-w","DOIUrl":"10.1007/s10462-025-11400-w","url":null,"abstract":"<div><p>Traffic network delays seriously affect the air transportation system’s safety, economy, and efficiency, and have always been a global concern. Flight delays usually propagate within airport networks, causing subsequent flights to be delayed. However, existing works lack in considering network causality, and the incorporation of emerging large language models (LLMs). Thus, this paper endeavours to examine the literature on network delay prediction that combines different background knowledge with journal paper publishing data. Particularly, the network delay prediction methods are categorized into four aspects: classic methods without explicit network topology modelling, traditional explicit network-based prediction methods, emerging deep learning methods, and the application of LLMs in transportation. Classic methods without explicit network topology modelling, including statistical analysis, operations research, traditional machine learning and causal inference without network structures, offer interpretable baselines but fail to capture the complexity and nonlinearity of air traffic systems. Traditional explicit network-based prediction methods often approach air traffic systems through frameworks such as complex networks and queuing theory, with an increasing focus on causal relationship analysis. However, these methods fall short in capturing the spatiotemporal dependencies of network delays, particularly in modelling spatiotemporal causality. In contrast, emerging deep learning methods have advanced significantly, enabling the construction of spatiotemporal causal networks and improving the accuracy of network delay prediction. In addition, some future trends are analyzed. It is concluded that graph neural networks with causality and emerging deep learning methods (e.g., spatiotemporal GCN) are identified as essential directions. Moreover, a conceptual AirTraffic LLM is suggested via a novel Spatial-Temporal Causal Large Language Model (STC-LLM) framework for high-precision flight delay prediction, which requires further experimental validation and real-world testing. Nevertheless, issues such as data privacy, model opacity, and high computational costs must be carefully addressed when applying LLMs. Finally, the findings are expected to enhance understanding of delay propagation among researchers, practitioners, and policymakers, while providing insights and guidance to airports, airlines, and air traffic control.</p></div>","PeriodicalId":8449,"journal":{"name":"Artificial Intelligence Review","volume":"59 1","pages":""},"PeriodicalIF":13.9,"publicationDate":"2025-12-16","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://link.springer.com/content/pdf/10.1007/s10462-025-11400-w.pdf","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145778796","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
期刊
Artificial Intelligence Review
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1