首页 > 最新文献

Artificial Intelligence Review最新文献

英文 中文
Finer monocular depth estimation with long range in various driving lighting environments 更精细的单目深度估计与远距离在各种驾驶照明环境
IF 13.9 2区 计算机科学 Q1 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE Pub Date : 2026-01-07 DOI: 10.1007/s10462-025-11436-y
Yan Liu, Mingyu Yan, Yanqiu Xiao, Guangzhen Cui, Li Han

Depth estimation methods for autonomous driving application face numerous challenges, such as capturing fine details and handling varying lighting conditions. Based on these challenges, LRDepth is proposed to improve the depth estimation task, which includes a simple high frequency enhancement module (HFEM) and a progressive residual denoising diffusion (PRDD) module. HFEM aids in extracting high-frequency components and amplifying the features, such as object edge details, generating more precise depth predictions. Inspired by the strong performance of diffusion models in various vision tasks, PRDD is designed to refine the depth predictions by reducing noise and enhancing edge details, which ensures the accurate representation of distant objects and subtle features. Extensive experiments on the KITTI and DIODE datasets demonstrated that the proposed network boosts the performance of monocular depth estimation, achieving more accurate long range depth predictions and improving model robustness in various lighting environments. The experiment results verified the method's adaptability, and the model is potential for real-world applications, which is beneficial for the optimization of visual perception module in intelligent driving system.

自动驾驶应用的深度估计方法面临许多挑战,例如捕捉精细细节和处理不同的照明条件。基于这些挑战,提出了LRDepth方法来改进深度估计任务,该方法包括一个简单的高频增强模块(HFEM)和一个渐进残差去噪扩散模块(PRDD)。HFEM有助于提取高频成分并放大特征,例如物体边缘细节,从而产生更精确的深度预测。受扩散模型在各种视觉任务中的强大性能的启发,PRDD旨在通过降低噪声和增强边缘细节来改进深度预测,从而确保对远处物体和细微特征的准确表示。在KITTI和DIODE数据集上进行的大量实验表明,所提出的网络提高了单目深度估计的性能,实现了更准确的远程深度预测,并提高了模型在各种照明环境下的鲁棒性。实验结果验证了该方法的适应性,该模型具有实际应用的潜力,有利于智能驾驶系统中视觉感知模块的优化。
{"title":"Finer monocular depth estimation with long range in various driving lighting environments","authors":"Yan Liu,&nbsp;Mingyu Yan,&nbsp;Yanqiu Xiao,&nbsp;Guangzhen Cui,&nbsp;Li Han","doi":"10.1007/s10462-025-11436-y","DOIUrl":"10.1007/s10462-025-11436-y","url":null,"abstract":"<div><p>Depth estimation methods for autonomous driving application face numerous challenges, such as capturing fine details and handling varying lighting conditions. Based on these challenges, LRDepth is proposed to improve the depth estimation task, which includes a simple high frequency enhancement module (HFEM) and a progressive residual denoising diffusion (PRDD) module. HFEM aids in extracting high-frequency components and amplifying the features, such as object edge details, generating more precise depth predictions. Inspired by the strong performance of diffusion models in various vision tasks, PRDD is designed to refine the depth predictions by reducing noise and enhancing edge details, which ensures the accurate representation of distant objects and subtle features. Extensive experiments on the KITTI and DIODE datasets demonstrated that the proposed network boosts the performance of monocular depth estimation, achieving more accurate long range depth predictions and improving model robustness in various lighting environments. The experiment results verified the method's adaptability, and the model is potential for real-world applications, which is beneficial for the optimization of visual perception module in intelligent driving system.</p></div>","PeriodicalId":8449,"journal":{"name":"Artificial Intelligence Review","volume":"59 2","pages":""},"PeriodicalIF":13.9,"publicationDate":"2026-01-07","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://link.springer.com/content/pdf/10.1007/s10462-025-11436-y.pdf","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145909045","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
The intersection of artificial intelligence and assistive technologies in the diagnosis and intervention of mental health conditions 人工智能和辅助技术在心理健康状况诊断和干预中的交叉
IF 13.9 2区 计算机科学 Q1 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE Pub Date : 2026-01-07 DOI: 10.1007/s10462-025-11447-9
Muhammad Abrar, Mujeeb ur Rehman, Sohail Khalid, Rahmat Ullah

Mental health disorders are becoming a major global health concern and pose a significant burden on global healthcare systems. Nearly one billion people suffer from mental disorders, accounting for 13% of the global disease burden and $1 trillion in annual productivity loss. Depression is the leading cause of disability and suicide is the second leading cause of death among young individuals. Economic uncertainty, social isolation, climate change, shifting societal norms, political conflict, and increasing violence are key factors contributing to the high prevalence of mental health issues. In the future, increasing poverty and inequality are likely to worsen this trend, resulting in a greater incidence and burden of mental illness. Therefore, timely diagnosis and intervention are a high priority. Traditional diagnostic and intervention methods, such as self-report questionnaires, clinical interviews, psychotherapy, medication, electroconvulsive therapy, and occupational therapy, have drawbacks including subjectivity, time commitment, and the potential for prolonged treatment. Due to these limitations, advanced approaches are needed to improve diagnostic accuracy and precision and to develop more effective interventions. This review aims to explore and evaluate the applications of Artificial Intelligence in the diagnosis and treatment of mental health conditions. This study provides a thorough analysis of various artificial intelligence-driven techniques and their advancements in the diagnosis of mental health conditions. Artificial intelligence has the potential to greatly improve the accuracy and effectiveness of mental health conditions. Moreover, this work consolidates the research gaps in current techniques and provides research hypotheses on how to overcome the gaps using a proposed 3-tier solution.

精神健康障碍正在成为一个主要的全球卫生问题,并对全球卫生保健系统构成重大负担。近10亿人患有精神障碍,占全球疾病负担的13%,每年造成1万亿美元的生产力损失。抑郁症是导致残疾的主要原因,自杀是导致年轻人死亡的第二大原因。经济不确定性、社会孤立、气候变化、社会规范转变、政治冲突和暴力增加是导致精神卫生问题高比例流行的关键因素。在未来,日益增加的贫困和不平等可能会使这一趋势恶化,导致精神疾病的发病率和负担增加。因此,及时诊断和干预是重中之重。传统的诊断和干预方法,如自我报告问卷、临床访谈、心理治疗、药物治疗、电休克治疗和职业治疗等,存在主观性、时间投入和延长治疗时间等缺点。由于这些限制,需要先进的方法来提高诊断的准确性和精确性,并制定更有效的干预措施。本文旨在探讨和评价人工智能在精神疾病诊断和治疗中的应用。本研究提供了各种人工智能驱动的技术及其在心理健康状况诊断方面的进展的全面分析。人工智能有可能大大提高心理健康状况的准确性和有效性。此外,这项工作巩固了当前技术的研究差距,并提供了如何使用提议的三层解决方案来克服差距的研究假设。
{"title":"The intersection of artificial intelligence and assistive technologies in the diagnosis and intervention of mental health conditions","authors":"Muhammad Abrar,&nbsp;Mujeeb ur Rehman,&nbsp;Sohail Khalid,&nbsp;Rahmat Ullah","doi":"10.1007/s10462-025-11447-9","DOIUrl":"10.1007/s10462-025-11447-9","url":null,"abstract":"<div><p>Mental health disorders are becoming a major global health concern and pose a significant burden on global healthcare systems. Nearly one billion people suffer from mental disorders, accounting for 13% of the global disease burden and $1 trillion in annual productivity loss. Depression is the leading cause of disability and suicide is the second leading cause of death among young individuals. Economic uncertainty, social isolation, climate change, shifting societal norms, political conflict, and increasing violence are key factors contributing to the high prevalence of mental health issues. In the future, increasing poverty and inequality are likely to worsen this trend, resulting in a greater incidence and burden of mental illness. Therefore, timely diagnosis and intervention are a high priority. Traditional diagnostic and intervention methods, such as self-report questionnaires, clinical interviews, psychotherapy, medication, electroconvulsive therapy, and occupational therapy, have drawbacks including subjectivity, time commitment, and the potential for prolonged treatment. Due to these limitations, advanced approaches are needed to improve diagnostic accuracy and precision and to develop more effective interventions. This review aims to explore and evaluate the applications of Artificial Intelligence in the diagnosis and treatment of mental health conditions. This study provides a thorough analysis of various artificial intelligence-driven techniques and their advancements in the diagnosis of mental health conditions. Artificial intelligence has the potential to greatly improve the accuracy and effectiveness of mental health conditions. Moreover, this work consolidates the research gaps in current techniques and provides research hypotheses on how to overcome the gaps using a proposed 3-tier solution.</p></div>","PeriodicalId":8449,"journal":{"name":"Artificial Intelligence Review","volume":"59 2","pages":""},"PeriodicalIF":13.9,"publicationDate":"2026-01-07","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://link.springer.com/content/pdf/10.1007/s10462-025-11447-9.pdf","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145909044","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Multi-objective genetic programming-based algorithmic trading, using directional changes and a modified sharpe ratio score for identifying optimal trading strategies 基于多目标遗传规划的交易算法,利用方向变化和改进的夏普比率分数来确定最优交易策略
IF 13.9 2区 计算机科学 Q1 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE Pub Date : 2026-01-07 DOI: 10.1007/s10462-025-11390-9
Xinpeng Long, Michael Kampouridis, Tasos Papastylianou

This study explores the integration of directional changes (DC), genetic programming (GP), and multi-objective optimisation (MOO) to develop advanced algorithmic trading strategies. Directional changes offer a dynamic, event-based approach to market analysis, identifying significant price movements and trends. Genetic programming evolves trading rules to discover effective and profitable strategies. However, financial trading presents a multi-objective challenge, balancing conflicting objectives such as returns and risk. We propose a novel algorithmic trading framework, termed MOO3, which integrates genetic programming with the NSGA-II multi-objective optimisation algorithm to optimise three fitness functions: total return, expected rate of return, and risk. While the use of NSGA-II itself is well-established, our contribution lies in how we apply it within a trading context that combines (i) directional changes, (ii) genetic programming with both DC-based and physical-time indicators, and (iii) a modified Sharpe Ratio for post-optimisation strategy selection based on trader preferences. Utilising indicators from both paradigms allows the GP algorithm to create profitable trading strategies, while the multi-objective fitness function allows it to simultaneously optimise for risk. A definitive strategy is chosen from Pareto-optimal solutions using the modified Sharpe Ratio, allowing traders to prioritise multiple objectives. Our methodology is tested on 110 stock datasets from 10 international markets, aiming to demonstrate that the multi-objective framework can yield superior trading strategies with lower risk. Results indicate that the MOO3 algorithm consistently and significantly outperforms single-objective optimisation (SOO) methods, even when the same SOO criterion is employed for choosing a single, definitive investment strategy from the Pareto front.

本研究探讨了定向变化(DC)、遗传规划(GP)和多目标优化(MOO)的整合,以开发先进的算法交易策略。方向性变化为市场分析提供了一种动态的、基于事件的方法,可以识别重要的价格变动和趋势。遗传编程进化交易规则,以发现有效和有利可图的策略。然而,金融交易提出了一个多目标的挑战,平衡冲突的目标,如回报和风险。我们提出了一种新的算法交易框架,称为MOO3,它将遗传规划与NSGA-II多目标优化算法相结合,以优化三个适应度函数:总收益、预期收益率和风险。虽然NSGA-II本身的使用是完善的,但我们的贡献在于我们如何将其应用于交易环境中,该环境结合了(i)方向变化,(ii)基于dc和物理时间指标的遗传规划,以及(iii)基于交易者偏好的优化后策略选择的修改夏普比率。利用两种范式的指标,GP算法可以创建有利可图的交易策略,而多目标适应度函数可以同时优化风险。使用修改的夏普比率从帕累托最优解决方案中选择确定的策略,允许交易者优先考虑多个目标。我们的方法在来自10个国际市场的110个股票数据集上进行了测试,旨在证明多目标框架可以产生具有较低风险的卓越交易策略。结果表明,MOO3算法持续且显著优于单目标优化(SOO)方法,即使采用相同的SOO标准从Pareto前沿选择单一,确定的投资策略。
{"title":"Multi-objective genetic programming-based algorithmic trading, using directional changes and a modified sharpe ratio score for identifying optimal trading strategies","authors":"Xinpeng Long,&nbsp;Michael Kampouridis,&nbsp;Tasos Papastylianou","doi":"10.1007/s10462-025-11390-9","DOIUrl":"10.1007/s10462-025-11390-9","url":null,"abstract":"<div><p>This study explores the integration of directional changes (DC), genetic programming (GP), and multi-objective optimisation (MOO) to develop advanced algorithmic trading strategies. Directional changes offer a dynamic, event-based approach to market analysis, identifying significant price movements and trends. Genetic programming evolves trading rules to discover effective and profitable strategies. However, financial trading presents a multi-objective challenge, balancing conflicting objectives such as returns and risk. We propose a novel algorithmic trading framework, termed MOO3, which integrates genetic programming with the NSGA-II multi-objective optimisation algorithm to optimise three fitness functions: total return, expected rate of return, and risk. While the use of NSGA-II itself is well-established, our contribution lies in how we apply it within a trading context that combines (i) directional changes, (ii) genetic programming with both DC-based and physical-time indicators, and (iii) a modified Sharpe Ratio for post-optimisation strategy selection based on trader preferences. Utilising indicators from both paradigms allows the GP algorithm to create profitable trading strategies, while the multi-objective fitness function allows it to simultaneously optimise for risk. A definitive strategy is chosen from Pareto-optimal solutions using the modified Sharpe Ratio, allowing traders to prioritise multiple objectives. Our methodology is tested on 110 stock datasets from 10 international markets, aiming to demonstrate that the multi-objective framework can yield superior trading strategies with lower risk. Results indicate that the MOO3 algorithm consistently and significantly outperforms single-objective optimisation (SOO) methods, even when the same SOO criterion is employed for choosing a single, definitive investment strategy from the Pareto front.</p></div>","PeriodicalId":8449,"journal":{"name":"Artificial Intelligence Review","volume":"59 2","pages":""},"PeriodicalIF":13.9,"publicationDate":"2026-01-07","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://link.springer.com/content/pdf/10.1007/s10462-025-11390-9.pdf","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145909002","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Emerging computational intelligence based techniques for lung cancer diagnosis and classification on chest CT scan images: a comprehensive survey 基于新兴计算智能的胸部CT扫描图像肺癌诊断和分类技术:一项综合调查
IF 13.9 2区 计算机科学 Q1 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE Pub Date : 2026-01-07 DOI: 10.1007/s10462-025-11374-9
Pankaj Kumari, Lavika Goel

Worldwide lung cancer is a significant reason for death resulting from cancer with early diagnosis crucial for enhancing patient results. This comprehensive survey looks at the most recent developments in methods for detecting lung cancer by using chest CT scan images. The study describes a broad variety of approaches includes methods for machine learning such random forests support vector machines logistic regression and k-nearest neighbors in addition to deep learning frameworks such as variational autoencoders recurrent neural networks convolutional neural networks and generative adversarial networks. Additionally the survey explores hybrid models that combine deep learning and machine learning with nature-inspired optimization techniques to enhance performance. All the techniques discussed in this paper mainly focus on the diagnosis of NSCLC i.e. non-small cell lung cancer as it is more prevalent. The paper also reviews multiple advanced techniques used in diagnosis of lung cancer, including 3D-CNN i.e. Convolutional Neural Networks, multimodal logistic regression models and Cyclic Variational Autoencoders. It highlights key publicly available datasets frequently used in this research area such as LIDC-IDRI (Lung Image Database Consortium and Image Database Resource Initiative), LUNA16 (Lung Nodule Analysis 2016), the Kaggle lung cancer dataset, NSCLC Radiogenomics and the NIH (National Institutes of Health) chest X-ray database. This survey provides a detailed comparison of each technique, describing their advantages, limitations, and reported performance metrics, especially in terms of classification accuracy. Transfer learning with Vision Transformer achieves the highest accuracy of 94.6%, while 3D Convolutional Neural Network (3D -CNN) achieves an accuracy of 93.7%, both of which are showcasing highest performance on applicable datasets. Furthermore, the research demonstrates the potential of emerging techniques like federated learning and explainable AI in addressing challenges pertaining to data privacy and model interpretability. This survey paper reviews several techniques and finds that deep learning is the most extensively researched area in lung cancer diagnosis. This approach is not only widely used but also exhibits notable success in identifying and categorizing lung cancer with a high degree of accuracy.

在世界范围内,肺癌是导致癌症死亡的一个重要原因,早期诊断对于提高患者的治疗效果至关重要。这项综合调查着眼于使用胸部CT扫描图像检测肺癌方法的最新进展。该研究描述了各种各样的方法,包括机器学习方法,如随机森林,支持向量机,逻辑回归和k近邻,以及深度学习框架,如变分自编码器,循环神经网络,卷积神经网络和生成对抗网络。此外,该调查还探讨了将深度学习和机器学习与自然优化技术相结合的混合模型,以提高性能。本文讨论的所有技术主要集中在非小细胞肺癌的诊断,即非小细胞肺癌,因为它更普遍。本文还回顾了用于肺癌诊断的多种先进技术,包括3D-CNN即卷积神经网络,多模态逻辑回归模型和循环变分自编码器。它突出了该研究领域经常使用的关键公开可用数据集,如LIDC-IDRI(肺图像数据库联盟和图像数据库资源倡议),LUNA16(肺结节分析2016),Kaggle肺癌数据集,NSCLC放射基因组学和NIH(美国国立卫生研究院)胸部x射线数据库。本调查提供了每种技术的详细比较,描述了它们的优点、局限性和报告的性能指标,特别是在分类准确性方面。使用Vision Transformer的迁移学习达到了94.6%的最高准确率,而3D卷积神经网络(3D -CNN)达到了93.7%的准确率,两者都在适用的数据集上展示了最高的性能。此外,该研究还展示了联邦学习和可解释人工智能等新兴技术在解决与数据隐私和模型可解释性相关的挑战方面的潜力。本文综述了几种技术,发现深度学习是肺癌诊断中研究最广泛的领域。这种方法不仅被广泛使用,而且在肺癌的识别和分类方面也取得了显著的成功,准确率很高。
{"title":"Emerging computational intelligence based techniques for lung cancer diagnosis and classification on chest CT scan images: a comprehensive survey","authors":"Pankaj Kumari,&nbsp;Lavika Goel","doi":"10.1007/s10462-025-11374-9","DOIUrl":"10.1007/s10462-025-11374-9","url":null,"abstract":"<div><p>Worldwide lung cancer is a significant reason for death resulting from cancer with early diagnosis crucial for enhancing patient results. This comprehensive survey looks at the most recent developments in methods for detecting lung cancer by using chest CT scan images. The study describes a broad variety of approaches includes methods for machine learning such random forests support vector machines logistic regression and k-nearest neighbors in addition to deep learning frameworks such as variational autoencoders recurrent neural networks convolutional neural networks and generative adversarial networks. Additionally the survey explores hybrid models that combine deep learning and machine learning with nature-inspired optimization techniques to enhance performance. All the techniques discussed in this paper mainly focus on the diagnosis of NSCLC i.e. non-small cell lung cancer as it is more prevalent. The paper also reviews multiple advanced techniques used in diagnosis of lung cancer, including 3D-CNN i.e. Convolutional Neural Networks, multimodal logistic regression models and Cyclic Variational Autoencoders. It highlights key publicly available datasets frequently used in this research area such as LIDC-IDRI (Lung Image Database Consortium and Image Database Resource Initiative), LUNA16 (Lung Nodule Analysis 2016), the Kaggle lung cancer dataset, NSCLC Radiogenomics and the NIH (National Institutes of Health) chest X-ray database. This survey provides a detailed comparison of each technique, describing their advantages, limitations, and reported performance metrics, especially in terms of classification accuracy. Transfer learning with Vision Transformer achieves the highest accuracy of 94.6%, while 3D Convolutional Neural Network (3D -CNN) achieves an accuracy of 93.7%, both of which are showcasing highest performance on applicable datasets. Furthermore, the research demonstrates the potential of emerging techniques like federated learning and explainable AI in addressing challenges pertaining to data privacy and model interpretability. This survey paper reviews several techniques and finds that deep learning is the most extensively researched area in lung cancer diagnosis. This approach is not only widely used but also exhibits notable success in identifying and categorizing lung cancer with a high degree of accuracy.</p></div>","PeriodicalId":8449,"journal":{"name":"Artificial Intelligence Review","volume":"59 2","pages":""},"PeriodicalIF":13.9,"publicationDate":"2026-01-07","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://link.springer.com/content/pdf/10.1007/s10462-025-11374-9.pdf","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145909007","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Farthest better or nearest worse optimizer: a novel metaheuristic algorithm 最优或最劣优化器:一种新的元启发式算法
IF 13.9 2区 计算机科学 Q1 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE Pub Date : 2026-01-07 DOI: 10.1007/s10462-025-11443-z
Ahmad Taheri, Keyvan RahimiZadeh, Jan Baumbach, Amin Beheshti, Olga Zolotareva, Mohammed Azmi Al-Betar, Seyedali Mirjalili, Amir H. Gandomi

A novel metaheuristic optimization algorithm, namely the Farthest better or Nearest worse Optimizer (FNO) algorithm, is proposed in this paper. The idea behind the FNO algorithm is derived from the qualities and distances between agents’ positions in a search space. The process of searching in the FNO includes two phases. During the first phase of the FNO, it jumps over the nearest regions with lower potential to avoid local optima. In the second phase, the algorithm tries to explore the farthest positions with higher potential to reach or explore the global optimum. These operations aim to enhance population diversity and provide the FNO with opportunities to discover high-quality regions while avoiding low-quality regions. A structural component within FNO, called Dynamic Focus Strategy (DFS), is also presented for controlling the exploration ratio. The DFS applies a random vector as a coefficient to shrink the area around the farthest better positions throughout the search process. Several experimental studies have been conducted on well-known benchmark suites, comprising 45 benchmarks, to assess the efficacy of the FNO algorithm. Additionally, five engineering problems were used to evaluate the practical applicability of the proposed FNO algorithm. The Wilcoxon test, as a well-known non-parametric statistical test, is conducted to fairly compare results. The findings indicate that the FNO algorithm performs competitively against other state-of-the-art population-based metaheuristic algorithms on the tested problems.

提出了一种新的元启发式优化算法,即最优或最劣优化算法(FNO)。FNO算法背后的思想来源于搜索空间中代理位置之间的质量和距离。FNO中的搜索过程包括两个阶段。在FNO的第一阶段,它跳过最近的低电位区域,以避免局部最优。在第二阶段,算法试图探索具有更高潜力的最远位置,以达到或探索全局最优。这些行动的目的是增加人口多样性,并为FNO提供机会,发现优质地区,同时避开劣质地区。本文还提出了FNO中的一个结构组件,即动态聚焦策略(DFS),用于控制探测比。在整个搜索过程中,DFS应用一个随机向量作为系数来缩小最远的最佳位置周围的面积。为了评估FNO算法的有效性,已经在知名的基准测试套件上进行了一些实验研究,其中包括45个基准测试。此外,用5个工程问题来评价所提出的FNO算法的实际适用性。Wilcoxon检验是一种众所周知的非参数统计检验,用于公平比较结果。研究结果表明,在测试问题上,FNO算法与其他最先进的基于种群的元启发式算法相比具有竞争力。
{"title":"Farthest better or nearest worse optimizer: a novel metaheuristic algorithm","authors":"Ahmad Taheri,&nbsp;Keyvan RahimiZadeh,&nbsp;Jan Baumbach,&nbsp;Amin Beheshti,&nbsp;Olga Zolotareva,&nbsp;Mohammed Azmi Al-Betar,&nbsp;Seyedali Mirjalili,&nbsp;Amir H. Gandomi","doi":"10.1007/s10462-025-11443-z","DOIUrl":"10.1007/s10462-025-11443-z","url":null,"abstract":"<div><p>A novel metaheuristic optimization algorithm, namely the Farthest better or Nearest worse Optimizer (FNO) algorithm, is proposed in this paper. The idea behind the FNO algorithm is derived from the qualities and distances between agents’ positions in a search space. The process of searching in the FNO includes two phases. During the first phase of the FNO, it jumps over the nearest regions with lower potential to avoid local optima. In the second phase, the algorithm tries to explore the farthest positions with higher potential to reach or explore the global optimum. These operations aim to enhance population diversity and provide the FNO with opportunities to discover high-quality regions while avoiding low-quality regions. A structural component within FNO, called Dynamic Focus Strategy (DFS), is also presented for controlling the exploration ratio. The DFS applies a random vector as a coefficient to shrink the area around the farthest better positions throughout the search process. Several experimental studies have been conducted on well-known benchmark suites, comprising 45 benchmarks, to assess the efficacy of the FNO algorithm. Additionally, five engineering problems were used to evaluate the practical applicability of the proposed FNO algorithm. The Wilcoxon test, as a well-known non-parametric statistical test, is conducted to fairly compare results. The findings indicate that the FNO algorithm performs competitively against other state-of-the-art population-based metaheuristic algorithms on the tested problems.</p></div>","PeriodicalId":8449,"journal":{"name":"Artificial Intelligence Review","volume":"59 2","pages":""},"PeriodicalIF":13.9,"publicationDate":"2026-01-07","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://link.springer.com/content/pdf/10.1007/s10462-025-11443-z.pdf","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"146027026","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Multi-agent generalized cooperative optimization scheduling for multi-energy complementarity in microgrids 微电网多能互补的多智能体广义协同优化调度
IF 13.9 2区 计算机科学 Q1 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE Pub Date : 2026-01-07 DOI: 10.1007/s10462-025-11354-z
Na Xu, Chaoxu Mu, Ke Wang, Liang Ma, Zhaoyang Liu

In this paper, we study a collaborative optimization scheduling approach for high-proportion renewable energy smart microgrids to achieve multi-energy management in a distributed execution framework with centralized training. First, we construct a multi-agent distributed microgrid optimization model for this optimization problem based on different types of renewable energy sources, energy storage, power exchange with the upper grid, and time-of-use electricity prices. Then, multiple long-term optimization objectives are designed to transform the cooperative optimization scheduling problem into a multi-agent multi-objective optimization problem, addressing the challenges of dynamic optimization. To enhance the correlation of policy sampling, we propose a novel multi-objective generalized normal distribution optimization (MGNDO) algorithm. By updating the covariance matrix, the policy correlations between different agents are better captured, resulting in more cooperative action sequences. Compared to traditional action sampling methods, this approach can better accommodate complex dynamic constraints and multi-objective requirements. Finally, a smart distribution network connected to three microgrids is taken as an example to realize the cooperative optimal scheduling problem by using the proposed algorithm, MADDPG algorithm and PSO algorithm, respectively. Operational cost and new energy consumption are compared separately to further illustrate the effectiveness of the proposed approach.

本文研究了高比例可再生能源智能微电网协同优化调度方法,以实现分布式执行框架下集中训练的多能源管理。首先,针对该优化问题,构建了基于不同类型可再生能源、储能、与上网交换电力和分时电价的多智能体分布式微电网优化模型。然后,设计多个长期优化目标,将协同优化调度问题转化为多智能体多目标优化问题,解决了动态优化的挑战;为了增强策略抽样的相关性,提出了一种新的多目标广义正态分布优化算法。通过更新协方差矩阵,可以更好地捕获不同agent之间的策略相关性,从而产生更具协作性的动作序列。与传统的动作采样方法相比,该方法能更好地适应复杂的动态约束和多目标要求。最后,以连接三个微电网的智能配电网为例,分别采用本文提出的算法、madpg算法和PSO算法实现协同最优调度问题。分别比较了运行成本和新能源消耗,进一步说明了所提方法的有效性。
{"title":"Multi-agent generalized cooperative optimization scheduling for multi-energy complementarity in microgrids","authors":"Na Xu,&nbsp;Chaoxu Mu,&nbsp;Ke Wang,&nbsp;Liang Ma,&nbsp;Zhaoyang Liu","doi":"10.1007/s10462-025-11354-z","DOIUrl":"10.1007/s10462-025-11354-z","url":null,"abstract":"<div><p>In this paper, we study a collaborative optimization scheduling approach for high-proportion renewable energy smart microgrids to achieve multi-energy management in a distributed execution framework with centralized training. First, we construct a multi-agent distributed microgrid optimization model for this optimization problem based on different types of renewable energy sources, energy storage, power exchange with the upper grid, and time-of-use electricity prices. Then, multiple long-term optimization objectives are designed to transform the cooperative optimization scheduling problem into a multi-agent multi-objective optimization problem, addressing the challenges of dynamic optimization. To enhance the correlation of policy sampling, we propose a novel multi-objective generalized normal distribution optimization (MGNDO) algorithm. By updating the covariance matrix, the policy correlations between different agents are better captured, resulting in more cooperative action sequences. Compared to traditional action sampling methods, this approach can better accommodate complex dynamic constraints and multi-objective requirements. Finally, a smart distribution network connected to three microgrids is taken as an example to realize the cooperative optimal scheduling problem by using the proposed algorithm, MADDPG algorithm and PSO algorithm, respectively. Operational cost and new energy consumption are compared separately to further illustrate the effectiveness of the proposed approach.</p></div>","PeriodicalId":8449,"journal":{"name":"Artificial Intelligence Review","volume":"59 2","pages":""},"PeriodicalIF":13.9,"publicationDate":"2026-01-07","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://link.springer.com/content/pdf/10.1007/s10462-025-11354-z.pdf","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145909003","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
A survey on large language models driven meta-optimizers for automated intelligent optimization 面向自动化智能优化的大型语言模型驱动元优化器综述
IF 13.9 2区 计算机科学 Q1 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE Pub Date : 2026-01-06 DOI: 10.1007/s10462-025-11470-w
Yan Zheng, Lida Zhang, Kaiwen Li, Rui Wang, Wenhua Li, Tao Zhang, Qingfu Zhang, Yaochu Jin

This review systematically summarizes the research progress of Large Language Models (LLMs) as meta-optimizers in the field of automated intelligent optimization algorithm design, aiming to establish a unified framework for this emerging research direction. The study first defines the core paradigm and research scope of LLM-driven meta-optimization, explaining how it overcomes the limitations of traditional optimization algorithms in terms of parameter sensitivity and cross-domain generalization. It then systematically summarizes representative methodological frameworks and typical methods for key stages such as algorithm auto-generation, dynamic selection, parameter configuration, and mutation control using LLMs, revealing the potential for synergies between generative AI and gradient-free optimization heuristics. Additionally, this paper integrates relevant performance evaluation metrics and benchmarking problems, providing practical references for researchers. Finally, the paper discusses the main challenges in this field and outlines future research directions, emphasizing the core role of LLMs as meta-optimizers in driving paradigm shifts toward algorithm design automation and intelligence.

本文系统总结了大语言模型作为元优化器在自动化智能优化算法设计领域的研究进展,旨在为这一新兴的研究方向建立一个统一的框架。本研究首先定义了llm驱动元优化的核心范式和研究范围,解释了它如何克服传统优化算法在参数敏感性和跨域泛化方面的局限性。然后系统地总结了具有代表性的方法框架和关键阶段的典型方法,如算法自动生成、动态选择、参数配置和使用llm的突变控制,揭示了生成人工智能和无梯度优化启发式之间协同作用的潜力。此外,本文还整合了相关绩效评估指标和标杆问题,为研究人员提供了实践参考。最后,本文讨论了该领域的主要挑战,并概述了未来的研究方向,强调法学硕士作为元优化器在推动范式转向算法设计自动化和智能方面的核心作用。
{"title":"A survey on large language models driven meta-optimizers for automated intelligent optimization","authors":"Yan Zheng,&nbsp;Lida Zhang,&nbsp;Kaiwen Li,&nbsp;Rui Wang,&nbsp;Wenhua Li,&nbsp;Tao Zhang,&nbsp;Qingfu Zhang,&nbsp;Yaochu Jin","doi":"10.1007/s10462-025-11470-w","DOIUrl":"10.1007/s10462-025-11470-w","url":null,"abstract":"<div>\u0000 \u0000 <p>This review systematically summarizes the research progress of Large Language Models (LLMs) as meta-optimizers in the field of automated intelligent optimization algorithm design, aiming to establish a unified framework for this emerging research direction. The study first defines the core paradigm and research scope of LLM-driven meta-optimization, explaining how it overcomes the limitations of traditional optimization algorithms in terms of parameter sensitivity and cross-domain generalization. It then systematically summarizes representative methodological frameworks and typical methods for key stages such as algorithm auto-generation, dynamic selection, parameter configuration, and mutation control using LLMs, revealing the potential for synergies between generative AI and gradient-free optimization heuristics. Additionally, this paper integrates relevant performance evaluation metrics and benchmarking problems, providing practical references for researchers. Finally, the paper discusses the main challenges in this field and outlines future research directions, emphasizing the core role of LLMs as meta-optimizers in driving paradigm shifts toward algorithm design automation and intelligence.</p>\u0000 </div>","PeriodicalId":8449,"journal":{"name":"Artificial Intelligence Review","volume":"59 2","pages":""},"PeriodicalIF":13.9,"publicationDate":"2026-01-06","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://link.springer.com/content/pdf/10.1007/s10462-025-11470-w.pdf","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"146027089","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
From language to action: a review of large language models as autonomous agents and tool users 从语言到行动:作为自主代理和工具用户的大型语言模型的回顾
IF 13.9 2区 计算机科学 Q1 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE Pub Date : 2026-01-06 DOI: 10.1007/s10462-025-11471-9
Sadia Sultana Chowa, Riasad Alvi, Subhey Sadi Rahman, Md Abdur Rahman, Mohaimenul Azam Khan Raiaan, Md Rafiqul Islam, Mukhtar Hussain, Sami Azam

The pursuit of human-level artificial intelligence (AI) has significantly advanced the development of autonomous agents and Large Language Models (LLMs). LLMs are now widely utilized as decision-making agents for their ability to interpret instructions, manage sequential tasks, and adapt through feedback. This review examines recent developments in employing LLMs as autonomous agents and tool users and comprises seven research questions. We only used the papers published between 2023 and 2025 in conferences of the A* and A-ranked and Q1 journals. A structured analysis of the LLM agents’ architectural design principles, dividing their applications into single-agent and multi-agent systems, and strategies for integrating external tools is presented. In addition, the cognitive mechanisms of LLMs, including reasoning, planning, and memory, and the impact of prompting methods and fine-tuning procedures on agent performance are also investigated. Furthermore, we have evaluated current benchmarks and assessment protocols and provided an analysis of 68 publicly available datasets to assess the performance of LLM-based agents in various tasks. In conducting this review, we have identified critical findings on verifiable reasoning of LLMs, the capacity for self-improvement, and the personalization of LLM-based agents. Finally, we have discussed ten future research directions to overcome these gaps.

对人类水平人工智能(AI)的追求极大地推动了自主代理和大型语言模型(llm)的发展。法学硕士现在被广泛用作决策代理,因为他们有能力解释指令,管理连续的任务,并通过反馈进行适应。这篇综述考察了最近在使用法学硕士作为自主代理和工具用户方面的发展,包括七个研究问题。我们只使用了2023 - 2025年在A*、A级和Q1期刊会议上发表的论文。结构化地分析了LLM智能体的体系结构设计原则,将其应用划分为单智能体和多智能体系统,并提出了集成外部工具的策略。此外,还研究了llm的认知机制,包括推理、计划和记忆,以及提示方法和微调程序对代理性能的影响。此外,我们评估了当前的基准和评估协议,并提供了68个公开可用数据集的分析,以评估基于llm的代理在各种任务中的性能。在进行这项审查时,我们已经确定了法学硕士的可验证推理,自我完善的能力和基于法学硕士的代理人的个性化的关键发现。最后,我们讨论了十个未来的研究方向,以克服这些差距。
{"title":"From language to action: a review of large language models as autonomous agents and tool users","authors":"Sadia Sultana Chowa,&nbsp;Riasad Alvi,&nbsp;Subhey Sadi Rahman,&nbsp;Md Abdur Rahman,&nbsp;Mohaimenul Azam Khan Raiaan,&nbsp;Md Rafiqul Islam,&nbsp;Mukhtar Hussain,&nbsp;Sami Azam","doi":"10.1007/s10462-025-11471-9","DOIUrl":"10.1007/s10462-025-11471-9","url":null,"abstract":"<div>\u0000 \u0000 <p>The pursuit of human-level artificial intelligence (AI) has significantly advanced the development of autonomous agents and Large Language Models (LLMs). LLMs are now widely utilized as decision-making agents for their ability to interpret instructions, manage sequential tasks, and adapt through feedback. This review examines recent developments in employing LLMs as autonomous agents and tool users and comprises seven research questions. We only used the papers published between 2023 and 2025 in conferences of the A* and A-ranked and Q1 journals. A structured analysis of the LLM agents’ architectural design principles, dividing their applications into single-agent and multi-agent systems, and strategies for integrating external tools is presented. In addition, the cognitive mechanisms of LLMs, including reasoning, planning, and memory, and the impact of prompting methods and fine-tuning procedures on agent performance are also investigated. Furthermore, we have evaluated current benchmarks and assessment protocols and provided an analysis of 68 publicly available datasets to assess the performance of LLM-based agents in various tasks. In conducting this review, we have identified critical findings on verifiable reasoning of LLMs, the capacity for self-improvement, and the personalization of LLM-based agents. Finally, we have discussed ten future research directions to overcome these gaps.</p>\u0000 </div>","PeriodicalId":8449,"journal":{"name":"Artificial Intelligence Review","volume":"59 2","pages":""},"PeriodicalIF":13.9,"publicationDate":"2026-01-06","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://link.springer.com/content/pdf/10.1007/s10462-025-11471-9.pdf","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"146027088","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Aigc-driven human-machine intelligence in ITS: technologies, applications, evaluation framework, challenges, and future directions 智能交通系统中由ai驱动的人机智能:技术、应用、评估框架、挑战和未来方向
IF 13.9 2区 计算机科学 Q1 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE Pub Date : 2026-01-05 DOI: 10.1007/s10462-025-11467-5
Doreen Sebastian Sarwatt, Frank Kulwa, Adamu Gaston Philipo, Angela-Aida Karugila Runyoro, Huansheng Ning, Jianguo Ding

This paper explores the integration of Artificial Intelligence Generated Content (AIGC), a rapidly evolving branch of generative AI, with Human-Machine intelligence (HMI) to enhance the functionality of Intelligent Transportation Systems (ITS). As transportation systems grow increasingly complex, adaptive decision-making becomes essential for interpreting vast streams of real-time data from vehicles, infrastructure, and users. AIGC plays a transformative role in optimizing traffic flow through dynamic routing and real-time traffic management, while human intelligence ensures these systems remain responsive to evolving real-world conditions. For safety, AIGC is used to simulate complex driving scenarios for autonomous vehicle training and detect traffic anomalies, with human oversight providing contextual decisions in ambiguous situations. For sustainability, AIGC supports data-driven strategies to reduce emissions and energy use, while human expertise ensures alignment with ethical and environmental goals. This synergy enhances real-time decision-making, improving both accuracy and adaptability across ITS scenarios. The paper presents a comprehensive review of core and supporting AIGC technologies and their applications across key ITS domains. Case studies and initiatives from industry leaders demonstrate practical implementations of AIGC-driven HMI collaboration. To guide future deployments, we propose a conceptual five-layer evaluation framework for assessing AIGC-HMI systems, encompassing functional performance, human interaction, explainability, ethical compliance, and robustness. We also address challenges such as legacy system integration, data privacy, model bias, and scalability. The paper concludes by outlining future research directions, emphasizing the need for scalable, interpretable, and ethically aligned AIGC models. This work contributes to the development of intelligent, adaptive, and trustworthy transportation systems.

本文探讨了人工智能生成内容(AIGC)与人机智能(HMI)的集成,这是生成式人工智能的一个快速发展的分支,以增强智能交通系统(ITS)的功能。随着交通系统变得越来越复杂,适应性决策对于解释来自车辆、基础设施和用户的大量实时数据流变得至关重要。AIGC通过动态路由和实时交通管理在优化交通流量方面发挥着变革性作用,而人工智能则确保这些系统保持对不断变化的现实世界条件的响应。为了安全起见,AIGC用于模拟自动驾驶车辆训练的复杂驾驶场景,并检测交通异常,在模糊情况下,人工监督提供上下文决策。在可持续发展方面,AIGC支持以数据为导向的战略,以减少排放和能源使用,而人类专业知识则确保与道德和环境目标保持一致。这种协同作用增强了实时决策,提高了ITS场景的准确性和适应性。本文介绍了核心和支持AIGC技术及其在关键ITS领域的应用。来自行业领导者的案例研究和计划演示了aigc驱动的HMI协作的实际实现。为了指导未来的部署,我们提出了一个概念性的五层评估框架,用于评估AIGC-HMI系统,包括功能性能、人机交互、可解释性、道德遵从性和鲁棒性。我们还解决了遗留系统集成、数据隐私、模型偏差和可扩展性等挑战。论文最后概述了未来的研究方向,强调需要可扩展、可解释和符合伦理的AIGC模型。这项工作有助于智能、适应性和可信赖的交通系统的发展。
{"title":"Aigc-driven human-machine intelligence in ITS: technologies, applications, evaluation framework, challenges, and future directions","authors":"Doreen Sebastian Sarwatt,&nbsp;Frank Kulwa,&nbsp;Adamu Gaston Philipo,&nbsp;Angela-Aida Karugila Runyoro,&nbsp;Huansheng Ning,&nbsp;Jianguo Ding","doi":"10.1007/s10462-025-11467-5","DOIUrl":"10.1007/s10462-025-11467-5","url":null,"abstract":"<div><p>This paper explores the integration of Artificial Intelligence Generated Content (AIGC), a rapidly evolving branch of generative AI, with Human-Machine intelligence (HMI) to enhance the functionality of Intelligent Transportation Systems (ITS). As transportation systems grow increasingly complex, adaptive decision-making becomes essential for interpreting vast streams of real-time data from vehicles, infrastructure, and users. AIGC plays a transformative role in optimizing traffic flow through dynamic routing and real-time traffic management, while human intelligence ensures these systems remain responsive to evolving real-world conditions. For safety, AIGC is used to simulate complex driving scenarios for autonomous vehicle training and detect traffic anomalies, with human oversight providing contextual decisions in ambiguous situations. For sustainability, AIGC supports data-driven strategies to reduce emissions and energy use, while human expertise ensures alignment with ethical and environmental goals. This synergy enhances real-time decision-making, improving both accuracy and adaptability across ITS scenarios. The paper presents a comprehensive review of core and supporting AIGC technologies and their applications across key ITS domains. Case studies and initiatives from industry leaders demonstrate practical implementations of AIGC-driven HMI collaboration. To guide future deployments, we propose a conceptual five-layer evaluation framework for assessing AIGC-HMI systems, encompassing functional performance, human interaction, explainability, ethical compliance, and robustness. We also address challenges such as legacy system integration, data privacy, model bias, and scalability. The paper concludes by outlining future research directions, emphasizing the need for scalable, interpretable, and ethically aligned AIGC models. This work contributes to the development of intelligent, adaptive, and trustworthy transportation systems.</p></div>","PeriodicalId":8449,"journal":{"name":"Artificial Intelligence Review","volume":"59 2","pages":""},"PeriodicalIF":13.9,"publicationDate":"2026-01-05","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://link.springer.com/content/pdf/10.1007/s10462-025-11467-5.pdf","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"146027086","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Watermarking techniques for large language models: a survey 大型语言模型的水印技术综述
IF 13.9 2区 计算机科学 Q1 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE Pub Date : 2026-01-05 DOI: 10.1007/s10462-025-11474-6
Yuqing Liang, Jiancheng Xiao, Wensheng Gan, Philip S. Yu

With the rapid advancement and extensive application of artificial intelligence technology, large language models (LLMs) are extensively used to enhance production, creativity, learning, and work efficiency across various domains. However, the abuse of LLMs also poses potential harm to human society, such as intellectual property rights issues, academic misconduct, false content, and hallucinations. Relevant research has proposed the use of LLM watermarking to achieve IP protection for LLMs and traceability of multimedia data output by LLMs. To our knowledge, this is the first thorough review that investigates and analyzes LLM watermarking technology in detail. This review begins by recounting the history of traditional watermarking technology, then analyzes the current state of LLM watermarking research, and thoroughly examines the inheritance and relevance of these techniques. By analyzing their inheritance and relevance, this review can provide research with ideas for applying traditional digital watermarking techniques to LLM watermarking, to promote the cross-integration and innovation of watermarking technology. In addition, this review examines the pros and cons of LLM watermarking. Considering the current multimodal development trend of LLMs, it provides a detailed analysis of emerging multimodal LLM watermarking, such as visual and audio data, to offer more reference ideas for relevant research. This review delves into the challenges and future prospects of current watermarking technologies, offering valuable insights for future LLM watermarking research and applications.

随着人工智能技术的快速发展和广泛应用,大型语言模型(llm)被广泛用于提高各个领域的生产、创造、学习和工作效率。然而,法学硕士的滥用也给人类社会带来了潜在的危害,如知识产权问题、学术不端、虚假内容、幻觉等。相关研究提出利用LLM水印实现LLM的IP保护和LLM多媒体数据输出的可追溯性。据我们所知,这是第一次详细调查和分析LLM水印技术的全面审查。本文首先回顾了传统水印技术的发展历史,然后分析了LLM水印研究的现状,并对这些技术的继承性和相关性进行了深入的研究。通过分析二者的继承性和相关性,为将传统数字水印技术应用于LLM水印提供研究思路,促进水印技术的交叉融合和创新。此外,本文还分析了LLM水印技术的优缺点。考虑到当前LLM的多模态发展趋势,对新兴的多模态LLM水印,如视觉和音频数据进行了详细的分析,为相关研究提供更多的参考思路。本文综述了当前水印技术面临的挑战和未来的发展前景,为今后法学硕士水印技术的研究和应用提供了有价值的见解。
{"title":"Watermarking techniques for large language models: a survey","authors":"Yuqing Liang,&nbsp;Jiancheng Xiao,&nbsp;Wensheng Gan,&nbsp;Philip S. Yu","doi":"10.1007/s10462-025-11474-6","DOIUrl":"10.1007/s10462-025-11474-6","url":null,"abstract":"<div><p>With the rapid advancement and extensive application of artificial intelligence technology, large language models (LLMs) are extensively used to enhance production, creativity, learning, and work efficiency across various domains. However, the abuse of LLMs also poses potential harm to human society, such as intellectual property rights issues, academic misconduct, false content, and hallucinations. Relevant research has proposed the use of LLM watermarking to achieve IP protection for LLMs and traceability of multimedia data output by LLMs. To our knowledge, this is the first thorough review that investigates and analyzes LLM watermarking technology in detail. This review begins by recounting the history of traditional watermarking technology, then analyzes the current state of LLM watermarking research, and thoroughly examines the inheritance and relevance of these techniques. By analyzing their inheritance and relevance, this review can provide research with ideas for applying traditional digital watermarking techniques to LLM watermarking, to promote the cross-integration and innovation of watermarking technology. In addition, this review examines the pros and cons of LLM watermarking. Considering the current multimodal development trend of LLMs, it provides a detailed analysis of emerging multimodal LLM watermarking, such as visual and audio data, to offer more reference ideas for relevant research. This review delves into the challenges and future prospects of current watermarking technologies, offering valuable insights for future LLM watermarking research and applications.</p></div>","PeriodicalId":8449,"journal":{"name":"Artificial Intelligence Review","volume":"59 2","pages":""},"PeriodicalIF":13.9,"publicationDate":"2026-01-05","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://link.springer.com/content/pdf/10.1007/s10462-025-11474-6.pdf","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"146027085","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
期刊
Artificial Intelligence Review
全部 Contrib. Mineral. Petrol. Geobiology Gondwana Res. Paleontol. J. Geostand. Geoanal. Res. Geosci. J. Acta Oceanolog. Sin. Am. J. Phys. Anthropol. FRONT EARTH SCI-PRC Aquat. Geochem. ARCT ANTARCT ALP RES Geosci. Model Dev. Ocean and Coastal Research Environ. Prog. Sustainable Energy Nat. Geosci. J. Adv. Model. Earth Syst. Int. J. Disaster Risk Reduct. Geochim. Cosmochim. Acta J. Clim. B SOC GEOL MEX Phys. Chem. Miner. Clean Technol. Environ. Policy POL POLAR RES Sediment. Geol. Geochem. Trans. J. Hydrol. Fossil Rec. Energy Ecol Environ Estudios Demográficos y Urbanos Stud. Geophys. Geod. Conserv. Genet. Resour. Ocean Modell. Environ. Technol. Innovation QUATERNAIRE Turk. J. Earth Sci. Explor. Geophys. ECOTOXICOLOGY Geosci. Front. J PALAEONTOL SOC IND Norw. J. Geol. Terra Nova Energy Storage Expert Opin. Ther. Targets EUREKA: Physics and Engineering Ocean Sci. Quat. Res. J. Syst. Paleontol. Surv. Rev. Environmental Toxicology & Water Quality Stratigr. Geol. Correl. GEOTECH LETT ERN: Stock Market Risk (Topic) IDOJARAS Aust. J. Earth Sci. BIOGEOSCIENCES ACTA GEOL SIN-ENGL Atmos. Res. Ann. Glaciol. Acta Geochimica Environmental Epigenetics Big Earth Data Archaeol. Anthropol. Sci. Nat. Clim. Change Appl. Clay Sci. Int. J. Paleopathol. ACTA PETROL SIN Carbon Balance Manage. Int. J. Climatol. Nat. Hazards Earth Syst. Sci. Acta Geophys. Pure Appl. Geophys. IZV-PHYS SOLID EART+ Am. J. Sci. AAPG Bull. ACTA GEOL POL Am. Mineral. Hydrol. Processes GEOLOGY ECOL RESTOR OCEANOLOGIA Int. J. Biometeorol. Environ. Eng. Manage. J. Adv. Atmos. Sci. RADIOCARBON J. Atmos. Chem. Geochem. Int. ARCHAEOMETRY Basin Res. J. Atmos. Sol. Terr. Phys. Adv. Meteorol. J. Archaeol. Sci. J. Environ. Eng. Geophys. Geochem. J. Mon. Weather Rev. Energy Environ. GEOL BELG Org. Geochem. ECOSYSTEMS Appl. Geochem. Int. J. Geog. Inf. Sci.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1