首页 > 最新文献

Artificial Intelligence Review最新文献

英文 中文
The Pygmalion effect in AI: influence of cultural narratives and policies on technological development 人工智能中的皮格马利翁效应:文化叙事和政策对技术发展的影响
IF 13.9 2区 计算机科学 Q1 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE Pub Date : 2025-11-18 DOI: 10.1007/s10462-025-11407-3
T. J. Mateo Sanguino

Advances in generative artificial intelligence (AI), such as recent developments in text, audio, and video production, have amplified societal concerns, with threat probabilities estimated between 5 and 50%. This manuscript undertakes a comprehensive study to understand the factors influencing AI development, focusing on the interplay between AI research, cinematographic representations, and regulatory policies. The study reveals a strong interaction between scientific advances and cultural representations, indicating shared concerns and themes across both domains. It also highlights broad support for ethical and responsible AI development, with temporal analyses showing the significant influence of films on public perception and slower growth in policy implementation relative to cultural diffusion. The findings discuss the presence of a Pygmalion effect, where cultural representations shape perceptions of AI, and a potential Golem effect, where increased regulation may limit the dangerous development of AI and its societal impact. The study underscores the importance of balanced and ethical AI development, requiring continued monitoring and careful management of the relationship between research, cultural representations, and regulatory frameworks.

生成式人工智能(AI)的进步,如最近在文本、音频和视频制作方面的发展,加剧了社会的担忧,其威胁概率估计在5%至50%之间。本文进行了全面的研究,以了解影响人工智能发展的因素,重点是人工智能研究、电影表现和监管政策之间的相互作用。该研究揭示了科学进步和文化表现之间的强烈互动,表明这两个领域都有共同的关注点和主题。它还强调了对道德和负责任的人工智能发展的广泛支持,时间分析显示了电影对公众认知的重大影响,以及相对于文化传播,政策实施的缓慢增长。研究结果讨论了皮格马利翁效应(Pygmalion effect)的存在,即文化表征塑造了对人工智能的看法,以及潜在的魔像效应(Golem effect),即增加监管可能限制人工智能的危险发展及其社会影响。该研究强调了平衡和合乎道德的人工智能发展的重要性,需要持续监测和仔细管理研究、文化表征和监管框架之间的关系。
{"title":"The Pygmalion effect in AI: influence of cultural narratives and policies on technological development","authors":"T. J. Mateo Sanguino","doi":"10.1007/s10462-025-11407-3","DOIUrl":"10.1007/s10462-025-11407-3","url":null,"abstract":"<div><p>Advances in generative artificial intelligence (AI), such as recent developments in text, audio, and video production, have amplified societal concerns, with threat probabilities estimated between 5 and 50%. This manuscript undertakes a comprehensive study to understand the factors influencing AI development, focusing on the interplay between AI research, cinematographic representations, and regulatory policies. The study reveals a strong interaction between scientific advances and cultural representations, indicating shared concerns and themes across both domains. It also highlights broad support for ethical and responsible AI development, with temporal analyses showing the significant influence of films on public perception and slower growth in policy implementation relative to cultural diffusion. The findings discuss the presence of a Pygmalion effect, where cultural representations shape perceptions of AI, and a potential Golem effect, where increased regulation may limit the dangerous development of AI and its societal impact. The study underscores the importance of balanced and ethical AI development, requiring continued monitoring and careful management of the relationship between research, cultural representations, and regulatory frameworks.</p></div>","PeriodicalId":8449,"journal":{"name":"Artificial Intelligence Review","volume":"59 1","pages":""},"PeriodicalIF":13.9,"publicationDate":"2025-11-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://link.springer.com/content/pdf/10.1007/s10462-025-11407-3.pdf","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145560954","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Exploring the potential of explainable AI in brain tumor detection and classification: a systematic review 探索可解释的人工智能在脑肿瘤检测和分类中的潜力:系统综述
IF 13.9 2区 计算机科学 Q1 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE Pub Date : 2025-11-18 DOI: 10.1007/s10462-025-11410-8
Lincy Annet Abraham, Gopinath Palanisamy, Goutham Veerapu, J. S. Nisha

The analysis and treatment of brain tumors are among the most difficult medical conditions. Brain tumors must be detected accurately and promptly to improve patient outcomes and plan effective treatments. Recently used advanced technologies such as artificial intelligence (AI) and machine learning (ML) have increased interest in applying AI to detect brain tumors. However, concerns have emerged regarding the reliability and transparency of AI models in medical settings, as their decision-making processes are often opaque and difficult to interpret. This research is unique in its focus on explainability in AI-based brain tumor detection, prioritizing confidence, safety, and clinical adoption over mere accuracy. It gives a thorough overview of XAI methodologies, problems, and uses, linking scientific advances to the needs of real-world healthcare. XAI is a sub-section of artificial intelligence that seeks to solve this problem by offering understandable and straightforward and providing explanations for the choices made by AI representations. Applications such as healthcare, where the interpretability of AI models is essential for guaranteeing patient safety and fostering confidence between medical professionals and AI systems, have seen the introduction of XAI-based procedures. This paper reviews recent advancements in XAI-based brain tumor detection, focusing on methods that provide justifications for AI model predictions. The study highlights the advantages of XAI in improving patient outcomes and supporting medical decision-making. The findings reveal that ResNet 18 performed better, with 94% training accuracy, 96.86% testing accuracy, low loss (0.012), and a rapid time ((sim 6text {s})). ResNet 50 was a little slower ((sim 13text {s})) but stable, with 92.86% test accuracy. DenseNet121 (Adam W) achieved the highest accuracy at 97.71%, but it was not consistent across all optimizers. ViT-GRU also got 97% accuracy with very little loss (0.008), although it took a long time to compute (around 49 s). On the other hand, VGG models (around 94% test accuracy) and MobileNetV2 (loss up to 6.024) were less reliable, even though they trained faster. Additionally, it explores various opportunities, challenges, and clinical applications. Based on these findings, this research offers a comprehensive analysis of XAI-based brain tumor detection and encourages further investigation in specific areas.

脑肿瘤的分析和治疗是最困难的医疗条件之一。脑肿瘤必须准确、及时地检测出来,以改善患者的预后,并制定有效的治疗方案。最近,人工智能(AI)和机器学习(ML)等先进技术的应用使人们对人工智能在脑肿瘤检测中的应用越来越感兴趣。然而,人们对医疗环境中人工智能模型的可靠性和透明度感到担忧,因为它们的决策过程往往不透明且难以解释。这项研究的独特之处在于它专注于基于人工智能的脑肿瘤检测的可解释性,优先考虑信心、安全性和临床采用,而不仅仅是准确性。它全面概述了XAI方法、问题和用途,并将科学进步与现实世界的医疗保健需求联系起来。XAI是人工智能的一个分支,它试图通过提供可理解和直接的方法来解决这个问题,并为人工智能表示所做的选择提供解释。在医疗保健等应用中,人工智能模型的可解释性对于保证患者安全、培养医疗专业人员与人工智能系统之间的信任至关重要,这些应用已经引入了基于xai的程序。本文综述了基于xai的脑肿瘤检测的最新进展,重点介绍了为AI模型预测提供依据的方法。该研究强调了XAI在改善患者预后和支持医疗决策方面的优势。结果显示,ResNet 18的表现更好,为94% training accuracy, 96.86% testing accuracy, low loss (0.012), and a rapid time ((sim 6text {s})). ResNet 50 was a little slower ((sim 13text {s})) but stable, with 92.86% test accuracy. DenseNet121 (Adam W) achieved the highest accuracy at 97.71%, but it was not consistent across all optimizers. ViT-GRU also got 97% accuracy with very little loss (0.008), although it took a long time to compute (around 49 s). On the other hand, VGG models (around 94% test accuracy) and MobileNetV2 (loss up to 6.024) were less reliable, even though they trained faster. Additionally, it explores various opportunities, challenges, and clinical applications. Based on these findings, this research offers a comprehensive analysis of XAI-based brain tumor detection and encourages further investigation in specific areas.
{"title":"Exploring the potential of explainable AI in brain tumor detection and classification: a systematic review","authors":"Lincy Annet Abraham,&nbsp;Gopinath Palanisamy,&nbsp;Goutham Veerapu,&nbsp;J. S. Nisha","doi":"10.1007/s10462-025-11410-8","DOIUrl":"10.1007/s10462-025-11410-8","url":null,"abstract":"<div><p>The analysis and treatment of brain tumors are among the most difficult medical conditions. Brain tumors must be detected accurately and promptly to improve patient outcomes and plan effective treatments. Recently used advanced technologies such as artificial intelligence (AI) and machine learning (ML) have increased interest in applying AI to detect brain tumors. However, concerns have emerged regarding the reliability and transparency of AI models in medical settings, as their decision-making processes are often opaque and difficult to interpret. This research is unique in its focus on explainability in AI-based brain tumor detection, prioritizing confidence, safety, and clinical adoption over mere accuracy. It gives a thorough overview of XAI methodologies, problems, and uses, linking scientific advances to the needs of real-world healthcare. XAI is a sub-section of artificial intelligence that seeks to solve this problem by offering understandable and straightforward and providing explanations for the choices made by AI representations. Applications such as healthcare, where the interpretability of AI models is essential for guaranteeing patient safety and fostering confidence between medical professionals and AI systems, have seen the introduction of XAI-based procedures. This paper reviews recent advancements in XAI-based brain tumor detection, focusing on methods that provide justifications for AI model predictions. The study highlights the advantages of XAI in improving patient outcomes and supporting medical decision-making. The findings reveal that ResNet 18 performed better, with 94% training accuracy, 96.86% testing accuracy, low loss (0.012), and a rapid time <span>((sim 6text {s}))</span>. ResNet 50 was a little slower <span>((sim 13text {s}))</span> but stable, with 92.86% test accuracy. DenseNet121 (Adam W) achieved the highest accuracy at 97.71%, but it was not consistent across all optimizers. ViT-GRU also got 97% accuracy with very little loss (0.008), although it took a long time to compute (around 49 s). On the other hand, VGG models (around 94% test accuracy) and MobileNetV2 (loss up to 6.024) were less reliable, even though they trained faster. Additionally, it explores various opportunities, challenges, and clinical applications. Based on these findings, this research offers a comprehensive analysis of XAI-based brain tumor detection and encourages further investigation in specific areas.</p></div>","PeriodicalId":8449,"journal":{"name":"Artificial Intelligence Review","volume":"59 1","pages":""},"PeriodicalIF":13.9,"publicationDate":"2025-11-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://link.springer.com/content/pdf/10.1007/s10462-025-11410-8.pdf","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145560956","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Validation is the central challenge for generative social simulation: a critical review of LLMs in agent-based modeling 验证是生成社会模拟的核心挑战:对基于代理的建模的法学硕士的批判性回顾
IF 13.9 2区 计算机科学 Q1 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE Pub Date : 2025-11-18 DOI: 10.1007/s10462-025-11412-6
Maik Larooij, Petter Törnberg

Recent advances in Large Language Models (LLMs) have revitalized interest in Agent-Based Models (ABMs) by enabling “generative” simulations, with agents that can plan, reason, and interact through natural language. These developments promise greater realism and expressive power, but also revive long-standing concerns over empirical grounding, calibration, and validation—issues that have historically limited the uptake of ABMs in the social sciences. This paper systematically reviews the emerging literature on generative ABMs to assess how these long-standing challenges are being addressed. We map domains of application, categorize reported validation practices, and assess their alignment with the stated modeling goals. Our review suggests that the use of LLMs may exacerbate rather than alleviate the challenge of validating ABMs, given their black-box structure, cultural biases, and stochastic outputs. While the need for validation is increasingly acknowledged, studies often rely on face-validity or outcome measures that are only loosely tied to underlying mechanisms. Generative ABMs thus occupy an ambiguous methodological space—lacking both the parsimony of formal models and the empirical validity of data-driven approaches—and their contribution to cumulative social-scientific knowledge hinges on resolving this tension.

大型语言模型(llm)的最新进展通过启用“生成”模拟,使基于代理的模型(ABMs)重新受到关注,代理可以通过自然语言进行计划,推理和交互。这些发展承诺了更大的现实主义和表达能力,但也重新引发了对经验基础、校准和验证的长期关注,这些问题历来限制了社会科学对ABMs的吸收。本文系统地回顾了关于生成式ABMs的新兴文献,以评估如何解决这些长期存在的挑战。我们映射应用程序的领域,对报告的验证实践进行分类,并评估它们与声明的建模目标的一致性。我们的回顾表明,llm的使用可能会加剧而不是减轻验证abm的挑战,因为它们的黑箱结构、文化偏差和随机输出。虽然越来越多的人认识到验证的必要性,但研究往往依赖于面部效度或结果测量,而这些测量与潜在机制的联系并不紧密。因此,生成式ABMs占据了一个模糊的方法论空间——既缺乏形式模型的简约性,也缺乏数据驱动方法的经验有效性——它们对累积社会科学知识的贡献取决于解决这种紧张关系。
{"title":"Validation is the central challenge for generative social simulation: a critical review of LLMs in agent-based modeling","authors":"Maik Larooij,&nbsp;Petter Törnberg","doi":"10.1007/s10462-025-11412-6","DOIUrl":"10.1007/s10462-025-11412-6","url":null,"abstract":"<div><p>Recent advances in Large Language Models (LLMs) have revitalized interest in Agent-Based Models (ABMs) by enabling “generative” simulations, with agents that can plan, reason, and interact through natural language. These developments promise greater realism and expressive power, but also revive long-standing concerns over empirical grounding, calibration, and validation—issues that have historically limited the uptake of ABMs in the social sciences. This paper systematically reviews the emerging literature on generative ABMs to assess how these long-standing challenges are being addressed. We map domains of application, categorize reported validation practices, and assess their alignment with the stated modeling goals. Our review suggests that the use of LLMs may exacerbate rather than alleviate the challenge of validating ABMs, given their black-box structure, cultural biases, and stochastic outputs. While the need for validation is increasingly acknowledged, studies often rely on face-validity or outcome measures that are only loosely tied to underlying mechanisms. Generative ABMs thus occupy an ambiguous methodological space—lacking both the parsimony of formal models and the empirical validity of data-driven approaches—and their contribution to cumulative social-scientific knowledge hinges on resolving this tension.</p></div>","PeriodicalId":8449,"journal":{"name":"Artificial Intelligence Review","volume":"59 1","pages":""},"PeriodicalIF":13.9,"publicationDate":"2025-11-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://link.springer.com/content/pdf/10.1007/s10462-025-11412-6.pdf","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145561098","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Machine learning powered financial credit scoring: a systematic literature review 机器学习驱动的金融信用评分:系统文献综述
IF 13.9 2区 计算机科学 Q1 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE Pub Date : 2025-11-18 DOI: 10.1007/s10462-025-11416-2
Helmi Ayari, Pr. Ramzi Guetari, Pr. Naoufel Kraïem

Over the past few decades, credit scoring has become an important tool in the financial sector. It enables banks and financial institutions to assess the creditworthiness of individuals and reduce the risk of default. As a result of significant advances in artificial intelligence techniques. Machine learning (ML) has made it possible to improve credit scoring by distinguishing between people with good creditworthiness and those with poorer creditworthiness. In this article, we propose a systematic literature review of ML-based financial credit scoring methods published between 2018 and 2024. A total of 330 research papers were extracted from four different online databases and digital libraries. After the study selection procedure, 63 research papers were selected for this systematic review. This paper aims to identify the major ML methods used in credit scoring, assess their strengths and limitations, and highlight notable trends and advancements. In addition, the review addresses the critical challenges faced in the adoption of ML models for credit scoring. This study not only contributes to the understanding of effective ML techniques used for credit scoring but also guides future research by highlighting the promising avenues in ML-based credit scoring efforts.

在过去的几十年里,信用评分已经成为金融领域的一个重要工具。它使银行和金融机构能够评估个人的信誉并降低违约风险。由于人工智能技术的显著进步。机器学习(ML)可以通过区分信誉良好的人和信誉较差的人来提高信用评分。在本文中,我们对2018年至2024年间发表的基于ml的金融信用评分方法进行了系统的文献综述。共有330篇研究论文从四个不同的在线数据库和数字图书馆中提取。经过研究选择程序,本系统综述共选择63篇研究论文。本文旨在确定用于信用评分的主要机器学习方法,评估其优势和局限性,并强调值得注意的趋势和进步。此外,该审查还解决了采用ML模型进行信用评分所面临的关键挑战。这项研究不仅有助于理解用于信用评分的有效机器学习技术,而且还通过强调基于机器学习的信用评分工作的有前途的途径来指导未来的研究。
{"title":"Machine learning powered financial credit scoring: a systematic literature review","authors":"Helmi Ayari,&nbsp;Pr. Ramzi Guetari,&nbsp;Pr. Naoufel Kraïem","doi":"10.1007/s10462-025-11416-2","DOIUrl":"10.1007/s10462-025-11416-2","url":null,"abstract":"<div><p>Over the past few decades, credit scoring has become an important tool in the financial sector. It enables banks and financial institutions to assess the creditworthiness of individuals and reduce the risk of default. As a result of significant advances in artificial intelligence techniques. Machine learning (ML) has made it possible to improve credit scoring by distinguishing between people with good creditworthiness and those with poorer creditworthiness. In this article, we propose a systematic literature review of ML-based financial credit scoring methods published between 2018 and 2024. A total of 330 research papers were extracted from four different online databases and digital libraries. After the study selection procedure, 63 research papers were selected for this systematic review. This paper aims to identify the major ML methods used in credit scoring, assess their strengths and limitations, and highlight notable trends and advancements. In addition, the review addresses the critical challenges faced in the adoption of ML models for credit scoring. This study not only contributes to the understanding of effective ML techniques used for credit scoring but also guides future research by highlighting the promising avenues in ML-based credit scoring efforts.</p></div>","PeriodicalId":8449,"journal":{"name":"Artificial Intelligence Review","volume":"59 1","pages":""},"PeriodicalIF":13.9,"publicationDate":"2025-11-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://link.springer.com/content/pdf/10.1007/s10462-025-11416-2.pdf","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145561091","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Vision-based fire management system using autonomous unmanned aerial vehicles: a comprehensive survey 基于视觉的自主无人机火灾管理系统:综合调查
IF 13.9 2区 计算机科学 Q1 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE Pub Date : 2025-11-18 DOI: 10.1007/s10462-025-11415-3
Sufyan Danish, Md. Jalil Piran, Samee Ullah Khan, Muhammad Attique Khan, L. Minh Dang, Yahya Zweiri, Hyoung-Kyu Song, Hyeonjoon Moon

In recent years, the intensity and frequency of fires have increased significantly, resulting in considerable damage to properties and the environment through wildfires, oil pipeline fires, hazardous gas emissions, and building fires. Effective fire management systems are essential for early detection, rapid response, and mitigation of fire impacts. To address this challenge, unmanned aerial vehicles (UAVs) integrated with advanced state-of-the-art deep learning techniques offer a transformative solution for real-time fire detection, monitoring, and response. As UAVs play an essential role in the detection, classification and segmentation of fire-affected regions, enhancing vision-based fire management through advanced computer vision and deep learning technologies. This comprehensive survey critically examines recent advancements in vision-based fire management systems enabled by autonomous UAVs. It explores how baseline deep learning models, including convolutional neural networks, attention mechanisms, YOLO variants, generative adversarial networks and transformers, enhance UAV capabilities for fire-related tasks. Unlike previous reviews that focus on conventional machine learning and general AI approaches, this survey emphasizes the unique advantages and applications of deep learning-driven UAV platforms in fire scenarios. It provides detailed insights into various architectures, performance and applications used in UAV-based fire management. Additionally, the paper provides detailed insights into the available fire datasets along with their download links and outlines critical challenges, including data imbalance, privacy concerns, and real-time processing limitations. Finally, the survey identifies promising future directions, including multimodal sensor fusion, lightweight neural network architectures optimized for UAV deployment, and vision-language models. By synthesizing current research and identifying future directions, this survey aims to support the development of robust, intelligent UAV-based solutions for next-generation fire management. Researchers and professionals can access the GitHub repository.

近年来,火灾的强度和频率显著增加,通过野火、石油管道火灾、有害气体排放和建筑火灾对财产和环境造成了相当大的破坏。有效的火灾管理系统对于早期发现、快速反应和减轻火灾影响至关重要。为了应对这一挑战,无人机与先进的深度学习技术相结合,为实时火灾探测、监控和响应提供了一种变革性的解决方案。由于无人机在火灾影响区域的检测、分类和分割中发挥着至关重要的作用,因此通过先进的计算机视觉和深度学习技术加强基于视觉的火灾管理。这项全面的调查严格审查了由自主无人机支持的基于视觉的火灾管理系统的最新进展。它探讨了基线深度学习模型,包括卷积神经网络、注意机制、YOLO变体、生成对抗网络和变压器,如何增强无人机执行火灾相关任务的能力。与以往关注传统机器学习和一般人工智能方法的综述不同,本次调查强调了深度学习驱动的无人机平台在五个场景中的独特优势和应用。它提供了对基于无人机的火灾管理中使用的各种架构、性能和应用的详细见解。此外,本文还提供了对现有数据集及其下载链接的详细见解,并概述了关键挑战,包括数据不平衡、隐私问题和实时处理限制。最后,该调查确定了有希望的未来方向,包括多模态传感器融合、针对无人机部署优化的轻量级神经网络架构和视觉语言模型。通过综合目前的研究和确定未来的方向,该调查旨在为下一代火灾管理提供强大的、智能的基于无人机的解决方案。研究人员和专业人士可以访问GitHub存储库。
{"title":"Vision-based fire management system using autonomous unmanned aerial vehicles: a comprehensive survey","authors":"Sufyan Danish,&nbsp;Md. Jalil Piran,&nbsp;Samee Ullah Khan,&nbsp;Muhammad Attique Khan,&nbsp;L. Minh Dang,&nbsp;Yahya Zweiri,&nbsp;Hyoung-Kyu Song,&nbsp;Hyeonjoon Moon","doi":"10.1007/s10462-025-11415-3","DOIUrl":"10.1007/s10462-025-11415-3","url":null,"abstract":"<div><p>In recent years, the intensity and frequency of fires have increased significantly, resulting in considerable damage to properties and the environment through wildfires, oil pipeline fires, hazardous gas emissions, and building fires. Effective fire management systems are essential for early detection, rapid response, and mitigation of fire impacts. To address this challenge, unmanned aerial vehicles (UAVs) integrated with advanced state-of-the-art deep learning techniques offer a transformative solution for real-time fire detection, monitoring, and response. As UAVs play an essential role in the detection, classification and segmentation of fire-affected regions, enhancing vision-based fire management through advanced computer vision and deep learning technologies. This comprehensive survey critically examines recent advancements in vision-based fire management systems enabled by autonomous UAVs. It explores how baseline deep learning models, including convolutional neural networks, attention mechanisms, YOLO variants, generative adversarial networks and transformers, enhance UAV capabilities for fire-related tasks. Unlike previous reviews that focus on conventional machine learning and general AI approaches, this survey emphasizes the unique advantages and applications of deep learning-driven UAV platforms in fire scenarios. It provides detailed insights into various architectures, performance and applications used in UAV-based fire management. Additionally, the paper provides detailed insights into the available fire datasets along with their download links and outlines critical challenges, including data imbalance, privacy concerns, and real-time processing limitations. Finally, the survey identifies promising future directions, including multimodal sensor fusion, lightweight neural network architectures optimized for UAV deployment, and vision-language models. By synthesizing current research and identifying future directions, this survey aims to support the development of robust, intelligent UAV-based solutions for next-generation fire management. Researchers and professionals can access the GitHub repository.</p></div>","PeriodicalId":8449,"journal":{"name":"Artificial Intelligence Review","volume":"59 1","pages":""},"PeriodicalIF":13.9,"publicationDate":"2025-11-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://link.springer.com/content/pdf/10.1007/s10462-025-11415-3.pdf","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145561092","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Agentic AI: a comprehensive survey of architectures, applications, and future directions 代理AI:对架构、应用和未来方向的全面调查
IF 13.9 2区 计算机科学 Q1 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE Pub Date : 2025-11-14 DOI: 10.1007/s10462-025-11422-4
Mohamad Abou Ali, Fadi Dornaika, Jinan Charafeddine

Agentic AI represents a transformative shift in artificial intelligence, but its rapid advancement has led to a fragmented understanding, often conflating modern neural systems with outdated symbolic models—a practice known as conceptual retrofitting. This survey cuts through this confusion by introducing a novel dual-paradigm framework that categorizes agentic systems into two distinct lineages: the symbolic/classical (relying on algorithmic planning and persistent state) and the neural/generative (leveraging stochastic generation and prompt-driven orchestration). Through a systematic PRISMA-based review of 90 studies (2018–2025), we provide a comprehensive analysis structured around this framework across three dimensions: (1) the theoretical foundations and architectural principles defining each paradigm; (2) domain-specific implementations in healthcare, finance, and robotics, demonstrating how application constraints dictate paradigm selection; and (3) paradigm-specific ethical and governance challenges, revealing divergent risks and mitigation strategies. Our analysis reveals that the choice of paradigm is strategic: symbolic systems dominate safety-critical domains (e.g., healthcare), while neural systems prevail in adaptive, data-rich environments (e.g., finance). Furthermore, we identify critical research gaps, including a significant deficit in governance models for symbolic systems and a pressing need for hybrid neuro-symbolic architectures. The findings culminate in a strategic roadmap arguing that the future of Agentic AI lies not in the dominance of one paradigm, but in their intentional integration to create systems that are both adaptable and reliable. This work provides the essential conceptual toolkit to guide future research, development, and policy toward robust and trustworthy hybrid intelligent systems.

人工智能代表了人工智能的变革,但它的快速发展导致了对它的理解支离破碎,经常将现代神经系统与过时的符号模型混为一谈——这种做法被称为概念改造。本研究通过引入一种新的双范式框架来消除这种困惑,该框架将代理系统分为两个不同的谱系:符号/经典(依赖于算法规划和持续状态)和神经/生成(利用随机生成和即时驱动的编排)。通过对90项研究(2018-2025)基于prisma的系统回顾,我们围绕这一框架从三个方面进行了全面分析:(1)定义每种范式的理论基础和架构原则;(2)在医疗保健、金融和机器人领域的特定实现,展示了应用约束如何决定范式选择;(3)特定范例的伦理和治理挑战,揭示了不同的风险和缓解策略。我们的分析表明,范式的选择是战略性的:符号系统主导着安全关键领域(例如,医疗保健),而神经系统在自适应、数据丰富的环境(例如,金融)中占主导地位。此外,我们确定了关键的研究空白,包括符号系统治理模型的重大缺陷和对混合神经符号架构的迫切需求。这些发现最终形成了一个战略路线图,认为人工智能的未来不在于一个范例的主导地位,而在于它们的有意整合,以创建既适应性强又可靠的系统。这项工作提供了基本的概念工具包,以指导未来的研究、开发和政策,以实现健壮和可信的混合智能系统。
{"title":"Agentic AI: a comprehensive survey of architectures, applications, and future directions","authors":"Mohamad Abou Ali,&nbsp;Fadi Dornaika,&nbsp;Jinan Charafeddine","doi":"10.1007/s10462-025-11422-4","DOIUrl":"10.1007/s10462-025-11422-4","url":null,"abstract":"<div><p>Agentic AI represents a transformative shift in artificial intelligence, but its rapid advancement has led to a fragmented understanding, often conflating modern neural systems with outdated symbolic models—a practice known as <i>conceptual retrofitting</i>. This survey cuts through this confusion by introducing a novel dual-paradigm framework that categorizes agentic systems into two distinct lineages: the symbolic/classical (relying on algorithmic planning and persistent state) and the neural/generative (leveraging stochastic generation and prompt-driven orchestration). Through a systematic PRISMA-based review of 90 studies (2018–2025), we provide a comprehensive analysis structured around this framework across three dimensions: (1) the theoretical foundations and architectural principles defining each paradigm; (2) domain-specific implementations in healthcare, finance, and robotics, demonstrating how application constraints dictate paradigm selection; and (3) paradigm-specific ethical and governance challenges, revealing divergent risks and mitigation strategies. Our analysis reveals that the choice of paradigm is strategic: symbolic systems dominate safety-critical domains (e.g., healthcare), while neural systems prevail in adaptive, data-rich environments (e.g., finance). Furthermore, we identify critical research gaps, including a significant deficit in governance models for symbolic systems and a pressing need for hybrid neuro-symbolic architectures. The findings culminate in a strategic roadmap arguing that the future of Agentic AI lies not in the dominance of one paradigm, but in their intentional integration to create systems that are both <i>adaptable</i> and <i>reliable</i>. This work provides the essential conceptual toolkit to guide future research, development, and policy toward robust and trustworthy hybrid intelligent systems.</p></div>","PeriodicalId":8449,"journal":{"name":"Artificial Intelligence Review","volume":"59 1","pages":""},"PeriodicalIF":13.9,"publicationDate":"2025-11-14","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://link.springer.com/content/pdf/10.1007/s10462-025-11422-4.pdf","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145511072","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Decoding nature’s melody: significance and challenges of machine learning in assessing bird diversity via soundscape analysis 解码自然的旋律:通过声景分析评估鸟类多样性的机器学习的意义和挑战
IF 13.9 2区 计算机科学 Q1 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE Pub Date : 2025-11-14 DOI: 10.1007/s10462-025-11414-4
Jiangjian Xie, Shanshan Xie, Yang Liu, Xin Jing, Mengkun Zhu, Linlin Xie, Junguo Zhang, Kun Qian, Björn W. Schuller

The broad application of passive acoustic monitoring provides a critical data foundation for studying soundscape ecology, necessitating automated analysis methods to accurately extract ecological information from vast soundscape data. This review comprehensively and cohesively examines two predominant approaches in soundscape analysis: soundscape component recognition and acoustic indices methods. Focusing on machine learning (ML)-based analysis methods for bird diversity assessment over the past five years, this review surveys representative research within each category, outlining their respective strengths and limitations. This not only addresses the growing interest in this field but also identifies research gaps and poses key questions for future studies. The insights from this review are anticipated to significantly enhance the understanding of ML applications in soundscape analysis, guiding subsequent investigative efforts in this rapidly evolving discipline, and thereby better supporting long-term biodiversity monitoring and conservation initiatives.

被动声监测的广泛应用为声景生态研究提供了重要的数据基础,需要自动化分析方法从海量声景数据中准确提取生态信息。本文综述了声景分析的两种主要方法:声景成分识别法和声学指数法。回顾了近五年来基于机器学习(ML)的鸟类多样性评估分析方法,对每个类别的代表性研究进行了调查,并概述了各自的优势和局限性。这不仅解决了人们对这一领域日益增长的兴趣,而且还确定了研究空白,并为未来的研究提出了关键问题。本综述的见解有望显著增强对声景分析中ML应用的理解,指导这一快速发展的学科的后续调查工作,从而更好地支持长期的生物多样性监测和保护举措。
{"title":"Decoding nature’s melody: significance and challenges of machine learning in assessing bird diversity via soundscape analysis","authors":"Jiangjian Xie,&nbsp;Shanshan Xie,&nbsp;Yang Liu,&nbsp;Xin Jing,&nbsp;Mengkun Zhu,&nbsp;Linlin Xie,&nbsp;Junguo Zhang,&nbsp;Kun Qian,&nbsp;Björn W. Schuller","doi":"10.1007/s10462-025-11414-4","DOIUrl":"10.1007/s10462-025-11414-4","url":null,"abstract":"<div><p>The broad application of passive acoustic monitoring provides a critical data foundation for studying soundscape ecology, necessitating automated analysis methods to accurately extract ecological information from vast soundscape data. This review comprehensively and cohesively examines two predominant approaches in soundscape analysis: soundscape component recognition and acoustic indices methods. Focusing on machine learning (ML)-based analysis methods for bird diversity assessment over the past five years, this review surveys representative research within each category, outlining their respective strengths and limitations. This not only addresses the growing interest in this field but also identifies research gaps and poses key questions for future studies. The insights from this review are anticipated to significantly enhance the understanding of ML applications in soundscape analysis, guiding subsequent investigative efforts in this rapidly evolving discipline, and thereby better supporting long-term biodiversity monitoring and conservation initiatives.</p></div>","PeriodicalId":8449,"journal":{"name":"Artificial Intelligence Review","volume":"59 1","pages":""},"PeriodicalIF":13.9,"publicationDate":"2025-11-14","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://link.springer.com/content/pdf/10.1007/s10462-025-11414-4.pdf","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145511071","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Large language models for mental health diagnosis and treatment: a survey 心理健康诊断和治疗的大型语言模型:一项调查
IF 13.9 2区 计算机科学 Q1 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE Pub Date : 2025-11-14 DOI: 10.1007/s10462-025-11418-0
Mohsen Ghorbian, Mostafa Ghobaei-Arani

Mental health (MeHE) is a fundamental dimension of human well-being that encompasses emotional, psychological, and social aspects. Effective MeHE management enables individuals to cope with stress, maintain healthy relationships, and achieve their personal and social goals. However, traditional approaches are often inadequate in addressing the multidimensional challenges of early detection, personalized interventions, and comprehensive MeHE education. Large Language Models offer a transformative approach to the field of MeHE. With the ability to process large and complex textual data, these models can identify behavioral patterns in patients’ responses, suggest personalized interventions, and improve access to MeHE resources. Despite these advances, significant challenges remain. Applying reinforcement learning techniques to MeHE applications necessitates addressing challenges such as model-driven bias, protecting sensitive information, and providing robust evidence of clinical performance. This review systematically examines the applications of large language models in MeHE, providing a comprehensive analysis of their capabilities and limitations. This study examined how large language models address existing challenges, including early diagnosis, personalized treatments, and effective public education. Findings show that large language models increased the accuracy of early diagnosis of mental disorders by 33%, the effectiveness of personalized treatment plans by 27%, and participation in MeHE education and awareness by 24%. Ultimately, this research underscores the pivotal role of large language models in promoting MeHE. By providing practical insights and suggesting strategies to overcome implementation challenges, this review lays the groundwork for developing innovative, effective, and equitable solutions in the field of MeHE.

心理健康(MeHE)是人类福祉的一个基本方面,包括情感、心理和社会方面。有效的MeHE管理使个人能够应对压力,保持健康的关系,实现个人和社会目标。然而,传统方法往往不足以解决早期发现、个性化干预和综合MeHE教育等多方面的挑战。大型语言模型为MeHE领域提供了一种变革性的方法。这些模型具有处理大量复杂文本数据的能力,可以识别患者反应中的行为模式,提出个性化干预建议,并改善对MeHE资源的访问。尽管取得了这些进展,但仍存在重大挑战。在MeHE应用中应用强化学习技术需要解决模型驱动偏差、保护敏感信息和提供临床表现的可靠证据等挑战。本文系统地考察了大型语言模型在MeHE中的应用,对其能力和局限性进行了全面的分析。这项研究考察了大型语言模型如何解决现有的挑战,包括早期诊断、个性化治疗和有效的公共教育。结果显示,大型语言模型使精神障碍早期诊断的准确率提高了33%,个性化治疗方案的有效性提高了27%,MeHE教育和意识的参与率提高了24%。最后,本研究强调了大型语言模型在促进MeHE中的关键作用。通过提供实际见解和建议策略来克服实施挑战,本综述为在MeHE领域制定创新、有效和公平的解决方案奠定了基础。
{"title":"Large language models for mental health diagnosis and treatment: a survey","authors":"Mohsen Ghorbian,&nbsp;Mostafa Ghobaei-Arani","doi":"10.1007/s10462-025-11418-0","DOIUrl":"10.1007/s10462-025-11418-0","url":null,"abstract":"<div><p>Mental health (MeHE) is a fundamental dimension of human well-being that encompasses emotional, psychological, and social aspects. Effective MeHE management enables individuals to cope with stress, maintain healthy relationships, and achieve their personal and social goals. However, traditional approaches are often inadequate in addressing the multidimensional challenges of early detection, personalized interventions, and comprehensive MeHE education. Large Language Models offer a transformative approach to the field of MeHE. With the ability to process large and complex textual data, these models can identify behavioral patterns in patients’ responses, suggest personalized interventions, and improve access to MeHE resources. Despite these advances, significant challenges remain. Applying reinforcement learning techniques to MeHE applications necessitates addressing challenges such as model-driven bias, protecting sensitive information, and providing robust evidence of clinical performance. This review systematically examines the applications of large language models in MeHE, providing a comprehensive analysis of their capabilities and limitations. This study examined how large language models address existing challenges, including early diagnosis, personalized treatments, and effective public education. Findings show that large language models increased the accuracy of early diagnosis of mental disorders by 33%, the effectiveness of personalized treatment plans by 27%, and participation in MeHE education and awareness by 24%. Ultimately, this research underscores the pivotal role of large language models in promoting MeHE. By providing practical insights and suggesting strategies to overcome implementation challenges, this review lays the groundwork for developing innovative, effective, and equitable solutions in the field of MeHE.</p></div>","PeriodicalId":8449,"journal":{"name":"Artificial Intelligence Review","volume":"59 1","pages":""},"PeriodicalIF":13.9,"publicationDate":"2025-11-14","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://link.springer.com/content/pdf/10.1007/s10462-025-11418-0.pdf","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145511030","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Automating synthetic dataset generation for image-based 3D detection: a literature review 自动合成数据集生成基于图像的三维检测:文献综述
IF 13.9 2区 计算机科学 Q1 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE Pub Date : 2025-11-13 DOI: 10.1007/s10462-025-11431-3
Paul Schulz, Thorsten Hempel, Magnus Jung, Ayoub Al-Hamadi

Reliable 3D detection is fundamental to autonomous systems such as mobile robots, self-driving cars, and unmanned aerial vehicles (UAVs). To achieve this capability, researchers have developed and trained supervised networks, which require large amounts of diverse and precisely annotated data. Due to the complex, expensive, and time-consuming capturing and annotation process, synthetic dataset generation approaches have gained popularity over the last decade. With increasing computational resources and advances in simulation technologies, a variety of dataset generators have emerged. These methods rely on either traditional 3D modeling or neural image synthesis to generate data for specific scenarios or general-purpose 3D detection tasks. Their primary goal is to produce high-quality, annotated 3D datasets in an automated and scalable manner. In this review, we evaluate the extent to which state-of-the-art approaches fulfill this goal by introducing a categorization scheme and conducting a comprehensive analysis of both 3D modeling and neural synthesis methods. Our analysis includes techniques used to address the Sim-to-Real domain gap. Furthermore, we assess each method’s level of automation, prerequisites, and practical adoption. This review aims to guide the reader in selecting automated dataset generation workflows for specific detection problems. By considering dataset quality, prerequisites, and application scenarios, we offer practical insights into identifying suitable methods for diverse downstream tasks.

可靠的3D检测是移动机器人、自动驾驶汽车、无人驾驶飞行器(uav)等自主系统的基础。为了实现这种能力,研究人员开发并训练了监督网络,这需要大量不同的、精确注释的数据。由于复杂、昂贵和耗时的捕获和注释过程,合成数据集生成方法在过去十年中得到了普及。随着计算资源的增加和仿真技术的进步,出现了各种各样的数据集生成器。这些方法依靠传统的3D建模或神经图像合成来生成特定场景或通用3D检测任务的数据。他们的主要目标是以自动化和可扩展的方式生成高质量,带注释的3D数据集。在这篇综述中,我们通过引入一种分类方案并对3D建模和神经合成方法进行全面分析,评估了最先进的方法在多大程度上实现了这一目标。我们的分析包括用于解决模拟到真实领域差距的技术。此外,我们评估每个方法的自动化水平、先决条件和实际采用。这篇综述旨在指导读者为特定的检测问题选择自动数据集生成工作流程。通过考虑数据集质量、先决条件和应用场景,我们为确定适合不同下游任务的方法提供了实用的见解。
{"title":"Automating synthetic dataset generation for image-based 3D detection: a literature review","authors":"Paul Schulz,&nbsp;Thorsten Hempel,&nbsp;Magnus Jung,&nbsp;Ayoub Al-Hamadi","doi":"10.1007/s10462-025-11431-3","DOIUrl":"10.1007/s10462-025-11431-3","url":null,"abstract":"<div><p>Reliable 3D detection is fundamental to autonomous systems such as mobile robots, self-driving cars, and unmanned aerial vehicles (UAVs). To achieve this capability, researchers have developed and trained supervised networks, which require large amounts of diverse and precisely annotated data. Due to the complex, expensive, and time-consuming capturing and annotation process, synthetic dataset generation approaches have gained popularity over the last decade. With increasing computational resources and advances in simulation technologies, a variety of dataset generators have emerged. These methods rely on either traditional 3D modeling or neural image synthesis to generate data for specific scenarios or general-purpose 3D detection tasks. Their primary goal is to produce high-quality, annotated 3D datasets in an automated and scalable manner. In this review, we evaluate the extent to which state-of-the-art approaches fulfill this goal by introducing a categorization scheme and conducting a comprehensive analysis of both 3D modeling and neural synthesis methods. Our analysis includes techniques used to address the Sim-to-Real domain gap. Furthermore, we assess each method’s level of automation, prerequisites, and practical adoption. This review aims to guide the reader in selecting automated dataset generation workflows for specific detection problems. By considering dataset quality, prerequisites, and application scenarios, we offer practical insights into identifying suitable methods for diverse downstream tasks.</p></div>","PeriodicalId":8449,"journal":{"name":"Artificial Intelligence Review","volume":"59 1","pages":""},"PeriodicalIF":13.9,"publicationDate":"2025-11-13","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://link.springer.com/content/pdf/10.1007/s10462-025-11431-3.pdf","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145510588","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
A systematic review of generative AI: importance of industry and startup-centered perspectives, agentic AI, ethical considerations & challenges, and future directions 对生成式人工智能的系统回顾:以行业和初创企业为中心的观点的重要性,人工智能代理,伦理考虑和挑战,以及未来方向
IF 13.9 2区 计算机科学 Q1 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE Pub Date : 2025-11-12 DOI: 10.1007/s10462-025-11435-z
Kinjal Patel, Milind Shah, Karishma M. Qureshi, Mohamed Rafik N. Qureshi

Generative Artificial Intelligence (GenAI) is rapidly redefining the landscape of work organizations and society at large. GenAI has rapidly evolved from rule-based symbolic systems ofThe 1940 s to advanced deep learning architectures capable of producing human-like content across modalities, such as text, images, audio, and video. This review focuses on current emerging trends, such as large concept models and critical comparisons of tools, including ChatGPT, Gemini, and Claude. This study synthesizes evidence of GenAI’s essential role across major industries, revealing transformative applications in the finance, cloud and IT, healthcare, education, and energy sectors. The paper also highlights the unique opportunities GenAI offers for start-ups, enabling agile projects to leverage cutting-edge technology for competitive advantage. However, the deployment of GenAI systems through edge devices also raises critical challenges related to ethics, transparency, bias, accountability, computational issues, and many more. To address these complexities, this paper examines emerging approaches such as AI agents, agentic AI, and multi-agent systems that aim to extend the functionality of GenAI through autonomy, goal-directed behavior, and collaborative intelligence. It discovers novel incorporations with agentic AI architecture, such as BabyAGI, and discusses emerging issues of coordination, hallucination, and security risks. The findings reveal persistent challenges related to scalability, interpretability, and regulatory compliance while identifying future research directions toward developing more sophisticated, ethical, and accessible GenAI systems that will continue to reshape technological landscapes and societal interactions. This systematic review informs researchers, academicians, data scientists, and developers about the latest advancements in GenAI and highlights its applications and role across various industries, as well as supporting practitioners and scholars in staying current with the rapidly evolving landscape of generative technologies.

生成式人工智能(GenAI)正在迅速重新定义工作、组织和整个社会的格局。GenAI已经从20世纪40年代基于规则的符号系统迅速发展到先进的深度学习架构,能够跨模式(如文本、图像、音频和视频)产生类似人类的内容。这篇综述的重点是当前的新兴趋势,例如大型概念模型和工具的关键比较,包括ChatGPT、Gemini和Claude。本研究综合了GenAI在主要行业中发挥重要作用的证据,揭示了在金融、云和IT、医疗保健、教育和能源领域的变革性应用。该论文还强调了GenAI为初创企业提供的独特机会,使敏捷项目能够利用尖端技术获得竞争优势。然而,通过边缘设备部署GenAI系统也提出了与道德、透明度、偏见、问责制、计算问题等相关的关键挑战。为了解决这些复杂性,本文研究了人工智能代理、代理人工智能和多智能体系统等新兴方法,这些方法旨在通过自治、目标导向行为和协作智能来扩展GenAI的功能。它发现了与BabyAGI等人工智能架构的新结合,并讨论了协调、幻觉和安全风险等新出现的问题。研究结果揭示了与可扩展性、可解释性和法规遵从性相关的持续挑战,同时确定了未来的研究方向,即开发更复杂、更符合伦理、更易于访问的GenAI系统,这些系统将继续重塑技术景观和社会互动。本系统综述向研究人员、学者、数据科学家和开发人员介绍了GenAI的最新进展,并强调了其在各个行业的应用和作用,同时支持从业者和学者跟上快速发展的生成技术的现状。
{"title":"A systematic review of generative AI: importance of industry and startup-centered perspectives, agentic AI, ethical considerations & challenges, and future directions","authors":"Kinjal Patel,&nbsp;Milind Shah,&nbsp;Karishma M. Qureshi,&nbsp;Mohamed Rafik N. Qureshi","doi":"10.1007/s10462-025-11435-z","DOIUrl":"10.1007/s10462-025-11435-z","url":null,"abstract":"<div><p>Generative Artificial Intelligence (GenAI) is rapidly redefining the landscape of work organizations and society at large. GenAI has rapidly evolved from rule-based symbolic systems ofThe 1940 s to advanced deep learning architectures capable of producing human-like content across modalities, such as text, images, audio, and video. This review focuses on current emerging trends, such as large concept models and critical comparisons of tools, including ChatGPT, Gemini, and Claude. This study synthesizes evidence of GenAI’s essential role across major industries, revealing transformative applications in the finance, cloud and IT, healthcare, education, and energy sectors. The paper also highlights the unique opportunities GenAI offers for start-ups, enabling agile projects to leverage cutting-edge technology for competitive advantage. However, the deployment of GenAI systems through edge devices also raises critical challenges related to ethics, transparency, bias, accountability, computational issues, and many more. To address these complexities, this paper examines emerging approaches such as AI agents, agentic AI, and multi-agent systems that aim to extend the functionality of GenAI through autonomy, goal-directed behavior, and collaborative intelligence. It discovers novel incorporations with agentic AI architecture, such as BabyAGI, and discusses emerging issues of coordination, hallucination, and security risks. The findings reveal persistent challenges related to scalability, interpretability, and regulatory compliance while identifying future research directions toward developing more sophisticated, ethical, and accessible GenAI systems that will continue to reshape technological landscapes and societal interactions. This systematic review informs researchers, academicians, data scientists, and developers about the latest advancements in GenAI and highlights its applications and role across various industries, as well as supporting practitioners and scholars in staying current with the rapidly evolving landscape of generative technologies.</p></div>","PeriodicalId":8449,"journal":{"name":"Artificial Intelligence Review","volume":"59 1","pages":""},"PeriodicalIF":13.9,"publicationDate":"2025-11-12","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://link.springer.com/content/pdf/10.1007/s10462-025-11435-z.pdf","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145510492","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
期刊
Artificial Intelligence Review
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1