首页 > 最新文献

Frontiers in Artificial Intelligence最新文献

英文 中文
IndiaScene365: a transfer learning dataset for Indian scene understanding in diverse weather condition. IndiaScene365:一个用于在不同天气条件下理解印度场景的迁移学习数据集。
IF 4.7 Q2 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE Pub Date : 2026-01-09 eCollection Date: 2025-01-01 DOI: 10.3389/frai.2025.1669512
Deepa Mane, Sandhya Arora, Sachin Shelke
{"title":"IndiaScene365: a transfer learning dataset for Indian scene understanding in diverse weather condition.","authors":"Deepa Mane, Sandhya Arora, Sachin Shelke","doi":"10.3389/frai.2025.1669512","DOIUrl":"10.3389/frai.2025.1669512","url":null,"abstract":"","PeriodicalId":33315,"journal":{"name":"Frontiers in Artificial Intelligence","volume":"8 ","pages":"1669512"},"PeriodicalIF":4.7,"publicationDate":"2026-01-09","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12827691/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"146054061","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Unmasking the Clever Hans effect in AI models: shortcut learning, spurious correlations, and the path toward robust intelligence. 揭示人工智能模型中的聪明汉斯效应:捷径学习,虚假关联,以及通往强大智能的道路。
IF 4.7 Q2 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE Pub Date : 2026-01-09 eCollection Date: 2025-01-01 DOI: 10.3389/frai.2025.1692454
Abhay Kumar Pathak, Manjari Gupta, Garima Jain

The Clever Hans (CH) effect is a historical analogy of a horse solving mathematical problems based on some cues, representing a critical failure in artificial intelligence (AI) systems, where models achieve higher performance by utilizing spurious correlations and artifacts presented in the datasets rather than relying on causal relationships or task-related features. This effect or phenomenon is prevalent across multiple domains of AI such as computer vision, natural language processing, medical imaging, and reinforcement learning. This review examines the Clever Hans effect, the conceptual foundation of spurious correlations, and current evaluation methods that obscure such behavior. We further survey state-of-the-art detection and mitigation strategies, focusing on both model-centric and data-centric techniques. Building on these insights, we propose a roadmap for robust AI development, which includes standard benchmarking, causal integration, human-in-the-loop auditing, and transparent policy frameworks. This study underscores that addressing the Clever Hans effect is not only necessary for technical robustness but also for the ethical and responsible deployment of AI systems in real-world, high-stakes environments.

聪明的汉斯(CH)效应是马基于一些线索解决数学问题的历史类比,代表了人工智能(AI)系统中的一个关键失败,其中模型通过利用数据集中呈现的虚假相关性和工件而不是依赖于因果关系或任务相关特征来实现更高的性能。这种效应或现象普遍存在于人工智能的多个领域,如计算机视觉、自然语言处理、医学成像和强化学习。这篇综述考察了聪明的汉斯效应,虚假相关性的概念基础,以及当前模糊这种行为的评估方法。我们进一步调查了最先进的检测和缓解策略,重点关注以模型为中心和以数据为中心的技术。在这些见解的基础上,我们提出了一个强大的人工智能发展路线图,其中包括标准基准、因果整合、人在环审计和透明的政策框架。这项研究强调,解决聪明汉斯效应不仅是技术稳健性的必要条件,也是在现实世界高风险环境中部署人工智能系统的道德和负责任的必要条件。
{"title":"Unmasking the Clever Hans effect in AI models: shortcut learning, spurious correlations, and the path toward robust intelligence.","authors":"Abhay Kumar Pathak, Manjari Gupta, Garima Jain","doi":"10.3389/frai.2025.1692454","DOIUrl":"10.3389/frai.2025.1692454","url":null,"abstract":"<p><p>The Clever Hans (CH) effect is a historical analogy of a horse solving mathematical problems based on some cues, representing a critical failure in artificial intelligence (AI) systems, where models achieve higher performance by utilizing spurious correlations and artifacts presented in the datasets rather than relying on causal relationships or task-related features. This effect or phenomenon is prevalent across multiple domains of AI such as computer vision, natural language processing, medical imaging, and reinforcement learning. This review examines the Clever Hans effect, the conceptual foundation of spurious correlations, and current evaluation methods that obscure such behavior. We further survey state-of-the-art detection and mitigation strategies, focusing on both model-centric and data-centric techniques. Building on these insights, we propose a roadmap for robust AI development, which includes standard benchmarking, causal integration, human-in-the-loop auditing, and transparent policy frameworks. This study underscores that addressing the Clever Hans effect is not only necessary for technical robustness but also for the ethical and responsible deployment of AI systems in real-world, high-stakes environments.</p>","PeriodicalId":33315,"journal":{"name":"Frontiers in Artificial Intelligence","volume":"8 ","pages":"1692454"},"PeriodicalIF":4.7,"publicationDate":"2026-01-09","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12827554/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"146046971","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Tackling fraud detection with an enhanced Kepler optimization and ghost opposition-based learning. 通过增强的开普勒优化和基于幽灵对立的学习来解决欺诈检测。
IF 4.7 Q2 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE Pub Date : 2026-01-09 eCollection Date: 2025-01-01 DOI: 10.3389/frai.2025.1710387
Ria H Egami, Amr A Abd El-Mageed, Mona Gafar, Amr A Abohany

Introduction: The growing prevalence of fraud and malware, fueled by increased online activity and digital transactions, has exposed the shortcomings of conventional detection systems, particularly in handling novel or obfuscated threats, class imbalance, and high-dimensional data with many irrelevant features. This underscores the need for robust and adaptive detection methodologies.

Methods: This study proposes an advanced Fraud Detection (FD) methodology, BKOA-GOBL, that enhances the Binary Kepler Optimization Algorithm (BKOA) by integrating Ghost Opposition-Based Learning (GOBL) to improve Feature Selection (FS). The BKOA dynamically models gravitational attraction, planetary motion mechanics, and cyclic control to maintain a balance between exploration and exploitation. At the same time, the GOBL enhances broader search diversification and prevents early convergence, allowing the local optimum to be avoided. The Random Under-Sampling (RUS) technique is utilized to mitigate the class imbalance in fraud benchmarks.

Results and discussion: Experimental validation is conducted on five real-world benchmarks, including the Australian, European, CIC-MalMem-2022, Synthetic Financial Transaction Log, and Real vs Fake Job Postings datasets, using k-Nearest Neighbors (K-NN) and XGBoost (Xgb-tree) classifiers. The BKOA-GOBL achieves outstanding performance, reaching classification accuracies up to 99.96% in some benchmarks and corresponding feature reduction rates up to 81.82%. Precision, recall, ROC_AUC, and F1-scores were consistently high across most benchmarks, demonstrating reliable and balanced detection. However, some challenging benchmarks-such as the Real vs Fake Job Postings dataset using k-NN classifier-returned lower scores (Precision = 76.14%, Recall = 66.55%, F1-score = 71.00%, and ROC_AUC = 74.15%), reflecting the difficulty of the problem. Comparative analyses against 12 recent Metaheuristic Algorithms (MHAs) and Machine Learning (ML) classifiers confirmed BKOA-GOBL's dominance in terms of accuracy and computational efficiency. Its statistical superiority is confirmed by the Wilcoxon rank-sum test, underscoring its robustness, adaptability, and effectiveness in high-dimensional fraud and malware detection tasks and real-world fraud and malware detection scenarios.

导论:随着在线活动和数字交易的增加,欺诈和恶意软件的日益流行,暴露了传统检测系统的缺点,特别是在处理新颖或模糊的威胁、类别不平衡和具有许多不相关特征的高维数据方面。这强调了对鲁棒和自适应检测方法的需求。方法:本研究提出了一种先进的欺诈检测(FD)方法BKOA-GOBL,该方法通过集成基于幽灵对立的学习(GOBL)来改进特征选择(FS),从而增强了二进制开普勒优化算法(BKOA)。BKOA动态模拟万有引力、行星运动力学和循环控制,以保持勘探和开采之间的平衡。同时,GOBL增强了更广泛的搜索多样化,防止了早期收敛,从而避免了局部最优。利用随机欠采样(RUS)技术来缓解欺诈基准测试中的类不平衡。结果和讨论:使用k-Nearest Neighbors (K-NN)和XGBoost (Xgb-tree)分类器,在五个真实世界的基准上进行了实验验证,包括澳大利亚、欧洲、lic - malmem -2022、合成金融交易日志和真实与虚假的职位发布数据集。BKOA-GOBL取得了出色的性能,在一些基准测试中,分类准确率高达99.96%,相应的特征减少率高达81.82%。在大多数基准测试中,精度、召回率、ROC_AUC和f1得分始终很高,证明了检测的可靠性和平衡性。然而,一些具有挑战性的基准-例如使用k-NN分类器的真实与虚假职位发布数据集-返回较低的分数(Precision = 76.14%, Recall = 66.55%, F1-score = 71.00%, ROC_AUC = 74.15%),反映了问题的难度。与12种最新的元启发式算法(MHAs)和机器学习(ML)分类器进行比较分析,证实了BKOA-GOBL在准确性和计算效率方面的优势。Wilcoxon秩和检验证实了其统计上的优越性,强调了其在高维欺诈和恶意软件检测任务以及现实世界欺诈和恶意软件检测场景中的鲁棒性、适应性和有效性。
{"title":"Tackling fraud detection with an enhanced Kepler optimization and ghost opposition-based learning.","authors":"Ria H Egami, Amr A Abd El-Mageed, Mona Gafar, Amr A Abohany","doi":"10.3389/frai.2025.1710387","DOIUrl":"10.3389/frai.2025.1710387","url":null,"abstract":"<p><strong>Introduction: </strong>The growing prevalence of fraud and malware, fueled by increased online activity and digital transactions, has exposed the shortcomings of conventional detection systems, particularly in handling novel or obfuscated threats, class imbalance, and high-dimensional data with many irrelevant features. This underscores the need for robust and adaptive detection methodologies.</p><p><strong>Methods: </strong>This study proposes an advanced Fraud Detection (FD) methodology, BKOA-GOBL, that enhances the Binary Kepler Optimization Algorithm (BKOA) by integrating Ghost Opposition-Based Learning (GOBL) to improve Feature Selection (FS). The BKOA dynamically models gravitational attraction, planetary motion mechanics, and cyclic control to maintain a balance between exploration and exploitation. At the same time, the GOBL enhances broader search diversification and prevents early convergence, allowing the local optimum to be avoided. The Random Under-Sampling (RUS) technique is utilized to mitigate the class imbalance in fraud benchmarks.</p><p><strong>Results and discussion: </strong>Experimental validation is conducted on five real-world benchmarks, including the Australian, European, CIC-MalMem-2022, Synthetic Financial Transaction Log, and Real vs Fake Job Postings datasets, using k-Nearest Neighbors (<i>K</i>-NN) and XGBoost (Xgb-tree) classifiers. The BKOA-GOBL achieves outstanding performance, reaching classification accuracies up to 99.96% in some benchmarks and corresponding feature reduction rates up to 81.82%. Precision, recall, ROC_AUC, and F1-scores were consistently high across most benchmarks, demonstrating reliable and balanced detection. However, some challenging benchmarks-such as the Real vs Fake Job Postings dataset using <i>k</i>-NN classifier-returned lower scores (Precision = 76.14%, Recall = 66.55%, F1-score = 71.00%, and ROC_AUC = 74.15%), reflecting the difficulty of the problem. Comparative analyses against 12 recent Metaheuristic Algorithms (MHAs) and Machine Learning (ML) classifiers confirmed BKOA-GOBL's dominance in terms of accuracy and computational efficiency. Its statistical superiority is confirmed by the Wilcoxon rank-sum test, underscoring its robustness, adaptability, and effectiveness in high-dimensional fraud and malware detection tasks and real-world fraud and malware detection scenarios.</p>","PeriodicalId":33315,"journal":{"name":"Frontiers in Artificial Intelligence","volume":"8 ","pages":"1710387"},"PeriodicalIF":4.7,"publicationDate":"2026-01-09","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12827784/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"146054000","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Detection of protein-losing enteropathy (PLE) ultrasonographic imaging features in dogs using deep learning neural networks. 利用深度学习神经网络检测狗的蛋白质丢失性肠病(PLE)超声成像特征。
IF 4.7 Q2 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE Pub Date : 2026-01-08 eCollection Date: 2025-01-01 DOI: 10.3389/frai.2025.1707957
Anne-Kathrin Reichert, Kariem Ali, Amna Asif, Romy M Heilmann

Artificial intelligence (AI)-based models and algorithms may aid in achieving overall more efficient and accurate diagnostics in various medical specialties. Such AI-based tools could be integrated and potentially offer advantages over currently used diagnostic and monitoring algorithms, enabling the pursue of more individualized treatment options with potentially improved patient outcomes in the future. However, very few studies exploring the potential of AI-based tools have been reported in veterinary medicine. Diagnosis and subclassification of chronic inflammatory enteropathy (CIE) and protein-losing enteropathy (PLE), requiring an integrated approach including several diagnostic modalities, remains a challenge in clinical canine gastroenterology and might benefit from AI-based tools. Thus, we aimed to use AI-based deep learning to develop a model that can differentiate clinical cases of protein-losing PLE from non-PLE CIE using ultrasonographic (B-mode) images. This pilot study included anonymized data extracted from the electronic medical records and diagnostic images from routine diagnostic evaluations of 59 dogs. Following several optimization steps, the final model had a high accuracy (91.57%), precision (0.9286), recall (0.9070), F1 score (0.9176), and AUC-ROC (0.9529). This model was highly sensitive and specific for the detection of ultrasonographic features associated with clinicopathologic and/or histological lesions consistent with a PLE diagnosis. Combining sonographic diagnostics with machine learning yielded a high degree of accuracy in PLE differentiation. The results of this study underscore the potential of integrating an AI-based model into CIE diagnostics and PLE differentiation in clinical canine gastroenterology.

基于人工智能(AI)的模型和算法可能有助于在各种医学专业中实现更有效和更准确的诊断。这种基于人工智能的工具可以集成,并且可能比目前使用的诊断和监测算法更具优势,从而能够追求更个性化的治疗选择,并可能在未来改善患者的治疗效果。然而,很少有研究探索基于人工智能的工具在兽医领域的潜力。慢性炎症性肠病(CIE)和蛋白质丢失性肠病(PLE)的诊断和分类,需要包括几种诊断方式的综合方法,仍然是临床犬胃肠病学的一个挑战,可能受益于基于人工智能的工具。因此,我们的目标是使用基于人工智能的深度学习来开发一个模型,该模型可以通过超声(b型)图像区分蛋白质丢失的PLE和非PLE CIE的临床病例。这项初步研究包括从59只狗的电子病历和常规诊断评估的诊断图像中提取的匿名数据。经过多次优化,最终模型具有较高的准确率(91.57%)、精密度(0.9286)、召回率(0.9070)、F1得分(0.9176)和AUC-ROC(0.9529)。该模型对于检测与PLE诊断一致的临床病理和/或组织学病变相关的超声特征具有高度敏感性和特异性。超声诊断与机器学习相结合,在PLE鉴别中产生了高度的准确性。本研究的结果强调了将基于人工智能的模型整合到临床犬胃肠病学CIE诊断和PLE鉴别中的潜力。
{"title":"Detection of protein-losing enteropathy (PLE) ultrasonographic imaging features in dogs using deep learning neural networks.","authors":"Anne-Kathrin Reichert, Kariem Ali, Amna Asif, Romy M Heilmann","doi":"10.3389/frai.2025.1707957","DOIUrl":"10.3389/frai.2025.1707957","url":null,"abstract":"<p><p>Artificial intelligence (AI)-based models and algorithms may aid in achieving overall more efficient and accurate diagnostics in various medical specialties. Such AI-based tools could be integrated and potentially offer advantages over currently used diagnostic and monitoring algorithms, enabling the pursue of more individualized treatment options with potentially improved patient outcomes in the future. However, very few studies exploring the potential of AI-based tools have been reported in veterinary medicine. Diagnosis and subclassification of chronic inflammatory enteropathy (CIE) and protein-losing enteropathy (PLE), requiring an integrated approach including several diagnostic modalities, remains a challenge in clinical canine gastroenterology and might benefit from AI-based tools. Thus, we aimed to use AI-based <i>deep learning</i> to develop a model that can differentiate clinical cases of protein-losing PLE from non-PLE CIE using ultrasonographic (B-mode) images. This pilot study included anonymized data extracted from the electronic medical records and diagnostic images from routine diagnostic evaluations of 59 dogs. Following several optimization steps, the final model had a high accuracy (91.57%), precision (0.9286), recall (0.9070), F1 score (0.9176), and AUC-ROC (0.9529). This model was highly sensitive and specific for the detection of ultrasonographic features associated with clinicopathologic and/or histological lesions consistent with a PLE diagnosis. Combining sonographic diagnostics with <i>machine learning</i> yielded a high degree of accuracy in PLE differentiation. The results of this study underscore the potential of integrating an AI-based model into CIE diagnostics and PLE differentiation in clinical canine gastroenterology.</p>","PeriodicalId":33315,"journal":{"name":"Frontiers in Artificial Intelligence","volume":"8 ","pages":"1707957"},"PeriodicalIF":4.7,"publicationDate":"2026-01-08","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12823995/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"146054066","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Predicting adolescent depressive symptoms using teacher-reported textual descriptions of abnormal behaviors: a study based on machine learning. 使用教师报告的异常行为文本描述来预测青少年抑郁症状:一项基于机器学习的研究。
IF 4.7 Q2 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE Pub Date : 2026-01-08 eCollection Date: 2025-01-01 DOI: 10.3389/frai.2025.1732682
Nigela Wumaierjiang, Guoli Yan, Lidan Yuan, Jianan Song, Xiaofei Hou, Minghui Li, Ling Sun, Jiansong Zhou, Huifang Yin, Guangming Xu

Objective: This study aimed to develop and compare machine learning (ML) models for predicting depressive symptoms in adolescents, based on teacher-reported textual descriptions of student behaviors.

Methods: Participants were 441 adolescents from Tianjin, China. Their teachers provided written reports on behavioral or emotional concerns, while the students completed the Patient Health Questionnaire-9 (PHQ-9). Text data from reports were processed using Term Frequency-Inverse Document Frequency (TF-IDF). Four ML models-Random Forest (RF), Support Vector Machine (SVM), eXtreme Gradient Boosting (XGBoost), and Least Absolute Shrinkage and Selection Operator (LASSO)-were trained and evaluated using a 80/20 data split and 5-fold cross-validation.

Results: PHQ-9 screening identified 71.7% (n = 316) of adolescents with clinically significant depressive symptoms (score ≥10). The Random Forest (RF) model demonstrated superior performance, achieving a recall of 0.97, accuracy of 0.91, precision of 0.92, and F1-score of 0.92. SVM and XGBoost also showed good performance, while LASSO was the weakest. The analysis demonstrated that teacher reports could identify depressive symptoms with up to 97% recall.

Conclusion: Machine learning, particularly Random Forest, can effectively predict adolescent depressive symptoms from teacher-reported text. This approach offers a practical and efficient tool for early identification in school settings, facilitating timely intervention.

目的:本研究旨在开发和比较基于教师报告的学生行为文本描述的预测青少年抑郁症状的机器学习(ML)模型。方法:参与者为来自中国天津的441名青少年。他们的老师提供了关于行为或情感问题的书面报告,而学生完成了患者健康问卷-9 (PHQ-9)。使用术语频率-逆文档频率(TF-IDF)处理报告中的文本数据。四个ML模型-随机森林(RF),支持向量机(SVM),极端梯度增强(XGBoost)和最小绝对收缩和选择算子(LASSO)-使用80/20数据分割和5倍交叉验证进行训练和评估。结果:PHQ-9筛查发现71.7% (n = 316)有临床显著抑郁症状(评分≥10)的青少年。随机森林(Random Forest, RF)模型表现出优异的性能,召回率为0.97,准确率为0.91,精密度为0.92,f1得分为0.92。SVM和XGBoost也表现出较好的性能,LASSO是最弱的。分析表明,教师报告可以识别抑郁症状,召回率高达97%。结论:机器学习,特别是随机森林,可以有效地从教师报告的文本中预测青少年抑郁症状。这种方法为学校环境中的早期识别提供了一种实用而有效的工具,有助于及时干预。
{"title":"Predicting adolescent depressive symptoms using teacher-reported textual descriptions of abnormal behaviors: a study based on machine learning.","authors":"Nigela Wumaierjiang, Guoli Yan, Lidan Yuan, Jianan Song, Xiaofei Hou, Minghui Li, Ling Sun, Jiansong Zhou, Huifang Yin, Guangming Xu","doi":"10.3389/frai.2025.1732682","DOIUrl":"10.3389/frai.2025.1732682","url":null,"abstract":"<p><strong>Objective: </strong>This study aimed to develop and compare machine learning (ML) models for predicting depressive symptoms in adolescents, based on teacher-reported textual descriptions of student behaviors.</p><p><strong>Methods: </strong>Participants were 441 adolescents from Tianjin, China. Their teachers provided written reports on behavioral or emotional concerns, while the students completed the Patient Health Questionnaire-9 (PHQ-9). Text data from reports were processed using Term Frequency-Inverse Document Frequency (TF-IDF). Four ML models-Random Forest (RF), Support Vector Machine (SVM), eXtreme Gradient Boosting (XGBoost), and Least Absolute Shrinkage and Selection Operator (LASSO)-were trained and evaluated using a 80/20 data split and 5-fold cross-validation.</p><p><strong>Results: </strong>PHQ-9 screening identified 71.7% (<i>n</i> = 316) of adolescents with clinically significant depressive symptoms (score ≥10). The Random Forest (RF) model demonstrated superior performance, achieving a recall of 0.97, accuracy of 0.91, precision of 0.92, and F1-score of 0.92. SVM and XGBoost also showed good performance, while LASSO was the weakest. The analysis demonstrated that teacher reports could identify depressive symptoms with up to 97% recall.</p><p><strong>Conclusion: </strong>Machine learning, particularly Random Forest, can effectively predict adolescent depressive symptoms from teacher-reported text. This approach offers a practical and efficient tool for early identification in school settings, facilitating timely intervention.</p>","PeriodicalId":33315,"journal":{"name":"Frontiers in Artificial Intelligence","volume":"8 ","pages":"1732682"},"PeriodicalIF":4.7,"publicationDate":"2026-01-08","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12823932/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"146054050","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Explainable AI-driven MRI-based brain tumor classification: a novel deep learning approach. 可解释的人工智能驱动的基于mri的脑肿瘤分类:一种新的深度学习方法。
IF 4.7 Q2 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE Pub Date : 2026-01-08 eCollection Date: 2025-01-01 DOI: 10.3389/frai.2025.1700214
Vinayaka R Srinivas, Ramasubramanian Parvathi

Introduction: Brain tumors are among the most aggressive forms of cancer, requiring precise diagnosis and treatment planning to improve patient outcomes. This study aims to develop an efficient deep learning-based framework for the classification of brain tumors using MRI data.

Methods: The methodology employs Convolutional Neural Networks (CNNs) to accurately classify tumors into four categories: normal, glioma, pituitary, and meningioma. Key preprocessing techniques, including noise reduction,resizing, and data augmentation, were applied to enhance the robustness of the model. Advanced architectures such as DenseNet50, VGG19, and other transfer learning models, along with CNN variants, were trained and evaluated for their performance. Explainable AI (XAI) techniques, including Grad-CAM, LIME, and feature map visualizations, played a crucial role in providing better visualizations of the model's decision-making process and identifying areas of improvement during model training and to establish a better model.

Results: The best-performing model, a 4-conv-1-dense-1-dropout CNN, achieved a classification accuracy of 95.86%, outperforming deeper architectures and transfer learning approaches. The findings underscore the potential of deep learning models for reliable and efficient brain tumor classification. This work concludes with recommendations for real-time deployment in clinical settings and explores future integration with Large Language Models (LLMs) to generate detailed diagnostic reports.

脑肿瘤是最具侵袭性的癌症之一,需要精确的诊断和治疗计划来改善患者的预后。本研究旨在开发一种高效的基于深度学习的框架,用于使用MRI数据对脑肿瘤进行分类。方法:采用卷积神经网络(cnn)将肿瘤准确分为正常、胶质瘤、垂体瘤和脑膜瘤4类。采用关键的预处理技术,包括降噪、调整大小和数据增强,以增强模型的鲁棒性。高级架构,如DenseNet50、VGG19和其他迁移学习模型,以及CNN的变体,被训练并评估了它们的性能。可解释的人工智能(XAI)技术,包括Grad-CAM、LIME和特征图可视化,在提供模型决策过程的更好的可视化和识别模型训练期间改进的领域以及建立更好的模型方面发挥了至关重要的作用。结果:表现最好的模型是4- convo -1-dense-1-dropout CNN,其分类准确率达到95.86%,优于更深层次的架构和迁移学习方法。这些发现强调了深度学习模型在可靠和有效的脑肿瘤分类方面的潜力。这项工作总结了在临床环境中实时部署的建议,并探索了未来与大型语言模型(llm)的集成,以生成详细的诊断报告。
{"title":"Explainable AI-driven MRI-based brain tumor classification: a novel deep learning approach.","authors":"Vinayaka R Srinivas, Ramasubramanian Parvathi","doi":"10.3389/frai.2025.1700214","DOIUrl":"10.3389/frai.2025.1700214","url":null,"abstract":"<p><strong>Introduction: </strong>Brain tumors are among the most aggressive forms of cancer, requiring precise diagnosis and treatment planning to improve patient outcomes. This study aims to develop an efficient deep learning-based framework for the classification of brain tumors using MRI data.</p><p><strong>Methods: </strong>The methodology employs Convolutional Neural Networks (CNNs) to accurately classify tumors into four categories: normal, glioma, pituitary, and meningioma. Key preprocessing techniques, including noise reduction,resizing, and data augmentation, were applied to enhance the robustness of the model. Advanced architectures such as DenseNet50, VGG19, and other transfer learning models, along with CNN variants, were trained and evaluated for their performance. Explainable AI (XAI) techniques, including Grad-CAM, LIME, and feature map visualizations, played a crucial role in providing better visualizations of the model's decision-making process and identifying areas of improvement during model training and to establish a better model.</p><p><strong>Results: </strong>The best-performing model, a 4-conv-1-dense-1-dropout CNN, achieved a classification accuracy of 95.86%, outperforming deeper architectures and transfer learning approaches. The findings underscore the potential of deep learning models for reliable and efficient brain tumor classification. This work concludes with recommendations for real-time deployment in clinical settings and explores future integration with Large Language Models (LLMs) to generate detailed diagnostic reports.</p>","PeriodicalId":33315,"journal":{"name":"Frontiers in Artificial Intelligence","volume":"8 ","pages":"1700214"},"PeriodicalIF":4.7,"publicationDate":"2026-01-08","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12823918/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"146054048","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Demographic identification of Greater Caribbean manatees via acoustic feature learning. 基于声学特征学习的大加勒比海牛人口特征识别。
IF 4.7 Q2 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE Pub Date : 2026-01-08 eCollection Date: 2025-01-01 DOI: 10.3389/frai.2025.1660388
Fernando Merchan, Kenji Contreras, Héctor Poveda, Rocío M Estévez, Hector M Guzman, Javier E Sanchez-Galan
<p><p>Demographic inference from vocalizations is essential for monitoring endangered Greater Caribbean manatees (<i>Trichechus manatus manatus</i>) in tropical environments where direct observation is limited. While passive acoustic monitoring has proven effective for manatee detection and individual identification, the ability to classify sex and age from vocalizations remains unexplored, limiting ecological insights into population structure and reproductive dynamics. We investigated whether machine learning can accurately classify sex and age from manatee acoustic signals using 1,285 vocalizations from 20 wild individuals captured in the Changuinola River, Panama. Acoustic features including spectral envelope descriptors (MFCCs), harmonic content (chroma), and temporal-frequency parameters were extracted and analyzed using two feature sets: SET1 (30 spectral-cepstral features) and SET2 (38 features augmented with explicit pitch and temporal descriptors). Four classification algorithms (Random Forest, XGBoost, SVM, LDA) were trained under Leave-One-Group-Out cross-validation with SMOTE oversampling to address class imbalance. Sex classification achieved 85%-87% accuracy (75%-78% macro-F1) with balanced performance across both classes (female: 86%, male: 79%), validating operational feasibility for passive monitoring applications. However, subject-level bootstrap analysis revealed substantial individual heterogeneity (female: 95% CI: 68.7%-96.4%, male: 75.1%-83.6%), indicating that approximately 10%-15% of individuals exhibit systematic misclassification due to atypical acoustic signatures. Spectral envelope characteristics (MFCCs, spectral skewness) rather than fundamental frequency were most discriminative, suggesting sex-related variation manifests in vocal tract resonance patterns. Age classification achieved 73%-85% global accuracy but exhibited severe juvenile under-detection (14%-26% recall), with bootstrap confidence intervals spanning 9.3%-86.3% for juveniles vs. 60.7%-84.7% for adults. Dimensionality reduction (PCA, t-SNE) revealed substantial overlap between juvenile and adult acoustic feature distributions, with clearer age structure visible primarily within female clusters, contributing to systematic misclassification of male juveniles. Threshold optimization improved juvenile recall to 63% but increased false positives to 37%, presenting trade-offs for conservation surveillance. Acoustic body size regression demonstrated promising continuous estimation (MAE = 0.208 m, <i>R</i> <sup>2</sup> = 0.33), offering an alternative to categorical age classification by enabling coarse demographic profiling when integrated with sex inference. These findings establish the operational viability of acoustic sex classification for manatee conservation while highlighting fundamental challenges in categorical age inference due to continuous ontogenetic variation and limited juvenile samples. However, acoustic body size regression offers a promising
在直接观察有限的热带环境中,从发声中得出的人口统计学推断对于监测濒临灭绝的大加勒比海牛(trichecchus manatus manatus)至关重要。虽然被动声学监测已被证明对海牛的探测和个体识别是有效的,但从发声中分类性别和年龄的能力仍未被探索,这限制了对种群结构和生殖动态的生态学见解。我们研究了机器学习是否可以从海牛的声音信号中准确地分类性别和年龄,使用了在巴拿马Changuinola河捕获的20只野生海牛的1,285种发声信号。声学特征包括频谱包络描述符(MFCCs)、谐波含量(色度)和时间频率参数,使用两个特征集:SET1(30个频谱倒谱特征)和SET2(38个特征增强了明确的音高和时间描述符)进行提取和分析。在Leave-One-Group-Out交叉验证和SMOTE过采样下,训练了4种分类算法(Random Forest、XGBoost、SVM、LDA)来解决分类不平衡问题。性别分类的准确率达到85%-87%(宏观f1为75%-78%),在两个类别(女性:86%,男性:79%)中表现平衡,验证了被动监测应用的操作可行性。然而,受试者水平的bootstrap分析显示了大量的个体异质性(女性:95% CI: 68.7%-96.4%,男性:75.1%-83.6%),表明大约10%-15%的个体由于非典型声学特征而表现出系统性的错误分类。谱包络特征(MFCCs,谱偏度)比基频更具歧视性,表明性别相关的变异表现在声道共振模式中。年龄分类达到了73%-85%的全球准确率,但表现出严重的青少年检测不足(14%-26%的召回率),青少年的自举置信区间为9.3%-86.3%,成人为60.7%-84.7%。降维分析(PCA, t-SNE)显示,幼鱼和成年鱼的声学特征分布存在明显的重叠,年龄结构更清晰,主要在雌性群集中可见,这导致了雄性幼鱼的系统误分类。阈值优化将幼鱼的召回率提高到63%,但将误报率提高到37%,为保护监测提供了折衷方案。声学体型回归显示出有希望的连续估计(MAE = 0.208 m, r2 = 0.33),通过与性别推断相结合的粗略人口统计分析,提供了分类年龄分类的另一种选择。这些发现确立了声学性别分类对海牛保护的可行性,同时强调了由于持续的个体发生变化和有限的幼崽样本,在分类年龄推断方面面临的基本挑战。然而,声学体型回归提供了一种很有前途的补充方法,可以实现跨体型类别而不是离散年龄类别的连续人口统计分析。与已建立的个体识别框架相结合,可以实现全面的声学标记重新捕获,同时从长期水听器部署中估计丰度、性别比例、大小分布和人口结构,而无需视觉确认身体尺寸。
{"title":"Demographic identification of Greater Caribbean manatees via acoustic feature learning.","authors":"Fernando Merchan, Kenji Contreras, Héctor Poveda, Rocío M Estévez, Hector M Guzman, Javier E Sanchez-Galan","doi":"10.3389/frai.2025.1660388","DOIUrl":"10.3389/frai.2025.1660388","url":null,"abstract":"&lt;p&gt;&lt;p&gt;Demographic inference from vocalizations is essential for monitoring endangered Greater Caribbean manatees (&lt;i&gt;Trichechus manatus manatus&lt;/i&gt;) in tropical environments where direct observation is limited. While passive acoustic monitoring has proven effective for manatee detection and individual identification, the ability to classify sex and age from vocalizations remains unexplored, limiting ecological insights into population structure and reproductive dynamics. We investigated whether machine learning can accurately classify sex and age from manatee acoustic signals using 1,285 vocalizations from 20 wild individuals captured in the Changuinola River, Panama. Acoustic features including spectral envelope descriptors (MFCCs), harmonic content (chroma), and temporal-frequency parameters were extracted and analyzed using two feature sets: SET1 (30 spectral-cepstral features) and SET2 (38 features augmented with explicit pitch and temporal descriptors). Four classification algorithms (Random Forest, XGBoost, SVM, LDA) were trained under Leave-One-Group-Out cross-validation with SMOTE oversampling to address class imbalance. Sex classification achieved 85%-87% accuracy (75%-78% macro-F1) with balanced performance across both classes (female: 86%, male: 79%), validating operational feasibility for passive monitoring applications. However, subject-level bootstrap analysis revealed substantial individual heterogeneity (female: 95% CI: 68.7%-96.4%, male: 75.1%-83.6%), indicating that approximately 10%-15% of individuals exhibit systematic misclassification due to atypical acoustic signatures. Spectral envelope characteristics (MFCCs, spectral skewness) rather than fundamental frequency were most discriminative, suggesting sex-related variation manifests in vocal tract resonance patterns. Age classification achieved 73%-85% global accuracy but exhibited severe juvenile under-detection (14%-26% recall), with bootstrap confidence intervals spanning 9.3%-86.3% for juveniles vs. 60.7%-84.7% for adults. Dimensionality reduction (PCA, t-SNE) revealed substantial overlap between juvenile and adult acoustic feature distributions, with clearer age structure visible primarily within female clusters, contributing to systematic misclassification of male juveniles. Threshold optimization improved juvenile recall to 63% but increased false positives to 37%, presenting trade-offs for conservation surveillance. Acoustic body size regression demonstrated promising continuous estimation (MAE = 0.208 m, &lt;i&gt;R&lt;/i&gt; &lt;sup&gt;2&lt;/sup&gt; = 0.33), offering an alternative to categorical age classification by enabling coarse demographic profiling when integrated with sex inference. These findings establish the operational viability of acoustic sex classification for manatee conservation while highlighting fundamental challenges in categorical age inference due to continuous ontogenetic variation and limited juvenile samples. However, acoustic body size regression offers a promising","PeriodicalId":33315,"journal":{"name":"Frontiers in Artificial Intelligence","volume":"8 ","pages":"1660388"},"PeriodicalIF":4.7,"publicationDate":"2026-01-08","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12823911/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"146054068","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Correction: Machine learning-based detection of cognitive decline using SSWTRT: classification performance and decision analysis. 更正:使用SSWTRT的基于机器学习的认知衰退检测:分类性能和决策分析。
IF 4.7 Q2 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE Pub Date : 2026-01-08 eCollection Date: 2025-01-01 DOI: 10.3389/frai.2025.1764066
Yuji Nozaki, Chihiro Kamohara, Ryota Abe, Taiki Ieda, Madoka Nakajima, Maki Sakamoto

[This corrects the article DOI: 10.3389/frai.2025.1689182.].

[这更正了文章DOI: 10.3389/frai.2025.1689182.]。
{"title":"Correction: Machine learning-based detection of cognitive decline using SSWTRT: classification performance and decision analysis.","authors":"Yuji Nozaki, Chihiro Kamohara, Ryota Abe, Taiki Ieda, Madoka Nakajima, Maki Sakamoto","doi":"10.3389/frai.2025.1764066","DOIUrl":"https://doi.org/10.3389/frai.2025.1764066","url":null,"abstract":"<p><p>[This corrects the article DOI: 10.3389/frai.2025.1689182.].</p>","PeriodicalId":33315,"journal":{"name":"Frontiers in Artificial Intelligence","volume":"8 ","pages":"1764066"},"PeriodicalIF":4.7,"publicationDate":"2026-01-08","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12824601/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"146054011","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Multi-modal AI in precision medicine: integrating genomics, imaging, and EHR data for clinical insights. 精准医疗中的多模式人工智能:整合基因组学、成像和电子病历数据以获得临床见解。
IF 4.7 Q2 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE Pub Date : 2026-01-07 eCollection Date: 2025-01-01 DOI: 10.3389/frai.2025.1743921
Shahper Nazeer Khan, Danishuddin, Mohd Wajid Ali Khan, Luca Guarnera, Syed Mohammad Fauzan Akhtar

Precision healthcare is increasingly oriented toward the development of therapeutic strategies that are as individualized as the patients receiving them. Central to this paradigm shift is artificial intelligence (AI)-enabled multi-modal data integration, which consolidates heterogeneous data streams-including genomic, transcriptomic, proteomic, imaging, environmental, and electronic health record (EHR) data into a unified analytical framework. This integrative approach enhances early disease detection, facilitates the discovery of clinically actionable biomarkers, and accelerates rational drug development, with particularly significant implications for oncology, neurology, and cardiovascular medicine. Advanced machine learning (ML) and deep learning (DL) algorithms are capable of extracting complex, non-linear associations across data modalities, thereby improving diagnostic precision, enabling robust risk stratification, and informing patient-specific therapeutic interventions. Furthermore, AI-driven applications in digital health, such as wearable biosensors and real-time physiological monitoring, allow for continuous, dynamic refinement of treatment plans. This review examines the transformative potential of multi-modal AI in precision medicine, with emphasis on its role in multi-omics data integration, predictive modeling, and clinical decision support. In parallel, it critically evaluates prevailing challenges, including data interoperability, algorithmic bias, and ethical considerations surrounding patient privacy. The synergistic convergence of AI and multi-modal data represents not merely a technological innovation but a fundamental redefinition of individualized healthcare delivery.

精准医疗越来越趋向于治疗策略的发展,这些治疗策略与接受治疗的患者一样个性化。这种模式转变的核心是支持人工智能(AI)的多模式数据集成,它将异构数据流(包括基因组、转录组、蛋白质组、成像、环境和电子健康记录(EHR)数据整合到统一的分析框架中。这种综合方法增强了疾病的早期检测,促进了临床可操作生物标志物的发现,并加速了合理的药物开发,对肿瘤、神经病学和心血管医学具有特别重要的意义。先进的机器学习(ML)和深度学习(DL)算法能够从数据模式中提取复杂的非线性关联,从而提高诊断精度,实现稳健的风险分层,并为患者特定的治疗干预提供信息。此外,人工智能驱动的数字健康应用,如可穿戴生物传感器和实时生理监测,允许持续、动态地改进治疗计划。本文综述了多模态人工智能在精准医学中的变革潜力,重点介绍了其在多组学数据集成、预测建模和临床决策支持方面的作用。同时,它批判性地评估当前的挑战,包括数据互操作性、算法偏见和围绕患者隐私的伦理考虑。人工智能和多模态数据的协同融合不仅代表了一种技术创新,而且代表了对个性化医疗服务的根本重新定义。
{"title":"Multi-modal AI in precision medicine: integrating genomics, imaging, and EHR data for clinical insights.","authors":"Shahper Nazeer Khan, Danishuddin, Mohd Wajid Ali Khan, Luca Guarnera, Syed Mohammad Fauzan Akhtar","doi":"10.3389/frai.2025.1743921","DOIUrl":"10.3389/frai.2025.1743921","url":null,"abstract":"<p><p>Precision healthcare is increasingly oriented toward the development of therapeutic strategies that are as individualized as the patients receiving them. Central to this paradigm shift is artificial intelligence (AI)-enabled multi-modal data integration, which consolidates heterogeneous data streams-including genomic, transcriptomic, proteomic, imaging, environmental, and electronic health record (EHR) data into a unified analytical framework. This integrative approach enhances early disease detection, facilitates the discovery of clinically actionable biomarkers, and accelerates rational drug development, with particularly significant implications for oncology, neurology, and cardiovascular medicine. Advanced machine learning (ML) and deep learning (DL) algorithms are capable of extracting complex, non-linear associations across data modalities, thereby improving diagnostic precision, enabling robust risk stratification, and informing patient-specific therapeutic interventions. Furthermore, AI-driven applications in digital health, such as wearable biosensors and real-time physiological monitoring, allow for continuous, dynamic refinement of treatment plans. This review examines the transformative potential of multi-modal AI in precision medicine, with emphasis on its role in multi-omics data integration, predictive modeling, and clinical decision support. In parallel, it critically evaluates prevailing challenges, including data interoperability, algorithmic bias, and ethical considerations surrounding patient privacy. The synergistic convergence of AI and multi-modal data represents not merely a technological innovation but a fundamental redefinition of individualized healthcare delivery.</p>","PeriodicalId":33315,"journal":{"name":"Frontiers in Artificial Intelligence","volume":"8 ","pages":"1743921"},"PeriodicalIF":4.7,"publicationDate":"2026-01-07","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12819606/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"146030996","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Use of machine learning models to predict mechanical ventilation, ECMO, and mortality in COVID-19. 使用机器学习模型预测COVID-19患者的机械通气、ECMO和死亡率。
IF 4.7 Q2 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE Pub Date : 2026-01-06 eCollection Date: 2025-01-01 DOI: 10.3389/frai.2025.1661637
Nina Moorman, Erin Hedlund-Botti, Grace Gombolay, Matthew C Gombolay

Introduction: Patients with severe COVID-19 may require MV or ECMO. Predicting who will require interventions and the duration of those interventions are challenging due to the diverse responses among patients and the dynamic nature of the disease. As such, there is a need for better prediction of the duration and outcomes of MV use in patients, to improve patient care and aid with MV and ECMO allocation. Here we develop and examine the performance of ML models to predict MV duration, ECMO, and mortality for patients with COVID-19.

Methods: In this retrospective prognostic study, hierarchical machine-learning models were developed to predict MV duration and outcome prediction from demographic data and time-series data consisting of vital signs and laboratory results. We train our models on 10,378 patients with positive severe acute respiratory syndrome-related coronavirus (SARS-CoV-2) virus testing from Emory's COVID CRADLE Dataset who sought treatment at Emory University Hospital between February 28, 2020, to January 24, 2022. Analysis was conducted between January 10, 2022, and April 5, 2024. The main outcomes and measures were the AUROC, AUPRC and the F-score for MV duration, need for ECMO, and mortality prediction.

Results: Data from 10,378 patients with COVID-19 (median [IQR] age, 60 [48-72] years; 5,281 [50.89%] women) were included. Overall MV class distributions for 0 days, 1-4 days, 5-9 days, 10-14 days, 15-19 days, 20-24 days, 25-29 days, and ≥30 days of MV were 8,141 (78.44%), 812 (7.82%), 325 (3.13%), 241 (2.32%), 153 (1.47%), 97 (0.93%), 87 (0.84%), and 522 (5.03%), respectively. Overall ECMO use and mortality rates were 15 (0.14%) and 1,114 (10.73%), respectively. On MV duration, ECMO use, and mortality outcomes, the highest-performing model reached weighted average AUROC scores of 0.873, 0.902, and 0.774, and the highest-performing model reached weighted average AUPRC scores of 0.790, 0.999, and 0.893.

Conclusions and relevance: Hierarchical ML models trained on vital signs, laboratory results, and demographic data show promise for the prediction of MV duration, ECMO use, and mortality in COVID-19 patients.

重症COVID-19患者可能需要MV或ECMO。由于患者的不同反应和疾病的动态性质,预测谁将需要干预以及这些干预的持续时间具有挑战性。因此,有必要更好地预测患者使用MV的持续时间和结果,以改善患者护理并协助MV和ECMO的分配。在这里,我们开发并检验了ML模型的性能,以预测COVID-19患者的MV持续时间、ECMO和死亡率。方法:在这项回顾性预后研究中,开发了分层机器学习模型,根据人口统计数据和由生命体征和实验室结果组成的时间序列数据预测MV持续时间和结局预测。我们对10378名严重急性呼吸综合征相关冠状病毒(SARS-CoV-2)病毒检测呈阳性的患者进行了模型训练,这些患者来自埃默里大学的COVID - CRADLE数据集,他们在2020年2月28日至2022年1月24日期间在埃默里大学医院寻求治疗。分析时间为2022年1月10日至2024年4月5日。主要结果和指标为AUROC、AUPRC和MV持续时间f评分、ECMO需求和死亡率预测。结果:纳入10378例COVID-19患者的数据(中位[IQR]年龄为60[48-72]岁;5281例[50.89%]女性)。0天、1-4天、5-9天、10-14天、15-19天、20-24天、25-29天和≥30天的总MV级分布分别为8141(78.44%)、812(7.82%)、325(3.13%)、241(2.32%)、153(1.47%)、97(0.93%)、87(0.84%)和522(5.03%)。总体ECMO使用率和死亡率分别为15例(0.14%)和1114例(10.73%)。在MV持续时间、ECMO使用和死亡率结果方面,表现最好的模型AUROC加权平均得分分别为0.873、0.902和0.774,表现最好的模型AUPRC加权平均得分分别为0.790、0.999和0.893。结论和相关性:基于生命体征、实验室结果和人口统计学数据训练的分层机器学习模型有望预测COVID-19患者的MV持续时间、ECMO使用和死亡率。
{"title":"Use of machine learning models to predict mechanical ventilation, ECMO, and mortality in COVID-19.","authors":"Nina Moorman, Erin Hedlund-Botti, Grace Gombolay, Matthew C Gombolay","doi":"10.3389/frai.2025.1661637","DOIUrl":"10.3389/frai.2025.1661637","url":null,"abstract":"<p><strong>Introduction: </strong>Patients with severe COVID-19 may require MV or ECMO. Predicting who will require interventions and the duration of those interventions are challenging due to the diverse responses among patients and the dynamic nature of the disease. As such, there is a need for better prediction of the duration and outcomes of MV use in patients, to improve patient care and aid with MV and ECMO allocation. Here we develop and examine the performance of ML models to predict MV duration, ECMO, and mortality for patients with COVID-19.</p><p><strong>Methods: </strong>In this retrospective prognostic study, hierarchical machine-learning models were developed to predict MV duration and outcome prediction from demographic data and time-series data consisting of vital signs and laboratory results. We train our models on 10,378 patients with positive severe acute respiratory syndrome-related coronavirus (SARS-CoV-2) virus testing from Emory's COVID CRADLE Dataset who sought treatment at Emory University Hospital between February 28, 2020, to January 24, 2022. Analysis was conducted between January 10, 2022, and April 5, 2024. The main outcomes and measures were the AUROC, AUPRC and the F-score for MV duration, need for ECMO, and mortality prediction.</p><p><strong>Results: </strong>Data from 10,378 patients with COVID-19 (median [IQR] age, 60 [48-72] years; 5,281 [50.89%] women) were included. Overall MV class distributions for 0 days, 1-4 days, 5-9 days, 10-14 days, 15-19 days, 20-24 days, 25-29 days, and ≥30 days of MV were 8,141 (78.44%), 812 (7.82%), 325 (3.13%), 241 (2.32%), 153 (1.47%), 97 (0.93%), 87 (0.84%), and 522 (5.03%), respectively. Overall ECMO use and mortality rates were 15 (0.14%) and 1,114 (10.73%), respectively. On MV duration, ECMO use, and mortality outcomes, the highest-performing model reached weighted average AUROC scores of 0.873, 0.902, and 0.774, and the highest-performing model reached weighted average AUPRC scores of 0.790, 0.999, and 0.893.</p><p><strong>Conclusions and relevance: </strong>Hierarchical ML models trained on vital signs, laboratory results, and demographic data show promise for the prediction of MV duration, ECMO use, and mortality in COVID-19 patients.</p>","PeriodicalId":33315,"journal":{"name":"Frontiers in Artificial Intelligence","volume":"8 ","pages":"1661637"},"PeriodicalIF":4.7,"publicationDate":"2026-01-06","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12816323/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"146019758","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
期刊
Frontiers in Artificial Intelligence
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1