Artificial Intelligence in Medicine最新文献_第4页

QENNA: A quantum-enhanced neural network for early Alzheimer's detection using magnetic resonance imaging QENNA：一个量子增强的神经网络，用于早期阿尔茨海默氏症的磁共振成像检测。

IF 6.2 2区医学 Q1 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE

Artificial Intelligence in Medicine

Pub Date : 2025-11-29 DOI: 10.1016/j.artmed.2025.103322

Chutchai Kaewta , Rapeepan Pitakaso , Surajet Khonjun , Thanatkij Srichok , Peerawat Luesak , Sarayut Gonwirat , Prem Enkvetchakul , Surasak Matitopanum , Thitinon Srisuwandee

Early detection of Alzheimer's disease (AD) is essential for effective clinical intervention and disease management. However, conventional Deep Learning (DL) methods face limitations in analyzing complex brain magnetic resonance imaging (MRI), especially when training data are scarce. In this study, we propose a Quantum-Enhanced Neural Network Architecture (QENNA) that integrates quantum convolutional layers with classical deep learning to improve diagnostic accuracy in early AD detection. The model also incorporates quantum data augmentation strategies, including Quantum Generative Adversarial Networks (QGANs) and quantum random walks, to generate high-fidelity synthetic MRI scans and address training data limitations. Experiments on two public MRI datasets demonstrate that QENNA achieves up to 93.0 % accuracy and 96.0 % Area Under the Curve (AUC), outperforming state-of-the-art classical models. Ablation studies confirm that the quantum components substantially enhance performance. These results suggest that quantum-enhanced learning frameworks can significantly advance Artificial Intelligence (AI)-driven diagnostic tools for neurodegenerative disorders and support scalable, early-stage AD screening in clinical practice.

早期发现阿尔茨海默病（AD）对于有效的临床干预和疾病管理至关重要。然而，传统的深度学习（DL）方法在分析复杂的脑磁共振成像（MRI）时面临局限性，特别是在训练数据稀缺的情况下。在本研究中，我们提出了一种量子增强神经网络架构（QENNA），该架构将量子卷积层与经典深度学习相结合，以提高早期AD检测的诊断准确性。该模型还结合了量子数据增强策略，包括量子生成对抗网络（qgan）和量子随机漫步，以生成高保真合成MRI扫描并解决训练数据的限制。在两个公开的MRI数据集上的实验表明，QENNA达到了高达93.0%的准确率和96.0%的曲线下面积（AUC），优于最先进的经典模型。烧蚀研究证实，量子元件大大提高了性能。这些结果表明，量子增强学习框架可以显著推进人工智能（AI）驱动的神经退行性疾病诊断工具，并支持临床实践中可扩展的早期AD筛查。

{"title":"QENNA: A quantum-enhanced neural network for early Alzheimer's detection using magnetic resonance imaging","authors":"Chutchai Kaewta , Rapeepan Pitakaso , Surajet Khonjun , Thanatkij Srichok , Peerawat Luesak , Sarayut Gonwirat , Prem Enkvetchakul , Surasak Matitopanum , Thitinon Srisuwandee","doi":"10.1016/j.artmed.2025.103322","DOIUrl":"10.1016/j.artmed.2025.103322","url":null,"abstract":"<div><div>Early detection of Alzheimer's disease (AD) is essential for effective clinical intervention and disease management. However, conventional Deep Learning (DL) methods face limitations in analyzing complex brain magnetic resonance imaging (MRI), especially when training data are scarce. In this study, we propose a Quantum-Enhanced Neural Network Architecture (QENNA) that integrates quantum convolutional layers with classical deep learning to improve diagnostic accuracy in early AD detection. The model also incorporates quantum data augmentation strategies, including Quantum Generative Adversarial Networks (QGANs) and quantum random walks, to generate high-fidelity synthetic MRI scans and address training data limitations. Experiments on two public MRI datasets demonstrate that QENNA achieves up to 93.0 % accuracy and 96.0 % Area Under the Curve (AUC), outperforming state-of-the-art classical models. Ablation studies confirm that the quantum components substantially enhance performance. These results suggest that quantum-enhanced learning frameworks can significantly advance Artificial Intelligence (AI)-driven diagnostic tools for neurodegenerative disorders and support scalable, early-stage AD screening in clinical practice.</div></div>","PeriodicalId":55458,"journal":{"name":"Artificial Intelligence in Medicine","volume":"172 ","pages":"Article 103322"},"PeriodicalIF":6.2,"publicationDate":"2025-11-29","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145679400","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Data Augmentation for Few-Shot Biomedical NER Using ChatGPT 基于ChatGPT的少量生物医学NER数据增强。

IF 6.2 2区医学 Q1 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE

Artificial Intelligence in Medicine

Pub Date : 2025-11-29 DOI: 10.1016/j.artmed.2025.103314

Wenxuan Mu , Di Zhao , Jiana Meng , Peng Chen , Shichang Sun , Yumeng Yang , Jian Wang , Hongfei Lin

Data Augmentation (DA) aims to create a new dataset to address the lack of data in various domains. Particularly in few-shot scenarios of the biomedical Named Entity Recognition (NER) domain, an effective DA method can enhance data diversity, reduce overfitting, and significantly improve the model’s generalization ability. In this work, we propose a novel DA method for NER tasks, which uses ChatGPT and prompt learning to extract high-quality data from large language models. The entity recognition tasks are then performed via transfer learning and efficient decoding strategies. Moreover, this study conducted extensive experiments on four publicly available biomedical datasets (BC5CDR, NCBI, BioNLP11EPI, and BioNLP13GE), demonstrating that our methods exhibit strong stability and entity recognition capabilities even in extremely limited scenarios. In the 5-shot, 20-shot, and 50-shot scenarios, the average F1 scores of the four datasets reached 72.96%, 75.05%, and 77.42%, respectively.

数据增强（DA）旨在创建一个新的数据集，以解决各个领域的数据缺乏问题。特别是在生物医学命名实体识别（NER）领域的少量场景中，有效的数据挖掘方法可以增强数据多样性，减少过拟合，显著提高模型的泛化能力。在这项工作中，我们提出了一种用于NER任务的新的数据处理方法，该方法使用ChatGPT和提示学习从大型语言模型中提取高质量数据。然后通过迁移学习和高效解码策略执行实体识别任务。此外，本研究在四个公开的生物医学数据集（BC5CDR、NCBI、BioNLP11EPI和BioNLP13GE）上进行了广泛的实验，表明我们的方法即使在极其有限的场景下也具有很强的稳定性和实体识别能力。在5次射击、20次射击和50次射击场景下，4个数据集的平均F1得分分别达到72.96%、75.05%和77.42%。

引用次数: 0

Artificial intelligence in depression diagnostics: A systematic review of methodologies and clinical applications 人工智能在抑郁症诊断中的应用：方法和临床应用的系统回顾。

IF 6.2 2区医学 Q1 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE

Artificial Intelligence in Medicine

Pub Date : 2025-11-28 DOI: 10.1016/j.artmed.2025.103320

Mahdi Ghorbankhani, Maryam Safara

The integration of artificial intelligence (AI) into the field of mental health diagnosis has garnered increasing scholarly and clinical attention, particularly in relation to the early detection and classification of depression. This study offers a comprehensive review of the current landscape of AI-driven approaches for depression diagnosis, examining the methodologies, data modalities, and performance metrics employed across recent empirical investigations. Emphasizing machine learning and deep learning techniques, the study critically evaluates the utility of linguistic, behavioral, and physiological data sourced from social media, clinical interviews, speech recordings, and wearable devices. The findings suggest that AI systems, particularly those incorporating multimodal data fusion and advanced neural network architectures, demonstrate promising diagnostic accuracy and the potential to augment traditional psychiatric assessments. However, the study also identifies significant methodological, ethical, and practical challenges, including issues of dataset bias, algorithmic transparency, and clinical applicability. In response, the paper outlines key future directions aimed at improving model generalizability, enhancing interpretability, and fostering ethically responsible deployment in real-world settings. This review not only elucidates the transformative capacity of AI in mental health diagnostics but also provides a roadmap for advancing the development of robust, transparent, and clinically integrated AI systems for the detection of depression.

人工智能（AI）与心理健康诊断领域的整合已经引起了越来越多的学术和临床关注，特别是在抑郁症的早期发现和分类方面。本研究全面回顾了人工智能驱动的抑郁症诊断方法的现状，检查了最近实证调查中采用的方法、数据模式和绩效指标。该研究强调机器学习和深度学习技术，批判性地评估了来自社交媒体、临床访谈、语音录音和可穿戴设备的语言、行为和生理数据的效用。研究结果表明，人工智能系统，特别是那些结合多模态数据融合和先进神经网络架构的系统，显示出有希望的诊断准确性，并有可能增强传统的精神病学评估。然而，该研究也发现了重大的方法、伦理和实践挑战，包括数据集偏差、算法透明度和临床适用性问题。作为回应，本文概述了未来的关键方向，旨在提高模型的通用性，增强可解释性，并促进在现实世界环境中的道德责任部署。这篇综述不仅阐明了人工智能在精神卫生诊断方面的变革能力，而且还为推进用于检测抑郁症的强大、透明和临床集成的人工智能系统的开发提供了路线图。

{"title":"Artificial intelligence in depression diagnostics: A systematic review of methodologies and clinical applications","authors":"Mahdi Ghorbankhani, Maryam Safara","doi":"10.1016/j.artmed.2025.103320","DOIUrl":"10.1016/j.artmed.2025.103320","url":null,"abstract":"<div><div>The integration of <em>artificial intelligence</em> (AI) into the field of mental health diagnosis has garnered increasing scholarly and clinical attention, particularly in relation to the early detection and classification of depression. This study offers a comprehensive review of the current landscape of AI-driven approaches for depression diagnosis, examining the methodologies, data modalities, and performance metrics employed across recent empirical investigations. Emphasizing machine learning and deep learning techniques, the study critically evaluates the utility of linguistic, behavioral, and physiological data sourced from social media, clinical interviews, speech recordings, and wearable devices. The findings suggest that AI systems, particularly those incorporating multimodal data fusion and advanced neural network architectures, demonstrate promising diagnostic accuracy and the potential to augment traditional psychiatric assessments. However, the study also identifies significant methodological, ethical, and practical challenges, including issues of dataset bias, algorithmic transparency, and clinical applicability. In response, the paper outlines key future directions aimed at improving model generalizability, enhancing interpretability, and fostering ethically responsible deployment in real-world settings. This review not only elucidates the transformative capacity of AI in mental health diagnostics but also provides a roadmap for advancing the development of robust, transparent, and clinically integrated AI systems for the detection of depression.</div></div>","PeriodicalId":55458,"journal":{"name":"Artificial Intelligence in Medicine","volume":"172 ","pages":"Article 103320"},"PeriodicalIF":6.2,"publicationDate":"2025-11-28","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145670930","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Development and validation of deep continual learning model to sequentially learn multiple clinical prediction tasks for ICU patients 基于深度连续学习的ICU患者多项临床预测任务的开发与验证

IF 6.2 2区医学 Q1 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE

Artificial Intelligence in Medicine

Pub Date : 2025-11-27 DOI: 10.1016/j.artmed.2025.103319

Zhixuan Zeng , Yang Liu , Shuo Yao , Xu Cai , Wenbin Nan , Yiyang Xie , Xun Gong

Background

ICU patients often suffer from critical and complex condition, and multiple potential risks should be monitored to provide them comprehensive care. However, no study proposes continual learning (CL) model that can effectively solve multiple clinical prediction tasks without catastrophic forgetting. This study proposes three deep CL models for ICU patients.

Methods

Three public ICU databases were employed. The included patients from MIMIC-III and MIMIC-IV were divided into eight task sets, and the patients from eICU-CRD composed the test set. We propose three CL models (CL_1, CL_2, CL_3) to sequentially learn eight prediction tasks on the eight task sets, and then externally validate them on the test set. We compare our models to three representative baseline CL models and the single-task (ST) and multi-task (MT) model. We train all the CL models under different orders, and evaluate their prediction performance by multiple metrics and their memory ability by backward transfer (BWT). We also analyzed the effect of previously learned tasks on learning new tasks.

Results

Our three CL models had comparable or slightly weaker performance compared to ST and MT model on the eight tasks. They effectively mitigated catastrophic forgetting, and their performance is robust to different training orders. CL_2 and CL_3 even have improved performance on the current task after learning some previous tasks. Our three CL models outperformed the baseline CL models in most experiments.

Conclusions

Our CL models are promising to sequentially learn multiple clinical prediction tasks for ICU patients. The CL_2 and CL_3 show the ability of utilizing information of previous tasks to improve learning new tasks. More new datasets and tasks are still needed to further verify the validity of the CL models.

重症监护室患者往往病情危重复杂，应监测多种潜在风险，为其提供综合护理。然而，没有研究提出持续学习（CL）模型可以有效地解决多个临床预测任务而不发生灾难性遗忘。本研究针对ICU患者提出了三种深度CL模型。方法采用3个ICU公共数据库。将纳入的MIMIC-III和MIMIC-IV患者分为8个任务集，eICU-CRD患者组成测试集。我们提出了三个CL模型（CL_1、CL_2、CL_3），在8个任务集上依次学习8个预测任务，然后在测试集上进行外部验证。我们将我们的模型与三个代表性的基线CL模型和单任务（ST）和多任务（MT）模型进行比较。我们在不同阶数下训练了所有CL模型，并通过多个指标评估了它们的预测性能，通过后向迁移（BWT）评估了它们的记忆能力。我们还分析了以前学习过的任务对学习新任务的影响。结果与ST和MT模型相比，我们的三个CL模型在8个任务上的表现相当或略弱。他们有效地减轻了灾难性遗忘，并且他们的表现对不同的训练顺序是稳健的。CL_2和CL_3甚至在学习了一些以前的任务后，在当前任务上的表现也有所提高。我们的三个CL模型在大多数实验中都优于基线CL模型。结论sour CL模型有希望依次学习ICU患者的多项临床预测任务。CL_2和CL_3表现出利用已有任务信息促进新任务学习的能力。还需要更多新的数据集和任务来进一步验证CL模型的有效性。

{"title":"Development and validation of deep continual learning model to sequentially learn multiple clinical prediction tasks for ICU patients","authors":"Zhixuan Zeng , Yang Liu , Shuo Yao , Xu Cai , Wenbin Nan , Yiyang Xie , Xun Gong","doi":"10.1016/j.artmed.2025.103319","DOIUrl":"10.1016/j.artmed.2025.103319","url":null,"abstract":"<div><h3>Background</h3><div>ICU patients often suffer from critical and complex condition, and multiple potential risks should be monitored to provide them comprehensive care. However, no study proposes continual learning (CL) model that can effectively solve multiple clinical prediction tasks without catastrophic forgetting. This study proposes three deep CL models for ICU patients.</div></div><div><h3>Methods</h3><div>Three public ICU databases were employed. The included patients from MIMIC-III and MIMIC-IV were divided into eight task sets, and the patients from eICU-CRD composed the test set. We propose three CL models (CL_1, CL_2, CL_3) to sequentially learn eight prediction tasks on the eight task sets, and then externally validate them on the test set. We compare our models to three representative baseline CL models and the single-task (ST) and multi-task (MT) model. We train all the CL models under different orders, and evaluate their prediction performance by multiple metrics and their memory ability by backward transfer (BWT). We also analyzed the effect of previously learned tasks on learning new tasks.</div></div><div><h3>Results</h3><div>Our three CL models had comparable or slightly weaker performance compared to ST and MT model on the eight tasks. They effectively mitigated catastrophic forgetting, and their performance is robust to different training orders. CL_2 and CL_3 even have improved performance on the current task after learning some previous tasks. Our three CL models outperformed the baseline CL models in most experiments.</div></div><div><h3>Conclusions</h3><div>Our CL models are promising to sequentially learn multiple clinical prediction tasks for ICU patients. The CL_2 and CL_3 show the ability of utilizing information of previous tasks to improve learning new tasks. More new datasets and tasks are still needed to further verify the validity of the CL models.</div></div>","PeriodicalId":55458,"journal":{"name":"Artificial Intelligence in Medicine","volume":"172 ","pages":"Article 103319"},"PeriodicalIF":6.2,"publicationDate":"2025-11-27","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145685084","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Deep learning for autism detection using clinical notes: A comparison of transfer learning for a transparent and black-box approach 使用临床记录进行自闭症检测的深度学习：透明和黑盒方法的迁移学习比较。

IF 6.2 2区医学 Q1 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE

Artificial Intelligence in Medicine

Pub Date : 2025-11-27 DOI: 10.1016/j.artmed.2025.103318

Gondy Leroy , Prakash Bisht , Sai Madhuri Kandula , Nell Maltman , Sydney Rice

Autism spectrum disorder (ASD) is a complex neurodevelopmental condition whose rising prevalence places increasing demands on a lengthy diagnostic process. Machine learning (ML) has shown promise in automating ASD diagnosis, but most existing models operate as black boxes and are typically trained on a single dataset, limiting their generalizability.

In this study, we introduce a transparent and interpretable ML approach that leverages BioBERT, a state-of-the-art language model, to analyze unstructured clinical text. The model is trained to label descriptions of behaviors and map them to diagnostic criteria, which are then used to assign a final label (ASD or not). We evaluate transfer learning, the ability to transfer knowledge to new data, using two distinct real-world datasets. We trained on datasets sequentially and mixed together and compared the performance of the best models and their ability to transfer to new data. We also created a black-box approach and repeated this transfer process for comparison.

Our transparent model demonstrated robust performance, with the mixed-data training strategy yielding the best results (97 % sensitivity, 98 % specificity). Sequential training across datasets led to a slight drop in performance, highlighting the importance of training data order. The black-box model performed worse (90 % sensitivity, 96 % specificity) when trained sequentially or with mixed data.

Overall, our transparent approach outperformed the black-box approach. Mixing datasets during training resulted in slightly better performance and should be the preferred approach when practically possible. This work paves the way for more trustworthy, generalizable, and clinically actionable AI tools in neurodevelopmental diagnostics.

自闭症谱系障碍（ASD）是一种复杂的神经发育疾病，其患病率不断上升，对漫长的诊断过程提出了越来越高的要求。机器学习（ML）在自动化ASD诊断方面显示出了希望，但大多数现有模型都像黑箱一样运行，并且通常在单个数据集上进行训练，限制了它们的泛化性。在本研究中，我们引入了一种透明且可解释的ML方法，该方法利用BioBERT（一种最先进的语言模型）来分析非结构化临床文本。该模型被训练来标记行为描述，并将其映射到诊断标准，然后用于分配最终标签（是否为ASD）。我们评估迁移学习，将知识转移到新数据的能力，使用两个不同的现实世界数据集。我们按顺序对数据集进行训练，并将其混合在一起，比较最佳模型的性能及其转移到新数据的能力。我们还创建了一个黑盒方法，并重复这个转移过程进行比较。我们的透明模型表现出稳健的性能，混合数据训练策略产生了最佳结果（97%的灵敏度，98%的特异性）。跨数据集的顺序训练导致性能略有下降，突出了训练数据顺序的重要性。黑箱模型在顺序训练或混合数据时表现较差（90%灵敏度，96%特异性）。总的来说，我们的透明方法优于黑盒方法。在训练过程中混合数据集会产生稍微更好的性能，并且在实际可能的情况下应该是首选的方法。这项工作为神经发育诊断中更值得信赖、可推广和临床可操作的人工智能工具铺平了道路。

{"title":"Deep learning for autism detection using clinical notes: A comparison of transfer learning for a transparent and black-box approach","authors":"Gondy Leroy , Prakash Bisht , Sai Madhuri Kandula , Nell Maltman , Sydney Rice","doi":"10.1016/j.artmed.2025.103318","DOIUrl":"10.1016/j.artmed.2025.103318","url":null,"abstract":"<div><div>Autism spectrum disorder (ASD) is a complex neurodevelopmental condition whose rising prevalence places increasing demands on a lengthy diagnostic process. Machine learning (ML) has shown promise in automating ASD diagnosis, but most existing models operate as black boxes and are typically trained on a single dataset, limiting their generalizability.</div><div>In this study, we introduce a transparent and interpretable ML approach that leverages BioBERT, a state-of-the-art language model, to analyze unstructured clinical text. The model is trained to label descriptions of behaviors and map them to diagnostic criteria, which are then used to assign a final label (ASD or not). We evaluate transfer learning, the ability to transfer knowledge to new data, using two distinct real-world datasets. We trained on datasets sequentially and mixed together and compared the performance of the best models and their ability to transfer to new data. We also created a black-box approach and repeated this transfer process for comparison.</div><div>Our transparent model demonstrated robust performance, with the mixed-data training strategy yielding the best results (97 % sensitivity, 98 % specificity). Sequential training across datasets led to a slight drop in performance, highlighting the importance of training data order. The black-box model performed worse (90 % sensitivity, 96 % specificity) when trained sequentially or with mixed data.</div><div>Overall, our transparent approach outperformed the black-box approach. Mixing datasets during training resulted in slightly better performance and should be the preferred approach when practically possible. This work paves the way for more trustworthy, generalizable, and clinically actionable AI tools in neurodevelopmental diagnostics.</div></div>","PeriodicalId":55458,"journal":{"name":"Artificial Intelligence in Medicine","volume":"172 ","pages":"Article 103318"},"PeriodicalIF":6.2,"publicationDate":"2025-11-27","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145662947","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Artificial intelligence use and performance in detecting and predicting healthcare-associated infections: A systematic review 人工智能在检测和预测医疗保健相关感染中的应用和性能：系统综述。

IF 6.2 2区医学 Q1 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE

Artificial Intelligence in Medicine

Pub Date : 2025-11-27 DOI: 10.1016/j.artmed.2025.103321

Chiara Barbati , Luca Viviani , Riccardo Vecchio , Guglielmo Arzilli , Luigi De Angelis , Francesco Baglivo , Lucia Sacchi , Riccardo Bellazzi , Caterina Rizzo , Anna Odone

Objectives

The increasing digitisation of healthcare data and the rapid development of Artificial Intelligence (AI) pave the way for innovative strategies for infectious disease management. This study aimed to systematically retrieve and summarize current evidence on the use and performance of AI-based models for healthcare-associated infection (HAI) detection (i.e., identifying infections already present in available data) and prediction (i.e., estimating future risk based on earlier patient information).

Methods

PubMed, Embase, Scopus and Web of Science were searched for experimental and observational studies published between 1 July 2018 and 12 February 2024. Primary outcomes included technical performance metrics for HAI detection and prediction (e.g. recall, precision, AUROC). Any reported clinical, organisational or economic impacts were evaluated as secondary outcomes.

Results

Of 4489 records initially identified, 121 studies were included. Twenty-five studies (20.6 %) focused on HAI detection, with more than half achieving an AUROC above 0.90. In contrast, studies on HAI prediction (n = 93, 76.9 %) reported more heterogeneous performance. Among studies comparing AI with traditional methods (n = 32), AI models outperformed conventional approaches in 81.3 % of cases (n = 26).

Conclusions

A growing body of evidence suggests that AI models are equal to or superior to traditional methods for HAI detection and prediction, but challenges remain in evaluating performance, with many studies lacking comparators, few prospective evaluations, and limited assessment of organisational impact.

目标：医疗数据的日益数字化和人工智能（AI）的快速发展为传染病管理的创新战略铺平了道路。本研究旨在系统地检索和总结基于人工智能的医疗保健相关感染（HAI）检测（即识别现有数据中已经存在的感染）和预测（即根据早期患者信息估计未来风险）模型的使用和性能的现有证据。方法：检索PubMed、Embase、Scopus和Web of Science，检索2018年7月1日至2024年2月12日发表的实验和观察性研究。主要结果包括HAI检测和预测的技术性能指标（如召回率、精度、AUROC）。任何报告的临床、组织或经济影响被评估为次要结局。结果：在最初确定的4489份记录中，121份研究被纳入。25项研究（20.6%）集中在HAI检测上，超过一半的AUROC高于0.90。相比之下，对HAI预测的研究（n = 93, 76.9%）报告了更多的异质性表现。在比较人工智能与传统方法的研究中（n = 32），人工智能模型在81.3%的情况下优于传统方法（n = 26）。结论：越来越多的证据表明，人工智能模型等于或优于传统的HAI检测和预测方法，但在评估性能方面仍然存在挑战，许多研究缺乏比较物，前瞻性评估很少，对组织影响的评估有限。

{"title":"Artificial intelligence use and performance in detecting and predicting healthcare-associated infections: A systematic review","authors":"Chiara Barbati , Luca Viviani , Riccardo Vecchio , Guglielmo Arzilli , Luigi De Angelis , Francesco Baglivo , Lucia Sacchi , Riccardo Bellazzi , Caterina Rizzo , Anna Odone","doi":"10.1016/j.artmed.2025.103321","DOIUrl":"10.1016/j.artmed.2025.103321","url":null,"abstract":"<div><h3>Objectives</h3><div>The increasing digitisation of healthcare data and the rapid development of Artificial Intelligence (AI) pave the way for innovative strategies for infectious disease management. This study aimed to systematically retrieve and summarize current evidence on the use and performance of AI-based models for healthcare-associated infection (HAI) detection (i.e., identifying infections already present in available data) and prediction (i.e., estimating future risk based on earlier patient information).</div></div><div><h3>Methods</h3><div>PubMed, Embase, Scopus and Web of Science were searched for experimental and observational studies published between 1 July 2018 and 12 February 2024. Primary outcomes included technical performance metrics for HAI detection and prediction (e.g. recall, precision, AUROC). Any reported clinical, organisational or economic impacts were evaluated as secondary outcomes.</div></div><div><h3>Results</h3><div>Of 4489 records initially identified, 121 studies were included. Twenty-five studies (20.6 %) focused on HAI detection, with more than half achieving an AUROC above 0.90. In contrast, studies on HAI prediction (<em>n</em> = 93, 76.9 %) reported more heterogeneous performance. Among studies comparing AI with traditional methods (<em>n</em> = 32), AI models outperformed conventional approaches in 81.3 % of cases (<em>n</em> = 26).</div></div><div><h3>Conclusions</h3><div>A growing body of evidence suggests that AI models are equal to or superior to traditional methods for HAI detection and prediction, but challenges remain in evaluating performance, with many studies lacking comparators, few prospective evaluations, and limited assessment of organisational impact.</div></div>","PeriodicalId":55458,"journal":{"name":"Artificial Intelligence in Medicine","volume":"172 ","pages":"Article 103321"},"PeriodicalIF":6.2,"publicationDate":"2025-11-27","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145670925","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

SurgflowNet: Leveraging unannotated video for consistent endoscopic pituitary surgery workflow recognition SurgflowNet：利用无注释的视频进行一致的内窥镜垂体手术工作流程识别。

IF 6.2 2区医学 Q1 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE

Artificial Intelligence in Medicine

Pub Date : 2025-11-26 DOI: 10.1016/j.artmed.2025.103309

Anjana Wijekoon , Adrito Das , Zhehua Mao , Danyal Z. Khan , John G. Hanrahan , Danail Stoyanov , Hani J. Marcus , Sophia Bano

Surgical workflow recognition has the potential to accelerate training initiatives through the analysis of surgical videos, improve intraoperative efficiency, and support preemptive postoperative care. Unlike well-explored minimally invasive surgeries, where surgical workflows are consistent across patients, automating endoscopic pituitary surgery workflow recognition is challenging. Pituitary surgery involves a large number of steps, diverse sequences, optional steps, and frequent transitions, making it challenging for current state-of-the-art (SOTA) methods, which struggle with transferability. Progress is largely limited by the lack of annotated data that captures the complexity of pituitary surgery, and obtaining such annotations is both time-consuming and resource-intensive. This paper presents SurgflowNet, a novel spatio-temporal model for consistent pituitary workflow recognition leveraging unannotated data. We utilise a limited yet fully annotated dataset to infer quasi-labels for unannotated videos and curate a balanced dataset to train a robust frame encoder using the student–teacher framework. A spatio-temporal network that combines the resulting frame encoder and an LSTM network is trained with a consistency loss to ensure stability in step predictions. With a 5% improvement in macro F₁-score and 13.4% in Edit Score over the SOTA, SurgflowNetdemonstrates a significant improvement in workflow recognition for endoscopic pituitary surgery.

手术工作流程识别有可能通过分析手术视频来加快培训计划，提高术中效率，并支持先发制人的术后护理。与微创手术不同，微创手术的手术工作流程在患者之间是一致的，自动化内窥镜垂体手术工作流程识别是具有挑战性的。垂体手术涉及大量的步骤，不同的序列，可选的步骤和频繁的转换，使其对当前最先进的（SOTA）方法具有挑战性，这些方法难以转移。由于缺乏能够捕捉垂体手术复杂性的注释数据，进展在很大程度上受到限制，并且获得此类注释既耗时又耗费资源。本文提出了SurgflowNet，这是一种利用未注释数据进行一致垂体工作流识别的新型时空模型。我们利用有限但完全注释的数据集来推断未注释视频的准标签，并策划一个平衡的数据集来训练使用学生-教师框架的鲁棒帧编码器。结合生成的帧编码器和LSTM网络的时空网络进行了一致性损失训练，以确保步长预测的稳定性。与SOTA相比，surgflownetmacro f1评分提高了5%，Edit评分提高了13.4%，在垂体内窥镜手术的工作流程识别方面有了显著的提高。

{"title":"SurgflowNet: Leveraging unannotated video for consistent endoscopic pituitary surgery workflow recognition","authors":"Anjana Wijekoon , Adrito Das , Zhehua Mao , Danyal Z. Khan , John G. Hanrahan , Danail Stoyanov , Hani J. Marcus , Sophia Bano","doi":"10.1016/j.artmed.2025.103309","DOIUrl":"10.1016/j.artmed.2025.103309","url":null,"abstract":"<div><div>Surgical workflow recognition has the potential to accelerate training initiatives through the analysis of surgical videos, improve intraoperative efficiency, and support preemptive postoperative care. Unlike well-explored minimally invasive surgeries, where surgical workflows are consistent across patients, automating endoscopic pituitary surgery workflow recognition is challenging. Pituitary surgery involves a large number of steps, diverse sequences, optional steps, and frequent transitions, making it challenging for current state-of-the-art (SOTA) methods, which struggle with transferability. Progress is largely limited by the lack of annotated data that captures the complexity of pituitary surgery, and obtaining such annotations is both time-consuming and resource-intensive. This paper presents SurgflowNet, a novel spatio-temporal model for consistent pituitary workflow recognition leveraging unannotated data. We utilise a limited yet fully annotated dataset to infer quasi-labels for unannotated videos and curate a balanced dataset to train a robust frame encoder using the student–teacher framework. A spatio-temporal network that combines the resulting frame encoder and an LSTM network is trained with a consistency loss to ensure stability in step predictions. With a 5% improvement in macro F<sub>1</sub>-score and 13.4% in Edit Score over the SOTA, SurgflowNetdemonstrates a significant improvement in workflow recognition for endoscopic pituitary surgery.</div></div>","PeriodicalId":55458,"journal":{"name":"Artificial Intelligence in Medicine","volume":"172 ","pages":"Article 103309"},"PeriodicalIF":6.2,"publicationDate":"2025-11-26","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145662970","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Leveraging artificial intelligence in advance care planning: A scoping review 利用人工智能提前护理计划：范围审查

IF 6.2 2区医学 Q1 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE

Artificial Intelligence in Medicine

Pub Date : 2025-11-26 DOI: 10.1016/j.artmed.2025.103315

Minghui Tan , Siyuan Tang , Zhao Ni , Shichao Kan , Paul Macharia , Haojie Zhang , Hao Yi , Guo Li , Jinfeng Ding

Background

Advance care planning (ACP) is a process that enables individuals to discuss future health care decisions before they become seriously ill or unable to communicate. Artificial intelligence (AI) has demonstrated promising outcomes in facilitating healthcare, offering the potential to facilitate ACP. However, the current status of using AI to facilitate ACP is unclear. This study aimed to investigate how AI has been leveraged to facilitate ACP, with a particular focus on the intended purposes, AI algorithms used, data sources, and the performance of AI in achieving the intended purposes.

Methods

The methodology employed in this study adhered to the Scoping Review Methodological Framework. PubMed, EMBASE, Web of Science, CINAHL, Cochrane Library, and IEEE Xplore databases were searched from their inception to July 2025. Descriptive analyses and narrative synthesis were used to summarize findings from the included studies.

Results

A total of 42 eligible studies were analyzed. The studies were primarily used to detect ACP conversations and documents, identify patients needing ACP, promote ACP education, and explore linguistic features in ACP conversations. Rule-based natural language processing emerged as the most commonly used AI algorithm, with textual data being the primary modality employed. The included studies exhibited significant variation in performance evaluation.

Finding

The current use of AI in ACP remains limited in scope, primarily focusing on extracting ACP documentation from electronic health records and identifying patients who may benefit from ACP. The use of advanced technologies such as generative AI is limited, and performance evaluation primarily relies on discrimination metrics.

预先护理计划（ACP）是一个过程，使个人能够讨论未来的医疗保健决策之前，他们变得严重疾病或无法沟通。人工智能（AI）在促进医疗保健方面已显示出有希望的成果，为促进ACP提供了潜力。然而，利用人工智能促进ACP的现状尚不清楚。本研究旨在调查人工智能如何被利用来促进ACP，特别关注预期目的、使用的人工智能算法、数据源以及人工智能在实现预期目的方面的表现。方法本研究采用的方法学遵循范围审查方法学框架。检索了PubMed、EMBASE、Web of Science、CINAHL、Cochrane Library和IEEE explore数据库，检索时间从它们成立到2025年7月。使用描述性分析和叙述性综合来总结纳入研究的结果。结果共分析了42项符合条件的研究。本研究主要用于检测ACP会话和文献，识别需要ACP的患者，促进ACP教育，探索ACP会话的语言特征。基于规则的自然语言处理成为最常用的人工智能算法，文本数据是使用的主要形式。纳入的研究在绩效评价方面表现出显著差异。目前人工智能在ACP中的应用范围仍然有限，主要集中在从电子健康记录中提取ACP文档和识别可能受益于ACP的患者。生成式人工智能等先进技术的使用受到限制，绩效评估主要依赖于歧视指标。

{"title":"Leveraging artificial intelligence in advance care planning: A scoping review","authors":"Minghui Tan , Siyuan Tang , Zhao Ni , Shichao Kan , Paul Macharia , Haojie Zhang , Hao Yi , Guo Li , Jinfeng Ding","doi":"10.1016/j.artmed.2025.103315","DOIUrl":"10.1016/j.artmed.2025.103315","url":null,"abstract":"<div><h3>Background</h3><div>Advance care planning (ACP) is a process that enables individuals to discuss future health care decisions before they become seriously ill or unable to communicate. Artificial intelligence (AI) has demonstrated promising outcomes in facilitating healthcare, offering the potential to facilitate ACP. However, the current status of using AI to facilitate ACP is unclear. This study aimed to investigate how AI has been leveraged to facilitate ACP, with a particular focus on the intended purposes, AI algorithms used, data sources, and the performance of AI in achieving the intended purposes.</div></div><div><h3>Methods</h3><div>The methodology employed in this study adhered to the Scoping Review Methodological Framework. PubMed, EMBASE, Web of Science, CINAHL, Cochrane Library, and IEEE Xplore databases were searched from their inception to July 2025. Descriptive analyses and narrative synthesis were used to summarize findings from the included studies.</div></div><div><h3>Results</h3><div>A total of 42 eligible studies were analyzed. The studies were primarily used to detect ACP conversations and documents, identify patients needing ACP, promote ACP education, and explore linguistic features in ACP conversations. Rule-based natural language processing emerged as the most commonly used AI algorithm, with textual data being the primary modality employed. The included studies exhibited significant variation in performance evaluation.</div></div><div><h3>Finding</h3><div>The current use of AI in ACP remains limited in scope, primarily focusing on extracting ACP documentation from electronic health records and identifying patients who may benefit from ACP. The use of advanced technologies such as generative AI is limited, and performance evaluation primarily relies on discrimination metrics.</div></div>","PeriodicalId":55458,"journal":{"name":"Artificial Intelligence in Medicine","volume":"172 ","pages":"Article 103315"},"PeriodicalIF":6.2,"publicationDate":"2025-11-26","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145685037","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Using artificial intelligence to predict patient wait times in the emergency department: A scoping review 使用人工智能预测急诊科病人的等待时间：范围审查

IF 6.2 2区医学 Q1 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE

Artificial Intelligence in Medicine

Pub Date : 2025-11-25 DOI: 10.1016/j.artmed.2025.103316

Troy Gloyn , Christina Seo , Alexandra Godinho , Rahul Rahul , Siona Phadke , Hilary Fotheringham , Pete Wegier

<div><h3>Objective</h3><div>The purpose of this review was to comprehensively explore the landscape of recently published literature on the applications of artificial intelligence (AI) in predicting individualized patient waiting times in an emergency department (ED) and identify pertinent considerations for practitioners and hospital decision-makers.</div></div><div><h3>Introduction</h3><div>ED overcrowding is being experienced by hospitals around the globe and has worsened in the post COVID-19 era. The negative patient and staff experiences and poor clinical outcomes from overcrowding are evident and necessitate solutions to address this ongoing problem. Hospitals providing ED waiting time estimates to patients and staff are becoming popular; however, the more common methods, such as using rolling averages, suffer from an inability to capture the nuanced relationships within an ED. Recent applications of AI and machine learning (ML) in healthcare raises the possibility of applying these techniques to individualized waiting time predictions in the ED; although, literature on the topic is sparse.</div></div><div><h3>Methods</h3><div>A systematized search was conducted on November 10th, 2025, using the electronic databases CINAHL, EMBASE (OVID), Medline (OVID), PsychINFO, Web of Science, and PubMed. Articles were considered for review if written in English, peer-reviewed, published after 2014, and used AI techniques. Descriptive analysis was performed on the final extracted data to facilitate the identification of common themes across studies. Themes were inferred from the proportional usage among studies, of different data preparation, feature selection, and modeling strategies.</div></div><div><h3>Results</h3><div>The search identified 8613 citations that, after a rigorous screening process and critical appraisal, were narrowed down to 15 studies for final review. Most included studies were observational, using historical medical record data to compare modeling techniques or demonstrate a proof of concept. Studies commonly used one or more of ED queue-based, staff/resource-based, patient-based, and time-based feature categories. Incorporated AI methods included Random Forest, Linear Regression, and Least Absolute Shrinkage and Selection Operator (LASSO) techniques, among several others. All forms of AI and ML outperformed traditional rolling average estimates used by hospitals.</div></div><div><h3>Conclusions</h3><div>This review identified applications of AI in predicting individualized patient waiting times in the ED that outperform current waiting time estimate strategies. The use of nonlinear techniques, such as the Random Forest method, or incorporating queue-based feature categories, appeared to provide better performance in predictive estimates. Depending on the end user and modality in which the wait time estimate is conveyed, the importance of model selection is highlighted as a consideration to be made if overestimates or underestimate

目的：本综述的目的是全面探讨最近发表的关于人工智能（AI）在预测急诊室（ED）个性化患者等待时间方面的应用的文献，并确定从业人员和医院决策者的相关考虑因素。全球各地的医院都在经历着过度拥挤的情况，并且在后COVID-19时代恶化了。过度拥挤给患者和工作人员带来的负面体验以及糟糕的临床结果是显而易见的，需要解决这一持续存在的问题。向病人和工作人员提供急诊科等待时间估计的医院越来越受欢迎；然而，更常见的方法，如使用滚动平均，无法捕捉急诊科内部的细微关系。最近人工智能和机器学习（ML）在医疗保健领域的应用，提高了将这些技术应用于急诊科个性化等待时间预测的可能性；虽然，关于这个话题的文献很少。方法于2025年11月10日系统检索，检索的电子数据库为CINAHL、EMBASE （OVID）、Medline （OVID）、PsychINFO、Web of Science和PubMed。如果文章是用英文写的，经过同行评议，在2014年之后发表，并且使用了人工智能技术，则会被考虑进行审查。对最终提取的数据进行描述性分析，以促进识别研究中的共同主题。主题是从研究中不同数据准备、特征选择和建模策略的比例使用中推断出来的。经过严格的筛选过程和严格的评估，搜索确定了8613条引用，最终被缩小到15项研究。大多数纳入的研究是观察性的，使用历史医疗记录数据来比较建模技术或证明概念。研究通常使用一种或多种基于急诊队列的、基于员工/资源的、基于患者的和基于时间的特征类别。整合的人工智能方法包括随机森林、线性回归、最小绝对收缩和选择算子（LASSO）技术等。所有形式的人工智能和机器学习都优于医院使用的传统滚动平均估计。本综述确定了人工智能在预测急诊科个体化患者等待时间方面的应用，优于当前的等待时间估计策略。使用非线性技术，如随机森林方法，或结合基于队列的特征类别，似乎在预测估计中提供了更好的性能。根据最终用户和传递等待时间估计的方式，如果需要过高估计或过低估计，则强调模型选择的重要性。

{"title":"Using artificial intelligence to predict patient wait times in the emergency department: A scoping review","authors":"Troy Gloyn , Christina Seo , Alexandra Godinho , Rahul Rahul , Siona Phadke , Hilary Fotheringham , Pete Wegier","doi":"10.1016/j.artmed.2025.103316","DOIUrl":"10.1016/j.artmed.2025.103316","url":null,"abstract":"<div><h3>Objective</h3><div>The purpose of this review was to comprehensively explore the landscape of recently published literature on the applications of artificial intelligence (AI) in predicting individualized patient waiting times in an emergency department (ED) and identify pertinent considerations for practitioners and hospital decision-makers.</div></div><div><h3>Introduction</h3><div>ED overcrowding is being experienced by hospitals around the globe and has worsened in the post COVID-19 era. The negative patient and staff experiences and poor clinical outcomes from overcrowding are evident and necessitate solutions to address this ongoing problem. Hospitals providing ED waiting time estimates to patients and staff are becoming popular; however, the more common methods, such as using rolling averages, suffer from an inability to capture the nuanced relationships within an ED. Recent applications of AI and machine learning (ML) in healthcare raises the possibility of applying these techniques to individualized waiting time predictions in the ED; although, literature on the topic is sparse.</div></div><div><h3>Methods</h3><div>A systematized search was conducted on November 10th, 2025, using the electronic databases CINAHL, EMBASE (OVID), Medline (OVID), PsychINFO, Web of Science, and PubMed. Articles were considered for review if written in English, peer-reviewed, published after 2014, and used AI techniques. Descriptive analysis was performed on the final extracted data to facilitate the identification of common themes across studies. Themes were inferred from the proportional usage among studies, of different data preparation, feature selection, and modeling strategies.</div></div><div><h3>Results</h3><div>The search identified 8613 citations that, after a rigorous screening process and critical appraisal, were narrowed down to 15 studies for final review. Most included studies were observational, using historical medical record data to compare modeling techniques or demonstrate a proof of concept. Studies commonly used one or more of ED queue-based, staff/resource-based, patient-based, and time-based feature categories. Incorporated AI methods included Random Forest, Linear Regression, and Least Absolute Shrinkage and Selection Operator (LASSO) techniques, among several others. All forms of AI and ML outperformed traditional rolling average estimates used by hospitals.</div></div><div><h3>Conclusions</h3><div>This review identified applications of AI in predicting individualized patient waiting times in the ED that outperform current waiting time estimate strategies. The use of nonlinear techniques, such as the Random Forest method, or incorporating queue-based feature categories, appeared to provide better performance in predictive estimates. Depending on the end user and modality in which the wait time estimate is conveyed, the importance of model selection is highlighted as a consideration to be made if overestimates or underestimate","PeriodicalId":55458,"journal":{"name":"Artificial Intelligence in Medicine","volume":"171 ","pages":"Article 103316"},"PeriodicalIF":6.2,"publicationDate":"2025-11-25","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145624191","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Context-aware heterogeneous graph neural network for multi-level description and invasiveness prediction in renal cell carcinoma 上下文感知的异构图神经网络用于肾细胞癌的多级描述和侵袭性预测

IF 6.2 2区医学 Q1 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE

Artificial Intelligence in Medicine

Pub Date : 2025-11-25 DOI: 10.1016/j.artmed.2025.103313

Xiaoming Jiang , Guoying Ji , Ye Yan , Xiongjun Ye , Chao Liang , Bao Li , Wei Wang , Shudong Zhang , Lizhi Shao

The invasiveness prediction in renal cell carcinoma (RCC) is of significant importance for the decision of clinical surgical plans and the patients' prognosis. Currently, besides invasive pathological assessment, it mainly relies on observation through computed tomography (CT) imaging. However, limitations of human vision and qualitative descriptions restrict the accuracy of the diagnosis of renal sinus invasion (RSI). Recently, artificial intelligence approaches have shown promising prospects in cancer diagnosis. Due to the complex imaging characteristics of invasiveness, prediction models that only focus on tumor regions are inadequate, requiring comprehensive evaluation of intratumoral heterogeneity, peritumoral information, and the kidney in which the tumor resides. Therefore, in this study, we propose a context-aware heterogeneous graph neural network for multi-level description and invasiveness prediction in RCC. The superiority of the proposed model lies in its ability to integrate imaging features at multi-level, and to learn disturbance invariant features through a data-driven diffusion perturbation strategy. To evaluate the effectiveness and generalization of our model, we conduct extensive experiments on a multi-center dataset (including CT scan images of 437 patients) to compare our model with a series of state-of-the-art (SOTA) classification models. The experimental results show the superiority of our model for RSI classification (

AUC = 0.88

). Additionally, we also perform a comparative study with clinical experts, and the proposed method is significantly better than existing assessment methods and clinical experts (

p < 0.05

). In general, our work provides an effective assessment tool for automated diagnosis of RSI in RCC and also offers new insights for constructing more precise tumor prediction models.

肾细胞癌（RCC）的侵袭性预测对临床手术方案的制定和患者的预后具有重要意义。目前，除有创性病理评估外，主要依靠CT成像观察。然而，人类视觉和定性描述的局限性限制了肾窦侵犯（RSI）诊断的准确性。最近，人工智能方法在癌症诊断中显示出了良好的前景。由于侵袭性的复杂影像学特征，仅关注肿瘤区域的预测模型是不够的，需要综合评估肿瘤内异质性、肿瘤周围信息和肿瘤所在肾脏。因此，在本研究中，我们提出了一个上下文感知的异构图神经网络，用于RCC的多层次描述和入侵预测。该模型的优势在于能够多层次地整合成像特征，并通过数据驱动的扩散摄动策略学习扰动不变特征。为了评估我们模型的有效性和泛化性，我们在一个多中心数据集（包括437名患者的CT扫描图像）上进行了广泛的实验，将我们的模型与一系列最先进的（SOTA）分类模型进行了比较。实验结果表明，该模型在RSI分类上具有优势（AUC=0.88）。此外，我们还与临床专家进行了对比研究，提出的方法明显优于现有的评估方法和临床专家（p<0.05）。总的来说，我们的工作为RCC中RSI的自动诊断提供了有效的评估工具，也为构建更精确的肿瘤预测模型提供了新的见解。

{"title":"Context-aware heterogeneous graph neural network for multi-level description and invasiveness prediction in renal cell carcinoma","authors":"Xiaoming Jiang , Guoying Ji , Ye Yan , Xiongjun Ye , Chao Liang , Bao Li , Wei Wang , Shudong Zhang , Lizhi Shao","doi":"10.1016/j.artmed.2025.103313","DOIUrl":"10.1016/j.artmed.2025.103313","url":null,"abstract":"<div><div>The invasiveness prediction in renal cell carcinoma (RCC) is of significant importance for the decision of clinical surgical plans and the patients' prognosis. Currently, besides invasive pathological assessment, it mainly relies on observation through computed tomography (CT) imaging. However, limitations of human vision and qualitative descriptions restrict the accuracy of the diagnosis of renal sinus invasion (RSI). Recently, artificial intelligence approaches have shown promising prospects in cancer diagnosis. Due to the complex imaging characteristics of invasiveness, prediction models that only focus on tumor regions are inadequate, requiring comprehensive evaluation of intratumoral heterogeneity, peritumoral information, and the kidney in which the tumor resides. Therefore, in this study, we propose a context-aware heterogeneous graph neural network for multi-level description and invasiveness prediction in RCC. The superiority of the proposed model lies in its ability to integrate imaging features at multi-level, and to learn disturbance invariant features through a data-driven diffusion perturbation strategy. To evaluate the effectiveness and generalization of our model, we conduct extensive experiments on a multi-center dataset (including CT scan images of 437 patients) to compare our model with a series of state-of-the-art (SOTA) classification models. The experimental results show the superiority of our model for RSI classification (<span><math><mi>AUC</mi><mo>=</mo><mn>0.88</mn></math></span>). Additionally, we also perform a comparative study with clinical experts, and the proposed method is significantly better than existing assessment methods and clinical experts (<span><math><mi>p</mi><mo><</mo><mn>0.05</mn></math></span>). In general, our work provides an effective assessment tool for automated diagnosis of RSI in RCC and also offers new insights for constructing more precise tumor prediction models.</div></div>","PeriodicalId":55458,"journal":{"name":"Artificial Intelligence in Medicine","volume":"172 ","pages":"Article 103313"},"PeriodicalIF":6.2,"publicationDate":"2025-11-25","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145625344","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0