首页 > 最新文献

Artificial Intelligence in Medicine最新文献

英文 中文
Context-aware heterogeneous graph neural network for multi-level description and invasiveness prediction in renal cell carcinoma 上下文感知的异构图神经网络用于肾细胞癌的多级描述和侵袭性预测
IF 6.2 2区 医学 Q1 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE Pub Date : 2025-11-25 DOI: 10.1016/j.artmed.2025.103313
Xiaoming Jiang , Guoying Ji , Ye Yan , Xiongjun Ye , Chao Liang , Bao Li , Wei Wang , Shudong Zhang , Lizhi Shao
The invasiveness prediction in renal cell carcinoma (RCC) is of significant importance for the decision of clinical surgical plans and the patients' prognosis. Currently, besides invasive pathological assessment, it mainly relies on observation through computed tomography (CT) imaging. However, limitations of human vision and qualitative descriptions restrict the accuracy of the diagnosis of renal sinus invasion (RSI). Recently, artificial intelligence approaches have shown promising prospects in cancer diagnosis. Due to the complex imaging characteristics of invasiveness, prediction models that only focus on tumor regions are inadequate, requiring comprehensive evaluation of intratumoral heterogeneity, peritumoral information, and the kidney in which the tumor resides. Therefore, in this study, we propose a context-aware heterogeneous graph neural network for multi-level description and invasiveness prediction in RCC. The superiority of the proposed model lies in its ability to integrate imaging features at multi-level, and to learn disturbance invariant features through a data-driven diffusion perturbation strategy. To evaluate the effectiveness and generalization of our model, we conduct extensive experiments on a multi-center dataset (including CT scan images of 437 patients) to compare our model with a series of state-of-the-art (SOTA) classification models. The experimental results show the superiority of our model for RSI classification (AUC=0.88). Additionally, we also perform a comparative study with clinical experts, and the proposed method is significantly better than existing assessment methods and clinical experts (p<0.05). In general, our work provides an effective assessment tool for automated diagnosis of RSI in RCC and also offers new insights for constructing more precise tumor prediction models.
肾细胞癌(RCC)的侵袭性预测对临床手术方案的制定和患者的预后具有重要意义。目前,除有创性病理评估外,主要依靠CT成像观察。然而,人类视觉和定性描述的局限性限制了肾窦侵犯(RSI)诊断的准确性。最近,人工智能方法在癌症诊断中显示出了良好的前景。由于侵袭性的复杂影像学特征,仅关注肿瘤区域的预测模型是不够的,需要综合评估肿瘤内异质性、肿瘤周围信息和肿瘤所在肾脏。因此,在本研究中,我们提出了一个上下文感知的异构图神经网络,用于RCC的多层次描述和入侵预测。该模型的优势在于能够多层次地整合成像特征,并通过数据驱动的扩散摄动策略学习扰动不变特征。为了评估我们模型的有效性和泛化性,我们在一个多中心数据集(包括437名患者的CT扫描图像)上进行了广泛的实验,将我们的模型与一系列最先进的(SOTA)分类模型进行了比较。实验结果表明,该模型在RSI分类上具有优势(AUC=0.88)。此外,我们还与临床专家进行了对比研究,提出的方法明显优于现有的评估方法和临床专家(p<0.05)。总的来说,我们的工作为RCC中RSI的自动诊断提供了有效的评估工具,也为构建更精确的肿瘤预测模型提供了新的见解。
{"title":"Context-aware heterogeneous graph neural network for multi-level description and invasiveness prediction in renal cell carcinoma","authors":"Xiaoming Jiang ,&nbsp;Guoying Ji ,&nbsp;Ye Yan ,&nbsp;Xiongjun Ye ,&nbsp;Chao Liang ,&nbsp;Bao Li ,&nbsp;Wei Wang ,&nbsp;Shudong Zhang ,&nbsp;Lizhi Shao","doi":"10.1016/j.artmed.2025.103313","DOIUrl":"10.1016/j.artmed.2025.103313","url":null,"abstract":"<div><div>The invasiveness prediction in renal cell carcinoma (RCC) is of significant importance for the decision of clinical surgical plans and the patients' prognosis. Currently, besides invasive pathological assessment, it mainly relies on observation through computed tomography (CT) imaging. However, limitations of human vision and qualitative descriptions restrict the accuracy of the diagnosis of renal sinus invasion (RSI). Recently, artificial intelligence approaches have shown promising prospects in cancer diagnosis. Due to the complex imaging characteristics of invasiveness, prediction models that only focus on tumor regions are inadequate, requiring comprehensive evaluation of intratumoral heterogeneity, peritumoral information, and the kidney in which the tumor resides. Therefore, in this study, we propose a context-aware heterogeneous graph neural network for multi-level description and invasiveness prediction in RCC. The superiority of the proposed model lies in its ability to integrate imaging features at multi-level, and to learn disturbance invariant features through a data-driven diffusion perturbation strategy. To evaluate the effectiveness and generalization of our model, we conduct extensive experiments on a multi-center dataset (including CT scan images of 437 patients) to compare our model with a series of state-of-the-art (SOTA) classification models. The experimental results show the superiority of our model for RSI classification (<span><math><mi>AUC</mi><mo>=</mo><mn>0.88</mn></math></span>). Additionally, we also perform a comparative study with clinical experts, and the proposed method is significantly better than existing assessment methods and clinical experts (<span><math><mi>p</mi><mo>&lt;</mo><mn>0.05</mn></math></span>). In general, our work provides an effective assessment tool for automated diagnosis of RSI in RCC and also offers new insights for constructing more precise tumor prediction models.</div></div>","PeriodicalId":55458,"journal":{"name":"Artificial Intelligence in Medicine","volume":"172 ","pages":"Article 103313"},"PeriodicalIF":6.2,"publicationDate":"2025-11-25","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145625344","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
A labeled ophthalmic ultrasound dataset with medical report generation based on cross-modal deep learning 基于跨模态深度学习的带医疗报告生成的标记眼科超声数据集
IF 6.2 2区 医学 Q1 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE Pub Date : 2025-11-24 DOI: 10.1016/j.artmed.2025.103317
Jing Wang , Junyan Fan , Meng Zhou , Yanzhu Zhang , Mingyu Shi
Ultrasound imaging reveals eye morphology and aids in diagnosing and treating eye diseases. However, interpreting diagnostic reports requires specialized physicians. We present a labeled ophthalmic dataset for the precise analysis and the automated exploration of medical images along with their associated reports. It collects three modal data, including the ultrasound images, blood flow information and examination reports from 1,0361 patients at an ophthalmology hospital in Shenyang, China, during the year 2016 to 2020, in which the patient information is de-identified for privacy protection. To the best of our knowledge, it is the only ophthalmic dataset that contains the three modal information simultaneously. It incrementally consists of 2,2173 images with the corresponding free-text reports, which describe 10 typical imaging findings of intraocular diseases and the corresponding anatomical locations. Each image shows three kinds of blood flow indices at three specific arteries, i.e., nine parameter values to describe the spectral characteristics of blood flow distribution. The reports were written by ophthalmologists during the clinical care. In addition, the knowledge fusion cross modal network (KFCMN) is proposed to generate report according to the proposed dataset. The experimental results demonstrate that our dataset is suitable for training supervised models concerning cross-modal medical data.
超声成像显示眼部形态,有助于眼部疾病的诊断和治疗。然而,解释诊断报告需要专业的医生。我们提出了一个标记的眼科数据集,用于精确分析和自动探索医学图像及其相关报告。它收集了2016年至2020年中国沈阳某眼科医院10361名患者的超声图像、血流信息和检查报告三种模态数据,其中患者信息被去识别以保护隐私。据我们所知,这是唯一同时包含三种模态信息的眼科数据集。它逐渐由2,2173张图像和相应的自由文本报告组成,这些图像描述了10种典型的眼内疾病的影像学表现和相应的解剖位置。每张图像显示三个特定动脉的三种血流指标,即9个参数值来描述血流分布的频谱特征。报告由眼科医生在临床护理过程中撰写。此外,提出了知识融合跨模态网络(KFCMN),根据提出的数据集生成报告。实验结果表明,我们的数据集适合于训练跨模式医学数据的监督模型。
{"title":"A labeled ophthalmic ultrasound dataset with medical report generation based on cross-modal deep learning","authors":"Jing Wang ,&nbsp;Junyan Fan ,&nbsp;Meng Zhou ,&nbsp;Yanzhu Zhang ,&nbsp;Mingyu Shi","doi":"10.1016/j.artmed.2025.103317","DOIUrl":"10.1016/j.artmed.2025.103317","url":null,"abstract":"<div><div>Ultrasound imaging reveals eye morphology and aids in diagnosing and treating eye diseases. However, interpreting diagnostic reports requires specialized physicians. We present a labeled ophthalmic dataset for the precise analysis and the automated exploration of medical images along with their associated reports. It collects three modal data, including the ultrasound images, blood flow information and examination reports from 1,0361 patients at an ophthalmology hospital in Shenyang, China, during the year 2016 to 2020, in which the patient information is de-identified for privacy protection. To the best of our knowledge, it is the only ophthalmic dataset that contains the three modal information simultaneously. It incrementally consists of 2,2173 images with the corresponding free-text reports, which describe 10 typical imaging findings of intraocular diseases and the corresponding anatomical locations. Each image shows three kinds of blood flow indices at three specific arteries, i.e., nine parameter values to describe the spectral characteristics of blood flow distribution. The reports were written by ophthalmologists during the clinical care. In addition, the knowledge fusion cross modal network (KFCMN) is proposed to generate report according to the proposed dataset. The experimental results demonstrate that our dataset is suitable for training supervised models concerning cross-modal medical data.</div></div>","PeriodicalId":55458,"journal":{"name":"Artificial Intelligence in Medicine","volume":"171 ","pages":"Article 103317"},"PeriodicalIF":6.2,"publicationDate":"2025-11-24","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145624204","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
MedSumGraph: enhancing GraphRAG for medical QA with summarization and optimized prompts MedSumGraph:通过摘要和优化提示增强GraphRAG用于医疗质量保证
IF 6.2 2区 医学 Q1 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE Pub Date : 2025-11-24 DOI: 10.1016/j.artmed.2025.103311
DaeHo Kim , SoYeop Yoo , OkRan Jeong
The rapid development of large language models (LLMs) has accelerated research into applying artificial intelligence (AI) to domains such as medical question answering and clinical decision support. However, LLMs face substantial limitations in medical contexts due to challenges in understanding specialized terminology, complex contextual information, hallucination issues (i.e., generating incorrect responses), and the black-box nature of their reasoning processes. To address these issues, methods like retrieval-augmented generation (RAG) and its graph-based variant, GraphRAG, have been proposed to incorporate external knowledge into LLMs. Nonetheless, these approaches often rely heavily on external resources and increase system complexity. In this study, we introduce MedSumGraph, a medical question-answering system that enhances GraphRAG by integrating structured medical knowledge summaries and optimized prompt designs. Our method enables LLMs to better interpret domain-specific knowledge without requiring additional training, and it enhances the reliability and interpretability of responses by directly embedding factual evidence and graph-based reasoning into the generation process. MedSumGraph achieves competitive performance on two out of eight multiple-choice medical QA benchmarks, including MedQA (USMLE), outperforming closed-source LLMs and domain-specific foundation models. Moreover, it generalizes effectively to open-domain QA tasks, yielding significant gains in reasoning over common knowledge and evaluating the truthfulness of answers. These findings demonstrate the potential of structured summarization and graph-based reasoning in enhancing the trustworthiness and versatility of LLM-driven medical AI systems.
大型语言模型(llm)的快速发展加速了人工智能(AI)在医学问答和临床决策支持等领域的应用研究。然而,法学硕士在医学环境中面临着很大的限制,因为在理解专业术语、复杂的上下文信息、幻觉问题(即产生错误的反应)和他们推理过程的黑箱性质方面存在挑战。为了解决这些问题,已经提出了诸如检索增强生成(RAG)及其基于图的变体GraphRAG之类的方法来将外部知识合并到llm中。尽管如此,这些方法通常严重依赖外部资源并增加系统复杂性。在本研究中,我们介绍MedSumGraph,这是一个医学问答系统,通过集成结构化的医学知识摘要和优化的提示设计来增强GraphRAG。我们的方法使法学硕士能够在不需要额外培训的情况下更好地解释特定领域的知识,并且通过直接将事实证据和基于图的推理嵌入到生成过程中,它增强了响应的可靠性和可解释性。MedSumGraph在包括MedQA (USMLE)在内的8项多项选择医学QA基准测试中的2项上取得了具有竞争力的表现,优于闭源法学硕士和特定领域基础模型。此外,它有效地推广到开放领域的QA任务,在对常识的推理和评估答案的真实性方面取得了重大进展。这些发现证明了结构化摘要和基于图形的推理在增强法学硕士驱动的医疗人工智能系统的可信度和多功能性方面的潜力。
{"title":"MedSumGraph: enhancing GraphRAG for medical QA with summarization and optimized prompts","authors":"DaeHo Kim ,&nbsp;SoYeop Yoo ,&nbsp;OkRan Jeong","doi":"10.1016/j.artmed.2025.103311","DOIUrl":"10.1016/j.artmed.2025.103311","url":null,"abstract":"<div><div>The rapid development of large language models (LLMs) has accelerated research into applying artificial intelligence (AI) to domains such as medical question answering and clinical decision support. However, LLMs face substantial limitations in medical contexts due to challenges in understanding specialized terminology, complex contextual information, hallucination issues (i.e., generating incorrect responses), and the black-box nature of their reasoning processes. To address these issues, methods like retrieval-augmented generation (RAG) and its graph-based variant, GraphRAG, have been proposed to incorporate external knowledge into LLMs. Nonetheless, these approaches often rely heavily on external resources and increase system complexity. In this study, we introduce MedSumGraph, a medical question-answering system that enhances GraphRAG by integrating structured medical knowledge summaries and optimized prompt designs. Our method enables LLMs to better interpret domain-specific knowledge without requiring additional training, and it enhances the reliability and interpretability of responses by directly embedding factual evidence and graph-based reasoning into the generation process. MedSumGraph achieves competitive performance on two out of eight multiple-choice medical QA benchmarks, including MedQA (USMLE), outperforming closed-source LLMs and domain-specific foundation models. Moreover, it generalizes effectively to open-domain QA tasks, yielding significant gains in reasoning over common knowledge and evaluating the truthfulness of answers. These findings demonstrate the potential of structured summarization and graph-based reasoning in enhancing the trustworthiness and versatility of LLM-driven medical AI systems.</div></div>","PeriodicalId":55458,"journal":{"name":"Artificial Intelligence in Medicine","volume":"172 ","pages":"Article 103311"},"PeriodicalIF":6.2,"publicationDate":"2025-11-24","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145625343","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Do machine learning methods make better predictions than conventional ones in pharmacoepidemiology? A systematic review, meta-analysis, and network meta-analysis 在药物流行病学中,机器学习方法是否比传统方法做出更好的预测?系统回顾、元分析和网络元分析
IF 6.2 2区 医学 Q1 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE Pub Date : 2025-11-23 DOI: 10.1016/j.artmed.2025.103312
Ana Paula Bruno Pena-Gralle , Mireille E. Schnitzer , Sofia-Nada Boureguaa , Félix Morin , Marc-André Legault , Caroline Sirois , Alice Dragomir , Lucie Blais

Objective

To synthesize existing evidence and compare the predictive performance of conventional statistical (CS) models versus machine learning (ML) methods in pharmacoepidemiology.

Methods

Medline, Embase, PsycINFO, CINAHL and Web of Science databases were systematically searched for predictive pharmacoepidemiologic studies published between January 2018 and September 2025. Independent reviewers extracted predictive metrics and other data from each study and assessed the quality of the comparison between methods. The relative performance of ML compared to CS was estimated for each prediction objective. Performance metrics were pooled in meta-analyses and Bayesian network meta-analyses (NMA).

Results

Among 9106 records identified, 65 studies met inclusion criteria, encompassing 83 prediction objectives. For 84 % of these objectives, CS was outperformed by at least one ML method. The median sample size across these studies was 2691 subjects (50 to 1,807,159), and, for binary outcomes, the median number of events per candidate predictor was 17.9 (range 0.28 to 24,260). We observed a risk of bias in the comparison according to at least one of eight major criteria for 39 prediction objectives (47 %).
The pooled area under the receiver-operator curve (AUC) ratio for the highest-performing ML method in studies with low risk of bias was estimated as 1.07 (95 % confidence interval 1.03–1.12) in favor of ML, but with very high heterogeneity. NMA of 197 comparisons estimated an AUC ratio of 1.07 (95 % credible interval 1.04–1.12) for boosted methods compared with logistic regression and ranked Gradient Boosting Machine and XGBoost consistently among the best-performing methods.

Conclusion

Machine learning methods applied to structured pharmacoepidemiologic data demonstrated a consistent yet modest advantage in discriminative performance relative to conventional statistical models. This advantage was most evident for boosted methods such as GBM and XGBoost. However, greater rigor in reporting methodological details is recommended to improve the comprehension, transparency, and reproducibility of studies.

Registration

PROSPERO 2023 registration number: CRD42023426986.
目的综合现有证据,比较传统统计(CS)模型与机器学习(ML)方法在药物流行病学中的预测效果。方法系统检索medline、Embase、PsycINFO、CINAHL和Web of Science数据库,检索2018年1月至2025年9月期间发表的预测性药物流行病学研究。独立审稿人从每个研究中提取预测指标和其他数据,并评估方法之间比较的质量。对于每个预测目标,估计ML与CS的相对性能。在荟萃分析和贝叶斯网络荟萃分析(NMA)中汇总了绩效指标。结果在9106份文献中,65项研究符合纳入标准,包括83个预测目标。对于84%的目标,CS至少被一种ML方法优于。这些研究的中位数样本量为2691名受试者(50至1,807,159名),对于二元结果,每个候选预测因子的中位数事件数为17.9(范围为0.28至24,260)。根据39个预测目标(47%)的8个主要标准中的至少一个,我们观察到在比较中存在偏倚风险。在低偏倚风险的研究中,表现最好的ML方法的受试者-操作者曲线下的汇总面积(AUC)比估计为1.07(95%置信区间1.03-1.12),有利于ML,但异质性非常高。197个比较的NMA估计,与逻辑回归相比,增强方法的AUC比率为1.07(95%可信区间1.04-1.12),梯度增强机和XGBoost一直是表现最好的方法之一。结论机器学习方法应用于结构化药物流行病学数据,与传统统计模型相比,在判别性能上表现出一致但适度的优势。这种优势在GBM和XGBoost等增强方法中最为明显。然而,建议在报告方法细节方面更加严格,以提高研究的理解、透明度和可重复性。普洛斯彼罗2023注册号:CRD42023426986。
{"title":"Do machine learning methods make better predictions than conventional ones in pharmacoepidemiology? A systematic review, meta-analysis, and network meta-analysis","authors":"Ana Paula Bruno Pena-Gralle ,&nbsp;Mireille E. Schnitzer ,&nbsp;Sofia-Nada Boureguaa ,&nbsp;Félix Morin ,&nbsp;Marc-André Legault ,&nbsp;Caroline Sirois ,&nbsp;Alice Dragomir ,&nbsp;Lucie Blais","doi":"10.1016/j.artmed.2025.103312","DOIUrl":"10.1016/j.artmed.2025.103312","url":null,"abstract":"<div><h3>Objective</h3><div>To synthesize existing evidence and compare the predictive performance of conventional statistical (CS) models versus machine learning (ML) methods in pharmacoepidemiology.</div></div><div><h3>Methods</h3><div>Medline, Embase, PsycINFO, CINAHL and Web of Science databases were systematically searched for predictive pharmacoepidemiologic studies published between January 2018 and September 2025. Independent reviewers extracted predictive metrics and other data from each study and assessed the quality of the comparison between methods. The relative performance of ML compared to CS was estimated for each prediction objective. Performance metrics were pooled in meta-analyses and Bayesian network meta-analyses (NMA).</div></div><div><h3>Results</h3><div>Among 9106 records identified, 65 studies met inclusion criteria, encompassing 83 prediction objectives. For 84 % of these objectives, CS was outperformed by at least one ML method. The median sample size across these studies was 2691 subjects (50 to 1,807,159), and, for binary outcomes, the median number of events per candidate predictor was 17.9 (range 0.28 to 24,260). We observed a risk of bias in the comparison according to at least one of eight major criteria for 39 prediction objectives (47 %).</div><div>The pooled area under the receiver-operator curve (AUC) ratio for the highest-performing ML method in studies with low risk of bias was estimated as 1.07 (95 % confidence interval 1.03–1.12) in favor of ML, but with very high heterogeneity. NMA of 197 comparisons estimated an AUC ratio of 1.07 (95 % credible interval 1.04–1.12) for boosted methods compared with logistic regression and ranked Gradient Boosting Machine and XGBoost consistently among the best-performing methods.</div></div><div><h3>Conclusion</h3><div>Machine learning methods applied to structured pharmacoepidemiologic data demonstrated a consistent yet modest advantage in discriminative performance relative to conventional statistical models. This advantage was most evident for boosted methods such as GBM and XGBoost. However, greater rigor in reporting methodological details is recommended to improve the comprehension, transparency, and reproducibility of studies.</div></div><div><h3>Registration</h3><div>PROSPERO 2023 registration number: CRD42023426986.</div></div>","PeriodicalId":55458,"journal":{"name":"Artificial Intelligence in Medicine","volume":"171 ","pages":"Article 103312"},"PeriodicalIF":6.2,"publicationDate":"2025-11-23","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145624192","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Building trustworthy large language model-driven generative recommender system for healthcare decision support: A scoping review of corpus sources, customization techniques, and evaluation frameworks 为医疗保健决策支持构建可信赖的大型语言模型驱动的生成推荐系统:语料库来源、定制技术和评估框架的范围审查
IF 6.2 2区 医学 Q1 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE Pub Date : 2025-11-22 DOI: 10.1016/j.artmed.2025.103310
Shuqi Yang , Mingrui Jing , Shuai Wang , Zongan Huang , Jiaqing Wang , Jiaxin Kou , Manfei Shi , Zhentao Xia , Qipeng Wei , Weijie Xing , Yan Hu , Zheng Zhu

Introduction

Large Language Model-Driven Generative Recommender Systems (LLM-GRSs) are playing a growing role in healthcare, particularly in clinical question-answering. This study reviews their corpus sources, customization techniques, and evaluation metrics.

Methods

We conducted a systematic search of PubMed, Embase, Scopus, and Web of Science for studies published between January 2021 and August 2025 that applied LLM-GRSs to deliver medical or healthcare information. Eligible studies included publications describing LLMs designed to emulate clinical decision-making by providing diagnostic or therapeutic recommendations through dialogue-based interfaces. Two reviewers independently screened studies and extracted data on corpus sources, model architectures, customization methods, and evaluation metrics.

Results

A total of 61 articles were included. Corpus sources were grouped into clinical data (n = 25), literature (n = 34), open datasets (n = 37), and web-crawled data (n = 15), with many using multiple types. Most studies (n = 43) combined multiple approaches. Customization techniques included prompt engineering, retrieval-augmented generation and model fine-tuning. Twenty-four studies used a single customization technique, while 37 studies combined these methods during model development. The evaluation metrics were classified into three main domains: process metrics, usability metrics, and outcome metrics. The outcome metrics included both model-based and manual-assessed evaluations.

Conclusion

LLM-GRSs hold considerable promise in healthcare; however, their safety and reliability hinge on the use of evidence-based training corpora, transparent system design, and standardized evaluation protocols within real-world clinical environments.
大型语言模型驱动的生成推荐系统(LLM-GRSs)在医疗保健中发挥着越来越大的作用,特别是在临床问答方面。本研究回顾了它们的语料库来源、定制技术和评估指标。方法系统检索PubMed、Embase、Scopus和Web of Science,检索2021年1月至2025年8月期间发表的应用LLM-GRSs提供医疗保健信息的研究。符合条件的研究包括描述法学硕士通过基于对话的界面提供诊断或治疗建议来模拟临床决策的出版物。两位审稿人独立筛选研究并提取语料库来源、模型架构、定制方法和评估指标方面的数据。结果共纳入61篇文献。语料库来源分为临床数据(n = 25)、文献(n = 34)、开放数据集(n = 37)和网络抓取数据(n = 15),其中许多使用多种类型。大多数研究(n = 43)采用了多种方法。定制技术包括提示工程、检索增强生成和模型微调。24项研究使用了单一的定制技术,而37项研究在模型开发过程中结合了这些方法。评估度量被分为三个主要领域:过程度量、可用性度量和结果度量。结果指标包括基于模型和手工评估的评估。结论llm - grss在医疗卫生领域具有广阔的应用前景;然而,它们的安全性和可靠性取决于基于证据的训练语料库的使用、透明的系统设计和真实临床环境中的标准化评估方案。
{"title":"Building trustworthy large language model-driven generative recommender system for healthcare decision support: A scoping review of corpus sources, customization techniques, and evaluation frameworks","authors":"Shuqi Yang ,&nbsp;Mingrui Jing ,&nbsp;Shuai Wang ,&nbsp;Zongan Huang ,&nbsp;Jiaqing Wang ,&nbsp;Jiaxin Kou ,&nbsp;Manfei Shi ,&nbsp;Zhentao Xia ,&nbsp;Qipeng Wei ,&nbsp;Weijie Xing ,&nbsp;Yan Hu ,&nbsp;Zheng Zhu","doi":"10.1016/j.artmed.2025.103310","DOIUrl":"10.1016/j.artmed.2025.103310","url":null,"abstract":"<div><h3>Introduction</h3><div>Large Language Model-Driven Generative Recommender Systems (LLM-GRSs) are playing a growing role in healthcare, particularly in clinical question-answering. This study reviews their corpus sources, customization techniques, and evaluation metrics.</div></div><div><h3>Methods</h3><div>We conducted a systematic search of PubMed, Embase, Scopus, and Web of Science for studies published between January 2021 and August 2025 that applied LLM-GRSs to deliver medical or healthcare information. Eligible studies included publications describing LLMs designed to emulate clinical decision-making by providing diagnostic or therapeutic recommendations through dialogue-based interfaces. Two reviewers independently screened studies and extracted data on corpus sources, model architectures, customization methods, and evaluation metrics.</div></div><div><h3>Results</h3><div>A total of 61 articles were included. Corpus sources were grouped into clinical data (n = 25), literature (n = 34), open datasets (n = 37), and web-crawled data (n = 15), with many using multiple types. Most studies (n = 43) combined multiple approaches. Customization techniques included prompt engineering, retrieval-augmented generation and model fine-tuning. Twenty-four studies used a single customization technique, while 37 studies combined these methods during model development. The evaluation metrics were classified into three main domains: process metrics, usability metrics, and outcome metrics. The outcome metrics included both model-based and manual-assessed evaluations.</div></div><div><h3>Conclusion</h3><div>LLM-GRSs hold considerable promise in healthcare; however, their safety and reliability hinge on the use of evidence-based training corpora, transparent system design, and standardized evaluation protocols within real-world clinical environments.</div></div>","PeriodicalId":55458,"journal":{"name":"Artificial Intelligence in Medicine","volume":"171 ","pages":"Article 103310"},"PeriodicalIF":6.2,"publicationDate":"2025-11-22","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145624203","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
CA-OCL and CHAN: A novel diagnostic framework for rheumatoid arthritis integrating contradiction-aware orthogonal contrastive learning with confidence-guided hierarchical attention CA-OCL和CHAN:结合矛盾意识正交对比学习和自信引导的层次注意的类风湿关节炎诊断框架。
IF 6.2 2区 医学 Q1 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE Pub Date : 2025-11-17 DOI: 10.1016/j.artmed.2025.103307
Zhao Huang , QingMei Zeng , NanNan Gai
Rheumatoid arthritis (RA) is a chronic systemic autoimmune disorder characterized by progressive destruction of synovial joints, for which precise early diagnosis is critical to effective clinical management. Although current computer-aided diagnosis (CAD) systems show promising potential, their practical deployment remains challenged by heterogeneity in multimodal data and the inherent complexity of pathophysiological manifestations. A key limitation of existing approaches is the absence of dedicated mechanisms to reconcile inter-modal conflicts and to adequately leverage the diagnostic information present in contradictory samples, such as discordances between laboratory findings and clinical descriptions, which often leads to suboptimal diagnostic performance. To overcome these challenges, this study introduces a novel auxiliary diagnostic framework for RA based on Contradiction-Aware Orthogonal Contrastive Learning (CA-OCL) and a Confidence-guided Hierarchical Attention Network (CHAN). The proposed architecture incorporates three major innovations. First, in the data processing stage, a lightweight feature-crossing network is employed to derive robust representations from structured data, while a multi-task adapted extension of the BERT model is utilized to extract rich semantic features from unstructured textual inputs. Second, the CA-OCL module is designed to explicitly identify and learn from contradictory negative samples—a capability largely absent in conventional contrastive learning frameworks. Additionally, orthogonal constraints are applied to minimize feature redundancy across modalities, thereby preserving discriminative modality-specific information that is often obscured by methods promoting excessive feature alignment. Finally, the CHAN module dynamically modulates inter-modal contributions using confidence estimates, mitigating the risk of unilateral dominance by any single modality, a common drawback in attention-based fusion mechanisms and facilitating refined integration of conflicting information. Comprehensive experimental evaluations demonstrate that the proposed framework achieves superior performance compared to state-of-the-art methods. These results not only validate the efficacy of our approach in handling multimodal conflicts and exploiting contradictory evidence, but also highlight its significant clinical utility through effective multimodal data integration. This work addresses critical limitations in conventional CAD systems and provides an advanced paradigm for intelligent diagnostic assessment of RA.
类风湿性关节炎(RA)是一种以滑膜关节进行性破坏为特征的慢性系统性自身免疫性疾病,准确的早期诊断对有效的临床治疗至关重要。尽管当前的计算机辅助诊断(CAD)系统显示出良好的潜力,但其实际部署仍然受到多模态数据异质性和病理生理表现固有复杂性的挑战。现有方法的一个关键限制是缺乏专门的机制来调和模式间冲突,并充分利用存在于矛盾样本中的诊断信息,例如实验室发现与临床描述之间的不一致,这通常导致次优诊断性能。为了克服这些挑战,本研究引入了一种基于矛盾感知正交对比学习(CA-OCL)和信心引导层次注意网络(CHAN)的RA辅助诊断框架。提出的体系结构包含三个主要的创新。首先,在数据处理阶段,采用轻量级特征交叉网络从结构化数据中提取鲁棒表征,同时利用BERT模型的多任务自适应扩展从非结构化文本输入中提取丰富的语义特征。其次,CA-OCL模块被设计为明确地识别和从矛盾的负样本中学习——这是传统对比学习框架中基本缺乏的能力。此外,正交约束应用于最小化模态之间的特征冗余,从而保留了鉴别的特定于模态的信息,这些信息通常被促进过度特征对齐的方法所掩盖。最后,CHAN模块使用置信度估计动态调节多模式贡献,减轻任何单一模式单方面主导的风险,这是基于注意力的融合机制的共同缺点,并促进冲突信息的精细集成。综合实验评估表明,与最先进的方法相比,所提出的框架具有优越的性能。这些结果不仅验证了我们的方法在处理多模态冲突和利用矛盾证据方面的有效性,而且通过有效的多模态数据整合突出了其重要的临床实用性。这项工作解决了传统CAD系统的关键局限性,并为RA的智能诊断评估提供了一个先进的范例。
{"title":"CA-OCL and CHAN: A novel diagnostic framework for rheumatoid arthritis integrating contradiction-aware orthogonal contrastive learning with confidence-guided hierarchical attention","authors":"Zhao Huang ,&nbsp;QingMei Zeng ,&nbsp;NanNan Gai","doi":"10.1016/j.artmed.2025.103307","DOIUrl":"10.1016/j.artmed.2025.103307","url":null,"abstract":"<div><div>Rheumatoid arthritis (RA) is a chronic systemic autoimmune disorder characterized by progressive destruction of synovial joints, for which precise early diagnosis is critical to effective clinical management. Although current computer-aided diagnosis (CAD) systems show promising potential, their practical deployment remains challenged by heterogeneity in multimodal data and the inherent complexity of pathophysiological manifestations. A key limitation of existing approaches is the absence of dedicated mechanisms to reconcile inter-modal conflicts and to adequately leverage the diagnostic information present in contradictory samples, such as discordances between laboratory findings and clinical descriptions, which often leads to suboptimal diagnostic performance. To overcome these challenges, this study introduces a novel auxiliary diagnostic framework for RA based on Contradiction-Aware Orthogonal Contrastive Learning (CA-OCL) and a Confidence-guided Hierarchical Attention Network (CHAN). The proposed architecture incorporates three major innovations. First, in the data processing stage, a lightweight feature-crossing network is employed to derive robust representations from structured data, while a multi-task adapted extension of the BERT model is utilized to extract rich semantic features from unstructured textual inputs. Second, the CA-OCL module is designed to explicitly identify and learn from contradictory negative samples—a capability largely absent in conventional contrastive learning frameworks. Additionally, orthogonal constraints are applied to minimize feature redundancy across modalities, thereby preserving discriminative modality-specific information that is often obscured by methods promoting excessive feature alignment. Finally, the CHAN module dynamically modulates inter-modal contributions using confidence estimates, mitigating the risk of unilateral dominance by any single modality, a common drawback in attention-based fusion mechanisms and facilitating refined integration of conflicting information. Comprehensive experimental evaluations demonstrate that the proposed framework achieves superior performance compared to state-of-the-art methods. These results not only validate the efficacy of our approach in handling multimodal conflicts and exploiting contradictory evidence, but also highlight its significant clinical utility through effective multimodal data integration. This work addresses critical limitations in conventional CAD systems and provides an advanced paradigm for intelligent diagnostic assessment of RA.</div></div>","PeriodicalId":55458,"journal":{"name":"Artificial Intelligence in Medicine","volume":"171 ","pages":"Article 103307"},"PeriodicalIF":6.2,"publicationDate":"2025-11-17","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145566449","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Artificial intelligence in 4D flow MRI: Review of technological aspects and clinical applications 人工智能在4D血流MRI中的应用:技术方面和临床应用综述
IF 6.2 2区 医学 Q1 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE Pub Date : 2025-11-17 DOI: 10.1016/j.artmed.2025.103308
Jiwon Pung , Gyu-Han Lee , Hyungkyu Huh , Dong Hyun Yang , Hojin Ha

Background

Four-dimensional (4D) flow magnetic resonance imaging (MRI) has evolved into an advanced non-invasive imaging technique that enables comprehensive assessment of blood flow in cardiovascular system. Hemodynamic data from 4D flow MRI provide key biomarkers such as wall shear stress (WSS), turbulent kinetic energy (TKE), and viscous energy loss, which aid in the analysis of cardiovascular diseases. However, its clinical application is limited by challenges such as prolonged scan times and limited spatiotemporal resolution. Conventional algorithms have been developed to automate the reconstruction of sparse data and perform image segmentation for hemodynamic quantification, but these methods are often time-consuming and require user expertise for accurate postprocessing. To address these limitations, artificial intelligence (AI), particularly deep learning (DL) techniques, has been introduced. DL models have shown promise in accelerating scan times by reconstructing sparsely sampled data into fully sampled datasets and enhancing image resolution by combining computational fluid dynamics (CFD) with 4D flow MRI data, as well as improving data quality through noise reduction. In addition, automated segmentation techniques have been developed to reduce user intervention, enabling more consistent and efficient analysis. Many researchers are working on DL-based approaches to 4D flow MRI using limited datasets. Recently, the lack of systematic methodologies has made it difficult to identify appropriate approaches. This paper aims to provide a comprehensive review of the latest AI applications using 4D flow MRI, enabling researchers to access and evaluate existing research more effectively.

Conclusions

This review explores the latest research on integrating AI with 4D flow MRI, covering the entire process from data acquisition to postprocessing, and emphasizes the need for novel AI-based techniques to enhance clinical applicability. Furthermore, this review suggests it can serve as a foundation for developing innovative strategies toward fully automated AI-based approaches.
四维(4D)血流磁共振成像(MRI)已经发展成为一种先进的无创成像技术,可以全面评估心血管系统的血流。来自4D血流MRI的血流动力学数据提供了关键的生物标志物,如壁剪切应力(WSS)、湍流动能(TKE)和粘性能量损失,有助于心血管疾病的分析。然而,它的临床应用受到诸如长时间扫描和有限的时空分辨率等挑战的限制。传统的算法已经被开发用于自动重建稀疏数据和执行血流动力学量化的图像分割,但这些方法通常是耗时的,并且需要用户的专业知识来进行准确的后处理。为了解决这些限制,人工智能(AI),特别是深度学习(DL)技术已经被引入。DL模型通过将稀疏采样的数据重建为完全采样的数据集来加快扫描时间,通过将计算流体动力学(CFD)与四维流MRI数据相结合来提高图像分辨率,以及通过降低噪声来提高数据质量。此外,自动分割技术已经开发,以减少用户的干预,使更一致和有效的分析。许多研究人员正在使用有限的数据集研究基于dl的4D血流MRI方法。最近,由于缺乏系统的方法,很难确定适当的方法。本文旨在全面回顾使用4D流MRI的最新人工智能应用,使研究人员能够更有效地访问和评估现有研究。本文综述了人工智能与四维血流MRI结合的最新研究,涵盖了从数据采集到后处理的整个过程,并强调需要新的人工智能技术来提高临床适用性。此外,这篇综述表明,它可以作为开发基于全自动人工智能方法的创新策略的基础。
{"title":"Artificial intelligence in 4D flow MRI: Review of technological aspects and clinical applications","authors":"Jiwon Pung ,&nbsp;Gyu-Han Lee ,&nbsp;Hyungkyu Huh ,&nbsp;Dong Hyun Yang ,&nbsp;Hojin Ha","doi":"10.1016/j.artmed.2025.103308","DOIUrl":"10.1016/j.artmed.2025.103308","url":null,"abstract":"<div><h3>Background</h3><div>Four-dimensional (4D) flow magnetic resonance imaging (MRI) has evolved into an advanced non-invasive imaging technique that enables comprehensive assessment of blood flow in cardiovascular system. Hemodynamic data from 4D flow MRI provide key biomarkers such as wall shear stress (WSS), turbulent kinetic energy (TKE), and viscous energy loss, which aid in the analysis of cardiovascular diseases. However, its clinical application is limited by challenges such as prolonged scan times and limited spatiotemporal resolution. Conventional algorithms have been developed to automate the reconstruction of sparse data and perform image segmentation for hemodynamic quantification, but these methods are often time-consuming and require user expertise for accurate postprocessing. To address these limitations, artificial intelligence (AI), particularly deep learning (DL) techniques, has been introduced. DL models have shown promise in accelerating scan times by reconstructing sparsely sampled data into fully sampled datasets and enhancing image resolution by combining computational fluid dynamics (CFD) with 4D flow MRI data, as well as improving data quality through noise reduction. In addition, automated segmentation techniques have been developed to reduce user intervention, enabling more consistent and efficient analysis. Many researchers are working on DL-based approaches to 4D flow MRI using limited datasets. Recently, the lack of systematic methodologies has made it difficult to identify appropriate approaches. This paper aims to provide a comprehensive review of the latest AI applications using 4D flow MRI, enabling researchers to access and evaluate existing research more effectively.</div></div><div><h3>Conclusions</h3><div>This review explores the latest research on integrating AI with 4D flow MRI, covering the entire process from data acquisition to postprocessing, and emphasizes the need for novel AI-based techniques to enhance clinical applicability. Furthermore, this review suggests it can serve as a foundation for developing innovative strategies toward fully automated AI-based approaches.</div></div>","PeriodicalId":55458,"journal":{"name":"Artificial Intelligence in Medicine","volume":"171 ","pages":"Article 103308"},"PeriodicalIF":6.2,"publicationDate":"2025-11-17","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145579410","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Weakly-supervised ultrasound image segmentation with elliptical shape prior constraint 基于椭圆形状先验约束的弱监督超声图像分割。
IF 6.2 2区 医学 Q1 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE Pub Date : 2025-11-15 DOI: 10.1016/j.artmed.2025.103306
Changyan Wang , Yehua Cai , Ruyi Yang , Haobo Chen , Jiang Shang , Hong Ding , Qi Zhang
Accurate pixel-level segmentation of ultrasound (US) images is vital for computer-aided disease screening, diagnosis, and treatment response evaluation. The weakly supervised methods have the potential to reduce the time-consuming and labor-intensive workload for radiologists, paving the way for further automation in the quantitative analysis of US images. Among these methods, the multiple instance learning (MIL) has proven effective and is often applied to prediction tasks with insufficiently labeled data. In US examinations, the elliptical region formed by intersecting lines used by radiologists for target annotation serves as a crucial prior information. Therefore, we propose a novel weakly supervised method called elliptical shape prior constraint MIL (ESPC-MIL) for pixel-level segmentation of US images. ESPC-MIL incorporates an elliptical shape prior constraint into the MIL framework, delivering more accurate foreground and background candidate regions for MIL, which enhances its predictive performance for tissues and organs with approximately elliptical shapes. Furthermore, the method utilizes elliptical shape prior information for global supervision, improving edge segmentation and localization accuracy. Compared to other weakly supervised methods, ESPC-MIL achieves state-of-the-art results on four US image datasets: Achilles tendon dataset, median nerve dataset, private breast tumor dataset, and public breast ultrasound image dataset, with Dice similarity coefficients of 0.855, 0.849, 0.876, and 0.748, respectively. It demonstrates performance comparable to fully supervised segmentation methods while significantly reducing annotation requirements. Notably, the method demonstrates a more significant performance improvement in segmenting objects with approximately elliptical shapes compared to those with complex shapes. Source codes and models are available at https://github.com/CYWang-kayla/ESPC-MIL-Model.
超声(US)图像的精确像素级分割对于计算机辅助疾病筛查、诊断和治疗反应评估至关重要。弱监督方法有可能减少放射科医生耗时和劳动密集型的工作量,为美国图像定量分析的进一步自动化铺平道路。在这些方法中,多实例学习(MIL)已被证明是有效的,通常用于标记数据不足的预测任务。在美国检查中,放射科医生用于目标注释的相交线形成的椭圆区域作为关键的先验信息。因此,我们提出了一种新的弱监督方法,称为椭圆形状先验约束MIL (ESPC-MIL),用于美国图像的像素级分割。ESPC-MIL在MIL框架中引入了椭圆形状的先验约束,为MIL提供了更精确的前景和背景候选区域,提高了其对近似椭圆形状的组织和器官的预测性能。此外,该方法利用椭圆形状先验信息进行全局监督,提高了边缘分割和定位精度。与其他弱监督方法相比,ESPC-MIL在4个美国图像数据集(跟腱数据集、中神经数据集、乳房肿瘤数据集和公共乳房超声图像数据集)上取得了最先进的结果,Dice相似系数分别为0.855、0.849、0.876和0.748。它展示了与完全监督分割方法相当的性能,同时显着减少了注释需求。值得注意的是,与复杂形状的物体相比,该方法在分割近似椭圆形状的物体时表现出更显著的性能改进。源代码和模型可在https://github.com/CYWang-kayla/ESPC-MIL-Model上获得。
{"title":"Weakly-supervised ultrasound image segmentation with elliptical shape prior constraint","authors":"Changyan Wang ,&nbsp;Yehua Cai ,&nbsp;Ruyi Yang ,&nbsp;Haobo Chen ,&nbsp;Jiang Shang ,&nbsp;Hong Ding ,&nbsp;Qi Zhang","doi":"10.1016/j.artmed.2025.103306","DOIUrl":"10.1016/j.artmed.2025.103306","url":null,"abstract":"<div><div>Accurate pixel-level segmentation of ultrasound (US) images is vital for computer-aided disease screening, diagnosis, and treatment response evaluation. The weakly supervised methods have the potential to reduce the time-consuming and labor-intensive workload for radiologists, paving the way for further automation in the quantitative analysis of US images. Among these methods, the multiple instance learning (MIL) has proven effective and is often applied to prediction tasks with insufficiently labeled data. In US examinations, the elliptical region formed by intersecting lines used by radiologists for target annotation serves as a crucial prior information. Therefore, we propose a novel weakly supervised method called elliptical shape prior constraint MIL (ESPC-MIL) for pixel-level segmentation of US images. ESPC-MIL incorporates an elliptical shape prior constraint into the MIL framework, delivering more accurate foreground and background candidate regions for MIL, which enhances its predictive performance for tissues and organs with approximately elliptical shapes. Furthermore, the method utilizes elliptical shape prior information for global supervision, improving edge segmentation and localization accuracy. Compared to other weakly supervised methods, ESPC-MIL achieves state-of-the-art results on four US image datasets: Achilles tendon dataset, median nerve dataset, private breast tumor dataset, and public breast ultrasound image dataset, with Dice similarity coefficients of 0.855, 0.849, 0.876, and 0.748, respectively. It demonstrates performance comparable to fully supervised segmentation methods while significantly reducing annotation requirements. Notably, the method demonstrates a more significant performance improvement in segmenting objects with approximately elliptical shapes compared to those with complex shapes. Source codes and models are available at <span><span>https://github.com/CYWang-kayla/ESPC-MIL-Model</span><svg><path></path></svg></span>.</div></div>","PeriodicalId":55458,"journal":{"name":"Artificial Intelligence in Medicine","volume":"171 ","pages":"Article 103306"},"PeriodicalIF":6.2,"publicationDate":"2025-11-15","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145558119","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Deformable phrase level attention: A flexible approach for improving AI based medical coding 可变形短语级注意力:一种改进基于AI的医学编码的灵活方法。
IF 6.2 2区 医学 Q1 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE Pub Date : 2025-11-15 DOI: 10.1016/j.artmed.2025.103299
Christoph Metzner , Shang Gao , Drahomira Herrmannova , John Gounley , Heidi A. Hanson

Objective:

Improving the AI-driven automated medical encoding of clinical text plays a vital role in gathering information on the occurrence of diseases to improve population-level health. This work presents a novel attention mechanism designed to enhance text classification models and ensure appropriate classification of medical concepts in unstructured electronic health records.

Materials and Methods:

We developed a deformable, phrase-level attention mechanism to identify important lexical word-level and contextual phrase-level information from clinical text documents. We evaluated conventional and transformer-based deep learning models that we extended with our attention mechanism on the extraction of critical cancer information (e.g., site, subsite, laterality, histology, behavior) from 629,908 electronic pathology reports and on the automated medical encoding of 52,722 hospital discharge summaries.

Results:

Transformer-based models with the deformable, phrase-level attention mechanism achieved the best performance on the extraction of critical cancer information from pathology reports. Conventional- and transformer-based models show similar or better performance than their baseline counterparts on the automated medical encoding of clinical documents.

Discussion:

The addition of phrase-level information allowed models extended with our proposed method to outperform standard word-level attention. Our method showed favorable properties for the real-world application in terms of model robustness and phenotyping. These results indicate that our method is promising for automated data harmonization for common data models.

Conclusion:

This work proposes a novel deformable, phrase-level attention mechanism that enhances text classification models in the extraction of medical concepts from clinical text documents. We demonstrate strong performances on two clinical text datasets and showcase real-world deployability of our method.
目的:改进人工智能驱动的临床文本自动医学编码,对收集疾病发生信息,提高人群健康水平具有重要意义。这项工作提出了一种新的注意力机制,旨在增强文本分类模型,并确保在非结构化电子健康记录中对医学概念进行适当分类。材料和方法:我们开发了一种可变形的短语级注意机制,用于从临床文本文档中识别重要的词汇级、词级和上下文级短语级信息。我们评估了传统的和基于变压器的深度学习模型,我们扩展了我们的注意力机制,从629,908份电子病理报告和52,722份医院出院摘要的自动医疗编码中提取关键的癌症信息(例如,部位,子部位,侧边性,组织学,行为)。结果:基于变压器的模型具有可变形的、短语级关注机制,在从病理报告中提取关键癌症信息方面表现最佳。传统和基于变压器的模型在临床文件的自动医学编码方面表现出与基线相似或更好的性能。讨论:短语级信息的添加允许使用我们提出的方法扩展的模型优于标准单词级注意力。我们的方法在模型稳健性和表型方面显示了对现实世界应用的有利特性。这些结果表明,我们的方法有望实现通用数据模型的自动数据协调。结论:本文提出了一种新的可变形的短语级注意机制,增强了从临床文本文档中提取医学概念的文本分类模型。我们在两个临床文本数据集上展示了强大的性能,并展示了我们的方法在现实世界中的可部署性。
{"title":"Deformable phrase level attention: A flexible approach for improving AI based medical coding","authors":"Christoph Metzner ,&nbsp;Shang Gao ,&nbsp;Drahomira Herrmannova ,&nbsp;John Gounley ,&nbsp;Heidi A. Hanson","doi":"10.1016/j.artmed.2025.103299","DOIUrl":"10.1016/j.artmed.2025.103299","url":null,"abstract":"<div><h3>Objective:</h3><div>Improving the AI-driven automated medical encoding of clinical text plays a vital role in gathering information on the occurrence of diseases to improve population-level health. This work presents a novel attention mechanism designed to enhance text classification models and ensure appropriate classification of medical concepts in unstructured electronic health records.</div></div><div><h3>Materials and Methods:</h3><div>We developed a deformable, phrase-level attention mechanism to identify important lexical word-level and contextual phrase-level information from clinical text documents. We evaluated conventional and transformer-based deep learning models that we extended with our attention mechanism on the extraction of critical cancer information (e.g., site, subsite, laterality, histology, behavior) from 629,908 electronic pathology reports and on the automated medical encoding of 52,722 hospital discharge summaries.</div></div><div><h3>Results:</h3><div>Transformer-based models with the deformable, phrase-level attention mechanism achieved the best performance on the extraction of critical cancer information from pathology reports. Conventional- and transformer-based models show similar or better performance than their baseline counterparts on the automated medical encoding of clinical documents.</div></div><div><h3>Discussion:</h3><div>The addition of phrase-level information allowed models extended with our proposed method to outperform standard word-level attention. Our method showed favorable properties for the real-world application in terms of model robustness and phenotyping. These results indicate that our method is promising for automated data harmonization for common data models.</div></div><div><h3>Conclusion:</h3><div>This work proposes a novel deformable, phrase-level attention mechanism that enhances text classification models in the extraction of medical concepts from clinical text documents. We demonstrate strong performances on two clinical text datasets and showcase real-world deployability of our method.</div></div>","PeriodicalId":55458,"journal":{"name":"Artificial Intelligence in Medicine","volume":"171 ","pages":"Article 103299"},"PeriodicalIF":6.2,"publicationDate":"2025-11-15","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145558160","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
A deep representation learning algorithm on drug-target interaction to screen novel drug candidates for Alzheimer's disease 基于药物-靶标相互作用的深度表征学习算法筛选阿尔茨海默病新候选药物。
IF 6.2 2区 医学 Q1 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE Pub Date : 2025-11-13 DOI: 10.1016/j.artmed.2025.103301
Xin Yuan , Lihui Gao , Yonglin Peng , Tiantian She , Ju Wang
Alzheimer's disease (AD) is a serious neurodegenerative brain disorder with complex pathophysiology. While currently available drugs can provide symptomatic benefits, they often fail to cure the disease. Thus, there is an urgent need to explore new therapeutic agents. In this study, we developed DTIP (Drug-Target Interaction Prediction), a machine learning-based approach to search novel drugs for AD by utilizing the information of drug-target interaction (DTI). By training a Skip-gram model on drug-target sequences derived from known DTI information, the algorithm learned the drug-target relationship embeddings and to predict potential drug candidates for diseases like AD. For AD, we compiled 917 risk genes and identified 292 potential drugs via the new algorithm. We further performed molecular docking by AutoDock Vina and conducted Inverted Gene Set Enrichment Analysis (IGSEA) on these drug candidates. Our results identified that several drugs could be promising for AD treatment, including human C1-esterase inhibitor, quetiapine, dasatinib, miconazole, aniracetam, chlorpromazine, hypericin, entrectinib, torcetrapib, bosutinib, sunitinib, aniracetam, rosiglitazone, tarenflurbil, milrinone, and MITO-4509. Results from this study also provided insights for understanding the molecular mechanisms underlying AD. As a systematic and versatile method, our approach can also be applied to identify efficacious therapies for other complex diseases.
阿尔茨海默病(AD)是一种严重的神经退行性脑疾病,具有复杂的病理生理。虽然目前可用的药物可以提供对症治疗,但它们往往不能治愈这种疾病。因此,迫切需要探索新的治疗药物。在这项研究中,我们开发了DTIP (Drug-Target Interaction Prediction,药物-靶标相互作用预测),这是一种基于机器学习的方法,利用药物-靶标相互作用(DTI)的信息来搜索AD的新药。该算法通过对已知DTI信息衍生的药物-靶标序列进行Skip-gram模型训练,学习药物-靶标关系嵌入,预测AD等疾病的潜在候选药物。对于AD,我们通过新的算法编译了917个风险基因,并鉴定了292种潜在药物。我们进一步通过AutoDock Vina进行分子对接,并对这些候选药物进行了倒置基因集富集分析(IGSEA)。我们的研究结果表明,几种药物有望治疗阿尔茨海默病,包括人c1酯酶抑制剂、喹硫平、达沙替尼、咪康唑、阿尼西坦、氯丙嗪、金丝桃素、enterrectinib、torcetrapib、bosutinib、舒尼替尼、阿尼西坦、罗格列酮、他伦氟比尔、米立酮和MITO-4509。这项研究的结果也为理解AD的分子机制提供了见解。作为一种系统和通用的方法,我们的方法也可以应用于其他复杂疾病的有效治疗。
{"title":"A deep representation learning algorithm on drug-target interaction to screen novel drug candidates for Alzheimer's disease","authors":"Xin Yuan ,&nbsp;Lihui Gao ,&nbsp;Yonglin Peng ,&nbsp;Tiantian She ,&nbsp;Ju Wang","doi":"10.1016/j.artmed.2025.103301","DOIUrl":"10.1016/j.artmed.2025.103301","url":null,"abstract":"<div><div>Alzheimer's disease (AD) is a serious neurodegenerative brain disorder with complex pathophysiology. While currently available drugs can provide symptomatic benefits, they often fail to cure the disease. Thus, there is an urgent need to explore new therapeutic agents. In this study, we developed DTIP (Drug-Target Interaction Prediction), a machine learning-based approach to search novel drugs for AD by utilizing the information of drug-target interaction (DTI). By training a Skip-gram model on drug-target sequences derived from known DTI information, the algorithm learned the drug-target relationship embeddings and to predict potential drug candidates for diseases like AD. For AD, we compiled 917 risk genes and identified 292 potential drugs via the new algorithm. We further performed molecular docking by AutoDock Vina and conducted Inverted Gene Set Enrichment Analysis (IGSEA) on these drug candidates. Our results identified that several drugs could be promising for AD treatment, including human C1-esterase inhibitor, quetiapine, dasatinib, miconazole, aniracetam, chlorpromazine, hypericin, entrectinib, torcetrapib, bosutinib, sunitinib, aniracetam, rosiglitazone, tarenflurbil, milrinone, and MITO-4509. Results from this study also provided insights for understanding the molecular mechanisms underlying AD. As a systematic and versatile method, our approach can also be applied to identify efficacious therapies for other complex diseases.</div></div>","PeriodicalId":55458,"journal":{"name":"Artificial Intelligence in Medicine","volume":"171 ","pages":"Article 103301"},"PeriodicalIF":6.2,"publicationDate":"2025-11-13","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145566418","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
期刊
Artificial Intelligence in Medicine
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1