首页 > 最新文献

International Journal of Medical Informatics最新文献

英文 中文
Advancing healthcare with large language models: A scoping review of applications and future directions 使用大型语言模型推进医疗保健:对应用程序和未来方向的范围审查。
IF 4.1 2区 医学 Q2 COMPUTER SCIENCE, INFORMATION SYSTEMS Pub Date : 2026-03-15 Epub Date: 2025-12-22 DOI: 10.1016/j.ijmedinf.2025.106231
Zhihong Zhang , Mohamad Javad Momeni Nezhad , Seyed Mohammad Bagher Hosseini , Ali Zolnour , Zahra Zonour , Seyedeh Mahdis Hosseini , Maxim Topaz , Maryam Zolnoori

Background

The release of ChatGPT has spurred the widespread adoption of generative large language models (LLMs) in healthcare. This scoping review systematically examines their use in healthcare.

Methods

A systematic search was conducted using PubMed, a comprehensive and representative database on biomedical and health science, to identify studies published between January 1, 2023, and July 30, 2024. Studies were included if they assessed the performance of generative LLMs in healthcare applications; review or perspective articles were excluded.

Results

A total of 415 studies were included, with a significant increase in publications observed after April 2023. Generative LLMs were applied across various medical specialties, primarily supporting clinical decision-making (26.7%) and providing patient information (23.9%). Smaller proportions were focused on professional education and training (18.1%), research (16.1%), and workflow support (12.5%). These applications were mainly supported by three key NLP tasks: question answering (36.1%), text classification (27.5%), and text generation (26.3%). Public datasets appeared in 20% of studies, and 15% used clinical patient data. Of the 98 LLMs used, GPT-4 (51.3%), GPT-3.5 (36.6%), and ChatGPT (22.4%) were the most common. Direct prompting was the most common adaptation method (92.5%), with reinforcement learning rarely utilized (1.4%). Accuracy was the most frequently assessed metric, while errors and safety (9.4%) and time efficiency (7.0%) were less commonly evaluated.

Conclusion

LLMs hold promise across healthcare applications. Expanding their use in workflow optimization, trainee education, and research tools could enhance healthcare delivery and innovation. Comprehensive evaluation using standardized criteria is essential for LLMs integration into healthcare.
背景:ChatGPT的发布促进了生成式大型语言模型(llm)在医疗保健领域的广泛采用。本综述系统地考察了它们在医疗保健中的应用。方法:系统检索具有代表性的综合性生物医学与健康科学数据库PubMed,检索2023年1月1日至2024年7月30日期间发表的研究。如果研究评估生成法学硕士在医疗保健应用中的表现,则纳入研究;综述或透视文章被排除在外。结果:共纳入415项研究,在2023年4月之后观察到的出版物显著增加。生成法学硕士应用于不同的医学专业,主要是支持临床决策(26.7%)和提供患者信息(23.9%)。较小的比例集中在专业教育和培训(18.1%),研究(16.1%)和工作流程支持(12.5%)。这些应用主要由三个关键的NLP任务支持:问答(36.1%)、文本分类(27.5%)和文本生成(26.3%)。20%的研究使用了公共数据集,15%的研究使用了临床患者数据。在使用的98个llm中,最常见的是GPT-4(51.3%)、GPT-3.5(36.6%)和ChatGPT(22.4%)。直接提示是最常见的适应方法(92.5%),强化学习很少使用(1.4%)。准确性是最常被评估的指标,而错误和安全性(9.4%)以及时间效率(7.0%)则不常被评估。结论:llm在医疗保健应用中具有前景。扩大它们在工作流程优化、培训生教育和研究工具中的应用,可以增强医疗保健服务和创新。使用标准化标准的综合评估对于法学硕士融入医疗保健至关重要。
{"title":"Advancing healthcare with large language models: A scoping review of applications and future directions","authors":"Zhihong Zhang ,&nbsp;Mohamad Javad Momeni Nezhad ,&nbsp;Seyed Mohammad Bagher Hosseini ,&nbsp;Ali Zolnour ,&nbsp;Zahra Zonour ,&nbsp;Seyedeh Mahdis Hosseini ,&nbsp;Maxim Topaz ,&nbsp;Maryam Zolnoori","doi":"10.1016/j.ijmedinf.2025.106231","DOIUrl":"10.1016/j.ijmedinf.2025.106231","url":null,"abstract":"<div><h3>Background</h3><div>The release of ChatGPT has spurred the widespread adoption of generative large language models (LLMs) in healthcare. This scoping review systematically examines their use in healthcare.</div></div><div><h3>Methods</h3><div>A systematic search was conducted using PubMed, a comprehensive and representative database on biomedical and health science, to identify studies published between January 1, 2023, and July 30, 2024. Studies were included if they assessed the performance of generative LLMs in healthcare applications; review or perspective articles were excluded.</div></div><div><h3>Results</h3><div>A total of 415 studies were included, with a significant increase in publications observed after April 2023. Generative LLMs were applied across various medical specialties, primarily supporting clinical decision-making (26.7%) and providing patient information (23.9%). Smaller proportions were focused on professional education and training (18.1%), research (16.1%), and workflow support (12.5%). These applications were mainly supported by three key NLP tasks: question answering (36.1%), text classification (27.5%), and text generation (26.3%). Public datasets appeared in 20% of studies, and 15% used clinical patient data. Of the 98 LLMs used, GPT-4 (51.3%), GPT-3.5 (36.6%), and ChatGPT (22.4%) were the most common. Direct prompting was the most common adaptation method (92.5%), with reinforcement learning rarely utilized (1.4%). Accuracy was the most frequently assessed metric, while errors and safety (9.4%) and time efficiency (7.0%) were less commonly evaluated.</div></div><div><h3>Conclusion</h3><div>LLMs hold promise across healthcare applications. Expanding their use in workflow optimization, trainee education, and research tools could enhance healthcare delivery and innovation. Comprehensive evaluation using standardized criteria is essential for LLMs integration into healthcare.</div></div>","PeriodicalId":54950,"journal":{"name":"International Journal of Medical Informatics","volume":"208 ","pages":"Article 106231"},"PeriodicalIF":4.1,"publicationDate":"2026-03-15","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145829162","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Commentary on “Towards practical federated learning and evaluation for medical prediction models” 对“医学预测模型的实用联合学习和评估”的评论。
IF 4.1 2区 医学 Q2 COMPUTER SCIENCE, INFORMATION SYSTEMS Pub Date : 2026-03-15 Epub Date: 2025-12-28 DOI: 10.1016/j.ijmedinf.2025.106249
Wen-Jiang Yang
{"title":"Commentary on “Towards practical federated learning and evaluation for medical prediction models”","authors":"Wen-Jiang Yang","doi":"10.1016/j.ijmedinf.2025.106249","DOIUrl":"10.1016/j.ijmedinf.2025.106249","url":null,"abstract":"","PeriodicalId":54950,"journal":{"name":"International Journal of Medical Informatics","volume":"208 ","pages":"Article 106249"},"PeriodicalIF":4.1,"publicationDate":"2026-03-15","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145879347","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Limitations of SHAP-based interpretability in sepsis progression models and paths to more robust feature validation 基于shap的脓毒症进展模型可解释性的局限性和更稳健的特征验证途径
IF 4.1 2区 医学 Q2 COMPUTER SCIENCE, INFORMATION SYSTEMS Pub Date : 2026-03-15 Epub Date: 2025-12-24 DOI: 10.1016/j.ijmedinf.2025.106238
Yuto Arai , Yoshiyasu Takefuji
{"title":"Limitations of SHAP-based interpretability in sepsis progression models and paths to more robust feature validation","authors":"Yuto Arai ,&nbsp;Yoshiyasu Takefuji","doi":"10.1016/j.ijmedinf.2025.106238","DOIUrl":"10.1016/j.ijmedinf.2025.106238","url":null,"abstract":"","PeriodicalId":54950,"journal":{"name":"International Journal of Medical Informatics","volume":"208 ","pages":"Article 106238"},"PeriodicalIF":4.1,"publicationDate":"2026-03-15","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145841723","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Comment on “Medication-based mortality prediction in COPD using machine learning and conventional statistical methods” 对“利用机器学习和传统统计方法预测COPD药物死亡率”的评论
IF 4.1 2区 医学 Q2 COMPUTER SCIENCE, INFORMATION SYSTEMS Pub Date : 2026-03-15 Epub Date: 2025-12-19 DOI: 10.1016/j.ijmedinf.2025.106228
Siyi Liu , Zekai Yu
{"title":"Comment on “Medication-based mortality prediction in COPD using machine learning and conventional statistical methods”","authors":"Siyi Liu ,&nbsp;Zekai Yu","doi":"10.1016/j.ijmedinf.2025.106228","DOIUrl":"10.1016/j.ijmedinf.2025.106228","url":null,"abstract":"","PeriodicalId":54950,"journal":{"name":"International Journal of Medical Informatics","volume":"208 ","pages":"Article 106228"},"PeriodicalIF":4.1,"publicationDate":"2026-03-15","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145799805","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Efficient medical NER with limited data: Enhancing LLM performance through annotation guidelines 有限数据的高效医疗NER:通过注释指南增强LLM性能。
IF 4.1 2区 医学 Q2 COMPUTER SCIENCE, INFORMATION SYSTEMS Pub Date : 2026-03-15 Epub Date: 2025-12-18 DOI: 10.1016/j.ijmedinf.2025.106230
Emiko Shinohara, Yoshimasa Kawazoe

Background

Named entity recognition (NER) is critical in natural language processing (NLP), particularly in the medical field, where accurate identification of entities, such as patient information and clinical events, is essential. Traditional NER approaches rely heavily on large, annotated corpora, which are resource intensive. Large language models (LLMs) offer new NER approaches, particularly through in-context and few-shot learning.

Objective

This study investigates the effects of incorporating annotation guidelines into prompts for NER via LLMs, with a specific focus on their impact on few-shot learning performance across various medical corpora.

Methods

We designed eight different prompt patterns, combining few-shot examples with annotation guidelines of varying complexity, and evaluated their performance via three prominent LLMs: GPT-4o, Claude 3.5 Sonnet, and gpt-oss-120b. Additionally, we employed three diverse medical corpora: i2b2-2014, i2b2-2012, and MedTxt-CR. Accuracy was assessed via precision, recall, and the F1 score, with evaluation methods aligned with those used in relevant shared tasks to ensure the comparability of the results.

Results

Our findings indicate that adding detailed annotation guidelines to few-shot prompts improves the recall and F1 score in most cases.

Conclusion

Including annotation guidelines in prompts enhances the performance of LLMs in NER tasks, making this a practical approach for developing accurate NLP systems in resource-constrained environments. Although annotation guidelines are essential for evaluation and example creation, their integration into LLM prompts can further optimize few-shot learning, especially within specialized domains such as medical NLP.
背景:命名实体识别(NER)在自然语言处理(NLP)中至关重要,特别是在医学领域,准确识别实体(如患者信息和临床事件)至关重要。传统的NER方法严重依赖于大型的、带注释的语料库,这是资源密集型的。大型语言模型(llm)提供了新的NER方法,特别是通过上下文学习和少镜头学习。目的:本研究探讨了通过llm将注释指南纳入NER提示的效果,并特别关注了它们对跨各种医学语料库的少射学习性能的影响。方法:我们设计了8种不同的提示模式,结合了不同复杂性的注释指南,并通过三个著名的llm: gpt- 40、Claude 3.5 Sonnet和gpt- ss-120b来评估它们的性能。此外,我们还采用了三种不同的医疗资料库:i2b2-2014、i2b2-2012和MedTxt-CR。准确性通过精密度、召回率和F1分数来评估,评估方法与相关共享任务中使用的方法一致,以确保结果的可比性。结果:我们的研究结果表明,在大多数情况下,为少量提示添加详细的注释指南可以提高召回率和F1分数。结论:在提示中包含注释指南可以增强llm在NER任务中的性能,使其成为在资源受限环境中开发准确的NLP系统的实用方法。尽管注释指南对于评估和示例创建至关重要,但将它们集成到LLM提示中可以进一步优化少量学习,特别是在医疗NLP等专业领域中。
{"title":"Efficient medical NER with limited data: Enhancing LLM performance through annotation guidelines","authors":"Emiko Shinohara,&nbsp;Yoshimasa Kawazoe","doi":"10.1016/j.ijmedinf.2025.106230","DOIUrl":"10.1016/j.ijmedinf.2025.106230","url":null,"abstract":"<div><h3>Background</h3><div>Named entity recognition (NER) is critical in natural language processing (NLP), particularly in the medical field, where accurate identification of entities, such as patient information and clinical events, is essential. Traditional NER approaches rely heavily on large, annotated corpora, which are resource intensive. Large language models (LLMs) offer new NER approaches, particularly through in-context and few-shot learning.</div></div><div><h3>Objective</h3><div>This study investigates the effects of incorporating annotation guidelines into prompts for NER via LLMs, with a specific focus on their impact on few-shot learning performance across various medical corpora.</div></div><div><h3>Methods</h3><div>We designed eight different prompt patterns, combining few-shot examples with annotation guidelines of varying complexity, and evaluated their performance via three prominent LLMs: GPT-4o, Claude 3.5 Sonnet, and gpt-oss-120b. Additionally, we employed three diverse medical corpora: i2b2-2014, i2b2-2012, and MedTxt-CR. Accuracy was assessed via precision, recall, and the F1 score, with evaluation methods aligned with those used in relevant shared tasks to ensure the comparability of the results.</div></div><div><h3>Results</h3><div>Our findings indicate that adding detailed annotation guidelines to few-shot prompts improves the recall and F1 score in most cases.</div></div><div><h3>Conclusion</h3><div>Including annotation guidelines in prompts enhances the performance of LLMs in NER tasks, making this a practical approach for developing accurate NLP systems in resource-constrained environments. Although annotation guidelines are essential for evaluation and example creation, their integration into LLM prompts can further optimize few-shot learning, especially within specialized domains such as medical NLP.</div></div>","PeriodicalId":54950,"journal":{"name":"International Journal of Medical Informatics","volume":"208 ","pages":"Article 106230"},"PeriodicalIF":4.1,"publicationDate":"2026-03-15","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145821980","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Vision-language models in diagnostic imaging: review of technical advances, clinical validation, and practical deployment 诊断成像中的视觉语言模型:技术进步、临床验证和实际部署的回顾
IF 4.1 2区 医学 Q2 COMPUTER SCIENCE, INFORMATION SYSTEMS Pub Date : 2026-03-15 Epub Date: 2025-12-28 DOI: 10.1016/j.ijmedinf.2025.106227
Niharika Dutta , Kartik Bose , Emir Syailendra , Linda Chu , Pankaj Gupta

Background

Radiology faces an unprecedented workload crisis, creating demand for AI solutions to enhance efficiency and quality. Vision-language models (VLMs) represent a paradigm shift from narrow AI tools to integrated systems for image interpretation and report generation. However, their rapid technical progress has outpaced rigorous clinical validation, creating a critical gap between their theoretical potential and safe, practical deployment.

Objective

To critically review the state of VLMs in diagnostic imaging by evaluating their clinical validation, identifying deployment challenges, and assessing their impact on the radiological workflow. This review provides a roadmap for responsible clinical integration by analyzing the gap between model performance and real-world utility.

Method

A narrative review of literature was conducted from January 2017 to May 2025. The search focused on VLM applications in radiology, including automated report generation and visual question answering. We synthesized findings from technical and clinical validation studies, thematically organized around architectural evolution, applications, validation, and implementation barriers.

Results

A clear progression from encoder-decoder models to sophisticated LLM-integrated foundation models was identified. While these models achieve high performance on NLP metrics, their clinical utility is limited. Key findings include: (1) Pervasive model hallucination, with factual errors in ∼ 22 % of AI-generated reports; (2) A lack of external validation on diverse, multi-institutional datasets; (3) Significant implementation barriers, including high computational costs, poor workflow integration, and unresolved liability. Human expert evaluations show that while AI-generated reports for routine cases are often acceptable (77.7 % in one study), accuracy declines significantly in complex cases.

Conclusion

VLMs hold transformative potential but are not ready for autonomous clinical use. Their primary value lies in augmenting radiologists’ workflow. For successful adoption, the field must shift focus from algorithmic metrics to proving clinical safety and efficacy through rigorous validation, developing robust hallucination mitigation strategies, and designing seamless workflow integrations.
放射学面临着前所未有的工作量危机,创造了对人工智能解决方案的需求,以提高效率和质量。视觉语言模型(VLMs)代表了从狭窄的人工智能工具到用于图像解释和报告生成的集成系统的范式转变。然而,它们的快速技术进步已经超过了严格的临床验证,在它们的理论潜力和安全、实际部署之间形成了一个关键的差距。目的通过评估VLMs在诊断成像中的临床有效性、确定部署挑战以及评估其对放射工作流程的影响,批判性地回顾VLMs在诊断成像中的状态。本综述通过分析模型性能与现实世界效用之间的差距,为负责任的临床整合提供了路线图。方法对2017年1月至2025年5月的文献进行叙述性综述。搜索的重点是VLM在放射学中的应用,包括自动报告生成和可视化问题回答。我们综合了来自技术和临床验证研究的发现,并围绕架构演变、应用程序、验证和实现障碍进行了主题组织。结果确定了从编码器-解码器模型到复杂的llm集成基础模型的明显进展。虽然这些模型在NLP指标上实现了高性能,但它们的临床效用有限。主要发现包括:(1)普遍存在的模型幻觉,在人工智能生成的报告中,约22%存在事实错误;(2)缺乏对多样化、多机构数据集的外部验证;(3)显著的实现障碍,包括高计算成本、差的工作流集成和未解决的责任。人类专家的评估表明,虽然人工智能生成的报告在常规病例中通常是可以接受的(在一项研究中为77.7%),但在复杂病例中准确性显著下降。结论vlms具有变革潜力,但尚未具备自主临床应用的条件。它们的主要价值在于增强放射科医生的工作流程。为了成功采用,该领域必须将重点从算法指标转移到通过严格的验证来证明临床安全性和有效性,开发强大的幻觉缓解策略,并设计无缝的工作流程集成。
{"title":"Vision-language models in diagnostic imaging: review of technical advances, clinical validation, and practical deployment","authors":"Niharika Dutta ,&nbsp;Kartik Bose ,&nbsp;Emir Syailendra ,&nbsp;Linda Chu ,&nbsp;Pankaj Gupta","doi":"10.1016/j.ijmedinf.2025.106227","DOIUrl":"10.1016/j.ijmedinf.2025.106227","url":null,"abstract":"<div><h3>Background</h3><div>Radiology faces an unprecedented workload crisis, creating demand for AI solutions to enhance efficiency and quality. Vision-language models (VLMs) represent a paradigm shift from narrow AI tools to integrated systems for image interpretation and report generation. However, their rapid technical progress has outpaced rigorous clinical validation, creating a critical gap between their theoretical potential and safe, practical deployment.</div></div><div><h3>Objective</h3><div>To critically review the state of VLMs in diagnostic imaging by evaluating their clinical validation, identifying deployment challenges, and assessing their impact on the radiological workflow. This review provides a roadmap for responsible clinical integration by analyzing the gap between model performance and real-world utility.</div></div><div><h3>Method</h3><div>A narrative review of literature was conducted from January 2017 to May 2025. The search focused on VLM applications in radiology, including automated report generation and visual question answering. We synthesized findings from technical and clinical validation studies, thematically organized around architectural evolution, applications, validation, and implementation barriers.</div></div><div><h3>Results</h3><div>A clear progression from encoder-decoder models to sophisticated LLM-integrated foundation models was identified. While these models achieve high performance on NLP metrics, their clinical utility is limited. Key findings include: (1) Pervasive model hallucination, with factual errors in ∼ 22 % of AI-generated reports; (2) A lack of external validation on diverse, multi-institutional datasets; (3) Significant implementation barriers, including high computational costs, poor workflow integration, and unresolved liability. Human expert evaluations show that while AI-generated reports for routine cases are often acceptable (77.7 % in one study), accuracy declines significantly in complex cases.</div></div><div><h3>Conclusion</h3><div>VLMs hold transformative potential but are not ready for autonomous clinical use. Their primary value lies in augmenting radiologists’ workflow. For successful adoption, the field must shift focus from algorithmic metrics to proving clinical safety and efficacy through rigorous validation, developing robust hallucination mitigation strategies, and designing seamless workflow integrations.</div></div>","PeriodicalId":54950,"journal":{"name":"International Journal of Medical Informatics","volume":"208 ","pages":"Article 106227"},"PeriodicalIF":4.1,"publicationDate":"2026-03-15","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145885679","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
A roadmap for federated learning projects using health data to guide sustainable artificial intelligence development in the European Union 使用健康数据指导欧洲联盟可持续人工智能发展的联合学习项目路线图。
IF 4.1 2区 医学 Q2 COMPUTER SCIENCE, INFORMATION SYSTEMS Pub Date : 2026-03-15 Epub Date: 2025-12-25 DOI: 10.1016/j.ijmedinf.2025.106242
Janne Kommusaar , Silja Elunurm , Taridzo Chomutare , Mari Kangasniemi , Sanna Salanterä , Laura-Maria Peltonen
<div><h3>Background</h3><div>The rise of digital health data has expanded opportunities for data-driven innovation, yet privacy, legal and ethical barriers frame data sharing and collaborative artificial intelligence development. Federated Learning (FL) offers a privacy-preserving alternative, but current research considers mainly technical aspects. There is no end-to-end roadmap that integrates ethical, legal, technical and administrative principles tailored to FL projects in healthcare. This study addresses that gap by developing a roadmap to guide responsible and scalable FL research in the European context.</div></div><div><h3>Methods</h3><div>A multi-method participatory approach was used to develop a roadmap for scientific projects using FL on health data. The iterative process involved three phases. First, key questions were defined and existing evidence was explored through (i) a survey of domain experts (researchers, data governance specialists and infrastructure providers), (ii) a targeted literature review of FL applications in health research and (iii) systematic mapping of relevant EU-level legislation and policy frameworks. Evidence from these sources was synthesized to identify technical, organizational, legal and sustainability-related requirements for FL-based research. Second, preliminary roadmap components were refined through stakeholder engagement in an online workshop, where feasibility, scalability and sustainability considerations were explicitly discussed. Third, the roadmap was validated and iteratively refined by an expert panel through a structured group discussion, focusing on long-term sustainability, governance and transferability across research contexts. The process was carried out within a Baltic-Nordic collaboration in 2023–2025.</div></div><div><h3>Results</h3><div>The developed roadmap integrates ethical, legal, technical, administrative and sustainability-related considerations essential for applying FL to health data. It emphasizes the importance of multidisciplinary collaboration throughout the FL project lifecycle, with particular attention to long-term governance, scalability and reuse of infrastructures and practices. The process is structured into six phases: (1) Planning, (2) Execution refinement, (3) Data, (4) FL platform, (5) FL experiment and (6) Dissemination. Across these phases, sustainability is addressed through mechanisms such as regulatory alignment, shared governance models, capacity building and integration with existing research and health data infrastructures. By merging ethical, legal, technical and administrative aspects into a unified, end-to-end framework, the roadmap provides actionable, novel guidance beyond existing recommendations.</div></div><div><h3>Conclusions</h3><div>This work consolidates early lessons from FL in healthcare into a practical, step-by-step roadmap that integrates ethical, legal, technical and administrative aspects in the European context. By offering a shared
背景:数字健康数据的兴起扩大了数据驱动创新的机会,但隐私、法律和道德障碍阻碍了数据共享和协作式人工智能的发展。联邦学习(FL)提供了一种保护隐私的替代方案,但目前的研究主要考虑技术方面的问题。目前还没有一个端到端的路线图,可以整合为医疗保健领域的FL项目量身定制的道德、法律、技术和管理原则。本研究通过制定路线图来指导欧洲范围内负责任和可扩展的FL研究,从而解决了这一差距。方法:采用多方法参与式方法,为利用FL处理卫生数据的科学项目制定路线图。迭代过程包括三个阶段。首先,通过(i)对领域专家(研究人员、数据治理专家和基础设施提供商)的调查,(ii)对FL在卫生研究中的应用进行有针对性的文献综述,以及(iii)系统地绘制相关欧盟层面的立法和政策框架,定义了关键问题并探索了现有证据。综合了这些来源的证据,以确定基于fl的研究的技术、组织、法律和可持续性相关要求。其次,通过在线研讨会的利益相关者参与,对初步路线图组件进行了细化,其中明确讨论了可行性、可扩展性和可持续性考虑因素。第三,路线图由专家小组通过结构化的小组讨论进行验证和迭代完善,重点关注长期可持续性、治理和跨研究背景的可转移性。该过程是在2023-2025年的波罗的海-北欧合作中进行的。结果:制定的路线图整合了将FL应用于健康数据所必需的伦理、法律、技术、行政和可持续性相关考虑因素。它强调了在整个FL项目生命周期中多学科协作的重要性,特别关注基础结构和实践的长期治理、可伸缩性和重用。该过程分为六个阶段:(1)计划,(2)执行改进,(3)数据,(4)FL平台,(5)FL实验和(6)传播。在这些阶段,可持续性是通过监管协调、共享治理模式、能力建设以及与现有研究和卫生数据基础设施的整合等机制来解决的。通过将道德、法律、技术和管理方面合并到一个统一的端到端框架中,路线图提供了超越现有建议的可操作的新颖指导。结论:这项工作将早期FL在医疗保健方面的经验教训整合到一个实用的、逐步的路线图中,该路线图在欧洲背景下整合了伦理、法律、技术和行政方面。通过为不同的利益相关者提供共享框架,它支持跨医疗保健系统的更值得信赖、可扩展和兼容的人工智能协作。
{"title":"A roadmap for federated learning projects using health data to guide sustainable artificial intelligence development in the European Union","authors":"Janne Kommusaar ,&nbsp;Silja Elunurm ,&nbsp;Taridzo Chomutare ,&nbsp;Mari Kangasniemi ,&nbsp;Sanna Salanterä ,&nbsp;Laura-Maria Peltonen","doi":"10.1016/j.ijmedinf.2025.106242","DOIUrl":"10.1016/j.ijmedinf.2025.106242","url":null,"abstract":"&lt;div&gt;&lt;h3&gt;Background&lt;/h3&gt;&lt;div&gt;The rise of digital health data has expanded opportunities for data-driven innovation, yet privacy, legal and ethical barriers frame data sharing and collaborative artificial intelligence development. Federated Learning (FL) offers a privacy-preserving alternative, but current research considers mainly technical aspects. There is no end-to-end roadmap that integrates ethical, legal, technical and administrative principles tailored to FL projects in healthcare. This study addresses that gap by developing a roadmap to guide responsible and scalable FL research in the European context.&lt;/div&gt;&lt;/div&gt;&lt;div&gt;&lt;h3&gt;Methods&lt;/h3&gt;&lt;div&gt;A multi-method participatory approach was used to develop a roadmap for scientific projects using FL on health data. The iterative process involved three phases. First, key questions were defined and existing evidence was explored through (i) a survey of domain experts (researchers, data governance specialists and infrastructure providers), (ii) a targeted literature review of FL applications in health research and (iii) systematic mapping of relevant EU-level legislation and policy frameworks. Evidence from these sources was synthesized to identify technical, organizational, legal and sustainability-related requirements for FL-based research. Second, preliminary roadmap components were refined through stakeholder engagement in an online workshop, where feasibility, scalability and sustainability considerations were explicitly discussed. Third, the roadmap was validated and iteratively refined by an expert panel through a structured group discussion, focusing on long-term sustainability, governance and transferability across research contexts. The process was carried out within a Baltic-Nordic collaboration in 2023–2025.&lt;/div&gt;&lt;/div&gt;&lt;div&gt;&lt;h3&gt;Results&lt;/h3&gt;&lt;div&gt;The developed roadmap integrates ethical, legal, technical, administrative and sustainability-related considerations essential for applying FL to health data. It emphasizes the importance of multidisciplinary collaboration throughout the FL project lifecycle, with particular attention to long-term governance, scalability and reuse of infrastructures and practices. The process is structured into six phases: (1) Planning, (2) Execution refinement, (3) Data, (4) FL platform, (5) FL experiment and (6) Dissemination. Across these phases, sustainability is addressed through mechanisms such as regulatory alignment, shared governance models, capacity building and integration with existing research and health data infrastructures. By merging ethical, legal, technical and administrative aspects into a unified, end-to-end framework, the roadmap provides actionable, novel guidance beyond existing recommendations.&lt;/div&gt;&lt;/div&gt;&lt;div&gt;&lt;h3&gt;Conclusions&lt;/h3&gt;&lt;div&gt;This work consolidates early lessons from FL in healthcare into a practical, step-by-step roadmap that integrates ethical, legal, technical and administrative aspects in the European context. By offering a shared","PeriodicalId":54950,"journal":{"name":"International Journal of Medical Informatics","volume":"208 ","pages":"Article 106242"},"PeriodicalIF":4.1,"publicationDate":"2026-03-15","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145879368","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Study of bladder cancer detection in standard white light versus AI-supported endoscopy-01 (RAISE-01) - Development and validation of an AI-based support tool. 标准白光下膀胱癌检测与人工智能支持的内窥镜-01 (RAISE-01)的研究-基于人工智能的支持工具的开发和验证。
IF 4.1 2区 医学 Q2 COMPUTER SCIENCE, INFORMATION SYSTEMS Pub Date : 2026-03-14 DOI: 10.1016/j.ijmedinf.2026.106390
Peter B Hjort, Jacob E Jensen, Jørgen B Jensen, Andreas Ernst

Background: Diagnosis and surveillance of bladder cancer rely on white-light cystoscopy (WLC). However, this modality is operator-dependent and associated with a risk of missed lesions, contributing to high recurrence rates, especially in non-muscle invasive bladder cancer. Recent advances in artificial intelligence (AI) enable software-based decision support for bladder lesion detection, with potential for vendor-independent deployment and broad integration into routine clinical workflows.

Objective: To develop and externally validate an AI-based clinical decision support system for real-time bladder lesion detection during cystoscopy.

Methods: CystoAID, a convolutional neural network-based object detection system, was trained on prospectively collected video recordings from flexible cystoscopies and transurethral resections of bladder tumors. Diagnostic accuracy was evaluated using a retrospective external validation dataset representative of routine clinical practice, in accordance with STARD-AI recommendations.

Results: In the external validation cohort, CystoAID achieved a sensitivity of 1.00 (95% CI 0.95-1.00). Precision was 88.1% (95% CI 81.3-92.7), exceeding published estimates for WLC. Precision-recall analysis showed consistently high precision (>0.8) across clinically relevant recall levels, with declining precision at higher recall, reflecting the expected trade-off between sensitivity and false-positive detections. The system operated with low processing latency, supporting feasibility for real-time clinical use. Sensitivity was prioritized to mitigate the clinical risk associated with false-negative findings.

Conclusions: CystoAID is a real-time, AI-based decision support tool for cystoscopy that demonstrated high sensitivity and favorable precision in external validation. These findings support its potential role as an assistive technology in routine urologic practice. Prospective studies are warranted to evaluate clinical impact, workflow integration, and performance in detecting challenging lesion subtypes, including flat lesions and carcinoma in situ.

背景:膀胱癌的诊断和监测依赖于白光膀胱镜检查。然而,这种手术方式依赖于手术者,并且与漏诊的风险相关,导致高复发率,特别是在非肌肉浸润性膀胱癌中。人工智能(AI)的最新进展使基于软件的膀胱病变检测决策支持成为可能,具有独立于供应商的部署和广泛集成到常规临床工作流程中的潜力。目的:开发并外部验证基于人工智能的膀胱镜下病变实时检测临床决策支持系统。方法:采用基于卷积神经网络的目标检测系统CystoAID,对前瞻性收集的柔性膀胱镜和经尿道膀胱肿瘤切除术的视频记录进行训练。根据star - ai建议,使用代表常规临床实践的回顾性外部验证数据集评估诊断准确性。结果:在外部验证队列中,CystoAID的敏感性为1.00 (95% CI 0.95-1.00)。精密度为88.1% (95% CI 81.3-92.7),超过了已发表的WLC估计值。查准率-查全率分析显示,在临床相关的查全率水平上,准确率始终较高(>.8),查全率越高,准确率越低,反映了灵敏度和假阳性检测之间的预期权衡。该系统运行时处理延迟低,支持实时临床应用的可行性。敏感性被优先考虑,以减轻与假阴性结果相关的临床风险。结论:CystoAID是一种基于人工智能的实时膀胱镜决策支持工具,在外部验证中具有较高的灵敏度和良好的精度。这些发现支持其在泌尿外科常规实践中作为辅助技术的潜在作用。有必要进行前瞻性研究,以评估临床影响、工作流程整合以及检测具有挑战性的病变亚型(包括扁平病变和原位癌)的性能。
{"title":"Study of bladder cancer detection in standard white light versus AI-supported endoscopy-01 (RAISE-01) - Development and validation of an AI-based support tool.","authors":"Peter B Hjort, Jacob E Jensen, Jørgen B Jensen, Andreas Ernst","doi":"10.1016/j.ijmedinf.2026.106390","DOIUrl":"https://doi.org/10.1016/j.ijmedinf.2026.106390","url":null,"abstract":"<p><strong>Background: </strong>Diagnosis and surveillance of bladder cancer rely on white-light cystoscopy (WLC). However, this modality is operator-dependent and associated with a risk of missed lesions, contributing to high recurrence rates, especially in non-muscle invasive bladder cancer. Recent advances in artificial intelligence (AI) enable software-based decision support for bladder lesion detection, with potential for vendor-independent deployment and broad integration into routine clinical workflows.</p><p><strong>Objective: </strong>To develop and externally validate an AI-based clinical decision support system for real-time bladder lesion detection during cystoscopy.</p><p><strong>Methods: </strong>CystoAID, a convolutional neural network-based object detection system, was trained on prospectively collected video recordings from flexible cystoscopies and transurethral resections of bladder tumors. Diagnostic accuracy was evaluated using a retrospective external validation dataset representative of routine clinical practice, in accordance with STARD-AI recommendations.</p><p><strong>Results: </strong>In the external validation cohort, CystoAID achieved a sensitivity of 1.00 (95% CI 0.95-1.00). Precision was 88.1% (95% CI 81.3-92.7), exceeding published estimates for WLC. Precision-recall analysis showed consistently high precision (>0.8) across clinically relevant recall levels, with declining precision at higher recall, reflecting the expected trade-off between sensitivity and false-positive detections. The system operated with low processing latency, supporting feasibility for real-time clinical use. Sensitivity was prioritized to mitigate the clinical risk associated with false-negative findings.</p><p><strong>Conclusions: </strong>CystoAID is a real-time, AI-based decision support tool for cystoscopy that demonstrated high sensitivity and favorable precision in external validation. These findings support its potential role as an assistive technology in routine urologic practice. Prospective studies are warranted to evaluate clinical impact, workflow integration, and performance in detecting challenging lesion subtypes, including flat lesions and carcinoma in situ.</p>","PeriodicalId":54950,"journal":{"name":"International Journal of Medical Informatics","volume":"213 ","pages":"106390"},"PeriodicalIF":4.1,"publicationDate":"2026-03-14","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"147464393","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
"Clicking without understanding": A mixed-methods analysis of user agreements in digital mental health services. “点击而不理解”:对数字心理健康服务用户协议的混合方法分析。
IF 4.1 2区 医学 Q2 COMPUTER SCIENCE, INFORMATION SYSTEMS Pub Date : 2026-03-13 DOI: 10.1016/j.ijmedinf.2026.106384
Gerard Chung Siew Keong

Objectives: This study evaluates privacy policies and terms of service agreements from digital mental health platforms, focusing on accessibility, comprehensibility, and alignment with informed consent principles in healthcare informatics.

Materials and methods: We applied mixed methods combining content analysis and computational linguistic assessment to 139 user agreements from international mental health applications and Singaporean providers, including commercial platforms and social service agencies serving vulnerable populations. We evaluated readability, communicative practices, regulatory compliance, and power asymmetries. Only 1.67% of services implemented comprehension verification for informed consent. User agreements required approximately 16 years of education for comprehension and exhibited significant linguistic power asymmetries favoring providers. Privacy policies comprehensively addressed data collection but systematically neglected post-service communication regarding data retention and deletion. Among local services, only 8.33% adequately communicated data breach notification procedures as required by Singapore's Personal Data Protection Act. Terms of service failed to establish bidirectional communicative exchange necessary for meaningful healthcare informed consent. Findings reveal fundamental misalignment between digital mental health agreements and collaborative communication principles essential to therapeutic relationships and healthcare informatics best practices. Communication barriers pose particular risks for individuals with serious mental illness requiring accessible health information for decision-making. Results have implications for health informatics policy, consumer health technology design, and digital health regulatory frameworks. Digital mental health platforms demonstrate significant user communication deficiencies. Our findings point to the need for user agreements that are written in plain language, that incorporate essential informed consent components, that balance linguistic power between providers and users, and that accommodate the cognitive needs of vulnerable populations seeking mental health support.

目的:本研究评估了数字心理健康平台的隐私政策和服务协议条款,重点关注可访问性、可理解性以及与医疗信息学中知情同意原则的一致性。材料和方法:我们将内容分析和计算语言评估相结合的混合方法应用于来自国际心理健康应用程序和新加坡提供者的139份用户协议,包括为弱势群体服务的商业平台和社会服务机构。我们评估了可读性、沟通实践、法规遵从性和权力不对称。只有1.67%的服务实施了知情同意的理解性验证。用户协议需要大约16年的教育才能理解,并表现出明显的语言权力不对称,有利于提供者。隐私政策全面解决了数据收集问题,但系统性地忽视了有关数据保留和删除的离职后沟通。在本地服务中,只有8.33%的服务按照新加坡《个人数据保护法》的要求充分传达了数据泄露通知程序。服务条款未能建立有意义的医疗知情同意所必需的双向沟通交流。研究结果揭示了数字心理健康协议与治疗关系和医疗信息学最佳实践所必需的协作沟通原则之间的根本不一致。沟通障碍对需要获取健康信息以供决策的严重精神疾病患者构成特别的风险。研究结果对健康信息政策、消费者健康技术设计和数字健康监管框架具有启示意义。数字心理健康平台显示出严重的用户沟通缺陷。我们的研究结果指出,需要用通俗易懂的语言编写用户协议,其中包括基本的知情同意组成部分,平衡提供者和用户之间的语言力量,并适应寻求心理健康支持的弱势群体的认知需求。
{"title":"\"Clicking without understanding\": A mixed-methods analysis of user agreements in digital mental health services.","authors":"Gerard Chung Siew Keong","doi":"10.1016/j.ijmedinf.2026.106384","DOIUrl":"https://doi.org/10.1016/j.ijmedinf.2026.106384","url":null,"abstract":"<p><strong>Objectives: </strong>This study evaluates privacy policies and terms of service agreements from digital mental health platforms, focusing on accessibility, comprehensibility, and alignment with informed consent principles in healthcare informatics.</p><p><strong>Materials and methods: </strong>We applied mixed methods combining content analysis and computational linguistic assessment to 139 user agreements from international mental health applications and Singaporean providers, including commercial platforms and social service agencies serving vulnerable populations. We evaluated readability, communicative practices, regulatory compliance, and power asymmetries. Only 1.67% of services implemented comprehension verification for informed consent. User agreements required approximately 16 years of education for comprehension and exhibited significant linguistic power asymmetries favoring providers. Privacy policies comprehensively addressed data collection but systematically neglected post-service communication regarding data retention and deletion. Among local services, only 8.33% adequately communicated data breach notification procedures as required by Singapore's Personal Data Protection Act. Terms of service failed to establish bidirectional communicative exchange necessary for meaningful healthcare informed consent. Findings reveal fundamental misalignment between digital mental health agreements and collaborative communication principles essential to therapeutic relationships and healthcare informatics best practices. Communication barriers pose particular risks for individuals with serious mental illness requiring accessible health information for decision-making. Results have implications for health informatics policy, consumer health technology design, and digital health regulatory frameworks. Digital mental health platforms demonstrate significant user communication deficiencies. Our findings point to the need for user agreements that are written in plain language, that incorporate essential informed consent components, that balance linguistic power between providers and users, and that accommodate the cognitive needs of vulnerable populations seeking mental health support.</p>","PeriodicalId":54950,"journal":{"name":"International Journal of Medical Informatics","volume":"213 ","pages":"106384"},"PeriodicalIF":4.1,"publicationDate":"2026-03-13","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"147464283","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Unsupervised identification of muscle phenotypes in adults with obesity: a data-driven framework for the identification of sarcopenia in absence of a gold standard. 成人肥胖患者肌肉表型的无监督鉴定:在缺乏金标准的情况下识别肌肉减少症的数据驱动框架。
IF 4.1 2区 医学 Q2 COMPUTER SCIENCE, INFORMATION SYSTEMS Pub Date : 2026-03-13 DOI: 10.1016/j.ijmedinf.2026.106398
Lo Conte Sofia, Bufano Annalisa, Cevenini Gabriele, Barbini Paolo, Castagna Maria Grazia, Cartocci Alessandra

Background: Sarcopenia is characterized by progressive loss of skeletal muscle mass and strength and is associated with increased disability and mortality. However, the diagnosis of sarcopenia remains challenging due to the absence of a universally accepted gold standard and validated cut-off values for skeletal muscle indices. Data-driven approaches based on unsupervised clustering may overcome these limitations by identifying muscle-related phenotypes directly from anthropometric and body composition data.

Methods: In this study, 600 adults with obesity were analyzed and stratified by sex. The dataset was randomly divided into a training set (80%) and a testing set (20%). After data standardization, principal component analysis (PCA) was applied separately in males and females. Unsupervised clustering was then performed on the preserved principal components, and the optimal number of clusters was determined using internal validation indices. Linear Discriminant Analysis (LDA) was applied to assign patients in the test set, and posterior probabilities were correlated with Skeletal Muscle Index (SMI).

Results: Clustering consistently identified two distinct groups in both sexes: one with higher SMI and another with lower SMI, consistent with reduced muscle status. Stepwise LDA accurately classified individuals, and posterior probabilities of belonging to the pathological cluster were negatively correlated with SMI in both sexes, despite SMI not being used in clustering or classification. Individuals in the pathological group exhibited significantly lower SMI, particularly among females.

Conclusions: The combined use of unsupervised clustering and LDA allows reliable identification of distinct muscle-related phenotypes in adults with obesity. This framework provides reproducible classifications, correlates with skeletal muscle index, and offers a quantitative approach to stratify patients by muscle status, even in the absence of predefined diagnostic criteria. These findings support the potential of data-driven phenotyping to improve early detection of sarcopenic obesity.

背景:骨骼肌减少症的特征是骨骼肌质量和力量的进行性损失,并与残疾和死亡率增加有关。然而,由于缺乏普遍接受的骨骼肌指数的金标准和有效的临界值,肌肉减少症的诊断仍然具有挑战性。基于无监督聚类的数据驱动方法可以通过直接从人体测量和身体成分数据中识别肌肉相关表型来克服这些局限性。方法:本研究对600例成人肥胖患者进行分析,并按性别分层。数据集随机分为训练集(80%)和测试集(20%)。数据标准化后,分别对男女进行主成分分析(PCA)。然后对保留的主成分进行无监督聚类,并使用内部验证指标确定最佳聚类数。采用线性判别分析(LDA)对测试集中的患者进行分配,后验概率与骨骼肌指数(SMI)相关。结果:聚类一致地确定了两种不同的性别群体:一种具有较高的SMI,另一种具有较低的SMI,与减少的肌肉状态一致。逐步LDA准确地对个体进行了分类,尽管SMI没有用于聚类或分类,但在两性中,属于病理聚类的后验概率与SMI呈负相关。病理组的个体表现出明显较低的重度精神分裂症,尤其是女性。结论:联合使用无监督聚类和LDA可以可靠地识别成人肥胖患者不同的肌肉相关表型。该框架提供了可重复的分类,与骨骼肌指数相关,并提供了一种定量方法,根据肌肉状态对患者进行分层,即使在没有预定义的诊断标准的情况下。这些发现支持数据驱动型表型的潜力,以提高肌肉减少性肥胖的早期检测。
{"title":"Unsupervised identification of muscle phenotypes in adults with obesity: a data-driven framework for the identification of sarcopenia in absence of a gold standard.","authors":"Lo Conte Sofia, Bufano Annalisa, Cevenini Gabriele, Barbini Paolo, Castagna Maria Grazia, Cartocci Alessandra","doi":"10.1016/j.ijmedinf.2026.106398","DOIUrl":"https://doi.org/10.1016/j.ijmedinf.2026.106398","url":null,"abstract":"<p><strong>Background: </strong>Sarcopenia is characterized by progressive loss of skeletal muscle mass and strength and is associated with increased disability and mortality. However, the diagnosis of sarcopenia remains challenging due to the absence of a universally accepted gold standard and validated cut-off values for skeletal muscle indices. Data-driven approaches based on unsupervised clustering may overcome these limitations by identifying muscle-related phenotypes directly from anthropometric and body composition data.</p><p><strong>Methods: </strong>In this study, 600 adults with obesity were analyzed and stratified by sex. The dataset was randomly divided into a training set (80%) and a testing set (20%). After data standardization, principal component analysis (PCA) was applied separately in males and females. Unsupervised clustering was then performed on the preserved principal components, and the optimal number of clusters was determined using internal validation indices. Linear Discriminant Analysis (LDA) was applied to assign patients in the test set, and posterior probabilities were correlated with Skeletal Muscle Index (SMI).</p><p><strong>Results: </strong>Clustering consistently identified two distinct groups in both sexes: one with higher SMI and another with lower SMI, consistent with reduced muscle status. Stepwise LDA accurately classified individuals, and posterior probabilities of belonging to the pathological cluster were negatively correlated with SMI in both sexes, despite SMI not being used in clustering or classification. Individuals in the pathological group exhibited significantly lower SMI, particularly among females.</p><p><strong>Conclusions: </strong>The combined use of unsupervised clustering and LDA allows reliable identification of distinct muscle-related phenotypes in adults with obesity. This framework provides reproducible classifications, correlates with skeletal muscle index, and offers a quantitative approach to stratify patients by muscle status, even in the absence of predefined diagnostic criteria. These findings support the potential of data-driven phenotyping to improve early detection of sarcopenic obesity.</p>","PeriodicalId":54950,"journal":{"name":"International Journal of Medical Informatics","volume":"213 ","pages":"106398"},"PeriodicalIF":4.1,"publicationDate":"2026-03-13","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"147476218","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
期刊
International Journal of Medical Informatics
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1