International Journal of Medical Informatics最新文献_第2页

Digital literacy training within interventions to support older adults with cardiovascular disease in using technologies: a systematic review 在支持患有心血管疾病的老年人使用技术的干预措施中进行数字扫盲培训：系统回顾。

IF 4.1 2区医学 Q2 COMPUTER SCIENCE, INFORMATION SYSTEMS

International Journal of Medical Informatics

Pub Date : 2026-02-05 DOI: 10.1016/j.ijmedinf.2026.106312

Kathy L. Rush , Cherisse L. Seaton , Rowan Ross , Taylor Robertson , Angeliki-Iliana Louloudi , Peter Loewen , Kristen R. Haase , Jennifer Jakobi , Robert Janke

Background

Advancement in digitalization in the health sector have created numerous opportunities for cardiovascular disease (CVD) self-management but also challenges, especially for older adults with lower digital health literacy. Reviews have examined impacts of digital health technology interventions on health outcomes without examining the role of training provided. The aim of this review is to synthesize evidence about the impacts of digital literacy training (DLT) and its characteristics as a component of digital interventions related to cardiovascular health on patient reported outcome and experience measures among older adults with CVD.

Methods

In accordance with the 2020 Preferred Reporting Items for Systematic Reviews and Meta-Analyses guidelines, a search of MEDLINE, EMBASE, CINAHL, and PsycINFO databases for articles published between inception to March 31, 2025 was conducted. Empirical studies reporting digital health technology training with adults (M age 60 + years) with CVD were eligible for inclusion. Articles included were quality-rated using the Mixed Methods Appraisal Tool. Data were extracted according to the DLT and health technologies alongside patient-reported outcome (i.e technology- and health-related) and experience measures.

Results

Of the 56 included studies (totaling 7698 participants), DLT varied considerably, with 51 describing in-person training. Two studies (totaling 519 participants) examined the role of training with positive impacts on technology- and health-related outcomes. In many of the remaining studies, positive technology-related outcomes were evident but could not be linked back to DLT separate from the overall intervention. In studies (n = 10) where training was evaluated, feedback from patients largely affirmed the training was needed.

Discussion

The collective evidence suggests DLT overall is useful and needed in digital interventions for older adults with CVD. More work is needed to elucidate the distinct role of DLT characteristics and to determine for whom and under what conditions DLT impacts health and technology-related outcomes.

Registration

The protocol for this review was registered Aug 12, 2024 in Open Science Framework (OSF) (See: https://osf.io/unhd9)

背景：卫生部门数字化的进步为心血管疾病（CVD）的自我管理创造了许多机会，但也带来了挑战，特别是对数字健康素养较低的老年人。审查审查了数字卫生技术干预措施对健康结果的影响，但没有审查所提供培训的作用。本综述的目的是综合有关数字素养培训（DLT）及其特征作为心血管健康相关数字干预的组成部分对老年心血管疾病患者报告的结果和体验测量的影响的证据。方法：根据2020年系统评价和荟萃分析首选报告项目指南，检索MEDLINE、EMBASE、CINAHL和PsycINFO数据库，检索创刊至2025年3月31日期间发表的文章。报告对心血管疾病成人（M年龄60 + 岁）进行数字健康技术培训的实证研究符合纳入条件。使用混合方法评价工具对纳入的文章进行质量评价。根据DLT和卫生技术以及患者报告的结果（即技术和健康相关）和经验措施提取数据。结果：在纳入的56项研究（共计7698名参与者）中，DLT差异很大，其中51项描述了亲自培训。两项研究（共519名参与者）考察了培训对技术和健康相关结果的积极影响。在许多剩余的研究中，与技术相关的积极结果是明显的，但不能与整体干预分开的DLT联系起来。在评估培训的研究中（n = 10），患者的反馈在很大程度上肯定了培训的必要性。讨论：集体证据表明，DLT总体上是有用的，并且需要用于老年心血管疾病患者的数字干预。需要做更多的工作来阐明DLT特征的独特作用，并确定DLT对谁以及在什么条件下影响健康和技术相关成果。注册：本综述的方案于2024年8月12日在开放科学框架（OSF）注册（见：https://osf.io/unhd9）。

{"title":"Digital literacy training within interventions to support older adults with cardiovascular disease in using technologies: a systematic review","authors":"Kathy L. Rush , Cherisse L. Seaton , Rowan Ross , Taylor Robertson , Angeliki-Iliana Louloudi , Peter Loewen , Kristen R. Haase , Jennifer Jakobi , Robert Janke","doi":"10.1016/j.ijmedinf.2026.106312","DOIUrl":"10.1016/j.ijmedinf.2026.106312","url":null,"abstract":"<div><h3>Background</h3><div>Advancement in digitalization in the health sector have created numerous opportunities for cardiovascular disease (CVD) self-management but also challenges, especially for older adults with lower digital health literacy. Reviews have examined impacts of digital health technology interventions on health outcomes without examining the role of training provided. The aim of this review is to synthesize evidence about the impacts of digital literacy training (DLT) and its characteristics as a component of digital interventions related to cardiovascular health on patient reported outcome and experience measures among older adults with CVD.</div></div><div><h3>Methods</h3><div>In accordance with the 2020 Preferred Reporting Items for Systematic Reviews and Meta-Analyses guidelines, a search of MEDLINE, EMBASE, CINAHL, and PsycINFO databases for articles published between inception to March 31, 2025 was conducted. Empirical studies reporting digital health technology training with adults (<em>M</em> age 60 + years) with CVD were eligible for inclusion. Articles included were quality-rated using the Mixed Methods Appraisal Tool. Data were extracted according to the DLT and health technologies alongside patient-reported outcome (i.e technology- and health-related) and experience measures.</div></div><div><h3>Results</h3><div>Of the 56 included studies (totaling 7698 participants), DLT varied considerably, with 51 describing in-person training. Two studies (totaling 519 participants) examined the role of training with positive impacts on technology- and health-related outcomes. In many of the remaining studies, positive technology-related outcomes were evident but could not be linked back to DLT separate from the overall intervention. In studies (n = 10) where training was evaluated, feedback from patients largely affirmed the training was needed.</div></div><div><h3>Discussion</h3><div>The collective evidence suggests DLT overall is useful and needed in digital interventions for older adults with CVD. More work is needed to elucidate the distinct role of DLT characteristics and to determine for whom and under what conditions DLT impacts health and technology-related outcomes.</div></div><div><h3>Registration</h3><div>The protocol for this review was registered Aug 12, 2024 in Open Science Framework (OSF) (See: https://osf.io/unhd9)</div></div>","PeriodicalId":54950,"journal":{"name":"International Journal of Medical Informatics","volume":"211 ","pages":"Article 106312"},"PeriodicalIF":4.1,"publicationDate":"2026-02-05","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"146133684","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

The effect of artificial intelligence–assisted pulmonary rehabilitation on exercise capacity: A systematic review and meta-analysis 人工智能辅助肺康复对运动能力的影响：一项系统综述和荟萃分析。

IF 4.1 2区医学 Q2 COMPUTER SCIENCE, INFORMATION SYSTEMS

International Journal of Medical Informatics

Pub Date : 2026-02-05 DOI: 10.1016/j.ijmedinf.2026.106336

Ecran Cinkavuk, Ebru Calik, Naciye Vardar-Yagli

Introduction

Artificial intelligence (AI) technologies are increasingly being integrated into pulmonary rehabilitation (PR) to improve individualization, real-time monitoring, and adherence in individuals with chronic respiratory diseases. However, their clinical impact on exercise capacity remains unclear. This systematic review and meta-analysis aimed to evaluate the effectiveness of AI-supported PR programs compared to usual care in improving exercise capacity and respiratory function in adults with chronic respiratory diseases.

Methods

This systematic review and meta-analysis followed PRISMA guidelines and was registered with PROSPERO (ID: CRD420251075622). A comprehensive search was conducted across five electronic databases (PubMed, Web of Science, Scopus, Cochrane Central Register of Controlled Trials (CENTRAL) and PEDro) from inception to July 2025. Statistical analyses for the meta-analysis were conducted using RevMan 5.4.

Results

Three eligible RCTs with a total of 456 participants were included. Pooled analysis showed a significant improvement in 6-minute walk distance (6MWD) after AI-assisted PR group compared to control (MD: 22.08 m; 95% CI: 4.96–39.20; p = 0.01). Moderate heterogeneity was observed (I² = 40%). No meta-analysis was conducted for respiratory function due to insufficient pre-post data. Risk of bias was generally low, though participant blinding was absent in all studies. Methodological quality was good, with a mean PEDro score of 6.0 ± 0.82.

Conclusion

AI-supported PR can significantly improve exercise capacity in individuals with chronic respiratory diseases. Despite promising results, high-quality studies in different pulmonary patient groups are needed to address existing limitations, particularly regarding standardization, cost-effectiveness, and clinical integration of AI-technology.

人工智能（AI）技术越来越多地被整合到肺康复（PR）中，以改善慢性呼吸道疾病患者的个体化、实时监测和依从性。然而，它们对运动能力的临床影响尚不清楚。本系统综述和荟萃分析旨在评估人工智能支持的PR项目与常规护理相比，在改善慢性呼吸系统疾病成人的运动能力和呼吸功能方面的有效性。方法：本系统评价和荟萃分析遵循PRISMA指南，并在PROSPERO注册（ID: CRD420251075622）。我们对5个电子数据库（PubMed、Web of Science、Scopus、Cochrane Central Register of Controlled Trials （Central）和PEDro）进行了全面的检索，检索时间从成立到2025年7月。meta分析采用RevMan 5.4进行统计分析。结果：纳入3项符合条件的随机对照试验，共纳入456名受试者。合并分析显示，人工智能辅助PR组患者6分钟步行距离（6MWD）较对照组有显著改善（MD: 22.08 m; 95% CI: 4.96 ~ 39.20; p = 0.01）。观察到中度异质性（I2 = 40%）。由于前后数据不足，未对呼吸功能进行meta分析。偏倚风险一般较低，但所有研究均未采用受试者盲法。方法质量良好，平均PEDro评分为6.0±0.82。结论：人工智能辅助PR可显著提高慢性呼吸系统疾病患者的运动能力。尽管取得了令人鼓舞的结果，但需要对不同肺部患者群体进行高质量的研究，以解决现有的局限性，特别是在标准化、成本效益和人工智能技术的临床整合方面。

{"title":"The effect of artificial intelligence–assisted pulmonary rehabilitation on exercise capacity: A systematic review and meta-analysis","authors":"Ecran Cinkavuk, Ebru Calik, Naciye Vardar-Yagli","doi":"10.1016/j.ijmedinf.2026.106336","DOIUrl":"10.1016/j.ijmedinf.2026.106336","url":null,"abstract":"<div><h3>Introduction</h3><div>Artificial intelligence (AI) technologies are increasingly being integrated into pulmonary rehabilitation (PR) to improve individualization, real-time monitoring, and adherence in individuals with chronic respiratory diseases. However, their clinical impact on exercise capacity remains unclear. This systematic review and <em>meta</em>-analysis aimed to evaluate the effectiveness of AI-supported PR programs compared to usual care in improving exercise capacity and respiratory function in adults with chronic respiratory diseases.</div></div><div><h3>Methods</h3><div>This systematic review and <em>meta</em>-analysis followed PRISMA guidelines and was registered with PROSPERO (ID: CRD420251075622). A comprehensive search was conducted across five electronic databases (PubMed, Web of Science, Scopus, Cochrane Central Register of Controlled Trials (CENTRAL) and PEDro) from inception to July 2025. Statistical analyses for the <em>meta</em>-analysis were conducted using RevMan 5.4.</div></div><div><h3>Results</h3><div>Three eligible RCTs with a total of 456 participants were included. Pooled analysis showed a significant improvement in 6-minute walk distance (6MWD) after AI-assisted PR group compared to control (MD: 22.08 m; 95% CI: 4.96–39.20; p = 0.01). Moderate heterogeneity was observed (I<sup>2</sup> = 40%). No <em>meta</em>-analysis was conducted for respiratory function due to insufficient pre-post data. Risk of bias was generally low, though participant blinding was absent in all studies. Methodological quality was good, with a mean PEDro score of 6.0 ± 0.82.</div></div><div><h3>Conclusion</h3><div>AI-supported PR can significantly improve exercise capacity in individuals with chronic respiratory diseases. Despite promising results, high-quality studies in different pulmonary patient groups are needed to address existing limitations, particularly regarding standardization, cost-effectiveness, and clinical integration of AI-technology.</div></div>","PeriodicalId":54950,"journal":{"name":"International Journal of Medical Informatics","volume":"211 ","pages":"Article 106336"},"PeriodicalIF":4.1,"publicationDate":"2026-02-05","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"146144685","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Machine learning models for identifying urinary incontinence in women with a history of hysterectomy using basic demographic and clinical characteristics: A cross-sectional study 使用基本人口学和临床特征识别子宫切除术史女性尿失禁的机器学习模型：一项横断面研究。

IF 4.1 2区医学 Q2 COMPUTER SCIENCE, INFORMATION SYSTEMS

International Journal of Medical Informatics

Pub Date : 2026-02-05 DOI: 10.1016/j.ijmedinf.2026.106334

Lu Liu , Wei Chen , Lili Li , Ping Zhang

Background

Urinary incontinence (UI) in women with a history of hysterectomy represents a significant global health concern. It is crucial to clarify the association between hysterectomy for benign indications and UI to avoid unnecessary surgery.

Objective

This study aimed to develop a machine learning (ML) model to identify factors associated with UI in women with a history of hysterectomy.

Methods

We analyzed 2021 patients from the National Health and Nutrition Examination Survey (NHANES) database who underwent hysterectomy for benign indications as our derivation cohort. Thirteen demographic and clinical features were evaluated: age, educational, anthropometric measurements (height, weight, waist), medical history diabetes mellitus (DM), and reproductive history. Six ML algorithms were employed: logistic regression (LR), naïve Bayes (NB), multilayer perceptron (MLP), extreme gradient boosting (XGBoost), random forest (RF), and support vector machine (SVM). External validation was performed on a cohort consisting of 556 patients from the Second Qilu Hospital of Shandong University. To improve interpretability, the predictive process was graphically illustrated employing a nomogram and SHapley Additive exPlanations (SHAP). Finally, the model was deployed as an online clinical decision support platform for applications.

Results

A comparison of receiver operating characteristic (ROC) curves using LR as the reference model revealed no statistically significant differences across the six ML algorithms. In the internal validation cohorts, the models achieved area-under-the-curve (AUC) values of 0.753–0.763 and accuracies between 0.627 and 0.664. This predictive performance was sustained in the external-validation cohort, with AUC values ranging from 0.702 to 0.718 and accuracies ranging from 0.661 to 0.697.

Conclusion

Our findings demonstrated that ML models could effectively identify UI in women with a history of hysterectomy. This approach, facilitated by the nomogram and online tool, enhanced the feasibility and accessibility of identifying women at risk.

背景：子宫切除术史女性尿失禁（UI）是一个重要的全球健康问题。为了避免不必要的手术，明确良性子宫切除术与尿失禁之间的关系是至关重要的。目的：本研究旨在开发一种机器学习（ML）模型，以识别子宫切除术史女性尿失禁的相关因素。方法：我们分析了来自国家健康和营养检查调查（NHANES）数据库中因良性适应症接受子宫切除术的2021例患者作为我们的衍生队列。评估了13项人口统计学和临床特征：年龄、教育程度、人体测量（身高、体重、腰围）、糖尿病病史和生殖史。采用了六种机器学习算法：逻辑回归（LR）、naïve贝叶斯（NB）、多层感知器（MLP）、极端梯度增强（XGBoost）、随机森林（RF）和支持向量机（SVM）。外部验证对象为来自山东大学齐鲁第二医院的556例患者。为了提高可解释性，预测过程以图形方式说明采用nomogram和SHapley Additive explanation （SHAP）。最后，将该模型部署为应用程序的在线临床决策支持平台。结果：以LR为参考模型的受试者工作特征（ROC）曲线比较显示，六种ML算法之间无统计学差异。在内部验证队列中，模型的曲线下面积（AUC）值为0.753-0.763，精度在0.627 - 0.664之间。这种预测性能在外部验证队列中保持不变，AUC值范围为0.702至0.718，准确度范围为0.661至0.697。结论：我们的研究结果表明，ML模型可以有效地识别子宫切除术史女性的尿失禁。在nomogram和在线工具的推动下，这种方法提高了识别处于危险中的妇女的可行性和可及性。

{"title":"Machine learning models for identifying urinary incontinence in women with a history of hysterectomy using basic demographic and clinical characteristics: A cross-sectional study","authors":"Lu Liu , Wei Chen , Lili Li , Ping Zhang","doi":"10.1016/j.ijmedinf.2026.106334","DOIUrl":"10.1016/j.ijmedinf.2026.106334","url":null,"abstract":"<div><h3>Background</h3><div>Urinary incontinence (UI) in women with a history of hysterectomy represents a significant global health concern. It is crucial to clarify the association between hysterectomy for benign indications and UI to avoid unnecessary surgery.</div></div><div><h3>Objective</h3><div>This study aimed to develop a machine learning (ML) model to identify factors associated with UI in women with a history of hysterectomy.</div></div><div><h3>Methods</h3><div>We analyzed 2021 patients from the National Health and Nutrition Examination Survey (NHANES) database who underwent hysterectomy for benign indications as our derivation cohort. Thirteen demographic and clinical features were evaluated: age, educational, anthropometric measurements (height, weight, waist), medical history diabetes mellitus (DM), and reproductive history. Six ML algorithms were employed: logistic regression (LR), naïve Bayes (NB), multilayer perceptron (MLP), extreme gradient boosting (XGBoost), random forest (RF), and support vector machine (SVM). External validation was performed on a cohort consisting of 556 patients from the Second Qilu Hospital of Shandong University. To improve interpretability, the predictive process was graphically illustrated employing a nomogram and SHapley Additive exPlanations (SHAP). Finally, the model was deployed as an online clinical decision support platform for applications.</div></div><div><h3>Results</h3><div> Our findings demonstrated that ML models could effectively identify UI in women with a history of hysterectomy. This approach, facilitated by the nomogram and online tool, enhanced the feasibility and accessibility of identifying women at risk.</div></div>","PeriodicalId":54950,"journal":{"name":"International Journal of Medical Informatics","volume":"211 ","pages":"Article 106334"},"PeriodicalIF":4.1,"publicationDate":"2026-02-05","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"146167897","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

A systematic review of the causes of morbidity data quality issues 系统回顾发病原因的数据质量问题。

IF 4.1 2区医学 Q2 COMPUTER SCIENCE, INFORMATION SYSTEMS

International Journal of Medical Informatics

Pub Date : 2026-02-05 DOI: 10.1016/j.ijmedinf.2026.106333

Sam Yan , Jessica Dickson , Brandon Cheong , Heather Grain , John Oldroyd

Background

The quality of hospital morbidity data collected with the International Classification of Diseases is unknown. A systematic review of the causes of morbidity data quality issues is urgently needed.

Objectives

We aimed to systematically identify and investigate the root causes of issues associated with hospital morbidity data collected using the International Classification of Diseases 10th edition, Australian Modification (ICD-10-AM) and Australian Classification of Health Interventions (ACHI).

Methods

This review included studies related to morbidity data collection issues arising from using ICD-10-AM and ACHI from Scopus, Embase, Medline and other data sources from 2017 to January 2025 in English. The quality of included studies was assessed using SQUIRE and STROBE checklists. A narrative synthesis was undertaken with themes and sub-categories of issues identified. The review followed the Preferred Reporting Items for Systematic Reviews and Meta-Analyses (PRISMA) 2020 Statement.

Results

Fifty-two studies were included, 37 from Australia, 3 from Canada, 2 each from Ireland and New Zealand, and 1 each from France, Germany, Turkey, US. Four themes were identified: 1) quality issues in standards, 2) technology, 3) education and training, and 4) issues related to clinical practice. There exists ambiguity in standards due to optional guidelines in data processing and jurisdictional differences. The standards do not provide sufficient granularity for precise disease identification. The standards are not capable of linking complex diagnostic, causal and procedural relationships and are leading to technical and other categories of issues. The complexity of issues associated with the standard leads to insufficient training resources for staff worldwide. Fragmented information structure and changes in clinical documentation rules lead to inconsistent coding.

Interpretation.

The root causes of the morbidity data collection errors are mainly associated with the quality of the standards. Further research is needed to address the root causes of morbidity data quality issues, including the structure of data capture systems and the use of more consistent approaches to standards writing, such as those applied by the International Organisation for Standardisation (ISO), which is not investigated by this research.

背景：使用国际疾病分类收集的医院发病率数据的质量尚不清楚。迫切需要对发病原因、数据质量问题进行系统审查。目的：我们旨在系统地识别和调查与使用国际疾病分类第10版、澳大利亚修订（ICD-10-AM）和澳大利亚健康干预分类（ACHI）收集的医院发病率数据相关的问题的根本原因。方法：本综述纳入2017年至2025年1月Scopus、Embase、Medline等数据源中使用ICD-10-AM和ACHI引起的发病率数据收集问题的相关研究。采用SQUIRE和STROBE检查表评估纳入研究的质量。对确定的主题和问题分类别进行了叙述综合。该综述遵循了系统评价和荟萃分析（PRISMA） 2020声明的首选报告项目。结果：共纳入52项研究，其中澳大利亚37项，加拿大3项，爱尔兰和新西兰各2项，法国、德国、土耳其、美国各1项。确定了四个主题：1)标准中的质量问题，2)技术，3)教育和培训，以及4)与临床实践相关的问题。由于数据处理的可选准则和管辖权差异，标准存在歧义。这些标准没有为精确的疾病识别提供足够的粒度。这些标准不能将复杂的诊断、因果和程序关系联系起来，并导致技术和其他类别的问题。与该标准有关的问题的复杂性导致全世界工作人员的培训资源不足。信息结构的碎片化和临床文件规则的变化导致编码不一致。解释：发病率数据收集错误的根本原因主要与标准的质量有关。需要进一步的研究来解决发病率数据质量问题的根本原因，包括数据捕获系统的结构和使用更一致的标准编写方法，例如国际标准化组织（ISO）所应用的方法，本研究未对此进行调查。

{"title":"A systematic review of the causes of morbidity data quality issues","authors":"Sam Yan , Jessica Dickson , Brandon Cheong , Heather Grain , John Oldroyd","doi":"10.1016/j.ijmedinf.2026.106333","DOIUrl":"10.1016/j.ijmedinf.2026.106333","url":null,"abstract":"<div><h3>Background</h3><div>The quality of hospital morbidity data collected with the International Classification of Diseases is unknown. A systematic review of the causes of morbidity data quality issues is urgently needed.</div></div><div><h3>Objectives</h3><div>We aimed to systematically identify and investigate the root causes of issues associated with hospital morbidity data collected using the International Classification of Diseases 10th edition, Australian Modification (ICD-10-AM) and Australian Classification of Health Interventions (ACHI).</div></div><div><h3>Methods</h3><div>This review included studies related to morbidity data collection issues arising from using ICD-10-AM and ACHI from Scopus, Embase, Medline and other data sources from 2017 to January 2025 in English. The quality of included studies was assessed using SQUIRE and STROBE checklists. A narrative synthesis was undertaken with themes and sub-categories of issues identified. The review followed the Preferred Reporting Items for Systematic Reviews and Meta-Analyses (PRISMA) 2020 Statement.</div></div><div><h3>Results</h3><div>Fifty-two studies were included, 37 from Australia, 3 from Canada, 2 each from Ireland and New Zealand, and 1 each from France, Germany, Turkey, US. Four themes were identified: 1) quality issues in standards, 2) technology, 3) education and training, and 4) issues related to clinical practice. There exists ambiguity in standards due to optional guidelines in data processing and jurisdictional differences. The standards do not provide sufficient granularity for precise disease identification. The standards are not capable of linking complex diagnostic, causal and procedural relationships and are leading to technical and other categories of issues. The complexity of issues associated with the standard leads to insufficient training resources for staff worldwide. Fragmented information structure and changes in clinical documentation rules lead to inconsistent coding.</div><div>Interpretation.</div><div>The root causes of the morbidity data collection errors are mainly associated with the quality of the standards. Further research is needed to address the root causes of morbidity data quality issues, including the structure of data capture systems and the use of more consistent approaches to standards writing, such as those applied by the International Organisation for Standardisation (ISO), which is not investigated by this research.</div></div>","PeriodicalId":54950,"journal":{"name":"International Journal of Medical Informatics","volume":"211 ","pages":"Article 106333"},"PeriodicalIF":4.1,"publicationDate":"2026-02-05","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"146167939","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

The state of standardized musculoskeletal terminology for healthcare reuse:A scoping review 医疗保健重用的标准化肌肉骨骼术语的现状：范围审查。

IF 4.1 2区医学 Q2 COMPUTER SCIENCE, INFORMATION SYSTEMS

International Journal of Medical Informatics

Pub Date : 2026-02-04 DOI: 10.1016/j.ijmedinf.2026.106318

Melinda Wassell , Kerryn Butler-Henderson , Peter McCann , Henry Pollard , Salma Arabi , Wei Wang , Karin Verspoor

Objective

Standardizing terminology offers opportunities for improved communication and care outcomes. With increasing adoption of clinical terminologies, questions remain about whether they adequately capture the scope of musculoskeletal (MSK) primary care practice. This scoping review examines global development efforts on MSK-relevant standardized terminology and its implementation in clinical practice.

Methods

A scoping review was conducted of 6 databases to May 2025. Identified studies (n = 3668) were included (n = 60) if they addressed standardized terminology relevant to the MSK primary care professions of chiropractic, osteopathy, and physiotherapy. Data were extracted on use cases, documentation of MSK information, alignment with national interoperability standards, and implementation status.

Results

Global development efforts span diverse MSK domains across condition types. Five studies achieved consensus around domain-specific terms (including tendinopathies, groin pain, and weight-bearing rehabilitation); in contrast, many studies developed extensive clinical terminology sets. Most studies (82.4%) address the development of terminologies, with few yet addressing how they have been implemented into clinical practice (2.7%).

Analysis revealed MSK clinicians require documentation beyond existing core interoperability data groups, including 1) function and movement, 2) pain characteristics, 3) psychosocial factors, 4) social determinants of health (environmental factors and participation barriers), 5) intervention effectiveness and clinical outcomes, and 6) person-centered factors.

Multiple barriers emerged, including technical (EHR integration, cognitive burden), workflow (time requirements, clinical value), professional (training, profession-specific terminology), and knowledge gaps (impact on care quality).

Conclusion

Extensive terminology development has begun yet gaps exist between development and clinical adoption. Terms evolve as research evolves; therefore, MSK professions should actively engage with interoperability groups to establish hierarchical ontologies that incorporate the identified data groups and balance standardization at higher conceptual levels with flexible lexicons to enable terminology growth over time. Establishing feedback mechanisms with EHR vendors to minimize clinicians’ cognitive burden will accelerate adoption and maximize clinical value.

目的：标准化术语为改善沟通和护理结果提供了机会。随着越来越多的临床术语的采用，问题仍然存在，他们是否充分捕捉肌肉骨骼（MSK）初级保健实践的范围。这一范围审查审查了msk相关标准化术语的全球发展努力及其在临床实践中的实施。方法：对截至2025年5月的6个数据库进行范围综述。确定的研究（n = 3668）被纳入（n = 60），如果它们涉及与MSK初级保健专业（脊椎指压疗法、整骨疗法和物理疗法）相关的标准化术语。从用例、MSK信息文档、与国家互操作性标准的一致性和实施状态中提取数据。结果：全球发展努力跨越不同条件类型的MSK领域。五项研究围绕特定领域的术语（包括肌腱病变、腹股沟疼痛和负重康复）达成了共识；相比之下，许多研究开发了广泛的临床术语集。大多数研究（82.4%）涉及术语的发展，很少有研究涉及如何将术语应用于临床实践（2.7%）。分析显示，MSK临床医生需要的文件超出了现有的核心互操作性数据组，包括1)功能和运动，2)疼痛特征，3)心理社会因素，4)健康的社会决定因素（环境因素和参与障碍），5)干预效果和临床结果，以及6)以人为中心的因素。出现了多种障碍，包括技术（EHR整合、认知负担）、工作流程（时间要求、临床价值）、专业（培训、专业特定术语）和知识差距（对护理质量的影响）。结论：广泛的术语开发已经开始，但在开发和临床应用之间存在差距。术语随着研究的发展而演变；因此，MSK专业人员应积极参与互操作性组，以建立包含已识别数据组的分层本体，并在更高的概念级别上平衡标准化和灵活的词汇，以支持术语随着时间的推移而增长。与电子病历供应商建立反馈机制，以减少临床医生的认知负担，将加速采用和最大化临床价值。

{"title":"The state of standardized musculoskeletal terminology for healthcare reuse:A scoping review","authors":"Melinda Wassell , Kerryn Butler-Henderson , Peter McCann , Henry Pollard , Salma Arabi , Wei Wang , Karin Verspoor","doi":"10.1016/j.ijmedinf.2026.106318","DOIUrl":"10.1016/j.ijmedinf.2026.106318","url":null,"abstract":"<div><h3>Objective</h3><div>Standardizing terminology offers opportunities for improved communication and care outcomes. With increasing adoption of clinical terminologies, questions remain about whether they adequately capture the scope of musculoskeletal (MSK) primary care practice. This scoping review examines global development efforts on MSK-relevant standardized terminology and its implementation in clinical practice.</div></div><div><h3>Methods</h3><div>A scoping review was conducted of 6 databases to May 2025. Identified studies (n = 3668) were included (n = 60) if they addressed standardized terminology relevant to the MSK primary care professions of chiropractic, osteopathy, and physiotherapy. Data were extracted on use cases, documentation of MSK information, alignment with national interoperability standards, and implementation status.</div></div><div><h3>Results</h3><div>Global development efforts span diverse MSK domains across condition types. Five studies achieved consensus around domain-specific terms (including tendinopathies, groin pain, and weight-bearing rehabilitation); in contrast, many studies developed extensive clinical terminology sets. Most studies (82.4%) address the development of terminologies, with few yet addressing how they have been implemented into clinical practice (2.7%).</div><div>Analysis revealed MSK clinicians require documentation beyond existing core interoperability data groups, including 1) function and movement, 2) pain characteristics, 3) psychosocial factors, 4) social determinants of health (environmental factors and participation barriers), 5) intervention effectiveness and clinical outcomes, and 6) person-centered factors.</div><div>Multiple barriers emerged, including technical (EHR integration, cognitive burden), workflow (time requirements, clinical value), professional (training, profession-specific terminology), and knowledge gaps (impact on care quality).</div></div><div><h3>Conclusion</h3><div>Extensive terminology development has begun yet gaps exist between development and clinical adoption. Terms evolve as research evolves; therefore, MSK professions should actively engage with interoperability groups to establish hierarchical ontologies that incorporate the identified data groups and balance standardization at higher conceptual levels with flexible lexicons to enable terminology growth over time. Establishing feedback mechanisms with EHR vendors to minimize clinicians’ cognitive burden will accelerate adoption and maximize clinical value.</div></div>","PeriodicalId":54950,"journal":{"name":"International Journal of Medical Informatics","volume":"211 ","pages":"Article 106318"},"PeriodicalIF":4.1,"publicationDate":"2026-02-04","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"146159304","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Clinical-radiological machine learning model for non-invasive diagnosis and stratification of peripheral artery disease: a multicenter study 外周动脉疾病无创诊断和分层的临床-放射学机器学习模型：一项多中心研究。

IF 4.1 2区医学 Q2 COMPUTER SCIENCE, INFORMATION SYSTEMS

International Journal of Medical Informatics

Pub Date : 2026-02-03 DOI: 10.1016/j.ijmedinf.2026.106338

Bowen Hou , Jinhan Qiao , Zheng Ran , Yitong Li , Zhongyichen Huang , Xiaolong Luo , Xiaoming Li

Background and Objective

Peripheral artery disease (PAD) is an atherosclerotic disorder prevalent in the elderly that leads to peripheral function decline and body composition changes. Current diagnostic approaches lack sensitivity for early PAD detection and staging. This study aimed to develop and validate machine learning (ML) models of clinical and CT-based radiological features to improve PAD diagnosis and severity stratification.

Methods

A retrospective multicenter study was conducted using data from two institutions. Clinical and radiological features (including volumetric body composition and muscle texture parameters extracted from calf and thigh segments) were analyzed. Participants were randomly divided into training (70%) and test (30%) sets, stratified by PAD status. Models with different ML algorithms were developed and compared. Model interpretability was assessed with Shapley additive explanation (SHAP) analysis, and performance was evaluated through receiver operating characteristic analysis, Hosmer-Lemeshow testing, Brier score and calibration curves.

Results

This study comprised 342 participants, divided into training (n = 176), test set (n = 76) from Institute 1, external validation (n = 90) from Institute 2. Three models were developed: clinical model (CM), radiological model (RM), and combined clinical-radiological model (CRM). The calf-based CRM using random forest algorithm achieved area under the curves of 0.871 (training), 0.870 (test), and 0.828 (validation), demonstrating good calibration (p ≥ 0.05) and the low Brier score. SHAP analysis visually interpreted feature contributions toward PAD diagnosis and staging.

Conclusions

The CRM model effectively integrated calf-derived radiological and clinical features into a noninvasive, interpretable tool for PAD diagnosis and severity stratification, demonstrating strong clinical applicability.

背景与目的：外周动脉疾病（PAD）是一种常见于老年人的动脉粥样硬化性疾病，可导致外周功能下降和身体成分改变。目前的诊断方法缺乏对早期PAD检测和分期的敏感性。本研究旨在开发和验证临床和基于ct的放射学特征的机器学习（ML）模型，以改善PAD的诊断和严重程度分层。方法：采用两所机构的资料进行回顾性多中心研究。分析临床和放射学特征（包括从小腿和大腿段提取的体积体组成和肌肉纹理参数）。参与者随机分为训练组（70%）和测试组（30%），按PAD状态分层。开发了不同ML算法的模型并进行了比较。采用Shapley加性解释（SHAP）分析评价模型的可解释性，通过受试者工作特征分析、Hosmer-Lemeshow检验、Brier评分和校准曲线评价模型的性能。结果：本研究共纳入342名参与者，分为训练组（n = 176），测试组（n = 76）来自研究所1，外部验证组（n = 90）来自研究所2。建立了临床模型（CM）、放射学模型（RM）和临床-放射学联合模型（CRM）。采用随机森林算法的基于小牛的CRM曲线下面积分别为0.871（训练）、0.870（检验）和0.828（验证），具有较好的校正效果（p≥0.05）和较低的Brier评分。SHAP分析直观地解释了对PAD诊断和分期的特征贡献。结论：CRM模型有效地将小牛衍生的影像学和临床特征整合为一种无创、可解释的PAD诊断和严重程度分层工具，具有很强的临床适用性。

{"title":"Clinical-radiological machine learning model for non-invasive diagnosis and stratification of peripheral artery disease: a multicenter study","authors":"Bowen Hou , Jinhan Qiao , Zheng Ran , Yitong Li , Zhongyichen Huang , Xiaolong Luo , Xiaoming Li","doi":"10.1016/j.ijmedinf.2026.106338","DOIUrl":"10.1016/j.ijmedinf.2026.106338","url":null,"abstract":"<div><h3>Background and Objective</h3><div>Peripheral artery disease (PAD) is an atherosclerotic disorder prevalent in the elderly that leads to peripheral function decline and body composition changes. Current diagnostic approaches lack sensitivity for early PAD detection and staging. This study aimed to develop and validate machine learning (ML) models of clinical and CT-based radiological features to improve PAD diagnosis and severity stratification.</div></div><div><h3>Methods</h3><div>A retrospective multicenter study was conducted using data from two institutions. Clinical and radiological features (including volumetric body composition and muscle texture parameters extracted from calf and thigh segments) were analyzed. Participants were randomly divided into training (70%) and test (30%) sets, stratified by PAD status. Models with different ML algorithms were developed and compared. Model interpretability was assessed with Shapley additive explanation (SHAP) analysis, and performance was evaluated through receiver operating characteristic analysis, Hosmer-Lemeshow testing, Brier score and calibration curves.</div></div><div><h3>Results</h3><div>This study comprised 342 participants, divided into training (n = 176), test set (n = 76) from Institute 1, external validation (n = 90) from Institute 2. Three models were developed: clinical model (CM), radiological model (RM), and combined clinical-radiological model (CRM). The calf-based CRM using random forest algorithm achieved area under the curves of 0.871 (training), 0.870 (test), and 0.828 (validation), demonstrating good calibration (<em>p</em> ≥ 0.05) and the low Brier score. SHAP analysis visually interpreted feature contributions toward PAD diagnosis and staging.</div></div><div><h3>Conclusions</h3><div>The CRM model effectively integrated calf-derived radiological and clinical features into a noninvasive, interpretable tool for PAD diagnosis and severity stratification, demonstrating strong clinical applicability.</div></div>","PeriodicalId":54950,"journal":{"name":"International Journal of Medical Informatics","volume":"211 ","pages":"Article 106338"},"PeriodicalIF":4.1,"publicationDate":"2026-02-03","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"146133629","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Applying a statistical model-based AI method to identify prognostic factors for long-term cognitive decline in Alzheimer’s disease: Evidence from pooled placebo data of four phase III trials 应用基于统计模型的人工智能方法识别阿尔茨海默病长期认知能力下降的预后因素：来自四个III期试验安慰剂数据汇总的证据

IF 4.1 2区医学 Q2 COMPUTER SCIENCE, INFORMATION SYSTEMS

International Journal of Medical Informatics

Pub Date : 2026-02-03 DOI: 10.1016/j.ijmedinf.2026.106337

Ryoichi Hanazawa , Hiroyuki Sato , Keisuke Suzuki , Akihiro Hirakawa

Background

Heterogeneity in the long-term progression of Alzheimer’s disease (AD) challenges the efficiency of clinical trials. Identifying long-term prognostic factors is critical for enhancing trial efficiency, although it has been limited by the lack of appropriate statistical approaches. We applied a recently developed statistical model-based AI method to identify the baseline prognostic factors for long-term cognitive decline in a clinical trial population.

Methods

We analyzed pooled placebo arm data (N = 1,597) from four Phase III trials in patients with mild-to-moderate AD. Long-term trajectories for the Mini-Mental State Examination (MMSE), 11- and 14-item versions of the Alzheimer’s Disease Assessment Scale-Cognitive subscale (ADAS-Cog11, ADAS-Cog14), and Clinical Dementia Rating-Sum of Boxes (CDR-SB) were predicted from their short-term data (≤80 weeks). Trajectories were compared between subgroups defined by six baseline factors (age, sex, apolipoprotein E ε4 [APOE ε4] status, years of education, years from diagnosis, and years from disease onset) using the area under the curve (AUC).

Results

Longer years of education (≥13 years) was the most robust predictor associated with faster progression across all four outcomes (e.g., for 20-year ADAS-Cog11, AUC ratio, 1.11, p < 0.001). Younger age (<74 years) was associated with a faster decline in MMSE and ADAS-Cog scores, but not in CDR-SB. APOE ε4 status, sex, years from diagnosis, and years from disease onset were not significantly associated with long-term progression.

Conclusions

Baseline educational level and age were significant prognostic factors of long-term cognitive decline. These findings will help optimize patient stratification in future clinical trials on AD.

阿尔茨海默病（AD）长期进展的异质性对临床试验的效率提出了挑战。确定长期预后因素对于提高试验效率至关重要，尽管由于缺乏适当的统计方法而受到限制。我们应用了最近开发的基于统计模型的人工智能方法来确定临床试验人群长期认知能力下降的基线预后因素。方法：我们分析了来自4个轻至中度AD患者的III期临床试验的安慰剂组数据（N = 1597）。根据短期数据（≤80周）预测迷你精神状态检查（MMSE）、11项和14项版本的阿尔茨海默病评估量表-认知亚量表（ADAS-Cog11、ADAS-Cog14）和临床痴呆评分-盒和（CDR-SB）的长期轨迹。使用曲线下面积（AUC）比较由六个基线因素（年龄、性别、载脂蛋白E ε4 [APOE ε4]状态、受教育年限、诊断年限和发病年限）定义的亚组之间的轨迹。结果：较长的受教育年限（≥13年）是与所有四种结局（例如，对于20年adas， cog11， AUC比为1.11,p < 0.001）中更快进展相关的最可靠预测因子。年龄较小（74岁）与MMSE和ADAS-Cog评分下降较快相关，但与CDR-SB无关。APOE ε4状态、性别、诊断年龄和发病年龄与长期进展无显著相关。结论基线受教育程度和年龄是长期认知能力下降的重要预后因素。这些发现将有助于在未来的阿尔茨海默病临床试验中优化患者分层。

{"title":"Applying a statistical model-based AI method to identify prognostic factors for long-term cognitive decline in Alzheimer’s disease: Evidence from pooled placebo data of four phase III trials","authors":"Ryoichi Hanazawa , Hiroyuki Sato , Keisuke Suzuki , Akihiro Hirakawa","doi":"10.1016/j.ijmedinf.2026.106337","DOIUrl":"10.1016/j.ijmedinf.2026.106337","url":null,"abstract":"<div><h3>Background</h3><div>Heterogeneity in the long-term progression of Alzheimer’s disease (AD) challenges the efficiency of clinical trials. Identifying long-term prognostic factors is critical for enhancing trial efficiency, although it has been limited by the lack of appropriate statistical approaches. We applied a recently developed statistical model-based AI method to identify the baseline prognostic factors for long-term cognitive decline in a clinical trial population.</div></div><div><h3>Methods</h3><div>We analyzed pooled placebo arm data (N = 1,597) from four Phase III trials in patients with mild-to-moderate AD. Long-term trajectories for the Mini-Mental State Examination (MMSE), 11- and 14-item versions of the Alzheimer’s Disease Assessment Scale-Cognitive subscale (ADAS-Cog11, ADAS-Cog14), and Clinical Dementia Rating-Sum of Boxes (CDR-SB) were predicted from their short-term data (≤80 weeks). Trajectories were compared between subgroups defined by six baseline factors (age, sex, <em>apolipoprotein E ε4</em> [<em>APOE ε4</em>] status, years of education, years from diagnosis, and years from disease onset) using the area under the curve (AUC).</div></div><div><h3>Results</h3><div>Longer years of education (≥13 years) was the most robust predictor associated with faster progression across all four outcomes (e.g., for 20-year ADAS-Cog11, AUC ratio, 1.11, p < 0.001). Younger age (<74 years) was associated with a faster decline in MMSE and ADAS-Cog scores, but not in CDR-SB. <em>APOE ε4</em> status, sex, years from diagnosis, and years from disease onset were not significantly associated with long-term progression.</div></div><div><h3>Conclusions</h3><div>Baseline educational level and age were significant prognostic factors of long-term cognitive decline. These findings will help optimize patient stratification in future clinical trials on AD.</div></div>","PeriodicalId":54950,"journal":{"name":"International Journal of Medical Informatics","volume":"211 ","pages":"Article 106337"},"PeriodicalIF":4.1,"publicationDate":"2026-02-03","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"146190933","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Harmonizing patient-reported outcome measures for nasal complaints using traditional and machine learning methods 使用传统和机器学习方法协调患者报告的鼻部投诉结果测量。

IF 4.1 2区医学 Q2 COMPUTER SCIENCE, INFORMATION SYSTEMS

International Journal of Medical Informatics

Pub Date : 2026-02-02 DOI: 10.1016/j.ijmedinf.2026.106319

Miljan Jović , Esther Hof , Maryam Amir Haeri , Jasper J. Hoorweg , Stéphanie M. van den Berg

Background

Nasal obstruction measurement instruments are widely used in the field of nasal surgery. There are various scales that measure nasal obstruction and they differ regarding the number of items, their wording, and the type of response options. In order to pool the data and analyze it together, it is necessary to harmonize it so that we can compare participants’ nasal obstruction scores irrespective of instrument they filled in. Data harmonization is still not used in the field of nasal obstruction assessment.

The Aim

The aim of this study was to find the best harmonization method in terms of predicting the scores on a target instrument based on the scores from another instrument as precise as possible in the case of four different nasal complaints instruments. A method was sought to find a transformation of scores on the NOSE, Utrecht-Q and SCHNOS that makes them equivalent to ENFAS scores.

Methods

A total of 1324 unique patients completed all four measurement instruments. We tried linear equating, Item Response Theory (IRT), and the following machine learning methods: linear regression, random forest regression, support vector machine regression, and neural network. We used the root-mean-square error (RMSE) of differences between predicted and observed scores to evaluate the quality of harmonization in 5-fold cross-validation.

Results

The ML methods gave overall the best results (the lowest RMSEs) and outperformed IRT (which is considered as a common choice for data harmonization in psychometrics).

Conclusion

The ML methods led to the best quality of the results, confirming their strong potential for data harmonization. This study shows that next to linear equating and IRT that are commonly used for data harmonization, we can also use ML methods for the same purpose and, by doing so, to even increase the quality of the harmonization in certain use cases.

背景：鼻阻塞测量仪器在鼻外科领域应用广泛。有各种各样的测量鼻塞的量表，它们在项目的数量、措辞和反应选择的类型上有所不同。为了汇集数据并对其进行分析，有必要对其进行协调，以便我们可以比较参与者的鼻塞分数，而不管他们填写的是什么仪器。数据协调仍未应用于鼻塞评估领域。目的：本研究的目的是在四种不同的鼻部抱怨仪器的情况下，根据另一种仪器的分数尽可能精确地预测目标仪器的分数，找到最佳的协调方法。寻求一种方法来找到NOSE， Utrecht-Q和SCHNOS分数的转换，使它们与ENFAS分数相等。方法：1324例特殊患者完成了所有四种测量工具。我们尝试了线性方程、项目反应理论（IRT）和以下机器学习方法：线性回归、随机森林回归、支持向量机回归和神经网络。我们使用预测和观察评分之间差异的均方根误差（RMSE）来评估5倍交叉验证的一致性质量。结果：ML方法总体上给出了最好的结果（最低rmse），并且优于IRT（这被认为是心理测量学中数据协调的常见选择）。结论：机器学习方法的结果质量最好，证实了它们在数据协调方面的强大潜力。这项研究表明，除了通常用于数据协调的线性方程和IRT之外，我们还可以使用ML方法来达到相同的目的，通过这样做，甚至可以在某些用例中提高协调的质量。

{"title":"Harmonizing patient-reported outcome measures for nasal complaints using traditional and machine learning methods","authors":"Miljan Jović , Esther Hof , Maryam Amir Haeri , Jasper J. Hoorweg , Stéphanie M. van den Berg","doi":"10.1016/j.ijmedinf.2026.106319","DOIUrl":"10.1016/j.ijmedinf.2026.106319","url":null,"abstract":"<div><h3>Background</h3><div>Nasal obstruction measurement instruments are widely used in the field of nasal surgery. There are various scales that measure nasal obstruction and they differ regarding the number of items, their wording, and the type of response options. In order to pool the data and analyze it together, it is necessary to harmonize it so that we can compare participants’ nasal obstruction scores irrespective of instrument they filled in. Data harmonization is still not used in the field of nasal obstruction assessment.</div></div><div><h3>The Aim</h3><div>The aim of this study was to find the best harmonization method in terms of predicting the scores on a target instrument based on the scores from another instrument as precise as possible in the case of four different nasal complaints instruments. A method was sought to find a transformation of scores on the NOSE, Utrecht-Q and SCHNOS that makes them equivalent to ENFAS scores.</div></div><div><h3>Methods</h3><div>A total of 1324 unique patients completed all four measurement instruments. We tried linear equating, Item Response Theory (IRT), and the following machine learning methods: linear regression, random forest regression, support vector machine regression, and neural network. We used the root-mean-square error (RMSE) of differences between predicted and observed scores to evaluate the quality of harmonization in 5-fold cross-validation.</div></div><div><h3>Results</h3><div>The ML methods gave overall the best results (the lowest RMSEs) and outperformed IRT (which is considered as a common choice for data harmonization in psychometrics).</div></div><div><h3>Conclusion</h3><div>The ML methods led to the best quality of the results, confirming their strong potential for data harmonization. This study shows that next to linear equating and IRT that are commonly used for data harmonization, we can also use ML methods for the same purpose and, by doing so, to even increase the quality of the harmonization in certain use cases.</div></div>","PeriodicalId":54950,"journal":{"name":"International Journal of Medical Informatics","volume":"211 ","pages":"Article 106319"},"PeriodicalIF":4.1,"publicationDate":"2026-02-02","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"146133655","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Explainable AI in Cardiology Diagnostics: A Systematic Review of Machine Learning, Meta-heuristic Optimization, and Clinical Text Mining for Coronary Artery Disease 心脏病诊断中可解释的人工智能：冠状动脉疾病机器学习、元启发式优化和临床文本挖掘的系统综述。

IF 4.1 2区医学 Q2 COMPUTER SCIENCE, INFORMATION SYSTEMS

International Journal of Medical Informatics

Pub Date : 2026-02-02 DOI: 10.1016/j.ijmedinf.2026.106321

Majdi Jaradat , Mohammed Awad

Background

This systematic review compiles evidence and examines how various artificial intelligence (AI) approaches, including machine learning (ML), natural language processing (NLP), meta-heuristic optimization, and explainable AI (XAI), are utilized to predict and diagnose coronary artery disease (CAD). We aim to identify the most commonly used models, evaluate their performance, and explore how interpretability and optimization enhance their usefulness in clinical practice.

Method

A thorough search was conducted across five major databases (PubMed, Scopus, IEEE Xplore, ACM Digital Library, and SpringerLink) to identify relevant studies published between January 2022 and August 2025, in accordance with the PRISMA guidelines. Dual independent reviewers performed study selection and data extraction. The quality of the included studies was evaluated using a checklist based on QUADAS-2. Data were collected on study characteristics, model types, validation methods, and performance metrics, which will be the cornerstone of the analysis.

Results

Sixty-one studies met the inclusion criteria. ML and deep learning models demonstrated strong performance and achieved high accuracy in benchmark datasets, but showed limited clinical validation. Transformer-based models (e.g., BioBERT, ClinicalBERT) showed high efficacy for medical text analysis, but require substantial data and computational resources. Meta-heuristic algorithms (e.g., Genetic Algorithms, Particle Swarm Optimization) effectively improved model efficiency but were rarely applied to unstructured clinical narratives. XAI tools (e.g., SHAP, LIME) improved model transparency, though most studies highlight a need for more rigorous evaluation.

Conclusion

Integrated ML, NLP, meta-heuristic optimization, and XAI hold significant promise in advancing the diagnosis of CAD by improving both accuracy and interpretability. However, challenges such as data scarcity, limited external validation, and a lack of standardized, clinician-centric explainability impede clinical adoption. Future research should focus on hybrid frameworks validated for large, diverse, and real-world datasets.

背景：本系统综述收集证据并研究了各种人工智能（AI）方法，包括机器学习（ML）、自然语言处理（NLP）、元启发式优化和可解释人工智能（XAI），如何用于预测和诊断冠状动脉疾病（CAD）。我们的目标是确定最常用的模型，评估它们的性能，并探索如何可解释性和优化增强它们在临床实践中的有用性。方法：根据PRISMA指南，在五个主要数据库（PubMed, Scopus, IEEE Xplore， ACM Digital Library和SpringerLink）中进行了彻底的检索，以确定2022年1月至2025年8月期间发表的相关研究。双独立审稿人进行研究选择和数据提取。采用基于QUADAS-2的检查表对纳入研究的质量进行评估。收集有关研究特征、模型类型、验证方法和性能度量的数据，这将是分析的基石。结果：61项研究符合纳入标准。ML和深度学习模型在基准数据集中表现出很强的性能和较高的准确性，但临床验证有限。基于转换器的模型（如BioBERT、ClinicalBERT）在医学文本分析中显示出很高的效率，但需要大量的数据和计算资源。元启发式算法（如遗传算法、粒子群优化）有效地提高了模型效率，但很少应用于非结构化临床叙述。XAI工具（例如，SHAP， LIME）提高了模型的透明度，尽管大多数研究强调需要更严格的评估。结论：整合ML、NLP、元启发式优化和XAI，通过提高准确性和可解释性，在推进CAD诊断方面具有重要的前景。然而，诸如数据稀缺、有限的外部验证以及缺乏标准化、以临床为中心的可解释性等挑战阻碍了临床应用。未来的研究应该集中在大型、多样化和真实世界数据集验证的混合框架上。

{"title":"Explainable AI in Cardiology Diagnostics: A Systematic Review of Machine Learning, Meta-heuristic Optimization, and Clinical Text Mining for Coronary Artery Disease","authors":"Majdi Jaradat , Mohammed Awad","doi":"10.1016/j.ijmedinf.2026.106321","DOIUrl":"10.1016/j.ijmedinf.2026.106321","url":null,"abstract":"<div><h3>Background</h3><div>This systematic review compiles evidence and examines how various artificial intelligence (AI) approaches, including machine learning (ML), natural language processing (NLP), <em>meta</em>-heuristic optimization, and explainable AI (XAI), are utilized to predict and diagnose coronary artery disease (CAD). We aim to identify the most commonly used models, evaluate their performance, and explore how interpretability and optimization enhance their usefulness in clinical practice.</div></div><div><h3>Method</h3><div>A thorough search was conducted across five major databases (PubMed, Scopus, IEEE Xplore, ACM Digital Library, and SpringerLink) to identify relevant studies published between January 2022 and August 2025, in accordance with the PRISMA guidelines. Dual independent reviewers performed study selection and data extraction. The quality of the included studies was evaluated using a checklist based on QUADAS-2. Data were collected on study characteristics, model types, validation methods, and performance metrics, which will be the cornerstone of the analysis.</div></div><div><h3>Results</h3><div>Sixty-one studies met the inclusion criteria. ML and deep learning models demonstrated strong performance and achieved high accuracy in benchmark datasets, but showed limited clinical validation. Transformer-based models (e.g., BioBERT, ClinicalBERT) showed high efficacy for medical text analysis, but require substantial data and computational resources. Meta-heuristic algorithms (e.g., Genetic Algorithms, Particle Swarm Optimization) effectively improved model efficiency but were rarely applied to unstructured clinical narratives. XAI tools (e.g., SHAP, LIME) improved model transparency, though most studies highlight a need for more rigorous evaluation.</div></div><div><h3>Conclusion</h3><div>Integrated ML, NLP, <em>meta</em>-heuristic optimization, and XAI hold significant promise in advancing the diagnosis of CAD by improving both accuracy and interpretability. However, challenges such as data scarcity, limited external validation, and a lack of standardized, clinician-centric explainability impede clinical adoption. Future research should focus on hybrid frameworks validated for large, diverse, and real-world datasets.</div></div>","PeriodicalId":54950,"journal":{"name":"International Journal of Medical Informatics","volume":"211 ","pages":"Article 106321"},"PeriodicalIF":4.1,"publicationDate":"2026-02-02","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"146137980","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Using causal rule mining to identify opportunities for value improvement in regional CABG care: A proof-of-concept study 使用因果规则挖掘来识别区域CABG护理价值改进的机会：一项概念验证研究。

IF 4.1 2区医学 Q2 COMPUTER SCIENCE, INFORMATION SYSTEMS

International Journal of Medical Informatics

Pub Date : 2026-02-01 DOI: 10.1016/j.ijmedinf.2026.106317

Sophie van Heuveln , Gijs J. van Steenbergen , Mileen R.D. van de Kar , Erwin S.H. Tan , Mohamed A. Soliman-Hamad Veghel , Rik Eshuis , Lukas R.C. Dekker , Dennis van Veghel

Objective

To explore the potential of causal rule mining (CRM) as a complementary method to outcome monitoring in identifying plausible causal patterns that may explain undesired clinical outcomes or elevated care consumption in cardiac surgery.

Methods

In this proof-of-concept study, CRM was applied to data from 1,068 patients who underwent elective isolated coronary artery bypass grafting between January 2016 and March 2021 at a single heart center and its referral network in the Netherlands. Outcomes of interest included: 1-year and 120-day mortality, in-hospital stroke, 30-day deep sternal wound infection (DSWI), 30-day re-explorations, 1-year coronary reinterventions, event-free survival, 30-day emergency department (ED) visits, postoperative length of stay, and preoperative fractional flow reserve (FFR) testing. Causal rules were considered relevant if both the odds ratio (OR) and its 95 % confidence interval (CI) were > 1. Identified rules were independently reviewed by clinical experts.

Results

CRM identified 114 significant rules. Five rules were rated as ‘new and interesting’ and two additional rules were included based on special interest. In follow-up discussions, clinical experts agreed that three rules warrant further clinical investigation: (1) the absence of fractional flow reserve (FFR) testing reducing the likelihood of coronary reintervention, (2) absence of red blood cell (RBC) transfusion during admission reducing the likelihood of 30-day re-explorations, and (3) RBC transfusion increasing the likelihood of 30-day re-explorations.

Conclusion

CRM helped identify potential explanations for certain outcomes and care consumption, providing structured input for hypothesis-driven quality improvement and supporting efforts to enhance patient value.

目的：探讨因果规则挖掘（CRM）作为结果监测的补充方法的潜力，以识别可能解释心脏手术中不良临床结果或护理消耗增加的合理因果模式。方法：在这项概念验证研究中，CRM应用于2016年1月至2021年3月期间在荷兰单一心脏中心及其转诊网络接受选择性孤立冠状动脉旁路移植术的1,068例患者的数据。研究结果包括：1年和120天死亡率、院内卒中、30天深胸骨伤口感染（DSWI）、30天再探查、1年冠状动脉再介入、无事件生存、30天急诊科（ED）就诊、术后住院时间和术前血流储备分数（FFR）测试。如果比值比（OR）及其95% %置信区间（CI）均为 > 1，则认为因果规则相关。确定的规则由临床专家独立审查。结果：CRM识别出114条重要规则。其中5条规则被评为“新颖有趣”，另外2条规则被评为“特殊兴趣”。在后续讨论中，临床专家一致认为有三条规则值得进一步的临床研究：(1)缺乏分数血流储备（FFR）测试降低了冠状动脉再介入的可能性，(2)入院时缺乏红细胞（RBC）输血降低了30天再探查的可能性，(3)红细胞输血增加了30天再探查的可能性。结论：客户关系管理有助于确定某些结果和护理消费的潜在解释，为假设驱动的质量改进提供结构化输入，并支持提高患者价值的努力。

{"title":"Using causal rule mining to identify opportunities for value improvement in regional CABG care: A proof-of-concept study","authors":"Sophie van Heuveln , Gijs J. van Steenbergen , Mileen R.D. van de Kar , Erwin S.H. Tan , Mohamed A. Soliman-Hamad Veghel , Rik Eshuis , Lukas R.C. Dekker , Dennis van Veghel","doi":"10.1016/j.ijmedinf.2026.106317","DOIUrl":"10.1016/j.ijmedinf.2026.106317","url":null,"abstract":"<div><h3>Objective</h3><div>To explore the potential of causal rule mining (CRM) as a complementary method to outcome monitoring in identifying plausible causal patterns that may explain undesired clinical outcomes or elevated care consumption in cardiac surgery.</div></div><div><h3>Methods</h3><div>In this proof-of-concept study, CRM was applied to data from 1,068 patients who underwent elective isolated coronary artery bypass grafting between January 2016 and March 2021 at a single heart center and its referral network in the Netherlands. Outcomes of interest included: 1-year and 120-day mortality, in-hospital stroke, 30-day deep sternal wound infection (DSWI), 30-day re-explorations, 1-year coronary reinterventions, event-free survival, 30-day emergency department (ED) visits, postoperative length of stay, and preoperative fractional flow reserve (FFR) testing. Causal rules were considered relevant if both the odds ratio (OR) and its 95 % confidence interval (CI) were > 1. Identified rules were independently reviewed by clinical experts.</div></div><div><h3>Results</h3><div>CRM identified 114 significant rules. Five rules were rated as ‘new and interesting’ and two additional rules were included based on special interest. In follow-up discussions, clinical experts agreed that three rules warrant further clinical investigation: (1) the absence of fractional flow reserve (FFR) testing reducing the likelihood of coronary reintervention, (2) absence of red blood cell (RBC) transfusion during admission reducing the likelihood of 30-day re-explorations, and (3) RBC transfusion increasing the likelihood of 30-day re-explorations.</div></div><div><h3>Conclusion</h3><div>CRM helped identify potential explanations for certain outcomes and care consumption, providing structured input for hypothesis-driven quality improvement and supporting efforts to enhance patient value.</div></div>","PeriodicalId":54950,"journal":{"name":"International Journal of Medical Informatics","volume":"211 ","pages":"Article 106317"},"PeriodicalIF":4.1,"publicationDate":"2026-02-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"146121141","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0