首页 > 最新文献

International Journal of Medical Informatics最新文献

英文 中文
Machine learning in stroke and its sequelae: a narrative review of clinical applications and emerging trends 中风及其后遗症中的机器学习:临床应用和新趋势的叙述性回顾
IF 4.1 2区 医学 Q2 COMPUTER SCIENCE, INFORMATION SYSTEMS Pub Date : 2025-12-27 DOI: 10.1016/j.ijmedinf.2025.106245
Hengjun Liu , Tianwei Meng , Rui Qie

Objective

This narrative review synthesizes machine learning (ML) applications across the stroke and post-stroke continuum from acute imaging and diagnosis to long-term sequelae prognosis and rehabilitation.

Method

We searched PubMed, Embase, and WOS from inception to October 17, 2025, for a comprehensive review. We used a combination of search terms, including “machine learning,” “deep learning,” “post stroke.” These terms were carefully selected to capture a wide range of relevant studies and articles related to stroke and ML.

Results

ML has been successfully deployed in six core domains: Image reading, where deep learning enables automated lesion segmentation on MRI/CT and prediction of tissue fate; Diagnosis, including etiology, atrial fibrillation screening; Overall prognosis, with high-accuracy models for functional outcome, mortality, and readmission; Sequelae prediction, such as cognitive impairment, motor dysfunction, aphasia, depression, fatigue, and organ diseases; Treatment response, including outcome prediction after thrombectomy and rehabilitation; Rehabilitation monitoring, using wearable sensors and robotics for objective, granular assessment of motor recovery. A clear trend toward multimodal data integration and model interpretability was observed, enhancing both predictive power and biological plausibility.

Conclusion

ML has evolved from a research tool into a transformative force in stroke care, enabling precise, individualized prediction and monitoring across the entire post-stroke trajectory. Future efforts must prioritize prospective validation, standardized reporting, and seamless integration into clinical workflows to realize its full potential for precision medicine.
目的本文综述了机器学习(ML)在中风和中风后连续体中的应用,从急性成像和诊断到长期后遗症预后和康复。方法检索PubMed、Embase、WOS自成立至2025年10月17日的文献进行综合评价。我们使用了一系列搜索词,包括“机器学习”、“深度学习”、“中风后”。这些术语经过精心挑选,以捕获与中风和ml相关的广泛相关研究和文章。结果sml已成功应用于六个核心领域:图像读取,其中深度学习能够在MRI/CT上自动分割病变并预测组织命运;诊断,包括病因、房颤筛查;总体预后,功能结局、死亡率和再入院的高精度模型;后遗症预测,如认知障碍、运动功能障碍、失语、抑郁、疲劳和器官疾病;治疗反应,包括取栓和康复后的预后预测;康复监测,使用可穿戴传感器和机器人技术对运动恢复进行客观、细致的评估。观察到多模态数据集成和模型可解释性的明显趋势,增强了预测能力和生物学合理性。ml已经从一种研究工具发展成为卒中护理的变革力量,能够在整个卒中后轨迹中实现精确、个性化的预测和监测。未来的工作必须优先考虑前瞻性验证、标准化报告和无缝集成到临床工作流程中,以实现其在精准医疗方面的全部潜力。
{"title":"Machine learning in stroke and its sequelae: a narrative review of clinical applications and emerging trends","authors":"Hengjun Liu ,&nbsp;Tianwei Meng ,&nbsp;Rui Qie","doi":"10.1016/j.ijmedinf.2025.106245","DOIUrl":"10.1016/j.ijmedinf.2025.106245","url":null,"abstract":"<div><h3>Objective</h3><div>This narrative review synthesizes machine learning (ML) applications across the stroke and post-stroke continuum from acute imaging and diagnosis to long-term sequelae prognosis and rehabilitation.</div></div><div><h3>Method</h3><div>We searched PubMed, Embase, and WOS from inception to October 17, 2025, for a comprehensive review. We used a combination of search terms, including “machine learning,” “deep learning,” “post stroke.” These terms were carefully selected to capture a wide range of relevant studies and articles related to stroke and ML.</div></div><div><h3>Results</h3><div>ML has been successfully deployed in six core domains: Image reading, where deep learning enables automated lesion segmentation on MRI/CT and prediction of tissue fate; Diagnosis, including etiology, atrial fibrillation screening; Overall prognosis, with high-accuracy models for functional outcome, mortality, and readmission; Sequelae prediction, such as cognitive impairment, motor dysfunction, aphasia, depression, fatigue, and organ diseases; Treatment response, including outcome prediction after thrombectomy and rehabilitation; Rehabilitation monitoring, using wearable sensors and robotics for objective, granular assessment of motor recovery. A clear trend toward multimodal data integration and model interpretability was observed, enhancing both predictive power and biological plausibility.</div></div><div><h3>Conclusion</h3><div>ML has evolved from a research tool into a transformative force in stroke care, enabling precise, individualized prediction and monitoring across the entire post-stroke trajectory. Future efforts must prioritize prospective validation, standardized reporting, and seamless integration into clinical workflows to realize its full potential for precision medicine.</div></div>","PeriodicalId":54950,"journal":{"name":"International Journal of Medical Informatics","volume":"208 ","pages":"Article 106245"},"PeriodicalIF":4.1,"publicationDate":"2025-12-27","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145885682","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Construction and validation of a machine learning-based risk prediction model for invasive mechanical ventilation in AECOPD patients complicated with respiratory failure 基于机器学习的AECOPD合并呼吸衰竭患者有创机械通气风险预测模型构建与验证
IF 4.1 2区 医学 Q2 COMPUTER SCIENCE, INFORMATION SYSTEMS Pub Date : 2025-12-27 DOI: 10.1016/j.ijmedinf.2025.106244
Xin Jiang , Ji Li , Jingjing Ju , Hao Ding , Sufang Yang

Objective

This study aimed to create and validate a machine learning (ML) model to predict the likelihood of invasive mechanical ventilation (IMV) in patients with acute exacerbation of chronic obstructive pulmonary disease (AECOPD) complicated by respiratory failure.

Methods

Data from patients diagnosed with AECOPD and respiratory failure were retrospectively extracted from the Medical Information Mart for Intensive Care-IV (MIMIC-IV). A total of 551 cases were split 7:3 into a training set (385 cases) for model construction and an internal validation set (166 cases). The IMV served as the outcome event. Features were selected with the Boruta algorithm and least absolute shrinkage and selection operator (LASSO). Eight ML algorithms—XGBoost, decision tree (DT), random forest (RF), support-vector machine (SVM), LightGBM, CatBoost, Gaussian naïve Bayes (NB) and K-nearest neighbor (NN)—were trained with 10-fold cross-validation. Model performance was assessed by the area under the receiver operating characteristic curve (AUC), accuracy, sensitivity, specificity, F1 score, calibration curve, decision curve and clinical impact curve. An external validation cohort of 100 AECOPD-respiratory failure patients admitted to Baoying People’s Hospital between January 2020 and August 2025 was collected. The final best model was interpreted with SHapley Additive exPlanations (SHAP) to clarify feature importance and decision logic, and an interactive dynamic nomogram was plotted to increase readability.

Results

Boruta plus LASSO identified total calcium, partial pressure of oxygen (PO2), oxygen saturation (SpO2) and sepsis as significant predictors. XGBoost outperformed the other algorithms, achieving an internal validation accuracy of 72.2 %, sensitivity of 64.6 %, specificity of 79.8 %, F1 score of 69.7 % and AUC of 0.813 (95 % CI 0.748–0.878). The external validation accuracy reached 76.4 %, the sensitivity reached 82.6 %, the specificity reached 70.0 %, the F1 score reached 78.7 %, and the AUC reached 0.840 (95 % CI 0.801–0.879). SHAP analysis further indicated that PO2 and SpO2 were the primary drivers of model decisions. An interactive dynamic nomogram was successfully constructed.

Conclusion

IMV in AECOPD patients with respiratory failure was associated with total calcium, PO2, and SpO2 levels and sepsis. The developed XGBoost model demonstrated good predictive value for IMV in this clinical population.
目的:本研究旨在建立并验证机器学习(ML)模型,以预测慢性阻塞性肺疾病(AECOPD)急性加重期合并呼吸衰竭患者进行有创机械通气(IMV)的可能性。方法:回顾性地从重症监护医学信息市场- iv (MIMIC-IV)中提取诊断为AECOPD和呼吸衰竭的患者的资料。551个案例以7:3的比例分成用于模型构建的训练集(385例)和内部验证集(166例)。国际货币基金组织会议是最后的会议。使用Boruta算法和最小绝对收缩和选择算子(LASSO)选择特征。8种ML算法——xgboost、决策树(DT)、随机森林(RF)、支持向量机(SVM)、LightGBM、CatBoost、高斯naïve贝叶斯(NB)和k近邻(NN)——进行了10倍交叉验证的训练。通过受试者工作特征曲线下面积(AUC)、准确度、灵敏度、特异性、F1评分、校准曲线、决策曲线和临床影响曲线评价模型性能。收集2020年1月至2025年8月在宝应市人民医院住院的100例aecopd -呼吸衰竭患者的外部验证队列。利用SHapley加性解释(SHAP)对最终的最佳模型进行解释,以明确特征重要性和决策逻辑,并绘制交互式动态nomogram以提高可读性。结果:Boruta + LASSO发现总钙、氧分压(PO2)、氧饱和度(SpO2)和脓毒症是显著的预测因素。XGBoost优于其他算法,其内部验证准确率为72.2%,灵敏度为64.6%,特异性为79.8%,F1评分为69.7%,AUC为0.813 (95% CI 0.748 ~ 0.878)。外部验证准确度达76.4%,灵敏度达82.6%,特异性达70.0%,F1评分达78.7%,AUC达0.840 (95% CI 0.801 ~ 0.879)。SHAP分析进一步表明,PO2和SpO2是模型决策的主要驱动因素。成功地构造了一个交互式动态图。结论:AECOPD合并呼吸衰竭患者IMV与总钙、PO2、SpO2水平及脓毒症相关。开发的XGBoost模型在该临床人群中显示出良好的IMV预测价值。
{"title":"Construction and validation of a machine learning-based risk prediction model for invasive mechanical ventilation in AECOPD patients complicated with respiratory failure","authors":"Xin Jiang ,&nbsp;Ji Li ,&nbsp;Jingjing Ju ,&nbsp;Hao Ding ,&nbsp;Sufang Yang","doi":"10.1016/j.ijmedinf.2025.106244","DOIUrl":"10.1016/j.ijmedinf.2025.106244","url":null,"abstract":"<div><h3>Objective</h3><div>This study aimed to create and validate a machine learning (ML) model to predict the likelihood of invasive mechanical ventilation (IMV) in patients with acute exacerbation of chronic obstructive pulmonary disease (AECOPD) complicated by respiratory failure.</div></div><div><h3>Methods</h3><div>Data from patients diagnosed with AECOPD and respiratory failure were retrospectively extracted from the Medical Information Mart for Intensive Care-IV (MIMIC-IV). A total of 551 cases were split 7:3 into a training set (385 cases) for model construction and an internal validation set (166 cases). The IMV served as the outcome event. Features were selected with the Boruta algorithm and least absolute shrinkage and selection operator (LASSO). Eight ML algorithms—XGBoost, decision tree (DT), random forest (RF), support-vector machine (SVM), LightGBM, CatBoost, Gaussian naïve Bayes (NB) and K-nearest neighbor (NN)—were trained with 10-fold cross-validation. Model performance was assessed by the area under the receiver operating characteristic curve (AUC), accuracy, sensitivity, specificity, F1 score, calibration curve, decision curve and clinical impact curve. An external validation cohort of 100 AECOPD-respiratory failure patients admitted to Baoying People’s Hospital between January 2020 and August 2025 was collected. The final best model was interpreted with SHapley Additive exPlanations (SHAP) to clarify feature importance and decision logic, and an interactive dynamic nomogram was plotted to increase readability.</div></div><div><h3>Results</h3><div>Boruta plus LASSO identified total calcium, partial pressure of oxygen (PO<sub>2</sub>), oxygen saturation (SpO<sub>2</sub>) and sepsis as significant predictors. XGBoost outperformed the other algorithms, achieving an internal validation accuracy of 72.2<!--> <!-->%, sensitivity of 64.6<!--> <!-->%, specificity of 79.8 %, F1 score of 69.7<!--> <!-->% and AUC of 0.813 (95<!--> <!-->% CI 0.748–0.878). The external validation accuracy reached 76.4<!--> <!-->%, the sensitivity reached 82.6<!--> <!-->%, the specificity reached 70.0<!--> <!-->%, the F1 score reached 78.7<!--> <!-->%, and the AUC reached 0.840 (95<!--> <!-->% CI 0.801–0.879). SHAP analysis further indicated that PO<sub>2</sub> and SpO<sub>2</sub> were the primary drivers of model decisions. An interactive dynamic nomogram was successfully constructed.</div></div><div><h3>Conclusion</h3><div>IMV in AECOPD patients with respiratory failure was associated with total calcium, PO<sub>2</sub>, and SpO<sub>2</sub> levels and sepsis. The developed XGBoost model demonstrated good predictive value for IMV in this clinical population.</div></div>","PeriodicalId":54950,"journal":{"name":"International Journal of Medical Informatics","volume":"208 ","pages":"Article 106244"},"PeriodicalIF":4.1,"publicationDate":"2025-12-27","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145866463","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Synthetic data generation in paediatrics and paediatric nursing: what, how, and why? 儿科和儿科护理中的合成数据生成:什么,如何生成,为什么生成?
IF 4.1 2区 医学 Q2 COMPUTER SCIENCE, INFORMATION SYSTEMS Pub Date : 2025-12-27 DOI: 10.1016/j.ijmedinf.2025.106236
Elisabetta Mezzalira , Maria Paola Boaro , Giulia Reggiani , Riccardo Biondi , Gastone Castellani , Raffaella Colombatti

Introduction

This paper explores the potential benefits and limitations of synthetic data (SD) in paediatrics, addressing the challenges of data scarcity and privacy concerns in paediatric research.

Methodology

A narrative literature review was conducted, searching PubMed and Scopus databases for relevant publications up to August 2025. The review focused on studies addressing the use, development, or application of SD in paediatric healthcare settings.

Findings

Synthetic data offers numerous benefits in paediatrics, including enhancing dataset diversity, protecting patient privacy, and enabling AI model development, especially in areas with limited real datasets such as rare diseases. Applications of SD in paediatrics span various fields, including neonatology, oncology, radiology, and neurodevelopmental disorders. However, challenges persist, including potential data bias, ensuring accuracy and quality, privacy concerns, and the lack of standardized guidelines for data generation and validation.

Conclusions and future directions

While SD demonstrates potential in specific paediatric applications, such as improving AI early warning systems and augmenting datasets for rare conditions, its use requires a structured, actionable framework for evaluation. Future efforts should focus through multi-stakeholder engagement, on developing paediatric-specific guidelines, ensuring fair and safe use of SD, and addressing unique aspects of child development in data synthesis.
前言:本文探讨了合成数据(SD)在儿科研究中的潜在优势和局限性,解决了儿科研究中数据稀缺和隐私问题的挑战。方法:采用叙述性文献综述,检索PubMed和Scopus数据库中截至2025年8月的相关出版物。这篇综述的重点是研究SD在儿科医疗机构的使用、发展或应用。研究结果:合成数据为儿科提供了许多好处,包括增强数据集多样性,保护患者隐私,并使人工智能模型开发成为可能,特别是在罕见疾病等真实数据集有限的领域。SD在儿科的应用涉及多个领域,包括新生儿学、肿瘤学、放射学和神经发育障碍。然而,挑战依然存在,包括潜在的数据偏差、确保准确性和质量、隐私问题以及缺乏数据生成和验证的标准化指南。结论和未来方向:虽然SD在特定的儿科应用中显示出潜力,例如改进人工智能预警系统和增加罕见疾病的数据集,但其使用需要一个结构化的、可操作的评估框架。未来的努力应侧重于多方利益相关者的参与,制定针对儿科的指南,确保公平和安全地使用可持续发展指标,并在数据综合中解决儿童发展的独特方面。
{"title":"Synthetic data generation in paediatrics and paediatric nursing: what, how, and why?","authors":"Elisabetta Mezzalira ,&nbsp;Maria Paola Boaro ,&nbsp;Giulia Reggiani ,&nbsp;Riccardo Biondi ,&nbsp;Gastone Castellani ,&nbsp;Raffaella Colombatti","doi":"10.1016/j.ijmedinf.2025.106236","DOIUrl":"10.1016/j.ijmedinf.2025.106236","url":null,"abstract":"<div><h3>Introduction</h3><div>This paper explores the potential benefits and limitations of synthetic data (SD) in paediatrics, addressing the challenges of data scarcity and privacy concerns in paediatric research.</div></div><div><h3>Methodology</h3><div>A narrative literature review was conducted, searching PubMed and Scopus databases for relevant publications up to August 2025. The review focused on studies addressing the use, development, or application of SD in paediatric healthcare settings.</div></div><div><h3>Findings</h3><div>Synthetic data offers numerous benefits in paediatrics, including enhancing dataset diversity, protecting patient privacy, and enabling AI model development, especially in areas with limited real datasets such as rare diseases. Applications of SD in paediatrics span various fields, including neonatology, oncology, radiology, and neurodevelopmental disorders. However, challenges persist, including potential data bias, ensuring accuracy and quality, privacy concerns, and the lack of standardized guidelines for data generation and validation.</div></div><div><h3>Conclusions and future directions</h3><div>While SD demonstrates potential in specific paediatric applications, such as improving AI early warning systems and augmenting datasets for rare conditions, its use requires a structured, actionable framework for evaluation. Future efforts should focus through multi-stakeholder engagement, on developing paediatric-specific guidelines, ensuring fair and safe use of SD, and addressing unique aspects of child development in data synthesis.</div></div>","PeriodicalId":54950,"journal":{"name":"International Journal of Medical Informatics","volume":"209 ","pages":"Article 106236"},"PeriodicalIF":4.1,"publicationDate":"2025-12-27","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145913972","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Reviewer Acknowledgement 2025 审稿人致谢2025。
IF 4.1 2区 医学 Q2 COMPUTER SCIENCE, INFORMATION SYSTEMS Pub Date : 2025-12-25 DOI: 10.1016/j.ijmedinf.2025.106243
{"title":"Reviewer Acknowledgement 2025","authors":"","doi":"10.1016/j.ijmedinf.2025.106243","DOIUrl":"10.1016/j.ijmedinf.2025.106243","url":null,"abstract":"","PeriodicalId":54950,"journal":{"name":"International Journal of Medical Informatics","volume":"207 ","pages":"Article 106243"},"PeriodicalIF":4.1,"publicationDate":"2025-12-25","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145866390","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Machine learning-based diagnosis of autism spectrum disorder in children and adolescents using eye-tracking data: a systematic review and meta-analysis 基于机器学习的儿童和青少年自闭症谱系障碍的眼动追踪诊断:系统回顾和荟萃分析
IF 4.1 2区 医学 Q2 COMPUTER SCIENCE, INFORMATION SYSTEMS Pub Date : 2025-12-25 DOI: 10.1016/j.ijmedinf.2025.106235
Wenhao Han , Xinyu Yang , Xin Li , Jiacheng Wang , Juan Liu , Wei Pang

Objective

Eye-tracking technology has been increasingly investigated as an objective approach for distinguishing individuals with Autism Spectrum Disorder (ASD) from typically developing (TD) individuals. Artificial intelligence and machine learning (ML) methods have been widely applied to support ASD diagnosis and treatment, and prior studies suggest that ML models leveraging eye-tracking data can achieve high diagnostic accuracy. This systematic review and meta-analysis aimed to evaluate the diagnostic performance of machine-learning models using eye-tracking data to distinguish children and adolescents with ASD from TD peers.

Methods

We systematically searched PubMed, Embase, Web of Science, IEEE Xplore, Scopus, and the Cochrane Library from inception to August 3, 2025. We included studies that applied ML methods to eye-tracking data to distinguish children with ASD from TD children. We extracted data on participant characteristics, model performance, eye-tracking protocols, and machine-learning algorithms. The review protocol was registered in PROSPERO (CRD420251162462).

Results

We identified 1,045 records, of which 25 studies were included in the meta-analysis. The included studies comprised 2,319 participants, with sample sizes ranging from 32 to 529 per study. The pooled accuracy, sensitivity, and specificity of machine-learning models using eye-tracking data to distinguish children with ASD from TD children were 85 % (95 % CI, 81–89 %), 86 % (95 % CI, 82–89 %), and 86 % (95 % CI, 79–91 %), respectively. These results suggest that eye-tracking–based machine-learning approaches have good diagnostic performance for identifying ASD.

Conclusion

Eye-tracking–based machine-learning approaches show considerable potential for distinguishing children with ASD from TD children. However, the robustness and generalizability of these findings are limited by the lack of external validation, small sample sizes, and substantial between-study heterogeneity. To establish generalizability, future research should prioritize standardized eye-tracking paradigms and large-scale, prospective, multicenter study designs with external validation. Such efforts may facilitate the translation of these models into clinical practice as objective and efficient adjunctive screening tools.
眼动追踪技术作为一种区分自闭症谱系障碍(ASD)和正常发育(TD)个体的客观方法,已得到越来越多的研究。人工智能和机器学习(ML)方法已广泛应用于支持ASD的诊断和治疗,先前的研究表明,利用眼动追踪数据的ML模型可以实现较高的诊断准确性。本系统综述和荟萃分析旨在评估使用眼动追踪数据的机器学习模型的诊断性能,以区分自闭症儿童和青少年与TD同龄人。方法系统检索PubMed、Embase、Web of Science、IEEE explore、Scopus、Cochrane Library自成立至2025年8月3日的文献。我们纳入了将ML方法应用于眼动追踪数据以区分ASD儿童和TD儿童的研究。我们提取了参与者特征、模型性能、眼动追踪协议和机器学习算法的数据。该审查方案已在PROSPERO注册(CRD420251162462)。结果我们确定了1045条记录,其中25项研究被纳入meta分析。纳入的研究包括2319名参与者,每项研究的样本量从32到529不等。使用眼动追踪数据的机器学习模型区分ASD儿童和TD儿童的总准确性、灵敏度和特异性分别为85% (95% CI, 81 - 89%)、86% (95% CI, 82 - 89%)和86% (95% CI, 79 - 91%)。这些结果表明,基于眼动追踪的机器学习方法在识别ASD方面具有良好的诊断性能。结论基于眼动追踪的机器学习方法在区分ASD儿童和TD儿童方面具有很大的潜力。然而,这些发现的稳健性和普遍性受到缺乏外部验证、小样本量和大量研究间异质性的限制。为了建立普遍性,未来的研究应优先考虑标准化的眼动追踪范式和具有外部验证的大规模、前瞻性、多中心研究设计。这些努力可能有助于将这些模型转化为临床实践,作为客观有效的辅助筛查工具。
{"title":"Machine learning-based diagnosis of autism spectrum disorder in children and adolescents using eye-tracking data: a systematic review and meta-analysis","authors":"Wenhao Han ,&nbsp;Xinyu Yang ,&nbsp;Xin Li ,&nbsp;Jiacheng Wang ,&nbsp;Juan Liu ,&nbsp;Wei Pang","doi":"10.1016/j.ijmedinf.2025.106235","DOIUrl":"10.1016/j.ijmedinf.2025.106235","url":null,"abstract":"<div><h3>Objective</h3><div>Eye-tracking technology has been increasingly investigated as an objective approach for distinguishing individuals with Autism Spectrum Disorder (ASD) from typically developing (TD) individuals. Artificial intelligence and machine learning (ML) methods have been widely applied to support ASD diagnosis and treatment, and prior studies suggest that ML models leveraging eye-tracking data can achieve high diagnostic accuracy. This systematic review and meta-analysis aimed to evaluate the diagnostic performance of machine-learning models using eye-tracking data to distinguish children and adolescents with ASD from TD peers.</div></div><div><h3>Methods</h3><div>We systematically searched PubMed, Embase, Web of Science, IEEE Xplore, Scopus, and the Cochrane Library from inception to August 3, 2025. We included studies that applied ML methods to eye-tracking data to distinguish children with ASD from TD children. We extracted data on participant characteristics, model performance, eye-tracking protocols, and machine-learning algorithms. The review protocol was registered in PROSPERO (CRD420251162462).</div></div><div><h3>Results</h3><div>We identified 1,045 records, of which 25 studies were included in the meta-analysis. The included studies comprised 2,319 participants, with sample sizes ranging from 32 to 529 per study. The pooled accuracy, sensitivity, and specificity of machine-learning models using eye-tracking data to distinguish children with ASD from TD children were 85 % (95 % CI, 81–89 %), 86 % (95 % CI, 82–89 %), and 86 % (95 % CI, 79–91 %), respectively. These results suggest that eye-tracking–based machine-learning approaches have good diagnostic performance for identifying ASD.</div></div><div><h3>Conclusion</h3><div>Eye-tracking–based machine-learning approaches show considerable potential for distinguishing children with ASD from TD children. However, the robustness and generalizability of these findings are limited by the lack of external validation, small sample sizes, and substantial between-study heterogeneity. To establish generalizability, future research should prioritize standardized eye-tracking paradigms and large-scale, prospective, multicenter study designs with external validation. Such efforts may facilitate the translation of these models into clinical practice as objective and efficient adjunctive screening tools.</div></div>","PeriodicalId":54950,"journal":{"name":"International Journal of Medical Informatics","volume":"208 ","pages":"Article 106235"},"PeriodicalIF":4.1,"publicationDate":"2025-12-25","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145885672","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
A roadmap for federated learning projects using health data to guide sustainable artificial intelligence development in the European Union 使用健康数据指导欧洲联盟可持续人工智能发展的联合学习项目路线图。
IF 4.1 2区 医学 Q2 COMPUTER SCIENCE, INFORMATION SYSTEMS Pub Date : 2025-12-25 DOI: 10.1016/j.ijmedinf.2025.106242
Janne Kommusaar , Silja Elunurm , Taridzo Chomutare , Mari Kangasniemi , Sanna Salanterä , Laura-Maria Peltonen
<div><h3>Background</h3><div>The rise of digital health data has expanded opportunities for data-driven innovation, yet privacy, legal and ethical barriers frame data sharing and collaborative artificial intelligence development. Federated Learning (FL) offers a privacy-preserving alternative, but current research considers mainly technical aspects. There is no end-to-end roadmap that integrates ethical, legal, technical and administrative principles tailored to FL projects in healthcare. This study addresses that gap by developing a roadmap to guide responsible and scalable FL research in the European context.</div></div><div><h3>Methods</h3><div>A multi-method participatory approach was used to develop a roadmap for scientific projects using FL on health data. The iterative process involved three phases. First, key questions were defined and existing evidence was explored through (i) a survey of domain experts (researchers, data governance specialists and infrastructure providers), (ii) a targeted literature review of FL applications in health research and (iii) systematic mapping of relevant EU-level legislation and policy frameworks. Evidence from these sources was synthesized to identify technical, organizational, legal and sustainability-related requirements for FL-based research. Second, preliminary roadmap components were refined through stakeholder engagement in an online workshop, where feasibility, scalability and sustainability considerations were explicitly discussed. Third, the roadmap was validated and iteratively refined by an expert panel through a structured group discussion, focusing on long-term sustainability, governance and transferability across research contexts. The process was carried out within a Baltic-Nordic collaboration in 2023–2025.</div></div><div><h3>Results</h3><div>The developed roadmap integrates ethical, legal, technical, administrative and sustainability-related considerations essential for applying FL to health data. It emphasizes the importance of multidisciplinary collaboration throughout the FL project lifecycle, with particular attention to long-term governance, scalability and reuse of infrastructures and practices. The process is structured into six phases: (1) Planning, (2) Execution refinement, (3) Data, (4) FL platform, (5) FL experiment and (6) Dissemination. Across these phases, sustainability is addressed through mechanisms such as regulatory alignment, shared governance models, capacity building and integration with existing research and health data infrastructures. By merging ethical, legal, technical and administrative aspects into a unified, end-to-end framework, the roadmap provides actionable, novel guidance beyond existing recommendations.</div></div><div><h3>Conclusions</h3><div>This work consolidates early lessons from FL in healthcare into a practical, step-by-step roadmap that integrates ethical, legal, technical and administrative aspects in the European context. By offering a shared
背景:数字健康数据的兴起扩大了数据驱动创新的机会,但隐私、法律和道德障碍阻碍了数据共享和协作式人工智能的发展。联邦学习(FL)提供了一种保护隐私的替代方案,但目前的研究主要考虑技术方面的问题。目前还没有一个端到端的路线图,可以整合为医疗保健领域的FL项目量身定制的道德、法律、技术和管理原则。本研究通过制定路线图来指导欧洲范围内负责任和可扩展的FL研究,从而解决了这一差距。方法:采用多方法参与式方法,为利用FL处理卫生数据的科学项目制定路线图。迭代过程包括三个阶段。首先,通过(i)对领域专家(研究人员、数据治理专家和基础设施提供商)的调查,(ii)对FL在卫生研究中的应用进行有针对性的文献综述,以及(iii)系统地绘制相关欧盟层面的立法和政策框架,定义了关键问题并探索了现有证据。综合了这些来源的证据,以确定基于fl的研究的技术、组织、法律和可持续性相关要求。其次,通过在线研讨会的利益相关者参与,对初步路线图组件进行了细化,其中明确讨论了可行性、可扩展性和可持续性考虑因素。第三,路线图由专家小组通过结构化的小组讨论进行验证和迭代完善,重点关注长期可持续性、治理和跨研究背景的可转移性。该过程是在2023-2025年的波罗的海-北欧合作中进行的。结果:制定的路线图整合了将FL应用于健康数据所必需的伦理、法律、技术、行政和可持续性相关考虑因素。它强调了在整个FL项目生命周期中多学科协作的重要性,特别关注基础结构和实践的长期治理、可伸缩性和重用。该过程分为六个阶段:(1)计划,(2)执行改进,(3)数据,(4)FL平台,(5)FL实验和(6)传播。在这些阶段,可持续性是通过监管协调、共享治理模式、能力建设以及与现有研究和卫生数据基础设施的整合等机制来解决的。通过将道德、法律、技术和管理方面合并到一个统一的端到端框架中,路线图提供了超越现有建议的可操作的新颖指导。结论:这项工作将早期FL在医疗保健方面的经验教训整合到一个实用的、逐步的路线图中,该路线图在欧洲背景下整合了伦理、法律、技术和行政方面。通过为不同的利益相关者提供共享框架,它支持跨医疗保健系统的更值得信赖、可扩展和兼容的人工智能协作。
{"title":"A roadmap for federated learning projects using health data to guide sustainable artificial intelligence development in the European Union","authors":"Janne Kommusaar ,&nbsp;Silja Elunurm ,&nbsp;Taridzo Chomutare ,&nbsp;Mari Kangasniemi ,&nbsp;Sanna Salanterä ,&nbsp;Laura-Maria Peltonen","doi":"10.1016/j.ijmedinf.2025.106242","DOIUrl":"10.1016/j.ijmedinf.2025.106242","url":null,"abstract":"&lt;div&gt;&lt;h3&gt;Background&lt;/h3&gt;&lt;div&gt;The rise of digital health data has expanded opportunities for data-driven innovation, yet privacy, legal and ethical barriers frame data sharing and collaborative artificial intelligence development. Federated Learning (FL) offers a privacy-preserving alternative, but current research considers mainly technical aspects. There is no end-to-end roadmap that integrates ethical, legal, technical and administrative principles tailored to FL projects in healthcare. This study addresses that gap by developing a roadmap to guide responsible and scalable FL research in the European context.&lt;/div&gt;&lt;/div&gt;&lt;div&gt;&lt;h3&gt;Methods&lt;/h3&gt;&lt;div&gt;A multi-method participatory approach was used to develop a roadmap for scientific projects using FL on health data. The iterative process involved three phases. First, key questions were defined and existing evidence was explored through (i) a survey of domain experts (researchers, data governance specialists and infrastructure providers), (ii) a targeted literature review of FL applications in health research and (iii) systematic mapping of relevant EU-level legislation and policy frameworks. Evidence from these sources was synthesized to identify technical, organizational, legal and sustainability-related requirements for FL-based research. Second, preliminary roadmap components were refined through stakeholder engagement in an online workshop, where feasibility, scalability and sustainability considerations were explicitly discussed. Third, the roadmap was validated and iteratively refined by an expert panel through a structured group discussion, focusing on long-term sustainability, governance and transferability across research contexts. The process was carried out within a Baltic-Nordic collaboration in 2023–2025.&lt;/div&gt;&lt;/div&gt;&lt;div&gt;&lt;h3&gt;Results&lt;/h3&gt;&lt;div&gt;The developed roadmap integrates ethical, legal, technical, administrative and sustainability-related considerations essential for applying FL to health data. It emphasizes the importance of multidisciplinary collaboration throughout the FL project lifecycle, with particular attention to long-term governance, scalability and reuse of infrastructures and practices. The process is structured into six phases: (1) Planning, (2) Execution refinement, (3) Data, (4) FL platform, (5) FL experiment and (6) Dissemination. Across these phases, sustainability is addressed through mechanisms such as regulatory alignment, shared governance models, capacity building and integration with existing research and health data infrastructures. By merging ethical, legal, technical and administrative aspects into a unified, end-to-end framework, the roadmap provides actionable, novel guidance beyond existing recommendations.&lt;/div&gt;&lt;/div&gt;&lt;div&gt;&lt;h3&gt;Conclusions&lt;/h3&gt;&lt;div&gt;This work consolidates early lessons from FL in healthcare into a practical, step-by-step roadmap that integrates ethical, legal, technical and administrative aspects in the European context. By offering a shared","PeriodicalId":54950,"journal":{"name":"International Journal of Medical Informatics","volume":"208 ","pages":"Article 106242"},"PeriodicalIF":4.1,"publicationDate":"2025-12-25","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145879368","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Development and validation of a machine learning model to predict functional outcomes in patients with recent small subcortical infarction 机器学习模型的开发和验证,以预测近期小皮质下梗死患者的功能结局。
IF 4.1 2区 医学 Q2 COMPUTER SCIENCE, INFORMATION SYSTEMS Pub Date : 2025-12-24 DOI: 10.1016/j.ijmedinf.2025.106241
Hongbing Liu , Ying Yao , Ce Zong , Ke Zhang , Haixu Zhao , Yuan Song , Yuming Xu , Yuan Gao

Objective

A substantial proportion of patients (12 %–25 %) with recent small subcortical infarction (RSSI) suffer poor functional outcomes at 3 months. Despite the identification of prognostic factors, a significant gap exists in predictive modeling. This study aimed to develop and validate machine learning models to accurately predict 3-month functional status in this patient population.

Methods

This multicenter study prospectively enrolled 1576 patients diagnosed with RSSI. The primary cohort (n = 1126) was randomly split into a training set (70 %) and an internal validation set (30 %). An independent external cohort (n = 450) was used for further validation. The primary outcome was an unfavorable functional status at 3 months, defined as a modified Rankin Scale (mRS) score ≥3. The Least Absolute Shrinkage and Selection Operator (LASSO) logistic regression model was employed for feature selection from demographic, clinical, laboratory, and imaging variables. 8 supervised machine learning models were developed and compared. Model performance was rigorously evaluated in the validation cohorts using the Area Under the Receiver Operating Characteristic Curve (AUC) for discrimination, calibration curves for consistency, and Decision Curve Analysis (DCA) for clinical utility. The optimal model was interpreted using SHapley Additive exPlanations (SHAP).

Results

LASSO regression identified 8 features with non-zero coefficients for predicting outcomes: NIHSS, proximal RSSI (pRSSI), glucose, stress hyperglycemia ratio (SHR), neutrophil-to-lymphocyte ratio (NLR), age, systolic blood pressure (SBP), and LDL-C. Among the eight developed models, the CatBoost model demonstrated the best performance. It achieved the highest AUC in the training set (0.961), the internal validation cohort (0.940), and the external validation cohort (0.875). The CatBoost model also showed excellent calibration and provided the greatest net benefit across a wide range of threshold probabilities in DCA for both validation cohorts. SHAP analysis identified the NIHSS score as the most significant predictor of unfavorable outcomes, followed by pRSSI, Glucose, SHR, and NLR. A publicly accessible web tool based on the model is available at: https:// predictrssi.streamlit.app.

Conclusion

This study successfully developed and validated a robust CatBoost machine learning model that accurately predicts 3-month functional outcomes in patients with RSSI using eight readily accessible features. This model, which outperforms seven other machine learning algorithms, is available as a user-friendly web application to aid clinicians in early risk stratification and personalized patient management.
目的:相当比例的近期小皮质下梗死(RSSI)患者(12% - 25%)在3个月时功能预后较差。尽管确定了预后因素,但在预测建模方面存在显着差距。本研究旨在开发和验证机器学习模型,以准确预测该患者群体3个月的功能状态。方法:本多中心研究前瞻性纳入1576例诊断为RSSI的患者。主要队列(n = 1126)随机分为训练集(70%)和内部验证集(30%)。一个独立的外部队列(n = 450)被用于进一步验证。主要结局是3个月时不良的功能状态,定义为修改的Rankin量表(mRS)评分≥3。最小绝对收缩和选择算子(LASSO)逻辑回归模型用于从人口统计学、临床、实验室和影像学变量中进行特征选择。开发并比较了8个监督机器学习模型。在验证队列中,使用受试者工作特征曲线下面积(AUC)进行甄别,使用校准曲线进行一致性,使用决策曲线分析(DCA)进行临床效用,严格评估模型的性能。最优模型采用SHapley加性解释(SHAP)进行解释。结果:LASSO回归确定了8个预测结果的非零系数特征:NIHSS、近端RSSI (pRSSI)、葡萄糖、应激性高血糖比(SHR)、中性粒细胞与淋巴细胞比(NLR)、年龄、收缩压(SBP)和LDL-C。在八个已开发的模型中,CatBoost模型表现出最好的性能。在训练集(0.961)、内部验证队列(0.940)和外部验证队列(0.875)中AUC最高。CatBoost模型也显示出出色的校准,并在DCA的广泛阈值概率范围内为两个验证队列提供了最大的净收益。SHAP分析发现NIHSS评分是不良结果的最重要预测因子,其次是pRSSI、葡萄糖、SHR和NLR。结论:本研究成功开发并验证了一个强大的CatBoost机器学习模型,该模型可以使用8个易于访问的特征准确预测RSSI患者3个月的功能结果。该模型优于其他7种机器学习算法,可作为用户友好的web应用程序,帮助临床医生进行早期风险分层和个性化患者管理。
{"title":"Development and validation of a machine learning model to predict functional outcomes in patients with recent small subcortical infarction","authors":"Hongbing Liu ,&nbsp;Ying Yao ,&nbsp;Ce Zong ,&nbsp;Ke Zhang ,&nbsp;Haixu Zhao ,&nbsp;Yuan Song ,&nbsp;Yuming Xu ,&nbsp;Yuan Gao","doi":"10.1016/j.ijmedinf.2025.106241","DOIUrl":"10.1016/j.ijmedinf.2025.106241","url":null,"abstract":"<div><h3>Objective</h3><div>A substantial proportion of patients (12 %–25 %) with recent small subcortical infarction (RSSI) suffer poor functional outcomes at 3 months. Despite the identification of prognostic factors, a significant gap exists in predictive modeling. This study aimed to develop and validate machine learning models to accurately predict 3-month functional status in this patient population.</div></div><div><h3>Methods</h3><div>This multicenter study prospectively enrolled 1576 patients diagnosed with RSSI. The primary cohort (n = 1126) was randomly split into a training set (70 %) and an internal validation set (30 %). An independent external cohort (n = 450) was used for further validation. The primary outcome was an unfavorable functional status at 3 months, defined as a modified Rankin Scale (mRS) score ≥3. The Least Absolute Shrinkage and Selection Operator (LASSO) logistic regression model was employed for feature selection from demographic, clinical, laboratory, and imaging variables. 8 supervised machine learning models were developed and compared. Model performance was rigorously evaluated in the validation cohorts using the Area Under the Receiver Operating Characteristic Curve (AUC) for discrimination, calibration curves for consistency, and Decision Curve Analysis (DCA) for clinical utility. The optimal model was interpreted using SHapley Additive exPlanations (SHAP).</div></div><div><h3>Results</h3><div>LASSO regression identified 8 features with non-zero coefficients for predicting outcomes: NIHSS, proximal RSSI (pRSSI), glucose, stress hyperglycemia ratio (SHR), neutrophil-to-lymphocyte ratio (NLR), age, systolic blood pressure (SBP), and LDL-C. Among the eight developed models, the CatBoost model demonstrated the best performance. It achieved the highest AUC in the training set (0.961), the internal validation cohort (0.940), and the external validation cohort (0.875). The CatBoost model also showed excellent calibration and provided the greatest net benefit across a wide range of threshold probabilities in DCA for both validation cohorts. SHAP analysis identified the NIHSS score as the most significant predictor of unfavorable outcomes, followed by pRSSI, Glucose, SHR, and NLR. A publicly accessible web tool based on the model is available at: <span><span>https:// predictrssi.streamlit.app</span><svg><path></path></svg></span>.</div></div><div><h3>Conclusion</h3><div>This study successfully developed and validated a robust CatBoost machine learning model that accurately predicts 3-month functional outcomes in patients with RSSI using eight readily accessible features. This model, which outperforms seven other machine learning algorithms, is available as a user-friendly web application to aid clinicians in early risk stratification and personalized patient management.</div></div>","PeriodicalId":54950,"journal":{"name":"International Journal of Medical Informatics","volume":"208 ","pages":"Article 106241"},"PeriodicalIF":4.1,"publicationDate":"2025-12-24","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145851363","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Limitations of SHAP-based interpretability in sepsis progression models and paths to more robust feature validation 基于shap的脓毒症进展模型可解释性的局限性和更稳健的特征验证途径
IF 4.1 2区 医学 Q2 COMPUTER SCIENCE, INFORMATION SYSTEMS Pub Date : 2025-12-24 DOI: 10.1016/j.ijmedinf.2025.106238
Yuto Arai , Yoshiyasu Takefuji
{"title":"Limitations of SHAP-based interpretability in sepsis progression models and paths to more robust feature validation","authors":"Yuto Arai ,&nbsp;Yoshiyasu Takefuji","doi":"10.1016/j.ijmedinf.2025.106238","DOIUrl":"10.1016/j.ijmedinf.2025.106238","url":null,"abstract":"","PeriodicalId":54950,"journal":{"name":"International Journal of Medical Informatics","volume":"208 ","pages":"Article 106238"},"PeriodicalIF":4.1,"publicationDate":"2025-12-24","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145841723","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Physicians’ attitudes toward the patient summary in the Czech Republic: A national cross-sectional survey on awareness, use, and barriers 捷克共和国医生对病人总结的态度:一项关于意识、使用和障碍的全国性横断面调查。
IF 4.1 2区 医学 Q2 COMPUTER SCIENCE, INFORMATION SYSTEMS Pub Date : 2025-12-23 DOI: 10.1016/j.ijmedinf.2025.106232
Petra Hospodková , Jan Bruthans , Adéla Englová

Introduction

The Patient Summary (PS), a standardized subset of the electronic health record is designed to provide essential patient information for use in emergencies, unplanned care, and cross-border healthcare. While its technical development has progressed across Europe, little is known about real-world PS adoption and physician perceptions at the national level. This study explores the awareness, usage, and perceived barriers to the PS adoption among Czech physicians.

Methods

A cross-sectional online survey was distributed to all registered physicians in the Czech Republic between February and March 2025. The questionnaire assessed demographic characteristics, PS usage patterns, perceived benefits and barriers, and alignment with clinical practice. Descriptive statistics were calculated, and non-parametric tests (Wilcoxon rank-sum, Kruskal–Wallis) were used to examine differences by years of experience and medical specialty.

Results

A total of 1,739 responses were received (response rate: 4.14 %). Most respondents (66.4 %) reported not using the PS at all, and 72.1 % were unaware that their electronic medical record could be connected to the National Contact Point for eHealth. Only 1.7 % reported a current connection. There was no significant difference in PS use by years of clinical experience (P = 0.391), but a significant difference was observed across specialties (P < 0.001), with the highest usage reported in intensive care medicine and internal medicine.

Discussion and conclusion

Despite recognized benefits, PS usage remains low in the Czech Republic, largely due to limited awareness and system integration. Targeted policy measures, improved communication, and enhanced digital training are needed to support effective adoption.
简介:患者摘要(PS)是电子健康记录的一个标准化子集,旨在为紧急情况、计划外护理和跨境医疗保健提供必要的患者信息。虽然它的技术发展在整个欧洲都取得了进展,但人们对现实世界中PS的采用和国家层面上医生的看法知之甚少。本研究探讨了意识,使用和感知障碍的PS采用捷克医生。方法:在2025年2月至3月期间,对捷克共和国所有注册医生进行横断面在线调查。调查问卷评估了人口统计学特征、PS使用模式、可感知的益处和障碍,以及与临床实践的一致性。计算描述性统计数据,并使用非参数检验(Wilcoxon秩和,Kruskal-Wallis)来检查经验年限和医学专业的差异。结果:共收到问卷1739份,回复率4.14%。大多数受访者(66.4%)表示根本没有使用PS, 72.1%的人不知道他们的电子病历可以连接到国家电子健康联络点。只有1.7%的人报告有当前连接。不同临床年限对PS的使用差异无统计学意义(P = 0.391),但不同专科对PS的使用差异有统计学意义(P < 0.001),其中重症监护医学和内科使用率最高。讨论和结论:尽管公认的好处,PS使用率仍然很低在捷克共和国,主要是由于有限的认识和系统集成。需要有针对性的政策措施、改进的沟通和加强的数字培训来支持有效采用。
{"title":"Physicians’ attitudes toward the patient summary in the Czech Republic: A national cross-sectional survey on awareness, use, and barriers","authors":"Petra Hospodková ,&nbsp;Jan Bruthans ,&nbsp;Adéla Englová","doi":"10.1016/j.ijmedinf.2025.106232","DOIUrl":"10.1016/j.ijmedinf.2025.106232","url":null,"abstract":"<div><h3>Introduction</h3><div>The Patient Summary (PS), a standardized subset of the electronic health record is designed to provide essential patient information for use in emergencies, unplanned care, and cross-border healthcare. While its technical development has progressed across Europe, little is known about real-world PS adoption and physician perceptions at the national level. This study explores the awareness, usage, and perceived barriers to the PS adoption among Czech physicians.</div></div><div><h3>Methods</h3><div>A cross-sectional online survey was distributed to all registered physicians in the Czech Republic between February and March 2025. The questionnaire assessed demographic characteristics, PS usage patterns, perceived benefits and barriers, and alignment with clinical practice. Descriptive statistics were calculated, and non-parametric tests (Wilcoxon rank-sum, Kruskal–Wallis) were used to examine differences by years of experience and medical specialty.</div></div><div><h3>Results</h3><div>A total of 1,739 responses were received (response rate: 4.14 %). Most respondents (66.4 %) reported not using the PS at all, and 72.1 % were unaware that their electronic medical record could be connected to the National Contact Point for eHealth. Only 1.7 % reported a current connection. There was no significant difference in PS use by years of clinical experience (P = 0.391), but a significant difference was observed across specialties (P &lt; 0.001), with the highest usage reported in intensive care medicine and internal medicine.</div></div><div><h3>Discussion and conclusion</h3><div>Despite recognized benefits, PS usage remains low in the Czech Republic, largely due to limited awareness and system integration. Targeted policy measures, improved communication, and enhanced digital training are needed to support effective adoption.</div></div>","PeriodicalId":54950,"journal":{"name":"International Journal of Medical Informatics","volume":"208 ","pages":"Article 106232"},"PeriodicalIF":4.1,"publicationDate":"2025-12-23","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145829159","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Graph attention network with comorbidity connectivity embedding for post-traumatic epilepsy risk prediction using sparse time-series electronic health records 带共病连通性嵌入的图注意网络用于稀疏时间序列电子病历创伤后癫痫风险预测
IF 4.1 2区 医学 Q2 COMPUTER SCIENCE, INFORMATION SYSTEMS Pub Date : 2025-12-23 DOI: 10.1016/j.ijmedinf.2025.106239
Priyadharsini Ramamurthy , Zheng Han , Dursun Delen , Zhuqi Miao , Andrew Gin , Xiao Luo , William Paiva

Background

Traumatic brain injury (TBI) is a major risk factor for neurological disorders, including post-traumatic epilepsy (PTE), a debilitating condition associated with significant long-term consequences. The prognosis of PTE occurrence remains challenging due to the complex pathophysiology of PTE and the impracticality of traditional blood biomarker- or imaging-based screening for large populations. This study proposes a graph-based deep learning approach that leverages electronic health records (EHR) to enhance the predictive assessment of PTE risk.

Methods

We utilized Oracle Real-World Data (ORWD) to construct a Heterogeneous Graph Attention Network (HeteroGAT) that contains patient and diagnosis nodes, with temporal information represented using patients-to-diagnosis edges, and comorbidity connectivity embedded using diagnosis-to-diagnosis edges. The HeteroGAT was trained on a cohort of 1,598,998 TBI-only patients and 102,687 individuals who developed epilepsy after TBI. Model performance was evaluated using sensitivity, specificity, macro F1-score, and area under the receiver operating characteristic curve (AUC-ROC), benchmarked against traditional machine learning models. Attention scores of nodes were used to evaluate node importance. The capabilities of the HeteroGATs trained to differentiate early vs late PTE patients following TBI were also assessed.

Results

HeteroGAT significantly outperformed conventional models in PTE prediction by effectively integrating demographic data and comorbidity profiles spanning from 20 to 500 distinct conditions. The model’s multi-head attention mechanisms, in combination with learned comorbidity connectivity, enhanced its ability to capture complex dependencies within EHR data. HeteroGAT achieved an AUC-ROC of 0.80, outperforming the best-performing traditional model, random forest (AUC-ROC = 0.77). HeteroGAT also demonstrated capabilities in differentiating early and late PTEs. Ranking of nodes based on attention scores also identified predictors of PTE that are clinically relevant.

Conclusion

By modeling sparse EHR data through patient encounter embeddings, HeteroGAT effectively captures temporal and relational patterns in comorbidities critical for PTE prediction. Our findings highlight the potential of graph-based deep learning models, synergized with large-scale EHR data, in advancing personalized risk assessment, ultimately addressing the urgent need for more precise and proactive management of PTE in TBI patients.
背景外伤性脑损伤(TBI)是神经系统疾病的主要危险因素,包括创伤后癫痫(PTE),这是一种与严重的长期后果相关的衰弱性疾病。由于PTE的病理生理复杂,以及传统的基于血液生物标志物或影像学筛查在大人群中的不实用性,PTE的预后仍然具有挑战性。本研究提出了一种基于图形的深度学习方法,该方法利用电子健康记录(EHR)来增强PTE风险的预测评估。方法利用Oracle真实世界数据(ORWD)构建了包含患者和诊断节点的异构图关注网络(HeteroGAT),其中时间信息使用患者到诊断边表示,共病连接使用诊断到诊断边嵌入。HeteroGAT在1,598,998名TBI患者和102,687名TBI后发生癫痫的患者中进行了培训。以传统机器学习模型为基准,使用灵敏度、特异性、宏观f1评分和受试者工作特征曲线下面积(AUC-ROC)来评估模型的性能。节点的注意分数用于评价节点的重要性。此外,还评估了经过训练的heterogat区分TBI后早期和晚期PTE患者的能力。结果通过有效整合人口统计数据和20 - 500种不同疾病的共病概况,sheterogat在PTE预测方面明显优于传统模型。该模型的多头注意机制与习得的共病连接相结合,增强了其捕获EHR数据中复杂依赖关系的能力。HeteroGAT的AUC-ROC为0.80,优于表现最好的传统模型随机森林(AUC-ROC = 0.77)。此外,HeteroGAT还具有区分早期和晚期pte的能力。基于注意力得分的节点排名也确定了临床相关的PTE预测因子。结论:通过患者遭遇嵌入对稀疏的EHR数据进行建模,HeteroGAT可以有效捕获对PTE预测至关重要的合并症的时间和关系模式。我们的研究结果强调了基于图的深度学习模型与大规模电子病历数据协同的潜力,在推进个性化风险评估方面,最终解决了对TBI患者PTE更精确和主动管理的迫切需求。
{"title":"Graph attention network with comorbidity connectivity embedding for post-traumatic epilepsy risk prediction using sparse time-series electronic health records","authors":"Priyadharsini Ramamurthy ,&nbsp;Zheng Han ,&nbsp;Dursun Delen ,&nbsp;Zhuqi Miao ,&nbsp;Andrew Gin ,&nbsp;Xiao Luo ,&nbsp;William Paiva","doi":"10.1016/j.ijmedinf.2025.106239","DOIUrl":"10.1016/j.ijmedinf.2025.106239","url":null,"abstract":"<div><h3>Background</h3><div>Traumatic brain injury (TBI) is a major risk factor for neurological disorders, including post-traumatic epilepsy (PTE), a debilitating condition associated with significant long-term consequences. The prognosis of PTE occurrence remains challenging due to the complex pathophysiology of PTE and the impracticality of traditional blood biomarker- or imaging-based screening for large populations. This study proposes a graph-based deep learning approach that leverages electronic health records (EHR) to enhance the predictive assessment of PTE risk.</div></div><div><h3>Methods</h3><div>We utilized Oracle Real-World Data (ORWD) to construct a Heterogeneous Graph Attention Network (HeteroGAT) that contains patient and diagnosis nodes, with temporal information represented using patients-to-diagnosis edges, and comorbidity connectivity embedded using diagnosis-to-diagnosis edges. The HeteroGAT was trained on a cohort of 1,598,998 TBI-only patients and 102,687 individuals who developed epilepsy after TBI. Model performance was evaluated using sensitivity, specificity, macro F1-score, and area under the receiver operating characteristic curve (AUC-ROC), benchmarked against traditional machine learning models. Attention scores of nodes were used to evaluate node importance. The capabilities of the HeteroGATs trained to differentiate early vs late PTE patients following TBI were also assessed.</div></div><div><h3>Results</h3><div>HeteroGAT significantly outperformed conventional models in PTE prediction by effectively integrating demographic data and comorbidity profiles spanning from 20 to 500 distinct conditions. The model’s multi-head attention mechanisms, in combination with learned comorbidity connectivity, enhanced its ability to capture complex dependencies within EHR data. HeteroGAT achieved an AUC-ROC of 0.80, outperforming the best-performing traditional model, random forest (AUC-ROC = 0.77). HeteroGAT also demonstrated capabilities in differentiating early and late PTEs. Ranking of nodes based on attention scores also identified predictors of PTE that are clinically relevant.</div></div><div><h3>Conclusion</h3><div>By modeling sparse EHR data through patient encounter embeddings, HeteroGAT effectively captures temporal and relational patterns in comorbidities critical for PTE prediction. Our findings highlight the potential of graph-based deep learning models, synergized with large-scale EHR data, in advancing personalized risk assessment, ultimately addressing the urgent need for more precise and proactive management of PTE in TBI patients.</div></div>","PeriodicalId":54950,"journal":{"name":"International Journal of Medical Informatics","volume":"208 ","pages":"Article 106239"},"PeriodicalIF":4.1,"publicationDate":"2025-12-23","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145885680","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
期刊
International Journal of Medical Informatics
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1