首页 > 最新文献

International Journal of Medical Informatics最新文献

英文 中文
When artificial intelligence guides and misguides clinicians: A critical appraisal of AI recommendation correctness and diagnostic decision-making 当人工智能引导和误导临床医生:对人工智能推荐正确性和诊断决策的批判性评估。
IF 4.1 2区 医学 Q2 COMPUTER SCIENCE, INFORMATION SYSTEMS Pub Date : 2026-01-14 DOI: 10.1016/j.ijmedinf.2026.106293
Hasan Nawaz Tahir , Anfal Khan , Muhammad Yousaf , Shahnila Javed , Muhammad Kamran Khan , Yousaf Ali
{"title":"When artificial intelligence guides and misguides clinicians: A critical appraisal of AI recommendation correctness and diagnostic decision-making","authors":"Hasan Nawaz Tahir , Anfal Khan , Muhammad Yousaf , Shahnila Javed , Muhammad Kamran Khan , Yousaf Ali","doi":"10.1016/j.ijmedinf.2026.106293","DOIUrl":"10.1016/j.ijmedinf.2026.106293","url":null,"abstract":"","PeriodicalId":54950,"journal":{"name":"International Journal of Medical Informatics","volume":"209 ","pages":"Article 106293"},"PeriodicalIF":4.1,"publicationDate":"2026-01-14","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"146020816","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
“Calibration or contamination?” Reassessing the evaluation of large language models for clinical mortality prediction “校准还是污染?”重新评估大型语言模型对临床死亡率预测的评价
IF 4.1 2区 医学 Q2 COMPUTER SCIENCE, INFORMATION SYSTEMS Pub Date : 2026-01-14 DOI: 10.1016/j.ijmedinf.2026.106291
Zhihao Lei
{"title":"“Calibration or contamination?” Reassessing the evaluation of large language models for clinical mortality prediction","authors":"Zhihao Lei","doi":"10.1016/j.ijmedinf.2026.106291","DOIUrl":"10.1016/j.ijmedinf.2026.106291","url":null,"abstract":"","PeriodicalId":54950,"journal":{"name":"International Journal of Medical Informatics","volume":"209 ","pages":"Article 106291"},"PeriodicalIF":4.1,"publicationDate":"2026-01-14","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145979480","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Communicable diseases platform (CDP): Real-Time clinical analytics for infections 传染病平台(CDP):感染的实时临床分析
IF 4.1 2区 医学 Q2 COMPUTER SCIENCE, INFORMATION SYSTEMS Pub Date : 2026-01-12 DOI: 10.1016/j.ijmedinf.2026.106277
Manuri De Silva , Alice Voskoboynik , Sailavan Ramesh , Janice Campbell , Saravanan Satkumaran , Daryl R. Cheng

Objective

Communicable diseases, especially seasonal respiratory illnesses, contribute significantly to paediatric hospital presentations and admissions. Existing surveillance systems often require retrospective manual data collation and focus on either demographic or clinical data, not both. The Communicable Diseases Platform (CDP) is a dynamic data platform that aggregates both data types for all communicable disease presentations to The Royal Children’s Hospital Melbourne (RCH).

Methods

In the pilot phase, the CDP extracted de-identified aggregated data from hospital electronic medical records for patients with positive respiratory swabs. A dashboard displayed positivity rate and cumulative hospital admissions trends from 2016 to 2025, further filterable by pathogen, age, presentation type and interventions.

Discussion

The CDP improves understanding of clinical profiles, disease burden and seasonal patterns, supporting better outbreak control, patient flow prediction and clinical surveillance. Future developments include immunisation data integration and machine learning algorithm evaluation for real-time vaccine effectiveness estimations and communicable disease predictive modelling.
目的:传染性疾病,特别是季节性呼吸道疾病,是儿科就诊和住院的主要原因。现有的监测系统通常需要回顾性的人工数据整理,并侧重于人口统计或临床数据,而不是两者兼而有之。传染病平台(CDP)是一个动态数据平台,汇集了墨尔本皇家儿童医院(RCH)所有传染病报告的两种数据类型。方法在试点阶段,CDP从医院电子病历中提取呼吸道拭子阳性患者的去识别汇总数据。仪表板显示了2016年至2025年的阳性率和累计住院趋势,并进一步按病原体、年龄、表现类型和干预措施进行过滤。CDP提高了对临床概况、疾病负担和季节性模式的理解,支持更好的疫情控制、患者流量预测和临床监测。未来的发展包括免疫数据集成和机器学习算法评估,用于实时疫苗有效性估计和传染病预测建模。
{"title":"Communicable diseases platform (CDP): Real-Time clinical analytics for infections","authors":"Manuri De Silva ,&nbsp;Alice Voskoboynik ,&nbsp;Sailavan Ramesh ,&nbsp;Janice Campbell ,&nbsp;Saravanan Satkumaran ,&nbsp;Daryl R. Cheng","doi":"10.1016/j.ijmedinf.2026.106277","DOIUrl":"10.1016/j.ijmedinf.2026.106277","url":null,"abstract":"<div><h3>Objective</h3><div>Communicable diseases, especially seasonal respiratory illnesses, contribute significantly to paediatric hospital presentations and admissions. Existing surveillance systems often require retrospective manual data collation and focus on either demographic or clinical data, not both. The Communicable Diseases Platform (CDP) is a dynamic data platform that aggregates both data types for all communicable disease presentations to The Royal Children’s Hospital Melbourne (RCH).</div></div><div><h3>Methods</h3><div>In the pilot phase, the CDP extracted de-identified aggregated data from hospital electronic medical records for patients with positive respiratory swabs. A dashboard displayed positivity rate and cumulative hospital admissions trends from 2016 to 2025, further filterable by pathogen, age, presentation type and interventions.</div></div><div><h3>Discussion</h3><div>The CDP improves understanding of clinical profiles, disease burden and seasonal patterns, supporting better outbreak control, patient flow prediction and clinical surveillance. Future developments include immunisation data integration and machine learning algorithm evaluation for real-time vaccine effectiveness estimations and communicable disease predictive modelling.</div></div>","PeriodicalId":54950,"journal":{"name":"International Journal of Medical Informatics","volume":"209 ","pages":"Article 106277"},"PeriodicalIF":4.1,"publicationDate":"2026-01-12","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145979478","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Clinicians’ perspectives on electronic medical records use in diabetes outpatient Care: A qualitative study 临床医生对糖尿病门诊使用电子病历的看法:一项定性研究。
IF 4.1 2区 医学 Q2 COMPUTER SCIENCE, INFORMATION SYSTEMS Pub Date : 2026-01-11 DOI: 10.1016/j.ijmedinf.2026.106275
Wenyong Wang , Mahnaz Samadbeik , Gaurav Puri , Donald S.A. McLeod , Elton Lobo , Tuan Duong , Titus Kirwa , Clair Sullivan

Background

Electronic Medical Records (EMRs) aim to improve efficiency, safety, and quality of care. However, the impact of EMR implementation, particularly in outpatient diabetes care, remains underexplored. This study explored clinicians’ perspectives on EMR use in diabetes outpatient care.

Methods

This qualitative study, conducted in line with COREQ guidelines, involved four focus groups with 22 clinicians (doctors, nurses, and allied health) at a metropolitan diabetes service in Queensland, Australia. Data were analysed using deductive content analysis, guided by the Quintuple Aim and Technology Acceptance Model/Unified Theory of Acceptance and Use of Technology frameworks.

Results

Clinicians reported mixed outcomes across the Quintuple Aim domains, shaped by technology adoption constructs. Facilitators such as improved efficiency, access to patient information, and prescribing safety reflected perceived usefulness and positive attitudes, contributing to favourable outcomes across multiple Quintuple Aim. Barriers such as navigation complexity, technical issues, alert fatigue, and overwhelming training led to negative outcomes in EMR use. Tensions around documentation practices and patient expectations of system use, resulted in mixed outcomes. Overall, clinicians viewed EMRs as essential, but sustained adoption required improved usability, tailored training, and better system integration.

Conclusion

This study concludes that while the EMRs improved safety, efficiency, and access to information, their design and implementation also introduced burdens that negatively affected clinician experience. EMRs significantly shape the healthcare workforce, influencing workflow, wellbeing, and professional engagement. In outpatient diabetes care, specific workflow challenges such as glycaemic data integration highlight that existing EMR designs may not fully support the complexity of chronic disease management. To maximise benefits, EMR initiatives should be approached as quality improvement activities, with role-specific training, reliable infrastructure, and clinician involvement in system optimisation. Future research should address usability challenges, enhance integration, and ensure that both clinician and patient perspectives guide digital health transformation.
背景:电子病历(EMRs)旨在提高医疗效率、安全性和质量。然而,EMR实施的影响,特别是在门诊糖尿病护理方面,仍未得到充分探讨。本研究探讨临床医生在糖尿病门诊医疗中使用电子病历的观点。方法:本定性研究按照COREQ指南进行,涉及澳大利亚昆士兰州一家大都市糖尿病服务中心的22名临床医生(医生、护士和专职健康人员)的四个焦点小组。在“五重目标”和“技术接受模型”/“技术接受与使用统一理论”框架的指导下,采用演绎内容分析对数据进行分析。结果:临床医生报告了五项目标领域的混合结果,这些结果受到技术采用结构的影响。提高效率、获取患者信息和处方安全等促进因素反映了感知到的有用性和积极态度,有助于在多个“五大目标”中取得有利结果。导航复杂性、技术问题、警报疲劳和压倒性的培训等障碍导致EMR使用的负面结果。文档实践和患者对系统使用的期望之间的紧张关系导致了不同的结果。总体而言,临床医生认为电子病历是必要的,但持续采用需要改进可用性、量身定制的培训和更好的系统集成。结论:本研究得出结论,虽然电子病历提高了安全性、效率和信息获取,但其设计和实施也带来了负担,对临床医生的体验产生了负面影响。电子病历极大地塑造了医疗保健人力,影响了工作流程、健康和专业参与度。在门诊糖尿病护理中,特定的工作流程挑战,如血糖数据整合,突出表明现有的电子病历设计可能无法完全支持慢性疾病管理的复杂性。为了最大限度地提高效益,电子病历计划应该作为质量改进活动,具有特定角色的培训、可靠的基础设施和临床医生参与系统优化。未来的研究应解决可用性挑战,加强整合,并确保临床医生和患者的观点都能指导数字健康转型。
{"title":"Clinicians’ perspectives on electronic medical records use in diabetes outpatient Care: A qualitative study","authors":"Wenyong Wang ,&nbsp;Mahnaz Samadbeik ,&nbsp;Gaurav Puri ,&nbsp;Donald S.A. McLeod ,&nbsp;Elton Lobo ,&nbsp;Tuan Duong ,&nbsp;Titus Kirwa ,&nbsp;Clair Sullivan","doi":"10.1016/j.ijmedinf.2026.106275","DOIUrl":"10.1016/j.ijmedinf.2026.106275","url":null,"abstract":"<div><h3>Background</h3><div>Electronic Medical Records (EMRs) aim to improve efficiency, safety, and quality of care. However, the impact of EMR implementation, particularly in outpatient diabetes care, remains underexplored. This study explored clinicians’ perspectives on EMR use in diabetes outpatient care.</div></div><div><h3>Methods</h3><div>This qualitative study, conducted in line with COREQ guidelines, involved four focus groups with 22 clinicians (doctors, nurses, and allied health) at a metropolitan diabetes service in Queensland, Australia. Data were analysed using deductive content analysis, guided by the Quintuple Aim and Technology Acceptance Model/Unified Theory of Acceptance and Use of Technology frameworks.</div></div><div><h3>Results</h3><div>Clinicians reported mixed outcomes across the Quintuple Aim domains, shaped by technology adoption constructs. Facilitators such as improved efficiency, access to patient information, and prescribing safety reflected perceived usefulness and positive attitudes, contributing to favourable outcomes across multiple Quintuple Aim. Barriers such as navigation complexity, technical issues, alert fatigue, and overwhelming training led to negative outcomes in EMR use. Tensions around documentation practices and patient expectations of system use, resulted in mixed outcomes<strong>.</strong> Overall, clinicians viewed EMRs as essential, but sustained adoption required improved usability, tailored training, and better system integration.</div></div><div><h3>Conclusion</h3><div>This study concludes that while the EMRs improved safety, efficiency, and access to information, their design and implementation also introduced burdens that negatively affected clinician experience. EMRs significantly shape the healthcare workforce, influencing workflow, wellbeing, and professional engagement. In outpatient diabetes care, specific workflow challenges such as glycaemic data integration highlight that existing EMR designs may not fully support the complexity of chronic disease management. To maximise benefits, EMR initiatives should be approached as quality improvement activities, with role-specific training, reliable infrastructure, and clinician involvement in system optimisation. Future research should address usability challenges, enhance integration, and ensure that both clinician and patient perspectives guide digital health transformation.</div></div>","PeriodicalId":54950,"journal":{"name":"International Journal of Medical Informatics","volume":"209 ","pages":"Article 106275"},"PeriodicalIF":4.1,"publicationDate":"2026-01-11","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145960768","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Predictive value of machine learning for mortality risk in aortic dissection: a systematic review and meta-analysis 机器学习对主动脉夹层死亡风险的预测价值:系统回顾和荟萃分析
IF 4.1 2区 医学 Q2 COMPUTER SCIENCE, INFORMATION SYSTEMS Pub Date : 2026-01-11 DOI: 10.1016/j.ijmedinf.2026.106271
Zhihong Han , Baixin Li , Jie Liu

Background

Aortic dissection (AD) is a critical cardiovascular disorder with substantial risks of short-term mortality. Some researchers have endeavored to utilize machine learning (ML) approaches to develop predictive models for the risk of mortality in AD. However, systematic evidence about the accuracy of these models remains scarce, which poses challenges to the development and enhancement of risk assessment tools. Therefore, this study seeks to systematically review the reliability of ML in forecasting the risk of mortality in AD.

Methods

A search was implemented through PubMed, Cochrane, Embase, and Web of Science up to September 11, 2025. The prediction model risk of bias (RoB) assessment tool (PROBAST) was leveraged to estimate the RoB of the included studies. Subgroup analyses were implemented based upon types of AD and time of death.

Results

In total, 35 studies were included, covering 19,838 patients with AD. The results showed that, within the training datasets, ML models demonstrated a sensitivity (SEN) of 0.75 (95% CI: 0.72–0.78) and specificity (SPE) of 0.77 (95% CI: 0.74–0.80) for predicting mortality in AD. Within the validation set, which mainly focused on TAAD, the SEN was 0.79 (95% CI: 0.74–0.84) and the SPE was 0.78 (95% CI: 0.68–0.85). For in-hospital mortality, the SEN was 0.78 (95% CI: 0.72–0.83) and the SPE was 0.77 (95% CI: 0.65–0.86); for out-of-hospital mortality, the SEN and SPE were 0.81–0.84 and 0.74–0.86.

Conclusion

ML models demonstrate remarkable accuracy in forecasting the risk of mortality in AD and show superior performance relative to existing scoring systems to some extent. Future research should incorporate more multi-center, multi-ethnic, and geographically varied cases to develop a more broadly applicable risk prediction tool and offer insights for the tailored prevention strategies.
主动脉夹层(AD)是一种严重的心血管疾病,具有短期死亡的重大风险。一些研究人员努力利用机器学习(ML)方法来开发阿尔茨海默病死亡风险的预测模型。然而,关于这些模型准确性的系统证据仍然很少,这对风险评估工具的开发和增强提出了挑战。因此,本研究旨在系统地回顾ML预测AD患者死亡风险的可靠性。方法检索截止到2025年9月11日的PubMed、Cochrane、Embase和Web of Science。运用预测模型偏倚风险评估工具(PROBAST)估计纳入研究的偏倚风险。根据AD类型和死亡时间进行亚组分析。结果共纳入35项研究,共19,838例AD患者。结果显示,在训练数据集中,ML模型预测AD死亡率的敏感性(SEN)为0.75 (95% CI: 0.72-0.78),特异性(SPE)为0.77 (95% CI: 0.74-0.80)。在主要关注TAAD的验证集中,SEN为0.79 (95% CI: 0.74-0.84), SPE为0.78 (95% CI: 0.68-0.85)。对于院内死亡率,SEN为0.78 (95% CI: 0.72-0.83), SPE为0.77 (95% CI: 0.65-0.86);院外死亡率的SEN和SPE分别为0.81 ~ 0.84和0.74 ~ 0.86。结论ml模型在预测AD死亡风险方面具有较好的准确性,在一定程度上优于现有评分系统。未来的研究应纳入更多的多中心、多民族和地域差异的病例,以开发更广泛适用的风险预测工具,并为量身定制的预防策略提供见解。
{"title":"Predictive value of machine learning for mortality risk in aortic dissection: a systematic review and meta-analysis","authors":"Zhihong Han ,&nbsp;Baixin Li ,&nbsp;Jie Liu","doi":"10.1016/j.ijmedinf.2026.106271","DOIUrl":"10.1016/j.ijmedinf.2026.106271","url":null,"abstract":"<div><h3>Background</h3><div>Aortic dissection (AD) is a critical cardiovascular disorder with substantial risks of short-term mortality. Some researchers have endeavored to utilize machine learning (ML) approaches to develop predictive models for the risk of mortality in AD. However, systematic evidence about the accuracy of these models remains scarce, which poses challenges to the development and enhancement of risk assessment tools. Therefore, this study seeks to systematically review the reliability of ML in forecasting the risk of mortality in AD.</div></div><div><h3>Methods</h3><div>A search was implemented through PubMed, Cochrane, Embase, and Web of Science up to September 11, 2025. The prediction model risk of bias (RoB) assessment tool (PROBAST) was leveraged to estimate the RoB of the included studies. Subgroup analyses were implemented based upon types of AD and time of death.</div></div><div><h3>Results</h3><div>In total, 35 studies were included, covering 19,838 patients with AD. The results showed that, within the training datasets, ML models demonstrated a sensitivity (SEN) of 0.75 (95% CI: 0.72–0.78) and specificity (SPE) of 0.77 (95% CI: 0.74–0.80) for predicting mortality in AD. Within the validation set, which mainly focused on TAAD, the SEN was 0.79 (95% CI: 0.74–0.84) and the SPE was 0.78 (95% CI: 0.68–0.85). For in-hospital mortality, the SEN was 0.78 (95% CI: 0.72–0.83) and the SPE was 0.77 (95% CI: 0.65–0.86); for out-of-hospital mortality, the SEN and SPE were 0.81–0.84 and 0.74–0.86.</div></div><div><h3>Conclusion</h3><div>ML models demonstrate remarkable accuracy in forecasting the risk of mortality in AD and show superior performance relative to existing scoring systems to some extent. Future research should incorporate more multi-center, multi-ethnic, and geographically varied cases to develop a more broadly applicable risk prediction tool and offer insights for the tailored prevention strategies.</div></div>","PeriodicalId":54950,"journal":{"name":"International Journal of Medical Informatics","volume":"209 ","pages":"Article 106271"},"PeriodicalIF":4.1,"publicationDate":"2026-01-11","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145979460","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Automated extraction of fluoropyrimidine treatment and treatment-related toxicities from clinical notes using natural language processing 使用自然语言处理从临床记录中自动提取氟嘧啶治疗和治疗相关毒性
IF 4.1 2区 医学 Q2 COMPUTER SCIENCE, INFORMATION SYSTEMS Pub Date : 2026-01-10 DOI: 10.1016/j.ijmedinf.2026.106276
Xizhi Wu , Madeline S. Kreider , Philip E. Empey , Chenyu Li , Yanshan Wang

Objective

Fluoropyrimidines are widely prescribed for colorectal and breast cancers, but are associated with toxicities such as hand-foot syndrome and cardiotoxicity. Since toxicity documentation is often embedded in clinical notes, we aimed to develop and evaluate natural language processing (NLP) methods to extract treatment and toxicity information.

Materials and methods

We constructed a gold-standard dataset of 236 clinical notes from 204,165 adult oncology patients. Domain experts annotated categories related to treatment regimens and toxicities. We developed rule-based, machine learning-based (Random Forest [RF], Support Vector Machine [SVM], Logistic Regression [LR]), deep learning-based (BERT, ClinicalBERT), and large language models (LLM)-based NLP approaches (zero-shot and error analysis prompting). A 5-fold cross validation were conducted to validate each model.

Results

Error analysis prompting achieved optimal precision, recall, and F1 scores for treatment (F1 = 1.000) and toxicities extraction (F1 = 0.965), whereas zero-shot perform moderately (treatment F1 = 0.889, toxicities extraction F1 = 0.854) Rule-based reached F1 = 1.000 for treatment and F1 = 0.904 for toxicities extraction. LR and SVM ranked second and fourth for toxicities extraction (LR F1 = 0.914, SVM F1 = 0.903). Deep learning and RF underperformed, with performance of BERT reached F1 = 0.792 for treatment and F1 = 0.837 for toxicities extraction.,ClinicalBERT reached F1 = 0.797 for treatment and F1 = 0.884 for toxicities extraction). RF reached F1 = 0.745 for treatment and F1 = 0.853 for toxicities extraction.

Discussion

LMM-based error analysis outperformed all others, followed by machine learning methods. Machine learning and deep learning methods were limited by small training data and showed limited generalizability, particularly for rare categories.

Conclusion

LLM-based error analysis most effectively extracted fluoropyrimidine treatment and toxicity information from clinical notes, and has strong potential to support oncology research and pharmacovigilance.
目的氟嘧啶广泛用于结直肠癌和乳腺癌,但与手足综合征和心脏毒性等毒性有关。由于毒性文件通常嵌入在临床记录中,我们旨在开发和评估自然语言处理(NLP)方法来提取治疗和毒性信息。材料与方法我们构建了一个金标准数据集,包含来自204,165名成年肿瘤患者的236份临床记录。领域专家注释了与治疗方案和毒性有关的类别。我们开发了基于规则的、基于机器学习的(随机森林[RF]、支持向量机[SVM]、逻辑回归[LR])、基于深度学习的(BERT、ClinicalBERT)和基于大型语言模型(LLM)的NLP方法(零射击和错误分析提示)。对每个模型进行5次交叉验证。结果serror分析提示在治疗和毒理提取的精密度、召回率和F1得分(F1 = 1.000)均达到了最佳水平(F1 = 0.965),而zero-shot表现中等(治疗F1 = 0.889,毒理提取F1 = 0.854),基于规则的治疗F1 = 1.000,毒理提取F1 = 0.904。LR和SVM的毒性提取效果分别为2、4位(LR F1 = 0.914, SVM F1 = 0.903)。深度学习和RF表现不佳,BERT在治疗方面的表现为F1 = 0.792,在毒性提取方面的表现为F1 = 0.837。,ClinicalBERT在治疗方面达到F1 = 0.797,毒性提取方面达到F1 = 0.884)。治疗组RF为F1 = 0.745,毒副作用提取组RF为F1 = 0.853。讨论基于lmm的误差分析优于所有其他方法,其次是机器学习方法。机器学习和深度学习方法受到小型训练数据的限制,并且泛化能力有限,特别是对于罕见的类别。结论基于llm的误差分析能最有效地从临床记录中提取氟嘧啶的治疗和毒性信息,在支持肿瘤研究和药物警戒方面具有很强的潜力。
{"title":"Automated extraction of fluoropyrimidine treatment and treatment-related toxicities from clinical notes using natural language processing","authors":"Xizhi Wu ,&nbsp;Madeline S. Kreider ,&nbsp;Philip E. Empey ,&nbsp;Chenyu Li ,&nbsp;Yanshan Wang","doi":"10.1016/j.ijmedinf.2026.106276","DOIUrl":"10.1016/j.ijmedinf.2026.106276","url":null,"abstract":"<div><h3>Objective</h3><div>Fluoropyrimidines are widely prescribed for colorectal and breast cancers, but are associated with toxicities such as hand-foot syndrome and cardiotoxicity. Since toxicity documentation is often embedded in clinical notes, we aimed to develop and evaluate natural language processing (NLP) methods to extract treatment and toxicity information.</div></div><div><h3>Materials and methods</h3><div>We constructed a gold-standard dataset of 236 clinical notes from 204,165 adult oncology patients. Domain experts annotated categories related to treatment regimens and toxicities. We developed rule-based, machine learning-based (Random Forest [RF], Support Vector Machine [SVM], Logistic Regression [LR]), deep learning-based (BERT, ClinicalBERT), and large language models (LLM)-based NLP approaches (zero-shot and error analysis prompting). A 5-fold cross validation were conducted to validate each model.</div></div><div><h3>Results</h3><div>Error analysis prompting achieved optimal precision, recall, and F1 scores for treatment (F1 = 1.000) and toxicities extraction (F1 = 0.965), whereas zero-shot perform moderately (treatment F1 = 0.889, toxicities extraction F1 = 0.854) Rule-based reached F1 = 1.000 for treatment and F1 = 0.904 for toxicities extraction. LR and SVM ranked second and fourth for toxicities extraction (LR F1 = 0.914, SVM F1 = 0.903). Deep learning and RF underperformed, with performance of BERT reached F1 = 0.792 for treatment and F1 = 0.837 for toxicities extraction.,ClinicalBERT reached F1 = 0.797 for treatment and F1 = 0.884 for toxicities extraction). RF reached F1 = 0.745 for treatment and F1 = 0.853 for toxicities extraction.</div></div><div><h3>Discussion</h3><div>LMM-based error analysis outperformed all others, followed by machine learning methods. Machine learning and deep learning methods were limited by small training data and showed limited generalizability, particularly for rare categories.</div></div><div><h3>Conclusion</h3><div>LLM-based error analysis most effectively extracted fluoropyrimidine treatment and toxicity information from clinical notes, and has strong potential to support oncology research and pharmacovigilance.</div></div>","PeriodicalId":54950,"journal":{"name":"International Journal of Medical Informatics","volume":"209 ","pages":"Article 106276"},"PeriodicalIF":4.1,"publicationDate":"2026-01-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145979481","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
An old disease, a new linguistic challenge for large language models: patient education on psoriasis and psoriatic arthritis in an underrepresented medical language 一种古老的疾病,对大型语言模型的新的语言挑战:在代表性不足的医学语言中对银屑病和银屑病关节炎的患者教育
IF 4.1 2区 医学 Q2 COMPUTER SCIENCE, INFORMATION SYSTEMS Pub Date : 2026-01-10 DOI: 10.1016/j.ijmedinf.2025.106246
Ahmet Ugur Atilan, Niyazi Cetin

Objective

Large Language Models (LLMs) are increasingly applied to patient education, yet their performance in languages that are relatively underrepresented in medical-domain corpora and large language model training datasets remains underexplored. Psoriasis and psoriatic arthritis (PsA) are chronic, immune-mediated diseases requiring lifelong patient engagement, making them suitable conditions to evaluate the clarity, reliability, and inclusivity of AI-generated educational content. To assess the comprehensibility, scientific reliability, and patient-centered communication of Turkish patient education materials for psoriasis vulgaris and PsA generated by seven state-of-the-art LLMs.

Methods

A cross-sectional analysis compared outputs from ChatGPT-4o, Gemini 2.0 Flash, Claude 3.7 Sonnet, Grok 3, Qwen 2.5, DeepSeek R1, and Mistral Large 2. Brochures were produced using standardized zero-shot prompts and evaluated via the Ateşman readability index and the DISCERN instrument. Overall differences in DISCERN scores across the seven models were assessed using a Friedman test, followed by Bonferroni-adjusted Wilcoxon signed-rank post-hoc analyses.

Results

Readability scores ranged from 61.6 to 80.2 (mean = 71.3 ± 6.9), with ChatGPT-4o and Qwen 2.5 generating the most accessible texts. DISCERN reliability scores ranged from 38.5 to 60.5, with Claude 3.7 Sonnet and Gemini 2.0 Flash showing the highest accuracy. Models prioritizing factual precision produced denser language, while conversational models favored fluency but sacrificed depth. Notable variation was observed, with only Claude 3.7 Sonnet and Gemini 2.0 Flash consistently reflecting patient-centered perspectives.

Conclusion

LLMs showed observable differences in balancing clarity and reliability when generating health education leaflets in Turkish. Most outputs appeared to lack explicit psychosocial framing and emphasis on shared decision-making, which may suggest the need for more culturally adaptive training, clinician oversight, and locally grounded validation frameworks to support safe and inclusive AI-based patient education.
大型语言模型(llm)越来越多地应用于患者教育,但它们在医学领域语料库和大型语言模型训练数据集中相对代表性不足的语言中的表现仍未得到充分探索。银屑病和银屑病关节炎(PsA)是一种慢性、免疫介导的疾病,需要患者终身参与,因此它们是评估人工智能生成的教育内容的清晰度、可靠性和包容性的合适条件。评估由7位最先进的法学硕士生成的土耳其寻常型牛皮癣和PsA患者教育材料的可理解性、科学可靠性和以患者为中心的交流。方法横断面分析比较chatgpt - 40、Gemini 2.0 Flash、Claude 3.7 Sonnet、Grok 3、Qwen 2.5、DeepSeek R1和Mistral Large 2的输出。使用标准化的零射击提示制作小册子,并通过ate可读性指数和DISCERN仪器进行评估。七个模型中辨别得分的总体差异采用弗里德曼检验进行评估,随后采用bonferroni调整的Wilcoxon符号秩事后分析。结果可读性评分范围为61.6 ~ 80.2(平均= 71.3±6.9),其中chatgpt - 40和qwen2.5生成的文本可读性最高。DISCERN的可靠性得分从38.5到60.5不等,克劳德3.7十四行诗和双子座2.0闪光显示出最高的准确性。优先考虑事实准确性的模型产生了更密集的语言,而会话模型倾向于流利,但牺牲了深度。观察到显著的差异,只有克劳德3.7十四行诗和双子座2.0闪光一致地反映了以患者为中心的观点。结论llm在制作土耳其语健康教育宣传单的平衡清晰度和可靠性方面存在显著差异。大多数产出似乎缺乏明确的社会心理框架和对共同决策的强调,这可能表明需要更多的文化适应性培训、临床医生监督和基于当地的验证框架,以支持安全和包容的基于人工智能的患者教育。
{"title":"An old disease, a new linguistic challenge for large language models: patient education on psoriasis and psoriatic arthritis in an underrepresented medical language","authors":"Ahmet Ugur Atilan,&nbsp;Niyazi Cetin","doi":"10.1016/j.ijmedinf.2025.106246","DOIUrl":"10.1016/j.ijmedinf.2025.106246","url":null,"abstract":"<div><h3>Objective</h3><div>Large Language Models (LLMs) are increasingly applied to patient education, yet their performance in languages that are relatively underrepresented in medical-domain corpora and large language model training datasets remains underexplored. Psoriasis and psoriatic arthritis (PsA) are chronic, immune-mediated diseases requiring lifelong patient engagement, making them suitable conditions to evaluate the clarity, reliability, and inclusivity of AI-generated educational content. To assess the comprehensibility, scientific reliability, and patient-centered communication of Turkish patient education materials for psoriasis vulgaris and PsA generated by seven state-of-the-art LLMs.</div></div><div><h3>Methods</h3><div>A cross-sectional analysis compared outputs from ChatGPT-4o, Gemini 2.0 Flash, Claude 3.7 Sonnet, Grok 3, Qwen 2.5, DeepSeek R1, and Mistral Large 2. Brochures were produced using standardized zero-shot prompts and evaluated via the Ateşman readability index and the DISCERN instrument. Overall differences in DISCERN scores across the seven models were assessed using a Friedman test, followed by Bonferroni-adjusted Wilcoxon signed-rank post-hoc analyses.</div></div><div><h3>Results</h3><div>Readability scores ranged from 61.6 to 80.2 (mean = 71.3 ± 6.9), with ChatGPT-4o and Qwen 2.5 generating the most accessible texts. DISCERN reliability scores ranged from 38.5 to 60.5, with Claude 3.7 Sonnet and Gemini 2.0 Flash showing the highest accuracy. Models prioritizing factual precision produced denser language, while conversational models favored fluency but sacrificed depth. Notable variation was observed, with only Claude 3.7 Sonnet and Gemini 2.0 Flash consistently reflecting patient-centered perspectives.</div></div><div><h3>Conclusion</h3><div>LLMs showed observable differences in balancing clarity and reliability when generating health education leaflets in Turkish. Most outputs appeared to lack explicit psychosocial framing and emphasis on shared decision-making, which may suggest the need for more culturally adaptive training, clinician oversight, and locally grounded validation frameworks to support safe and inclusive AI-based patient education.</div></div>","PeriodicalId":54950,"journal":{"name":"International Journal of Medical Informatics","volume":"209 ","pages":"Article 106246"},"PeriodicalIF":4.1,"publicationDate":"2026-01-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145979482","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Empowering caregivers of individuals with autism spectrum disorder through sensor-based monitoring of emotional dysregulation: A scoping review 通过基于传感器的情绪失调监测赋予自闭症谱系障碍患者照顾者权力:范围综述
IF 4.1 2区 医学 Q2 COMPUTER SCIENCE, INFORMATION SYSTEMS Pub Date : 2026-01-09 DOI: 10.1016/j.ijmedinf.2026.106262
Moid Sandhu , Siddique Latif , Andrew Bayor , Wei Lu , Mahnoosh Kholghi , Deepa Prabhu , David Silvera-Tawil
Objective: This paper critically reviews existing work in sensor-based emotional dysregulation monitoring to support caregivers of individuals diagnosed with autism spectrum disorder (ASD).
Methods: A systematic literature search was conducted across six databases (Google Scholar, IEEE Xplore, Scopus, ACM Digital Library, Web of Science, and PubMed) covering publications from January 1, 2016, to September 30, 2025.
Results: Thirty-two studies met inclusion criteria, comprising 27 focused on sensor-based emotional dysregulation detection and 5 addressing intervention or support mechanisms. These studies suggest that sensor-based technologies have potential for continuous physiological monitoring, facilitating early detection and intervention to support emotional dysregulation episodes. Critical deficiencies were identified in real-time alerting capabilities, autonomous intervention deployment, self-regulation framework integration, system reliability, long-term sustainability, user interface design, and cross-environment scalability.
Conclusion: There is a significant need to develop real-time emotion monitoring systems to empower caregivers in delivering timely, targeted interventions for individuals diagnosed with ASD. Future research should prioritise the development of real-time alert systems, autonomous intervention protocols, and solutions optimised for reliability, sustainability, usability, and adaptability across heterogeneous care settings.
目的:本文综述了基于传感器的情绪失调监测的现有工作,以支持自闭症谱系障碍(ASD)患者的护理人员。方法:对6个数据库(b谷歌Scholar、IEEE Xplore、Scopus、ACM Digital Library、Web of Science和PubMed)进行系统文献检索,检索时间为2016年1月1日至2025年9月30日。结果:32项研究符合纳入标准,其中27项关注基于传感器的情绪失调检测,5项关注干预或支持机制。这些研究表明,基于传感器的技术具有持续生理监测的潜力,有助于早期发现和干预,以支持情绪失调发作。在实时警报能力、自主干预部署、自我调节框架集成、系统可靠性、长期可持续性、用户界面设计和跨环境可扩展性方面发现了关键缺陷。结论:迫切需要开发实时情绪监测系统,使护理人员能够为ASD患者提供及时、有针对性的干预措施。未来的研究应优先发展实时警报系统、自主干预协议和解决方案,以优化可靠性、可持续性、可用性和跨异构护理环境的适应性。
{"title":"Empowering caregivers of individuals with autism spectrum disorder through sensor-based monitoring of emotional dysregulation: A scoping review","authors":"Moid Sandhu ,&nbsp;Siddique Latif ,&nbsp;Andrew Bayor ,&nbsp;Wei Lu ,&nbsp;Mahnoosh Kholghi ,&nbsp;Deepa Prabhu ,&nbsp;David Silvera-Tawil","doi":"10.1016/j.ijmedinf.2026.106262","DOIUrl":"10.1016/j.ijmedinf.2026.106262","url":null,"abstract":"<div><div><em>Objective:</em> This paper critically reviews existing work in sensor-based emotional dysregulation monitoring to support caregivers of individuals diagnosed with autism spectrum disorder (ASD).</div><div><em>Methods:</em> A systematic literature search was conducted across six databases (Google Scholar, IEEE Xplore, Scopus, ACM Digital Library, Web of Science, and PubMed) covering publications from January 1, 2016, to September 30, 2025.</div><div><em>Results:</em> Thirty-two studies met inclusion criteria, comprising 27 focused on sensor-based emotional dysregulation detection and 5 addressing intervention or support mechanisms. These studies suggest that sensor-based technologies have potential for continuous physiological monitoring, facilitating early detection and intervention to support emotional dysregulation episodes. Critical deficiencies were identified in real-time alerting capabilities, autonomous intervention deployment, self-regulation framework integration, system reliability, long-term sustainability, user interface design, and cross-environment scalability.</div><div><em>Conclusion:</em> There is a significant need to develop real-time emotion monitoring systems to empower caregivers in delivering timely, targeted interventions for individuals diagnosed with ASD. Future research should prioritise the development of real-time alert systems, autonomous intervention protocols, and solutions optimised for reliability, sustainability, usability, and adaptability across heterogeneous care settings.</div></div>","PeriodicalId":54950,"journal":{"name":"International Journal of Medical Informatics","volume":"209 ","pages":"Article 106262"},"PeriodicalIF":4.1,"publicationDate":"2026-01-09","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145979049","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
From promise to practice: strengthening evidence for AI conversational agents in healthcare 从承诺到实践:加强医疗保健中AI会话代理的证据
IF 4.1 2区 医学 Q2 COMPUTER SCIENCE, INFORMATION SYSTEMS Pub Date : 2026-01-09 DOI: 10.1016/j.ijmedinf.2026.106264
Yang Gao, Yingjie Lu, Xiaofei Li
{"title":"From promise to practice: strengthening evidence for AI conversational agents in healthcare","authors":"Yang Gao,&nbsp;Yingjie Lu,&nbsp;Xiaofei Li","doi":"10.1016/j.ijmedinf.2026.106264","DOIUrl":"10.1016/j.ijmedinf.2026.106264","url":null,"abstract":"","PeriodicalId":54950,"journal":{"name":"International Journal of Medical Informatics","volume":"209 ","pages":"Article 106264"},"PeriodicalIF":4.1,"publicationDate":"2026-01-09","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145979479","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Interpretable machine learning-based prediction of liver metastasis risk in elderly patients with small cell lung Cancer: A study based on the SEER database and external validation in a Chinese cohort 基于可解释机器学习的老年小细胞肺癌患者肝转移风险预测:基于SEER数据库和中国队列外部验证的研究
IF 4.1 2区 医学 Q2 COMPUTER SCIENCE, INFORMATION SYSTEMS Pub Date : 2026-01-08 DOI: 10.1016/j.ijmedinf.2026.106274
Hang Chen , Wenchao Dai , Jun Yang , Xin Dang , Li Jiang

Purpose

Small cell lung cancer (SCLC) is a highly aggressive malignancy with a high incidence of liver metastases, particularly among elderly patients, which significantly worsens survival outcomes. However, efficient predictive tools targeting this population remain scarce. This study aimed to develop and validate an interpretable machine learning-based model to re-stratify the risk of liver metastasis in elderly patients with SCLC after completion of routine staging evaluation at initial diagnosis.

Methods

A total of 10,080 patients aged ≥60 years with histologically confirmed SCLC were included from the SEER database (2010–2017) and the Affiliated Hospital of North Sichuan Medical College, China (2010–2024). Patients from SEER were randomly assigned to a training set (n = 7719) and an internal validation set (n = 1930), while 431 patients from China comprised the external validation set. Feature selection was performed using the Boruta algorithm, identifying 11 key variables. Seven ML models, namely, Logistic Regression, Naïve Bayes, Support Vector Machine (SVM), Decision Tree, Random Forest, XGBoost, and LightGBM, were developed to compare their predictive performance. The optimal model was further interpreted using SHAP (SHapley Additive exPlanations).

Results

The incidence of liver metastasis was approximately 32.89%, 35.39%, and 32.71% in the training, internal validation, and external validation sets, respectively. Comparative analysis across models demonstrated that, in the internal validation set, XGBoost achieved the best overall discriminative performance, with an AUC of 0.820, slightly outperforming LightGBM (0.819), logistic regression (0.813), and random forest (0.811). In the external validation set, the performance of all models declined. Given its relatively superior predictive performance, XGBoost was selected as the final model for interpretability analyses. SHAP analysis indicated that LDS/EDS, tumor stage, bone metastasis, and brain metastasis were the most influential features contributing to the model predictions.

Conclusion

The XGBoost-based model exhibited moderate predictive value and satisfactory interpretability in assessing the risk of liver metastasis in patients with SCLC, suggesting its potential utility as an adjunctive decision-support tool following initial diagnostic staging. Nevertheless, its generalizability across different populations requires further validation, and localized recalibration may be necessary prior to broader clinical implementation.
小细胞肺癌(SCLC)是一种高度侵袭性的恶性肿瘤,肝转移发生率高,尤其是在老年患者中,这明显恶化了生存结果。然而,针对这一人群的有效预测工具仍然很少。本研究旨在开发和验证一个可解释的基于机器学习的模型,以重新分层老年SCLC患者在初始诊断完成常规分期评估后的肝转移风险。方法从SEER数据库(2010-2017年)和川北医学院附属医院(2010-2024年)共纳入10080例年龄≥60岁组织学证实的SCLC患者。来自SEER的患者被随机分配到训练集(n = 7719)和内部验证集(n = 1930),而来自中国的431名患者组成外部验证集。采用Boruta算法进行特征选择,识别出11个关键变量。开发了逻辑回归、Naïve贝叶斯、支持向量机(SVM)、决策树、随机森林、XGBoost和LightGBM 7种ML模型,比较它们的预测性能。使用SHapley加性解释(SHapley Additive explanation)进一步解释最优模型。结果训练组、内部验证组和外部验证组的肝转移发生率分别约为32.89%、35.39%和32.71%。跨模型对比分析表明,在内部验证集中,XGBoost的整体判别性能最好,AUC为0.820,略优于LightGBM(0.819)、logistic回归(0.813)和随机森林(0.811)。在外部验证集中,所有模型的性能都下降了。鉴于其相对优越的预测性能,我们选择XGBoost作为可解释性分析的最终模型。SHAP分析表明,LDS/EDS、肿瘤分期、骨转移和脑转移是对模型预测影响最大的特征。结论基于xgboost的模型在评估SCLC患者肝转移风险方面具有中等的预测价值和令人满意的可解释性,提示其作为初始诊断分期后辅助决策支持工具的潜在用途。然而,其在不同人群中的普遍性需要进一步验证,在更广泛的临床应用之前,可能需要进行局部重新校准。
{"title":"Interpretable machine learning-based prediction of liver metastasis risk in elderly patients with small cell lung Cancer: A study based on the SEER database and external validation in a Chinese cohort","authors":"Hang Chen ,&nbsp;Wenchao Dai ,&nbsp;Jun Yang ,&nbsp;Xin Dang ,&nbsp;Li Jiang","doi":"10.1016/j.ijmedinf.2026.106274","DOIUrl":"10.1016/j.ijmedinf.2026.106274","url":null,"abstract":"<div><h3>Purpose</h3><div>Small cell lung cancer (SCLC) is a highly aggressive malignancy with a high incidence of liver metastases, particularly among elderly patients, which significantly worsens survival outcomes. However, efficient predictive tools targeting this population remain scarce. This study aimed to develop and validate an interpretable machine learning-based model to re-stratify the risk of liver metastasis in elderly patients with SCLC after completion of routine staging evaluation at initial diagnosis.</div></div><div><h3>Methods</h3><div>A total of 10,080 patients aged ≥60 years with histologically confirmed SCLC were included from the SEER database (2010–2017) and the Affiliated Hospital of North Sichuan Medical College, China (2010–2024). Patients from SEER were randomly assigned to a training set (n = 7719) and an internal validation set (n = 1930), while 431 patients from China comprised the external validation set. Feature selection was performed using the Boruta algorithm, identifying 11 key variables. Seven ML models, namely, Logistic Regression, Naïve Bayes, Support Vector Machine (SVM), Decision Tree, Random Forest, XGBoost, and LightGBM, were developed to compare their predictive performance. The optimal model was further interpreted using SHAP (SHapley Additive exPlanations).</div></div><div><h3>Results</h3><div>The incidence of liver metastasis was approximately 32.89%, 35.39%, and 32.71% in the training, internal validation, and external validation sets, respectively. Comparative analysis across models demonstrated that, in the internal validation set, XGBoost achieved the best overall discriminative performance, with an AUC of 0.820, slightly outperforming LightGBM (0.819), logistic regression (0.813), and random forest (0.811). In the external validation set, the performance of all models declined. Given its relatively superior predictive performance, XGBoost was selected as the final model for interpretability analyses. SHAP analysis indicated that LDS/EDS, tumor stage, bone metastasis, and brain metastasis were the most influential features contributing to the model predictions.</div></div><div><h3>Conclusion</h3><div>The XGBoost-based model exhibited moderate predictive value and satisfactory interpretability in assessing the risk of liver metastasis in patients with SCLC, suggesting its potential utility as an adjunctive decision-support tool following initial diagnostic staging. Nevertheless, its generalizability across different populations requires further validation, and localized recalibration may be necessary prior to broader clinical implementation.</div></div>","PeriodicalId":54950,"journal":{"name":"International Journal of Medical Informatics","volume":"209 ","pages":"Article 106274"},"PeriodicalIF":4.1,"publicationDate":"2026-01-08","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145940478","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
期刊
International Journal of Medical Informatics
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1