Explainable time-to-progression predictions in multiple sclerosis

IF 4.8 2区 医学 Q1 COMPUTER SCIENCE, INTERDISCIPLINARY APPLICATIONS Computer methods and programs in biomedicine Pub Date : 2025-05-01 Epub Date: 2025-02-06 DOI:10.1016/j.cmpb.2025.108624
Robbe D’hondt , Klest Dedja , Sofie Aerts , Bart Van Wijmeersch , Tomas Kalincik , Stephen Reddel , Eva Kubala Havrdova , Alessandra Lugaresi , Bianca Weinstock-Guttman , Saloua Mrabet , Patrice Lalive , Allan G. Kermode , Serkan Ozakbas , Francesco Patti , Alexandre Prat , Valentina Tomassini , Izanne Roos , Raed Alroughani , Oliver Gerlach , Samia J. Khoury , Celine Vens
{"title":"Explainable time-to-progression predictions in multiple sclerosis","authors":"Robbe D’hondt ,&nbsp;Klest Dedja ,&nbsp;Sofie Aerts ,&nbsp;Bart Van Wijmeersch ,&nbsp;Tomas Kalincik ,&nbsp;Stephen Reddel ,&nbsp;Eva Kubala Havrdova ,&nbsp;Alessandra Lugaresi ,&nbsp;Bianca Weinstock-Guttman ,&nbsp;Saloua Mrabet ,&nbsp;Patrice Lalive ,&nbsp;Allan G. Kermode ,&nbsp;Serkan Ozakbas ,&nbsp;Francesco Patti ,&nbsp;Alexandre Prat ,&nbsp;Valentina Tomassini ,&nbsp;Izanne Roos ,&nbsp;Raed Alroughani ,&nbsp;Oliver Gerlach ,&nbsp;Samia J. Khoury ,&nbsp;Celine Vens","doi":"10.1016/j.cmpb.2025.108624","DOIUrl":null,"url":null,"abstract":"<div><h3>Background:</h3><div>Prognostic machine learning research in multiple sclerosis has been mainly focusing on black-box models predicting whether a patients’ disability will progress in a fixed number of years. However, as this is a binary yes/no question, it cannot take individual disease severity into account. Therefore, in this work we propose to model the time to disease progression instead. Additionally, we use explainable machine learning techniques to make the model outputs more interpretable.</div></div><div><h3>Methods:</h3><div>A preprocessed subset of 29,201 patients of the international data registry MSBase was used. Disability was assessed in terms of the Expanded Disability Status Scale (EDSS). We predict the time to significant and confirmed disability progression using random survival forests, a machine learning model for survival analysis. Performance is evaluated on a time-dependent area under the receiver operating characteristic and the precision–recall curves. Importantly, predictions are then explained using SHAP and Bellatrex, two explainability toolboxes, and lead to both global (population-wide) as well as local (patient visit-specific) insights.</div></div><div><h3>Results:</h3><div>On the task of predicting progression in 2 years, the random survival forest achieves state-of-the-art performance, comparable to previous work employing a random forest. However, here the random survival forest has the added advantage of being able to predict progression over a longer time horizon, with AUROC <span><math><mrow><mo>&gt;</mo><mn>60</mn><mtext>%</mtext></mrow></math></span> for the first 10 years after baseline. Explainability techniques further validated the model by extracting clinically valid insights from the predictions made by the model. For example, a clear decline in the per-visit probability of progression is observed in more recent years since 2012, likely reflecting globally increasing use of more effective MS therapies.</div></div><div><h3>Conclusion:</h3><div>The binary classification models found in the literature can be extended to a time-to-event setting without loss of performance, thus allowing a more comprehensive prediction of patient prognosis. Furthermore, explainability techniques proved to be key to reach a better understanding of the model and increase validation of its behaviour.</div></div>","PeriodicalId":10624,"journal":{"name":"Computer methods and programs in biomedicine","volume":"263 ","pages":"Article 108624"},"PeriodicalIF":4.8000,"publicationDate":"2025-05-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Computer methods and programs in biomedicine","FirstCategoryId":"5","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0169260725000410","RegionNum":2,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"2025/2/6 0:00:00","PubModel":"Epub","JCR":"Q1","JCRName":"COMPUTER SCIENCE, INTERDISCIPLINARY APPLICATIONS","Score":null,"Total":0}
引用次数: 0

Abstract

Background:

Prognostic machine learning research in multiple sclerosis has been mainly focusing on black-box models predicting whether a patients’ disability will progress in a fixed number of years. However, as this is a binary yes/no question, it cannot take individual disease severity into account. Therefore, in this work we propose to model the time to disease progression instead. Additionally, we use explainable machine learning techniques to make the model outputs more interpretable.

Methods:

A preprocessed subset of 29,201 patients of the international data registry MSBase was used. Disability was assessed in terms of the Expanded Disability Status Scale (EDSS). We predict the time to significant and confirmed disability progression using random survival forests, a machine learning model for survival analysis. Performance is evaluated on a time-dependent area under the receiver operating characteristic and the precision–recall curves. Importantly, predictions are then explained using SHAP and Bellatrex, two explainability toolboxes, and lead to both global (population-wide) as well as local (patient visit-specific) insights.

Results:

On the task of predicting progression in 2 years, the random survival forest achieves state-of-the-art performance, comparable to previous work employing a random forest. However, here the random survival forest has the added advantage of being able to predict progression over a longer time horizon, with AUROC >60% for the first 10 years after baseline. Explainability techniques further validated the model by extracting clinically valid insights from the predictions made by the model. For example, a clear decline in the per-visit probability of progression is observed in more recent years since 2012, likely reflecting globally increasing use of more effective MS therapies.

Conclusion:

The binary classification models found in the literature can be extended to a time-to-event setting without loss of performance, thus allowing a more comprehensive prediction of patient prognosis. Furthermore, explainability techniques proved to be key to reach a better understanding of the model and increase validation of its behaviour.
查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
多发性硬化症可解释的进展时间预测
背景:多发性硬化症的预后机器学习研究主要集中在黑盒模型上,预测患者的残疾是否会在固定的年限内进展。然而,由于这是一个二元的是/否问题,它不能考虑到个体疾病的严重程度。因此,在这项工作中,我们建议对疾病进展的时间进行建模。此外,我们使用可解释的机器学习技术使模型输出更具可解释性。方法:使用国际数据注册中心MSBase的29,201例患者的预处理子集。根据扩展残疾状态量表(EDSS)评估残疾。我们使用随机生存森林(一种用于生存分析的机器学习模型)来预测显著和确认残疾进展的时间。性能在接收机工作特性和精确召回曲线下的时间依赖区域上进行评估。重要的是,然后使用SHAP和Bellatrex这两个可解释性工具箱来解释预测,并导致全球(人口范围内)和当地(特定患者访问)的见解。结果:在预测2年进展的任务上,随机生存森林达到了最先进的性能,可与以前使用随机森林的工作相媲美。然而,随机生存森林有一个额外的优势,即能够预测更长的时间范围内的进展,基线后前10年的AUROC为60%。可解释性技术通过从模型所做的预测中提取临床有效的见解进一步验证了模型。例如,自2012年以来,近年来观察到每次就诊的进展概率明显下降,这可能反映了全球越来越多地使用更有效的多发性硬化症治疗。结论:文献中发现的二元分类模型可以在不损失性能的情况下扩展到事件时间设置,从而可以更全面地预测患者预后。此外,可解释性技术被证明是更好地理解模型和增加其行为验证的关键。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 去求助
来源期刊
Computer methods and programs in biomedicine
Computer methods and programs in biomedicine 工程技术-工程:生物医学
CiteScore
12.30
自引率
6.60%
发文量
601
审稿时长
135 days
期刊介绍: To encourage the development of formal computing methods, and their application in biomedical research and medical practice, by illustration of fundamental principles in biomedical informatics research; to stimulate basic research into application software design; to report the state of research of biomedical information processing projects; to report new computer methodologies applied in biomedical areas; the eventual distribution of demonstrable software to avoid duplication of effort; to provide a forum for discussion and improvement of existing software; to optimize contact between national organizations and regional user groups by promoting an international exchange of information on formal methods, standards and software in biomedicine. Computer Methods and Programs in Biomedicine covers computing methodology and software systems derived from computing science for implementation in all aspects of biomedical research and medical practice. It is designed to serve: biochemists; biologists; geneticists; immunologists; neuroscientists; pharmacologists; toxicologists; clinicians; epidemiologists; psychiatrists; psychologists; cardiologists; chemists; (radio)physicists; computer scientists; programmers and systems analysts; biomedical, clinical, electrical and other engineers; teachers of medical informatics and users of educational software.
期刊最新文献
Machine learning classification of normal and malignant cells on the basis of their viscoelastic properties Inhalation exposure and particle deposition across 16 nonhuman primate airway models using computational fluid–particle dynamics for allometric extrapolation Robust prediction of parameterized cardiovascular hemodynamics using deep operator networks with time normalization CRMIPred: Identifying the spatial interactions among cis-regulatory modules via considering their cross-attended epigenetic profiles A robust topology optimization based biomechanical computational framework for patient-specific trabecular bone microstructure reconstruction
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1