Leveraging interpretable machine learning in intensive care

IF 4.4 3区管理学 Q1 OPERATIONS RESEARCH & MANAGEMENT SCIENCE Annals of Operations Research Pub Date : 2024-09-19 DOI:10.1007/s10479-024-06226-8

Lasse Bohlen, Julian Rosenberger, Patrick Zschech, Mathias Kraus

{"title":"Leveraging interpretable machine learning in intensive care","authors":"Lasse Bohlen, Julian Rosenberger, Patrick Zschech, Mathias Kraus","doi":"10.1007/s10479-024-06226-8","DOIUrl":null,"url":null,"abstract":"<p>In healthcare, especially within intensive care units (ICU), informed decision-making by medical professionals is crucial due to the complexity of medical data. Healthcare analytics seeks to support these decisions by generating accurate predictions through advanced machine learning (ML) models, such as boosted decision trees and random forests. While these models frequently exhibit accurate predictions across various medical tasks, they often lack interpretability. To address this challenge, researchers have developed interpretable ML models that balance accuracy and interpretability. In this study, we evaluate the performance gap between interpretable and black-box models in two healthcare prediction tasks, mortality and length-of-stay prediction in ICU settings. We focus specifically on the family of generalized additive models (GAMs) as powerful interpretable ML models. Our assessment uses the publicly available Medical Information Mart for Intensive Care dataset, and we analyze the models based on (i) predictive performance, (ii) the influence of compact feature sets (i.e., only few features) on predictive performance, and (iii) interpretability and consistency with medical knowledge. Our results show that interpretable models achieve competitive performance, with a minor decrease of 0.2–0.9 percentage points in area under the receiver operating characteristic relative to state-of-the-art black-box models, while preserving complete interpretability. This remains true even for parsimonious models that use only 2.2 % of patient features. Our study highlights the potential of interpretable models to improve decision-making in ICUs by providing medical professionals with easily understandable and verifiable predictions.</p>","PeriodicalId":8215,"journal":{"name":"Annals of Operations Research","volume":"19 1","pages":""},"PeriodicalIF":4.4000,"publicationDate":"2024-09-19","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Annals of Operations Research","FirstCategoryId":"91","ListUrlMain":"https://doi.org/10.1007/s10479-024-06226-8","RegionNum":3,"RegionCategory":"管理学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"OPERATIONS RESEARCH & MANAGEMENT SCIENCE","Score":null,"Total":0}

引用次数: 0

Abstract

In healthcare, especially within intensive care units (ICU), informed decision-making by medical professionals is crucial due to the complexity of medical data. Healthcare analytics seeks to support these decisions by generating accurate predictions through advanced machine learning (ML) models, such as boosted decision trees and random forests. While these models frequently exhibit accurate predictions across various medical tasks, they often lack interpretability. To address this challenge, researchers have developed interpretable ML models that balance accuracy and interpretability. In this study, we evaluate the performance gap between interpretable and black-box models in two healthcare prediction tasks, mortality and length-of-stay prediction in ICU settings. We focus specifically on the family of generalized additive models (GAMs) as powerful interpretable ML models. Our assessment uses the publicly available Medical Information Mart for Intensive Care dataset, and we analyze the models based on (i) predictive performance, (ii) the influence of compact feature sets (i.e., only few features) on predictive performance, and (iii) interpretability and consistency with medical knowledge. Our results show that interpretable models achieve competitive performance, with a minor decrease of 0.2–0.9 percentage points in area under the receiver operating characteristic relative to state-of-the-art black-box models, while preserving complete interpretability. This remains true even for parsimonious models that use only 2.2 % of patient features. Our study highlights the potential of interpretable models to improve decision-making in ICUs by providing medical professionals with easily understandable and verifiable predictions.

Abstract Image

查看原文

微信好友朋友圈 QQ好友复制链接

本刊更多论文

在重症监护中利用可解释的机器学习

在医疗保健领域，尤其是在重症监护室（ICU）内，由于医疗数据的复杂性，医疗专业人员做出明智的决策至关重要。医疗分析旨在通过先进的机器学习（ML）模型（如增强决策树和随机森林）生成准确的预测，从而为这些决策提供支持。虽然这些模型经常能对各种医疗任务做出准确预测，但它们往往缺乏可解释性。为了应对这一挑战，研究人员开发了可解释的 ML 模型，在准确性和可解释性之间取得了平衡。在本研究中，我们评估了可解释模型和黑盒模型在两项医疗预测任务（重症监护室的死亡率和住院时间预测）中的性能差距。我们特别关注作为强大的可解释 ML 模型的广义加法模型（GAM）系列。我们的评估使用了公开的重症监护医疗信息集市数据集，并根据以下几个方面对模型进行了分析：(i) 预测性能；(ii) 紧凑型特征集（即只有少数特征）对预测性能的影响；(iii) 可解释性以及与医学知识的一致性。我们的研究结果表明，可解释模型在保持完全可解释性的同时，还能获得有竞争力的性能，与最先进的黑盒模型相比，接收器操作特征下面积略微下降了 0.2-0.9 个百分点。即使是仅使用 2.2% 患者特征的简约模型，情况也是如此。我们的研究强调了可解释模型的潜力，它能为医疗专业人员提供易于理解和验证的预测，从而改善重症监护室的决策。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文去求助

来源期刊

Annals of Operations Research 管理科学-运筹学与管理科学

CiteScore

7.90

自引率

16.70%

发文量

596

审稿时长

8.4 months

期刊介绍： The Annals of Operations Research publishes peer-reviewed original articles dealing with key aspects of operations research, including theory, practice, and computation. The journal publishes full-length research articles, short notes, expositions and surveys, reports on computational studies, and case studies that present new and innovative practical applications. In addition to regular issues, the journal publishes periodic special volumes that focus on defined fields of operations research, ranging from the highly theoretical to the algorithmic and the applied. These volumes have one or more Guest Editors who are responsible for collecting the papers and overseeing the refereeing process.