Explainable machine learning model for predicting skeletal muscle loss during surgery and adjuvant chemotherapy in ovarian cancer

IF 8.9 1区医学 Journal of Cachexia, Sarcopenia and Muscle Pub Date : 2023-07-12 DOI:10.1002/jcsm.13282

Wen-Han Hsu, Ai-Tung Ko, Chia-Sui Weng, Chih-Long Chang, Ya-Ting Jan, Jhen-Bin Lin, Hung-Ju Chien, Wan-Chun Lin, Fang-Ju Sun, Kun-Pin Wu, Jie Lee

{"title":"Explainable machine learning model for predicting skeletal muscle loss during surgery and adjuvant chemotherapy in ovarian cancer","authors":"Wen-Han Hsu, Ai-Tung Ko, Chia-Sui Weng, Chih-Long Chang, Ya-Ting Jan, Jhen-Bin Lin, Hung-Ju Chien, Wan-Chun Lin, Fang-Ju Sun, Kun-Pin Wu, Jie Lee","doi":"10.1002/jcsm.13282","DOIUrl":null,"url":null,"abstract":"<div>\n \n \n <section>\n \n <h3> Background</h3>\n \n <p>Skeletal muscle loss during treatment is associated with poor survival outcomes in patients with ovarian cancer. Although changes in muscle mass can be assessed on computed tomography (CT) scans, this labour-intensive process can impair its utility in clinical practice. This study aimed to develop a machine learning (ML) model to predict muscle loss based on clinical data and to interpret the ML model by applying SHapley Additive exPlanations (SHAP) method.</p>\n </section>\n \n <section>\n \n <h3> Methods</h3>\n \n <p>This study included the data of 617 patients with ovarian cancer who underwent primary debulking surgery and platinum-based chemotherapy at a tertiary centre between 2010 and 2019. The cohort data were split into training and test sets based on the treatment time. External validation was performed using 140 patients from a different tertiary centre. The skeletal muscle index (SMI) was measured from pre- and post-treatment CT scans, and a decrease in SMI ≥ 5% was defined as muscle loss. We evaluated five ML models to predict muscle loss, and their performance was determined using the area under the receiver operating characteristic curve (AUC) and F1 score. The features for analysis included demographic and disease-specific characteristics and relative changes in body mass index (BMI), albumin, neutrophil-to-lymphocyte ratio (NLR), and platelet-to-lymphocyte ratio (PLR). The SHAP method was applied to determine the importance of the features and interpret the ML models.</p>\n </section>\n \n <section>\n \n <h3> Results</h3>\n \n <p>The median (inter-quartile range) age of the cohort was 52 (46–59) years. After treatment, 204 patients (33.1%) experienced muscle loss in the training and test datasets, while 44 (31.4%) patients experienced muscle loss in the external validation dataset. Among the five evaluated ML models, the random forest model achieved the highest AUC (0.856, 95% confidence interval: 0.854–0.859) and F1 score (0.726, 95% confidence interval: 0.722–0.730). In the external validation, the random forest model outperformed all ML models with an AUC of 0.874 and an F1 score of 0.741. The results of the SHAP method showed that the albumin change, BMI change, malignant ascites, NLR change, and PLR change were the most important factors in muscle loss. At the patient level, SHAP force plots demonstrated insightful interpretation of our random forest model to predict muscle loss.</p>\n </section>\n \n <section>\n \n <h3> Conclusions</h3>\n \n <p>Explainable ML model was developed using clinical data to identify patients experiencing muscle loss after treatment and provide information of feature contribution. Using the SHAP method, clinicians may better understand the contributors to muscle loss and target interventions to counteract muscle loss.</p>\n </section>\n </div>","PeriodicalId":186,"journal":{"name":"Journal of Cachexia, Sarcopenia and Muscle","volume":"14 5","pages":"2044-2053"},"PeriodicalIF":8.9000,"publicationDate":"2023-07-12","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://onlinelibrary.wiley.com/doi/epdf/10.1002/jcsm.13282","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Journal of Cachexia, Sarcopenia and Muscle","FirstCategoryId":"3","ListUrlMain":"https://onlinelibrary.wiley.com/doi/10.1002/jcsm.13282","RegionNum":1,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 0

Abstract

Background

Skeletal muscle loss during treatment is associated with poor survival outcomes in patients with ovarian cancer. Although changes in muscle mass can be assessed on computed tomography (CT) scans, this labour-intensive process can impair its utility in clinical practice. This study aimed to develop a machine learning (ML) model to predict muscle loss based on clinical data and to interpret the ML model by applying SHapley Additive exPlanations (SHAP) method.

Methods

This study included the data of 617 patients with ovarian cancer who underwent primary debulking surgery and platinum-based chemotherapy at a tertiary centre between 2010 and 2019. The cohort data were split into training and test sets based on the treatment time. External validation was performed using 140 patients from a different tertiary centre. The skeletal muscle index (SMI) was measured from pre- and post-treatment CT scans, and a decrease in SMI ≥ 5% was defined as muscle loss. We evaluated five ML models to predict muscle loss, and their performance was determined using the area under the receiver operating characteristic curve (AUC) and F1 score. The features for analysis included demographic and disease-specific characteristics and relative changes in body mass index (BMI), albumin, neutrophil-to-lymphocyte ratio (NLR), and platelet-to-lymphocyte ratio (PLR). The SHAP method was applied to determine the importance of the features and interpret the ML models.

Results

The median (inter-quartile range) age of the cohort was 52 (46–59) years. After treatment, 204 patients (33.1%) experienced muscle loss in the training and test datasets, while 44 (31.4%) patients experienced muscle loss in the external validation dataset. Among the five evaluated ML models, the random forest model achieved the highest AUC (0.856, 95% confidence interval: 0.854–0.859) and F1 score (0.726, 95% confidence interval: 0.722–0.730). In the external validation, the random forest model outperformed all ML models with an AUC of 0.874 and an F1 score of 0.741. The results of the SHAP method showed that the albumin change, BMI change, malignant ascites, NLR change, and PLR change were the most important factors in muscle loss. At the patient level, SHAP force plots demonstrated insightful interpretation of our random forest model to predict muscle loss.

Conclusions

Explainable ML model was developed using clinical data to identify patients experiencing muscle loss after treatment and provide information of feature contribution. Using the SHAP method, clinicians may better understand the contributors to muscle loss and target interventions to counteract muscle loss.

Abstract Image

查看原文

微信好友朋友圈 QQ好友复制链接

本刊更多论文

可解释的机器学习模型用于预测卵巢癌症手术和辅助化疗期间骨骼肌损失。

背景：癌症患者在治疗过程中骨骼肌丢失与生存率低有关。尽管可以通过计算机断层扫描（CT）来评估肌肉质量的变化，但这种劳动密集型的过程可能会削弱其在临床实践中的实用性。本研究旨在开发一种基于临床数据预测肌肉损失的机器学习（ML）模型，并通过应用SHapley加性预测（SHAP）方法来解释ML模型。方法：这项研究包括了617名癌症患者的数据，他们在2010年至2019年期间在一家三级中心接受了初级减瘤手术和基于铂的化疗。根据治疗时间将队列数据分为训练集和测试集。使用来自不同三级中心的140名患者进行外部验证。通过治疗前后的CT扫描测量骨骼肌指数（SMI），SMI下降≥5%被定义为肌肉损失。我们评估了五个ML模型来预测肌肉损失，并使用受试者工作特征曲线下面积（AUC）和F1评分来确定它们的性能。分析的特征包括人口统计学和疾病特异性特征以及体重指数（BMI）、白蛋白、中性粒细胞与淋巴细胞比率（NLR）和血小板与淋巴细胞比率的相对变化。应用SHAP方法来确定特征的重要性并解释ML模型。结果：队列的中位（四分位间距）年龄为52岁（46-59岁）。治疗后，204名患者（33.1%）在训练和测试数据集中出现肌肉损失，而44名患者（31.4%）在外部验证数据集中出现了肌肉损失。在五个评估的ML模型中，随机森林模型获得了最高的AUC（0.856，95%置信区间：0.854-0.859）和F1得分（0.726，95%可信区间：0.722-0.730）。在外部验证中，随机林模型的AUC为0.874，F1得分为0.741，优于所有ML模型。SHAP法的结果表明，白蛋白变化、BMI变化、恶性腹水、NLR变化和PLR变化是肌肉损失的最重要因素。在患者层面，SHAP力图展示了对我们的随机森林模型的深刻解释，以预测肌肉损失。结论：使用临床数据开发了可解释的ML模型，以识别治疗后出现肌肉损失的患者，并提供特征贡献的信息。使用SHAP方法，临床医生可以更好地了解肌肉损失的原因，并针对性地采取干预措施来抵消肌肉损失。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文去求助

来源期刊

Journal of Cachexia, Sarcopenia and Muscle Medicine-Orthopedics and Sports Medicine

自引率

12.40%

发文量

期刊介绍： The Journal of Cachexia, Sarcopenia, and Muscle is a prestigious, peer-reviewed international publication committed to disseminating research and clinical insights pertaining to cachexia, sarcopenia, body composition, and the physiological and pathophysiological alterations occurring throughout the lifespan and in various illnesses across the spectrum of life sciences. This journal serves as a valuable resource for physicians, biochemists, biologists, dieticians, pharmacologists, and students alike.

期刊最新文献

Issue Information Issue Information Author Index Abstracts Call for standardization in assessment and reporting of muscle and adipose change using computed tomography analysis in oncology: A scoping review