Development of a prediction model for clinically-relevant fatigue: a multi-cancer approach.

IF 3.3 3区医学 Q1 HEALTH CARE SCIENCES & SERVICES Quality of Life Research Pub Date : 2025-01-01 Epub Date: 2024-11-09 DOI:10.1007/s11136-024-03807-9

Dhirendra Adiprakoso, Dimitris Katsimpokis, Simone Oerlemans, Nicole P M Ezendam, Marissa C van Maaren, Janine A van Til, Thijs G W van der Heijden, Floortje Mols, Katja K H Aben, Geraldine R Vink, Miriam Koopman, Lonneke V van de Poll-Franse, Belle H de Rooij

{"title":"Development of a prediction model for clinically-relevant fatigue: a multi-cancer approach.","authors":"Dhirendra Adiprakoso, Dimitris Katsimpokis, Simone Oerlemans, Nicole P M Ezendam, Marissa C van Maaren, Janine A van Til, Thijs G W van der Heijden, Floortje Mols, Katja K H Aben, Geraldine R Vink, Miriam Koopman, Lonneke V van de Poll-Franse, Belle H de Rooij","doi":"10.1007/s11136-024-03807-9","DOIUrl":null,"url":null,"abstract":"Purpose: Fatigue is the most prevalent symptom across cancer types. To support clinicians in providing fatigue-related supportive care, this study aims to develop and compare models predicting clinically relevant fatigue (CRF) occurring between two and three years after diagnosis, and to assess the validity of the best-performing model across diverse cancer populations.Methods: Patients with non-metastatic bladder, colorectal, endometrial, ovarian, or prostate cancer who completed a questionnaire within three months after diagnosis and a subsequent questionnaire between two and three years thereafter, were included. Predictor variables included clinical, socio-demographic, and patient-reported variables. The outcome was CRF (EORTC QLQC30 fatigue ≥ 39). Logistic regression using LASSO selection was compared to more advanced Machine Learning (ML) based models, including Extreme gradient boosting (XGBoost), support vector machines (SVM), and artificial neural networks (ANN). Internal-external cross-validation was conducted on the best-performing model.Results: 3160 patients were included. The logistic regression model had the highest C-statistic (0.77) and balanced accuracy (0.65), both indicating good discrimination between patients with and without CRF. However, sensitivity was low across all models (0.22-0.37). Following internal-external validation, performance across cancer types was consistent (C-statistics 0.73-0.82).Conclusion: Although the models' discrimination was good, the low balanced accuracy and poor calibration in the presence of CRF indicates a relatively high likelihood of underdiagnosis of future CRF. Yet, the clinical applicability of the model remains uncertain. The logistic regression performed better than the ML-based models and was robust across cohorts, suggesting an advantage of simpler models to predict CRF.","PeriodicalId":20748,"journal":{"name":"Quality of Life Research","volume":" ","pages":"231-245"},"PeriodicalIF":3.3000,"publicationDate":"2025-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11802682/pdf/","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Quality of Life Research","FirstCategoryId":"3","ListUrlMain":"https://doi.org/10.1007/s11136-024-03807-9","RegionNum":3,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"2024/11/9 0:00:00","PubModel":"Epub","JCR":"Q1","JCRName":"HEALTH CARE SCIENCES & SERVICES","Score":null,"Total":0}

引用次数: 0

Abstract

Purpose: Fatigue is the most prevalent symptom across cancer types. To support clinicians in providing fatigue-related supportive care, this study aims to develop and compare models predicting clinically relevant fatigue (CRF) occurring between two and three years after diagnosis, and to assess the validity of the best-performing model across diverse cancer populations.

Methods: Patients with non-metastatic bladder, colorectal, endometrial, ovarian, or prostate cancer who completed a questionnaire within three months after diagnosis and a subsequent questionnaire between two and three years thereafter, were included. Predictor variables included clinical, socio-demographic, and patient-reported variables. The outcome was CRF (EORTC QLQC30 fatigue ≥ 39). Logistic regression using LASSO selection was compared to more advanced Machine Learning (ML) based models, including Extreme gradient boosting (XGBoost), support vector machines (SVM), and artificial neural networks (ANN). Internal-external cross-validation was conducted on the best-performing model.

Results: 3160 patients were included. The logistic regression model had the highest C-statistic (0.77) and balanced accuracy (0.65), both indicating good discrimination between patients with and without CRF. However, sensitivity was low across all models (0.22-0.37). Following internal-external validation, performance across cancer types was consistent (C-statistics 0.73-0.82).

Conclusion: Although the models' discrimination was good, the low balanced accuracy and poor calibration in the presence of CRF indicates a relatively high likelihood of underdiagnosis of future CRF. Yet, the clinical applicability of the model remains uncertain. The logistic regression performed better than the ML-based models and was robust across cohorts, suggesting an advantage of simpler models to predict CRF.

查看原文

微信好友朋友圈 QQ好友复制链接

本刊更多论文

开发临床相关疲劳预测模型：一种多癌症方法。

目的疲劳是各癌症类型中最常见的症状。为了支持临床医生提供与疲劳相关的支持性护理，本研究旨在开发和比较预测诊断后两到三年内发生的临床相关疲劳（CRF）的模型，并评估不同癌症人群中表现最佳模型的有效性：研究对象包括非转移性膀胱癌、结肠直肠癌、子宫内膜癌、卵巢癌或前列腺癌患者，这些患者在确诊后三个月内完成了问卷调查，并在此后两到三年内完成了后续问卷调查。预测变量包括临床、社会人口学和患者报告变量。结果为 CRF（EORTC QLQC30 疲劳度≥39）。使用 LASSO 选择的逻辑回归与更先进的基于机器学习（ML）的模型进行了比较，包括极端梯度提升（XGBoost）、支持向量机（SVM）和人工神经网络（ANN）。对表现最佳的模型进行了内部-外部交叉验证：结果：共纳入 3160 名患者。逻辑回归模型具有最高的 C 统计量（0.77）和平衡准确率（0.65），两者都表明该模型能很好地区分有 CRF 和无 CRF 的患者。然而，所有模型的灵敏度都较低（0.22-0.37）。经过内部-外部验证后，各癌症类型的表现一致（C 统计量为 0.73-0.82）：结论：虽然模型的分辨能力较好，但在存在 CRF 的情况下，平衡准确率较低，校准能力较差，这表明未来 CRF 被漏诊的可能性相对较高。然而，该模型的临床适用性仍不确定。逻辑回归的表现优于基于 ML 的模型，并且在不同队列中表现稳健，这表明较简单的模型在预测 CRF 方面具有优势。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文去求助

来源期刊

Quality of Life Research 医学-公共卫生、环境卫生与职业卫生

CiteScore

6.50

自引率

8.60%

发文量

224

审稿时长

3-8 weeks

期刊介绍： Quality of Life Research is an international, multidisciplinary journal devoted to the rapid communication of original research, theoretical articles and methodological reports related to the field of quality of life, in all the health sciences. The journal also offers editorials, literature, book and software reviews, correspondence and abstracts of conferences. Quality of life has become a prominent issue in biometry, philosophy, social science, clinical medicine, health services and outcomes research. The journal''s scope reflects the wide application of quality of life assessment and research in the biological and social sciences. All original work is subject to peer review for originality, scientific quality and relevance to a broad readership. This is an official journal of the International Society of Quality of Life Research.