Dhirendra Adiprakoso, Dimitris Katsimpokis, Simone Oerlemans, Nicole P M Ezendam, Marissa C van Maaren, Janine A van Til, Thijs G W van der Heijden, Floortje Mols, Katja K H Aben, Geraldine R Vink, Miriam Koopman, Lonneke V van de Poll-Franse, Belle H de Rooij
{"title":"Development of a prediction model for clinically-relevant fatigue: a multi-cancer approach.","authors":"Dhirendra Adiprakoso, Dimitris Katsimpokis, Simone Oerlemans, Nicole P M Ezendam, Marissa C van Maaren, Janine A van Til, Thijs G W van der Heijden, Floortje Mols, Katja K H Aben, Geraldine R Vink, Miriam Koopman, Lonneke V van de Poll-Franse, Belle H de Rooij","doi":"10.1007/s11136-024-03807-9","DOIUrl":null,"url":null,"abstract":"<p><strong>Purpose: </strong>Fatigue is the most prevalent symptom across cancer types. To support clinicians in providing fatigue-related supportive care, this study aims to develop and compare models predicting clinically relevant fatigue (CRF) occurring between two and three years after diagnosis, and to assess the validity of the best-performing model across diverse cancer populations.</p><p><strong>Methods: </strong>Patients with non-metastatic bladder, colorectal, endometrial, ovarian, or prostate cancer who completed a questionnaire within three months after diagnosis and a subsequent questionnaire between two and three years thereafter, were included. Predictor variables included clinical, socio-demographic, and patient-reported variables. The outcome was CRF (EORTC QLQC30 fatigue ≥ 39). Logistic regression using LASSO selection was compared to more advanced Machine Learning (ML) based models, including Extreme gradient boosting (XGBoost), support vector machines (SVM), and artificial neural networks (ANN). Internal-external cross-validation was conducted on the best-performing model.</p><p><strong>Results: </strong>3160 patients were included. The logistic regression model had the highest C-statistic (0.77) and balanced accuracy (0.65), both indicating good discrimination between patients with and without CRF. However, sensitivity was low across all models (0.22-0.37). Following internal-external validation, performance across cancer types was consistent (C-statistics 0.73-0.82).</p><p><strong>Conclusion: </strong>Although the models' discrimination was good, the low balanced accuracy and poor calibration in the presence of CRF indicates a relatively high likelihood of underdiagnosis of future CRF. Yet, the clinical applicability of the model remains uncertain. The logistic regression performed better than the ML-based models and was robust across cohorts, suggesting an advantage of simpler models to predict CRF.</p>","PeriodicalId":20748,"journal":{"name":"Quality of Life Research","volume":" ","pages":""},"PeriodicalIF":3.3000,"publicationDate":"2024-11-09","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Quality of Life Research","FirstCategoryId":"3","ListUrlMain":"https://doi.org/10.1007/s11136-024-03807-9","RegionNum":3,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"HEALTH CARE SCIENCES & SERVICES","Score":null,"Total":0}
引用次数: 0
Abstract
Purpose: Fatigue is the most prevalent symptom across cancer types. To support clinicians in providing fatigue-related supportive care, this study aims to develop and compare models predicting clinically relevant fatigue (CRF) occurring between two and three years after diagnosis, and to assess the validity of the best-performing model across diverse cancer populations.
Methods: Patients with non-metastatic bladder, colorectal, endometrial, ovarian, or prostate cancer who completed a questionnaire within three months after diagnosis and a subsequent questionnaire between two and three years thereafter, were included. Predictor variables included clinical, socio-demographic, and patient-reported variables. The outcome was CRF (EORTC QLQC30 fatigue ≥ 39). Logistic regression using LASSO selection was compared to more advanced Machine Learning (ML) based models, including Extreme gradient boosting (XGBoost), support vector machines (SVM), and artificial neural networks (ANN). Internal-external cross-validation was conducted on the best-performing model.
Results: 3160 patients were included. The logistic regression model had the highest C-statistic (0.77) and balanced accuracy (0.65), both indicating good discrimination between patients with and without CRF. However, sensitivity was low across all models (0.22-0.37). Following internal-external validation, performance across cancer types was consistent (C-statistics 0.73-0.82).
Conclusion: Although the models' discrimination was good, the low balanced accuracy and poor calibration in the presence of CRF indicates a relatively high likelihood of underdiagnosis of future CRF. Yet, the clinical applicability of the model remains uncertain. The logistic regression performed better than the ML-based models and was robust across cohorts, suggesting an advantage of simpler models to predict CRF.
期刊介绍:
Quality of Life Research is an international, multidisciplinary journal devoted to the rapid communication of original research, theoretical articles and methodological reports related to the field of quality of life, in all the health sciences. The journal also offers editorials, literature, book and software reviews, correspondence and abstracts of conferences.
Quality of life has become a prominent issue in biometry, philosophy, social science, clinical medicine, health services and outcomes research. The journal''s scope reflects the wide application of quality of life assessment and research in the biological and social sciences. All original work is subject to peer review for originality, scientific quality and relevance to a broad readership.
This is an official journal of the International Society of Quality of Life Research.