Xu Tian, Haoyang Li, Feili Li, María F Jiménez-Herrera, Yi Ren, Hongcai Shang
{"title":"基于机器学习算法确定心理困扰风险的基于网络的计算器的开发和验证:对342名肺癌患者的横断面研究。","authors":"Xu Tian, Haoyang Li, Feili Li, María F Jiménez-Herrera, Yi Ren, Hongcai Shang","doi":"10.1007/s00520-024-09127-5","DOIUrl":null,"url":null,"abstract":"<p><strong>Purpose: </strong>Early and accurate identification of the risk of psychological distress allows for timely intervention and improved prognosis. Current methods for predicting psychological distress among lung cancer patients using readily available data are limited. This study aimed to develop a robust machine learning (ML) model for determining the risk of psychological distress among lung cancer patients.</p><p><strong>Methods: </strong>A cross-sectional study was designed to collect data from 342 lung cancer patients. Least Absolute Shrinkage and Selection Operator (LASSO) was used for feature selection. Model training and validation were conducted with bootstrap resampling method. Fivefold cross-validation evaluated and optimized the model with parameter tuning. Feature importance was assessed using SHapley additive exPlanations (SHAP) method.</p><p><strong>Results: </strong>The model identified seven independent risk factors of psychological distress: residence (β = 0.141), diagnosis duration (β = 0.055), TNM stage (β = 0.098), pain severity (β = 0.067), perceived stigma (β = 0.052), illness perception (β = 0.100), and coping style (β = 0.097). Among the eight ML algorithms evaluated, the extreme gradient boosting (XGBoost) algorithm demonstrated the highest performance with AUROC values of 0.988, 0.945, and 0.922 for the training, validation, and test sets, respectively. The model's results were further explained using SHAP, which revealed the importance and contribution of each risk factor to the overall distress risk. A web-based tool was developed based on this model to facilitate clinical use.</p><p><strong>Conclusion: </strong>The XGBoost classifier demonstrated exceptional performance, and clinical implementation of the web-based risk calculator can serve as an easy-to-use tool for health practitioners to formulate early prevention and intervention strategies.</p>","PeriodicalId":22046,"journal":{"name":"Supportive Care in Cancer","volume":"33 1","pages":"63"},"PeriodicalIF":2.8000,"publicationDate":"2024-12-30","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Development and validation of a web-based calculator for determining the risk of psychological distress based on machine learning algorithms: A cross-sectional study of 342 lung cancer patients.\",\"authors\":\"Xu Tian, Haoyang Li, Feili Li, María F Jiménez-Herrera, Yi Ren, Hongcai Shang\",\"doi\":\"10.1007/s00520-024-09127-5\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<p><strong>Purpose: </strong>Early and accurate identification of the risk of psychological distress allows for timely intervention and improved prognosis. Current methods for predicting psychological distress among lung cancer patients using readily available data are limited. This study aimed to develop a robust machine learning (ML) model for determining the risk of psychological distress among lung cancer patients.</p><p><strong>Methods: </strong>A cross-sectional study was designed to collect data from 342 lung cancer patients. Least Absolute Shrinkage and Selection Operator (LASSO) was used for feature selection. Model training and validation were conducted with bootstrap resampling method. Fivefold cross-validation evaluated and optimized the model with parameter tuning. Feature importance was assessed using SHapley additive exPlanations (SHAP) method.</p><p><strong>Results: </strong>The model identified seven independent risk factors of psychological distress: residence (β = 0.141), diagnosis duration (β = 0.055), TNM stage (β = 0.098), pain severity (β = 0.067), perceived stigma (β = 0.052), illness perception (β = 0.100), and coping style (β = 0.097). Among the eight ML algorithms evaluated, the extreme gradient boosting (XGBoost) algorithm demonstrated the highest performance with AUROC values of 0.988, 0.945, and 0.922 for the training, validation, and test sets, respectively. The model's results were further explained using SHAP, which revealed the importance and contribution of each risk factor to the overall distress risk. A web-based tool was developed based on this model to facilitate clinical use.</p><p><strong>Conclusion: </strong>The XGBoost classifier demonstrated exceptional performance, and clinical implementation of the web-based risk calculator can serve as an easy-to-use tool for health practitioners to formulate early prevention and intervention strategies.</p>\",\"PeriodicalId\":22046,\"journal\":{\"name\":\"Supportive Care in Cancer\",\"volume\":\"33 1\",\"pages\":\"63\"},\"PeriodicalIF\":2.8000,\"publicationDate\":\"2024-12-30\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Supportive Care in Cancer\",\"FirstCategoryId\":\"3\",\"ListUrlMain\":\"https://doi.org/10.1007/s00520-024-09127-5\",\"RegionNum\":3,\"RegionCategory\":\"医学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q2\",\"JCRName\":\"HEALTH CARE SCIENCES & SERVICES\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Supportive Care in Cancer","FirstCategoryId":"3","ListUrlMain":"https://doi.org/10.1007/s00520-024-09127-5","RegionNum":3,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"HEALTH CARE SCIENCES & SERVICES","Score":null,"Total":0}
Development and validation of a web-based calculator for determining the risk of psychological distress based on machine learning algorithms: A cross-sectional study of 342 lung cancer patients.
Purpose: Early and accurate identification of the risk of psychological distress allows for timely intervention and improved prognosis. Current methods for predicting psychological distress among lung cancer patients using readily available data are limited. This study aimed to develop a robust machine learning (ML) model for determining the risk of psychological distress among lung cancer patients.
Methods: A cross-sectional study was designed to collect data from 342 lung cancer patients. Least Absolute Shrinkage and Selection Operator (LASSO) was used for feature selection. Model training and validation were conducted with bootstrap resampling method. Fivefold cross-validation evaluated and optimized the model with parameter tuning. Feature importance was assessed using SHapley additive exPlanations (SHAP) method.
Results: The model identified seven independent risk factors of psychological distress: residence (β = 0.141), diagnosis duration (β = 0.055), TNM stage (β = 0.098), pain severity (β = 0.067), perceived stigma (β = 0.052), illness perception (β = 0.100), and coping style (β = 0.097). Among the eight ML algorithms evaluated, the extreme gradient boosting (XGBoost) algorithm demonstrated the highest performance with AUROC values of 0.988, 0.945, and 0.922 for the training, validation, and test sets, respectively. The model's results were further explained using SHAP, which revealed the importance and contribution of each risk factor to the overall distress risk. A web-based tool was developed based on this model to facilitate clinical use.
Conclusion: The XGBoost classifier demonstrated exceptional performance, and clinical implementation of the web-based risk calculator can serve as an easy-to-use tool for health practitioners to formulate early prevention and intervention strategies.
期刊介绍:
Supportive Care in Cancer provides members of the Multinational Association of Supportive Care in Cancer (MASCC) and all other interested individuals, groups and institutions with the most recent scientific and social information on all aspects of supportive care in cancer patients. It covers primarily medical, technical and surgical topics concerning supportive therapy and care which may supplement or substitute basic cancer treatment at all stages of the disease.
Nursing, rehabilitative, psychosocial and spiritual issues of support are also included.