Shi-Qi Liang , Yu-Tong Cui , Guang-Bing Hu , Hai-Yang Guo , Xin-Rui Chen , Ji Zuo , Zhi-Rui Qi , Xian-Fei Wang
{"title":"针对胃肠道间质瘤术前风险的机器学习模型的开发与验证。","authors":"Shi-Qi Liang , Yu-Tong Cui , Guang-Bing Hu , Hai-Yang Guo , Xin-Rui Chen , Ji Zuo , Zhi-Rui Qi , Xian-Fei Wang","doi":"10.1016/j.gassur.2024.10.019","DOIUrl":null,"url":null,"abstract":"<div><h3>Background</h3><div>Gastrointestinal stromal tumors (GISTs) have malignant potential, and treatment varies according to risk. However, no specific protocols exist for preoperative assessment of the malignant potential of gastric GISTs (gGISTs). This study aimed to use machine learning (ML) to develop and validate clinically relevant preoperative models to predict the malignant potential of gGISTs.</div></div><div><h3>Methods</h3><div>This study screened patients diagnosed with gGISTs at the Affiliated Hospital of North Sichuan Medical College. Moreover, this study employed the Least Absolute Shrinkage and Selection Operator (LASSO) and logistic regression to identify risk factors. Subsequently, an ensemble of ML models was used to determine the optimal classifier. In addition, this study used SHapley Additive exPlanations (SHAP) for tailored risk profiling.</div></div><div><h3>Results</h3><div>This study included 318 patients with gGISTs. Using LASSO regression and multifactorial logistic regression, this study analyzed the training dataset, revealing that the presence of endoscopic ultrasound (EUS) high-risk features, tumor border clarity, tumor diameter, and monocyte-to-lymphocyte ratio (MLR) were significant predictors of high malignancy risk in gGIST. As determined by our ML approach, the logistic classification model demonstrated optimal performance, with area under the receiver operating characteristic curves of 0.919 for the training set and 0.925 for the test set. Furthermore, decision curve analysis confirmed the clinical relevance of the model.</div></div><div><h3>Conclusion</h3><div>High-risk EUS features, ill-defined tumor margins, larger tumor diameters, and elevated MLR independently predicted increased malignant potential in gGIST. This study developed logistic regression models based on these factors, which were further interpreted using the SHAP methodology. This analytical approach facilitated personalized therapeutic decision-making among diverse patient populations.</div></div>","PeriodicalId":15893,"journal":{"name":"Journal of Gastrointestinal Surgery","volume":"29 1","pages":"Article 101864"},"PeriodicalIF":2.2000,"publicationDate":"2025-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Development and validation of a machine-learning model for preoperative risk of gastric gastrointestinal stromal tumors\",\"authors\":\"Shi-Qi Liang , Yu-Tong Cui , Guang-Bing Hu , Hai-Yang Guo , Xin-Rui Chen , Ji Zuo , Zhi-Rui Qi , Xian-Fei Wang\",\"doi\":\"10.1016/j.gassur.2024.10.019\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<div><h3>Background</h3><div>Gastrointestinal stromal tumors (GISTs) have malignant potential, and treatment varies according to risk. However, no specific protocols exist for preoperative assessment of the malignant potential of gastric GISTs (gGISTs). This study aimed to use machine learning (ML) to develop and validate clinically relevant preoperative models to predict the malignant potential of gGISTs.</div></div><div><h3>Methods</h3><div>This study screened patients diagnosed with gGISTs at the Affiliated Hospital of North Sichuan Medical College. Moreover, this study employed the Least Absolute Shrinkage and Selection Operator (LASSO) and logistic regression to identify risk factors. Subsequently, an ensemble of ML models was used to determine the optimal classifier. In addition, this study used SHapley Additive exPlanations (SHAP) for tailored risk profiling.</div></div><div><h3>Results</h3><div>This study included 318 patients with gGISTs. Using LASSO regression and multifactorial logistic regression, this study analyzed the training dataset, revealing that the presence of endoscopic ultrasound (EUS) high-risk features, tumor border clarity, tumor diameter, and monocyte-to-lymphocyte ratio (MLR) were significant predictors of high malignancy risk in gGIST. As determined by our ML approach, the logistic classification model demonstrated optimal performance, with area under the receiver operating characteristic curves of 0.919 for the training set and 0.925 for the test set. Furthermore, decision curve analysis confirmed the clinical relevance of the model.</div></div><div><h3>Conclusion</h3><div>High-risk EUS features, ill-defined tumor margins, larger tumor diameters, and elevated MLR independently predicted increased malignant potential in gGIST. This study developed logistic regression models based on these factors, which were further interpreted using the SHAP methodology. This analytical approach facilitated personalized therapeutic decision-making among diverse patient populations.</div></div>\",\"PeriodicalId\":15893,\"journal\":{\"name\":\"Journal of Gastrointestinal Surgery\",\"volume\":\"29 1\",\"pages\":\"Article 101864\"},\"PeriodicalIF\":2.2000,\"publicationDate\":\"2025-01-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Journal of Gastrointestinal Surgery\",\"FirstCategoryId\":\"3\",\"ListUrlMain\":\"https://www.sciencedirect.com/science/article/pii/S1091255X2400684X\",\"RegionNum\":3,\"RegionCategory\":\"医学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q3\",\"JCRName\":\"GASTROENTEROLOGY & HEPATOLOGY\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Journal of Gastrointestinal Surgery","FirstCategoryId":"3","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S1091255X2400684X","RegionNum":3,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q3","JCRName":"GASTROENTEROLOGY & HEPATOLOGY","Score":null,"Total":0}
引用次数: 0
摘要
背景:胃肠道间质瘤(GISTs)具有恶性潜能,治疗方法因风险而异。然而,目前还没有具体的方案用于术前评估胃间质瘤(gGISTs)的恶性可能性。本研究旨在利用机器学习(ML)开发并验证与临床相关的术前模型,以预测胃间质瘤的恶性可能性:方法:我们筛选了川北医学院附属医院确诊的 gGISTs 患者。我们采用最小绝对收缩和选择操作器(LASSO)和逻辑回归来识别风险因素。随后,我们部署了一组 ML 模型,以确定最佳分类器。此外,我们还利用SHapley Additive exPlanations(SHAP)进行了量身定制的风险分析:我们招募了 318 名 gGISTs 患者。我们利用 LASSO 回归和多因素逻辑回归分析了训练数据集,发现内镜超声(EUS)高风险特征、肿瘤边界清晰度、肿瘤直径和单核细胞与淋巴细胞比值(MLR)是 gGIST 高恶性风险的重要预测因素。根据我们的 ML 方法,逻辑分类模型表现出最佳性能,训练集和测试集的接收者操作特征曲线下面积分别为 0.919 和 0.925。此外,决策曲线分析证实了该模型的临床相关性:结论:高危 EUS 特征、肿瘤边缘不清晰、肿瘤直径较大和 MLR 升高可独立预测 gGIST 的恶性程度。我们根据这些因素建立了逻辑回归模型,并使用 SHAP 方法对其进行了进一步解释。这种分析方法有助于为不同的患者群体做出个性化的治疗决策。
Development and validation of a machine-learning model for preoperative risk of gastric gastrointestinal stromal tumors
Background
Gastrointestinal stromal tumors (GISTs) have malignant potential, and treatment varies according to risk. However, no specific protocols exist for preoperative assessment of the malignant potential of gastric GISTs (gGISTs). This study aimed to use machine learning (ML) to develop and validate clinically relevant preoperative models to predict the malignant potential of gGISTs.
Methods
This study screened patients diagnosed with gGISTs at the Affiliated Hospital of North Sichuan Medical College. Moreover, this study employed the Least Absolute Shrinkage and Selection Operator (LASSO) and logistic regression to identify risk factors. Subsequently, an ensemble of ML models was used to determine the optimal classifier. In addition, this study used SHapley Additive exPlanations (SHAP) for tailored risk profiling.
Results
This study included 318 patients with gGISTs. Using LASSO regression and multifactorial logistic regression, this study analyzed the training dataset, revealing that the presence of endoscopic ultrasound (EUS) high-risk features, tumor border clarity, tumor diameter, and monocyte-to-lymphocyte ratio (MLR) were significant predictors of high malignancy risk in gGIST. As determined by our ML approach, the logistic classification model demonstrated optimal performance, with area under the receiver operating characteristic curves of 0.919 for the training set and 0.925 for the test set. Furthermore, decision curve analysis confirmed the clinical relevance of the model.
Conclusion
High-risk EUS features, ill-defined tumor margins, larger tumor diameters, and elevated MLR independently predicted increased malignant potential in gGIST. This study developed logistic regression models based on these factors, which were further interpreted using the SHAP methodology. This analytical approach facilitated personalized therapeutic decision-making among diverse patient populations.
期刊介绍:
The Journal of Gastrointestinal Surgery is a scholarly, peer-reviewed journal that updates the surgeon on the latest developments in gastrointestinal surgery. The journal includes original articles on surgery of the digestive tract; gastrointestinal images; "How I Do It" articles, subject reviews, book reports, editorial columns, the SSAT Presidential Address, articles by a guest orator, symposia, letters, results of conferences and more. This is the official publication of the Society for Surgery of the Alimentary Tract. The journal functions as an outstanding forum for continuing education in surgery and diseases of the gastrointestinal tract.