基于机器学习算法的永久性结肠造口术后吻合口旁疝风险预测模型。

IF 3.3 3区 医学 Q2 MEDICAL INFORMATICS BMC Medical Informatics and Decision Making Pub Date : 2024-08-08 DOI:10.1186/s12911-024-02627-8
Tian Dai, Manzhen Bao, Miao Zhang, Zonggui Wang, JingJing Tang, Zeyan Liu
{"title":"基于机器学习算法的永久性结肠造口术后吻合口旁疝风险预测模型。","authors":"Tian Dai, Manzhen Bao, Miao Zhang, Zonggui Wang, JingJing Tang, Zeyan Liu","doi":"10.1186/s12911-024-02627-8","DOIUrl":null,"url":null,"abstract":"<p><strong>Objective: </strong>To develop a machine learning-based risk prediction model for postoperative parastomal hernia (PSH) in colorectal cancer patients undergoing permanent colostomy, assisting nurses in identifying high-risk groups and devising preventive care strategies.</p><p><strong>Methods: </strong>A case-control study was conducted on 495 colorectal cancer patients who underwent permanent colostomy at the Second Affiliated Hospital of Anhui Medical University from June 2017 to June 2023, with a 1-year follow-up period. Patients were categorized into PSH and non-PSH groups based on PSH occurrence within 1-year post-operation. Data were split into training (70%) and testing (30%) sets. Variable selection was performed using Least Absolute Shrinkage and Selection Operator (LASSO) regression, and binary classification prediction models were established using Logistic Regression (LR), Support Vector Classification (SVC), K Nearest Neighbor (KNN), Random Forest (RF), Light Gradient Boosting Machine (LGBM), and Extreme Gradient Boosting (XgBoost). The binary classification label denoted 1 for PSH occurrence and 0 for no PSH occurrence. Parameters were optimized via 5-fold cross-validation. Model performance was evaluated using Area Under Curve (AUC), specificity, sensitivity, accuracy, positive predictive value, negative predictive value, and F1-score. Clinical utility was evaluated using decision curve analysis (DCA), model explanation was enhanced using shapley additive explanation (SHAP), and model visualization was achieved using a nomogram.</p><p><strong>Results: </strong>The incidence of PSH within 1 year was 29.1% (144 patients). Among the models tested, the RF model demonstrated the highest discrimination capability with an AUC of 0.888 (95% CI: 0.881-0.935), along with superior specificity, accuracy, sensitivity, and F1 score. It also showed the highest clinical net benefit on the DCA curve. SHAP analysis identified the top 10 influential variables associated with PSH risk: body mass index (BMI), operation duration, history and status of chronic obstructive pulmonary disease (COPD), prealbumin, tumor node metastasis (TNM) staging, stoma site, thickness of rectus abdominis muscle (TRAM), C-reactive protein CRP, american society of anesthesiologists physical status classification (ASA), and stoma diameter. These insights from SHAP plots illustrated how these factors influence individual PSH outcomes. The nomogram was used for model visualization.</p><p><strong>Conclusion: </strong>The Random Forest model demonstrated robust predictive performance and clinical relevance in forecasting colonic PSH. This model aids in early identification of high-risk patients and guides preventive care.</p>","PeriodicalId":9340,"journal":{"name":"BMC Medical Informatics and Decision Making","volume":null,"pages":null},"PeriodicalIF":3.3000,"publicationDate":"2024-08-08","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11308496/pdf/","citationCount":"0","resultStr":"{\"title\":\"A risk prediction model based on machine learning algorithm for parastomal hernia after permanent colostomy.\",\"authors\":\"Tian Dai, Manzhen Bao, Miao Zhang, Zonggui Wang, JingJing Tang, Zeyan Liu\",\"doi\":\"10.1186/s12911-024-02627-8\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<p><strong>Objective: </strong>To develop a machine learning-based risk prediction model for postoperative parastomal hernia (PSH) in colorectal cancer patients undergoing permanent colostomy, assisting nurses in identifying high-risk groups and devising preventive care strategies.</p><p><strong>Methods: </strong>A case-control study was conducted on 495 colorectal cancer patients who underwent permanent colostomy at the Second Affiliated Hospital of Anhui Medical University from June 2017 to June 2023, with a 1-year follow-up period. Patients were categorized into PSH and non-PSH groups based on PSH occurrence within 1-year post-operation. Data were split into training (70%) and testing (30%) sets. Variable selection was performed using Least Absolute Shrinkage and Selection Operator (LASSO) regression, and binary classification prediction models were established using Logistic Regression (LR), Support Vector Classification (SVC), K Nearest Neighbor (KNN), Random Forest (RF), Light Gradient Boosting Machine (LGBM), and Extreme Gradient Boosting (XgBoost). The binary classification label denoted 1 for PSH occurrence and 0 for no PSH occurrence. Parameters were optimized via 5-fold cross-validation. Model performance was evaluated using Area Under Curve (AUC), specificity, sensitivity, accuracy, positive predictive value, negative predictive value, and F1-score. Clinical utility was evaluated using decision curve analysis (DCA), model explanation was enhanced using shapley additive explanation (SHAP), and model visualization was achieved using a nomogram.</p><p><strong>Results: </strong>The incidence of PSH within 1 year was 29.1% (144 patients). Among the models tested, the RF model demonstrated the highest discrimination capability with an AUC of 0.888 (95% CI: 0.881-0.935), along with superior specificity, accuracy, sensitivity, and F1 score. It also showed the highest clinical net benefit on the DCA curve. SHAP analysis identified the top 10 influential variables associated with PSH risk: body mass index (BMI), operation duration, history and status of chronic obstructive pulmonary disease (COPD), prealbumin, tumor node metastasis (TNM) staging, stoma site, thickness of rectus abdominis muscle (TRAM), C-reactive protein CRP, american society of anesthesiologists physical status classification (ASA), and stoma diameter. These insights from SHAP plots illustrated how these factors influence individual PSH outcomes. The nomogram was used for model visualization.</p><p><strong>Conclusion: </strong>The Random Forest model demonstrated robust predictive performance and clinical relevance in forecasting colonic PSH. This model aids in early identification of high-risk patients and guides preventive care.</p>\",\"PeriodicalId\":9340,\"journal\":{\"name\":\"BMC Medical Informatics and Decision Making\",\"volume\":null,\"pages\":null},\"PeriodicalIF\":3.3000,\"publicationDate\":\"2024-08-08\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11308496/pdf/\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"BMC Medical Informatics and Decision Making\",\"FirstCategoryId\":\"3\",\"ListUrlMain\":\"https://doi.org/10.1186/s12911-024-02627-8\",\"RegionNum\":3,\"RegionCategory\":\"医学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q2\",\"JCRName\":\"MEDICAL INFORMATICS\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"BMC Medical Informatics and Decision Making","FirstCategoryId":"3","ListUrlMain":"https://doi.org/10.1186/s12911-024-02627-8","RegionNum":3,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"MEDICAL INFORMATICS","Score":null,"Total":0}
引用次数: 0

摘要

目的建立基于机器学习的结直肠癌永久性结肠造口术患者术后吻合口旁疝(PSH)风险预测模型,协助护士识别高危人群并制定预防性护理策略:对2017年6月至2023年6月在安徽医科大学第二附属医院接受永久性结肠造口术的495例结直肠癌患者进行病例对照研究,随访1年。根据术后1年内PSH的发生率将患者分为PSH组和非PSH组。数据分为训练集(70%)和测试集(30%)。使用最小绝对收缩和选择操作器(LASSO)回归进行变量选择,并使用逻辑回归(LR)、支持向量分类(SVC)、K 最近邻(KNN)、随机森林(RF)、轻梯度提升机(LGBM)和极端梯度提升(XgBoost)建立二元分类预测模型。二元分类标签表示发生 PSH 为 1,未发生 PSH 为 0。参数通过 5 倍交叉验证进行优化。使用曲线下面积(AUC)、特异性、灵敏度、准确性、阳性预测值、阴性预测值和 F1 分数评估模型性能。使用决策曲线分析(DCA)评估临床实用性,使用沙普利加法解释(SHAP)加强模型解释,使用提名图实现模型可视化:一年内 PSH 的发生率为 29.1%(144 名患者)。在所测试的模型中,RF 模型的分辨能力最高,AUC 为 0.888(95% CI:0.881-0.935),特异性、准确性、灵敏度和 F1 评分均优于其他模型。它还显示出 DCA 曲线上最高的临床净效益。SHAP 分析确定了与 PSH 风险相关的 10 大影响变量:体重指数 (BMI)、手术持续时间、慢性阻塞性肺病 (COPD) 病史和状态、前白蛋白、肿瘤结节转移 (TNM) 分期、造口部位、腹直肌厚度 (TRAM)、C 反应蛋白 CRP、美国麻醉医师协会身体状况分类 (ASA) 和造口直径。这些从 SHAP 图中得出的见解说明了这些因素如何影响个体 PSH 结果。提名图用于模型的可视化:随机森林模型在预测结肠 PSH 方面表现出强大的预测性能和临床相关性。该模型有助于早期识别高危患者并指导预防性护理。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
A risk prediction model based on machine learning algorithm for parastomal hernia after permanent colostomy.

Objective: To develop a machine learning-based risk prediction model for postoperative parastomal hernia (PSH) in colorectal cancer patients undergoing permanent colostomy, assisting nurses in identifying high-risk groups and devising preventive care strategies.

Methods: A case-control study was conducted on 495 colorectal cancer patients who underwent permanent colostomy at the Second Affiliated Hospital of Anhui Medical University from June 2017 to June 2023, with a 1-year follow-up period. Patients were categorized into PSH and non-PSH groups based on PSH occurrence within 1-year post-operation. Data were split into training (70%) and testing (30%) sets. Variable selection was performed using Least Absolute Shrinkage and Selection Operator (LASSO) regression, and binary classification prediction models were established using Logistic Regression (LR), Support Vector Classification (SVC), K Nearest Neighbor (KNN), Random Forest (RF), Light Gradient Boosting Machine (LGBM), and Extreme Gradient Boosting (XgBoost). The binary classification label denoted 1 for PSH occurrence and 0 for no PSH occurrence. Parameters were optimized via 5-fold cross-validation. Model performance was evaluated using Area Under Curve (AUC), specificity, sensitivity, accuracy, positive predictive value, negative predictive value, and F1-score. Clinical utility was evaluated using decision curve analysis (DCA), model explanation was enhanced using shapley additive explanation (SHAP), and model visualization was achieved using a nomogram.

Results: The incidence of PSH within 1 year was 29.1% (144 patients). Among the models tested, the RF model demonstrated the highest discrimination capability with an AUC of 0.888 (95% CI: 0.881-0.935), along with superior specificity, accuracy, sensitivity, and F1 score. It also showed the highest clinical net benefit on the DCA curve. SHAP analysis identified the top 10 influential variables associated with PSH risk: body mass index (BMI), operation duration, history and status of chronic obstructive pulmonary disease (COPD), prealbumin, tumor node metastasis (TNM) staging, stoma site, thickness of rectus abdominis muscle (TRAM), C-reactive protein CRP, american society of anesthesiologists physical status classification (ASA), and stoma diameter. These insights from SHAP plots illustrated how these factors influence individual PSH outcomes. The nomogram was used for model visualization.

Conclusion: The Random Forest model demonstrated robust predictive performance and clinical relevance in forecasting colonic PSH. This model aids in early identification of high-risk patients and guides preventive care.

求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
CiteScore
7.20
自引率
5.70%
发文量
297
审稿时长
1 months
期刊介绍: BMC Medical Informatics and Decision Making is an open access journal publishing original peer-reviewed research articles in relation to the design, development, implementation, use, and evaluation of health information technologies and decision-making for human health.
期刊最新文献
Real-world data to support post-market safety and performance of embolization coils: evidence generation from a medical device manufacturer and data institute partnership. Development of message passing-based graph convolutional networks for classifying cancer pathology reports Machine learning-based evaluation of prognostic factors for mortality and relapse in patients with acute lymphoblastic leukemia: a comparative simulation study A cross domain access control model for medical consortium based on DBSCAN and penalty function RCC-Supporter: supporting renal cell carcinoma treatment decision-making using machine learning
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1