Ji Won Seo, Ki Bum Park, Seung Taek Lim, Kyong Hwa Jun, Hyung Min Chin
{"title":"Machine learning models for prediction of lymph node metastasis in patients with T1b gastric cancer.","authors":"Ji Won Seo, Ki Bum Park, Seung Taek Lim, Kyong Hwa Jun, Hyung Min Chin","doi":"10.62347/KREL8138","DOIUrl":null,"url":null,"abstract":"<p><p>The prognosis of early gastric cancer (EGC) patients is associated with lymph node metastasis (LNM). Considering the relatively high rate of LNM in T1b EGC patients, it is crucial to determine the factors associated with LNM. In this study, we constructed and validated predictive models based on machine learning (ML) algorithms for LNM in patients with T1b EGC. Data from patients with T1b gastric cancer were extracted from the Korean Gastric Cancer Association database. ML algorithms such as logistic regression (LR), random forest (RF), extreme gradient boosting (XGBoost), and support vector machine (SVM) were applied for model construction utilizing five-fold cross-validation. The performances of these models were assessed in terms of discrimination, calibration, and clinical applicability. Moreover, external validation of XGBoost models was performed using the T1b gastric cancer database of The Catholic University Medical Center. In total, 3,468 T1b EGC patients were included in the analysis, whom 550 (15.9%) had LNM. Eleven variables were selected to construct the models. The LR, RF, XGBoost, and SVM models were established, revealing area under the receiver operating characteristic curve values of 0.8284, 0.7921, 0.8776, and 0.8323, respectively. Among the models, the XGBoost model exhibited the best predictive performance in terms of discrimination, calibration, and clinical applicability. ML models are reliable for predicting LNM in T1b EGC patients. The XGBoost model exhibited the best predictive performance and can be used by surgeons for the identification of EGC patients with a high-risk of LNM, thereby facilitating treatment selection.</p>","PeriodicalId":7437,"journal":{"name":"American journal of cancer research","volume":"14 8","pages":"3842-3851"},"PeriodicalIF":3.6000,"publicationDate":"2024-08-25","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11387857/pdf/","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"American journal of cancer research","FirstCategoryId":"3","ListUrlMain":"https://doi.org/10.62347/KREL8138","RegionNum":3,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"2024/1/1 0:00:00","PubModel":"eCollection","JCR":"Q2","JCRName":"ONCOLOGY","Score":null,"Total":0}
引用次数: 0
Abstract
The prognosis of early gastric cancer (EGC) patients is associated with lymph node metastasis (LNM). Considering the relatively high rate of LNM in T1b EGC patients, it is crucial to determine the factors associated with LNM. In this study, we constructed and validated predictive models based on machine learning (ML) algorithms for LNM in patients with T1b EGC. Data from patients with T1b gastric cancer were extracted from the Korean Gastric Cancer Association database. ML algorithms such as logistic regression (LR), random forest (RF), extreme gradient boosting (XGBoost), and support vector machine (SVM) were applied for model construction utilizing five-fold cross-validation. The performances of these models were assessed in terms of discrimination, calibration, and clinical applicability. Moreover, external validation of XGBoost models was performed using the T1b gastric cancer database of The Catholic University Medical Center. In total, 3,468 T1b EGC patients were included in the analysis, whom 550 (15.9%) had LNM. Eleven variables were selected to construct the models. The LR, RF, XGBoost, and SVM models were established, revealing area under the receiver operating characteristic curve values of 0.8284, 0.7921, 0.8776, and 0.8323, respectively. Among the models, the XGBoost model exhibited the best predictive performance in terms of discrimination, calibration, and clinical applicability. ML models are reliable for predicting LNM in T1b EGC patients. The XGBoost model exhibited the best predictive performance and can be used by surgeons for the identification of EGC patients with a high-risk of LNM, thereby facilitating treatment selection.
期刊介绍:
The American Journal of Cancer Research (AJCR) (ISSN 2156-6976), is an independent open access, online only journal to facilitate rapid dissemination of novel discoveries in basic science and treatment of cancer. It was founded by a group of scientists for cancer research and clinical academic oncologists from around the world, who are devoted to the promotion and advancement of our understanding of the cancer and its treatment. The scope of AJCR is intended to encompass that of multi-disciplinary researchers from any scientific discipline where the primary focus of the research is to increase and integrate knowledge about etiology and molecular mechanisms of carcinogenesis with the ultimate aim of advancing the cure and prevention of this increasingly devastating disease. To achieve these aims AJCR will publish review articles, original articles and new techniques in cancer research and therapy. It will also publish hypothesis, case reports and letter to the editor. Unlike most other open access online journals, AJCR will keep most of the traditional features of paper print that we are all familiar with, such as continuous volume, issue numbers, as well as continuous page numbers to retain our comfortable familiarity towards an academic journal.