Application of Survival Quilts for prognosis prediction of gastrectomy patients based on the Surveillance, Epidemiology, and End Results database and China National Cancer Center Gastric Cancer database
{"title":"Application of Survival Quilts for prognosis prediction of gastrectomy patients based on the Surveillance, Epidemiology, and End Results database and China National Cancer Center Gastric Cancer database","authors":"Lulu Zhao , Penghui Niu , Wanqing Wang , Xue Han , Xiaoyi Luan , Huang Huang , Yawei Zhang , Dongbing Zhao , Jidong Gao , Yingtai Chen","doi":"10.1016/j.jncc.2024.01.007","DOIUrl":null,"url":null,"abstract":"<div><h3>Objective</h3><p>Accurate prognosis prediction is critical for individualized-therapy making of gastric cancer patients. We aimed to develop and test 6-month, 1-, 2-, 3-, 5-, and 10-year overall survival (OS) and cancer-specific survival (CSS) prediction models for gastric cancer patients following gastrectomy.</p></div><div><h3>Methods</h3><p>We derived and tested Survival Quilts, a machine learning-based model, to develop 6-month, 1-, 2-, 3-, 5-, and 10-year OS and CSS prediction models. Gastrectomy patients in the development set (<em>n</em> = 20,583) and the internal validation set (<em>n</em> = 5,106) were recruited from the Surveillance, Epidemiology, and End Results (SEER) database, while those in the external validation set (<em>n</em> = 6,352) were recruited from the China National Cancer Center Gastric Cancer (NCCGC) database. Furthermore, we selected gastrectomy patients without neoadjuvant therapy as a subgroup to train and test the prognostic models in order to keep the accuracy of tumor-node-metastasis (TNM) stage. Prognostic performances of these OS and CSS models were assessed using the Concordance Index (C-index) and area under the curve (AUC) values.</p></div><div><h3>Results</h3><p>The machine learning model had a consistently high accuracy in predicting 6-month, 1-, 2-, 3-, 5-, and 10-year OS in the SEER development set (C-index = 0.861, 0.832, 0.789, 0.766, 0.740, and 0.709; AUC = 0.784, 0.828, 0.840, 0.849, 0.869, and 0.902, respectively), SEER validation set (C-index = 0.782, 0.739, 0.712, 0.698, 0.681, and 0.660; AUC = 0.751, 0.772, 0.767, 0.762, 0.766, and 0.787, respectively), and NCCGC set (C-index = 0.691, 0.756, 0.751, 0.737, 0.722, and 0.701; AUC = 0.769, 0.788, 0.790, 0.790, 0.787, and 0.788, respectively). The model was able to predict 6-month, 1-, 2-, 3-, 5-, and 10-year CSS in the SEER development set (C-index = 0.879, 0.858, 0.820, 0.802, 0.784, and 0.774; AUC = 0.756, 0.827, 0.852, 0.863, 0.874, and 0.884, respectively) and SEER validation set (C-index = 0.790, 0.763, 0.741, 0.729, 0.718, and 0.708; AUC = 0.706, 0.758, 0.767, 0.766, 0.766, and 0.764, respectively). In multivariate analysis, the high-risk group with risk score output by 5-year OS model was proved to be a strong survival predictor both in the SEER development set (hazard ratio [HR] = 14.59, 95% confidence interval [CI]: 1.872–2.774, <em>P</em> < 0.001), SEER validation set (HR = 2.28, 95% CI: 13.089–16.293, <em>P</em> < 0.001), and NCCGC set (HR = 1.98, 95% CI: 1.617–2.437, <em>P</em> <em><</em> 0.001). We further explored the prognostic value of risk score resulted 5-year CSS model of gastrectomy patients, and found that high-risk group remained as an independent CSS factor in the SEER development set (HR = 12.81, 95% CI: 11.568–14.194, <em>P</em> < 0.001) and SEER validation set (HR = 1.61, 95% CI: 1.338–1.935, <em>P</em> < 0.001).</p></div><div><h3>Conclusion</h3><p>Survival Quilts could allow accurate prediction of 6-month, 1-, 2-, 3-, 5-, and 10-year OS and CSS in gastric cancer patients following gastrectomy.</p></div>","PeriodicalId":73987,"journal":{"name":"Journal of the National Cancer Center","volume":"4 2","pages":"Pages 142-152"},"PeriodicalIF":7.6000,"publicationDate":"2024-06-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.sciencedirect.com/science/article/pii/S266700542400019X/pdfft?md5=03ed90ba842276e402ae3a9c6f81bd5f&pid=1-s2.0-S266700542400019X-main.pdf","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Journal of the National Cancer Center","FirstCategoryId":"1085","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S266700542400019X","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"ONCOLOGY","Score":null,"Total":0}
引用次数: 0
Abstract
Objective
Accurate prognosis prediction is critical for individualized-therapy making of gastric cancer patients. We aimed to develop and test 6-month, 1-, 2-, 3-, 5-, and 10-year overall survival (OS) and cancer-specific survival (CSS) prediction models for gastric cancer patients following gastrectomy.
Methods
We derived and tested Survival Quilts, a machine learning-based model, to develop 6-month, 1-, 2-, 3-, 5-, and 10-year OS and CSS prediction models. Gastrectomy patients in the development set (n = 20,583) and the internal validation set (n = 5,106) were recruited from the Surveillance, Epidemiology, and End Results (SEER) database, while those in the external validation set (n = 6,352) were recruited from the China National Cancer Center Gastric Cancer (NCCGC) database. Furthermore, we selected gastrectomy patients without neoadjuvant therapy as a subgroup to train and test the prognostic models in order to keep the accuracy of tumor-node-metastasis (TNM) stage. Prognostic performances of these OS and CSS models were assessed using the Concordance Index (C-index) and area under the curve (AUC) values.
Results
The machine learning model had a consistently high accuracy in predicting 6-month, 1-, 2-, 3-, 5-, and 10-year OS in the SEER development set (C-index = 0.861, 0.832, 0.789, 0.766, 0.740, and 0.709; AUC = 0.784, 0.828, 0.840, 0.849, 0.869, and 0.902, respectively), SEER validation set (C-index = 0.782, 0.739, 0.712, 0.698, 0.681, and 0.660; AUC = 0.751, 0.772, 0.767, 0.762, 0.766, and 0.787, respectively), and NCCGC set (C-index = 0.691, 0.756, 0.751, 0.737, 0.722, and 0.701; AUC = 0.769, 0.788, 0.790, 0.790, 0.787, and 0.788, respectively). The model was able to predict 6-month, 1-, 2-, 3-, 5-, and 10-year CSS in the SEER development set (C-index = 0.879, 0.858, 0.820, 0.802, 0.784, and 0.774; AUC = 0.756, 0.827, 0.852, 0.863, 0.874, and 0.884, respectively) and SEER validation set (C-index = 0.790, 0.763, 0.741, 0.729, 0.718, and 0.708; AUC = 0.706, 0.758, 0.767, 0.766, 0.766, and 0.764, respectively). In multivariate analysis, the high-risk group with risk score output by 5-year OS model was proved to be a strong survival predictor both in the SEER development set (hazard ratio [HR] = 14.59, 95% confidence interval [CI]: 1.872–2.774, P < 0.001), SEER validation set (HR = 2.28, 95% CI: 13.089–16.293, P < 0.001), and NCCGC set (HR = 1.98, 95% CI: 1.617–2.437, P< 0.001). We further explored the prognostic value of risk score resulted 5-year CSS model of gastrectomy patients, and found that high-risk group remained as an independent CSS factor in the SEER development set (HR = 12.81, 95% CI: 11.568–14.194, P < 0.001) and SEER validation set (HR = 1.61, 95% CI: 1.338–1.935, P < 0.001).
Conclusion
Survival Quilts could allow accurate prediction of 6-month, 1-, 2-, 3-, 5-, and 10-year OS and CSS in gastric cancer patients following gastrectomy.