Development and External Validation of a Machine Learning Model to Predict Pathological Complete Response After Neoadjuvant Chemotherapy in Breast Cancer.

IF 2.4 4区医学 Q3 ONCOLOGY Journal of Breast Cancer Pub Date : 2023-08-01 DOI:10.4048/jbc.2023.26.e14

Ji-Jung Jung, Eun-Kyu Kim, Eunyoung Kang, Jee Hyun Kim, Se Hyun Kim, Koung Jin Suh, Sun Mi Kim, Mijung Jang, Bo La Yun, So Yeon Park, Changjin Lim, Wonshik Han, Hee-Chul Shin

{"title":"Development and External Validation of a Machine Learning Model to Predict Pathological Complete Response After Neoadjuvant Chemotherapy in Breast Cancer.","authors":"Ji-Jung Jung, Eun-Kyu Kim, Eunyoung Kang, Jee Hyun Kim, Se Hyun Kim, Koung Jin Suh, Sun Mi Kim, Mijung Jang, Bo La Yun, So Yeon Park, Changjin Lim, Wonshik Han, Hee-Chul Shin","doi":"10.4048/jbc.2023.26.e14","DOIUrl":null,"url":null,"abstract":"Purpose: Several predictive models have been developed to predict the pathological complete response (pCR) after neoadjuvant chemotherapy (NAC); however, few are broadly applicable owing to radiologic complexity and institution-specific clinical variables, and none have been externally validated. This study aimed to develop and externally validate a machine learning model that predicts pCR after NAC in patients with breast cancer using routinely collected clinical and demographic variables.Methods: The electronic medical records of patients with advanced breast cancer who underwent NAC before surgical resection between January 2017 and December 2020 were reviewed. Patient data from Seoul National University Bundang Hospital were divided into training and internal validation cohorts. Five machine learning techniques, including gradient boosting machine (GBM), support vector machine, random forest, decision tree, and neural network, were used to build predictive models, and the area under the receiver operating characteristic curve (AUC) was compared to select the best model. Finally, the model was validated using an independent cohort from Seoul National University Hospital.Results: A total of 1,003 patients were included in the study: 287, 71, and 645 in the training, internal validation, and external validation cohorts, respectively. Overall, 36.3% of the patients achieved pCR. Among the five machine learning models, the GBM showed the highest AUC for pCR prediction (AUC, 0.903; 95% confidence interval [CI], 0.833-0.972). External validation confirmed an AUC of 0.833 (95% CI, 0.800-0.865).Conclusion: Commonly available clinical and demographic variables were used to develop a machine learning model for predicting pCR following NAC. External validation of the model demonstrated good discrimination power, indicating that routinely collected variables were sufficient to build a good prediction model.","PeriodicalId":15206,"journal":{"name":"Journal of Breast Cancer","volume":"26 4","pages":"353-362"},"PeriodicalIF":2.4000,"publicationDate":"2023-08-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://ftp.ncbi.nlm.nih.gov/pub/pmc/oa_pdf/8b/15/jbc-26-353.PMC10475713.pdf","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Journal of Breast Cancer","FirstCategoryId":"3","ListUrlMain":"https://doi.org/10.4048/jbc.2023.26.e14","RegionNum":4,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q3","JCRName":"ONCOLOGY","Score":null,"Total":0}

引用次数: 0

Abstract

Purpose: Several predictive models have been developed to predict the pathological complete response (pCR) after neoadjuvant chemotherapy (NAC); however, few are broadly applicable owing to radiologic complexity and institution-specific clinical variables, and none have been externally validated. This study aimed to develop and externally validate a machine learning model that predicts pCR after NAC in patients with breast cancer using routinely collected clinical and demographic variables.

Methods: The electronic medical records of patients with advanced breast cancer who underwent NAC before surgical resection between January 2017 and December 2020 were reviewed. Patient data from Seoul National University Bundang Hospital were divided into training and internal validation cohorts. Five machine learning techniques, including gradient boosting machine (GBM), support vector machine, random forest, decision tree, and neural network, were used to build predictive models, and the area under the receiver operating characteristic curve (AUC) was compared to select the best model. Finally, the model was validated using an independent cohort from Seoul National University Hospital.

Results: A total of 1,003 patients were included in the study: 287, 71, and 645 in the training, internal validation, and external validation cohorts, respectively. Overall, 36.3% of the patients achieved pCR. Among the five machine learning models, the GBM showed the highest AUC for pCR prediction (AUC, 0.903; 95% confidence interval [CI], 0.833-0.972). External validation confirmed an AUC of 0.833 (95% CI, 0.800-0.865).

Conclusion: Commonly available clinical and demographic variables were used to develop a machine learning model for predicting pCR following NAC. External validation of the model demonstrated good discrimination power, indicating that routinely collected variables were sufficient to build a good prediction model.

Abstract Image

查看原文

微信好友朋友圈 QQ好友复制链接

本刊更多论文

预测乳腺癌新辅助化疗后病理完全缓解的机器学习模型的开发和外部验证。

目的:建立了几种预测模型来预测新辅助化疗(NAC)后的病理完全缓解(pCR);然而，由于放射学的复杂性和机构特定的临床变量，很少有广泛适用的，而且没有一个得到外部验证。本研究旨在开发并外部验证一种机器学习模型，该模型使用常规收集的临床和人口统计学变量预测乳腺癌患者NAC后的pCR。方法:回顾2017年1月至2020年12月行NAC手术前晚期乳腺癌患者的电子病历。来自首尔国立大学盆唐医院的患者数据被分为培训组和内部验证组。采用梯度增强机(GBM)、支持向量机、随机森林、决策树和神经网络等5种机器学习技术构建预测模型，并通过比较受者工作特征曲线下面积(AUC)选择最佳模型。最后，使用来自首尔国立大学医院的独立队列验证该模型。结果:研究共纳入1003例患者，其中培训组287例，内部验证组71例，外部验证组645例。总体而言，36.3%的患者实现了pCR。在5种机器学习模型中，GBM的pCR预测AUC最高(AUC, 0.903;95%可信区间[CI]， 0.833-0.972)。外部验证证实AUC为0.833 (95% CI, 0.800-0.865)。结论:使用常用的临床和人口统计学变量来开发预测NAC后pCR的机器学习模型。模型的外部验证显示了良好的判别能力，表明常规收集的变量足以建立良好的预测模型。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文去求助

来源期刊

Journal of Breast Cancer 医学-肿瘤学

CiteScore

3.80

自引率

4.20%

发文量

审稿时长

6-12 weeks

期刊介绍： The Journal of Breast Cancer (abbreviated as ''J Breast Cancer'') is the official journal of the Korean Breast Cancer Society, which is issued quarterly in the last day of March, June, September, and December each year since 1998. All the contents of the Journal is available online at the official journal website (http://ejbc.kr) under open access policy. The journal aims to provide a forum for the academic communication between medical doctors, basic science researchers, and health care professionals to be interested in breast cancer. To get this aim, we publish original investigations, review articles, brief communications including case reports, editorial opinions on the topics of importance to breast cancer, and welcome new research findings and epidemiological studies, especially when they contain a regional data to grab the international reader''s interest. Although the journal is mainly dealing with the issues of breast cancer, rare cases among benign breast diseases or evidence-based scientifically written articles providing useful information for clinical practice can be published as well.