Development and External Validation of a Machine Learning Model to Predict Pathological Complete Response After Neoadjuvant Chemotherapy in Breast Cancer.
Ji-Jung Jung, Eun-Kyu Kim, Eunyoung Kang, Jee Hyun Kim, Se Hyun Kim, Koung Jin Suh, Sun Mi Kim, Mijung Jang, Bo La Yun, So Yeon Park, Changjin Lim, Wonshik Han, Hee-Chul Shin
{"title":"Development and External Validation of a Machine Learning Model to Predict Pathological Complete Response After Neoadjuvant Chemotherapy in Breast Cancer.","authors":"Ji-Jung Jung, Eun-Kyu Kim, Eunyoung Kang, Jee Hyun Kim, Se Hyun Kim, Koung Jin Suh, Sun Mi Kim, Mijung Jang, Bo La Yun, So Yeon Park, Changjin Lim, Wonshik Han, Hee-Chul Shin","doi":"10.4048/jbc.2023.26.e14","DOIUrl":null,"url":null,"abstract":"<p><strong>Purpose: </strong>Several predictive models have been developed to predict the pathological complete response (pCR) after neoadjuvant chemotherapy (NAC); however, few are broadly applicable owing to radiologic complexity and institution-specific clinical variables, and none have been externally validated. This study aimed to develop and externally validate a machine learning model that predicts pCR after NAC in patients with breast cancer using routinely collected clinical and demographic variables.</p><p><strong>Methods: </strong>The electronic medical records of patients with advanced breast cancer who underwent NAC before surgical resection between January 2017 and December 2020 were reviewed. Patient data from Seoul National University Bundang Hospital were divided into training and internal validation cohorts. Five machine learning techniques, including gradient boosting machine (GBM), support vector machine, random forest, decision tree, and neural network, were used to build predictive models, and the area under the receiver operating characteristic curve (AUC) was compared to select the best model. Finally, the model was validated using an independent cohort from Seoul National University Hospital.</p><p><strong>Results: </strong>A total of 1,003 patients were included in the study: 287, 71, and 645 in the training, internal validation, and external validation cohorts, respectively. Overall, 36.3% of the patients achieved pCR. Among the five machine learning models, the GBM showed the highest AUC for pCR prediction (AUC, 0.903; 95% confidence interval [CI], 0.833-0.972). External validation confirmed an AUC of 0.833 (95% CI, 0.800-0.865).</p><p><strong>Conclusion: </strong>Commonly available clinical and demographic variables were used to develop a machine learning model for predicting pCR following NAC. External validation of the model demonstrated good discrimination power, indicating that routinely collected variables were sufficient to build a good prediction model.</p>","PeriodicalId":15206,"journal":{"name":"Journal of Breast Cancer","volume":"26 4","pages":"353-362"},"PeriodicalIF":2.2000,"publicationDate":"2023-08-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://ftp.ncbi.nlm.nih.gov/pub/pmc/oa_pdf/8b/15/jbc-26-353.PMC10475713.pdf","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Journal of Breast Cancer","FirstCategoryId":"3","ListUrlMain":"https://doi.org/10.4048/jbc.2023.26.e14","RegionNum":4,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q3","JCRName":"ONCOLOGY","Score":null,"Total":0}
引用次数: 0
Abstract
Purpose: Several predictive models have been developed to predict the pathological complete response (pCR) after neoadjuvant chemotherapy (NAC); however, few are broadly applicable owing to radiologic complexity and institution-specific clinical variables, and none have been externally validated. This study aimed to develop and externally validate a machine learning model that predicts pCR after NAC in patients with breast cancer using routinely collected clinical and demographic variables.
Methods: The electronic medical records of patients with advanced breast cancer who underwent NAC before surgical resection between January 2017 and December 2020 were reviewed. Patient data from Seoul National University Bundang Hospital were divided into training and internal validation cohorts. Five machine learning techniques, including gradient boosting machine (GBM), support vector machine, random forest, decision tree, and neural network, were used to build predictive models, and the area under the receiver operating characteristic curve (AUC) was compared to select the best model. Finally, the model was validated using an independent cohort from Seoul National University Hospital.
Results: A total of 1,003 patients were included in the study: 287, 71, and 645 in the training, internal validation, and external validation cohorts, respectively. Overall, 36.3% of the patients achieved pCR. Among the five machine learning models, the GBM showed the highest AUC for pCR prediction (AUC, 0.903; 95% confidence interval [CI], 0.833-0.972). External validation confirmed an AUC of 0.833 (95% CI, 0.800-0.865).
Conclusion: Commonly available clinical and demographic variables were used to develop a machine learning model for predicting pCR following NAC. External validation of the model demonstrated good discrimination power, indicating that routinely collected variables were sufficient to build a good prediction model.
期刊介绍:
The Journal of Breast Cancer (abbreviated as ''J Breast Cancer'') is the official journal of the Korean Breast Cancer Society, which is issued quarterly in the last day of March, June, September, and December each year since 1998. All the contents of the Journal is available online at the official journal website (http://ejbc.kr) under open access policy. The journal aims to provide a forum for the academic communication between medical doctors, basic science researchers, and health care professionals to be interested in breast cancer. To get this aim, we publish original investigations, review articles, brief communications including case reports, editorial opinions on the topics of importance to breast cancer, and welcome new research findings and epidemiological studies, especially when they contain a regional data to grab the international reader''s interest. Although the journal is mainly dealing with the issues of breast cancer, rare cases among benign breast diseases or evidence-based scientifically written articles providing useful information for clinical practice can be published as well.