Mostafa A Arafa, Karim H Farhat, Sherin F Aly, Farrukh K Khan, Alaa Mokhtar, Abdulaziz M Althunayan, Waleed Al-Taweel, Sultan S Al-Khateeb, Sami Azhari, Danny M Rabah
{"title":"Prediction of prostate biopsy outcomes at different cut-offs of prostate-specific antigen using machine learning: a multicenter study.","authors":"Mostafa A Arafa, Karim H Farhat, Sherin F Aly, Farrukh K Khan, Alaa Mokhtar, Abdulaziz M Althunayan, Waleed Al-Taweel, Sultan S Al-Khateeb, Sami Azhari, Danny M Rabah","doi":"10.1186/s43046-025-00265-3","DOIUrl":null,"url":null,"abstract":"<p><strong>Background: </strong>Machine learning (ML) is a significant area of artificial intelligence, which can improve the accuracy of predictive or diagnostic models for differentiating between prostate biopsy outcomes. This study aims to develop a novel decision-support ML model for classifying patients with biopsy-negative (cancer-free), clinically significant, and non-clinically significant prostate cancer across two prostate-specific antigen (PSA) cut-offs ≤ 10 ng/ml and > 10 ng/ml.</p><p><strong>Methods: </strong>The data for the current study were retrieved from the records of two main hospitals in Riyadh, Saudi Arabia from July 2018 through July 2024. Six machine learning algorithms were employed, and the dataset was randomly divided into a training set and a validation set at a ratio of 8:2. The following metrics were used as performance indicators across the six algorithms: Accuracy, Precision, Recall, F1-score, and area under the curve. Recent data from the two hospitals was utilized for external validation.</p><p><strong>Results: </strong>The metrics for Random Forest, Extra Tree, and Decision Tree algorithms showed excellent capability in classifying the outcomes of prostate biopsy for the two PSA cut-offs. However, the metrics for the PSA cut-off > 10 ng/ml are higher than those for PSA ≤ 10 ng/ml. For the three-class classification, the accuracy and area under the curve for the cut-off > 10 ng/ml were 0.96 and 0.99, respectively. While for the cut-off ≤ 10 ng/ml they were 0.92 and 0.94 for Random Forest and 0.94 and 0.95 for the Extra Tree algorithm. The metrics of non-clinically significant and biopsy-negative cases outperformed those of clinically significant cases.</p><p><strong>Conclusion: </strong>ML models are proving to be effective tools in differentiating between prostate biopsy outcomes, enhancing diagnostic accuracy, and potentially transforming clinical practices in prostate cancer management.</p>","PeriodicalId":17301,"journal":{"name":"Journal of the Egyptian National Cancer Institute","volume":"37 1","pages":"8"},"PeriodicalIF":2.1000,"publicationDate":"2025-03-17","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Journal of the Egyptian National Cancer Institute","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1186/s43046-025-00265-3","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q3","JCRName":"ONCOLOGY","Score":null,"Total":0}
引用次数: 0
Abstract
Background: Machine learning (ML) is a significant area of artificial intelligence, which can improve the accuracy of predictive or diagnostic models for differentiating between prostate biopsy outcomes. This study aims to develop a novel decision-support ML model for classifying patients with biopsy-negative (cancer-free), clinically significant, and non-clinically significant prostate cancer across two prostate-specific antigen (PSA) cut-offs ≤ 10 ng/ml and > 10 ng/ml.
Methods: The data for the current study were retrieved from the records of two main hospitals in Riyadh, Saudi Arabia from July 2018 through July 2024. Six machine learning algorithms were employed, and the dataset was randomly divided into a training set and a validation set at a ratio of 8:2. The following metrics were used as performance indicators across the six algorithms: Accuracy, Precision, Recall, F1-score, and area under the curve. Recent data from the two hospitals was utilized for external validation.
Results: The metrics for Random Forest, Extra Tree, and Decision Tree algorithms showed excellent capability in classifying the outcomes of prostate biopsy for the two PSA cut-offs. However, the metrics for the PSA cut-off > 10 ng/ml are higher than those for PSA ≤ 10 ng/ml. For the three-class classification, the accuracy and area under the curve for the cut-off > 10 ng/ml were 0.96 and 0.99, respectively. While for the cut-off ≤ 10 ng/ml they were 0.92 and 0.94 for Random Forest and 0.94 and 0.95 for the Extra Tree algorithm. The metrics of non-clinically significant and biopsy-negative cases outperformed those of clinically significant cases.
Conclusion: ML models are proving to be effective tools in differentiating between prostate biopsy outcomes, enhancing diagnostic accuracy, and potentially transforming clinical practices in prostate cancer management.
期刊介绍:
As the official publication of the National Cancer Institute, Cairo University, the Journal of the Egyptian National Cancer Institute (JENCI) is an open access peer-reviewed journal that publishes on the latest innovations in oncology and thereby, providing academics and clinicians a leading research platform. JENCI welcomes submissions pertaining to all fields of basic, applied and clinical cancer research. Main topics of interest include: local and systemic anticancer therapy (with specific interest on applied cancer research from developing countries); experimental oncology; early cancer detection; randomized trials (including negatives ones); and key emerging fields of personalized medicine, such as molecular pathology, bioinformatics, and biotechnologies.