Flavio Vasconcelos Ordones , Paulo Roberto Kawano , Lodewikus Vermeulen , Ali Hooshyari , David Scholtz , Peter John Gilling , Darren Foreman , Basil Kaufmann , Cedric Poyet , Michael Gorin , Abner Macola Pacheco Barbosa , Naila Camila da Rocha , Luis Gustavo Modelli de Andrade
{"title":"A Novel Machine Learning-based Predictive Model of Clinically Significant Prostate Cancer and Online Risk Calculator","authors":"Flavio Vasconcelos Ordones , Paulo Roberto Kawano , Lodewikus Vermeulen , Ali Hooshyari , David Scholtz , Peter John Gilling , Darren Foreman , Basil Kaufmann , Cedric Poyet , Michael Gorin , Abner Macola Pacheco Barbosa , Naila Camila da Rocha , Luis Gustavo Modelli de Andrade","doi":"10.1016/j.urology.2024.11.001","DOIUrl":null,"url":null,"abstract":"<div><h3>Objective</h3><div>To create a machine-learning predictive model combining prostate imaging-reporting and data system (PI-RADS) score, PSA density, and clinical variables to predict clinically significant prostate cancer (csPCa).</div></div><div><h3>Methods</h3><div>We evaluated a cohort of patients who underwent prostate biopsy for suspected prostate cancer (PCa) in New Zealand, Australia, and Switzerland. We collected data on age, body mass index (BMI), PSA level, prostate volume, PSA density (PSAD), PI-RADS scores, previous biopsy, and corresponding histology results. The dataset was divided into derivation (training) and validation (test) sets using random splits. An independent dataset was obtained from the Harvard Dataverse for external validation. A cohort of 1272 patients was analyzed. We fitted a Lasso model, XGBoost, and LightGBM to the training set and assessed their accuracy.</div></div><div><h3>Results</h3><div>All models demonstrated ROC-AUC values ranging from 0.830 to 0.851. LightGBM was considered the superior model, with an ROC of 0.851 (95%CI: 0.804-0.897) in the test set and 0.818 (95% CI: 0.798-0.831) in the external dataset. The most important variable was PI-RADS, followed by PSA density, history of previous biopsy, age, and BMI.</div></div><div><h3>Conclusion</h3><div>We developed a predictive model for detecting csPCa that exhibited a high ROC-AUC value for internal and external validations. This suggests that the integration of the clinical parameters outperformed each individual predictor. Additionally, the model demonstrated good calibration metrics, indicative of a more balanced model than the existing models.</div></div>","PeriodicalId":23415,"journal":{"name":"Urology","volume":"196 ","pages":"Pages 20-26"},"PeriodicalIF":2.0000,"publicationDate":"2025-02-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Urology","FirstCategoryId":"3","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0090429524010574","RegionNum":3,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"2024/11/12 0:00:00","PubModel":"Epub","JCR":"Q2","JCRName":"UROLOGY & NEPHROLOGY","Score":null,"Total":0}
引用次数: 0
Abstract
Objective
To create a machine-learning predictive model combining prostate imaging-reporting and data system (PI-RADS) score, PSA density, and clinical variables to predict clinically significant prostate cancer (csPCa).
Methods
We evaluated a cohort of patients who underwent prostate biopsy for suspected prostate cancer (PCa) in New Zealand, Australia, and Switzerland. We collected data on age, body mass index (BMI), PSA level, prostate volume, PSA density (PSAD), PI-RADS scores, previous biopsy, and corresponding histology results. The dataset was divided into derivation (training) and validation (test) sets using random splits. An independent dataset was obtained from the Harvard Dataverse for external validation. A cohort of 1272 patients was analyzed. We fitted a Lasso model, XGBoost, and LightGBM to the training set and assessed their accuracy.
Results
All models demonstrated ROC-AUC values ranging from 0.830 to 0.851. LightGBM was considered the superior model, with an ROC of 0.851 (95%CI: 0.804-0.897) in the test set and 0.818 (95% CI: 0.798-0.831) in the external dataset. The most important variable was PI-RADS, followed by PSA density, history of previous biopsy, age, and BMI.
Conclusion
We developed a predictive model for detecting csPCa that exhibited a high ROC-AUC value for internal and external validations. This suggests that the integration of the clinical parameters outperformed each individual predictor. Additionally, the model demonstrated good calibration metrics, indicative of a more balanced model than the existing models.
期刊介绍:
Urology is a monthly, peer–reviewed journal primarily for urologists, residents, interns, nephrologists, and other specialists interested in urology
The mission of Urology®, the "Gold Journal," is to provide practical, timely, and relevant clinical and basic science information to physicians and researchers practicing the art of urology worldwide. Urology® publishes original articles relating to adult and pediatric clinical urology as well as to clinical and basic science research. Topics in Urology® include pediatrics, surgical oncology, radiology, pathology, erectile dysfunction, infertility, incontinence, transplantation, endourology, andrology, female urology, reconstructive surgery, and medical oncology, as well as relevant basic science issues. Special features include rapid communication of important timely issues, surgeon''s workshops, interesting case reports, surgical techniques, clinical and basic science review articles, guest editorials, letters to the editor, book reviews, and historical articles in urology.