Flavio Vasconcelos Ordones, Paulo Roberto Kawano, Lodewikus Vermeulen, Ali Hooshyari, David Scholtz, Peter John Gilling, Darren Foreman, Basil Kaufmann, Cedric Poyet, Michael Gorin, Abner Macola Pacheco Barbosa, Naila Camila da Rocha, Luis Gustavo Modelli de Andrade
{"title":"A Novel Machine Learning-Based Predictive Model of Clinically Significant Prostate Cancer and Online Risk Calculator.","authors":"Flavio Vasconcelos Ordones, Paulo Roberto Kawano, Lodewikus Vermeulen, Ali Hooshyari, David Scholtz, Peter John Gilling, Darren Foreman, Basil Kaufmann, Cedric Poyet, Michael Gorin, Abner Macola Pacheco Barbosa, Naila Camila da Rocha, Luis Gustavo Modelli de Andrade","doi":"10.1016/j.urology.2024.11.001","DOIUrl":null,"url":null,"abstract":"<p><strong>Objectives: </strong>To create a machine learning predictive model combining PI-RADS score, PSA density, and clinical variables to predict clinically significant prostate cancer (csPCa).</p><p><strong>Methods: </strong>We evaluated a cohort of patients who underwent prostate biopsy for suspected prostate cancer (PCa) in New Zealand, Australia, and Switzerland. We collected data on age, body mass index (BMI), PSA level, prostate volume, PSA density (PSAD), PI-RADS scores, previous biopsy, and corresponding histology results. The dataset was divided into derivation (training) and validation (test) sets using random splits. An independent dataset was obtained from the Harvard Dataverse for external validation. A cohort of 1272 patients was analyzed. We fitted a Lasso model, XGBoost, and LightGBM to the training set and assessed their accuracy.</p><p><strong>Results: </strong>All models demonstrated ROC AUC values ranging from 0.830 to 0.851. LightGBM was considered the superior model, with an ROC of 0.851 [95%CI: 0.804 - 0.897] in the test set and 0.818 [95% CI: 0.798 - 0.831] in the external dataset. The most important variable was PI-RADS, followed by PSA density, history of previous biopsy, age, and BMI.</p><p><strong>Conclusions: </strong>We developed a predictive model for detecting csPCa that exhibited a high ROC-AUC value for internal and external validations. This suggests that the integration of the clinical parameters outperformed each individual predictor. Additionally, the model demonstrated good calibration metrics, indicative of a more balanced model than the existing models.</p>","PeriodicalId":23415,"journal":{"name":"Urology","volume":" ","pages":""},"PeriodicalIF":2.1000,"publicationDate":"2024-11-11","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Urology","FirstCategoryId":"3","ListUrlMain":"https://doi.org/10.1016/j.urology.2024.11.001","RegionNum":3,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"UROLOGY & NEPHROLOGY","Score":null,"Total":0}
引用次数: 0
Abstract
Objectives: To create a machine learning predictive model combining PI-RADS score, PSA density, and clinical variables to predict clinically significant prostate cancer (csPCa).
Methods: We evaluated a cohort of patients who underwent prostate biopsy for suspected prostate cancer (PCa) in New Zealand, Australia, and Switzerland. We collected data on age, body mass index (BMI), PSA level, prostate volume, PSA density (PSAD), PI-RADS scores, previous biopsy, and corresponding histology results. The dataset was divided into derivation (training) and validation (test) sets using random splits. An independent dataset was obtained from the Harvard Dataverse for external validation. A cohort of 1272 patients was analyzed. We fitted a Lasso model, XGBoost, and LightGBM to the training set and assessed their accuracy.
Results: All models demonstrated ROC AUC values ranging from 0.830 to 0.851. LightGBM was considered the superior model, with an ROC of 0.851 [95%CI: 0.804 - 0.897] in the test set and 0.818 [95% CI: 0.798 - 0.831] in the external dataset. The most important variable was PI-RADS, followed by PSA density, history of previous biopsy, age, and BMI.
Conclusions: We developed a predictive model for detecting csPCa that exhibited a high ROC-AUC value for internal and external validations. This suggests that the integration of the clinical parameters outperformed each individual predictor. Additionally, the model demonstrated good calibration metrics, indicative of a more balanced model than the existing models.
期刊介绍:
Urology is a monthly, peer–reviewed journal primarily for urologists, residents, interns, nephrologists, and other specialists interested in urology
The mission of Urology®, the "Gold Journal," is to provide practical, timely, and relevant clinical and basic science information to physicians and researchers practicing the art of urology worldwide. Urology® publishes original articles relating to adult and pediatric clinical urology as well as to clinical and basic science research. Topics in Urology® include pediatrics, surgical oncology, radiology, pathology, erectile dysfunction, infertility, incontinence, transplantation, endourology, andrology, female urology, reconstructive surgery, and medical oncology, as well as relevant basic science issues. Special features include rapid communication of important timely issues, surgeon''s workshops, interesting case reports, surgical techniques, clinical and basic science review articles, guest editorials, letters to the editor, book reviews, and historical articles in urology.