Prediction of prognosis in lung cancer using machine learning with inter-institutional generalizability: A multicenter cohort study (WJOG15121L: REAL-WIND)
{"title":"Prediction of prognosis in lung cancer using machine learning with inter-institutional generalizability: A multicenter cohort study (WJOG15121L: REAL-WIND)","authors":"Daichi Fujimoto , Hidetoshi Hayashi , Kenta Murotani , Yukihiro Toi , Toshihide Yokoyama , Terufumi Kato , Teppei Yamaguchi , Kaoru Tanaka , Satoru Miura , Motohiro Tamiya , Motoko Tachihara , Takehito Shukuya , Yuko Tsuchiya-Kawano , Yuki Sato , Satoshi Ikeda , Shinya Sakata , Takeshi Masuda , Shinnosuke Takemoto , Kohei Otsubo , Ryota Shibaki , Nobuyuki Yamamoto","doi":"10.1016/j.lungcan.2024.107896","DOIUrl":null,"url":null,"abstract":"<div><h3>Objectives</h3><p>Predicting the prognosis of lung cancer is crucial for providing optimal medical care. However, a method to accurately predict the overall prognosis in patients with stage IV lung cancer, even with the use of machine learning, has not been established. Moreover, the inter-institutional generalizability of such algorithms remains unexplored. This study aimed to establish machine learning-based algorithms with inter-institutional generalizability to predict prognosis.</p></div><div><h3>Materials and Methods</h3><p>This multicenter, retrospective, hospital-based cohort study included consecutive patients with stage IV lung cancer who were randomly categorized into the training and independent test cohorts with a 2:1 ratio, respectively. The primary metric to assess algorithm performance was the area under the receiver operating characteristic curve in the independent test cohort. To assess the inter-institutional generalizability of the algorithms, we investigated their ability to predict patient outcomes in the remaining facility after being trained using data from 15 other facilities.</p></div><div><h3>Results</h3><p>Overall, 6,751 patients (median age, 70 years) were enrolled, and 1,515 (22 %) showed mutated epidermal growth factor receptor expression. The median overall survival was 16.6 (95 % confidence interval, 15.9–17.5) months. Algorithm performance metrics in the test cohort showed that the areas under the curves were 0.90 (95 % confidence interval, 0.88–0.91), 0.85 (0.84–0.87), 0.83 (0.81–0.85), and 0.85 (0.82–0.87) at 180, 360, 720, and 1,080 predicted survival days, respectively. The performance test of 16 algorithms for investigating inter-institutional generalizability showed median areas under the curves of 0.87 (range, 0.84–0.92), 0.84 (0.78–0.88), 0.84 (0.76–0.89), and 0.84 (0.75–0.90) at 180, 360, 720, and 1,080 days, respectively.</p></div><div><h3>Conclusion</h3><p>This study developed machine learning algorithms that could accurately predict the prognosis in patients with stage IV lung cancer with high inter-institutional generalizability. This can enhance the accuracy of prognosis prediction and support informed and shared decision-making in clinical settings.</p></div>","PeriodicalId":18129,"journal":{"name":"Lung Cancer","volume":"194 ","pages":"Article 107896"},"PeriodicalIF":4.5000,"publicationDate":"2024-08-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Lung Cancer","FirstCategoryId":"3","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0169500224004306","RegionNum":2,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"ONCOLOGY","Score":null,"Total":0}
引用次数: 0
Abstract
Objectives
Predicting the prognosis of lung cancer is crucial for providing optimal medical care. However, a method to accurately predict the overall prognosis in patients with stage IV lung cancer, even with the use of machine learning, has not been established. Moreover, the inter-institutional generalizability of such algorithms remains unexplored. This study aimed to establish machine learning-based algorithms with inter-institutional generalizability to predict prognosis.
Materials and Methods
This multicenter, retrospective, hospital-based cohort study included consecutive patients with stage IV lung cancer who were randomly categorized into the training and independent test cohorts with a 2:1 ratio, respectively. The primary metric to assess algorithm performance was the area under the receiver operating characteristic curve in the independent test cohort. To assess the inter-institutional generalizability of the algorithms, we investigated their ability to predict patient outcomes in the remaining facility after being trained using data from 15 other facilities.
Results
Overall, 6,751 patients (median age, 70 years) were enrolled, and 1,515 (22 %) showed mutated epidermal growth factor receptor expression. The median overall survival was 16.6 (95 % confidence interval, 15.9–17.5) months. Algorithm performance metrics in the test cohort showed that the areas under the curves were 0.90 (95 % confidence interval, 0.88–0.91), 0.85 (0.84–0.87), 0.83 (0.81–0.85), and 0.85 (0.82–0.87) at 180, 360, 720, and 1,080 predicted survival days, respectively. The performance test of 16 algorithms for investigating inter-institutional generalizability showed median areas under the curves of 0.87 (range, 0.84–0.92), 0.84 (0.78–0.88), 0.84 (0.76–0.89), and 0.84 (0.75–0.90) at 180, 360, 720, and 1,080 days, respectively.
Conclusion
This study developed machine learning algorithms that could accurately predict the prognosis in patients with stage IV lung cancer with high inter-institutional generalizability. This can enhance the accuracy of prognosis prediction and support informed and shared decision-making in clinical settings.
期刊介绍:
Lung Cancer is an international publication covering the clinical, translational and basic science of malignancies of the lung and chest region.Original research articles, early reports, review articles, editorials and correspondence covering the prevention, epidemiology and etiology, basic biology, pathology, clinical assessment, surgery, chemotherapy, radiotherapy, combined treatment modalities, other treatment modalities and outcomes of lung cancer are welcome.