Evaluating deep learning auto-contouring for lung radiation therapy: A review of accuracy, variability, efficiency and dose, in target volumes and organs at risk
{"title":"Evaluating deep learning auto-contouring for lung radiation therapy: A review of accuracy, variability, efficiency and dose, in target volumes and organs at risk","authors":"Keeva Moran, Claire Poole, Sarah Barrett","doi":"10.1016/j.phro.2025.100736","DOIUrl":null,"url":null,"abstract":"<div><h3>Background and purpose</h3><div>Delineation of target volumes (TVs) and organs at risk (OARs) is a resource intensive process in lung radiation therapy and, despite the introduction of some auto-contouring, inter-observer variability remains a challenge. Deep learning algorithms may prove an efficient alternative and this review aims to map the evidence base on the use of deep learning algorithms for TV and OAR delineation in the radiation therapy planning process for lung cancer patients.</div></div><div><h3>Materials and methods</h3><div>A literature search identified studies relating to deep learning. Manual contouring and deep learning auto-contours were evaluated against one another for accuracy, inter-observer variability, contouring time and dose-volume effects. A total of 40 studies were included for review.</div></div><div><h3>Results</h3><div>Thirty nine out of 40 studies investigated the accuracy of deep learning auto-contours and determined that they were of a comparable accuracy to manual contours. Inter-observer variability outcomes were heterogeneous in the seven relevant studies identified. Twenty-four studies analysed the time saving associated with deep learning auto-contours and reported a significant time reduction in comparison to manual contours. The eight studies that conducted a dose-volume metric evaluation on deep learning auto-contours identified negligible effect on treatment plans.</div></div><div><h3>Conclusion</h3><div>The accuracy and time-saving capacity of deep learning auto-contours in comparison to manual contours has been extensively studied. However, additional research is required in the areas of inter-observer variability and dose-volume metric evaluation to further substantiate its clinical use.</div></div>","PeriodicalId":36850,"journal":{"name":"Physics and Imaging in Radiation Oncology","volume":"33 ","pages":"Article 100736"},"PeriodicalIF":3.4000,"publicationDate":"2025-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Physics and Imaging in Radiation Oncology","FirstCategoryId":"1085","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S2405631625000417","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"ONCOLOGY","Score":null,"Total":0}
引用次数: 0
Abstract
Background and purpose
Delineation of target volumes (TVs) and organs at risk (OARs) is a resource intensive process in lung radiation therapy and, despite the introduction of some auto-contouring, inter-observer variability remains a challenge. Deep learning algorithms may prove an efficient alternative and this review aims to map the evidence base on the use of deep learning algorithms for TV and OAR delineation in the radiation therapy planning process for lung cancer patients.
Materials and methods
A literature search identified studies relating to deep learning. Manual contouring and deep learning auto-contours were evaluated against one another for accuracy, inter-observer variability, contouring time and dose-volume effects. A total of 40 studies were included for review.
Results
Thirty nine out of 40 studies investigated the accuracy of deep learning auto-contours and determined that they were of a comparable accuracy to manual contours. Inter-observer variability outcomes were heterogeneous in the seven relevant studies identified. Twenty-four studies analysed the time saving associated with deep learning auto-contours and reported a significant time reduction in comparison to manual contours. The eight studies that conducted a dose-volume metric evaluation on deep learning auto-contours identified negligible effect on treatment plans.
Conclusion
The accuracy and time-saving capacity of deep learning auto-contours in comparison to manual contours has been extensively studied. However, additional research is required in the areas of inter-observer variability and dose-volume metric evaluation to further substantiate its clinical use.