Julia Geppert, Asra Asgharzadeh, Anna Brown, Chris Stinton, Emma J Helm, Surangi Jayakody, Daniel Todkill, Daniel Gallacher, Hesam Ghiasvand, Mubarak Patel, Peter Auguste, Alexander Tsertsvadze, Yen-Fu Chen, Amy Grove, Bethany Shinkins, Aileen Clarke, Sian Taylor-Phillips
{"title":"Software using artificial intelligence for nodule and cancer detection in CT lung cancer screening: systematic review of test accuracy studies","authors":"Julia Geppert, Asra Asgharzadeh, Anna Brown, Chris Stinton, Emma J Helm, Surangi Jayakody, Daniel Todkill, Daniel Gallacher, Hesam Ghiasvand, Mubarak Patel, Peter Auguste, Alexander Tsertsvadze, Yen-Fu Chen, Amy Grove, Bethany Shinkins, Aileen Clarke, Sian Taylor-Phillips","doi":"10.1136/thorax-2024-221662","DOIUrl":null,"url":null,"abstract":"Objectives To examine the accuracy and impact of artificial intelligence (AI) software assistance in lung cancer screening using CT. Methods A systematic review of CE-marked, AI-based software for automated detection and analysis of nodules in CT lung cancer screening was conducted. Multiple databases including Medline, Embase and Cochrane CENTRAL were searched from 2012 to March 2023. Primary research reporting test accuracy or impact on reading time or clinical management was included. QUADAS-2 and QUADAS-C were used to assess risk of bias. We undertook narrative synthesis. Results Eleven studies evaluating six different AI-based software and reporting on 19 770 patients were eligible. All were at high risk of bias with multiple applicability concerns. Compared with unaided reading, AI-assisted reading was faster and generally improved sensitivity (+5% to +20% for detecting/categorising actionable nodules; +3% to +15% for detecting/categorising malignant nodules), with lower specificity (−7% to −3% for correctly detecting/categorising people without actionable nodules; −8% to −6% for correctly detecting/categorising people without malignant nodules). AI assistance tended to increase the proportion of nodules allocated to higher risk categories. Assuming 0.5% cancer prevalence, these results would translate into additional 150–750 cancers detected per million people attending screening but lead to an additional 59 700 to 79 600 people attending screening without cancer receiving unnecessary CT surveillance. Conclusions AI assistance in lung cancer screening may improve sensitivity but increases the number of false-positive results and unnecessary surveillance. Future research needs to increase the specificity of AI-assisted reading and minimise risk of bias and applicability concerns through improved study design. PROSPERO registration number CRD42021298449. All data relevant to the study are included in the article or uploaded as supplementary information.","PeriodicalId":23284,"journal":{"name":"Thorax","volume":"2 1","pages":""},"PeriodicalIF":9.0000,"publicationDate":"2024-09-25","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Thorax","FirstCategoryId":"3","ListUrlMain":"https://doi.org/10.1136/thorax-2024-221662","RegionNum":1,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"RESPIRATORY SYSTEM","Score":null,"Total":0}
引用次数: 0
Abstract
Objectives To examine the accuracy and impact of artificial intelligence (AI) software assistance in lung cancer screening using CT. Methods A systematic review of CE-marked, AI-based software for automated detection and analysis of nodules in CT lung cancer screening was conducted. Multiple databases including Medline, Embase and Cochrane CENTRAL were searched from 2012 to March 2023. Primary research reporting test accuracy or impact on reading time or clinical management was included. QUADAS-2 and QUADAS-C were used to assess risk of bias. We undertook narrative synthesis. Results Eleven studies evaluating six different AI-based software and reporting on 19 770 patients were eligible. All were at high risk of bias with multiple applicability concerns. Compared with unaided reading, AI-assisted reading was faster and generally improved sensitivity (+5% to +20% for detecting/categorising actionable nodules; +3% to +15% for detecting/categorising malignant nodules), with lower specificity (−7% to −3% for correctly detecting/categorising people without actionable nodules; −8% to −6% for correctly detecting/categorising people without malignant nodules). AI assistance tended to increase the proportion of nodules allocated to higher risk categories. Assuming 0.5% cancer prevalence, these results would translate into additional 150–750 cancers detected per million people attending screening but lead to an additional 59 700 to 79 600 people attending screening without cancer receiving unnecessary CT surveillance. Conclusions AI assistance in lung cancer screening may improve sensitivity but increases the number of false-positive results and unnecessary surveillance. Future research needs to increase the specificity of AI-assisted reading and minimise risk of bias and applicability concerns through improved study design. PROSPERO registration number CRD42021298449. All data relevant to the study are included in the article or uploaded as supplementary information.
期刊介绍:
Thorax stands as one of the premier respiratory medicine journals globally, featuring clinical and experimental research articles spanning respiratory medicine, pediatrics, immunology, pharmacology, pathology, and surgery. The journal's mission is to publish noteworthy advancements in scientific understanding that are poised to influence clinical practice significantly. This encompasses articles delving into basic and translational mechanisms applicable to clinical material, covering areas such as cell and molecular biology, genetics, epidemiology, and immunology.