Evaluation of the impact of artificial intelligence-assisted image interpretation on the diagnostic performance of clinicians in identifying pneumothoraces on plain chest X-ray: a multi-case multi-reader study.
Alex Novak, Sarim Ather, Avneet Gill, Peter Aylward, Giles Maskell, Gordon W Cowell, Abdala Trinidad Espinosa Morgado, Tom Duggan, Melissa Keevill, Olivia Gamble, Osama Akrama, Elizabeth Belcher, Rhona Taberham, Rob Hallifax, Jasdeep Bahra, Abhishek Banerji, Jon Bailey, Antonia James, Ali Ansaripour, Nathan Spence, John Wrightson, Waqas Jarral, Steven Barry, Saher Bhatti, Kerry Astley, Amied Shadmaan, Sharon Ghelman, Alec Baenen, Jason Oke, Claire Bloomfield, Hilal Johnson, Mark Beggs, Fergus Gleeson
{"title":"Evaluation of the impact of artificial intelligence-assisted image interpretation on the diagnostic performance of clinicians in identifying pneumothoraces on plain chest X-ray: a multi-case multi-reader study.","authors":"Alex Novak, Sarim Ather, Avneet Gill, Peter Aylward, Giles Maskell, Gordon W Cowell, Abdala Trinidad Espinosa Morgado, Tom Duggan, Melissa Keevill, Olivia Gamble, Osama Akrama, Elizabeth Belcher, Rhona Taberham, Rob Hallifax, Jasdeep Bahra, Abhishek Banerji, Jon Bailey, Antonia James, Ali Ansaripour, Nathan Spence, John Wrightson, Waqas Jarral, Steven Barry, Saher Bhatti, Kerry Astley, Amied Shadmaan, Sharon Ghelman, Alec Baenen, Jason Oke, Claire Bloomfield, Hilal Johnson, Mark Beggs, Fergus Gleeson","doi":"10.1136/emermed-2023-213620","DOIUrl":null,"url":null,"abstract":"<p><strong>Background: </strong>Artificial intelligence (AI)-assisted image interpretation is a fast-developing area of clinical innovation. Most research to date has focused on the performance of AI-assisted algorithms in comparison with that of radiologists rather than evaluating the algorithms' impact on the clinicians who often undertake initial image interpretation in routine clinical practice. This study assessed the impact of AI-assisted image interpretation on the diagnostic performance of frontline acute care clinicians for the detection of pneumothoraces (PTX).</p><p><strong>Methods: </strong>A multicentre blinded multi-case multi-reader study was conducted between October 2021 and January 2022. The online study recruited 18 clinician readers from six different clinical specialties, with differing levels of seniority, across four English hospitals. The study included 395 plain CXR images, 189 positive for PTX and 206 negative. The reference standard was the consensus opinion of two thoracic radiologists with a third acting as arbitrator. General Electric Healthcare Critical Care Suite (GEHC CCS) PTX algorithm was applied to the final dataset. Readers individually interpreted the dataset without AI assistance, recording the presence or absence of a PTX and a confidence rating. Following a 'washout' period, this process was repeated including the AI output.</p><p><strong>Results: </strong>Analysis of the performance of the algorithm for detecting or ruling out a PTX revealed an overall AUROC of 0.939. Overall reader sensitivity increased by 11.4% (95% CI 4.8, 18.0, p=0.002) from 66.8% (95% CI 57.3, 76.2) unaided to 78.1% aided (95% CI 72.2, 84.0, p=0.002), specificity 93.9% (95% CI 90.9, 97.0) without AI to 95.8% (95% CI 93.7, 97.9, p=0.247). The junior reader subgroup showed the largest improvement at 21.7% (95% CI 10.9, 32.6), increasing from 56.0% (95% CI 37.7, 74.3) to 77.7% (95% CI 65.8, 89.7, p<0.01).</p><p><strong>Conclusion: </strong>The study indicates that AI-assisted image interpretation significantly enhances the diagnostic accuracy of clinicians in detecting PTX, particularly benefiting less experienced practitioners. While overall interpretation time remained unchanged, the use of AI improved diagnostic confidence and sensitivity, especially among junior clinicians. These findings underscore the potential of AI to support less skilled clinicians in acute care settings.</p>","PeriodicalId":11532,"journal":{"name":"Emergency Medicine Journal","volume":" ","pages":"602-609"},"PeriodicalIF":2.7000,"publicationDate":"2024-09-25","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11503157/pdf/","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Emergency Medicine Journal","FirstCategoryId":"3","ListUrlMain":"https://doi.org/10.1136/emermed-2023-213620","RegionNum":3,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"EMERGENCY MEDICINE","Score":null,"Total":0}
引用次数: 0
Abstract
Background: Artificial intelligence (AI)-assisted image interpretation is a fast-developing area of clinical innovation. Most research to date has focused on the performance of AI-assisted algorithms in comparison with that of radiologists rather than evaluating the algorithms' impact on the clinicians who often undertake initial image interpretation in routine clinical practice. This study assessed the impact of AI-assisted image interpretation on the diagnostic performance of frontline acute care clinicians for the detection of pneumothoraces (PTX).
Methods: A multicentre blinded multi-case multi-reader study was conducted between October 2021 and January 2022. The online study recruited 18 clinician readers from six different clinical specialties, with differing levels of seniority, across four English hospitals. The study included 395 plain CXR images, 189 positive for PTX and 206 negative. The reference standard was the consensus opinion of two thoracic radiologists with a third acting as arbitrator. General Electric Healthcare Critical Care Suite (GEHC CCS) PTX algorithm was applied to the final dataset. Readers individually interpreted the dataset without AI assistance, recording the presence or absence of a PTX and a confidence rating. Following a 'washout' period, this process was repeated including the AI output.
Results: Analysis of the performance of the algorithm for detecting or ruling out a PTX revealed an overall AUROC of 0.939. Overall reader sensitivity increased by 11.4% (95% CI 4.8, 18.0, p=0.002) from 66.8% (95% CI 57.3, 76.2) unaided to 78.1% aided (95% CI 72.2, 84.0, p=0.002), specificity 93.9% (95% CI 90.9, 97.0) without AI to 95.8% (95% CI 93.7, 97.9, p=0.247). The junior reader subgroup showed the largest improvement at 21.7% (95% CI 10.9, 32.6), increasing from 56.0% (95% CI 37.7, 74.3) to 77.7% (95% CI 65.8, 89.7, p<0.01).
Conclusion: The study indicates that AI-assisted image interpretation significantly enhances the diagnostic accuracy of clinicians in detecting PTX, particularly benefiting less experienced practitioners. While overall interpretation time remained unchanged, the use of AI improved diagnostic confidence and sensitivity, especially among junior clinicians. These findings underscore the potential of AI to support less skilled clinicians in acute care settings.
期刊介绍:
The Emergency Medicine Journal is a leading international journal reporting developments and advances in emergency medicine and acute care. It has relevance to all specialties involved in the management of emergencies in the hospital and prehospital environment. Each issue contains editorials, reviews, original research, evidence based reviews, letters and more.