Comparative effectiveness of standard vs. AI-assisted PET/CT reading workflow for pre-treatment lymphoma staging: a multi-institutional reader study evaluation

Frontiers in Nuclear Medicine Pub Date : 2024-01-11 DOI:10.3389/fnume.2023.1327186

R. Frood, Julien M. Y. Willaime, Brad Miles, Greg Chambers, H’ssein Al-Chalabi, Tamir Ali, Natasha Hougham, Naomi Brooks, George Petrides, Matthew Naylor, Daniel Ward, Tom Sulkin, Richard Chaytor, Peter Strouhal, Chirag Patel, A. F. Scarsbrook

{"title":"Comparative effectiveness of standard vs. AI-assisted PET/CT reading workflow for pre-treatment lymphoma staging: a multi-institutional reader study evaluation","authors":"R. Frood, Julien M. Y. Willaime, Brad Miles, Greg Chambers, H’ssein Al-Chalabi, Tamir Ali, Natasha Hougham, Naomi Brooks, George Petrides, Matthew Naylor, Daniel Ward, Tom Sulkin, Richard Chaytor, Peter Strouhal, Chirag Patel, A. F. Scarsbrook","doi":"10.3389/fnume.2023.1327186","DOIUrl":null,"url":null,"abstract":"Fluorine-18 fluorodeoxyglucose (FDG)-positron emission tomography/computed tomography (PET/CT) is widely used for staging high-grade lymphoma, with the time to evaluate such studies varying depending on the complexity of the case. Integrating artificial intelligence (AI) within the reporting workflow has the potential to improve quality and efficiency. The aims of the present study were to evaluate the influence of an integrated research prototype segmentation tool implemented within diagnostic PET/CT reading software on the speed and quality of reporting with variable levels of experience, and to assess the effect of the AI-assisted workflow on reader confidence and whether this tool influenced reporting behaviour.Nine blinded reporters (three trainees, three junior consultants and three senior consultants) from three UK centres participated in a two-part reader study. A total of 15 lymphoma staging PET/CT scans were evaluated twice: first, using a standard PET/CT reporting workflow; then, after a 6-week gap, with AI assistance incorporating pre-segmentation of disease sites within the reading software. An even split of PET/CT segmentations with gold standard (GS), false-positive (FP) over-contour or false-negative (FN) under-contour were provided. The read duration was calculated using file logs, while the report quality was independently assessed by two radiologists with >15 years of experience. Confidence in AI assistance and identification of disease was assessed via online questionnaires for each case.There was a significant decrease in time between non-AI and AI-assisted reads (median 15.0 vs. 13.3 min, p < 0.001). Sub-analysis confirmed this was true for both junior (14.5 vs. 12.7 min, p = 0.03) and senior consultants (15.1 vs. 12.2 min, p = 0.03) but not for trainees (18.1 vs. 18.0 min, p = 0.2). There was no significant difference between report quality between reads. AI assistance provided a significant increase in confidence of disease identification (p < 0.001). This held true when splitting the data into FN, GS and FP. In 19/88 cases, participants did not identify either FP (31.8%) or FN (11.4%) segmentations. This was significantly greater for trainees (13/30, 43.3%) than for junior (3/28, 10.7%, p = 0.05) and senior consultants (3/30, 10.0%, p = 0.05).The study findings indicate that an AI-assisted workflow achieves comparable performance to humans, demonstrating a marginal enhancement in reporting speed. Less experienced readers were more influenced by segmentation errors. An AI-assisted PET/CT reading workflow has the potential to increase reporting efficiency without adversely affecting quality, which could reduce costs and report turnaround times. These preliminary findings need to be confirmed in larger studies.","PeriodicalId":505895,"journal":{"name":"Frontiers in Nuclear Medicine","volume":" 12","pages":""},"PeriodicalIF":0.0000,"publicationDate":"2024-01-11","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Frontiers in Nuclear Medicine","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.3389/fnume.2023.1327186","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 0

Abstract

Fluorine-18 fluorodeoxyglucose (FDG)-positron emission tomography/computed tomography (PET/CT) is widely used for staging high-grade lymphoma, with the time to evaluate such studies varying depending on the complexity of the case. Integrating artificial intelligence (AI) within the reporting workflow has the potential to improve quality and efficiency. The aims of the present study were to evaluate the influence of an integrated research prototype segmentation tool implemented within diagnostic PET/CT reading software on the speed and quality of reporting with variable levels of experience, and to assess the effect of the AI-assisted workflow on reader confidence and whether this tool influenced reporting behaviour.Nine blinded reporters (three trainees, three junior consultants and three senior consultants) from three UK centres participated in a two-part reader study. A total of 15 lymphoma staging PET/CT scans were evaluated twice: first, using a standard PET/CT reporting workflow; then, after a 6-week gap, with AI assistance incorporating pre-segmentation of disease sites within the reading software. An even split of PET/CT segmentations with gold standard (GS), false-positive (FP) over-contour or false-negative (FN) under-contour were provided. The read duration was calculated using file logs, while the report quality was independently assessed by two radiologists with >15 years of experience. Confidence in AI assistance and identification of disease was assessed via online questionnaires for each case.There was a significant decrease in time between non-AI and AI-assisted reads (median 15.0 vs. 13.3 min, p < 0.001). Sub-analysis confirmed this was true for both junior (14.5 vs. 12.7 min, p = 0.03) and senior consultants (15.1 vs. 12.2 min, p = 0.03) but not for trainees (18.1 vs. 18.0 min, p = 0.2). There was no significant difference between report quality between reads. AI assistance provided a significant increase in confidence of disease identification (p < 0.001). This held true when splitting the data into FN, GS and FP. In 19/88 cases, participants did not identify either FP (31.8%) or FN (11.4%) segmentations. This was significantly greater for trainees (13/30, 43.3%) than for junior (3/28, 10.7%, p = 0.05) and senior consultants (3/30, 10.0%, p = 0.05).The study findings indicate that an AI-assisted workflow achieves comparable performance to humans, demonstrating a marginal enhancement in reporting speed. Less experienced readers were more influenced by segmentation errors. An AI-assisted PET/CT reading workflow has the potential to increase reporting efficiency without adversely affecting quality, which could reduce costs and report turnaround times. These preliminary findings need to be confirmed in larger studies.

查看原文

微信好友朋友圈 QQ好友复制链接

本刊更多论文

治疗前淋巴瘤分期的标准 PET/CT 阅读工作流程与 AI 辅助阅读工作流程的效果比较：多机构阅读研究评估

氟-18脱氧葡萄糖（FDG）-正电子发射断层扫描/计算机断层扫描（PET/CT）被广泛用于高级别淋巴瘤的分期，评估此类研究的时间因病例的复杂程度而异。在报告工作流程中整合人工智能（AI）有可能提高质量和效率。本研究的目的是评估在 PET/CT 诊断阅片软件中实施的集成研究原型分割工具对不同经验水平的报告速度和质量的影响，并评估人工智能辅助工作流程对读者信心的影响以及该工具是否会影响报告行为。来自英国三个中心的九名盲法报告人员（三名实习生、三名初级顾问和三名高级顾问）参加了一项由两部分组成的读者研究。共对 15 份淋巴瘤分期 PET/CT 扫描进行了两次评估：首先，使用标准 PET/CT 报告工作流程；然后，在间隔 6 周后，在人工智能的辅助下，将疾病部位的预分割纳入阅片软件。提供的 PET/CT 分段平均分为金标准（GS）、假阳性（FP）过度轮廓或假阴性（FN）轮廓不足。读取时间是通过文件日志计算得出的，而报告质量则由两名具有 15 年以上经验的放射科医生独立评估。非人工智能和人工智能辅助读片的时间显著缩短（中位 15.0 分钟对 13.3 分钟，p < 0.001）。子分析表明，初级顾问（14.5 分钟 vs. 12.7 分钟，p = 0.03）和高级顾问（15.1 分钟 vs. 12.2 分钟，p = 0.03）都是如此，但实习生（18.1 分钟 vs. 18.0 分钟，p = 0.2）不是这样。不同读数的报告质量没有明显差异。人工智能辅助可显著提高疾病识别的可信度（p < 0.001）。将数据拆分为 FN、GS 和 FP 时，情况也是如此。在 19/88 个病例中，参与者既没有识别出 FP（31.8%），也没有识别出 FN（11.4%）分割。研究结果表明，人工智能辅助工作流程的性能与人类相当，在报告速度方面略有提高。经验不足的读者受分割错误的影响更大。人工智能辅助 PET/CT 阅读工作流程有可能在不影响质量的情况下提高报告效率，从而降低成本并缩短报告周转时间。这些初步研究结果还需要更大规模的研究来证实。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文去求助

来源期刊

Frontiers in Nuclear Medicine

自引率

0.00%

发文量