Guidance on selecting and evaluating AI auto-segmentation systems in clinical radiotherapy: insights from a six-vendor analysis.

IF 2.4 4区 医学 Q3 ENGINEERING, BIOMEDICAL Physical and Engineering Sciences in Medicine Pub Date : 2025-01-13 DOI:10.1007/s13246-024-01513-x
Branimir Rusanov, Martin A Ebert, Mahsheed Sabet, Pejman Rowshanfarzad, Nathaniel Barry, Jake Kendrick, Zaid Alkhatib, Suki Gill, Joshua Dass, Nicholas Bucknell, Jeremy Croker, Colin Tang, Rohen White, Sean Bydder, Mandy Taylor, Luke Slama, Godfrey Mukwada
{"title":"Guidance on selecting and evaluating AI auto-segmentation systems in clinical radiotherapy: insights from a six-vendor analysis.","authors":"Branimir Rusanov, Martin A Ebert, Mahsheed Sabet, Pejman Rowshanfarzad, Nathaniel Barry, Jake Kendrick, Zaid Alkhatib, Suki Gill, Joshua Dass, Nicholas Bucknell, Jeremy Croker, Colin Tang, Rohen White, Sean Bydder, Mandy Taylor, Luke Slama, Godfrey Mukwada","doi":"10.1007/s13246-024-01513-x","DOIUrl":null,"url":null,"abstract":"<p><p>Artificial Intelligence (AI) based auto-segmentation has demonstrated numerous benefits to clinical radiotherapy workflows. However, the rapidly changing regulatory, research, and market environment presents challenges around selecting and evaluating the most suitable solution. To support the clinical adoption of AI auto-segmentation systems, Selection Criteria recommendations were developed to enable a holistic evaluation of vendors, considering not only raw performance but associated risks uniquely related to the clinical deployment of AI. In-house experience and key bodies of work on ethics, standards, and best practices for AI in Radiation Oncology were reviewed to inform selection criteria and evaluation strategies. A retrospective analysis using the criteria was performed across six vendors, including a quantitative assessment using five metrics (Dice, Hausdorff Distance, Average Surface Distance, Surface Dice, Added Path Length) across 20 head and neck, 20 thoracic, and 19 male pelvis patients for AI models as of March 2023. A total of 47 selection criteria were identified across seven categories. A retrospective analysis showed that overall no vendor performed exceedingly well, with systematically poor performance in Data Security & Responsibility, Vendor Support Tools, and Transparency & Ethics. In terms of raw performance, vendors varied widely from excellent to poor. As new regulations come into force and the scope of AI auto-segmentation systems adapt to clinical needs, continued interest in ensuring safe, fair, and transparent AI will persist. The selection and evaluation framework provided herein aims to promote user confidence by exploring the breadth of clinically relevant factors to support informed decision-making.</p>","PeriodicalId":48490,"journal":{"name":"Physical and Engineering Sciences in Medicine","volume":" ","pages":""},"PeriodicalIF":2.4000,"publicationDate":"2025-01-13","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Physical and Engineering Sciences in Medicine","FirstCategoryId":"3","ListUrlMain":"https://doi.org/10.1007/s13246-024-01513-x","RegionNum":4,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q3","JCRName":"ENGINEERING, BIOMEDICAL","Score":null,"Total":0}
引用次数: 0

Abstract

Artificial Intelligence (AI) based auto-segmentation has demonstrated numerous benefits to clinical radiotherapy workflows. However, the rapidly changing regulatory, research, and market environment presents challenges around selecting and evaluating the most suitable solution. To support the clinical adoption of AI auto-segmentation systems, Selection Criteria recommendations were developed to enable a holistic evaluation of vendors, considering not only raw performance but associated risks uniquely related to the clinical deployment of AI. In-house experience and key bodies of work on ethics, standards, and best practices for AI in Radiation Oncology were reviewed to inform selection criteria and evaluation strategies. A retrospective analysis using the criteria was performed across six vendors, including a quantitative assessment using five metrics (Dice, Hausdorff Distance, Average Surface Distance, Surface Dice, Added Path Length) across 20 head and neck, 20 thoracic, and 19 male pelvis patients for AI models as of March 2023. A total of 47 selection criteria were identified across seven categories. A retrospective analysis showed that overall no vendor performed exceedingly well, with systematically poor performance in Data Security & Responsibility, Vendor Support Tools, and Transparency & Ethics. In terms of raw performance, vendors varied widely from excellent to poor. As new regulations come into force and the scope of AI auto-segmentation systems adapt to clinical needs, continued interest in ensuring safe, fair, and transparent AI will persist. The selection and evaluation framework provided herein aims to promote user confidence by exploring the breadth of clinically relevant factors to support informed decision-making.

查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
临床放疗中人工智能自动分割系统的选择和评估指南:来自六家供应商分析的见解。
基于人工智能(AI)的自动分割已经证明了临床放疗工作流程的许多好处。然而,快速变化的监管、研究和市场环境在选择和评估最合适的解决方案方面提出了挑战。为了支持临床采用人工智能自动分割系统,制定了选择标准建议,以便对供应商进行全面评估,不仅考虑原始性能,还考虑与人工智能临床部署独特相关的相关风险。对放射肿瘤学人工智能的伦理、标准和最佳实践方面的内部经验和关键工作机构进行了审查,以告知选择标准和评估策略。使用标准对六家供应商进行了回顾性分析,包括使用五个指标(Dice、Hausdorff距离、平均表面距离、表面Dice、添加路径长度)进行定量评估,涉及20名头颈部、20名胸部和19名男性骨盆患者,用于人工智能模型截至2023年3月。在7个类别中共确定了47项选择标准。回顾性分析显示,总体而言,没有一家供应商表现得非常好,在数据安全和责任、供应商支持工具以及透明度和道德方面的表现都很差。在原始性能方面,供应商差异很大,从优秀到差。随着新法规的生效以及人工智能自动分割系统的范围适应临床需求,确保安全、公平和透明的人工智能将持续存在。本文提供的选择和评估框架旨在通过探索临床相关因素的广度来促进用户的信心,以支持知情决策。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 去求助
来源期刊
CiteScore
8.40
自引率
4.50%
发文量
110
期刊最新文献
A new HCM heart sound classification method based on weighted bispectrum features. Estimation of dose to a bystander from F-18 FDG patients using Monte Carlo simulation in clinical exposure scenarios. Measurement and spectral analysis of medical shock wave parameters based on flexible PVDF sensors. Significance of gender, brain region and EEG band complexity analysis for Parkinson's disease classification using recurrence plots and machine learning algorithms. Autoencoder based data clustering for identifying anomalous repetitive hand movements, and behavioral transition patterns in children.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1