在预先确定的变更控制计划下优化全新的基于人工智能的医疗设备:提高检测或排除儿童自闭症的能力

Dennis P. Wall , Stuart Liu-Mayo , Carmela Salomon , Jennifer Shannon , Sharief Taraman
{"title":"在预先确定的变更控制计划下优化全新的基于人工智能的医疗设备:提高检测或排除儿童自闭症的能力","authors":"Dennis P. Wall ,&nbsp;Stuart Liu-Mayo ,&nbsp;Carmela Salomon ,&nbsp;Jennifer Shannon ,&nbsp;Sharief Taraman","doi":"10.1016/j.ibmed.2023.100102","DOIUrl":null,"url":null,"abstract":"<div><p>A growing number of artificial intelligence-based medical devices are receiving clearance from the Food and Drug Administration (FDA). Debate has arisen about best practices for the regulation and safe oversight of such devices whose capabilities, if “unlocked”, include iterative learning and adaptation with exposure to new data. One regulatory mechanism proposed by the FDA is the predetermined change control plan (PCCP). This analysis provides what we believe would be the first example of how a PCCP has been leveraged to improve the performance of a de novo autism diagnostic device in practice. Following the PCCP's model update procedures included in the marketing authorization of the first generation of the device (“algorithm V1”), we conducted an algorithmic threshold optimization procedure to improve the device's ability to detect or rule out autism in children ages 18–72 months without changing the accuracy or intended use of the device. Decision threshold optimization was achieved using a repeated train/test validation procedure on a dataset of 722 children with concern for developmental delay, aged 18–72 months (28% autism, 22% neurotypical, 50% other developmental delay, mean age 3.6 years, 39% female). In 1000 repeats, 70% of samples were selected for threshold optimization and 30% for evaluation, ensuring that no training data appeared in the test set. Out-of-sample performance was estimated by evaluating the selected threshold pair on the test set and comparing the performance metrics of the new pair to the corresponding V1 metrics on the same test set. The device, with optimized decision thresholds, produced a determinate output for 66.5% (95% CI, 62.5–71.0) of children. Positive Predictive Value (PPV) and Negative Predictive Value (PPV) were 87.5% (95% CI, 82.5–96.7) and 95.6% (95% CI, 93.7–97.9) respectively. Threshold optimization improved the device's ability to accurately detect or rule out autism in a greater proportion of children. Given the current waitlist crisis for autism evaluations in the United States, the potential increase in coverage offered by the optimized thresholds is promising and emphasizes the value of regulatory mechanisms that allow software as medical devices to adapt safely and appropriately given real world data.</p></div>","PeriodicalId":73399,"journal":{"name":"Intelligence-based medicine","volume":"8 ","pages":"Article 100102"},"PeriodicalIF":0.0000,"publicationDate":"2023-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Optimizing a de novo artificial intelligence-based medical device under a predetermined change control plan: Improved ability to detect or rule out pediatric autism\",\"authors\":\"Dennis P. Wall ,&nbsp;Stuart Liu-Mayo ,&nbsp;Carmela Salomon ,&nbsp;Jennifer Shannon ,&nbsp;Sharief Taraman\",\"doi\":\"10.1016/j.ibmed.2023.100102\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<div><p>A growing number of artificial intelligence-based medical devices are receiving clearance from the Food and Drug Administration (FDA). Debate has arisen about best practices for the regulation and safe oversight of such devices whose capabilities, if “unlocked”, include iterative learning and adaptation with exposure to new data. One regulatory mechanism proposed by the FDA is the predetermined change control plan (PCCP). This analysis provides what we believe would be the first example of how a PCCP has been leveraged to improve the performance of a de novo autism diagnostic device in practice. Following the PCCP's model update procedures included in the marketing authorization of the first generation of the device (“algorithm V1”), we conducted an algorithmic threshold optimization procedure to improve the device's ability to detect or rule out autism in children ages 18–72 months without changing the accuracy or intended use of the device. Decision threshold optimization was achieved using a repeated train/test validation procedure on a dataset of 722 children with concern for developmental delay, aged 18–72 months (28% autism, 22% neurotypical, 50% other developmental delay, mean age 3.6 years, 39% female). In 1000 repeats, 70% of samples were selected for threshold optimization and 30% for evaluation, ensuring that no training data appeared in the test set. Out-of-sample performance was estimated by evaluating the selected threshold pair on the test set and comparing the performance metrics of the new pair to the corresponding V1 metrics on the same test set. The device, with optimized decision thresholds, produced a determinate output for 66.5% (95% CI, 62.5–71.0) of children. Positive Predictive Value (PPV) and Negative Predictive Value (PPV) were 87.5% (95% CI, 82.5–96.7) and 95.6% (95% CI, 93.7–97.9) respectively. Threshold optimization improved the device's ability to accurately detect or rule out autism in a greater proportion of children. Given the current waitlist crisis for autism evaluations in the United States, the potential increase in coverage offered by the optimized thresholds is promising and emphasizes the value of regulatory mechanisms that allow software as medical devices to adapt safely and appropriately given real world data.</p></div>\",\"PeriodicalId\":73399,\"journal\":{\"name\":\"Intelligence-based medicine\",\"volume\":\"8 \",\"pages\":\"Article 100102\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2023-01-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Intelligence-based medicine\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://www.sciencedirect.com/science/article/pii/S2666521223000169\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Intelligence-based medicine","FirstCategoryId":"1085","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S2666521223000169","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0

摘要

越来越多的基于人工智能的医疗设备正在获得美国食品药品监督管理局(FDA)的批准。关于监管和安全监督此类设备的最佳实践,人们展开了争论,这些设备的功能如果“解锁”,包括迭代学习和适应新数据。美国食品药品监督管理局提出的一种监管机制是预先确定的变更控制计划(PCCP)。这项分析提供了我们认为是第一个在实践中如何利用PCCP来提高新自闭症诊断设备性能的例子。根据第一代设备营销授权中包含的PCCP模型更新程序(“算法V1”),我们进行了算法阈值优化程序,以提高设备在不改变设备准确性或预期用途的情况下检测或排除18-72个月儿童自闭症的能力。决策阈值优化是在722名18-22个月的发育迟缓儿童(28%为自闭症,22%为神经典型,50%为其他发育迟缓,平均年龄3.6岁,39%为女性)的数据集上使用重复训练/测试验证程序实现的。在1000次重复中,选择70%的样本进行阈值优化,30%进行评估,确保测试集中没有出现训练数据。通过评估测试集上选择的阈值对并将新对的性能度量与同一测试集上对应的V1度量进行比较来估计样本外性能。该设备具有优化的决策阈值,为66.5%(95%置信区间,62.5–71.0)的儿童产生了确定的输出。阳性预测值(PPV)和阴性预测值(PPV)分别为87.5%(95%CI,82.5–96.7)和95.6%(95%CI,93.7–97.9)。阈值优化提高了该设备在更大比例的儿童中准确检测或排除自闭症的能力。鉴于目前美国自闭症评估的等待名单危机,优化阈值提供的覆盖范围的潜在增加是有希望的,并强调了监管机制的价值,该机制允许作为医疗设备的软件在给定真实世界数据的情况下安全、适当地适应。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
Optimizing a de novo artificial intelligence-based medical device under a predetermined change control plan: Improved ability to detect or rule out pediatric autism

A growing number of artificial intelligence-based medical devices are receiving clearance from the Food and Drug Administration (FDA). Debate has arisen about best practices for the regulation and safe oversight of such devices whose capabilities, if “unlocked”, include iterative learning and adaptation with exposure to new data. One regulatory mechanism proposed by the FDA is the predetermined change control plan (PCCP). This analysis provides what we believe would be the first example of how a PCCP has been leveraged to improve the performance of a de novo autism diagnostic device in practice. Following the PCCP's model update procedures included in the marketing authorization of the first generation of the device (“algorithm V1”), we conducted an algorithmic threshold optimization procedure to improve the device's ability to detect or rule out autism in children ages 18–72 months without changing the accuracy or intended use of the device. Decision threshold optimization was achieved using a repeated train/test validation procedure on a dataset of 722 children with concern for developmental delay, aged 18–72 months (28% autism, 22% neurotypical, 50% other developmental delay, mean age 3.6 years, 39% female). In 1000 repeats, 70% of samples were selected for threshold optimization and 30% for evaluation, ensuring that no training data appeared in the test set. Out-of-sample performance was estimated by evaluating the selected threshold pair on the test set and comparing the performance metrics of the new pair to the corresponding V1 metrics on the same test set. The device, with optimized decision thresholds, produced a determinate output for 66.5% (95% CI, 62.5–71.0) of children. Positive Predictive Value (PPV) and Negative Predictive Value (PPV) were 87.5% (95% CI, 82.5–96.7) and 95.6% (95% CI, 93.7–97.9) respectively. Threshold optimization improved the device's ability to accurately detect or rule out autism in a greater proportion of children. Given the current waitlist crisis for autism evaluations in the United States, the potential increase in coverage offered by the optimized thresholds is promising and emphasizes the value of regulatory mechanisms that allow software as medical devices to adapt safely and appropriately given real world data.

求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
Intelligence-based medicine
Intelligence-based medicine Health Informatics
CiteScore
5.00
自引率
0.00%
发文量
0
审稿时长
187 days
期刊最新文献
Artificial intelligence in child development monitoring: A systematic review on usage, outcomes and acceptance Automatic characterization of cerebral MRI images for the detection of autism spectrum disorders DOTnet 2.0: Deep learning network for diffuse optical tomography image reconstruction Artificial intelligence in child development monitoring: A systematic review on usage, outcomes and acceptance Clustering polycystic ovary syndrome laboratory results extracted from a large internet forum with machine learning
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1