Alaa T Youssef, David Fronk, John Nicholas Grimes, Lina Cheuy, David B. Larson
{"title":"超越黑箱:通过 FDA 510(k) 途径实现放射人工智能/人工智能辅助医疗设备监管透明度的途径","authors":"Alaa T Youssef, David Fronk, John Nicholas Grimes, Lina Cheuy, David B. Larson","doi":"10.1101/2024.07.12.24309602","DOIUrl":null,"url":null,"abstract":"Background: The majority of AI/M-enabled software as a medical device (SaMD) has been cleared through the FDA 510(k) pathway, but with limited transparency on algorithm development details. Because algorithm quality depends on the quality of the training data and algorithmic input, this study aimed to assess the availability of algorithm development details in the 510(k) summaries of AI/ML-enabled SaMD. Then, clinical and/or technical equivalence between predicate generations was assessed by mapping the predicate lineages of all cleared computer-assisted detection (CAD) devices, to ensure equivalence in diagnostic function. Methods: The FDA public database was searched for CAD devices cleared through the 510(k) pathway. Details on algorithmic input, including annotation instructions and definition of ground truth, were extracted from summary statements, product webpages, and relevant publications. These findings were cross-referenced with the American College of Radiology, Data Science Institute AI Central database. Predicate lineages were also manually mapped through product numbers included within the 510(k) summaries.\nResults: In total, 98 CAD devices had been cleared at the time of this study, with the majority being computer-assisted triage (CADt) devices (67/98). Notably, none of the cleared CAD devices provided image annotation instructions in their summaries, and only one provided access to its training data. Similarly, more than half of the devices did not disclose how the ground truth was defined. Only 13 CAD devices were reported in peer-reviewed publications, and only two were evaluated in prospective studies. Significant deviations in clinical function were seen between cleared devices and their claimed predicate. Conclusion: The lack of imaging annotation instructions and signicant mismatches in clinical function between predicate generations raise concerns about whether substantial equivalence in the 510(k) pathway truly equates to equivalent diagnostic function. Avenues for greater transparency are needed to enable independent evaluations of safety and performance and promote trust in AI/ML-enabled devices.","PeriodicalId":501386,"journal":{"name":"medRxiv - Health Policy","volume":"6 1","pages":""},"PeriodicalIF":0.0000,"publicationDate":"2024-07-12","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Beyond the Black Box: Avenues to Transparency in Regulating Radiological AI/ML-enabled SaMD via the FDA 510(k) Pathway\",\"authors\":\"Alaa T Youssef, David Fronk, John Nicholas Grimes, Lina Cheuy, David B. Larson\",\"doi\":\"10.1101/2024.07.12.24309602\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Background: The majority of AI/M-enabled software as a medical device (SaMD) has been cleared through the FDA 510(k) pathway, but with limited transparency on algorithm development details. Because algorithm quality depends on the quality of the training data and algorithmic input, this study aimed to assess the availability of algorithm development details in the 510(k) summaries of AI/ML-enabled SaMD. Then, clinical and/or technical equivalence between predicate generations was assessed by mapping the predicate lineages of all cleared computer-assisted detection (CAD) devices, to ensure equivalence in diagnostic function. Methods: The FDA public database was searched for CAD devices cleared through the 510(k) pathway. Details on algorithmic input, including annotation instructions and definition of ground truth, were extracted from summary statements, product webpages, and relevant publications. These findings were cross-referenced with the American College of Radiology, Data Science Institute AI Central database. Predicate lineages were also manually mapped through product numbers included within the 510(k) summaries.\\nResults: In total, 98 CAD devices had been cleared at the time of this study, with the majority being computer-assisted triage (CADt) devices (67/98). Notably, none of the cleared CAD devices provided image annotation instructions in their summaries, and only one provided access to its training data. Similarly, more than half of the devices did not disclose how the ground truth was defined. Only 13 CAD devices were reported in peer-reviewed publications, and only two were evaluated in prospective studies. Significant deviations in clinical function were seen between cleared devices and their claimed predicate. Conclusion: The lack of imaging annotation instructions and signicant mismatches in clinical function between predicate generations raise concerns about whether substantial equivalence in the 510(k) pathway truly equates to equivalent diagnostic function. Avenues for greater transparency are needed to enable independent evaluations of safety and performance and promote trust in AI/ML-enabled devices.\",\"PeriodicalId\":501386,\"journal\":{\"name\":\"medRxiv - Health Policy\",\"volume\":\"6 1\",\"pages\":\"\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2024-07-12\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"medRxiv - Health Policy\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1101/2024.07.12.24309602\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"medRxiv - Health Policy","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1101/2024.07.12.24309602","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
Beyond the Black Box: Avenues to Transparency in Regulating Radiological AI/ML-enabled SaMD via the FDA 510(k) Pathway
Background: The majority of AI/M-enabled software as a medical device (SaMD) has been cleared through the FDA 510(k) pathway, but with limited transparency on algorithm development details. Because algorithm quality depends on the quality of the training data and algorithmic input, this study aimed to assess the availability of algorithm development details in the 510(k) summaries of AI/ML-enabled SaMD. Then, clinical and/or technical equivalence between predicate generations was assessed by mapping the predicate lineages of all cleared computer-assisted detection (CAD) devices, to ensure equivalence in diagnostic function. Methods: The FDA public database was searched for CAD devices cleared through the 510(k) pathway. Details on algorithmic input, including annotation instructions and definition of ground truth, were extracted from summary statements, product webpages, and relevant publications. These findings were cross-referenced with the American College of Radiology, Data Science Institute AI Central database. Predicate lineages were also manually mapped through product numbers included within the 510(k) summaries.
Results: In total, 98 CAD devices had been cleared at the time of this study, with the majority being computer-assisted triage (CADt) devices (67/98). Notably, none of the cleared CAD devices provided image annotation instructions in their summaries, and only one provided access to its training data. Similarly, more than half of the devices did not disclose how the ground truth was defined. Only 13 CAD devices were reported in peer-reviewed publications, and only two were evaluated in prospective studies. Significant deviations in clinical function were seen between cleared devices and their claimed predicate. Conclusion: The lack of imaging annotation instructions and signicant mismatches in clinical function between predicate generations raise concerns about whether substantial equivalence in the 510(k) pathway truly equates to equivalent diagnostic function. Avenues for greater transparency are needed to enable independent evaluations of safety and performance and promote trust in AI/ML-enabled devices.