Investigation and evaluation of randomized controlled trials for interventions involving artificial intelligence

IF 4.4 Q1 COMPUTER SCIENCE, INTERDISCIPLINARY APPLICATIONS Intelligent medicine Pub Date : 2021-08-01 DOI:10.1016/j.imed.2021.04.006

Jianjian Wang , Shouyuan Wu , Qiangqiang Guo , Hui Lan , Estill Janne , Ling Wang , Juanjuan Zhang , Qi Wang , Yang Song , Nan Yang , Xufei Luo , Qi Zhou , Qianling Shi , Xuan Yu , Yanfang Ma , Joseph L. Mathew , Hyeong Sik Ahn , Myeong Soo Lee , Yaolong Chen

{"title":"Investigation and evaluation of randomized controlled trials for interventions involving artificial intelligence","authors":"Jianjian Wang , Shouyuan Wu , Qiangqiang Guo , Hui Lan , Estill Janne , Ling Wang , Juanjuan Zhang , Qi Wang , Yang Song , Nan Yang , Xufei Luo , Qi Zhou , Qianling Shi , Xuan Yu , Yanfang Ma , Joseph L. Mathew , Hyeong Sik Ahn , Myeong Soo Lee , Yaolong Chen","doi":"10.1016/j.imed.2021.04.006","DOIUrl":null,"url":null,"abstract":"<div>Objective Complete and transparent reporting is of critical importance for randomized controlled trials (RCTs). The present study aimed to determine the reporting quality and methodological quality of RCTs for interventions involving artificial intelligence (AI) and their protocols.Methods We searched MEDLINE (via PubMed), Embase, Web of Science, CBMdisc, Wanfang Data, and CNKI from January 1, 2016, to November 11, 2020, to collect RCTs involving AI. We also extracted the protocol of each included RCT if it could be obtained. CONSORT-AI (Consolidated Standards of Reporting Trials–Artificial Intelligence) statement and Cochrane Collaboration's tool for assessing risk of bias (ROB) were used to evaluate the reporting quality and methodological quality, respectively, and SPIRIT-AI (The Standard Protocol Items: Recommendations for Interventional Trials–Artificial Intelligence) statement was used to evaluate the reporting quality of the protocols. The associations of the reporting rate of CONSORT-AI with the publication year, journal's impact factor (IF), number of authors, sample size, and first author's country were analyzed univariately using Pearson's chi-squared test, or Fisher's exact test if the expected values in any of the cells were below 5. The compliance of the retrieved protocols to SPIRIT-AI was presented descriptively.Results Overall, 29 RCTs and three protocols were considered eligible. The CONSORT-AI items “title and abstract” and “interpretation of results” were reported by all RCTs, with the items with the lowest reporting rates being “funding” (0), “implementation” (3.5%), and “harms” (3.5%). The risk of bias was high in 13 (44.8%) RCTs and not clear in 15 (51.7%) RCTs. Only one RCT (3.5%) had a low risk of bias. The compliance was not significantly different in terms of the publication year, journal's IF, number of authors, sample size, or first author's country. Ten of the 35 SPIRIT-AI items (funding, participant timeline, allocation concealment mechanism, implementation, data management, auditing, declaration of interests, access to data, informed consent materials and biological specimens) were not reported by any of the three protocols.Conclusions The reporting and methodological quality of RCTs involving AI need to be improved. Because of the limited availability of protocols, their quality could not be fully judged. Following the CONSORT-AI and SPIRIT-AI statements and with appropriate guidance on the risk of bias when designing and reporting AI-related RCTs can promote standardization and transparency.</div>","PeriodicalId":73400,"journal":{"name":"Intelligent medicine","volume":"1 2","pages":"Pages 61-69"},"PeriodicalIF":4.4000,"publicationDate":"2021-08-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://sci-hub-pdf.com/10.1016/j.imed.2021.04.006","citationCount":"2","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Intelligent medicine","FirstCategoryId":"3","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S266710262100019X","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"COMPUTER SCIENCE, INTERDISCIPLINARY APPLICATIONS","Score":null,"Total":0}

引用次数: 2

Abstract

Objective Complete and transparent reporting is of critical importance for randomized controlled trials (RCTs). The present study aimed to determine the reporting quality and methodological quality of RCTs for interventions involving artificial intelligence (AI) and their protocols.

Methods We searched MEDLINE (via PubMed), Embase, Web of Science, CBMdisc, Wanfang Data, and CNKI from January 1, 2016, to November 11, 2020, to collect RCTs involving AI. We also extracted the protocol of each included RCT if it could be obtained. CONSORT-AI (Consolidated Standards of Reporting Trials–Artificial Intelligence) statement and Cochrane Collaboration's tool for assessing risk of bias (ROB) were used to evaluate the reporting quality and methodological quality, respectively, and SPIRIT-AI (The Standard Protocol Items: Recommendations for Interventional Trials–Artificial Intelligence) statement was used to evaluate the reporting quality of the protocols. The associations of the reporting rate of CONSORT-AI with the publication year, journal's impact factor (IF), number of authors, sample size, and first author's country were analyzed univariately using Pearson's chi-squared test, or Fisher's exact test if the expected values in any of the cells were below 5. The compliance of the retrieved protocols to SPIRIT-AI was presented descriptively.

Results Overall, 29 RCTs and three protocols were considered eligible. The CONSORT-AI items “title and abstract” and “interpretation of results” were reported by all RCTs, with the items with the lowest reporting rates being “funding” (0), “implementation” (3.5%), and “harms” (3.5%). The risk of bias was high in 13 (44.8%) RCTs and not clear in 15 (51.7%) RCTs. Only one RCT (3.5%) had a low risk of bias. The compliance was not significantly different in terms of the publication year, journal's IF, number of authors, sample size, or first author's country. Ten of the 35 SPIRIT-AI items (funding, participant timeline, allocation concealment mechanism, implementation, data management, auditing, declaration of interests, access to data, informed consent materials and biological specimens) were not reported by any of the three protocols.

Conclusions The reporting and methodological quality of RCTs involving AI need to be improved. Because of the limited availability of protocols, their quality could not be fully judged. Following the CONSORT-AI and SPIRIT-AI statements and with appropriate guidance on the risk of bias when designing and reporting AI-related RCTs can promote standardization and transparency.

查看原文

微信好友朋友圈 QQ好友复制链接

本刊更多论文

涉及人工智能干预的随机对照试验的调查与评价

目的完整、透明的报告对随机对照试验(RCTs)至关重要。本研究旨在确定涉及人工智能(AI)及其协议的干预措施的随机对照试验的报告质量和方法质量。方法从2016年1月1日至2020年11月11日检索MEDLINE (PubMed)、Embase、Web of Science、CBMdisc、万方数据、中国知网，收集涉及人工智能的随机对照试验。我们还提取了每个纳入的RCT的方案，如果可以得到的话。采用consortium - ai(综合试验报告标准-人工智能)声明和Cochrane协作的偏倚风险评估工具(ROB)分别评估报告质量和方法质量，采用SPIRIT-AI(标准方案项目:介入性试验建议-人工智能)声明评估方案的报告质量。concont - ai报告率与出版年份、期刊影响因子(IF)、作者数量、样本量和第一作者所在国家的关系采用单因素Pearson卡方检验，如果任何单元格中的预测值低于5，则使用Fisher精确检验。描述了检索到的协议对SPIRIT-AI的遵从性。总的来说，29项随机对照试验和3项方案被认为是合格的。所有rct均报告了congo - ai项目“标题和摘要”和“结果解释”，报告率最低的项目是“资助”(0)、“实施”(3.5%)和“危害”(3.5%)。13项(44.8%)rct偏倚风险高，15项(51.7%)rct偏倚风险不明确。只有一项RCT(3.5%)具有低偏倚风险。合规性在出版年份、期刊的影响因子、作者数量、样本量或第一作者的国家方面没有显著差异。在35个SPIRIT-AI项目中，有10个项目(经费、参与者时间表、拨款隐藏机制、实施、数据管理、审计、利益申报、数据获取、知情同意材料和生物标本)未被三个方案中的任何一个报告。结论人工智能随机对照试验的报告质量和方法学质量有待提高。由于方案的可得性有限，因此无法充分判断其质量。在设计和报告与人工智能相关的随机对照试验时，遵循consortium - ai和SPIRIT-AI声明，并对偏倚风险进行适当的指导，可以促进标准化和透明度。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文去求助

来源期刊

Intelligent medicine Surgery, Radiology and Imaging, Artificial Intelligence, Biomedical Engineering

CiteScore

5.20

自引率

0.00%

发文量