Investigation and evaluation of randomized controlled trials for interventions involving artificial intelligence

IF 4.4 Q1 COMPUTER SCIENCE, INTERDISCIPLINARY APPLICATIONS Intelligent medicine Pub Date : 2021-08-01 DOI:10.1016/j.imed.2021.04.006
Jianjian Wang , Shouyuan Wu , Qiangqiang Guo , Hui Lan , Estill Janne , Ling Wang , Juanjuan Zhang , Qi Wang , Yang Song , Nan Yang , Xufei Luo , Qi Zhou , Qianling Shi , Xuan Yu , Yanfang Ma , Joseph L. Mathew , Hyeong Sik Ahn , Myeong Soo Lee , Yaolong Chen
{"title":"Investigation and evaluation of randomized controlled trials for interventions involving artificial intelligence","authors":"Jianjian Wang ,&nbsp;Shouyuan Wu ,&nbsp;Qiangqiang Guo ,&nbsp;Hui Lan ,&nbsp;Estill Janne ,&nbsp;Ling Wang ,&nbsp;Juanjuan Zhang ,&nbsp;Qi Wang ,&nbsp;Yang Song ,&nbsp;Nan Yang ,&nbsp;Xufei Luo ,&nbsp;Qi Zhou ,&nbsp;Qianling Shi ,&nbsp;Xuan Yu ,&nbsp;Yanfang Ma ,&nbsp;Joseph L. Mathew ,&nbsp;Hyeong Sik Ahn ,&nbsp;Myeong Soo Lee ,&nbsp;Yaolong Chen","doi":"10.1016/j.imed.2021.04.006","DOIUrl":null,"url":null,"abstract":"<div><p><strong>Objective</strong> Complete and transparent reporting is of critical importance for randomized controlled trials (RCTs). The present study aimed to determine the reporting quality and methodological quality of RCTs for interventions involving artificial intelligence (AI) and their protocols.</p><p><strong>Methods</strong> We searched MEDLINE (via PubMed), Embase, Web of Science, CBMdisc, Wanfang Data, and CNKI from January 1, 2016, to November 11, 2020, to collect RCTs involving AI. We also extracted the protocol of each included RCT if it could be obtained. CONSORT-AI (Consolidated Standards of Reporting Trials–Artificial Intelligence) statement and Cochrane Collaboration's tool for assessing risk of bias (ROB) were used to evaluate the reporting quality and methodological quality, respectively, and SPIRIT-AI (The Standard Protocol Items: Recommendations for Interventional Trials–Artificial Intelligence) statement was used to evaluate the reporting quality of the protocols. The associations of the reporting rate of CONSORT-AI with the publication year, journal's impact factor (IF), number of authors, sample size, and first author's country were analyzed univariately using Pearson's chi-squared test, or Fisher's exact test if the expected values in any of the cells were below 5. The compliance of the retrieved protocols to SPIRIT-AI was presented descriptively.</p><p><strong>Results</strong> Overall, 29 RCTs and three protocols were considered eligible. The CONSORT-AI items “title and abstract” and “interpretation of results” were reported by all RCTs, with the items with the lowest reporting rates being “funding” (0), “implementation” (3.5%), and “harms” (3.5%). The risk of bias was high in 13 (44.8%) RCTs and not clear in 15 (51.7%) RCTs. Only one RCT (3.5%) had a low risk of bias. The compliance was not significantly different in terms of the publication year, journal's IF, number of authors, sample size, or first author's country. Ten of the 35 SPIRIT-AI items (funding, participant timeline, allocation concealment mechanism, implementation, data management, auditing, declaration of interests, access to data, informed consent materials and biological specimens) were not reported by any of the three protocols.</p><p><strong>Conclusions</strong> The reporting and methodological quality of RCTs involving AI need to be improved. Because of the limited availability of protocols, their quality could not be fully judged. Following the CONSORT-AI and SPIRIT-AI statements and with appropriate guidance on the risk of bias when designing and reporting AI-related RCTs can promote standardization and transparency.</p></div>","PeriodicalId":73400,"journal":{"name":"Intelligent medicine","volume":"1 2","pages":"Pages 61-69"},"PeriodicalIF":4.4000,"publicationDate":"2021-08-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://sci-hub-pdf.com/10.1016/j.imed.2021.04.006","citationCount":"2","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Intelligent medicine","FirstCategoryId":"3","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S266710262100019X","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"COMPUTER SCIENCE, INTERDISCIPLINARY APPLICATIONS","Score":null,"Total":0}
引用次数: 2

Abstract

Objective Complete and transparent reporting is of critical importance for randomized controlled trials (RCTs). The present study aimed to determine the reporting quality and methodological quality of RCTs for interventions involving artificial intelligence (AI) and their protocols.

Methods We searched MEDLINE (via PubMed), Embase, Web of Science, CBMdisc, Wanfang Data, and CNKI from January 1, 2016, to November 11, 2020, to collect RCTs involving AI. We also extracted the protocol of each included RCT if it could be obtained. CONSORT-AI (Consolidated Standards of Reporting Trials–Artificial Intelligence) statement and Cochrane Collaboration's tool for assessing risk of bias (ROB) were used to evaluate the reporting quality and methodological quality, respectively, and SPIRIT-AI (The Standard Protocol Items: Recommendations for Interventional Trials–Artificial Intelligence) statement was used to evaluate the reporting quality of the protocols. The associations of the reporting rate of CONSORT-AI with the publication year, journal's impact factor (IF), number of authors, sample size, and first author's country were analyzed univariately using Pearson's chi-squared test, or Fisher's exact test if the expected values in any of the cells were below 5. The compliance of the retrieved protocols to SPIRIT-AI was presented descriptively.

Results Overall, 29 RCTs and three protocols were considered eligible. The CONSORT-AI items “title and abstract” and “interpretation of results” were reported by all RCTs, with the items with the lowest reporting rates being “funding” (0), “implementation” (3.5%), and “harms” (3.5%). The risk of bias was high in 13 (44.8%) RCTs and not clear in 15 (51.7%) RCTs. Only one RCT (3.5%) had a low risk of bias. The compliance was not significantly different in terms of the publication year, journal's IF, number of authors, sample size, or first author's country. Ten of the 35 SPIRIT-AI items (funding, participant timeline, allocation concealment mechanism, implementation, data management, auditing, declaration of interests, access to data, informed consent materials and biological specimens) were not reported by any of the three protocols.

Conclusions The reporting and methodological quality of RCTs involving AI need to be improved. Because of the limited availability of protocols, their quality could not be fully judged. Following the CONSORT-AI and SPIRIT-AI statements and with appropriate guidance on the risk of bias when designing and reporting AI-related RCTs can promote standardization and transparency.

查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
涉及人工智能干预的随机对照试验的调查与评价
目的完整、透明的报告对随机对照试验(RCTs)至关重要。本研究旨在确定涉及人工智能(AI)及其协议的干预措施的随机对照试验的报告质量和方法质量。方法从2016年1月1日至2020年11月11日检索MEDLINE (PubMed)、Embase、Web of Science、CBMdisc、万方数据、中国知网,收集涉及人工智能的随机对照试验。我们还提取了每个纳入的RCT的方案,如果可以得到的话。采用consortium - ai(综合试验报告标准-人工智能)声明和Cochrane协作的偏倚风险评估工具(ROB)分别评估报告质量和方法质量,采用SPIRIT-AI(标准方案项目:介入性试验建议-人工智能)声明评估方案的报告质量。concont - ai报告率与出版年份、期刊影响因子(IF)、作者数量、样本量和第一作者所在国家的关系采用单因素Pearson卡方检验,如果任何单元格中的预测值低于5,则使用Fisher精确检验。描述了检索到的协议对SPIRIT-AI的遵从性。总的来说,29项随机对照试验和3项方案被认为是合格的。所有rct均报告了congo - ai项目“标题和摘要”和“结果解释”,报告率最低的项目是“资助”(0)、“实施”(3.5%)和“危害”(3.5%)。13项(44.8%)rct偏倚风险高,15项(51.7%)rct偏倚风险不明确。只有一项RCT(3.5%)具有低偏倚风险。合规性在出版年份、期刊的影响因子、作者数量、样本量或第一作者的国家方面没有显著差异。在35个SPIRIT-AI项目中,有10个项目(经费、参与者时间表、拨款隐藏机制、实施、数据管理、审计、利益申报、数据获取、知情同意材料和生物标本)未被三个方案中的任何一个报告。结论人工智能随机对照试验的报告质量和方法学质量有待提高。由于方案的可得性有限,因此无法充分判断其质量。在设计和报告与人工智能相关的随机对照试验时,遵循consortium - ai和SPIRIT-AI声明,并对偏倚风险进行适当的指导,可以促进标准化和透明度。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 去求助
来源期刊
Intelligent medicine
Intelligent medicine Surgery, Radiology and Imaging, Artificial Intelligence, Biomedical Engineering
CiteScore
5.20
自引率
0.00%
发文量
19
期刊最新文献
Impact of data balancing a multiclass dataset before the creation of association rules to study bacterial vaginosis Neuropsychological detection and prediction using machine learning algorithms: a comprehensive review Improved neurological diagnoses and treatment strategies via automated human brain tissue segmentation from clinical magnetic resonance imaging Increasing the accuracy and reproducibility of positron emission tomography radiomics for predicting pelvic lymph node metastasis in patients with cervical cancer using 3D local binary pattern-based texture features A clinical decision support system using rough set theory and machine learning for disease prediction
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1