Artificial Intelligence to Automate Network Meta-Analyses: Four Case Studies to Evaluate the Potential Application of Large Language Models.

IF 2 Q2 ECONOMICS PharmacoEconomics Open Pub Date : 2024-03-01 Epub Date: 2024-02-10 DOI:10.1007/s41669-024-00476-9
Tim Reason, Emma Benbow, Julia Langham, Andy Gimblett, Sven L Klijn, Bill Malcolm
{"title":"Artificial Intelligence to Automate Network Meta-Analyses: Four Case Studies to Evaluate the Potential Application of Large Language Models.","authors":"Tim Reason, Emma Benbow, Julia Langham, Andy Gimblett, Sven L Klijn, Bill Malcolm","doi":"10.1007/s41669-024-00476-9","DOIUrl":null,"url":null,"abstract":"<p><strong>Background: </strong>The emergence of artificial intelligence, capable of human-level performance on some tasks, presents an opportunity to revolutionise development of systematic reviews and network meta-analyses (NMAs). In this pilot study, we aim to assess use of a large-language model (LLM, Generative Pre-trained Transformer 4 [GPT-4]) to automatically extract data from publications, write an R script to conduct an NMA and interpret the results.</p><p><strong>Methods: </strong>We considered four case studies involving binary and time-to-event outcomes in two disease areas, for which an NMA had previously been conducted manually. For each case study, a Python script was developed that communicated with the LLM via application programming interface (API) calls. The LLM was prompted to extract relevant data from publications, to create an R script to be used to run the NMA and then to produce a small report describing the analysis.</p><p><strong>Results: </strong>The LLM had a > 99% success rate of accurately extracting data across 20 runs for each case study and could generate R scripts that could be run end-to-end without human input. It also produced good quality reports describing the disease area, analysis conducted, results obtained and a correct interpretation of the results.</p><p><strong>Conclusions: </strong>This study provides a promising indication of the feasibility of using current generation LLMs to automate data extraction, code generation and NMA result interpretation, which could result in significant time savings and reduce human error. This is provided that routine technical checks are performed, as recommend for human-conducted analyses. Whilst not currently 100% consistent, LLMs are likely to improve with time.</p>","PeriodicalId":19770,"journal":{"name":"PharmacoEconomics Open","volume":null,"pages":null},"PeriodicalIF":2.0000,"publicationDate":"2024-03-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10884375/pdf/","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"PharmacoEconomics Open","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1007/s41669-024-00476-9","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"2024/2/10 0:00:00","PubModel":"Epub","JCR":"Q2","JCRName":"ECONOMICS","Score":null,"Total":0}
引用次数: 0

Abstract

Background: The emergence of artificial intelligence, capable of human-level performance on some tasks, presents an opportunity to revolutionise development of systematic reviews and network meta-analyses (NMAs). In this pilot study, we aim to assess use of a large-language model (LLM, Generative Pre-trained Transformer 4 [GPT-4]) to automatically extract data from publications, write an R script to conduct an NMA and interpret the results.

Methods: We considered four case studies involving binary and time-to-event outcomes in two disease areas, for which an NMA had previously been conducted manually. For each case study, a Python script was developed that communicated with the LLM via application programming interface (API) calls. The LLM was prompted to extract relevant data from publications, to create an R script to be used to run the NMA and then to produce a small report describing the analysis.

Results: The LLM had a > 99% success rate of accurately extracting data across 20 runs for each case study and could generate R scripts that could be run end-to-end without human input. It also produced good quality reports describing the disease area, analysis conducted, results obtained and a correct interpretation of the results.

Conclusions: This study provides a promising indication of the feasibility of using current generation LLMs to automate data extraction, code generation and NMA result interpretation, which could result in significant time savings and reduce human error. This is provided that routine technical checks are performed, as recommend for human-conducted analyses. Whilst not currently 100% consistent, LLMs are likely to improve with time.

查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
人工智能自动化网络元分析:评估大型语言模型潜在应用的四项案例研究》。
背景:人工智能的出现为系统综述和网络荟萃分析(NMA)的发展带来了革命性的机遇,因为人工智能能够在某些任务上达到人类的水平。在这项试验性研究中,我们旨在评估使用大型语言模型(LLM,生成预训练转换器 4 [GPT-4])自动从出版物中提取数据、编写 R 脚本以进行 NMA 并解释结果的情况:我们考虑了两个疾病领域中涉及二元和时间到事件结果的四项案例研究,之前已对这些案例研究进行了人工NMA分析。针对每个案例研究,我们都开发了一个 Python 脚本,通过调用应用编程接口 (API) 与 LLM 通信。提示 LLM 从出版物中提取相关数据,创建用于运行 NMA 的 R 脚本,然后生成一份描述分析结果的小报告:LLM 在每个案例研究的 20 次运行中准确提取数据的成功率大于 99%,并能生成无需人工输入即可端到端运行的 R 脚本。它还能生成高质量的报告,描述疾病领域、进行的分析、获得的结果以及对结果的正确解释:这项研究很好地说明了使用当前一代 LLM 自动进行数据提取、代码生成和 NMA 结果解释的可行性,这将大大节省时间并减少人为错误。前提是按照人工分析的建议进行常规技术检查。尽管目前 LLMs 还没有达到 100% 的一致性,但随着时间的推移,LLMs 很可能会得到改善。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 去求助
来源期刊
CiteScore
3.50
自引率
0.00%
发文量
64
审稿时长
8 weeks
期刊介绍: PharmacoEconomics - Open focuses on applied research on the economic implications and health outcomes associated with drugs, devices and other healthcare interventions. The journal includes, but is not limited to, the following research areas:Economic analysis of healthcare interventionsHealth outcomes researchCost-of-illness studiesQuality-of-life studiesAdditional digital features (including animated abstracts, video abstracts, slide decks, audio slides, instructional videos, infographics, podcasts and animations) can be published with articles; these are designed to increase the visibility, readership and educational value of the journal’s content. In addition, articles published in PharmacoEconomics -Open may be accompanied by plain language summaries to assist readers who have some knowledge of, but not in-depth expertise in, the area to understand important medical advances.All manuscripts are subject to peer review by international experts. Letters to the Editor are welcomed and will be considered for publication.
期刊最新文献
Costs of Adverse Events in Patients with Advanced or Metastatic Renal Cell Carcinoma with First-Line Treatment. Digital Versus Paper-Based Consent from the UK NHS Perspective: A Micro-costing Analysis. Correction: Cost-Effectiveness of Dupilumab and Oral Janus Kinase Inhibitors for the Treatment of Moderate-to-Severe Atopic Dermatitis in Singapore. Publisher Correction: Health Technology Assessment Reports for Non-Oncology Medications in Canada from 2018 to 2022: Methodological Critiques on Manufacturers' Submissions and a Comparison Between Manufacturer and Canadian Agency for Drugs and Technologies in Health (CADTH) Analyses. Comparison of Two Financial Incentives to Encourage the Use of Adalimumab Biosimilars: Results of a French Experiment Close to Clinicians.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1