Is ChatGPT Ready for Public Use in Organ-Specific Drug Toxicity Research?

IF 7.5 2区 医学 Q1 PHARMACOLOGY & PHARMACY Drug Discovery Today Pub Date : 2025-02-01 Epub Date: 2025-01-20 DOI:10.1016/j.drudis.2025.104297
Skylar Connor , Leihong Wu , Ruth A. Roberts , Weida Tong
{"title":"Is ChatGPT Ready for Public Use in Organ-Specific Drug Toxicity Research?","authors":"Skylar Connor ,&nbsp;Leihong Wu ,&nbsp;Ruth A. Roberts ,&nbsp;Weida Tong","doi":"10.1016/j.drudis.2025.104297","DOIUrl":null,"url":null,"abstract":"<div><div>The growing impact of large language models (LLMs), such as ChatGPT, prompts questions about the reliability of their application in public health. We compared drug toxicity assessments by GPT-4 for liver, heart, and kidney against expert assessments using US Food and Drug Administration (FDA) drug-labeling documents. Two approaches were assessed: a ‘General prompt’, mimicking the conversational style used by the general public, and an ‘Expert prompt’ engineered to represent an approach of an expert. The Expert prompt achieved higher accuracy (64–75%) compared with the General prompt (48–72%), but the overall performance was moderate, indicating that caution is needed when using GPT-4 for public health. To improve reliability, an advanced framework,such as Retrieval Augmented Generation (RAG), might be required to leverage knowledge embedded in GPT-4.</div></div>","PeriodicalId":301,"journal":{"name":"Drug Discovery Today","volume":"30 2","pages":"Article 104297"},"PeriodicalIF":7.5000,"publicationDate":"2025-02-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Drug Discovery Today","FirstCategoryId":"3","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S1359644625000108","RegionNum":2,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"2025/1/20 0:00:00","PubModel":"Epub","JCR":"Q1","JCRName":"PHARMACOLOGY & PHARMACY","Score":null,"Total":0}
引用次数: 0

Abstract

The growing impact of large language models (LLMs), such as ChatGPT, prompts questions about the reliability of their application in public health. We compared drug toxicity assessments by GPT-4 for liver, heart, and kidney against expert assessments using US Food and Drug Administration (FDA) drug-labeling documents. Two approaches were assessed: a ‘General prompt’, mimicking the conversational style used by the general public, and an ‘Expert prompt’ engineered to represent an approach of an expert. The Expert prompt achieved higher accuracy (64–75%) compared with the General prompt (48–72%), but the overall performance was moderate, indicating that caution is needed when using GPT-4 for public health. To improve reliability, an advanced framework,such as Retrieval Augmented Generation (RAG), might be required to leverage knowledge embedded in GPT-4.
查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
ChatGPT是否已准备好在器官特异性药物毒性研究中公开使用?
大型语言模型(llm)的影响越来越大,比如ChatGPT,这引发了人们对它们在公共卫生领域应用的可靠性的质疑。我们比较了GPT-4对肝脏、心脏和肾脏的药物毒性评估与使用美国食品和药物管理局(FDA)药物标签文件的专家评估。评估了两种方法:一种是“一般提示”,模仿公众使用的对话风格,另一种是“专家提示”,旨在代表专家的方法。与一般提示(48-72%)相比,专家提示实现了更高的准确性(64-75%),但总体表现一般,表明在将GPT-4用于公共卫生时需要谨慎。为了提高可靠性,可能需要一个高级框架,例如检索增强生成(Retrieval Augmented Generation, RAG)来利用GPT-4中嵌入的知识。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 去求助
来源期刊
Drug Discovery Today
Drug Discovery Today 医学-药学
CiteScore
14.80
自引率
2.70%
发文量
293
审稿时长
6 months
期刊介绍: Drug Discovery Today delivers informed and highly current reviews for the discovery community. The magazine addresses not only the rapid scientific developments in drug discovery associated technologies but also the management, commercial and regulatory issues that increasingly play a part in how R&D is planned, structured and executed. Features include comment by international experts, news and analysis of important developments, reviews of key scientific and strategic issues, overviews of recent progress in specific therapeutic areas and conference reports.
期刊最新文献
Interference of long noncoding RNAs in glioblastoma multiforme: key targets for next generation therapeutics Ferroptosis inhibitors: latest advances and therapeutic potential for pulmonary arterial hypertension Artificial-intelligence-guided autophagy modulation and nanomedicine design for precision photodynamic cancer therapy Antibody–drug conjugates: from efficacy testing to bioassay evaluation in modern drug development Digital twin approaches for interpretable side effect prediction in drug discovery
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1