Is ChatGPT Ready for Public Use in Organ-Specific Drug Toxicity Research?

IF 7.5 2区医学 Q1 PHARMACOLOGY & PHARMACY Drug Discovery Today Pub Date : 2025-02-01 Epub Date: 2025-01-20 DOI:10.1016/j.drudis.2025.104297

Skylar Connor , Leihong Wu , Ruth A. Roberts , Weida Tong

引用次数: 0

Abstract

The growing impact of large language models (LLMs), such as ChatGPT, prompts questions about the reliability of their application in public health. We compared drug toxicity assessments by GPT-4 for liver, heart, and kidney against expert assessments using US Food and Drug Administration (FDA) drug-labeling documents. Two approaches were assessed: a ‘General prompt’, mimicking the conversational style used by the general public, and an ‘Expert prompt’ engineered to represent an approach of an expert. The Expert prompt achieved higher accuracy (64–75%) compared with the General prompt (48–72%), but the overall performance was moderate, indicating that caution is needed when using GPT-4 for public health. To improve reliability, an advanced framework,such as Retrieval Augmented Generation (RAG), might be required to leverage knowledge embedded in GPT-4.

查看原文

微信好友朋友圈 QQ好友复制链接

本刊更多论文

ChatGPT是否已准备好在器官特异性药物毒性研究中公开使用？

大型语言模型（llm）的影响越来越大，比如ChatGPT，这引发了人们对它们在公共卫生领域应用的可靠性的质疑。我们比较了GPT-4对肝脏、心脏和肾脏的药物毒性评估与使用美国食品和药物管理局（FDA）药物标签文件的专家评估。评估了两种方法：一种是“一般提示”，模仿公众使用的对话风格，另一种是“专家提示”，旨在代表专家的方法。与一般提示（48-72%）相比，专家提示实现了更高的准确性（64-75%），但总体表现一般，表明在将GPT-4用于公共卫生时需要谨慎。为了提高可靠性，可能需要一个高级框架，例如检索增强生成（Retrieval Augmented Generation， RAG）来利用GPT-4中嵌入的知识。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文去求助

来源期刊

Drug Discovery Today 医学-药学

CiteScore

14.80

自引率

2.70%

发文量

293

审稿时长

6 months

期刊介绍： Drug Discovery Today delivers informed and highly current reviews for the discovery community. The magazine addresses not only the rapid scientific developments in drug discovery associated technologies but also the management, commercial and regulatory issues that increasingly play a part in how R&D is planned, structured and executed. Features include comment by international experts, news and analysis of important developments, reviews of key scientific and strategic issues, overviews of recent progress in specific therapeutic areas and conference reports.