基于agent的大型语言模型框架在慢性丙型肝炎病毒感染患者中的自动治疗处方

IF 3.8 3区医学 Q1 GASTROENTEROLOGY & HEPATOLOGY Digestive and Liver Disease Pub Date : 2025-02-01 Epub Date: 2025-03-10 DOI:10.1016/j.dld.2025.01.088

M. Giuffre , S. Kresevic , M. Ajcevic , L. Crocè , D. Shung

{"title":"基于agent的大型语言模型框架在慢性丙型肝炎病毒感染患者中的自动治疗处方","authors":"M. Giuffre , S. Kresevic , M. Ajcevic , L. Crocè , D. Shung","doi":"10.1016/j.dld.2025.01.088","DOIUrl":null,"url":null,"abstract":"<div><h3>Background</h3><div>Large Language Models (LLMs) may be useful for clinical tasks that require reasoning over different information sources. LLMs could be regarded as intelligent \"agents\" with internal planning abilities, enabling them to engage in multi-step reasoning and interact with other agents or external users. Hepatitis C Virus (HCV) management is a potential area where LLM-enabled agents could be useful, since treatment decisions require consideration of genotype, treatment history, liver fibrosis extent, and drug-drug interactions.</div></div><div><h3>Aim</h3><div>To evaluate the performance of different LLM agent-based configurations in automating HCV treatment decisions and to determine whether specialized multi-agent architectures improve prescription accuracy compared to single-agent approaches.</div></div><div><h3>Material and Methods</h3><div>We developed 50 clinical cases focusing on therapeutic regimen prescription. Cases included genotype, prior treatment history, fibrosis status, and concurrent medications. We compared multiple configurations using OpenAI's GPT-3.5 and GPT-4o, fine-tuned with HCV treatment guidelines. Different agent architectures were tested: single agent (one LLM extracting all data), multi-agent (three specialized LLMs for data extraction plus prescriber), and specialized multi-agent (four specialized extraction agents plus prescriber). Each agent accessed specific guideline sections relevant to its task. Performance was compared to baseline fine-tuned models.</div></div><div><h3>Results</h3><div>Using GPT-3.5, the baseline model achieved 24% prescription accuracy. The single agent configuration reached 50% (p=0.007), multi-agent 76% (p<0.001), and specialized multi-agent 89% (p<0.001). With GPT-4, performance improved significantly: baseline 35% accuracy, single agent 65% (p=0.005), multi-agent 88% (p<0.001), and specialized multi-agent 94% (p<0.001).</div></div><div><h3>Conclusions</h3><div>Specialized multi-agent LLM frameworks significantly improve HCV treatment recommendation accuracy, with GPT-4 showing superior performance. The agent-based approach demonstrates the potential for complex clinical decision-making. Future work should validate these findings in real-world settings and explore integration with clinical decision-support systems.</div></div>","PeriodicalId":11268,"journal":{"name":"Digestive and Liver Disease","volume":"57 ","pages":"Pages S46-S47"},"PeriodicalIF":3.8000,"publicationDate":"2025-02-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Large Language Model Agent-Based Framework for automated Treatment Prescription in Patients with Chronic Hepatitis C Virus Infection\",\"authors\":\"M. Giuffre , S. Kresevic , M. Ajcevic , L. Crocè , D. Shung\",\"doi\":\"10.1016/j.dld.2025.01.088\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<div><h3>Background</h3><div>Large Language Models (LLMs) may be useful for clinical tasks that require reasoning over different information sources. LLMs could be regarded as intelligent \\\"agents\\\" with internal planning abilities, enabling them to engage in multi-step reasoning and interact with other agents or external users. Hepatitis C Virus (HCV) management is a potential area where LLM-enabled agents could be useful, since treatment decisions require consideration of genotype, treatment history, liver fibrosis extent, and drug-drug interactions.</div></div><div><h3>Aim</h3><div>To evaluate the performance of different LLM agent-based configurations in automating HCV treatment decisions and to determine whether specialized multi-agent architectures improve prescription accuracy compared to single-agent approaches.</div></div><div><h3>Material and Methods</h3><div>We developed 50 clinical cases focusing on therapeutic regimen prescription. Cases included genotype, prior treatment history, fibrosis status, and concurrent medications. We compared multiple configurations using OpenAI's GPT-3.5 and GPT-4o, fine-tuned with HCV treatment guidelines. Different agent architectures were tested: single agent (one LLM extracting all data), multi-agent (three specialized LLMs for data extraction plus prescriber), and specialized multi-agent (four specialized extraction agents plus prescriber). Each agent accessed specific guideline sections relevant to its task. Performance was compared to baseline fine-tuned models.</div></div><div><h3>Results</h3><div>Using GPT-3.5, the baseline model achieved 24% prescription accuracy. The single agent configuration reached 50% (p=0.007), multi-agent 76% (p<0.001), and specialized multi-agent 89% (p<0.001). With GPT-4, performance improved significantly: baseline 35% accuracy, single agent 65% (p=0.005), multi-agent 88% (p<0.001), and specialized multi-agent 94% (p<0.001).</div></div><div><h3>Conclusions</h3><div>Specialized multi-agent LLM frameworks significantly improve HCV treatment recommendation accuracy, with GPT-4 showing superior performance. The agent-based approach demonstrates the potential for complex clinical decision-making. Future work should validate these findings in real-world settings and explore integration with clinical decision-support systems.</div></div>\",\"PeriodicalId\":11268,\"journal\":{\"name\":\"Digestive and Liver Disease\",\"volume\":\"57 \",\"pages\":\"Pages S46-S47\"},\"PeriodicalIF\":3.8000,\"publicationDate\":\"2025-02-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Digestive and Liver Disease\",\"FirstCategoryId\":\"3\",\"ListUrlMain\":\"https://www.sciencedirect.com/science/article/pii/S1590865825000891\",\"RegionNum\":3,\"RegionCategory\":\"医学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"2025/3/10 0:00:00\",\"PubModel\":\"Epub\",\"JCR\":\"Q1\",\"JCRName\":\"GASTROENTEROLOGY & HEPATOLOGY\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Digestive and Liver Disease","FirstCategoryId":"3","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S1590865825000891","RegionNum":3,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"2025/3/10 0:00:00","PubModel":"Epub","JCR":"Q1","JCRName":"GASTROENTEROLOGY & HEPATOLOGY","Score":null,"Total":0}

引用次数: 0

摘要

大型语言模型（llm）可能对需要对不同信息源进行推理的临床任务有用。法学硕士可以被视为具有内部规划能力的智能“代理”，使其能够进行多步推理，并与其他代理或外部用户交互。丙型肝炎病毒（HCV）管理是llm启用药物可能有用的潜在领域，因为治疗决策需要考虑基因型、治疗史、肝纤维化程度和药物-药物相互作用。目的评估不同基于LLM代理的配置在自动化HCV治疗决策中的性能，并确定与单代理方法相比，专门的多代理架构是否提高了处方准确性。材料与方法以治疗方案处方为重点，对50例临床病例进行分析。病例包括基因型、既往治疗史、纤维化状况和同时使用的药物。我们使用OpenAI的GPT-3.5和gpt - 40比较了多种配置，并根据HCV治疗指南进行了微调。测试了不同的代理体系结构：单个代理（一个LLM提取所有数据）、多代理（三个专门的LLM用于数据提取和处方）和专门的多代理（四个专门的提取代理和处方）。每个代理访问与其任务相关的特定指南部分。将性能与基线微调模型进行比较。结果使用GPT-3.5，基线模型的处方准确率达到24%。单代理配置达到50% (p=0.007)，多代理达到76% (p<0.001)，专用多代理达到89% （p<0.001）。使用GPT-4，性能显著提高：基线准确率为35%，单剂准确率为65% (p=0.005)，多剂准确率为88% (p<0.001)，专用多剂准确率为94% （p<0.001）。结论专门的多药物LLM框架显著提高了HCV治疗推荐的准确性，其中GPT-4表现更佳。基于药物的方法显示了复杂临床决策的潜力。未来的工作应该在现实环境中验证这些发现，并探索与临床决策支持系统的整合。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文

微信好友朋友圈 QQ好友复制链接

本刊更多论文

Large Language Model Agent-Based Framework for automated Treatment Prescription in Patients with Chronic Hepatitis C Virus Infection

Background

Large Language Models (LLMs) may be useful for clinical tasks that require reasoning over different information sources. LLMs could be regarded as intelligent "agents" with internal planning abilities, enabling them to engage in multi-step reasoning and interact with other agents or external users. Hepatitis C Virus (HCV) management is a potential area where LLM-enabled agents could be useful, since treatment decisions require consideration of genotype, treatment history, liver fibrosis extent, and drug-drug interactions.

Aim

To evaluate the performance of different LLM agent-based configurations in automating HCV treatment decisions and to determine whether specialized multi-agent architectures improve prescription accuracy compared to single-agent approaches.

Material and Methods

We developed 50 clinical cases focusing on therapeutic regimen prescription. Cases included genotype, prior treatment history, fibrosis status, and concurrent medications. We compared multiple configurations using OpenAI's GPT-3.5 and GPT-4o, fine-tuned with HCV treatment guidelines. Different agent architectures were tested: single agent (one LLM extracting all data), multi-agent (three specialized LLMs for data extraction plus prescriber), and specialized multi-agent (four specialized extraction agents plus prescriber). Each agent accessed specific guideline sections relevant to its task. Performance was compared to baseline fine-tuned models.

Results

Using GPT-3.5, the baseline model achieved 24% prescription accuracy. The single agent configuration reached 50% (p=0.007), multi-agent 76% (p<0.001), and specialized multi-agent 89% (p<0.001). With GPT-4, performance improved significantly: baseline 35% accuracy, single agent 65% (p=0.005), multi-agent 88% (p<0.001), and specialized multi-agent 94% (p<0.001).

Conclusions

Specialized multi-agent LLM frameworks significantly improve HCV treatment recommendation accuracy, with GPT-4 showing superior performance. The agent-based approach demonstrates the potential for complex clinical decision-making. Future work should validate these findings in real-world settings and explore integration with clinical decision-support systems.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

Digestive and Liver Disease 医学-胃肠肝病学

CiteScore

6.10

自引率

2.20%

发文量

632

审稿时长

19 days

期刊介绍： Digestive and Liver Disease is an international journal of Gastroenterology and Hepatology. It is the official journal of Italian Association for the Study of the Liver (AISF); Italian Association for the Study of the Pancreas (AISP); Italian Association for Digestive Endoscopy (SIED); Italian Association for Hospital Gastroenterologists and Digestive Endoscopists (AIGO); Italian Society of Gastroenterology (SIGE); Italian Society of Pediatric Gastroenterology and Hepatology (SIGENP) and Italian Group for the Study of Inflammatory Bowel Disease (IG-IBD). Digestive and Liver Disease publishes papers on basic and clinical research in the field of gastroenterology and hepatology. Contributions consist of: Original Papers Correspondence to the Editor Editorials, Reviews and Special Articles Progress Reports Image of the Month Congress Proceedings Symposia and Mini-symposia.