使用大型语言模型优化中毒中心图表制作。

IF 3.3 3区医学 Q2 TOXICOLOGY Clinical Toxicology Pub Date : 2024-06-01 Epub Date: 2024-06-12 DOI:10.1080/15563650.2024.2348107

Nikolaus Matsler, Lesley Pepin, Shireen Banerji, Christopher Hoyte, Kennon Heard

{"title":"使用大型语言模型优化中毒中心图表制作。","authors":"Nikolaus Matsler, Lesley Pepin, Shireen Banerji, Christopher Hoyte, Kennon Heard","doi":"10.1080/15563650.2024.2348107","DOIUrl":null,"url":null,"abstract":"Introduction: Efficient and complete medical charting is essential for patient care and research purposes. In this study, we sought to determine if Chat Generative Pre-Trained Transformer could generate cogent, suitable charts from recorded, real-world poison center calls and abstract and tabulate data.Methods: De-identified transcripts of real-world hospital-initiated poison center consults were summarized by Chat Generative Pre-Trained Transformer 4.0. Additionally, Chat Generative Pre-Trained Transformer organized tables for data points, including vital signs, test results, therapies, and recommendations. Seven trained reviewers, including certified specialists in poison information and board-certified medical toxicologists, graded summaries using a 1 to 5 scale to determine appropriateness for entry into the medical record. Intra-rater reliability was calculated. Tabulated data was quantitatively evaluated for accuracy. Finally, reviewers selected preferred documentation: original or Chat Generative Pre-Trained Transformer organized.Results: Eighty percent of summaries had a median score high enough to be deemed appropriate for entry into the medical record. In three duplicate cases, reviewers did change scores, leading to moderate intra-rater reliability (kappa = 0.6). Among all cases, 91 percent of data points were correctly abstracted into table format.Discussion: By utilizing a large language model with a unified prompt, charts can be generated directly from conversations in seconds without the need for additional training. Charts generated by Chat Generative Pre-Trained Transformer were preferred over extant charts, even when they were deemed unacceptable for entry into the medical record prior to the correction of errors. However, there were several limitations to our study, including poor intra-rater-reliability and a limited number of cases examined.Conclusions: In this study, we demonstrate that large language models can generate coherent summaries of real-world poison center calls that are often acceptable for entry to the medical record as is. When errors were present, these were often fixed with the addition or deletion of a word or phrase, presenting an enormous opportunity for efficiency gains. Our future work will focus on implementing this process in a prospective fashion.","PeriodicalId":10430,"journal":{"name":"Clinical Toxicology","volume":" ","pages":"385-390"},"PeriodicalIF":3.3000,"publicationDate":"2024-06-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Use of large language models to optimize poison center charting.\",\"authors\":\"Nikolaus Matsler, Lesley Pepin, Shireen Banerji, Christopher Hoyte, Kennon Heard\",\"doi\":\"10.1080/15563650.2024.2348107\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Introduction: Efficient and complete medical charting is essential for patient care and research purposes. In this study, we sought to determine if Chat Generative Pre-Trained Transformer could generate cogent, suitable charts from recorded, real-world poison center calls and abstract and tabulate data.Methods: De-identified transcripts of real-world hospital-initiated poison center consults were summarized by Chat Generative Pre-Trained Transformer 4.0. Additionally, Chat Generative Pre-Trained Transformer organized tables for data points, including vital signs, test results, therapies, and recommendations. Seven trained reviewers, including certified specialists in poison information and board-certified medical toxicologists, graded summaries using a 1 to 5 scale to determine appropriateness for entry into the medical record. Intra-rater reliability was calculated. Tabulated data was quantitatively evaluated for accuracy. Finally, reviewers selected preferred documentation: original or Chat Generative Pre-Trained Transformer organized.Results: Eighty percent of summaries had a median score high enough to be deemed appropriate for entry into the medical record. In three duplicate cases, reviewers did change scores, leading to moderate intra-rater reliability (kappa = 0.6). Among all cases, 91 percent of data points were correctly abstracted into table format.Discussion: By utilizing a large language model with a unified prompt, charts can be generated directly from conversations in seconds without the need for additional training. Charts generated by Chat Generative Pre-Trained Transformer were preferred over extant charts, even when they were deemed unacceptable for entry into the medical record prior to the correction of errors. However, there were several limitations to our study, including poor intra-rater-reliability and a limited number of cases examined.Conclusions: In this study, we demonstrate that large language models can generate coherent summaries of real-world poison center calls that are often acceptable for entry to the medical record as is. When errors were present, these were often fixed with the addition or deletion of a word or phrase, presenting an enormous opportunity for efficiency gains. Our future work will focus on implementing this process in a prospective fashion.\",\"PeriodicalId\":10430,\"journal\":{\"name\":\"Clinical Toxicology\",\"volume\":\" \",\"pages\":\"385-390\"},\"PeriodicalIF\":3.3000,\"publicationDate\":\"2024-06-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Clinical Toxicology\",\"FirstCategoryId\":\"3\",\"ListUrlMain\":\"https://doi.org/10.1080/15563650.2024.2348107\",\"RegionNum\":3,\"RegionCategory\":\"医学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"2024/6/12 0:00:00\",\"PubModel\":\"Epub\",\"JCR\":\"Q2\",\"JCRName\":\"TOXICOLOGY\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Clinical Toxicology","FirstCategoryId":"3","ListUrlMain":"https://doi.org/10.1080/15563650.2024.2348107","RegionNum":3,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"2024/6/12 0:00:00","PubModel":"Epub","JCR":"Q2","JCRName":"TOXICOLOGY","Score":null,"Total":0}

引用次数: 0

摘要

介绍：高效、完整的医疗图表对于病人护理和研究工作至关重要。在这项研究中，我们试图确定 Chat Generative Pre-Trained Transformer 能否从真实世界毒物中心的通话录音中生成有说服力的合适图表，并对数据进行摘要和制表：方法： Chat Generative Pre-Trained Transformer 4.0 对真实世界中由医院发起的毒物中心咨询的去身份化记录进行了总结。此外，Chat Generative Pre-Trained Transformer 还整理了数据点表格，包括生命体征、检查结果、疗法和建议。包括毒物信息认证专家和委员会认证的医学毒理学家在内的七位训练有素的审阅者采用 1 到 5 级评分法对摘要进行评分，以确定是否适合录入病历。计算了评分者内部的可靠性。对制表数据的准确性进行量化评估。最后，审稿人选择了首选文档：原始文档或经 Chat Generative Pre-Trained Transformer 整理的文档：结果：80%的摘要中位数得分较高，足以被认为适合录入病历。在三个重复病例中，审阅者确实改变了评分，这导致了中等程度的评分者内部可靠性（kappa = 0.6）。在所有病例中，有 91% 的数据点被正确摘录为表格格式：通过利用带有统一提示的大型语言模型，可以在几秒钟内直接从对话中生成图表，而无需额外的培训。与现有的图表相比，聊天生成式预训练转换器生成的图表更受青睐，即使在纠正错误之前，这些图表被认为无法录入病历。不过，我们的研究也存在一些局限性，包括评分者内部的可靠性较差以及研究的病例数量有限：在这项研究中，我们证明了大型语言模型可以生成真实世界中毒中心呼叫的连贯摘要，这些摘要通常可以原封不动地输入医疗记录。当出现错误时，通常只需添加或删除一个单词或短语即可解决，这为提高效率提供了巨大的机会。我们今后的工作重点是以前瞻性的方式实施这一流程。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文

微信好友朋友圈 QQ好友复制链接

本刊更多论文

Use of large language models to optimize poison center charting.

Introduction: Efficient and complete medical charting is essential for patient care and research purposes. In this study, we sought to determine if Chat Generative Pre-Trained Transformer could generate cogent, suitable charts from recorded, real-world poison center calls and abstract and tabulate data.

Methods: De-identified transcripts of real-world hospital-initiated poison center consults were summarized by Chat Generative Pre-Trained Transformer 4.0. Additionally, Chat Generative Pre-Trained Transformer organized tables for data points, including vital signs, test results, therapies, and recommendations. Seven trained reviewers, including certified specialists in poison information and board-certified medical toxicologists, graded summaries using a 1 to 5 scale to determine appropriateness for entry into the medical record. Intra-rater reliability was calculated. Tabulated data was quantitatively evaluated for accuracy. Finally, reviewers selected preferred documentation: original or Chat Generative Pre-Trained Transformer organized.

Results: Eighty percent of summaries had a median score high enough to be deemed appropriate for entry into the medical record. In three duplicate cases, reviewers did change scores, leading to moderate intra-rater reliability (kappa = 0.6). Among all cases, 91 percent of data points were correctly abstracted into table format.

Discussion: By utilizing a large language model with a unified prompt, charts can be generated directly from conversations in seconds without the need for additional training. Charts generated by Chat Generative Pre-Trained Transformer were preferred over extant charts, even when they were deemed unacceptable for entry into the medical record prior to the correction of errors. However, there were several limitations to our study, including poor intra-rater-reliability and a limited number of cases examined.

Conclusions: In this study, we demonstrate that large language models can generate coherent summaries of real-world poison center calls that are often acceptable for entry to the medical record as is. When errors were present, these were often fixed with the addition or deletion of a word or phrase, presenting an enormous opportunity for efficiency gains. Our future work will focus on implementing this process in a prospective fashion.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

Clinical Toxicology 医学-毒理学

CiteScore

5.70

自引率

12.10%

发文量

148

审稿时长

4-8 weeks

期刊介绍： clinical Toxicology publishes peer-reviewed scientific research and clinical advances in clinical toxicology. The journal reflects the professional concerns and best scientific judgment of its sponsors, the American Academy of Clinical Toxicology, the European Association of Poisons Centres and Clinical Toxicologists, the American Association of Poison Control Centers and the Asia Pacific Association of Medical Toxicology and, as such, is the leading international journal in the specialty.