Empowering PET imaging reporting with retrieval-augmented large language models and reading reports database: a pilot single center study

IF 8.6 1区医学 Q1 RADIOLOGY, NUCLEAR MEDICINE & MEDICAL IMAGING European Journal of Nuclear Medicine and Molecular Imaging Pub Date : 2025-01-23 DOI:10.1007/s00259-025-07101-9

Hongyoon Choi, Dongjoo Lee, Yeon-koo Kang, Minseok Suh

{"title":"Empowering PET imaging reporting with retrieval-augmented large language models and reading reports database: a pilot single center study","authors":"Hongyoon Choi, Dongjoo Lee, Yeon-koo Kang, Minseok Suh","doi":"10.1007/s00259-025-07101-9","DOIUrl":null,"url":null,"abstract":"<h3 data-test=\"abstract-sub-heading\">Purpose</h3><p>The potential of Large Language Models (LLMs) in enhancing a variety of natural language tasks in clinical fields includes medical imaging reporting. This pilot study examines the efficacy of a retrieval-augmented generation (RAG) LLM system considering zero-shot learning capability of LLMs, integrated with a comprehensive database of PET reading reports, in improving reference to prior reports and decision making.</p><h3 data-test=\"abstract-sub-heading\">Methods</h3><p>We developed a custom LLM framework with retrieval capabilities, leveraging a database of over 10 years of PET imaging reports from a single center. The system uses vector space embedding to facilitate similarity-based retrieval. Queries prompt the system to generate context-based answers and identify similar cases or differential diagnoses. From routine clinical PET readings, experienced nuclear medicine physicians evaluated the performance of system in terms of the relevance of queried similar cases and the appropriateness score of suggested potential diagnoses.</p><h3 data-test=\"abstract-sub-heading\">Results</h3><p>The system efficiently organized embedded vectors from PET reports, showing that imaging reports were accurately clustered within the embedded vector space according to the diagnosis or PET study type. Based on this system, a proof-of-concept chatbot was developed and showed the framework’s potential in referencing reports of previous similar cases and identifying exemplary cases for various purposes. From routine clinical PET readings, 84.1% of the cases retrieved relevant similar cases, as agreed upon by all three readers. Using the RAG system, the appropriateness score of the suggested potential diagnoses was significantly better than that of the LLM without RAG. Additionally, it demonstrated the capability to offer differential diagnoses, leveraging the vast database to enhance the completeness and precision of generated reports.</p><h3 data-test=\"abstract-sub-heading\">Conclusion</h3><p>The integration of RAG LLM with a large database of PET imaging reports suggests the potential to support clinical practice of nuclear medicine imaging reading by various tasks of AI including finding similar cases and deriving potential diagnoses from them. This study underscores the potential of advanced AI tools in transforming medical imaging reporting practices.</p>","PeriodicalId":11909,"journal":{"name":"European Journal of Nuclear Medicine and Molecular Imaging","volume":"74 1","pages":""},"PeriodicalIF":8.6000,"publicationDate":"2025-01-23","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"European Journal of Nuclear Medicine and Molecular Imaging","FirstCategoryId":"3","ListUrlMain":"https://doi.org/10.1007/s00259-025-07101-9","RegionNum":1,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"RADIOLOGY, NUCLEAR MEDICINE & MEDICAL IMAGING","Score":null,"Total":0}

引用次数: 0

Abstract

Purpose

The potential of Large Language Models (LLMs) in enhancing a variety of natural language tasks in clinical fields includes medical imaging reporting. This pilot study examines the efficacy of a retrieval-augmented generation (RAG) LLM system considering zero-shot learning capability of LLMs, integrated with a comprehensive database of PET reading reports, in improving reference to prior reports and decision making.

Methods

We developed a custom LLM framework with retrieval capabilities, leveraging a database of over 10 years of PET imaging reports from a single center. The system uses vector space embedding to facilitate similarity-based retrieval. Queries prompt the system to generate context-based answers and identify similar cases or differential diagnoses. From routine clinical PET readings, experienced nuclear medicine physicians evaluated the performance of system in terms of the relevance of queried similar cases and the appropriateness score of suggested potential diagnoses.

Results

The system efficiently organized embedded vectors from PET reports, showing that imaging reports were accurately clustered within the embedded vector space according to the diagnosis or PET study type. Based on this system, a proof-of-concept chatbot was developed and showed the framework’s potential in referencing reports of previous similar cases and identifying exemplary cases for various purposes. From routine clinical PET readings, 84.1% of the cases retrieved relevant similar cases, as agreed upon by all three readers. Using the RAG system, the appropriateness score of the suggested potential diagnoses was significantly better than that of the LLM without RAG. Additionally, it demonstrated the capability to offer differential diagnoses, leveraging the vast database to enhance the completeness and precision of generated reports.

Conclusion

The integration of RAG LLM with a large database of PET imaging reports suggests the potential to support clinical practice of nuclear medicine imaging reading by various tasks of AI including finding similar cases and deriving potential diagnoses from them. This study underscores the potential of advanced AI tools in transforming medical imaging reporting practices.

查看原文

微信好友朋友圈 QQ好友复制链接

本刊更多论文

增强PET成像报告检索增强大型语言模型和阅读报告数据库：试点单中心研究

目的大语言模型（LLMs）在增强临床领域的各种自然语言任务方面的潜力，包括医学成像报告。本初步研究考察了考虑LLM零射击学习能力的检索增强生成（RAG） LLM系统的有效性，该系统与PET阅读报告的综合数据库相结合，可以改善对先前报告的参考和决策。方法我们开发了一个具有检索功能的定制LLM框架，利用来自单个中心的超过10年的PET成像报告数据库。该系统使用向量空间嵌入来促进基于相似性的检索。查询会提示系统生成基于上下文的答案，并识别类似病例或鉴别诊断。根据常规临床PET读数，经验丰富的核医学医生根据查询的类似病例的相关性和建议的潜在诊断的适当性评分来评估系统的性能。结果该系统有效地组织了PET报告中的嵌入向量，显示成像报告能够根据诊断或PET研究类型在嵌入向量空间内准确聚类。基于该系统，开发了一个概念验证聊天机器人，并展示了该框架在参考先前类似案例报告和识别各种用途的示例案例方面的潜力。从常规临床PET读数中，84.1%的病例检索到相关的类似病例，这是三位读者一致同意的。使用RAG系统，建议的潜在诊断的适当性评分明显优于未使用RAG的LLM。此外，它还展示了提供鉴别诊断的能力，利用庞大的数据库来增强生成报告的完整性和准确性。结论RAG LLM与大型PET影像学报告数据库的整合，表明人工智能的各种任务，包括发现相似病例并从中得出潜在诊断，可能支持核医学影像学阅读的临床实践。这项研究强调了先进的人工智能工具在改变医学成像报告实践方面的潜力。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文去求助

来源期刊

European Journal of Nuclear Medicine and Molecular Imaging 医学-核医学

CiteScore

15.60

自引率

9.90%

发文量

392

审稿时长

3 months

期刊介绍： The European Journal of Nuclear Medicine and Molecular Imaging serves as a platform for the exchange of clinical and scientific information within nuclear medicine and related professions. It welcomes international submissions from professionals involved in the functional, metabolic, and molecular investigation of diseases. The journal's coverage spans physics, dosimetry, radiation biology, radiochemistry, and pharmacy, providing high-quality peer review by experts in the field. Known for highly cited and downloaded articles, it ensures global visibility for research work and is part of the EJNMMI journal family.