ChatGPT Responses to Clinical Questions in the Japan Atherosclerosis Society Guidelines for Prevention of Atherosclerotic Cardiovascular Disease 2022.

IF 3 2区 医学 Q2 PERIPHERAL VASCULAR DISEASE Journal of atherosclerosis and thrombosis Pub Date : 2024-10-30 DOI:10.5551/jat.65240
Takashi Hisamatsu, Mari Fukuda, Minako Kinuta, Hideyuki Kanda
{"title":"ChatGPT Responses to Clinical Questions in the Japan Atherosclerosis Society Guidelines for Prevention of Atherosclerotic Cardiovascular Disease 2022.","authors":"Takashi Hisamatsu, Mari Fukuda, Minako Kinuta, Hideyuki Kanda","doi":"10.5551/jat.65240","DOIUrl":null,"url":null,"abstract":"<p><strong>Aims: </strong>Artificial intelligence is increasingly used in the medical field. We assessed the accuracy and reproducibility of responses by ChatGPT to clinical questions (CQs) in the Japan Atherosclerosis Society Guidelines for Prevention Atherosclerotic Cardiovascular Diseases 2022 (JAS Guidelines 2022).</p><p><strong>Methods: </strong>In June 2024, we assessed responses by ChatGPT (version 3.5) to CQs, including background questions (BQs) and foreground questions (FQs). Accuracy was assessed independently by three researchers using six-point Likert scales ranging from 1 (\"completely incorrect\") to 6 (\"completely correct\") by evaluating responses to CQs in Japanese or translated into English. For reproducibility assessment, responses to each CQ asked five times separately in a new chat were scored using six-point Likert scales, and Fleiss kappa coefficients were calculated.</p><p><strong>Results: </strong>The median (25th-75th percentile) score for ChatGPT's responses to BQs and FQs was 4 (3-5) and 5 (5-6) for Japanese CQs and 5 (3-6) and 6 (5-6) for English CQs, respectively. Response scores were higher for FQs than those for BQs (P values <0.001 for Japanese and English). Similar response accuracy levels were observed between Japanese and English CQs (P value 0.139 for BQs and 0.586 for FQs). Kappa coefficients for reproducibility were 0.76 for BQs and 0.90 for FQs.</p><p><strong>Conclusions: </strong>ChatGPT showed high accuracy and reproducibility in responding to JAS Guidelines 2022 CQs, especially FQs. While ChatGPT primarily reflects existing guidelines, its strength could lie in rapidly organizing and presenting relevant information, thus supporting instant and more efficient guideline interpretation and aiding in medical decision-making.</p>","PeriodicalId":15128,"journal":{"name":"Journal of atherosclerosis and thrombosis","volume":" ","pages":""},"PeriodicalIF":3.0000,"publicationDate":"2024-10-30","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Journal of atherosclerosis and thrombosis","FirstCategoryId":"3","ListUrlMain":"https://doi.org/10.5551/jat.65240","RegionNum":2,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"PERIPHERAL VASCULAR DISEASE","Score":null,"Total":0}
引用次数: 0

Abstract

Aims: Artificial intelligence is increasingly used in the medical field. We assessed the accuracy and reproducibility of responses by ChatGPT to clinical questions (CQs) in the Japan Atherosclerosis Society Guidelines for Prevention Atherosclerotic Cardiovascular Diseases 2022 (JAS Guidelines 2022).

Methods: In June 2024, we assessed responses by ChatGPT (version 3.5) to CQs, including background questions (BQs) and foreground questions (FQs). Accuracy was assessed independently by three researchers using six-point Likert scales ranging from 1 ("completely incorrect") to 6 ("completely correct") by evaluating responses to CQs in Japanese or translated into English. For reproducibility assessment, responses to each CQ asked five times separately in a new chat were scored using six-point Likert scales, and Fleiss kappa coefficients were calculated.

Results: The median (25th-75th percentile) score for ChatGPT's responses to BQs and FQs was 4 (3-5) and 5 (5-6) for Japanese CQs and 5 (3-6) and 6 (5-6) for English CQs, respectively. Response scores were higher for FQs than those for BQs (P values <0.001 for Japanese and English). Similar response accuracy levels were observed between Japanese and English CQs (P value 0.139 for BQs and 0.586 for FQs). Kappa coefficients for reproducibility were 0.76 for BQs and 0.90 for FQs.

Conclusions: ChatGPT showed high accuracy and reproducibility in responding to JAS Guidelines 2022 CQs, especially FQs. While ChatGPT primarily reflects existing guidelines, its strength could lie in rapidly organizing and presenting relevant information, thus supporting instant and more efficient guideline interpretation and aiding in medical decision-making.

查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
ChatGPT 对《日本动脉粥样硬化学会 2022 年动脉粥样硬化性心血管疾病预防指南》中临床问题的答复。
目的:人工智能在医学领域的应用越来越广泛。我们评估了 ChatGPT 对《日本动脉粥样硬化学会 2022 年预防动脉粥样硬化性心血管疾病指南》(JAS Guidelines 2022)中临床问题(CQs)回答的准确性和可重复性:2024 年 6 月,我们评估了 ChatGPT(3.5 版)对 CQ(包括背景问题 (BQ) 和前景问题 (FQ))的回答。准确性由三位研究人员通过评估日语或翻译成英语的 CQ 回答,使用从 1("完全错误")到 6("完全正确")的六点李克特量表进行独立评估。为了评估可重复性,在新的聊天中对每个 CQ 分别提问五次,采用六点李克特量表进行评分,并计算弗莱斯卡帕系数:ChatGPT 对日语 CQ 和英语 CQ 的 BQ 和 FQ 回答的中位数(第 25-75 百分位数)分别为 4(3-5)分和 5(5-6)分,英语 CQ 的中位数分别为 5(3-6)分和 6(5-6)分。对 FQs 的回答得分高于对 BQs 的回答得分(日语和英语的 P 值均<0.001)。日语和英语 CQs 的应答准确率水平相似(BQs 的 P 值为 0.139,FQs 的 P 值为 0.586)。BQs和FQs的重复性卡帕系数分别为0.76和0.90:ChatGPT 在响应 JAS 准则 2022 CQs(尤其是 FQs)方面表现出较高的准确性和可重复性。虽然 ChatGPT 主要反映的是现有指南,但其优势在于快速组织和呈现相关信息,从而支持即时、更高效地解读指南,帮助医疗决策。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 去求助
来源期刊
CiteScore
6.60
自引率
15.90%
发文量
271
审稿时长
1 months
期刊介绍: JAT publishes articles focused on all aspects of research on atherosclerosis, vascular biology, thrombosis, lipid and metabolism.
期刊最新文献
Atherosclerotic Diseases in Chronic Kidney Disease. Non-high-density Lipoprotein Cholesterol for Secondary Prevention after Minor Stroke. Chronic Disturbed Flow Induces Superficial Erosion-Prone Lesion via Endothelial-to-Mesenchymal Transition in a DNA Methyltransferase-Dependent Manner. The Clinical Implication of Pemafibrate, a Novel Selective PPARα Modulator. Dairy Intake and All-Cause, Cancer, and Cardiovascular Disease Mortality Risk in A Large Japanese Population: A 12-Year Follow-Up of the J-MICC Study.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1