Precision of artificial intelligence in paediatric cardiology multimodal image interpretation.

IF 0.9 4区 医学 Q4 CARDIAC & CARDIOVASCULAR SYSTEMS Cardiology in the Young Pub Date : 2024-11-11 DOI:10.1017/S1047951124036035
Michael N Gritti, Rahil Prajapati, Dolev Yissar, Conall T Morgan
{"title":"Precision of artificial intelligence in paediatric cardiology multimodal image interpretation.","authors":"Michael N Gritti, Rahil Prajapati, Dolev Yissar, Conall T Morgan","doi":"10.1017/S1047951124036035","DOIUrl":null,"url":null,"abstract":"<p><p>Multimodal imaging is crucial for diagnosis and treatment in paediatric cardiology. However, the proficiency of artificial intelligence chatbots, like ChatGPT-4, in interpreting these images has not been assessed. This cross-sectional study evaluates the precision of ChatGPT-4 in interpreting multimodal images for paediatric cardiology knowledge assessment, including echocardiograms, angiograms, X-rays, and electrocardiograms. One hundred multiple-choice questions with accompanying images from the textbook <i>Pediatric Cardiology Board Review</i> were randomly selected. The chatbot was prompted to answer these questions with and without the accompanying images. Statistical analysis was done using <i>X</i><sup>2</sup>, Fisher's exact, and McNemar tests. Results showed that ChatGPT-4 answered 41% of questions with images correctly, performing best on those with electrocardiograms (54%) and worst on those with angiograms (29%). Without the images, ChatGPT-4's performance was similar at 37% (difference = 4%, 95% confidence interval (CI) -9.4% to 17.2%, <i>p</i> = 0.56). The chatbot performed significantly better when provided the image of an electrocardiogram than without (difference = 18, 95% CI 4.0% to 31.9%, <i>p</i> < 0.04). In cases of incorrect answers, ChatGPT-4 was more inconsistent with an image than without (difference = 21%, 95% CI 3.5% to 36.9%, <i>p</i> < 0.02). In conclusion, ChatGPT-4 performed poorly in answering image-based multiple-choice questions in paediatric cardiology. Its accuracy in answering questions with images was similar to without, indicating limited multimodal image interpretation capabilities. Substantial training is required before clinical integration can be considered. Further research is needed to assess the clinical reasoning skills and progression of ChatGPT in paediatric cardiology for clinical and academic utility.</p>","PeriodicalId":9435,"journal":{"name":"Cardiology in the Young","volume":" ","pages":"1-6"},"PeriodicalIF":0.9000,"publicationDate":"2024-11-11","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Cardiology in the Young","FirstCategoryId":"3","ListUrlMain":"https://doi.org/10.1017/S1047951124036035","RegionNum":4,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q4","JCRName":"CARDIAC & CARDIOVASCULAR SYSTEMS","Score":null,"Total":0}
引用次数: 0

Abstract

Multimodal imaging is crucial for diagnosis and treatment in paediatric cardiology. However, the proficiency of artificial intelligence chatbots, like ChatGPT-4, in interpreting these images has not been assessed. This cross-sectional study evaluates the precision of ChatGPT-4 in interpreting multimodal images for paediatric cardiology knowledge assessment, including echocardiograms, angiograms, X-rays, and electrocardiograms. One hundred multiple-choice questions with accompanying images from the textbook Pediatric Cardiology Board Review were randomly selected. The chatbot was prompted to answer these questions with and without the accompanying images. Statistical analysis was done using X2, Fisher's exact, and McNemar tests. Results showed that ChatGPT-4 answered 41% of questions with images correctly, performing best on those with electrocardiograms (54%) and worst on those with angiograms (29%). Without the images, ChatGPT-4's performance was similar at 37% (difference = 4%, 95% confidence interval (CI) -9.4% to 17.2%, p = 0.56). The chatbot performed significantly better when provided the image of an electrocardiogram than without (difference = 18, 95% CI 4.0% to 31.9%, p < 0.04). In cases of incorrect answers, ChatGPT-4 was more inconsistent with an image than without (difference = 21%, 95% CI 3.5% to 36.9%, p < 0.02). In conclusion, ChatGPT-4 performed poorly in answering image-based multiple-choice questions in paediatric cardiology. Its accuracy in answering questions with images was similar to without, indicating limited multimodal image interpretation capabilities. Substantial training is required before clinical integration can be considered. Further research is needed to assess the clinical reasoning skills and progression of ChatGPT in paediatric cardiology for clinical and academic utility.

查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
人工智能在儿科心脏病学多模态图像解读中的精确性。
多模态成像对儿科心脏病学的诊断和治疗至关重要。然而,人工智能聊天机器人(如 ChatGPT-4)解读这些图像的能力尚未得到评估。这项横向研究评估了 ChatGPT-4 在儿科心脏病学知识评估中解读多模态图像的精确度,包括超声心动图、血管造影、X 光和心电图。我们从教科书《儿科心脏病学 Board Review》中随机抽取了 100 道多选题,并配有相应的图像。聊天机器人会被提示在有或没有配图的情况下回答这些问题。使用 X2、费雪精确检验和 McNemar 检验进行了统计分析。结果显示,ChatGPT-4 能正确回答 41% 带图片的问题,在心电图问题上表现最好(54%),在血管造影问题上表现最差(29%)。在没有图像的情况下,ChatGPT-4 的表现类似,为 37%(差异 = 4%,95% 置信区间 (CI) -9.4% 到 17.2%,p = 0.56)。在提供心电图图像的情况下,聊天机器人的表现明显优于未提供图像的情况(差异 = 18,95% 置信区间为 4.0% 到 31.9%,p < 0.04)。在回答错误的情况下,有图像时 ChatGPT-4 的表现比没有图像时更不一致(差异 = 21%,95% CI 3.5% 至 36.9%,p < 0.02)。总之,ChatGPT-4 在回答儿科心脏病学基于图像的选择题时表现不佳。在回答有图像的问题时,它的准确性与没有图像的问题相似,这表明它的多模态图像解读能力有限。在考虑临床整合之前,需要进行大量的培训。需要进一步开展研究,以评估 ChatGPT 在儿科心脏病学中的临床推理技能和进展,从而在临床和学术上发挥作用。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 去求助
来源期刊
Cardiology in the Young
Cardiology in the Young 医学-小儿科
CiteScore
1.70
自引率
10.00%
发文量
715
审稿时长
4-8 weeks
期刊介绍: Cardiology in the Young is devoted to cardiovascular issues affecting the young, and the older patient suffering the sequels of congenital heart disease, or other cardiac diseases acquired in childhood. The journal serves the interests of all professionals concerned with these topics. By design, the journal is international and multidisciplinary in its approach, and members of the editorial board take an active role in the its mission, helping to make it the essential journal in paediatric cardiology. All aspects of paediatric cardiology are covered within the journal. The content includes original articles, brief reports, editorials, reviews, and papers devoted to continuing professional development.
期刊最新文献
The intersection of allergy and acute coronary syndrome: a type II Kounis syndrome case report. Case presentation: successful occlusion of congenital left ventricle to coronary sinus fistula. Anatomic and non-anatomic substrates in infants with two ventricles undergoing aortic arch repair. Early dehiscence of a tricuspid valve annuloplasty ring in an adolescent with hypoplastic left heart syndrome presenting with unconjugated hyperbilirubinemia. Elevating diversity, inclusion, and health equity in Pediatric Heart Network Scholars grant funding: unique opportunities and lessons learned.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1