Precision of artificial intelligence in paediatric cardiology multimodal image interpretation.

IF 0.7 4区医学 Q4 CARDIAC & CARDIOVASCULAR SYSTEMS Cardiology in the Young Pub Date : 2024-11-01 Epub Date: 2024-11-11 DOI:10.1017/S1047951124036035

Michael N Gritti, Rahil Prajapati, Dolev Yissar, Conall T Morgan

{"title":"Precision of artificial intelligence in paediatric cardiology multimodal image interpretation.","authors":"Michael N Gritti, Rahil Prajapati, Dolev Yissar, Conall T Morgan","doi":"10.1017/S1047951124036035","DOIUrl":null,"url":null,"abstract":"Multimodal imaging is crucial for diagnosis and treatment in paediatric cardiology. However, the proficiency of artificial intelligence chatbots, like ChatGPT-4, in interpreting these images has not been assessed. This cross-sectional study evaluates the precision of ChatGPT-4 in interpreting multimodal images for paediatric cardiology knowledge assessment, including echocardiograms, angiograms, X-rays, and electrocardiograms. One hundred multiple-choice questions with accompanying images from the textbook Pediatric Cardiology Board Review were randomly selected. The chatbot was prompted to answer these questions with and without the accompanying images. Statistical analysis was done using X2, Fisher's exact, and McNemar tests. Results showed that ChatGPT-4 answered 41% of questions with images correctly, performing best on those with electrocardiograms (54%) and worst on those with angiograms (29%). Without the images, ChatGPT-4's performance was similar at 37% (difference = 4%, 95% confidence interval (CI) -9.4% to 17.2%, p = 0.56). The chatbot performed significantly better when provided the image of an electrocardiogram than without (difference = 18, 95% CI 4.0% to 31.9%, p < 0.04). In cases of incorrect answers, ChatGPT-4 was more inconsistent with an image than without (difference = 21%, 95% CI 3.5% to 36.9%, p < 0.02). In conclusion, ChatGPT-4 performed poorly in answering image-based multiple-choice questions in paediatric cardiology. Its accuracy in answering questions with images was similar to without, indicating limited multimodal image interpretation capabilities. Substantial training is required before clinical integration can be considered. Further research is needed to assess the clinical reasoning skills and progression of ChatGPT in paediatric cardiology for clinical and academic utility.","PeriodicalId":9435,"journal":{"name":"Cardiology in the Young","volume":" ","pages":"2349-2354"},"PeriodicalIF":0.7000,"publicationDate":"2024-11-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Cardiology in the Young","FirstCategoryId":"3","ListUrlMain":"https://doi.org/10.1017/S1047951124036035","RegionNum":4,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"2024/11/11 0:00:00","PubModel":"Epub","JCR":"Q4","JCRName":"CARDIAC & CARDIOVASCULAR SYSTEMS","Score":null,"Total":0}

引用次数: 0

Abstract

Multimodal imaging is crucial for diagnosis and treatment in paediatric cardiology. However, the proficiency of artificial intelligence chatbots, like ChatGPT-4, in interpreting these images has not been assessed. This cross-sectional study evaluates the precision of ChatGPT-4 in interpreting multimodal images for paediatric cardiology knowledge assessment, including echocardiograms, angiograms, X-rays, and electrocardiograms. One hundred multiple-choice questions with accompanying images from the textbook Pediatric Cardiology Board Review were randomly selected. The chatbot was prompted to answer these questions with and without the accompanying images. Statistical analysis was done using X², Fisher's exact, and McNemar tests. Results showed that ChatGPT-4 answered 41% of questions with images correctly, performing best on those with electrocardiograms (54%) and worst on those with angiograms (29%). Without the images, ChatGPT-4's performance was similar at 37% (difference = 4%, 95% confidence interval (CI) -9.4% to 17.2%, p = 0.56). The chatbot performed significantly better when provided the image of an electrocardiogram than without (difference = 18, 95% CI 4.0% to 31.9%, p < 0.04). In cases of incorrect answers, ChatGPT-4 was more inconsistent with an image than without (difference = 21%, 95% CI 3.5% to 36.9%, p < 0.02). In conclusion, ChatGPT-4 performed poorly in answering image-based multiple-choice questions in paediatric cardiology. Its accuracy in answering questions with images was similar to without, indicating limited multimodal image interpretation capabilities. Substantial training is required before clinical integration can be considered. Further research is needed to assess the clinical reasoning skills and progression of ChatGPT in paediatric cardiology for clinical and academic utility.

查看原文

微信好友朋友圈 QQ好友复制链接

本刊更多论文

人工智能在儿科心脏病学多模态图像解读中的精确性。

多模态成像对儿科心脏病学的诊断和治疗至关重要。然而，人工智能聊天机器人（如 ChatGPT-4）解读这些图像的能力尚未得到评估。这项横向研究评估了 ChatGPT-4 在儿科心脏病学知识评估中解读多模态图像的精确度，包括超声心动图、血管造影、X 光和心电图。我们从教科书《儿科心脏病学 Board Review》中随机抽取了 100 道多选题，并配有相应的图像。聊天机器人会被提示在有或没有配图的情况下回答这些问题。使用 X2、费雪精确检验和 McNemar 检验进行了统计分析。结果显示，ChatGPT-4 能正确回答 41% 带图片的问题，在心电图问题上表现最好（54%），在血管造影问题上表现最差（29%）。在没有图像的情况下，ChatGPT-4 的表现类似，为 37%（差异 = 4%，95% 置信区间 (CI) -9.4% 到 17.2%，p = 0.56）。在提供心电图图像的情况下，聊天机器人的表现明显优于未提供图像的情况（差异 = 18，95% 置信区间为 4.0% 到 31.9%，p < 0.04）。在回答错误的情况下，有图像时 ChatGPT-4 的表现比没有图像时更不一致（差异 = 21%，95% CI 3.5% 至 36.9%，p < 0.02）。总之，ChatGPT-4 在回答儿科心脏病学基于图像的选择题时表现不佳。在回答有图像的问题时，它的准确性与没有图像的问题相似，这表明它的多模态图像解读能力有限。在考虑临床整合之前，需要进行大量的培训。需要进一步开展研究，以评估 ChatGPT 在儿科心脏病学中的临床推理技能和进展，从而在临床和学术上发挥作用。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文去求助

来源期刊

Cardiology in the Young 医学-小儿科

CiteScore

1.70

自引率

10.00%

发文量

715

审稿时长

4-8 weeks

期刊介绍： Cardiology in the Young is devoted to cardiovascular issues affecting the young, and the older patient suffering the sequels of congenital heart disease, or other cardiac diseases acquired in childhood. The journal serves the interests of all professionals concerned with these topics. By design, the journal is international and multidisciplinary in its approach, and members of the editorial board take an active role in the its mission, helping to make it the essential journal in paediatric cardiology. All aspects of paediatric cardiology are covered within the journal. The content includes original articles, brief reports, editorials, reviews, and papers devoted to continuing professional development.