A large language model in solving primary healthcare issues: A potential implication for remote healthcare and medical education.

IF 1.4 Q3 EDUCATION, SCIENTIFIC DISCIPLINES Journal of Education and Health Promotion Pub Date : 2024-09-28 eCollection Date: 2024-01-01 DOI:10.4103/jehp.jehp_688_23
Himel Mondal, Rajesh De, Shaikat Mondal, Ayesha Juhi
{"title":"A large language model in solving primary healthcare issues: A potential implication for remote healthcare and medical education.","authors":"Himel Mondal, Rajesh De, Shaikat Mondal, Ayesha Juhi","doi":"10.4103/jehp.jehp_688_23","DOIUrl":null,"url":null,"abstract":"<p><strong>Background and aim: </strong>Access to quality health care is essential, particularly in remote areas where the availability of healthcare professionals may be limited. The advancement of artificial intelligence (AI) and natural language processing (NLP) has led to the development of large language models (LLMs) that exhibit capabilities in understanding and generating human-like text. This study aimed to evaluate the performance of a LLM, ChatGPT, in addressing primary healthcare issues.</p><p><strong>Materials and methods: </strong>This study was conducted in May 2023 with ChatGPT May 12 version. A total of 30 multiple-choice questions (MCQs) related to primary health care were selected to test the proficiency of ChatGPT. These MCQs covered various topics commonly encountered in primary healthcare practice. ChatGPT answered the questions in two segments-one is choosing the single best answer of MCQ and another is supporting text for the answer. The answers to MCQs were compared with the predefined answer keys. The justifications of the answers were checked by two primary healthcare professionals on a 5-point Likert-type scale. The data were presented as number and percentage.</p><p><strong>Results: </strong>Among the 30 questions, ChatGPT provided correct responses for 28 yielding an accuracy of 93.33%. The mean score for explanation in supporting the answer was 4.58 ± 0.85. There was an inter-item correlation of 0.896, and the average measure intraclass correlation coefficient (ICC) was 0.94 (95% confidence interval 0.88-0.97) indicating a high level of interobserver agreement.</p><p><strong>Conclusion: </strong>LLMs, such as ChatGPT, show promising potential in addressing primary healthcare issues. The high accuracy rate achieved by ChatGPT in answering primary healthcare-related MCQs underscores the value of these models as resources for patients and healthcare providers in remote healthcare settings. This can also help in self-directed learning by medical students.</p>","PeriodicalId":15581,"journal":{"name":"Journal of Education and Health Promotion","volume":"13 ","pages":"362"},"PeriodicalIF":1.4000,"publicationDate":"2024-09-28","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11639534/pdf/","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Journal of Education and Health Promotion","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.4103/jehp.jehp_688_23","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"2024/1/1 0:00:00","PubModel":"eCollection","JCR":"Q3","JCRName":"EDUCATION, SCIENTIFIC DISCIPLINES","Score":null,"Total":0}
引用次数: 0

Abstract

Background and aim: Access to quality health care is essential, particularly in remote areas where the availability of healthcare professionals may be limited. The advancement of artificial intelligence (AI) and natural language processing (NLP) has led to the development of large language models (LLMs) that exhibit capabilities in understanding and generating human-like text. This study aimed to evaluate the performance of a LLM, ChatGPT, in addressing primary healthcare issues.

Materials and methods: This study was conducted in May 2023 with ChatGPT May 12 version. A total of 30 multiple-choice questions (MCQs) related to primary health care were selected to test the proficiency of ChatGPT. These MCQs covered various topics commonly encountered in primary healthcare practice. ChatGPT answered the questions in two segments-one is choosing the single best answer of MCQ and another is supporting text for the answer. The answers to MCQs were compared with the predefined answer keys. The justifications of the answers were checked by two primary healthcare professionals on a 5-point Likert-type scale. The data were presented as number and percentage.

Results: Among the 30 questions, ChatGPT provided correct responses for 28 yielding an accuracy of 93.33%. The mean score for explanation in supporting the answer was 4.58 ± 0.85. There was an inter-item correlation of 0.896, and the average measure intraclass correlation coefficient (ICC) was 0.94 (95% confidence interval 0.88-0.97) indicating a high level of interobserver agreement.

Conclusion: LLMs, such as ChatGPT, show promising potential in addressing primary healthcare issues. The high accuracy rate achieved by ChatGPT in answering primary healthcare-related MCQs underscores the value of these models as resources for patients and healthcare providers in remote healthcare settings. This can also help in self-directed learning by medical students.

查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
解决初级医疗保健问题的大型语言模型:对远程医疗和医学教育的潜在影响。
背景和目的:获得高质量的医疗保健服务至关重要,尤其是在医疗保健专业人员有限的偏远地区。随着人工智能(AI)和自然语言处理(NLP)技术的进步,大型语言模型(LLM)也随之发展起来,这些模型具有理解和生成类人文本的能力。本研究旨在评估大型语言模型 ChatGPT 在解决初级医疗保健问题方面的性能:本研究于 2023 年 5 月进行,使用的是 ChatGPT May 12 版本。共选择了 30 道与初级医疗保健相关的选择题(MCQ)来测试 ChatGPT 的熟练程度。这些 MCQ 涵盖了基层医疗实践中常见的各种主题。ChatGPT 分两部分回答问题,一部分是选择 MCQ 的最佳答案,另一部分是答案的辅助文本。MCQ 的答案与预定义答案进行了比较。答案的合理性由两名基层医护人员以 5 分李克特量表进行核对。数据以数量和百分比表示:在 30 个问题中,ChatGPT 提供了 28 个正确答案,准确率为 93.33%。支持答案的解释平均得分为 4.58 ± 0.85。项目间相关性为 0.896,平均测量类内相关系数 (ICC) 为 0.94(95% 置信区间为 0.88-0.97),表明观察者之间的一致性很高:结论:ChatGPT 等 LLM 在解决初级医疗保健问题方面显示出了巨大的潜力。ChatGPT 在回答初级医疗保健相关 MCQ 时所达到的高准确率凸显了这些模型作为远程医疗保健环境中患者和医疗保健提供者资源的价值。这也有助于医学生的自主学习。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 去求助
来源期刊
CiteScore
2.60
自引率
21.40%
发文量
218
审稿时长
34 weeks
期刊最新文献
Development of a minimum data set for a blockchain-based personal health records (PHRs), for patient/physician interaction in family medicine. Development of occupational well-being measurement model in the employee of Abadan University of Medical Sciences. Flipping the script: The benefits of inverted classrooms in oncology education. Health literacy and tobacco cessation among hypertensive individuals: A mixed method study. Implementation and evaluation of the impact of virtual workshops of voluntary educational assistance among occupational health and safety students: Case study, Ardabil, Iran.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1