首页 > 最新文献

JMIR Medical Education最新文献

英文 中文
Resilience Training Web App for National Health Service Keyworkers: Pilot Usability Study. 国家卫生服务核心工作者弹性训练网络应用程序:试点可用性研究。
IF 3.2 Q1 EDUCATION, SCIENTIFIC DISCIPLINES Pub Date : 2025-01-06 DOI: 10.2196/51101
Joanna Burrell, Felicity Baker, Matthew Russell Bennion

Background: It is well established that frontline health care staff are particularly at risk of stress. Resilience is important to help staff to manage daily challenges and to protect against burnout.

Objective: This study aimed to assess the usability and user perceptions of a resilience training web app developed to support health care keyworkers in understanding their own stress response and to help them put into place strategies to manage stress and to build resilience.

Methods: Nurses (n=7) and other keyworkers (n=1), the target users for the resilience training web app, participated in the usability evaluation. Participants completed a pretraining questionnaire capturing basic demographic information and then used the training before completing a posttraining feedback questionnaire exploring the impact and usability of the web app.

Results: From a sample of 8 keyworkers, 6 (75%) rated their current role as "sometimes" stressful. All 8 (100%) keyworkers found the training easy to understand, and 5 of 7 (71%) agreed that the training increased their understanding of both stress and resilience. Further, 6 of 8 (75%) agreed that the resilience model had helped them to understand what resilience is. Many of the keyworkers (6/8, 75%) agreed that the content was relevant to them. Furthermore, 6 of 8 (75%) agreed that they were likely to act to develop their resilience following completion of the training.

Conclusions: This study tested the usability of a web app for resilience training specifically targeting National Health Service keyworkers. This work preceded a larger scale usability study, and it is hoped this study will help guide other studies to develop similar programs in clinical settings.

背景:前线医护人员尤其容易受到压力的影响,这一点已得到公认。抗压能力对于帮助医护人员应对日常挑战和防止职业倦怠非常重要:本研究旨在评估一款抗压能力培训网络应用程序的可用性和用户感知,该应用程序旨在帮助医疗保健关键工作者了解自身的压力反应,并帮助他们制定管理压力和建立抗压能力的策略:护士(人数=7)和其他关键工作者(人数=1)是复原力培训网络应用程序的目标用户,他们参与了可用性评估。参与者填写了一份培训前调查问卷,收集了基本的人口信息,然后使用了培训,最后填写了一份培训后反馈问卷,探讨网络应用程序的影响和可用性:在抽样调查的 8 名关键工作者中,有 6 人(75%)认为他们目前的工作 "有时 "有压力。所有 8 名(100%)关键员工都认为培训内容通俗易懂,7 人中有 5 人(71%)认为培训加深了他们对压力和抗压能力的理解。此外,8 位关键工作者中有 6 位(75%)认为抗逆力模型帮助他们理解了什么是抗逆力。许多关键工作者(6/8,75%)都认为培训内容与他们息息相关。此外,8 人中有 6 人(75%)认为,在完成培训后,他们可能会采取行动来提高自己的抗逆力:本研究测试了专门针对国民健康服务关键工作者的抗逆力培训网络应用程序的可用性。这项工作是更大规模可用性研究的前奏,希望这项研究能为其他研究提供指导,以便在临床环境中开发类似的项目。
{"title":"Resilience Training Web App for National Health Service Keyworkers: Pilot Usability Study.","authors":"Joanna Burrell, Felicity Baker, Matthew Russell Bennion","doi":"10.2196/51101","DOIUrl":"10.2196/51101","url":null,"abstract":"<p><strong>Background: </strong>It is well established that frontline health care staff are particularly at risk of stress. Resilience is important to help staff to manage daily challenges and to protect against burnout.</p><p><strong>Objective: </strong>This study aimed to assess the usability and user perceptions of a resilience training web app developed to support health care keyworkers in understanding their own stress response and to help them put into place strategies to manage stress and to build resilience.</p><p><strong>Methods: </strong>Nurses (n=7) and other keyworkers (n=1), the target users for the resilience training web app, participated in the usability evaluation. Participants completed a pretraining questionnaire capturing basic demographic information and then used the training before completing a posttraining feedback questionnaire exploring the impact and usability of the web app.</p><p><strong>Results: </strong>From a sample of 8 keyworkers, 6 (75%) rated their current role as \"sometimes\" stressful. All 8 (100%) keyworkers found the training easy to understand, and 5 of 7 (71%) agreed that the training increased their understanding of both stress and resilience. Further, 6 of 8 (75%) agreed that the resilience model had helped them to understand what resilience is. Many of the keyworkers (6/8, 75%) agreed that the content was relevant to them. Furthermore, 6 of 8 (75%) agreed that they were likely to act to develop their resilience following completion of the training.</p><p><strong>Conclusions: </strong>This study tested the usability of a web app for resilience training specifically targeting National Health Service keyworkers. This work preceded a larger scale usability study, and it is hoped this study will help guide other studies to develop similar programs in clinical settings.</p>","PeriodicalId":36236,"journal":{"name":"JMIR Medical Education","volume":"11 ","pages":"e51101"},"PeriodicalIF":3.2,"publicationDate":"2025-01-06","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11728195/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142980234","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Leveraging Generative AI To Improve Motivation and Retrieval in Higher Education Learners.
IF 3.2 Q1 EDUCATION, SCIENTIFIC DISCIPLINES Pub Date : 2025-01-02 DOI: 10.2196/59210
Noahlana Monzon, Franklin Alan Hays

Unstructured: Generative artificial intelligence (GAI) presents novel approaches to enhance motivation, curriculum structure and development, and learning and retrieval processes for both learners and instructors. Though a focus for this emerging technology is academic misconduct, we sought to leverage GAI in curriculum structure to facilitate educational outcomes. For instructors, GAI offers new opportunities in course design and management while reducing time requirements to evaluate outcomes and personalizing learner feedback. These include innovative instructional designs such as flipped classrooms and gamification, enriching teaching methodologies with focused and interactive approaches, and team-based exercise development, among others. For learners, GAI offers unprecedented self-directed learning opportunities, improved cognitive engagement, and effective retrieval practices, leading to enhanced autonomy, motivation, and knowledge retention. Though empowering, this evolving landscape has integration challenges and ethical considerations, including accuracy, technological evolution, loss of learner's voice, and socio-economic disparities. Our experience demonstrates that the responsible application of GAI's in educational settings will revolutionize learning practices, making education more accessible and tailored - producing positive motivational outcomes for both learners and instructors. Thus, we argue that leveraging GAI in educational settings will improve outcomes with implications extending from primary through higher and continuing education paradigms.

{"title":"Leveraging Generative AI To Improve Motivation and Retrieval in Higher Education Learners.","authors":"Noahlana Monzon, Franklin Alan Hays","doi":"10.2196/59210","DOIUrl":"https://doi.org/10.2196/59210","url":null,"abstract":"<p><strong>Unstructured: </strong>Generative artificial intelligence (GAI) presents novel approaches to enhance motivation, curriculum structure and development, and learning and retrieval processes for both learners and instructors. Though a focus for this emerging technology is academic misconduct, we sought to leverage GAI in curriculum structure to facilitate educational outcomes. For instructors, GAI offers new opportunities in course design and management while reducing time requirements to evaluate outcomes and personalizing learner feedback. These include innovative instructional designs such as flipped classrooms and gamification, enriching teaching methodologies with focused and interactive approaches, and team-based exercise development, among others. For learners, GAI offers unprecedented self-directed learning opportunities, improved cognitive engagement, and effective retrieval practices, leading to enhanced autonomy, motivation, and knowledge retention. Though empowering, this evolving landscape has integration challenges and ethical considerations, including accuracy, technological evolution, loss of learner's voice, and socio-economic disparities. Our experience demonstrates that the responsible application of GAI's in educational settings will revolutionize learning practices, making education more accessible and tailored - producing positive motivational outcomes for both learners and instructors. Thus, we argue that leveraging GAI in educational settings will improve outcomes with implications extending from primary through higher and continuing education paradigms.</p>","PeriodicalId":36236,"journal":{"name":"JMIR Medical Education","volume":" ","pages":""},"PeriodicalIF":3.2,"publicationDate":"2025-01-02","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"143256918","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Performance of ChatGPT-4o on the Japanese Medical Licensing Examination: Evalution of Accuracy in Text-Only and Image-Based Questions. chatgpt - 40在日本医疗执照考试中的表现:纯文本和基于图像问题的准确性评估。
IF 3.2 Q1 EDUCATION, SCIENTIFIC DISCIPLINES Pub Date : 2024-12-24 DOI: 10.2196/63129
Yuki Miyazaki, Masahiro Hata, Hisaki Omori, Atsuya Hirashima, Yuta Nakagawa, Mitsuhiro Eto, Shun Takahashi, Manabu Ikeda

Unlabelled: This study evaluated the performance of ChatGPT with GPT-4 Omni (GPT-4o) on the 118th Japanese Medical Licensing Examination. The study focused on both text-only and image-based questions. The model demonstrated a high level of accuracy overall, with no significant difference in performance between text-only and image-based questions. Common errors included clinical judgment mistakes and prioritization issues, underscoring the need for further improvement in the integration of artificial intelligence into medical education and practice.

未标记:本研究评估了ChatGPT与GPT-4 Omni (gpt - 40)在第118届日本医疗执照考试中的表现。这项研究主要集中在纯文本和基于图像的问题上。该模型总体上显示出很高的准确性,纯文本和基于图像的问题在性能上没有显著差异。常见的错误包括临床判断错误和优先顺序问题,强调需要进一步改进将人工智能融入医学教育和实践。
{"title":"Performance of ChatGPT-4o on the Japanese Medical Licensing Examination: Evalution of Accuracy in Text-Only and Image-Based Questions.","authors":"Yuki Miyazaki, Masahiro Hata, Hisaki Omori, Atsuya Hirashima, Yuta Nakagawa, Mitsuhiro Eto, Shun Takahashi, Manabu Ikeda","doi":"10.2196/63129","DOIUrl":"10.2196/63129","url":null,"abstract":"<p><strong>Unlabelled: </strong>This study evaluated the performance of ChatGPT with GPT-4 Omni (GPT-4o) on the 118th Japanese Medical Licensing Examination. The study focused on both text-only and image-based questions. The model demonstrated a high level of accuracy overall, with no significant difference in performance between text-only and image-based questions. Common errors included clinical judgment mistakes and prioritization issues, underscoring the need for further improvement in the integration of artificial intelligence into medical education and practice.</p>","PeriodicalId":36236,"journal":{"name":"JMIR Medical Education","volume":"10 ","pages":"e63129"},"PeriodicalIF":3.2,"publicationDate":"2024-12-24","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11687171/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142883112","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Acceptance of Virtual Reality in Trainees Using a Technology Acceptance Model: Survey Study. 基于技术接受模型的学员对虚拟现实的接受:调查研究。
IF 3.2 Q1 EDUCATION, SCIENTIFIC DISCIPLINES Pub Date : 2024-12-23 DOI: 10.2196/60767
Ellen Y Wang, Daniel Qian, Lijin Zhang, Brian S-K Li, Brian Ko, Michael Khoury, Meghana Renavikar, Avani Ganesan, Thomas J Caruso

Background: Virtual reality (VR) technologies have demonstrated therapeutic usefulness across a variety of health care settings. However, graduate medical education (GME) trainee perspectives on VR acceptability and usability are limited. The behavioral intentions of GME trainees with regard to VR as an anxiolytic tool have not been characterized through a theoretical framework of technology adoption.

Objective: The primary aim of this study was to apply a hybrid Technology Acceptance Model (TAM) and a United Theory of Acceptance and Use of Technology (UTAUT) model to evaluate factors that predict the behavioral intentions of GME trainees to use VR for patient anxiolysis. The secondary aim was to assess the reliability of the TAM-UTAUT.

Methods: Participants were surveyed in June 2023. GME trainees participated in a VR experience used to reduce perioperative anxiety. Participants then completed a survey evaluating demographics, perceptions, attitudes, environmental factors, and behavioral intentions that influence the adoption of new technologies.

Results: In total, 202 of 1540 GME trainees participated. Only 198 participants were included in the final analysis (12.9% participation rate). Perceptions of usefulness, ease of use, and enjoyment; social influence; and facilitating conditions predicted intention to use VR. Age, past use, price willing to pay, and curiosity were less strong predictors of intention to use. All confirmatory factor analysis models demonstrated a good fit. All domain measurements demonstrated acceptable reliability.

Conclusions: This TAM-UTAUT demonstrated validity and reliability for predicting the behavioral intentions of GME trainees to use VR as a therapeutic anxiolytic in clinical practice. Social influence and facilitating conditions are modifiable factors that present opportunities to advance VR adoption, such as fostering exposure to new technologies and offering relevant training and social encouragement. Future investigations should study the model's reliability within specialties in different geographic locations.

背景:虚拟现实(VR)技术已经在各种卫生保健环境中证明了治疗的有效性。然而,研究生医学教育(GME)实习生对虚拟现实可接受性和可用性的看法是有限的。GME受训者将虚拟现实作为一种焦虑缓解工具的行为意图尚未通过技术采用的理论框架来表征。目的:本研究的主要目的是应用混合技术接受模型(TAM)和技术接受与使用联合理论(UTAUT)模型来评估预测GME受训者使用VR进行患者焦虑缓解的行为意图的因素。第二个目的是评估TAM-UTAUT的可靠性。方法:于2023年6月对参与者进行调查。GME学员参加了一项VR体验,用于减少围手术期焦虑。然后,参与者完成了一项调查,评估影响新技术采用的人口统计、观念、态度、环境因素和行为意图。结果:1540名GME学员中,共有202人参加。最终分析纳入198人(参与率12.9%)。对有用性、易用性和乐趣的认知;社会影响;以及预测使用VR意愿的便利条件。年龄、过去的使用情况、愿意支付的价格和好奇心对使用意图的预测作用较弱。所有验证性因子分析模型均显示出良好的拟合性。所有的领域测量显示可接受的可靠性。结论:该TAM-UTAUT在预测GME受训者在临床实践中使用VR作为治疗性抗焦虑药的行为意图方面具有有效性和可靠性。社会影响和便利条件是可改变的因素,为推动虚拟现实的采用提供了机会,例如促进对新技术的接触,提供相关培训和社会鼓励。未来的调查应该研究模型在不同地理位置的专业范围内的可靠性。
{"title":"Acceptance of Virtual Reality in Trainees Using a Technology Acceptance Model: Survey Study.","authors":"Ellen Y Wang, Daniel Qian, Lijin Zhang, Brian S-K Li, Brian Ko, Michael Khoury, Meghana Renavikar, Avani Ganesan, Thomas J Caruso","doi":"10.2196/60767","DOIUrl":"10.2196/60767","url":null,"abstract":"<p><strong>Background: </strong>Virtual reality (VR) technologies have demonstrated therapeutic usefulness across a variety of health care settings. However, graduate medical education (GME) trainee perspectives on VR acceptability and usability are limited. The behavioral intentions of GME trainees with regard to VR as an anxiolytic tool have not been characterized through a theoretical framework of technology adoption.</p><p><strong>Objective: </strong>The primary aim of this study was to apply a hybrid Technology Acceptance Model (TAM) and a United Theory of Acceptance and Use of Technology (UTAUT) model to evaluate factors that predict the behavioral intentions of GME trainees to use VR for patient anxiolysis. The secondary aim was to assess the reliability of the TAM-UTAUT.</p><p><strong>Methods: </strong>Participants were surveyed in June 2023. GME trainees participated in a VR experience used to reduce perioperative anxiety. Participants then completed a survey evaluating demographics, perceptions, attitudes, environmental factors, and behavioral intentions that influence the adoption of new technologies.</p><p><strong>Results: </strong>In total, 202 of 1540 GME trainees participated. Only 198 participants were included in the final analysis (12.9% participation rate). Perceptions of usefulness, ease of use, and enjoyment; social influence; and facilitating conditions predicted intention to use VR. Age, past use, price willing to pay, and curiosity were less strong predictors of intention to use. All confirmatory factor analysis models demonstrated a good fit. All domain measurements demonstrated acceptable reliability.</p><p><strong>Conclusions: </strong>This TAM-UTAUT demonstrated validity and reliability for predicting the behavioral intentions of GME trainees to use VR as a therapeutic anxiolytic in clinical practice. Social influence and facilitating conditions are modifiable factors that present opportunities to advance VR adoption, such as fostering exposure to new technologies and offering relevant training and social encouragement. Future investigations should study the model's reliability within specialties in different geographic locations.</p>","PeriodicalId":36236,"journal":{"name":"JMIR Medical Education","volume":"10 ","pages":"e60767"},"PeriodicalIF":3.2,"publicationDate":"2024-12-23","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11693781/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142899066","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Influence of Training With Corrective Feedback Devices on Cardiopulmonary Resuscitation Skills Acquisition and Retention: Systematic Review and Meta-Analysis. 矫正反馈装置训练对心肺复苏技能习得和保留的影响:系统回顾和荟萃分析。
IF 3.2 Q1 EDUCATION, SCIENTIFIC DISCIPLINES Pub Date : 2024-12-19 DOI: 10.2196/59720
Abel Nicolau, Inês Jorge, Pedro Vieira-Marques, Carla Sa-Couto

Background: Several studies related to the use of corrective feedback devices in cardiopulmonary resuscitation training, with different populations, training methodologies, and equipment, present distinct results regarding the influence of this technology.

Objective: This systematic review and meta-analysis aimed to examine the impact of corrective feedback devices in cardiopulmonary resuscitation skills acquisition and retention for laypeople and health care professionals. Training duration was also studied.

Methods: The search was conducted in PubMed, Web of Science, and Scopus from January 2015 to December 2023. Eligible randomized controlled trials compared technology-based training incorporating corrective feedback with standard training. Outcomes of interest were the quality of chest compression-related components. The risk of bias was assessed using the Cochrane tool. A meta-analysis was used to explore the heterogeneity of the selected studies.

Results: In total, 20 studies were included. Overall, it was reported that corrective feedback devices used during training had a positive impact on both skills acquisition and retention. Medium to high heterogeneity was observed.

Conclusions: This systematic review and meta-analysis suggest that corrective feedback devices enhance skills acquisition and retention over time. Considering the medium to high heterogeneity observed, these findings should be interpreted with caution. More standardized, high-quality studies are needed.

Trial registration: PROSPERO CRD42021240953; https://www.crd.york.ac.uk/prospero/display_record.php?RecordID=240953.

背景:几项关于在心肺复苏训练中使用纠正反馈装置的研究,针对不同的人群、训练方法和设备,得出了关于该技术影响的不同结果。目的:本系统综述和荟萃分析旨在研究纠正反馈装置对外行人和卫生保健专业人员心肺复苏技能习得和保留的影响。训练时间也进行了研究。方法:2015年1月- 2023年12月在PubMed、Web of Science和Scopus中进行检索。符合条件的随机对照试验比较了基于技术的结合纠正反馈的培训与标准培训。结果感兴趣的是胸按压相关成分的质量。使用Cochrane工具评估偏倚风险。荟萃分析用于探讨所选研究的异质性。结果:共纳入20项研究。总的来说,据报道,在培训期间使用的纠正反馈装置对技能的获得和保留都有积极的影响。观察到中度至高度异质性。结论:本系统综述和荟萃分析表明,随着时间的推移,纠正反馈装置可以提高技能的习得和保留。考虑到观察到的中等到高度异质性,这些发现应该谨慎解释。需要更多标准化、高质量的研究。试验注册:PROSPERO CRD42021240953;https://www.crd.york.ac.uk/prospero/display_record.php?RecordID=240953。
{"title":"Influence of Training With Corrective Feedback Devices on Cardiopulmonary Resuscitation Skills Acquisition and Retention: Systematic Review and Meta-Analysis.","authors":"Abel Nicolau, Inês Jorge, Pedro Vieira-Marques, Carla Sa-Couto","doi":"10.2196/59720","DOIUrl":"10.2196/59720","url":null,"abstract":"<p><strong>Background: </strong>Several studies related to the use of corrective feedback devices in cardiopulmonary resuscitation training, with different populations, training methodologies, and equipment, present distinct results regarding the influence of this technology.</p><p><strong>Objective: </strong>This systematic review and meta-analysis aimed to examine the impact of corrective feedback devices in cardiopulmonary resuscitation skills acquisition and retention for laypeople and health care professionals. Training duration was also studied.</p><p><strong>Methods: </strong>The search was conducted in PubMed, Web of Science, and Scopus from January 2015 to December 2023. Eligible randomized controlled trials compared technology-based training incorporating corrective feedback with standard training. Outcomes of interest were the quality of chest compression-related components. The risk of bias was assessed using the Cochrane tool. A meta-analysis was used to explore the heterogeneity of the selected studies.</p><p><strong>Results: </strong>In total, 20 studies were included. Overall, it was reported that corrective feedback devices used during training had a positive impact on both skills acquisition and retention. Medium to high heterogeneity was observed.</p><p><strong>Conclusions: </strong>This systematic review and meta-analysis suggest that corrective feedback devices enhance skills acquisition and retention over time. Considering the medium to high heterogeneity observed, these findings should be interpreted with caution. More standardized, high-quality studies are needed.</p><p><strong>Trial registration: </strong>PROSPERO CRD42021240953; https://www.crd.york.ac.uk/prospero/display_record.php?RecordID=240953.</p>","PeriodicalId":36236,"journal":{"name":"JMIR Medical Education","volume":"10 ","pages":"e59720"},"PeriodicalIF":3.2,"publicationDate":"2024-12-19","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11695954/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142855615","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Long-Term Knowledge Retention of Biochemistry Among Medical Students in Riyadh, Saudi Arabia: Cross-Sectional Survey. 沙特阿拉伯利雅得医学生对生物化学知识的长期保留:横断面调查。
IF 3.2 Q1 EDUCATION, SCIENTIFIC DISCIPLINES Pub Date : 2024-12-16 DOI: 10.2196/56132
Nimer Mehyar, Mohammed Awawdeh, Aamir Omair, Adi Aldawsari, Abdullah Alshudukhi, Ahmed Alzeer, Khaled Almutairi, Sultan Alsultan
<p><strong>Background: </strong>Biochemistry is a cornerstone of medical education. Its knowledge is integral to the understanding of complex biological processes and how they are applied in several areas in health care. Also, its significance is reflected in the way it informs the practice of medicine, which can guide and help in both diagnosis and treatment. However, the retention of biochemistry knowledge over time remains a dilemma. Long-term retention of such crucial information is extremely important, as it forms the foundation upon which clinical skills are developed and refined. The effectiveness of biochemistry education, and consequently its long-term retention, is influenced by several factors. Educational methods play a critical role; interactional and integrative teaching approaches have been suggested to enhance retention compared with traditional didactic methods. The frequency and context in which biochemistry knowledge is applied in clinical settings can significantly impact its retention. Practical application reinforces theoretical understanding, making the knowledge more accessible in the long term. Prior knowledge (familiarity) of information suggests that it is stored in long-term memory, which makes its retention in the long term easier to recall.</p><p><strong>Objectives: </strong>This investigation was conducted at King Saud bin Abdulaziz University for Health Sciences in Riyadh, Saudi Arabia. The aim of the study is to understand the dynamics of long-term retention of biochemistry among medical students. Specifically, it looks for the association between students' familiarity with biochemistry content and actual knowledge retention levels.</p><p><strong>Methods: </strong>A cross-sectional correlational survey involving 240 students from King Saud bin Abdulaziz University for Health Sciences was conducted. Participants were recruited via nonprobability convenience sampling. A validated biochemistry assessment tool with 20 questions was used to gauge students' retention in biomolecules, catalysis, bioenergetics, and metabolism. To assess students' familiarity with the knowledge content of test questions, each question is accompanied by options that indicate students' prior knowledge of the content of the question. Statistical analyses tests such as Mann-Whitney U test, Kruskal-Wallis test, and chi-square tests were used.</p><p><strong>Results: </strong>Our findings revealed a significant correlation between students' familiarity of the content with their knowledge retention in the biomolecules (r=0.491; P<.001), catalysis (r=0.500; P<.001), bioenergetics (r=0.528; P<.001), and metabolism (r=0.564; P<.001) biochemistry knowledge domains.</p><p><strong>Conclusions: </strong>This study highlights the significance of familiarity (prior knowledge) in evaluating the retention of biochemistry knowledge. Although limited in terms of generalizability and inherent biases, the research highlights the crucial significance of student's
背景:生物化学是医学教育的基石。它的知识对于理解复杂的生物过程以及如何将其应用于卫生保健的几个领域是不可或缺的。此外,它的重要性还体现在它为医学实践提供信息的方式上,这可以指导和帮助诊断和治疗。然而,随着时间的推移,生物化学知识的保留仍然是一个难题。长期保留这些关键信息是极其重要的,因为它构成了临床技能发展和完善的基础。生物化学教育的有效性,从而使其长期保持,受到几个因素的影响。教育方法起着关键作用;与传统的教学方法相比,互动式和一体化的教学方法可以提高学生的记忆力。生物化学知识应用于临床环境的频率和背景可以显著影响其保留。实际应用强化了理论理解,使知识在长期内更容易获得。对信息的先验知识(熟悉程度)表明它被储存在长期记忆中,这使得它在长期内更容易被回忆起来。目的:本调查在沙特阿拉伯利雅得的沙特国王本阿卜杜勒阿齐兹健康科学大学进行。本研究的目的是了解医学生生物化学长期记忆的动态。具体来说,它寻找学生对生物化学内容的熟悉程度与实际知识保留水平之间的关系。方法:采用横断面相关调查方法,对沙特国王本阿卜杜勒阿齐兹健康科学大学240名学生进行调查。参与者通过非概率方便抽样招募。一个经过验证的包含20个问题的生物化学评估工具被用来衡量学生在生物分子、催化、生物能量学和代谢方面的记忆。为了评估学生对试题知识内容的熟悉程度,每个问题都附有选项,表明学生对问题内容的先验知识。采用Mann-Whitney U检验、Kruskal-Wallis检验、卡方检验等统计分析检验。结果:学生对内容的熟悉程度与生物分子知识的保留程度之间存在显著的相关关系(r=0.491;结论:本研究强调了熟悉度(先验知识)在评价生物化学知识保留中的重要性。本研究虽然在概括性和固有偏差方面存在局限性,但强调了学生对生物化学几个领域的实际知识记忆的熟悉程度的重要意义。这些结果可能被教育工作者用来定制教学方法,以提高学生对生物化学信息的长期记忆,提高他们的临床表现。
{"title":"Long-Term Knowledge Retention of Biochemistry Among Medical Students in Riyadh, Saudi Arabia: Cross-Sectional Survey.","authors":"Nimer Mehyar, Mohammed Awawdeh, Aamir Omair, Adi Aldawsari, Abdullah Alshudukhi, Ahmed Alzeer, Khaled Almutairi, Sultan Alsultan","doi":"10.2196/56132","DOIUrl":"10.2196/56132","url":null,"abstract":"&lt;p&gt;&lt;strong&gt;Background: &lt;/strong&gt;Biochemistry is a cornerstone of medical education. Its knowledge is integral to the understanding of complex biological processes and how they are applied in several areas in health care. Also, its significance is reflected in the way it informs the practice of medicine, which can guide and help in both diagnosis and treatment. However, the retention of biochemistry knowledge over time remains a dilemma. Long-term retention of such crucial information is extremely important, as it forms the foundation upon which clinical skills are developed and refined. The effectiveness of biochemistry education, and consequently its long-term retention, is influenced by several factors. Educational methods play a critical role; interactional and integrative teaching approaches have been suggested to enhance retention compared with traditional didactic methods. The frequency and context in which biochemistry knowledge is applied in clinical settings can significantly impact its retention. Practical application reinforces theoretical understanding, making the knowledge more accessible in the long term. Prior knowledge (familiarity) of information suggests that it is stored in long-term memory, which makes its retention in the long term easier to recall.&lt;/p&gt;&lt;p&gt;&lt;strong&gt;Objectives: &lt;/strong&gt;This investigation was conducted at King Saud bin Abdulaziz University for Health Sciences in Riyadh, Saudi Arabia. The aim of the study is to understand the dynamics of long-term retention of biochemistry among medical students. Specifically, it looks for the association between students' familiarity with biochemistry content and actual knowledge retention levels.&lt;/p&gt;&lt;p&gt;&lt;strong&gt;Methods: &lt;/strong&gt;A cross-sectional correlational survey involving 240 students from King Saud bin Abdulaziz University for Health Sciences was conducted. Participants were recruited via nonprobability convenience sampling. A validated biochemistry assessment tool with 20 questions was used to gauge students' retention in biomolecules, catalysis, bioenergetics, and metabolism. To assess students' familiarity with the knowledge content of test questions, each question is accompanied by options that indicate students' prior knowledge of the content of the question. Statistical analyses tests such as Mann-Whitney U test, Kruskal-Wallis test, and chi-square tests were used.&lt;/p&gt;&lt;p&gt;&lt;strong&gt;Results: &lt;/strong&gt;Our findings revealed a significant correlation between students' familiarity of the content with their knowledge retention in the biomolecules (r=0.491; P&lt;.001), catalysis (r=0.500; P&lt;.001), bioenergetics (r=0.528; P&lt;.001), and metabolism (r=0.564; P&lt;.001) biochemistry knowledge domains.&lt;/p&gt;&lt;p&gt;&lt;strong&gt;Conclusions: &lt;/strong&gt;This study highlights the significance of familiarity (prior knowledge) in evaluating the retention of biochemistry knowledge. Although limited in terms of generalizability and inherent biases, the research highlights the crucial significance of student's","PeriodicalId":36236,"journal":{"name":"JMIR Medical Education","volume":"10 ","pages":"e56132"},"PeriodicalIF":3.2,"publicationDate":"2024-12-16","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11665479/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142829567","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Topics and Trends of Health Informatics Education Research: Scientometric Analysis. 健康资讯教育研究的主题与趋势:科学计量分析。
IF 3.2 Q1 EDUCATION, SCIENTIFIC DISCIPLINES Pub Date : 2024-12-11 DOI: 10.2196/58165
Qing Han

Background: Academic and educational institutions are making significant contributions toward training health informatics professionals. As research in health informatics education (HIE) continues to grow, it is useful to have a clearer understanding of this research field.

Objective: This study aims to comprehensively explore the research topics and trends of HIE from 2014 to 2023. Specifically, it aims to explore (1) the trends of annual articles, (2) the prolific countries/regions, institutions, and publication sources, (3) the scientific collaborations of countries/regions and institutions, and (4) the major research themes and their developmental tendencies.

Methods: Using publications in Web of Science Core Collection, a scientometric analysis of 575 articles related to the field of HIE was conducted. The structural topic model was used to identify topics discussed in the literature and to reveal the topic structure and evolutionary trends of HIE research.

Results: Research interest in HIE has clearly increased from 2014 to 2023, and is continually expanding. The United States was found to be the most prolific country in this field. Harvard University was found to be the leading institution with the highest publication productivity. Journal of Medical Internet Research, Journal of The American Medical Informatics Association, and Applied Clinical Informatics were the top 3 journals with the highest articles in this field. Countries/regions and institutions having higher levels of international collaboration were more impactful. Research on HIE could be modeled into 7 topics related to the following areas: clinical (130/575, 22.6%), mobile application (123/575, 21.4%), consumer (99/575, 17.2%), teaching (61/575, 10.6%), public health (56/575, 9.7%), discipline (55/575, 9.6%), and nursing (51/575, 8.9%). The results clearly indicate the unique foci for each year, depicting the process of development for health informatics research.

Conclusions: This is believed to be the first scientometric analysis exploring the research topics and trends in HIE. This study provides useful insights and implications, and the findings could be used as a guide for HIE contributors.

背景:学术和教育机构在培养卫生信息学专业人员方面做出了重大贡献。随着健康信息教育(HIE)研究的不断发展,对这一研究领域有一个更清晰的认识是有益的。目的:本研究旨在全面探讨2014 - 2023年HIE的研究课题及趋势。具体而言,它旨在探索(1)年度文章趋势,(2)高产国家/地区、机构和出版来源,(3)国家/地区和机构的科学合作情况,(4)主要研究主题及其发展趋势。方法:利用Web of Science核心馆藏的出版物,对与HIE领域相关的575篇文献进行科学计量学分析。结构主题模型用于识别文献中讨论的主题,揭示HIE研究的主题结构和演变趋势。结果:从2014年到2023年,HIE的研究兴趣明显增加,并不断扩大。美国被认为是这一领域最多产的国家。哈佛大学被发现是出版效率最高的领先机构。《医学互联网研究杂志》、《美国医学信息学协会杂志》和《应用临床信息学》是该领域文章数量最多的前3大期刊。国际合作水平较高的国家/地区和机构更具影响力。HIE研究可分为7个主题:临床(130/575,22.6%)、移动应用(123/575,21.4%)、消费者(99/575,17.2%)、教学(61/575,10.6%)、公共卫生(56/575,9.7%)、学科(55/575,9.6%)和护理(51/575,8.9%)。结果清楚地显示了每年的独特重点,描绘了卫生信息学研究的发展过程。结论:这是首次对HIE的研究主题和趋势进行科学计量分析。本研究提供了有用的见解和启示,研究结果可作为HIE贡献者的指南。
{"title":"Topics and Trends of Health Informatics Education Research: Scientometric Analysis.","authors":"Qing Han","doi":"10.2196/58165","DOIUrl":"10.2196/58165","url":null,"abstract":"<p><strong>Background: </strong>Academic and educational institutions are making significant contributions toward training health informatics professionals. As research in health informatics education (HIE) continues to grow, it is useful to have a clearer understanding of this research field.</p><p><strong>Objective: </strong>This study aims to comprehensively explore the research topics and trends of HIE from 2014 to 2023. Specifically, it aims to explore (1) the trends of annual articles, (2) the prolific countries/regions, institutions, and publication sources, (3) the scientific collaborations of countries/regions and institutions, and (4) the major research themes and their developmental tendencies.</p><p><strong>Methods: </strong>Using publications in Web of Science Core Collection, a scientometric analysis of 575 articles related to the field of HIE was conducted. The structural topic model was used to identify topics discussed in the literature and to reveal the topic structure and evolutionary trends of HIE research.</p><p><strong>Results: </strong>Research interest in HIE has clearly increased from 2014 to 2023, and is continually expanding. The United States was found to be the most prolific country in this field. Harvard University was found to be the leading institution with the highest publication productivity. Journal of Medical Internet Research, Journal of The American Medical Informatics Association, and Applied Clinical Informatics were the top 3 journals with the highest articles in this field. Countries/regions and institutions having higher levels of international collaboration were more impactful. Research on HIE could be modeled into 7 topics related to the following areas: clinical (130/575, 22.6%), mobile application (123/575, 21.4%), consumer (99/575, 17.2%), teaching (61/575, 10.6%), public health (56/575, 9.7%), discipline (55/575, 9.6%), and nursing (51/575, 8.9%). The results clearly indicate the unique foci for each year, depicting the process of development for health informatics research.</p><p><strong>Conclusions: </strong>This is believed to be the first scientometric analysis exploring the research topics and trends in HIE. This study provides useful insights and implications, and the findings could be used as a guide for HIE contributors.</p>","PeriodicalId":36236,"journal":{"name":"JMIR Medical Education","volume":"10 ","pages":"e58165"},"PeriodicalIF":3.2,"publicationDate":"2024-12-11","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11669873/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142814292","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
ChatGPT May Improve Access to Language-Concordant Care for Patients With Non-English Language Preferences. ChatGPT 可改善非英语语言偏好患者获得语言协调护理的机会。
IF 3.2 Q1 EDUCATION, SCIENTIFIC DISCIPLINES Pub Date : 2024-12-10 DOI: 10.2196/51435
Fiatsogbe Dzuali, Kira Seiger, Roberto Novoa, Maria Aleshin, Joyce Teng, Jenna Lester, Roxana Daneshjou

Unlabelled: This study evaluated the accuracy of ChatGPT in translating English patient education materials into Spanish, Mandarin, and Russian. While ChatGPT shows promise for translating Spanish and Russian medical information, Mandarin translations require further refinement, highlighting the need for careful review of AI-generated translations before clinical use.

未标记:本研究评估ChatGPT将英语患者教育材料翻译成西班牙语、普通话和俄语的准确性。虽然ChatGPT有望翻译西班牙语和俄语医疗信息,但中文翻译需要进一步完善,这突出了在临床使用之前仔细审查人工智能生成的翻译的必要性。
{"title":"ChatGPT May Improve Access to Language-Concordant Care for Patients With Non-English Language Preferences.","authors":"Fiatsogbe Dzuali, Kira Seiger, Roberto Novoa, Maria Aleshin, Joyce Teng, Jenna Lester, Roxana Daneshjou","doi":"10.2196/51435","DOIUrl":"10.2196/51435","url":null,"abstract":"<p><strong>Unlabelled: </strong>This study evaluated the accuracy of ChatGPT in translating English patient education materials into Spanish, Mandarin, and Russian. While ChatGPT shows promise for translating Spanish and Russian medical information, Mandarin translations require further refinement, highlighting the need for careful review of AI-generated translations before clinical use.</p>","PeriodicalId":36236,"journal":{"name":"JMIR Medical Education","volume":"10 ","pages":"e51435"},"PeriodicalIF":3.2,"publicationDate":"2024-12-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11651640/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142829563","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Evaluation of a Computer-Based Morphological Analysis Method for Free-Text Responses in the General Medicine In-Training Examination: Algorithm Validation Study. 全科医学在职考试中基于计算机的自由文本响应形态分析方法的评价:算法验证研究。
IF 3.2 Q1 EDUCATION, SCIENTIFIC DISCIPLINES Pub Date : 2024-12-05 DOI: 10.2196/52068
Daiki Yokokawa, Kiyoshi Shikino, Yuji Nishizaki, Sho Fukui, Yasuharu Tokuda

Background: The General Medicine In-Training Examination (GM-ITE) tests clinical knowledge in a 2-year postgraduate residency program in Japan. In the academic year 2021, as a domain of medical safety, the GM-ITE included questions regarding the diagnosis from medical history and physical findings through video viewing and the skills in presenting a case. Examinees watched a video or audio recording of a patient examination and provided free-text responses. However, the human cost of scoring free-text answers may limit the implementation of GM-ITE. A simple morphological analysis and word-matching model, thus, can be used to score free-text responses.

Objective: This study aimed to compare human versus computer scoring of free-text responses and qualitatively evaluate the discrepancies between human- and machine-generated scores to assess the efficacy of machine scoring.

Methods: After obtaining consent for participation in the study, the authors used text data from residents who voluntarily answered the GM-ITE patient reproduction video-based questions involving simulated patients. The GM-ITE used video-based questions to simulate a patient's consultation in the emergency room with a diagnosis of pulmonary embolism following a fracture. Residents provided statements for the case presentation. We obtained human-generated scores by collating the results of 2 independent scorers and machine-generated scores by converting the free-text responses into a word sequence through segmentation and morphological analysis and matching them with a prepared list of correct answers in 2022.

Results: Of the 104 responses collected-63 for postgraduate year 1 and 41 for postgraduate year 2-39 cases remained for final analysis after excluding invalid responses. The authors found discrepancies between human and machine scoring in 14 questions (7.2%); some were due to shortcomings in machine scoring that could be resolved by maintaining a list of correct words and dictionaries, whereas others were due to human error.

Conclusions: Machine scoring is comparable to human scoring. It requires a simple program and calibration but can potentially reduce the cost of scoring free-text responses.

背景:在日本,全科医学培训考试(GM-ITE)测试为期两年的研究生住院医师项目的临床知识。在2021学年,作为医疗安全的一个领域,GM-ITE包括有关通过视频观看从病史和身体检查结果进行诊断的问题,以及介绍病例的技能。受试者观看患者检查的视频或录音,并提供自由文本回答。然而,为自由文本答案打分的人力成本可能会限制GM-ITE的实施。因此,一个简单的词形分析和单词匹配模型可以用来对自由文本响应进行评分。目的:本研究旨在比较人类和计算机对自由文本回答的评分,并定性地评估人类和机器生成的评分之间的差异,以评估机器评分的有效性。方法:在获得参与研究的同意后,作者使用居民自愿回答GM-ITE患者再现视频问题的文本数据,这些问题涉及模拟患者。GM-ITE使用基于视频的问题来模拟骨折后诊断为肺栓塞的患者在急诊室的咨询。居民们为案件陈述提供了陈述。我们将2个独立评分者的结果与机器生成的分数进行比对,通过分词和形态分析将自由文本回答转换成单词序列,并与事先准备好的2022年正确答案列表进行匹配,得到人工生成的分数。结果:在收集到的104份回复中(研究生一年级63份,研究生二年级41份),剔除无效回复后,还剩下39份用于最终分析。作者发现,人类和机器在14个问题上的得分存在差异(7.2%);一些是由于机器评分的缺点,可以通过维护正确的单词和字典列表来解决,而另一些是由于人为错误。结论:机器评分与人类评分相当。它需要一个简单的程序和校准,但可以潜在地降低对自由文本响应评分的成本。
{"title":"Evaluation of a Computer-Based Morphological Analysis Method for Free-Text Responses in the General Medicine In-Training Examination: Algorithm Validation Study.","authors":"Daiki Yokokawa, Kiyoshi Shikino, Yuji Nishizaki, Sho Fukui, Yasuharu Tokuda","doi":"10.2196/52068","DOIUrl":"10.2196/52068","url":null,"abstract":"<p><strong>Background: </strong>The General Medicine In-Training Examination (GM-ITE) tests clinical knowledge in a 2-year postgraduate residency program in Japan. In the academic year 2021, as a domain of medical safety, the GM-ITE included questions regarding the diagnosis from medical history and physical findings through video viewing and the skills in presenting a case. Examinees watched a video or audio recording of a patient examination and provided free-text responses. However, the human cost of scoring free-text answers may limit the implementation of GM-ITE. A simple morphological analysis and word-matching model, thus, can be used to score free-text responses.</p><p><strong>Objective: </strong>This study aimed to compare human versus computer scoring of free-text responses and qualitatively evaluate the discrepancies between human- and machine-generated scores to assess the efficacy of machine scoring.</p><p><strong>Methods: </strong>After obtaining consent for participation in the study, the authors used text data from residents who voluntarily answered the GM-ITE patient reproduction video-based questions involving simulated patients. The GM-ITE used video-based questions to simulate a patient's consultation in the emergency room with a diagnosis of pulmonary embolism following a fracture. Residents provided statements for the case presentation. We obtained human-generated scores by collating the results of 2 independent scorers and machine-generated scores by converting the free-text responses into a word sequence through segmentation and morphological analysis and matching them with a prepared list of correct answers in 2022.</p><p><strong>Results: </strong>Of the 104 responses collected-63 for postgraduate year 1 and 41 for postgraduate year 2-39 cases remained for final analysis after excluding invalid responses. The authors found discrepancies between human and machine scoring in 14 questions (7.2%); some were due to shortcomings in machine scoring that could be resolved by maintaining a list of correct words and dictionaries, whereas others were due to human error.</p><p><strong>Conclusions: </strong>Machine scoring is comparable to human scoring. It requires a simple program and calibration but can potentially reduce the cost of scoring free-text responses.</p>","PeriodicalId":36236,"journal":{"name":"JMIR Medical Education","volume":"10 ","pages":"e52068"},"PeriodicalIF":3.2,"publicationDate":"2024-12-05","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11637224/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142787214","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Performance of GPT-3.5 and GPT-4 on the Korean Pharmacist Licensing Examination: Comparison Study. GPT-3.5与GPT-4在韩国药师资格考试中的表现:比较研究
IF 3.2 Q1 EDUCATION, SCIENTIFIC DISCIPLINES Pub Date : 2024-12-04 DOI: 10.2196/57451
Hye Kyung Jin, EunYoung Kim

Background: ChatGPT, a recently developed artificial intelligence chatbot and a notable large language model, has demonstrated improved performance on medical field examinations. However, there is currently little research on its efficacy in languages other than English or in pharmacy-related examinations.

Objective: This study aimed to evaluate the performance of GPT models on the Korean Pharmacist Licensing Examination (KPLE).

Methods: We evaluated the percentage of correct answers provided by 2 different versions of ChatGPT (GPT-3.5 and GPT-4) for all multiple-choice single-answer KPLE questions, excluding image-based questions. In total, 320, 317, and 323 questions from the 2021, 2022, and 2023 KPLEs, respectively, were included in the final analysis, which consisted of 4 units: Biopharmacy, Industrial Pharmacy, Clinical and Practical Pharmacy, and Medical Health Legislation.

Results: The 3-year average percentage of correct answers was 86.5% (830/960) for GPT-4 and 60.7% (583/960) for GPT-3.5. GPT model accuracy was highest in Biopharmacy (GPT-3.5 77/96, 80.2% in 2022; GPT-4 87/90, 96.7% in 2021) and lowest in Medical Health Legislation (GPT-3.5 8/20, 40% in 2022; GPT-4 12/20, 60% in 2022). Additionally, when comparing the performance of artificial intelligence with that of human participants, pharmacy students outperformed GPT-3.5 but not GPT-4.

Conclusions: In the last 3 years, GPT models have performed very close to or exceeded the passing threshold for the KPLE. This study demonstrates the potential of large language models in the pharmacy domain; however, extensive research is needed to evaluate their reliability and ensure their secure application in pharmacy contexts due to several inherent challenges. Addressing these limitations could make GPT models more effective auxiliary tools for pharmacy education.

背景:ChatGPT是最近发展起来的人工智能聊天机器人,也是一个著名的大型语言模型,在医学现场检查中表现出了提高的性能。然而,目前对其在英语以外的语言或药学相关考试中的有效性的研究很少。目的:评价GPT模型在韩国药师资格考试(KPLE)中的表现。方法:我们评估了两个不同版本的ChatGPT (GPT-3.5和GPT-4)对所有选择单答案的KPLE问题提供的正确答案百分比,不包括基于图像的问题。最终分析的问题分别来自2021年、2022年和2023年的kple,题目分别为320、317和323个,包括4个单元:生物药剂学、工业药剂学、临床与实用药剂学和医疗卫生立法。结果:GPT-4和GPT-3.5的3年平均正确率分别为86.5%(830/960)和60.7%(583/960)。GPT模型准确率最高的是生物药剂学(GPT-3.5 77/96, 2022年为80.2%;GPT-4为87/90,2021年为96.7%),医疗卫生立法最低(GPT-3.5为8/20,2022年为40%;GPT-4 12/20, 2022年60%)。此外,当将人工智能的表现与人类参与者的表现进行比较时,药学专业学生的表现优于GPT-3.5,而不是GPT-4。结论:在过去3年中,GPT模型的表现非常接近或超过了KPLE的通过阈值。本研究展示了大型语言模型在药学领域的潜力;然而,由于一些固有的挑战,需要广泛的研究来评估它们的可靠性,并确保它们在药学环境中的安全应用。解决这些局限性可以使GPT模型成为药学教育更有效的辅助工具。
{"title":"Performance of GPT-3.5 and GPT-4 on the Korean Pharmacist Licensing Examination: Comparison Study.","authors":"Hye Kyung Jin, EunYoung Kim","doi":"10.2196/57451","DOIUrl":"10.2196/57451","url":null,"abstract":"<p><strong>Background: </strong>ChatGPT, a recently developed artificial intelligence chatbot and a notable large language model, has demonstrated improved performance on medical field examinations. However, there is currently little research on its efficacy in languages other than English or in pharmacy-related examinations.</p><p><strong>Objective: </strong>This study aimed to evaluate the performance of GPT models on the Korean Pharmacist Licensing Examination (KPLE).</p><p><strong>Methods: </strong>We evaluated the percentage of correct answers provided by 2 different versions of ChatGPT (GPT-3.5 and GPT-4) for all multiple-choice single-answer KPLE questions, excluding image-based questions. In total, 320, 317, and 323 questions from the 2021, 2022, and 2023 KPLEs, respectively, were included in the final analysis, which consisted of 4 units: Biopharmacy, Industrial Pharmacy, Clinical and Practical Pharmacy, and Medical Health Legislation.</p><p><strong>Results: </strong>The 3-year average percentage of correct answers was 86.5% (830/960) for GPT-4 and 60.7% (583/960) for GPT-3.5. GPT model accuracy was highest in Biopharmacy (GPT-3.5 77/96, 80.2% in 2022; GPT-4 87/90, 96.7% in 2021) and lowest in Medical Health Legislation (GPT-3.5 8/20, 40% in 2022; GPT-4 12/20, 60% in 2022). Additionally, when comparing the performance of artificial intelligence with that of human participants, pharmacy students outperformed GPT-3.5 but not GPT-4.</p><p><strong>Conclusions: </strong>In the last 3 years, GPT models have performed very close to or exceeded the passing threshold for the KPLE. This study demonstrates the potential of large language models in the pharmacy domain; however, extensive research is needed to evaluate their reliability and ensure their secure application in pharmacy contexts due to several inherent challenges. Addressing these limitations could make GPT models more effective auxiliary tools for pharmacy education.</p>","PeriodicalId":36236,"journal":{"name":"JMIR Medical Education","volume":"10 ","pages":"e57451"},"PeriodicalIF":3.2,"publicationDate":"2024-12-04","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11633516/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142773237","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
期刊
JMIR Medical Education
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1