首页 > 最新文献

JMIR Medical Education最新文献

英文 中文
AI- vs Human-Based Assessment of Medical Interview Transcripts in a Generative AI-Simulated Patient System: Cross-Sectional Validation Study. 生成式人工智能模拟患者系统中医疗访谈记录的人工智能与人类评估:横断面验证研究。
IF 3.2 Q1 EDUCATION, SCIENTIFIC DISCIPLINES Pub Date : 2026-02-17 DOI: 10.2196/81673
Hiromizu Takahashi, Kiyoshi Shikino, Takeshi Kondo, Yuji Yamada, Yoshitaka Tomoda, Minoru Kishi, Yuki Aiyama, Sho Nagai, Akiko Enomoto, Yoshinori Tokushima, Takahiro Shinohara, Fumiaki Sano, Takeshi Matsuura, Rikiya Watanabe, Toshio Naito
<p><strong>Background: </strong>Generative artificial intelligence (AI) is increasingly used in medical education, including AI-based virtual patients to improve interview skills. However, how much AI-based assessment (ABA) differs from human-based assessment (HBA) remains unclear.</p><p><strong>Objective: </strong>This study aimed to compare the quality of clinical interview assessments generated via an ABA (GPT-o1 Pro [ABA-o1] and GPT-5 Pro [ABA-5]) with those generated via an HBA conducted by clinical instructors in an AI-based virtual patient setting. We also examined whether AI reduced evaluation time and assessed agreement across participants with different levels of clinical experience.</p><p><strong>Methods: </strong>A standardized case of leg weakness was implemented in an AI-based virtual patient. Seven participants (2 medical students, 3 residents, and 2 attending physicians) each conducted an interview with the AI patient, and transcripts were scored using the 25-item Master Interview Rating Scale (0-125). Three evaluation strategies were compared. First, GPT-o1 Pro and GPT-5 Pro scored each transcript 5 times with different random seeds to test case specificity. Processing time was logged automatically. Second, 5 blinded clinical instructors independently rated each transcript once using the same rubric. Third, reliability metrics were applied. For AI, intraclass correlation coefficients (ICCs) quantified repeatability. For humans, the ICC(2,1) was calculated. Agreement was quantified using the Pearson r, Lin concordance correlation coefficient, Bland-Altman limits of agreement, Cronbach α, and ICC. Time efficiency was expressed as mean minutes per transcript and relative percentage reduction.</p><p><strong>Results: </strong>Mean interview scores were similar across methods (ABA-o1: mean 52.1, SD 6.9; ABA-5: mean 53.2, SD 6.8; HBA: mean 53.7, SD 6.8). Agreement between ABA and HBA was strong (r=0.90; concordance correlation coefficient=0.88) with minimal bias (ABA-o1: mean 0.4, SD 2.7; ABA-5: mean 1.5, SD 5.2; limits of agreement: -4.9 to 5.7 for ABA-o1 and -8.6 to 11.7 for ABA-5). The Cronbach α was 0.81 (ABA-o1), 0.86 (ABA-5), and 0.80 (HBA); the ICC(3,1) was 0.77 (ABA-o1) and 0.82 (ABA-5); and the ICC(2,1) was 0.38 (HBA). The coefficient of variation for ABA was approximately half that of HBA (6.6% vs 13.9%). Processing time for 5 runs was 4 minutes, 19 seconds for ABA-o1 and 3 minutes, 20 seconds for ABA-5 vs 10 minutes, 16 seconds for physicians, corresponding to 58% and 67.6% reductions, respectively.</p><p><strong>Conclusions: </strong>ABA-o1 and ABA-5 produced scores closely matching HBA while demonstrating superior consistency and reliability. In the setting of virtual interview transcripts, these findings suggest that ABA may serve as a valid, rapid, and scalable alternative to HBA, reducing per-assessment time by over half. Applied strategically, AI-based scoring could enable timely feedback, improve efficiency, and reduce
背景:生成式人工智能(AI)越来越多地应用于医学教育,包括基于AI的虚拟患者来提高面试技巧。然而,基于人工智能的评估(ABA)与基于人类的评估(HBA)有多大不同尚不清楚。目的:本研究旨在比较通过ABA (gpt - 01 Pro [ABA- 01]和GPT-5 Pro [ABA-5])生成的临床访谈评估的质量与临床教师在基于人工智能的虚拟患者环境中通过HBA生成的临床访谈评估的质量。我们还研究了人工智能是否缩短了评估时间,并评估了具有不同临床经验水平的参与者的一致性。方法:在一个基于人工智能的虚拟患者中实施一个标准化的腿部无力病例。7名参与者(2名医学生、3名住院医生和2名主治医生)分别对AI患者进行了访谈,并使用25项主访谈量表(0-125)对成绩单进行评分。比较了三种评价策略。首先,gpt - 01 Pro和GPT-5 Pro用不同的随机种子对每个转录本进行5次评分,以测试病例特异性。自动记录处理时间。其次,5名盲法临床讲师使用相同的评分标准对每份成绩单进行独立评分。第三,应用可靠性指标。对于人工智能,类内相关系数(ICCs)量化了重复性。对于人类,计算了ICC(2,1)。使用Pearson r、Lin一致性相关系数、Bland-Altman一致性极限、Cronbach α和ICC对一致性进行量化。时间效率表示为每个转录本的平均分钟数和相对减少百分比。结果:不同方法的平均访谈得分相似(aba - 01:平均52.1,SD 6.9; ABA-5:平均53.2,SD 6.8; HBA:平均53.7,SD 6.8)。ABA和HBA之间的一致性很强(r=0.90;一致性相关系数=0.88),偏差最小(ABA- 01:平均0.4,SD 2.7; ABA-5:平均1.5,SD 5.2; ABA- 01的一致性限为-4.9至5.7,ABA-5的一致性限为-8.6至11.7)。Cronbach α分别为0.81 (aba - 01)、0.86 (ABA-5)和0.80 (HBA);ICC(3,1)分别为0.77 (aba - 01)和0.82 (ABA-5);ICC(2,1)为0.38 (HBA)。ABA的变异系数大约是HBA的一半(6.6% vs 13.9%)。5组处理时间aba - 01组为4分19秒,ABA-5组为3分20秒,医生组为10分16秒,分别减少58%和67.6%。结论:aba - 01和ABA-5的评分与HBA非常匹配,且具有较好的一致性和可靠性。在虚拟访谈记录的设置中,这些发现表明ABA可以作为HBA的有效、快速和可扩展的替代方案,将每次评估时间减少一半以上。策略性地应用,基于人工智能的评分可以实现及时反馈,提高效率,减少教师的工作量。需要进一步的研究来证实在更广泛的情况下的普遍性。
{"title":"AI- vs Human-Based Assessment of Medical Interview Transcripts in a Generative AI-Simulated Patient System: Cross-Sectional Validation Study.","authors":"Hiromizu Takahashi, Kiyoshi Shikino, Takeshi Kondo, Yuji Yamada, Yoshitaka Tomoda, Minoru Kishi, Yuki Aiyama, Sho Nagai, Akiko Enomoto, Yoshinori Tokushima, Takahiro Shinohara, Fumiaki Sano, Takeshi Matsuura, Rikiya Watanabe, Toshio Naito","doi":"10.2196/81673","DOIUrl":"10.2196/81673","url":null,"abstract":"&lt;p&gt;&lt;strong&gt;Background: &lt;/strong&gt;Generative artificial intelligence (AI) is increasingly used in medical education, including AI-based virtual patients to improve interview skills. However, how much AI-based assessment (ABA) differs from human-based assessment (HBA) remains unclear.&lt;/p&gt;&lt;p&gt;&lt;strong&gt;Objective: &lt;/strong&gt;This study aimed to compare the quality of clinical interview assessments generated via an ABA (GPT-o1 Pro [ABA-o1] and GPT-5 Pro [ABA-5]) with those generated via an HBA conducted by clinical instructors in an AI-based virtual patient setting. We also examined whether AI reduced evaluation time and assessed agreement across participants with different levels of clinical experience.&lt;/p&gt;&lt;p&gt;&lt;strong&gt;Methods: &lt;/strong&gt;A standardized case of leg weakness was implemented in an AI-based virtual patient. Seven participants (2 medical students, 3 residents, and 2 attending physicians) each conducted an interview with the AI patient, and transcripts were scored using the 25-item Master Interview Rating Scale (0-125). Three evaluation strategies were compared. First, GPT-o1 Pro and GPT-5 Pro scored each transcript 5 times with different random seeds to test case specificity. Processing time was logged automatically. Second, 5 blinded clinical instructors independently rated each transcript once using the same rubric. Third, reliability metrics were applied. For AI, intraclass correlation coefficients (ICCs) quantified repeatability. For humans, the ICC(2,1) was calculated. Agreement was quantified using the Pearson r, Lin concordance correlation coefficient, Bland-Altman limits of agreement, Cronbach α, and ICC. Time efficiency was expressed as mean minutes per transcript and relative percentage reduction.&lt;/p&gt;&lt;p&gt;&lt;strong&gt;Results: &lt;/strong&gt;Mean interview scores were similar across methods (ABA-o1: mean 52.1, SD 6.9; ABA-5: mean 53.2, SD 6.8; HBA: mean 53.7, SD 6.8). Agreement between ABA and HBA was strong (r=0.90; concordance correlation coefficient=0.88) with minimal bias (ABA-o1: mean 0.4, SD 2.7; ABA-5: mean 1.5, SD 5.2; limits of agreement: -4.9 to 5.7 for ABA-o1 and -8.6 to 11.7 for ABA-5). The Cronbach α was 0.81 (ABA-o1), 0.86 (ABA-5), and 0.80 (HBA); the ICC(3,1) was 0.77 (ABA-o1) and 0.82 (ABA-5); and the ICC(2,1) was 0.38 (HBA). The coefficient of variation for ABA was approximately half that of HBA (6.6% vs 13.9%). Processing time for 5 runs was 4 minutes, 19 seconds for ABA-o1 and 3 minutes, 20 seconds for ABA-5 vs 10 minutes, 16 seconds for physicians, corresponding to 58% and 67.6% reductions, respectively.&lt;/p&gt;&lt;p&gt;&lt;strong&gt;Conclusions: &lt;/strong&gt;ABA-o1 and ABA-5 produced scores closely matching HBA while demonstrating superior consistency and reliability. In the setting of virtual interview transcripts, these findings suggest that ABA may serve as a valid, rapid, and scalable alternative to HBA, reducing per-assessment time by over half. Applied strategically, AI-based scoring could enable timely feedback, improve efficiency, and reduce","PeriodicalId":36236,"journal":{"name":"JMIR Medical Education","volume":"12 ","pages":"e81673"},"PeriodicalIF":3.2,"publicationDate":"2026-02-17","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12912650/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"146214342","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
AI-Enhanced Continuing Professional Development as an Evolving Sociotechnical System: Multimethod Theoretical Framework Development Study. 人工智能促进的持续专业发展作为一个不断发展的社会技术系统:多方法理论框架发展研究。
IF 3.2 Q1 EDUCATION, SCIENTIFIC DISCIPLINES Pub Date : 2026-02-12 DOI: 10.2196/69156
Vjekoslav Hlede, Sofia Valanci, G Robert D'Antuono, Heather Dow, Ronan O'Beirne, Richard Wiggins

Background: Artificial intelligence (AI) is changing continuing professional development (CPD) in health care and its interactions with the broader health care system. However, current scholarship lacks an integrated theoretical model that explains how AI impacts CPD as a complex sociotechnical system. Existing frameworks usually focus on isolated phenomena, such as ethics, literacy, or learning theory, leaving unaddressed the dynamics of how those phenomena interact in the complex sociotechnical AI-enhanced CPD system, as well as the new roles that AI-empowered patients and society play.

Objective: The objective of this study is to propose a comprehensive, theory-driven framework that provides insight into how AI transforms CPD systems. The goal was to integrate established AI constructs with Complexity Theory (CT) and Actor-Network Theory (ANT) to develop a model that guides practice, research, and policy.

Methods: We conducted a multimethod theory construction. The process started with identifying the AI-enhanced CPD as an established yet evolving phenomenon. Through a structured literature review, the main building blocks of AI-enhanced CPD were identified, as well as the ontological base (CT and ANT). The model was developed through iterative human-led and AI-assisted abductive analysis. The final model was abductively validated on a case study of a national organization pioneering AI use, demonstrating the theoretical model makes sense in practice. All conceptual decisions were reviewed collaboratively by the author group.

Results: The ALEERRT-CA framework is made of 6 pillars: AI literacy, explainability, ethics, readiness, reliability, and learning theories, and 2 theoretical lenses: CT and ANT. CT elucidates macro-level system behaviors in the AI-enhanced CPD system. Those behaviors include emergence, feedback loops, adaptation, and reality made of nested complex systems. ANT explains how localized interactions among human and nonhuman actors shape AI-enhanced CPD. Together, these lenses illustrate how AI redistributes agency, amplifies tensions, and generates emergent learning dynamics within CPD and the broader health care system.

Conclusions: This study presents a novel conceptual model of AI-enhanced CPD as a sociotechnical system. The integration of CT and ANT with AI constructs improves explanatory power of the ALEERRT-CA framework. Educators, program leaders, and policymakers can use the framework as a structured toolset to evaluate AI readiness, design responsible AI-enhanced CPD practices, and plan future empirical research. The framework provides a theoretical lens for observing the rapidly evolving field of AI-enhanced CPD and health care practice.

背景:人工智能(AI)正在改变卫生保健的持续专业发展(CPD)及其与更广泛的卫生保健系统的相互作用。然而,目前的学术研究缺乏一个完整的理论模型来解释人工智能如何影响CPD作为一个复杂的社会技术系统。现有的框架通常侧重于孤立的现象,如伦理、识字或学习理论,而没有解决这些现象在复杂的社会技术人工智能增强CPD系统中如何相互作用的动态,以及人工智能赋予患者和社会的新角色。目的:本研究的目的是提出一个全面的、理论驱动的框架,为人工智能如何改变CPD系统提供见解。目标是将已建立的人工智能结构与复杂性理论(CT)和行为者网络理论(ANT)相结合,以开发一个指导实践、研究和政策的模型。方法:进行多方法理论构建。这一过程首先将人工智能增强的CPD确定为一种已确立但仍在发展的现象。通过结构化的文献综述,确定了人工智能增强CPD的主要构建模块,以及本体基础(CT和ANT)。该模型是通过迭代的人类主导和人工智能辅助溯因分析开发的。最后的模型在一个开创性使用人工智能的国家组织的案例研究中得到了溯因性验证,证明了理论模型在实践中是有意义的。所有的概念性决定都由作者小组共同审查。结果:ALEERRT-CA框架由6个支柱组成:人工智能素养、可解释性、伦理、准备、可靠性和学习理论,以及2个理论透镜:CT和ANT。CT在人工智能增强CPD系统中阐明宏观层面的系统行为。这些行为包括涌现、反馈循环、适应和由嵌套的复杂系统构成的现实。ANT解释了人类和非人类参与者之间的局部交互如何塑造人工智能增强的CPD。总之,这些镜头说明了人工智能如何重新分配代理,放大紧张关系,并在CPD和更广泛的医疗保健系统中产生紧急学习动态。结论:本研究提出了人工智能增强CPD作为社会技术系统的新概念模型。CT和ANT与AI结构的集成提高了ALEERRT-CA框架的解释力。教育工作者、项目负责人和政策制定者可以使用该框架作为结构化工具集来评估人工智能准备情况,设计负责任的人工智能增强CPD实践,并规划未来的实证研究。该框架为观察快速发展的人工智能增强CPD和医疗保健实践领域提供了一个理论视角。
{"title":"AI-Enhanced Continuing Professional Development as an Evolving Sociotechnical System: Multimethod Theoretical Framework Development Study.","authors":"Vjekoslav Hlede, Sofia Valanci, G Robert D'Antuono, Heather Dow, Ronan O'Beirne, Richard Wiggins","doi":"10.2196/69156","DOIUrl":"10.2196/69156","url":null,"abstract":"<p><strong>Background: </strong>Artificial intelligence (AI) is changing continuing professional development (CPD) in health care and its interactions with the broader health care system. However, current scholarship lacks an integrated theoretical model that explains how AI impacts CPD as a complex sociotechnical system. Existing frameworks usually focus on isolated phenomena, such as ethics, literacy, or learning theory, leaving unaddressed the dynamics of how those phenomena interact in the complex sociotechnical AI-enhanced CPD system, as well as the new roles that AI-empowered patients and society play.</p><p><strong>Objective: </strong>The objective of this study is to propose a comprehensive, theory-driven framework that provides insight into how AI transforms CPD systems. The goal was to integrate established AI constructs with Complexity Theory (CT) and Actor-Network Theory (ANT) to develop a model that guides practice, research, and policy.</p><p><strong>Methods: </strong>We conducted a multimethod theory construction. The process started with identifying the AI-enhanced CPD as an established yet evolving phenomenon. Through a structured literature review, the main building blocks of AI-enhanced CPD were identified, as well as the ontological base (CT and ANT). The model was developed through iterative human-led and AI-assisted abductive analysis. The final model was abductively validated on a case study of a national organization pioneering AI use, demonstrating the theoretical model makes sense in practice. All conceptual decisions were reviewed collaboratively by the author group.</p><p><strong>Results: </strong>The ALEERRT-CA framework is made of 6 pillars: AI literacy, explainability, ethics, readiness, reliability, and learning theories, and 2 theoretical lenses: CT and ANT. CT elucidates macro-level system behaviors in the AI-enhanced CPD system. Those behaviors include emergence, feedback loops, adaptation, and reality made of nested complex systems. ANT explains how localized interactions among human and nonhuman actors shape AI-enhanced CPD. Together, these lenses illustrate how AI redistributes agency, amplifies tensions, and generates emergent learning dynamics within CPD and the broader health care system.</p><p><strong>Conclusions: </strong>This study presents a novel conceptual model of AI-enhanced CPD as a sociotechnical system. The integration of CT and ANT with AI constructs improves explanatory power of the ALEERRT-CA framework. Educators, program leaders, and policymakers can use the framework as a structured toolset to evaluate AI readiness, design responsible AI-enhanced CPD practices, and plan future empirical research. The framework provides a theoretical lens for observing the rapidly evolving field of AI-enhanced CPD and health care practice.</p>","PeriodicalId":36236,"journal":{"name":"JMIR Medical Education","volume":"12 ","pages":"e69156"},"PeriodicalIF":3.2,"publicationDate":"2026-02-12","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12900278/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"146182884","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Developing a Realistic and Cost-Effective Training Model (MaiSurge) for Laparoscopic Hysterectomies to Train and Assess Surgical Skill: Prospective Nonrandomized Controlled Trial. 为腹腔镜子宫切除术培养和评估手术技能开发一种现实且具有成本效益的培训模式(MaiSurge):前瞻性非随机对照试验。
IF 3.2 Q1 EDUCATION, SCIENTIFIC DISCIPLINES Pub Date : 2026-02-12 DOI: 10.2196/66369
Anna Maria Brechter, Roxana Schwab, Christoph Dold, Christine Skala, Maria Schröder, Lina Schiestl, Katharina Gillen, Walburgis Brenner, Annette Hasenburg, Mona Wanda Schmidt

Background: Laparoscopic surgery has a flatter learning curve compared to traditional open surgery. Therefore, structured programs and realistic training models are imperative to ensure patients' safety. However, commercially available models are often too expensive or technically unrealistic for continuous surgical training.

Objective: The aim of this trial was to develop a cost-efficient and highly realistic uterus model to perform a total laparoscopic hysterectomy (TLH) and evaluate its applicability.

Methods: A training model (MaiSurge) for a TLH with salpingectomy or adenectomy was developed using a 3D printer and different cast materials. Polyvinyl alcohol was used to allow for the use of electrosurgery. To gather the first validity evidence, novice and expert gynecologists performed a TLH on the model. Operative time as well as surgical performance scores (Hysterectomy-Objective Structured Assessment of Technical Skills) were compared between both groups.

Results: A total of 12 participants in the novice group and 18 participants in the expert group completed the simulation. The experts obtained significantly better modified Hysterectomy-Objective Structured Assessment of Technical Skills scores (mean 74.0, SD 12.9 vs mean 60.3, SD 14.9; P=.049) and performed significantly faster (median 69.5, IQR 49.5-74.3 minutes vs median 37.5, IQR 30.5-38.8 minutes; P<.001). An excellent interrater reliability was observed (intraclass correlation coefficient=0.91). Approximately 92% (11/12) of novices felt that they had improved their surgical performance after training on the MaiSurge uterus model. Overall, all participants agreed that the new MaiSurge uterus model should be integrated into training curricula to improve the performance of residents on TLHs.

Conclusions: A new highly realistic and cost-effective training model (MaiSurge) to perform a TLH was developed. The model distinguishes between good and poor laparoscopic performances and, thus, can be used in training as well as assessment of surgical skills. The possibility of simulating even complex laparoscopic procedures in a realistic environment may be an opportunity to train a future generation of gynecologists without compromising patient safety or exhausting the limited availability of operating room time.

背景:与传统的开放手术相比,腹腔镜手术具有更平坦的学习曲线。因此,结构化的程序和现实的培训模式是确保患者安全的必要条件。然而,商业上可用的模型往往过于昂贵或技术上不现实的持续手术训练。目的:本试验的目的是开发一种成本效益高、高度逼真的子宫模型,用于腹腔镜全子宫切除术(TLH),并评估其适用性。方法:使用3D打印机和不同的铸造材料制作TLH伴输卵管或腺切除术的训练模型(MaiSurge)。聚乙烯醇被用于电手术。为了收集第一个效度证据,新手和专家妇科医生对模型进行了TLH。比较两组患者的手术时间和手术表现评分(子宫切除术-技术技能客观结构化评估)。结果:新手组共12人,专家组共18人完成模拟。专家们获得了明显更好的改良子宫切除术-客观结构化技术技能评估评分(平均74.0,SD 12.9, SD 14.9, P= 0.049),执行速度明显加快(中位数69.5,IQR 49.5-74.3分钟vs中位数37.5,IQR 30.5-38.8分钟)。结论:开发了一种新的高度现实和具有成本效益的TLH培训模式(MaiSurge)。该模型可以区分好与差的腹腔镜性能,因此可以用于培训和评估手术技能。在现实环境中模拟甚至复杂的腹腔镜手术的可能性可能是培训下一代妇科医生的机会,而不会危及患者的安全或耗尽有限的手术室时间。
{"title":"Developing a Realistic and Cost-Effective Training Model (MaiSurge) for Laparoscopic Hysterectomies to Train and Assess Surgical Skill: Prospective Nonrandomized Controlled Trial.","authors":"Anna Maria Brechter, Roxana Schwab, Christoph Dold, Christine Skala, Maria Schröder, Lina Schiestl, Katharina Gillen, Walburgis Brenner, Annette Hasenburg, Mona Wanda Schmidt","doi":"10.2196/66369","DOIUrl":"10.2196/66369","url":null,"abstract":"<p><strong>Background: </strong>Laparoscopic surgery has a flatter learning curve compared to traditional open surgery. Therefore, structured programs and realistic training models are imperative to ensure patients' safety. However, commercially available models are often too expensive or technically unrealistic for continuous surgical training.</p><p><strong>Objective: </strong>The aim of this trial was to develop a cost-efficient and highly realistic uterus model to perform a total laparoscopic hysterectomy (TLH) and evaluate its applicability.</p><p><strong>Methods: </strong>A training model (MaiSurge) for a TLH with salpingectomy or adenectomy was developed using a 3D printer and different cast materials. Polyvinyl alcohol was used to allow for the use of electrosurgery. To gather the first validity evidence, novice and expert gynecologists performed a TLH on the model. Operative time as well as surgical performance scores (Hysterectomy-Objective Structured Assessment of Technical Skills) were compared between both groups.</p><p><strong>Results: </strong>A total of 12 participants in the novice group and 18 participants in the expert group completed the simulation. The experts obtained significantly better modified Hysterectomy-Objective Structured Assessment of Technical Skills scores (mean 74.0, SD 12.9 vs mean 60.3, SD 14.9; P=.049) and performed significantly faster (median 69.5, IQR 49.5-74.3 minutes vs median 37.5, IQR 30.5-38.8 minutes; P<.001). An excellent interrater reliability was observed (intraclass correlation coefficient=0.91). Approximately 92% (11/12) of novices felt that they had improved their surgical performance after training on the MaiSurge uterus model. Overall, all participants agreed that the new MaiSurge uterus model should be integrated into training curricula to improve the performance of residents on TLHs.</p><p><strong>Conclusions: </strong>A new highly realistic and cost-effective training model (MaiSurge) to perform a TLH was developed. The model distinguishes between good and poor laparoscopic performances and, thus, can be used in training as well as assessment of surgical skills. The possibility of simulating even complex laparoscopic procedures in a realistic environment may be an opportunity to train a future generation of gynecologists without compromising patient safety or exhausting the limited availability of operating room time.</p>","PeriodicalId":36236,"journal":{"name":"JMIR Medical Education","volume":"12 ","pages":"e66369"},"PeriodicalIF":3.2,"publicationDate":"2026-02-12","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12900277/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"146182833","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Using AI to Train Future Clinicians in Depression Assessment: Feasibility Study. 利用人工智能培训未来临床医生进行抑郁症评估:可行性研究。
IF 3.2 Q1 EDUCATION, SCIENTIFIC DISCIPLINES Pub Date : 2026-02-12 DOI: 10.2196/87102
Friederike Holderried, Alessandra Sonanini, Annika Philipps, Christian Stegemann-Philipps, Lea Herschbach, Teresa Festl-Wietek, Stephan Zipfel, Rebecca Erschens, Anne Herrmann-Werner
<p><strong>Background: </strong>Depression is a major global health care challenge, causing significant individual distress but also contributing to a substantial global burden. Timely and accurate diagnosis is crucial. To help future clinicians develop these essential skills, we trained a generative pretrained transformer (GPT)-powered chatbot to simulate patients with varying degrees of depression and suicidality.</p><p><strong>Objective: </strong>This study aims to evaluate the applicability and transferability of our GPT-4-powered chatbot for psychosomatic cases. Specifically, we aim to investigate how accurately the chatbot can simulate patients exhibiting various stages of depression and phases of suicidal ideation, while adhering to a predefined role script and maintaining a sufficient level of authenticity. Additionally, we want to analyze to what level the chatbot is suitable for practicing correctly diagnosing depressive disorders in patients, as well as assessing suicidality stages.</p><p><strong>Methods: </strong>We developed 3 virtual patient role scripts depicting complex, realistic cases of depression and varying degrees of suicidality collaboratively with field experts and aligned with mental health assessment guidelines. These cases were integrated into a GPT-4-powered chatbot for practicing clinical history-taking. A total of 148 medical students, with an average age of 22.71 years and mostly in their sixth semester, interacted individually with one of the randomly assigned virtual patients through chat. Following this, they completed a questionnaire assessing their demographics and user experience. Chats were analyzed descriptively to assess diagnostic accuracy and suicidality assessments, as well as the role script adherence and authenticity of the artificial intelligence (AI). This was done to gain further insight into the chatbot's behavior and the students' diagnostic accuracy.</p><p><strong>Results: </strong>In over 90% (725/778) of the answers, the chatbot maintained its assigned role. On average, students correctly identified the severity of depression in 60% (81/135) and the phase of suicidality in 67% (91/135) of the cases. Notably, the majority either failed to address or insufficiently explored the topic of suicidality despite explicit instructions beforehand.</p><p><strong>Conclusions: </strong>This study demonstrates that a GPT-powered chatbot can simulate patients with depression fairly accurately. More than two-thirds of participants perceived the AI-simulated patients with depression as authentic, and nearly 80% (106/135) indicated they would like to use the application for further practice, highlighting its potential as a training tool. While a small proportion of students expressed reservations, and the overall diagnostic accuracy varied depending on the severity of the case, the findings overall support the feasibility and educational value of AI-based role-playing in clinical training. AI-supported virtual p
背景:抑郁症是一项重大的全球卫生保健挑战,造成重大的个人痛苦,但也造成重大的全球负担。及时准确的诊断至关重要。为了帮助未来的临床医生发展这些基本技能,我们训练了一个生成式预训练变压器(GPT)驱动的聊天机器人来模拟不同程度的抑郁症和自杀倾向的患者。目的:本研究旨在评估我们的gpt -4动力聊天机器人在心身疾病中的适用性和可移植性。具体来说,我们的目标是研究聊天机器人在遵守预定义的角色脚本并保持足够的真实性的同时,如何准确地模拟表现出不同阶段抑郁和自杀意念的患者。此外,我们想要分析聊天机器人在什么程度上适合练习正确诊断患者的抑郁症,以及评估自杀阶段。方法:我们与现场专家合作,根据心理健康评估指南,开发了3个虚拟患者角色脚本,描述复杂、现实的抑郁症和不同程度的自杀病例。这些病例被整合到gpt -4驱动的聊天机器人中,用于实践临床病史记录。共有148名平均年龄为22.71岁的医学生,他们大多在第六学期,通过聊天的方式与随机分配的一位虚拟病人进行了单独的互动。在此之后,他们完成了一份调查问卷,评估他们的人口统计和用户体验。对聊天记录进行描述性分析,以评估诊断准确性和自杀倾向评估,以及人工智能(AI)的角色脚本依从性和真实性。这样做是为了进一步了解聊天机器人的行为和学生的诊断准确性。结果:在超过90%(725/778)的答案中,聊天机器人保持了其指定的角色。平均而言,学生正确识别抑郁症的严重程度为60%(81/135),自杀阶段为67%(91/135)。值得注意的是,尽管事先有明确的指示,但大多数人要么没有解决自杀问题,要么没有充分探讨自杀问题。结论:这项研究表明,gpt驱动的聊天机器人可以相当准确地模拟抑郁症患者。超过三分之二的参与者认为人工智能模拟的抑郁症患者是真实的,近80%(106/135)的参与者表示他们希望使用该应用程序进行进一步的实践,突出了其作为培训工具的潜力。虽然一小部分学生表示保留意见,并且总体诊断准确性因病例的严重程度而异,但研究结果总体上支持了基于人工智能的角色扮演在临床培训中的可行性和教育价值。人工智能支持的虚拟患者提供了高度灵活、标准化和随时可用的培训工具,不受现实生活的限制。
{"title":"Using AI to Train Future Clinicians in Depression Assessment: Feasibility Study.","authors":"Friederike Holderried, Alessandra Sonanini, Annika Philipps, Christian Stegemann-Philipps, Lea Herschbach, Teresa Festl-Wietek, Stephan Zipfel, Rebecca Erschens, Anne Herrmann-Werner","doi":"10.2196/87102","DOIUrl":"10.2196/87102","url":null,"abstract":"&lt;p&gt;&lt;strong&gt;Background: &lt;/strong&gt;Depression is a major global health care challenge, causing significant individual distress but also contributing to a substantial global burden. Timely and accurate diagnosis is crucial. To help future clinicians develop these essential skills, we trained a generative pretrained transformer (GPT)-powered chatbot to simulate patients with varying degrees of depression and suicidality.&lt;/p&gt;&lt;p&gt;&lt;strong&gt;Objective: &lt;/strong&gt;This study aims to evaluate the applicability and transferability of our GPT-4-powered chatbot for psychosomatic cases. Specifically, we aim to investigate how accurately the chatbot can simulate patients exhibiting various stages of depression and phases of suicidal ideation, while adhering to a predefined role script and maintaining a sufficient level of authenticity. Additionally, we want to analyze to what level the chatbot is suitable for practicing correctly diagnosing depressive disorders in patients, as well as assessing suicidality stages.&lt;/p&gt;&lt;p&gt;&lt;strong&gt;Methods: &lt;/strong&gt;We developed 3 virtual patient role scripts depicting complex, realistic cases of depression and varying degrees of suicidality collaboratively with field experts and aligned with mental health assessment guidelines. These cases were integrated into a GPT-4-powered chatbot for practicing clinical history-taking. A total of 148 medical students, with an average age of 22.71 years and mostly in their sixth semester, interacted individually with one of the randomly assigned virtual patients through chat. Following this, they completed a questionnaire assessing their demographics and user experience. Chats were analyzed descriptively to assess diagnostic accuracy and suicidality assessments, as well as the role script adherence and authenticity of the artificial intelligence (AI). This was done to gain further insight into the chatbot's behavior and the students' diagnostic accuracy.&lt;/p&gt;&lt;p&gt;&lt;strong&gt;Results: &lt;/strong&gt;In over 90% (725/778) of the answers, the chatbot maintained its assigned role. On average, students correctly identified the severity of depression in 60% (81/135) and the phase of suicidality in 67% (91/135) of the cases. Notably, the majority either failed to address or insufficiently explored the topic of suicidality despite explicit instructions beforehand.&lt;/p&gt;&lt;p&gt;&lt;strong&gt;Conclusions: &lt;/strong&gt;This study demonstrates that a GPT-powered chatbot can simulate patients with depression fairly accurately. More than two-thirds of participants perceived the AI-simulated patients with depression as authentic, and nearly 80% (106/135) indicated they would like to use the application for further practice, highlighting its potential as a training tool. While a small proportion of students expressed reservations, and the overall diagnostic accuracy varied depending on the severity of the case, the findings overall support the feasibility and educational value of AI-based role-playing in clinical training. AI-supported virtual p","PeriodicalId":36236,"journal":{"name":"JMIR Medical Education","volume":"12 ","pages":"e87102"},"PeriodicalIF":3.2,"publicationDate":"2026-02-12","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12946781/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"146182850","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Medical Students' Experiences With Virtual Reality Simulation Training: Qualitative Study. 医学生虚拟现实模拟训练经验的定性研究。
IF 3.2 Q1 EDUCATION, SCIENTIFIC DISCIPLINES Pub Date : 2026-02-11 DOI: 10.2196/74301
Vanshika Sharma, Sohee Park, Alexandra Voinescu, Chris Jacobs

Background: Beyond its applications in other settings, virtual reality (VR) technology has gained attention in medical education, offering immersive learning experiences. Previous research has demonstrated its potential as an educational tool in medical settings, highlighting enhanced educational outcomes, skill acquisition and retention, standardized training experiences, and the promotion of active learning. However, there is still a dearth of research exploring various aspects of VR user experiences, with most studies focusing on its effect on skill acquisition. Limited qualitative research further hinders an in-depth understanding of user experiences, restricting a comprehensive overview of VR's potential in medical education.

Objective: This study explored subjective experiences with VR simulation training and its perceived benefits and challenges among medical students in the United Kingdom, using the 5 domains of the Immersive Technology Evaluation Measure (ITEM).

Methods: In July 2024, 15- to 20-minute in-person interviews were conducted with 11 medical students who had completed the immersive VR training consisting of the assessment and treatment of a virtual patient. Guided by the 5 domains of the ITEM as preconceived themes, a deductive thematic analysis was used to explore individual experiences with the training, embedded within narrative responses.

Results: Findings aligned with the 5 a priori ITEM domains of system usability, immersion, motivation, cognitive load, and debriefing. Within these predefined domains, new subthemes emerged that enhanced the understanding of user experience. Participants reported usability barriers involving accessibility, technical issues, and limited variability in scenarios. Immersion was generally strong due to realistic environments, although reduced interactivity constrained authenticity. Motivation was reflected in active engagement and a greater sense of preparedness for clinical practice. Cognitive load was associated with divided attention, physical effects, and a need for clearer guidance and familiarization. Ultimately, participants valued debriefing sessions as valuable opportunities for reflection and reinforcing knowledge.

Conclusions: VR training fosters immersion and motivation, but its effectiveness depends on balancing technical usability with cognitive demands. Future integration should prioritize design variability and structured debriefing to optimize learning outcomes. Refinement of immersive VR training in clinical education is also warranted, alongside further research in broader contexts and longitudinal use.

背景:除了在其他环境中的应用,虚拟现实(VR)技术在医学教育中也得到了关注,提供了沉浸式的学习体验。先前的研究已经证明了它在医疗环境中作为一种教育工具的潜力,强调了提高教育成果、技能获得和保留、标准化培训经验和促进主动学习。然而,关于VR用户体验的各个方面的研究仍然缺乏,大多数研究都集中在它对技能习得的影响上。有限的定性研究进一步阻碍了对用户体验的深入理解,限制了对VR在医学教育中的潜力的全面概述。目的:本研究利用沉浸式技术评估量表(ITEM)的5个领域,探讨英国医学生对VR模拟训练的主观体验及其感知的好处和挑战。方法:于2024年7月,对11名完成虚拟患者评估和治疗沉浸式VR培训的医学生进行15- 20分钟的面对面访谈。以项目的5个领域为先入为主的主题为指导,采用演绎主题分析来探索个人与培训的经历,并将其嵌入叙事反应中。结果:研究结果与系统可用性、沉浸感、动机、认知负荷和汇报的5个先验项目领域一致。在这些预定义的领域中,新的子主题出现了,增强了对用户体验的理解。参与者报告了可用性障碍,包括可访问性、技术问题和场景中有限的可变性。由于现实的环境,沉浸感通常很强,尽管减少了交互性限制了真实性。动机反映在积极参与和更大的临床实践准备意识。认知负荷与注意力分散、身体影响以及对更清晰的指导和熟悉的需求有关。最后,与会者认为汇报会议是进行反思和加强知识的宝贵机会。结论:VR培训培养了沉浸感和动机,但其有效性取决于技术可用性与认知需求的平衡。未来的集成应该优先考虑设计可变性和结构化的汇报,以优化学习结果。在临床教育中改进沉浸式VR培训也是必要的,同时在更广泛的背景和纵向应用中进行进一步研究。
{"title":"Medical Students' Experiences With Virtual Reality Simulation Training: Qualitative Study.","authors":"Vanshika Sharma, Sohee Park, Alexandra Voinescu, Chris Jacobs","doi":"10.2196/74301","DOIUrl":"10.2196/74301","url":null,"abstract":"<p><strong>Background: </strong>Beyond its applications in other settings, virtual reality (VR) technology has gained attention in medical education, offering immersive learning experiences. Previous research has demonstrated its potential as an educational tool in medical settings, highlighting enhanced educational outcomes, skill acquisition and retention, standardized training experiences, and the promotion of active learning. However, there is still a dearth of research exploring various aspects of VR user experiences, with most studies focusing on its effect on skill acquisition. Limited qualitative research further hinders an in-depth understanding of user experiences, restricting a comprehensive overview of VR's potential in medical education.</p><p><strong>Objective: </strong>This study explored subjective experiences with VR simulation training and its perceived benefits and challenges among medical students in the United Kingdom, using the 5 domains of the Immersive Technology Evaluation Measure (ITEM).</p><p><strong>Methods: </strong>In July 2024, 15- to 20-minute in-person interviews were conducted with 11 medical students who had completed the immersive VR training consisting of the assessment and treatment of a virtual patient. Guided by the 5 domains of the ITEM as preconceived themes, a deductive thematic analysis was used to explore individual experiences with the training, embedded within narrative responses.</p><p><strong>Results: </strong>Findings aligned with the 5 a priori ITEM domains of system usability, immersion, motivation, cognitive load, and debriefing. Within these predefined domains, new subthemes emerged that enhanced the understanding of user experience. Participants reported usability barriers involving accessibility, technical issues, and limited variability in scenarios. Immersion was generally strong due to realistic environments, although reduced interactivity constrained authenticity. Motivation was reflected in active engagement and a greater sense of preparedness for clinical practice. Cognitive load was associated with divided attention, physical effects, and a need for clearer guidance and familiarization. Ultimately, participants valued debriefing sessions as valuable opportunities for reflection and reinforcing knowledge.</p><p><strong>Conclusions: </strong>VR training fosters immersion and motivation, but its effectiveness depends on balancing technical usability with cognitive demands. Future integration should prioritize design variability and structured debriefing to optimize learning outcomes. Refinement of immersive VR training in clinical education is also warranted, alongside further research in broader contexts and longitudinal use.</p>","PeriodicalId":36236,"journal":{"name":"JMIR Medical Education","volume":"12 ","pages":"e74301"},"PeriodicalIF":3.2,"publicationDate":"2026-02-11","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12936657/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"146167225","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Digital Choice Architecture in Medical Education: Applying Behavioral Economics to Online Learning Environments. 医学教育中的数字选择架构:将行为经济学应用于在线学习环境。
IF 3.2 Q1 EDUCATION, SCIENTIFIC DISCIPLINES Pub Date : 2026-02-06 DOI: 10.2196/86497
Victoria Ekstrom

Unlabelled: Health care has widely adopted behavioral economics to influence clinical practice, with documented success using defaults and social comparison feedback in electronic health records. However, online medical education, now the dominant modality for continuing professional development, remains designed on assumptions of rational learning that behavioral science has disproven in clinical contexts. This viewpoint examines the paradox of applying sophisticated behavioral insights to clinical work while designing digital learning environments as if learners are immune to cognitive limitations. We propose digital choice architecture for medical education: intentional integration of behavioral design principles into learning management systems and online platforms. Drawing from clinical nudge units and implementation science, we demonstrate how defaults, social norms, and commitment devices can be systematically applied to digital continuing education. As medical education becomes increasingly technology-mediated, behavioral science provides the theoretical foundation and practical tools for designing online learning environments that align with how clinicians actually make decisions.

未标记:医疗保健已广泛采用行为经济学来影响临床实践,在电子健康记录中使用默认值和社会比较反馈取得了成功。然而,在线医学教育,现在是继续专业发展的主要方式,仍然是基于理性学习的假设,行为科学在临床环境中已经证明了这一点。这一观点审视了在设计数字学习环境的同时,将复杂的行为见解应用于临床工作的悖论,就好像学习者不受认知限制一样。我们提出了医学教育的数字选择架构:有意将行为设计原则整合到学习管理系统和在线平台中。从临床推动单位和实施科学中,我们展示了默认值、社会规范和承诺设备如何系统地应用于数字继续教育。随着医学教育越来越多地以技术为媒介,行为科学为设计符合临床医生实际决策方式的在线学习环境提供了理论基础和实用工具。
{"title":"Digital Choice Architecture in Medical Education: Applying Behavioral Economics to Online Learning Environments.","authors":"Victoria Ekstrom","doi":"10.2196/86497","DOIUrl":"10.2196/86497","url":null,"abstract":"<p><strong>Unlabelled: </strong>Health care has widely adopted behavioral economics to influence clinical practice, with documented success using defaults and social comparison feedback in electronic health records. However, online medical education, now the dominant modality for continuing professional development, remains designed on assumptions of rational learning that behavioral science has disproven in clinical contexts. This viewpoint examines the paradox of applying sophisticated behavioral insights to clinical work while designing digital learning environments as if learners are immune to cognitive limitations. We propose digital choice architecture for medical education: intentional integration of behavioral design principles into learning management systems and online platforms. Drawing from clinical nudge units and implementation science, we demonstrate how defaults, social norms, and commitment devices can be systematically applied to digital continuing education. As medical education becomes increasingly technology-mediated, behavioral science provides the theoretical foundation and practical tools for designing online learning environments that align with how clinicians actually make decisions.</p>","PeriodicalId":36236,"journal":{"name":"JMIR Medical Education","volume":"12 ","pages":"e86497"},"PeriodicalIF":3.2,"publicationDate":"2026-02-06","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12880849/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"146133284","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Effect of an Online Continuing Professional Development Course on Physicians' Intention to Approach a Colleague in Difficulty: Mixed Methods Convergent Study. 在线持续专业发展课程对医生接近困难同事意愿的影响:一项混合方法趋同研究。
IF 3.2 Q1 EDUCATION, SCIENTIFIC DISCIPLINES Pub Date : 2026-02-05 DOI: 10.2196/80199
Florence Lizotte, Martin Tremblay, Caroline Biron, Éloi Lachance, Souleymane Gadio, Roberta de Carvalho Corôa, Claude Bernard Uwizeye, Sam J Daniel, France Légaré
<p><strong>Background: </strong>Burnout and psychological distress are prevalent among physicians. Peer support appears to play a protective role, yet little is known about training interventions that motivate physicians to approach peers in difficulty, as such effects are often overlooked or assessed using nonvalidated tools.</p><p><strong>Objective: </strong>We evaluated the effects of an online continuing professional development (CPD) course designed to increase physicians' intention to approach a colleague in difficulty.</p><p><strong>Methods: </strong>Physicians who completed a 1-hour asynchronous online CPD course between March 2022 and May 2024 were invited to participate in this mixed methods convergent study. The e-learning course aimed to increase physicians' confidence in approaching colleagues in difficulty by recognizing signs of psychological distress, offering support, and referring them to appropriate resources. Participant characteristics were collected, and behavioral intention to approach a colleague in difficulty along with its determinants were measured pre- and postcourse using the validated CPD-REACTION tool. Differences in mean pre-post intention scores were assessed using 2-tailed paired t tests (n=466) and generalized estimating equations. Factors associated with postcourse intention were examined using multivariate analysis (n=466). Four months later, the proportion of physicians reporting adoption of the behavior was calculated (n=61). Qualitative responses to open-ended questions were analyzed thematically using behavior change models, and behavior change techniques used in the course were identified. Quantitative and qualitative results were triangulated. We reported results following STROBE (Strengthening the Reporting of Observational Studies in Epidemiology) and SRQR (Standards for Reporting Qualitative Research) guidelines for quantitative and qualitative analyses, respectively.</p><p><strong>Results: </strong>Among 792 participating physicians, 466 (58.8%) completed online questionnaires pre- and postcourse. The average participant age was 48 (SD 12.4) years; 43.5% (332/762) were women, and 86% (655/762) were specialists. The average precourse intention score was 3.88 (SD 1.73) and average postcourse intention score was 4.92 (SD 1.40), for an adjusted mean difference of 1.06 (95% CI 0.93-1.20; P<.001). Factors associated with postcourse intention were beliefs about capabilities (β=0.52; P<.001), social influences (β=0.27; P<.001), and moral norm (β=0.26; P=.03; R<sup>2</sup>=0.22). Four months later, 41% (25/61; 95% CI 28.6%-54.3%) of participants reported having approached a colleague in difficulty. Frequently reported reasons for intention to adopt behavior were beliefs about capabilities, beliefs about consequences, and knowledge. Quantitative and qualitative results converged on beliefs about capabilities but diverged regarding beliefs about consequences. A total of 7 behavioral change techniques were ident
背景:医师普遍存在职业倦怠和心理困扰。同伴支持似乎对心理困扰起着保护作用。然而,人们对激励医生接触困难同伴的培训知之甚少,因为其效果经常被忽视或用未经验证的工具进行评估。目的:我们评估在线持续专业发展(CPD)课程的效果,该课程旨在提高医生接触困难同事的意愿。方法:邀请在2022年3月至2024年5月期间完成了一小时异步在线持续专业发展课程的医生参加这项混合方法趋同研究。这个在线学习课程旨在通过识别心理困扰的迹象,提供支持,并将他们转介到适当的资源,提高医生接近困难同事的信心。我们收集了参与者特征的数据,并使用经过验证的CPD-REACTION工具测量了他们在课程前后接近困难同事的行为意愿及其决定因素。我们使用配对t检验(n=466)和广义估计方程来评估前后平均意图得分的差异。我们通过多变量分析确定了与课程后意向相关的因素(n=466)。四个月后,我们计算了报告采取这种行为的医生的比例(n=61)。使用行为改变模型对开放式问题的定性回答进行主题分析。我们还确定了课程中使用的行为改变技巧。定性结果与定量结果进行三角剖分。我们按照STROBE(定量)和SRQR(定性)指南报告结果。结果:792名参与的医生中,466名(58.8%)完成了课程前后的在线问卷调查。平均年龄48±12.4岁,女性43.5%,专科医师86.0%。平均课程前意向为3.88 (SD=1.73),平均课程后意向为4.92 (SD=1.40),调整后平均差异为1.06 (95% CI: 0.93; 1.20)。结论:该在线CPD课程增加了医生接近困难同事的意向。结果强调了对能力的信念是这种行为意图的关键决定因素。该研究表明,在线学习在提高对同伴支持的认识,并最终在医护工作者中建立一种关怀文化方面具有强大的潜力。临床试验:不适用。
{"title":"Effect of an Online Continuing Professional Development Course on Physicians' Intention to Approach a Colleague in Difficulty: Mixed Methods Convergent Study.","authors":"Florence Lizotte, Martin Tremblay, Caroline Biron, Éloi Lachance, Souleymane Gadio, Roberta de Carvalho Corôa, Claude Bernard Uwizeye, Sam J Daniel, France Légaré","doi":"10.2196/80199","DOIUrl":"10.2196/80199","url":null,"abstract":"&lt;p&gt;&lt;strong&gt;Background: &lt;/strong&gt;Burnout and psychological distress are prevalent among physicians. Peer support appears to play a protective role, yet little is known about training interventions that motivate physicians to approach peers in difficulty, as such effects are often overlooked or assessed using nonvalidated tools.&lt;/p&gt;&lt;p&gt;&lt;strong&gt;Objective: &lt;/strong&gt;We evaluated the effects of an online continuing professional development (CPD) course designed to increase physicians' intention to approach a colleague in difficulty.&lt;/p&gt;&lt;p&gt;&lt;strong&gt;Methods: &lt;/strong&gt;Physicians who completed a 1-hour asynchronous online CPD course between March 2022 and May 2024 were invited to participate in this mixed methods convergent study. The e-learning course aimed to increase physicians' confidence in approaching colleagues in difficulty by recognizing signs of psychological distress, offering support, and referring them to appropriate resources. Participant characteristics were collected, and behavioral intention to approach a colleague in difficulty along with its determinants were measured pre- and postcourse using the validated CPD-REACTION tool. Differences in mean pre-post intention scores were assessed using 2-tailed paired t tests (n=466) and generalized estimating equations. Factors associated with postcourse intention were examined using multivariate analysis (n=466). Four months later, the proportion of physicians reporting adoption of the behavior was calculated (n=61). Qualitative responses to open-ended questions were analyzed thematically using behavior change models, and behavior change techniques used in the course were identified. Quantitative and qualitative results were triangulated. We reported results following STROBE (Strengthening the Reporting of Observational Studies in Epidemiology) and SRQR (Standards for Reporting Qualitative Research) guidelines for quantitative and qualitative analyses, respectively.&lt;/p&gt;&lt;p&gt;&lt;strong&gt;Results: &lt;/strong&gt;Among 792 participating physicians, 466 (58.8%) completed online questionnaires pre- and postcourse. The average participant age was 48 (SD 12.4) years; 43.5% (332/762) were women, and 86% (655/762) were specialists. The average precourse intention score was 3.88 (SD 1.73) and average postcourse intention score was 4.92 (SD 1.40), for an adjusted mean difference of 1.06 (95% CI 0.93-1.20; P&lt;.001). Factors associated with postcourse intention were beliefs about capabilities (β=0.52; P&lt;.001), social influences (β=0.27; P&lt;.001), and moral norm (β=0.26; P=.03; R&lt;sup&gt;2&lt;/sup&gt;=0.22). Four months later, 41% (25/61; 95% CI 28.6%-54.3%) of participants reported having approached a colleague in difficulty. Frequently reported reasons for intention to adopt behavior were beliefs about capabilities, beliefs about consequences, and knowledge. Quantitative and qualitative results converged on beliefs about capabilities but diverged regarding beliefs about consequences. A total of 7 behavioral change techniques were ident","PeriodicalId":36236,"journal":{"name":"JMIR Medical Education","volume":" ","pages":"e80199"},"PeriodicalIF":3.2,"publicationDate":"2026-02-05","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12921432/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145865338","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Effectiveness of Informed AI Use on Clinical Competence of General Practitioners and Internists: Pre-Post Intervention Study. 知情人工智能对全科医生和内科医生临床能力的影响:干预前后研究
IF 3.2 Q1 EDUCATION, SCIENTIFIC DISCIPLINES Pub Date : 2026-02-05 DOI: 10.2196/75534
Eyad A Qunaibi, Ayman M Al-Qaaneh, Baraa F Ismail, Hussam I Muhidat, Farhia S Rageh, Najwa A Musallam, Alaa K Fawzy

Background: Artificial intelligence (AI) shows promise in clinical diagnosis, treatment support, and health care efficiency. However, its adoption in real-world practice remains limited due to insufficient clinical validation and an unclear impact on practitioners' competence. Addressing these gaps is essential for effective, confident, and ethical integration of AI into modern health care settings.

Objective: This study aimed to evaluate the effectiveness of informed AI use, following a tailored AI training course, on the performance of general practitioners (GPs) and internists in test-based clinical competence assessments and their attitudes toward clinical AI applications.

Methods: A pre-post intervention study was conducted with 326 physicians from 39 countries. Participants completed a baseline test of clinical decision-making skills, covering diagnosis, treatment planning, and patient counseling; attended a 1.5-hour online training on effective AI use; and then took a similar postcourse test with AI assistance permitted (GPT-4.0). Test performance and time per question were compared before and after the training. Participants also rated AI accuracy, efficiency, perceived need for structured AI training, and their willingness to use AI in clinical practice before and after the course.

Results: The average test scores improved from 56.9% (SD 15.7%) to 77.6% (SD 12.7%; P<.001), and the pass rate increased from 6.4% (21/326) to 58.6% (191/326), with larger gains observed among GPs and younger physicians. All skill domains (diagnosis, treatment planning, and patient counseling) improved significantly (all P<.001), while time taken to complete the test increased slightly from before to after the course (mean 40.25, SD 16.14 min vs 42.29, SD 14.02 min; P=.03). By the end of the intervention, physicians viewed AI more favorably, reporting increased confidence in its accuracy and time efficiency, greater appreciation for the need for structured AI training, and increased confidence and willingness to integrate AI into patient care.

Conclusions: Informed use of AI, based on tailored training, was associated with higher performance in test-based clinical decision-making assessments and greater confidence in using AI among GPs and internists. Building on previous research that often lacked structured training, focused primarily on model performance, or was limited in clinical scope, this study provides empirical evidence of both competence and perceptual improvement following informed AI use in a large, multinational cohort, enhancing the generalizability. These findings support the integration of structured AI training into medical education and continuing professional development to improve clinical performance and promote competent use of AI in clinical practice.

背景:人工智能(AI)在临床诊断、治疗支持和卫生保健效率方面显示出前景。然而,由于缺乏临床验证和对从业者能力的不明确影响,其在现实世界实践中的采用仍然有限。解决这些差距对于将人工智能有效、自信和合乎道德地融入现代卫生保健环境至关重要。目的:本研究旨在评估在量身定制的人工智能培训课程之后,对全科医生(gp)和内科医生在基于测试的临床能力评估中的表现以及他们对临床人工智能应用的态度的有效性。方法:对来自39个国家的326名医生进行干预前后研究。参与者完成了临床决策技能的基线测试,包括诊断、治疗计划和患者咨询;参加1.5小时有效使用人工智能的在线培训;然后在人工智能辅助下进行类似的课后测试(GPT-4.0)。比较训练前后的测试成绩和每个问题的时间。参与者还评估了人工智能的准确性、效率、对结构化人工智能培训的感知需求,以及他们在课程前后在临床实践中使用人工智能的意愿。结果:平均测试分数从56.9% (SD 15.7%)提高到77.6% (SD 12.7%)。结论:基于定制培训的人工智能的知情使用与基于测试的临床决策评估的更高表现以及全科医生和内科医生对使用人工智能的更大信心有关。以往的研究往往缺乏结构化的训练,主要关注模型的性能,或者在临床范围上受到限制,在此基础上,本研究提供了在大型跨国队列中使用知情人工智能后能力和感知能力提高的经验证据,增强了可推广性。这些发现支持将结构化人工智能培训整合到医学教育和持续专业发展中,以提高临床表现并促进人工智能在临床实践中的有效应用。
{"title":"Effectiveness of Informed AI Use on Clinical Competence of General Practitioners and Internists: Pre-Post Intervention Study.","authors":"Eyad A Qunaibi, Ayman M Al-Qaaneh, Baraa F Ismail, Hussam I Muhidat, Farhia S Rageh, Najwa A Musallam, Alaa K Fawzy","doi":"10.2196/75534","DOIUrl":"10.2196/75534","url":null,"abstract":"<p><strong>Background: </strong>Artificial intelligence (AI) shows promise in clinical diagnosis, treatment support, and health care efficiency. However, its adoption in real-world practice remains limited due to insufficient clinical validation and an unclear impact on practitioners' competence. Addressing these gaps is essential for effective, confident, and ethical integration of AI into modern health care settings.</p><p><strong>Objective: </strong>This study aimed to evaluate the effectiveness of informed AI use, following a tailored AI training course, on the performance of general practitioners (GPs) and internists in test-based clinical competence assessments and their attitudes toward clinical AI applications.</p><p><strong>Methods: </strong>A pre-post intervention study was conducted with 326 physicians from 39 countries. Participants completed a baseline test of clinical decision-making skills, covering diagnosis, treatment planning, and patient counseling; attended a 1.5-hour online training on effective AI use; and then took a similar postcourse test with AI assistance permitted (GPT-4.0). Test performance and time per question were compared before and after the training. Participants also rated AI accuracy, efficiency, perceived need for structured AI training, and their willingness to use AI in clinical practice before and after the course.</p><p><strong>Results: </strong>The average test scores improved from 56.9% (SD 15.7%) to 77.6% (SD 12.7%; P<.001), and the pass rate increased from 6.4% (21/326) to 58.6% (191/326), with larger gains observed among GPs and younger physicians. All skill domains (diagnosis, treatment planning, and patient counseling) improved significantly (all P<.001), while time taken to complete the test increased slightly from before to after the course (mean 40.25, SD 16.14 min vs 42.29, SD 14.02 min; P=.03). By the end of the intervention, physicians viewed AI more favorably, reporting increased confidence in its accuracy and time efficiency, greater appreciation for the need for structured AI training, and increased confidence and willingness to integrate AI into patient care.</p><p><strong>Conclusions: </strong>Informed use of AI, based on tailored training, was associated with higher performance in test-based clinical decision-making assessments and greater confidence in using AI among GPs and internists. Building on previous research that often lacked structured training, focused primarily on model performance, or was limited in clinical scope, this study provides empirical evidence of both competence and perceptual improvement following informed AI use in a large, multinational cohort, enhancing the generalizability. These findings support the integration of structured AI training into medical education and continuing professional development to improve clinical performance and promote competent use of AI in clinical practice.</p>","PeriodicalId":36236,"journal":{"name":"JMIR Medical Education","volume":"12 ","pages":"e75534"},"PeriodicalIF":3.2,"publicationDate":"2026-02-05","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12921430/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"146126620","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Investigating the Impact of a Virtual Reality Experience on Medical Student Empathy: Mixed Methods Study. 虚拟现实体验对医学生共情的影响:混合方法研究。
IF 3.2 Q1 EDUCATION, SCIENTIFIC DISCIPLINES Pub Date : 2026-02-04 DOI: 10.2196/76504
Allen G Mundok, Vivian N Ho, Lauren A Fowler, Ann Blair Kennedy, Shannon Stark-Taylor

Background: Physician empathy is important not only for improving patient satisfaction and health outcomes but also for increasing physician job satisfaction and protecting against burnout. However, amid concerns over declining empathy levels in medical education, there is a need for innovative teaching approaches that address the empathy gap, a critical element in patient-centered care.

Objective: This study aimed to use a mixed-methods analysis to explore the effectiveness of a virtual reality (VR) intervention versus traditional lecture methods in enhancing empathy among medical students.

Methods: Overall, 50 first- and second-year medical students were randomized to either a VR intervention, which simulated patient experiences, or a control group receiving traditional empathy lectures. Both groups watch 2 videos with reflections gathered after each video to capture students' experiential learning. Empathy was measured using the Jefferson Scale of Empathy-Student Version before and after the intervention.

Results: Quantitative analysis revealed significant increases in empathy scores post intervention for both groups (lecture group: mean increase 4.71, SD 11.01; VR group: mean increase 5.6, SD 10.02; P<.001), indicating that both interventions enhanced empathy. The VR group exhibited a significant difference in qualitative empathy coding after the second video (U=165.5; P<.001) compared to the lecture group. Qualitative feedback from the VR group emphasized a more profound emotional and cognitive engagement with the patient perspective than the lecture group.

Conclusions: This study supports the integration of VR into medical education as a complementary approach to traditional teaching methods for empathy training. VR immersion provides a valuable platform for students to develop a deeper, more nuanced understanding of empathy. These findings advocate for further exploration into VR's long-term impact on empathy in clinical practice.

背景:医生共情不仅对提高患者满意度和健康结果很重要,而且对提高医生工作满意度和防止倦怠也很重要。然而,在对医学教育中移情水平下降的担忧中,需要创新的教学方法来解决移情差距,这是以患者为中心的护理的关键因素。目的:本研究旨在采用混合方法分析,探讨虚拟现实(VR)干预与传统讲座方法在增强医学生共情能力方面的有效性。方法:总共有50名一年级和二年级的医学生被随机分为两组,一组接受虚拟现实干预,模拟病人的经历,另一组接受传统的移情讲座。两组分别观看两个视频,每个视频后都收集了学生的反思,以捕捉学生的体验式学习。干预前后采用杰弗逊共情-学生版量表测量共情。结果:定量分析显示,干预后两组学生共情得分均显著提高(讲课组平均提高4.71分,SD值为11.01;VR组平均提高5.6分,SD值为10.02)。结论:本研究支持将VR融入医学教育,作为传统共情训练教学方法的补充。VR沉浸为学生提供了一个有价值的平台,让他们对同理心有更深入、更细致的理解。这些发现支持在临床实践中进一步探索VR对移情的长期影响。
{"title":"Investigating the Impact of a Virtual Reality Experience on Medical Student Empathy: Mixed Methods Study.","authors":"Allen G Mundok, Vivian N Ho, Lauren A Fowler, Ann Blair Kennedy, Shannon Stark-Taylor","doi":"10.2196/76504","DOIUrl":"10.2196/76504","url":null,"abstract":"<p><strong>Background: </strong>Physician empathy is important not only for improving patient satisfaction and health outcomes but also for increasing physician job satisfaction and protecting against burnout. However, amid concerns over declining empathy levels in medical education, there is a need for innovative teaching approaches that address the empathy gap, a critical element in patient-centered care.</p><p><strong>Objective: </strong>This study aimed to use a mixed-methods analysis to explore the effectiveness of a virtual reality (VR) intervention versus traditional lecture methods in enhancing empathy among medical students.</p><p><strong>Methods: </strong>Overall, 50 first- and second-year medical students were randomized to either a VR intervention, which simulated patient experiences, or a control group receiving traditional empathy lectures. Both groups watch 2 videos with reflections gathered after each video to capture students' experiential learning. Empathy was measured using the Jefferson Scale of Empathy-Student Version before and after the intervention.</p><p><strong>Results: </strong>Quantitative analysis revealed significant increases in empathy scores post intervention for both groups (lecture group: mean increase 4.71, SD 11.01; VR group: mean increase 5.6, SD 10.02; P<.001), indicating that both interventions enhanced empathy. The VR group exhibited a significant difference in qualitative empathy coding after the second video (U=165.5; P<.001) compared to the lecture group. Qualitative feedback from the VR group emphasized a more profound emotional and cognitive engagement with the patient perspective than the lecture group.</p><p><strong>Conclusions: </strong>This study supports the integration of VR into medical education as a complementary approach to traditional teaching methods for empathy training. VR immersion provides a valuable platform for students to develop a deeper, more nuanced understanding of empathy. These findings advocate for further exploration into VR's long-term impact on empathy in clinical practice.</p>","PeriodicalId":36236,"journal":{"name":"JMIR Medical Education","volume":"12 ","pages":"e76504"},"PeriodicalIF":3.2,"publicationDate":"2026-02-04","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12871579/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"146119636","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Blended Learning Compared With Face-to-Face Learning Among Family Medicine Residents: Randomized Controlled Trial. 家庭医学住院医师混合式学习与面对面学习之比较:随机对照试验。
IF 3.2 Q1 EDUCATION, SCIENTIFIC DISCIPLINES Pub Date : 2026-02-04 DOI: 10.2196/86387
Pierre-Yves Meunier, Sophie Schlatter, Juliette Macabrey, Frédéric Zorzi, Thomas Colleony, Rémy Boussageon, Hubert Maisonneuve, Marion Lamort-Bouché

Background: The medical education of French family medicine residents involves active, socioconstructivist-inspired small-group courses useful for skill acquisition. This is challenged by the increasing gap between the growing number of residents and the limited number of teachers. Blended courses have the potential to address this issue by reducing the duration of face-to-face sessions while preserving small-group courses.

Objective: This study aimed to compare the effects of blended vs traditional, face-to-face, active, socioconstructivist learning on the acquisition of knowledge and skills by family medicine residents.

Methods: We conducted a randomized controlled trial to compare a blended course and a traditional course. The blended course involved 2.5 hours of asynchronous e-learning and a 3-hour face-to-face session. The traditional course involved 5.5 hours of face-to-face teaching. Both courses were grounded in socioconstructivist principles and actively engaged residents. The primary outcome was residents' self-assessment of knowledge and skills. Secondary outcomes included satisfaction with knowledge- or skill-related learning objectives and academic achievement at 6 months.

Results: We included 155 family medicine residents (n=78, 50.3% in the blended course and n=77, 49.7% in the traditional course). There was no significant difference between groups regarding the primary outcome (mean difference 0.40 [maximum mean difference 20] points, 95% CI -0.21 to 1.02; P=.19; Cohen d=0.21). No significant differences were observed for the secondary outcomes except for knowledge self-assessment, which was higher in the blended course but not educationally meaningful (mean difference 0.40 [maximum possible 10] points, 95% CI 0.07-0.71; P=.02; Cohen d=0.39).

Conclusions: Blended courses can help sustain socioconstructivist small-group teaching methods while accommodating a growing family medicine resident population, with no deleterious impact on knowledge and skill self-assessments.

背景:法国家庭医学住院医师的医学教育包括积极的,社会建构主义启发的小组课程,有助于技能习得。不断增长的居民数量和有限的教师数量之间的差距越来越大,这对这一目标提出了挑战。混合课程有可能通过减少面对面会议的持续时间,同时保留小组课程来解决这个问题。目的:比较混合学习与传统学习、面对面学习、主动学习和社会建构主义学习对家庭医学住院医师知识和技能习得的影响。方法:我们进行了一项随机对照试验,比较混合疗程和传统疗程。混合课程包括2.5小时的异步电子学习和3小时的面对面课程。传统课程包括5.5小时的面对面教学。这两门课程都以社会建构主义原则为基础,并积极参与居民活动。主要结果是居民对知识和技能的自我评价。次要结果包括6个月时对知识或技能相关学习目标的满意度和学业成绩。结果:纳入155名家庭医学住院医师,其中混合治疗组为78人,50.3%;传统治疗组为77人,49.7%。各组间主要转归无显著差异(平均差0.40[最大平均差20]点,95% CI -0.21 ~ 1.02; P= 0.19; Cohen d=0.21)。除知识自我评估外,其他次要结局无显著差异,混合课程的知识自我评估更高,但没有教育意义(平均差异0.40[最大可能10]分,95% CI 0.07-0.71; P= 0.02; Cohen d=0.39)。结论:混合课程有助于维持社会建构主义的小组教学方法,同时适应不断增长的家庭医学住院居民,对知识和技能自我评估没有有害影响。
{"title":"Blended Learning Compared With Face-to-Face Learning Among Family Medicine Residents: Randomized Controlled Trial.","authors":"Pierre-Yves Meunier, Sophie Schlatter, Juliette Macabrey, Frédéric Zorzi, Thomas Colleony, Rémy Boussageon, Hubert Maisonneuve, Marion Lamort-Bouché","doi":"10.2196/86387","DOIUrl":"10.2196/86387","url":null,"abstract":"<p><strong>Background: </strong>The medical education of French family medicine residents involves active, socioconstructivist-inspired small-group courses useful for skill acquisition. This is challenged by the increasing gap between the growing number of residents and the limited number of teachers. Blended courses have the potential to address this issue by reducing the duration of face-to-face sessions while preserving small-group courses.</p><p><strong>Objective: </strong>This study aimed to compare the effects of blended vs traditional, face-to-face, active, socioconstructivist learning on the acquisition of knowledge and skills by family medicine residents.</p><p><strong>Methods: </strong>We conducted a randomized controlled trial to compare a blended course and a traditional course. The blended course involved 2.5 hours of asynchronous e-learning and a 3-hour face-to-face session. The traditional course involved 5.5 hours of face-to-face teaching. Both courses were grounded in socioconstructivist principles and actively engaged residents. The primary outcome was residents' self-assessment of knowledge and skills. Secondary outcomes included satisfaction with knowledge- or skill-related learning objectives and academic achievement at 6 months.</p><p><strong>Results: </strong>We included 155 family medicine residents (n=78, 50.3% in the blended course and n=77, 49.7% in the traditional course). There was no significant difference between groups regarding the primary outcome (mean difference 0.40 [maximum mean difference 20] points, 95% CI -0.21 to 1.02; P=.19; Cohen d=0.21). No significant differences were observed for the secondary outcomes except for knowledge self-assessment, which was higher in the blended course but not educationally meaningful (mean difference 0.40 [maximum possible 10] points, 95% CI 0.07-0.71; P=.02; Cohen d=0.39).</p><p><strong>Conclusions: </strong>Blended courses can help sustain socioconstructivist small-group teaching methods while accommodating a growing family medicine resident population, with no deleterious impact on knowledge and skill self-assessments.</p>","PeriodicalId":36236,"journal":{"name":"JMIR Medical Education","volume":"12 ","pages":"e86387"},"PeriodicalIF":3.2,"publicationDate":"2026-02-04","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12871943/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"146120747","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
期刊
JMIR Medical Education
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1