This study aimed to evaluate the impact of a teaching model combining massive open online course (MOOC) with virtual simulation experiments on clinical skills training. A quasi-experimental design was employed in the Second Clinical College of Wuhan University. The control group received traditional face-to-face teaching, while the experimental group received a teaching model combining MOOC instruction with virtual simulation experiments. After completing the course, both groups participated in a theoretical test and a Structured Clinical Skills Assessment to assess their mastery of clinical skills. Additionally, a questionnaire survey was conducted to assess the effectiveness of the experimental teaching model and to gauge student satisfaction. The experimental group's theoretical test scores were significantly higher than those of the control group, and the experimental group's Structured Clinical Skills Assessment scores were significantly higher than those of the control group, with statistically significant differences (p < 0.05). The post-course questionnaire survey of the experimental group showed that most students were satisfied with the teaching model. In the general surgery experiment course, the teaching model combining MOOC instruction with virtual simulation experiments is an effective teaching method that improves surgical skill performance and increases student satisfaction.
{"title":"MOOC-virtual simulation integration enhances surgical clinical skills: a quasi-experimental study.","authors":"Wang Zhang, Zonghuan Li, Pengcheng Li, Changhuan Liu, Zheng Wang, Yuping Liu, Lekai Zhu, Yulong Shi, Xue Fang, Xinghuan Wang, Zhe Xie, Xin Wang","doi":"10.1080/10872981.2025.2579396","DOIUrl":"10.1080/10872981.2025.2579396","url":null,"abstract":"<p><p>This study aimed to evaluate the impact of a teaching model combining massive open online course (MOOC) with virtual simulation experiments on clinical skills training. A quasi-experimental design was employed in the Second Clinical College of Wuhan University. The control group received traditional face-to-face teaching, while the experimental group received a teaching model combining MOOC instruction with virtual simulation experiments. After completing the course, both groups participated in a theoretical test and a Structured Clinical Skills Assessment to assess their mastery of clinical skills. Additionally, a questionnaire survey was conducted to assess the effectiveness of the experimental teaching model and to gauge student satisfaction. The experimental group's theoretical test scores were significantly higher than those of the control group, and the experimental group's Structured Clinical Skills Assessment scores were significantly higher than those of the control group, with statistically significant differences (<i>p</i> < 0.05). The post-course questionnaire survey of the experimental group showed that most students were satisfied with the teaching model. In the general surgery experiment course, the teaching model combining MOOC instruction with virtual simulation experiments is an effective teaching method that improves surgical skill performance and increases student satisfaction.</p>","PeriodicalId":47656,"journal":{"name":"Medical Education Online","volume":"30 1","pages":"2579396"},"PeriodicalIF":3.8,"publicationDate":"2025-12-31","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12587814/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145439662","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2025-12-31Epub Date: 2025-11-09DOI: 10.1080/10872981.2025.2579079
Patrick McGown, Jennifer Forshaw, Lisa Gould
Introduction: Access to mentorship is frequently cited as a priority for doctors, however formal mentorship programmes throughout training are lacking for residents. Near-peer mentorship is a faculty-light option to potentially bridge this mentorship gap, however the literature in the clinical postgraduate setting is not comprehensive. We aim to evaluate the benefits and pitfalls of near-peer mentorship in a postgraduate setting.
Materials and methods: This study conducted a narrative review based on previously published frameworks. An exhaustive search of the literature was conducted using PubMed and OVID Medline. The findings were analysed using Framework synthesis, in which the pre-existing framework of benefits and pitfalls of near-peer mentorship were applied. After allocation to the frameworks' overarching themes, data was analysed thematically.
Results: Across 10 identified studies (one quantitative, three qualitative, six mixed-methods), near-peer mentorship was perceived to be beneficial (72-100% approval), with 85-99% of mentors and mentees desiring continuation of schemes at their units (seven UK-based, two Australian, one Canadian). The main themes identified were Mentee benefits, including careers advice and development of transferable skills; Mentor benefits, including leadership and organisational skill improvements; Organisational benefits, including reduced faculty workload and an enhanced sense of community. Pitfalls included a perceived lack of mentor expertise, shortage of time and resources, and unsupportive mentoring relationships.
Discussion: This research suggests that near-peer mentorship offers benefits for the mentee, mentor and organisation, if care is taken to mitigate the potential pitfalls. The main benefits versus traditional senior faculty mentorship derive from the concept of social and cognitive congruence, whereby a more closely relatable tutor is better able to tailor learning in terms understood by the tutee. Practical recommendations to optimise near-pear mentorship include mentorship pyramids, matching of pairings within the same clinical sites, and a hybrid matching approach with the option for mentee-led selection.
{"title":"Near-peer mentorship for newly qualified doctors; what are the benefits, and what methods can be used to overcome the pitfalls?","authors":"Patrick McGown, Jennifer Forshaw, Lisa Gould","doi":"10.1080/10872981.2025.2579079","DOIUrl":"10.1080/10872981.2025.2579079","url":null,"abstract":"<p><strong>Introduction: </strong>Access to mentorship is frequently cited as a priority for doctors, however formal mentorship programmes throughout training are lacking for residents. Near-peer mentorship is a faculty-light option to potentially bridge this mentorship gap, however the literature in the clinical postgraduate setting is not comprehensive. We aim to evaluate the benefits and pitfalls of near-peer mentorship in a postgraduate setting.</p><p><strong>Materials and methods: </strong>This study conducted a narrative review based on previously published frameworks. An exhaustive search of the literature was conducted using PubMed and OVID Medline. The findings were analysed using Framework synthesis, in which the pre-existing framework of <i>benefits</i> and <i>pitfalls</i> of near-peer mentorship were applied. After allocation to the frameworks' overarching themes, data was analysed thematically.</p><p><strong>Results: </strong>Across 10 identified studies (one quantitative, three qualitative, six mixed-methods), near-peer mentorship was perceived to be beneficial (72-100% approval), with 85-99% of mentors and mentees desiring continuation of schemes at their units (seven UK-based, two Australian, one Canadian). The main themes identified were Mentee benefits, including careers advice and development of transferable skills; Mentor benefits, including leadership and organisational skill improvements; Organisational benefits, including reduced faculty workload and an enhanced sense of community. Pitfalls included a perceived lack of mentor expertise, shortage of time and resources, and unsupportive mentoring relationships.</p><p><strong>Discussion: </strong>This research suggests that near-peer mentorship offers benefits for the mentee, mentor and organisation, if care is taken to mitigate the potential pitfalls. The main benefits versus traditional senior faculty mentorship derive from the concept of social and cognitive congruence, whereby a more closely relatable tutor is better able to tailor learning in terms understood by the tutee. Practical recommendations to optimise near-pear mentorship include mentorship pyramids, matching of pairings within the same clinical sites, and a hybrid matching approach with the option for mentee-led selection.</p>","PeriodicalId":47656,"journal":{"name":"Medical Education Online","volume":"30 1","pages":"2579079"},"PeriodicalIF":3.8,"publicationDate":"2025-12-31","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12604131/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145482475","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2025-12-31Epub Date: 2025-10-16DOI: 10.1080/10872981.2025.2569936
Mikio Hayashi, Rintaro Imafuku, Katsumi Nishiya
The late submission of assignments by medical students is a chronic problem. Students' characteristics and past academic performances are useful indicators for predicting delays during medical school. However, medical students' perspectives on factors that lead to delays in assignment submission remain unexplored. To address this issue, this study examines the factors that influence the late submission of assignments in a medical university setting from the perspective of medical students. This exploratory qualitative case study recruited 20 undergraduate medical students who had submitted their assignments late at a private Japanese medical university. Data were collected through face-to-face, semi-structured interviews and analyzed using thematic analysis from a constructivist paradigm. A qualitative analysis revealed the following three themes regarding the factors influencing the late submission of assignments: (1) factors related to the medical university curriculum, including its structure, course offerings, and academic requirements, (2) factors related to the instructions from and authority of faculty, and (3) situational factors that exacerbate delays in student assignment submissions. Medical students felt that the nature of the assignments, which were assigned with no apparent regard for student schedules and offered a limited evaluative metric, was often irrational. The students implicitly accepted this situation while being aware of the infeasible submission deadlines. Medical students may have an internal sense of the perceived irrationality of faculty-imposed assignments and the authority behind them, and may strategically neglect some in favor of efficient course completion. Faculty members should ensure their assignments are consistent with course contents.
{"title":"Factors influencing the late submission of assignments in a medical university environment: a qualitative case study.","authors":"Mikio Hayashi, Rintaro Imafuku, Katsumi Nishiya","doi":"10.1080/10872981.2025.2569936","DOIUrl":"10.1080/10872981.2025.2569936","url":null,"abstract":"<p><p>The late submission of assignments by medical students is a chronic problem. Students' characteristics and past academic performances are useful indicators for predicting delays during medical school. However, medical students' perspectives on factors that lead to delays in assignment submission remain unexplored. To address this issue, this study examines the factors that influence the late submission of assignments in a medical university setting from the perspective of medical students. This exploratory qualitative case study recruited 20 undergraduate medical students who had submitted their assignments late at a private Japanese medical university. Data were collected through face-to-face, semi-structured interviews and analyzed using thematic analysis from a constructivist paradigm. A qualitative analysis revealed the following three themes regarding the factors influencing the late submission of assignments: (1) factors related to the medical university curriculum, including its structure, course offerings, and academic requirements, (2) factors related to the instructions from and authority of faculty, and (3) situational factors that exacerbate delays in student assignment submissions. Medical students felt that the nature of the assignments, which were assigned with no apparent regard for student schedules and offered a limited evaluative metric, was often irrational. The students implicitly accepted this situation while being aware of the infeasible submission deadlines. Medical students may have an internal sense of the perceived irrationality of faculty-imposed assignments and the authority behind them, and may strategically neglect some in favor of efficient course completion. Faculty members should ensure their assignments are consistent with course contents.</p>","PeriodicalId":47656,"journal":{"name":"Medical Education Online","volume":"30 1","pages":"2569936"},"PeriodicalIF":3.8,"publicationDate":"2025-12-31","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12536618/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145309562","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2025-12-31Epub Date: 2025-11-20DOI: 10.1080/10872981.2025.2479755
Jayne Victoria Cullen, Hugh Alberti
The decision to trust a learner to actively participate in care is a fundamental part of clinical training. However, concerns about patient safety mean that clinical practice often presents newly qualified doctors with situations they are unable to manage independently. 'Entrustable professional activities' (EPAs) have been proposed as a possible solution to this unpreparedness. Understanding how undergraduate GP teachers make entrustment decisions, which is a gap in the existing literature, may be an appropriate first step towards understanding whether EPAs can be applied to undergraduate primary care. To explore teachers' experiences of entrustment decision-making in an undergraduate primary care context. This is an interpretive phenomenological study. Data were collected through semi-structured interviews with final year GP teachers. Interviews were audio recorded, transcribed and subjected to thematic analysis. Five GP teachers were interviewed. Five themes and twenty-four subthemes were produced. These captured participants' experiences of entrustment decision-making, as well as the factors influencing these decisions. The teacher's personal approach to risk was a key consideration. The experience of GP teachers is also shaped by their context and challenges such as managing complexity and uncertainty that are inherent to the GP role. Entrustment decision-making is complex and is experienced individually by different teachers, resulting in a range of approaches to supervision. Despite these differences there are various common factors that impact the decision of whether to entrust. This study allows several recommendations to be made that may enable us to move towards entrustment of undergraduate students in a primary care context.
{"title":"A question of risk: how do undergraduate GP teachers experience entrustment decision-making in primary care?","authors":"Jayne Victoria Cullen, Hugh Alberti","doi":"10.1080/10872981.2025.2479755","DOIUrl":"10.1080/10872981.2025.2479755","url":null,"abstract":"<p><p>The decision to trust a learner to actively participate in care is a fundamental part of clinical training. However, concerns about patient safety mean that clinical practice often presents newly qualified doctors with situations they are unable to manage independently. 'Entrustable professional activities' (EPAs) have been proposed as a possible solution to this unpreparedness. Understanding how undergraduate GP teachers make entrustment decisions, which is a gap in the existing literature, may be an appropriate first step towards understanding whether EPAs can be applied to undergraduate primary care. To explore teachers' experiences of entrustment decision-making in an undergraduate primary care context. This is an interpretive phenomenological study. Data were collected through semi-structured interviews with final year GP teachers. Interviews were audio recorded, transcribed and subjected to thematic analysis. Five GP teachers were interviewed. Five themes and twenty-four subthemes were produced. These captured participants' experiences of entrustment decision-making, as well as the factors influencing these decisions. The teacher's personal approach to risk was a key consideration. The experience of GP teachers is also shaped by their context and challenges such as managing complexity and uncertainty that are inherent to the GP role. Entrustment decision-making is complex and is experienced individually by different teachers, resulting in a range of approaches to supervision. Despite these differences there are various common factors that impact the decision of whether to entrust. This study allows several recommendations to be made that may enable us to move towards entrustment of undergraduate students in a primary care context.</p>","PeriodicalId":47656,"journal":{"name":"Medical Education Online","volume":"30 1","pages":"2479755"},"PeriodicalIF":3.8,"publicationDate":"2025-12-31","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12636540/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145565984","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2025-12-31Epub Date: 2025-11-29DOI: 10.1080/10872981.2025.2592430
Pablo Ros-Arlanzón, Renato Gutarra-Ávila, Vicente Arrarte-Esteban, Vicente Bertomeu-González, Luis Hernández-Blasco, Mar Masiá, Laura Navarro-Canto, Juan Nieto-Navarro, Javier Abarca, Angel P Sempere
Large language models (LLMs) are increasingly used in healthcare and medical education, but their performance on institution-authored multiple-choice questions (MCQs), particularly with negative marking, remains unclear. To compare the examination performance of five contemporary LLMs with enrolled medical students on final multiple-choice (MCQ-style) course exams across four clinical courses. We conducted a comparative cross-sectional study at Miguel Hernández University (Spain) in 2025. Final exams in Infectious Diseases, Neurology, Respiratory Medicine, and Cardiovascular Medicine were administered under routine conditions in Spanish. Five LLMs (OpenAI o1, GPT-4o, DeepSeek R1, Microsoft Copilot, and Google Gemini 1.5 Flash) completed all MCQs in two independent runs. Scores were averaged and test-retest was estimated with Gwet's AC1. Student scores (n = 442) were summarized as mean ± SD or median (IQR). Pairwise differences between models were explored with McNemar's test; student-LLM contrasts were descriptive. Across courses, LLMs consistently exceeded the student median and, in several instances, the highest student score. Mean LLM courses scores ranged 7.46-9.88, versus student means 4.28-7.32. OpenAI o1 achieved the highest mean in three courses; Copilot led in Cardiovascular Medicine (text-only subset due to image limitations). All LLMs answered every MCQ and short term test-retest agreement was high (AC1 0.79-1.00). Aggregated across courses, LLMs averaged 8.75 compared with 5.76 for students. On department-set Spanish MCQ exams with negative marking, LLMs outperformed enrolled medical students, answered every item, and showed high short-term reproducibility. These findings support cautious, faculty-supervised use of LLMs as adjuncts to MCQ assessment (e.g. automated pretesting, feedback). Confirmation across institutions, languages, and image-rich formats, and evaluation of educational impact beyond accuracy are needed.
{"title":"When AI models take the exam: large language models vs medical students on multiple-choice course exams.","authors":"Pablo Ros-Arlanzón, Renato Gutarra-Ávila, Vicente Arrarte-Esteban, Vicente Bertomeu-González, Luis Hernández-Blasco, Mar Masiá, Laura Navarro-Canto, Juan Nieto-Navarro, Javier Abarca, Angel P Sempere","doi":"10.1080/10872981.2025.2592430","DOIUrl":"10.1080/10872981.2025.2592430","url":null,"abstract":"<p><p>Large language models (LLMs) are increasingly used in healthcare and medical education, but their performance on institution-authored multiple-choice questions (MCQs), particularly with negative marking, remains unclear. To compare the examination performance of five contemporary LLMs with enrolled medical students on final multiple-choice (MCQ-style) course exams across four clinical courses. We conducted a comparative cross-sectional study at Miguel Hernández University (Spain) in 2025. Final exams in Infectious Diseases, Neurology, Respiratory Medicine, and Cardiovascular Medicine were administered under routine conditions in Spanish. Five LLMs (OpenAI o1, GPT-4o, DeepSeek R1, Microsoft Copilot, and Google Gemini 1.5 Flash) completed all MCQs in two independent runs. Scores were averaged and test-retest was estimated with Gwet's AC1. Student scores (<i>n</i> = 442) were summarized as mean ± SD or median (IQR). Pairwise differences between models were explored with McNemar's test; student-LLM contrasts were descriptive. Across courses, LLMs consistently exceeded the student median and, in several instances, the highest student score. Mean LLM courses scores ranged 7.46-9.88, versus student means 4.28-7.32. OpenAI o1 achieved the highest mean in three courses; Copilot led in Cardiovascular Medicine (text-only subset due to image limitations). All LLMs answered every MCQ and short term test-retest agreement was high (AC1 0.79-1.00). Aggregated across courses, LLMs averaged 8.75 compared with 5.76 for students. On department-set Spanish MCQ exams with negative marking, LLMs outperformed enrolled medical students, answered every item, and showed high short-term reproducibility. These findings support cautious, faculty-supervised use of LLMs as adjuncts to MCQ assessment (e.g. automated pretesting, feedback). Confirmation across institutions, languages, and image-rich formats, and evaluation of educational impact beyond accuracy are needed.</p>","PeriodicalId":47656,"journal":{"name":"Medical Education Online","volume":"30 1","pages":"2592430"},"PeriodicalIF":3.8,"publicationDate":"2025-12-31","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12667333/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145641194","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2025-12-31Epub Date: 2025-12-01DOI: 10.1080/10872981.2025.2585637
Do-Hwan Kim, Ye Ji Kang, Young-Mee Lee
The rapid evolution of artificial intelligence (AI) and its growing role in clinical settings have made AI education a priority in undergraduate medical education. To support this, AI curricula must align with existing medical education frameworks while addressing AI's distinctive characteristics. This article outlines twelve actionable tips to guide the development and implementation of such curricula. These include defining the purpose and scope of AI education within the broader context of existing competency frameworks and digital health. The curriculum should be structured to allow for progressive deepening and integration of content, prioritizing key elements. Additionally, sustainable AI education depends on securing institutional resources, providing learners with authentic experiences, and ensuring continuous evaluation and improvement of the curriculum. Together, these approaches aim to help medical schools prepare students to practice effectively in a future where AI is a core component of medical practice.
{"title":"Twelve tips for developing and implementing AI curriculum for undergraduate medical education.","authors":"Do-Hwan Kim, Ye Ji Kang, Young-Mee Lee","doi":"10.1080/10872981.2025.2585637","DOIUrl":"10.1080/10872981.2025.2585637","url":null,"abstract":"<p><p>The rapid evolution of artificial intelligence (AI) and its growing role in clinical settings have made AI education a priority in undergraduate medical education. To support this, AI curricula must align with existing medical education frameworks while addressing AI's distinctive characteristics. This article outlines twelve actionable tips to guide the development and implementation of such curricula. These include defining the purpose and scope of AI education within the broader context of existing competency frameworks and digital health. The curriculum should be structured to allow for progressive deepening and integration of content, prioritizing key elements. Additionally, sustainable AI education depends on securing institutional resources, providing learners with authentic experiences, and ensuring continuous evaluation and improvement of the curriculum. Together, these approaches aim to help medical schools prepare students to practice effectively in a future where AI is a core component of medical practice.</p>","PeriodicalId":47656,"journal":{"name":"Medical Education Online","volume":"30 1","pages":"2585637"},"PeriodicalIF":3.8,"publicationDate":"2025-12-31","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12673982/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145655875","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2025-12-31Epub Date: 2025-10-29DOI: 10.1080/10872981.2025.2574647
Yavuz Selim Kıyak, Stanisław Górski, Tomasz Tokarek, Michał Pers, Andrzej A Kononowicz
In this study, we conducted a descriptive study to evaluate the quality of KFQs generated by OpenAI's o3 model. We developed a reusable generic prompt for KFQ generation, designed in alignment with the Medical Council of Canada's KFQ development guidelines. We also created an evaluation metric to systematically assess the quality of the KFQs based on the KFQ development guideline. Twenty unique cardiology-focused KFQs were created using recent European Society of Cardiology guidelines as reference. Each KFQ was independently assessed by two cardiology experts using the quality checklist, with disagreements resolved by a third reviewer. Descriptive statistics were used to summarize checklist compliance and final acceptability ratings. Of the 20 KFQs, 3 (15%) were rated 'Accept as is' and 17 (85%) 'Accept with minor revisions'; none required major revisions or were rejected. The overall compliance rate across checklist criteria was 93.7%, with perfect scores in domains such as key feature definition, scenario plausibility, and alignment between questions and scenarios. Lower performance was observed for inclusion of genuinely harmful 'killer' responses (50%), plausibility of distractors (77.8%), and active language use in phrasing the question (80%). The findings showed that an LLM, guided by a structured prompt, can generate KFQs that closely adhere to established quality standards, with most requiring only minor refinements. While expert review remains essential to ensure clinical accuracy and patient safety, AI-assisted workflows have strong potential to streamline KFQ development and enhance the scalability of CDM assessment in medical education.
{"title":"Large language models for generating key-feature questions in medical education.","authors":"Yavuz Selim Kıyak, Stanisław Górski, Tomasz Tokarek, Michał Pers, Andrzej A Kononowicz","doi":"10.1080/10872981.2025.2574647","DOIUrl":"10.1080/10872981.2025.2574647","url":null,"abstract":"<p><p>In this study, we conducted a descriptive study to evaluate the quality of KFQs generated by OpenAI's o3 model. We developed a reusable generic prompt for KFQ generation, designed in alignment with the Medical Council of Canada's KFQ development guidelines. We also created an evaluation metric to systematically assess the quality of the KFQs based on the KFQ development guideline. Twenty unique cardiology-focused KFQs were created using recent European Society of Cardiology guidelines as reference. Each KFQ was independently assessed by two cardiology experts using the quality checklist, with disagreements resolved by a third reviewer. Descriptive statistics were used to summarize checklist compliance and final acceptability ratings. Of the 20 KFQs, 3 (15%) were rated 'Accept as is' and 17 (85%) 'Accept with minor revisions'; none required major revisions or were rejected. The overall compliance rate across checklist criteria was 93.7%, with perfect scores in domains such as key feature definition, scenario plausibility, and alignment between questions and scenarios. Lower performance was observed for inclusion of genuinely harmful 'killer' responses (50%), plausibility of distractors (77.8%), and active language use in phrasing the question (80%). The findings showed that an LLM, guided by a structured prompt, can generate KFQs that closely adhere to established quality standards, with most requiring only minor refinements. While expert review remains essential to ensure clinical accuracy and patient safety, AI-assisted workflows have strong potential to streamline KFQ development and enhance the scalability of CDM assessment in medical education.</p>","PeriodicalId":47656,"journal":{"name":"Medical Education Online","volume":"30 1","pages":"2574647"},"PeriodicalIF":3.8,"publicationDate":"2025-12-31","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12573569/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145393963","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2025-12-31Epub Date: 2025-11-15DOI: 10.1080/10872981.2025.2585634
Emma J Hastings, M Courtney Hughes, Lei Hua
The overturning of Roe v. Wade in Dobbs v. Jackson Women's Health Organization (2022) has led to varying abortion laws across states in the U.S., potentially influencing medical education choices. While some research has examined abortion policy changes and medical residencies and fellowship decisions, none has investigated abortion policy and medical school decisions. This study aims to determine whether pre-medical students' personal views on abortion influence their willingness to attend medical schools in states where abortion is illegal at all stages EXCEPT to save the life of the mother. A cross-sectional survey was distributed to pre-medical organizations at the two largest four-year institutions in each U.S. state and the District of Columbia, except Wyoming, which has only one four-year institution. The survey collected demographic data, political affiliation, intended medical specialty, and personal stance on abortion. Participants indicated their willingness to attend medical school in states with different types of abortion policies. There were 182 completed surveys from participants in 20 different states. Analysis showed that students who believed abortion is acceptable and should be legal at all stages were significantly less willing to attend medical school in states where abortion is illegal EXCEPT to save the life of the mother. Abortion policy may influence medical school decisions among pre-medical students, which may have long-term implications for physician distribution, particularly in states with restrictive abortion laws. Future research should explore how these trends impact healthcare workforce shortages and access to reproductive care.
在多布斯诉杰克逊妇女健康组织(Dobbs v. Jackson Women’s Health Organization, 2022)中,罗伊诉韦德案(Roe v. Wade)的判决被推翻,导致美国各州的堕胎法各不相同,这可能会影响医学教育的选择。虽然一些研究审查了堕胎政策的变化和医疗住院医师和奖学金的决定,但没有一个研究调查堕胎政策和医学院的决定。本研究旨在确定预科医学生对堕胎的个人观点是否会影响他们在除挽救母亲生命外所有阶段堕胎都是非法的州就读医学院的意愿。一项横断面调查分发给美国各州和哥伦比亚特区两所最大的四年制大学的医学预科组织,怀俄明州只有一所四年制大学。该调查收集了人口统计数据、政治派别、预期的医学专业和个人对堕胎的立场。参与者表示,他们愿意在堕胎政策不同的州就读医学院。共有来自20个不同州的参与者完成了182份调查。分析表明,那些认为堕胎是可以接受的,并且在所有阶段都应该是合法的学生,在除拯救母亲生命外堕胎为非法的州,明显不太愿意去医学院上学。堕胎政策可能影响医学院预科学生的医学院决定,这可能对医生分布产生长期影响,特别是在有限制性堕胎法的州。未来的研究应探讨这些趋势如何影响医疗保健人力短缺和获得生殖保健。
{"title":"Overturning Roe v. Wade and pre-medical students' views and medical school choices.","authors":"Emma J Hastings, M Courtney Hughes, Lei Hua","doi":"10.1080/10872981.2025.2585634","DOIUrl":"10.1080/10872981.2025.2585634","url":null,"abstract":"<p><p>The overturning of Roe v. Wade in Dobbs v. Jackson Women's Health Organization (2022) has led to varying abortion laws across states in the U.S., potentially influencing medical education choices. While some research has examined abortion policy changes and medical residencies and fellowship decisions, none has investigated abortion policy and medical school decisions. This study aims to determine whether pre-medical students' personal views on abortion influence their willingness to attend medical schools in states where abortion is illegal at all stages EXCEPT to save the life of the mother. A cross-sectional survey was distributed to pre-medical organizations at the two largest four-year institutions in each U.S. state and the District of Columbia, except Wyoming, which has only one four-year institution. The survey collected demographic data, political affiliation, intended medical specialty, and personal stance on abortion. Participants indicated their willingness to attend medical school in states with different types of abortion policies. There were 182 completed surveys from participants in 20 different states. Analysis showed that students who believed abortion is acceptable and should be legal at all stages were significantly less willing to attend medical school in states where abortion is illegal EXCEPT to save the life of the mother. Abortion policy may influence medical school decisions among pre-medical students, which may have long-term implications for physician distribution, particularly in states with restrictive abortion laws. Future research should explore how these trends impact healthcare workforce shortages and access to reproductive care.</p>","PeriodicalId":47656,"journal":{"name":"Medical Education Online","volume":"30 1","pages":"2585634"},"PeriodicalIF":3.8,"publicationDate":"2025-12-31","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12621340/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145530853","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2025-12-31Epub Date: 2025-09-20DOI: 10.1080/10872981.2025.2559674
Maaike P Smit, Karsten A van Loon, Reinoud J B J Gemke, Matthijs de Hoog, Janielle van der Velden
Despite the use of structured competency frameworks and supporting educational tools, medical residents often continue to struggle with personalising their learning, managing feedback, and balancing professional demands with personal well-being. These persistent challenges suggest that conventional support systems may not fully address the nuanced needs of residents, contributing to feelings of disconnection and stress during training. The Resident Talks Podcast was developed to provide residents with accessible insights and practical tools to support their personal and professional development, particularly in areas such as work-life balance, stress management, and feedback integration. The podcast was developed in the Netherlands by a research group studying EPA-based residency training. It was inspired by insights from their qualitative studies and shaped in collaboration with residents. Social media engagement ensured the content remained relevant to their evolving needs. Topics ranged from professional identity, feedback culture and mental health to more sensitive or underrepresented issues such as pregnancy, diversity and addiction. Each 30-minute episode was distributed across platforms like Spotify and Apple Podcasts to maximise accessibility. To date, the podcast has released 21 episodes and garnered over 14,700 downloads. Feedback highlights its practical value and relevance in addressing issues that are not always openly discussed in formal training. The Resident Talks Podcast demonstrates the potential of innovative media to supplement resident support systems, addressing personal and professional challenges in a flexible and accessible way. Further research is needed to evaluate whether and how this format provides meaningful support in daily residency practice.
{"title":"Development and early reach of a podcast-based innovation to support medical residents' personal and professional development.","authors":"Maaike P Smit, Karsten A van Loon, Reinoud J B J Gemke, Matthijs de Hoog, Janielle van der Velden","doi":"10.1080/10872981.2025.2559674","DOIUrl":"10.1080/10872981.2025.2559674","url":null,"abstract":"<p><p>Despite the use of structured competency frameworks and supporting educational tools, medical residents often continue to struggle with personalising their learning, managing feedback, and balancing professional demands with personal well-being. These persistent challenges suggest that conventional support systems may not fully address the nuanced needs of residents, contributing to feelings of disconnection and stress during training. The <i>Resident Talks Podcast</i> was developed to provide residents with accessible insights and practical tools to support their personal and professional development, particularly in areas such as work-life balance, stress management, and feedback integration. The podcast was developed in the Netherlands by a research group studying EPA-based residency training. It was inspired by insights from their qualitative studies and shaped in collaboration with residents. Social media engagement ensured the content remained relevant to their evolving needs. Topics ranged from professional identity, feedback culture and mental health to more sensitive or underrepresented issues such as pregnancy, diversity and addiction. Each 30-minute episode was distributed across platforms like Spotify and Apple Podcasts to maximise accessibility. To date, the podcast has released 21 episodes and garnered over 14,700 downloads. Feedback highlights its practical value and relevance in addressing issues that are not always openly discussed in formal training. The <i>Resident Talks Podcast</i> demonstrates the potential of innovative media to supplement resident support systems, addressing personal and professional challenges in a flexible and accessible way. Further research is needed to evaluate whether and how this format provides meaningful support in daily residency practice.</p>","PeriodicalId":47656,"journal":{"name":"Medical Education Online","volume":"30 1","pages":"2559674"},"PeriodicalIF":3.8,"publicationDate":"2025-12-31","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12451950/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145092783","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2025-12-31Epub Date: 2025-10-12DOI: 10.1080/10872981.2025.2568575
Mamta Parikh, Jennifer L Rosenthal, Luis Fernando Santana, Frederick J Meyers
Since the establishment of the Flexnerian model of medical education, single degree (MD) physician-scientists have significantly advanced biomedical research and clinical medicine. However, institutional emphasis has predominantly favoured dual-degree MD/PhD Medical Scientist Training Programs (MSTPs). Recognizing that many successful academic physician-scientists do not hold PhDs, there remains a critical need for structured research training and mentorship targeting single degree students. The University of California Davis School of Medicine developed the Academic Research Careers for Medical Doctors (ARC-MD) program as part of a broader institutional strategy to accelerate innovation, aligning closely with an institutional framework emphasizing thematic breadth and reciprocal interactions between basic and clinical departments. ARC-MD strategically integrates a five-year research-intensive pathway into the traditional four-year MD curriculum, annually enrolling 4-8 students. The program includes an introductory pre-matriculation course, a longitudinal curriculum spanning medical school, focused mentorship fostering interdisciplinary collaboration, a dedicated research year emphasizing reciprocal exchanges between clinical and basic science research, professional identity formation, and financial support through tuition scholarships and stipends. Since its inception in 2019, ARC-MD has enrolled 41 students and graduated two cohorts. Program evaluation surveys were administered in 2024 and 2025 across all five years of training. Student-reported research self-efficacy, measured using the Clinical Research Appraisal Inventory, was lowest among newly matriculating students (pre-program) and increased in subsequent years. Student-reported identity as physician-scientists and student assessments of mentors also showed overall improvement across training levels, though trajectories were not strictly linear.
自从Flexnerian医学教育模式建立以来,单学位(MD)医师科学家显著地推进了生物医学研究和临床医学。然而,机构的重点主要是支持双学位医学博士/博士医学科学家培训计划(mstp)。认识到许多成功的学术医师科学家没有博士学位,仍然迫切需要针对单一学位的学生进行结构化的研究培训和指导。加州大学戴维斯分校医学院开发了医学博士学术研究职业(ARC-MD)项目,作为加速创新的更广泛机构战略的一部分,与强调主题广度和基础和临床部门之间相互作用的机构框架紧密结合。ARC-MD战略性地将五年制研究密集型课程整合到传统的四年制医学博士课程中,每年招收4-8名学生。该计划包括一个入门的预科课程,一个跨越医学院的纵向课程,重点指导促进跨学科合作,一个专门的研究年,强调临床和基础科学研究之间的互惠交流,专业身份的形成,以及通过学费奖学金和津贴提供的财政支持。自2019年成立以来,ARC-MD已经招收了41名学生,并毕业了两届。项目评估调查在2024年和2025年进行,涵盖所有五年的培训。使用临床研究评估量表(Clinical research assessment Inventory)测量的学生报告的研究自我效能感,在刚入学的学生(预课程)中最低,随后几年增加。学生报告的医生科学家身份和学生对导师的评估也显示出在培训水平上的总体改善,尽管轨迹不是严格的线性的。
{"title":"Academic research careers for medical doctors (ARC-MD): a five-year UC Davis training program to foster the next generation of physician-scientists.","authors":"Mamta Parikh, Jennifer L Rosenthal, Luis Fernando Santana, Frederick J Meyers","doi":"10.1080/10872981.2025.2568575","DOIUrl":"10.1080/10872981.2025.2568575","url":null,"abstract":"<p><p>Since the establishment of the Flexnerian model of medical education, single degree (MD) physician-scientists have significantly advanced biomedical research and clinical medicine. However, institutional emphasis has predominantly favoured dual-degree MD/PhD Medical Scientist Training Programs (MSTPs). Recognizing that many successful academic physician-scientists do not hold PhDs, there remains a critical need for structured research training and mentorship targeting single degree students. The University of California Davis School of Medicine developed the Academic Research Careers for Medical Doctors (ARC-MD) program as part of a broader institutional strategy to accelerate innovation, aligning closely with an institutional framework emphasizing thematic breadth and reciprocal interactions between basic and clinical departments. ARC-MD strategically integrates a five-year research-intensive pathway into the traditional four-year MD curriculum, annually enrolling 4-8 students. The program includes an introductory pre-matriculation course, a longitudinal curriculum spanning medical school, focused mentorship fostering interdisciplinary collaboration, a dedicated research year emphasizing reciprocal exchanges between clinical and basic science research, professional identity formation, and financial support through tuition scholarships and stipends. Since its inception in 2019, ARC-MD has enrolled 41 students and graduated two cohorts. Program evaluation surveys were administered in 2024 and 2025 across all five years of training. Student-reported research self-efficacy, measured using the Clinical Research Appraisal Inventory, was lowest among newly matriculating students (pre-program) and increased in subsequent years. Student-reported identity as physician-scientists and student assessments of mentors also showed overall improvement across training levels, though trajectories were not strictly linear.</p>","PeriodicalId":47656,"journal":{"name":"Medical Education Online","volume":"30 1","pages":"2568575"},"PeriodicalIF":3.8,"publicationDate":"2025-12-31","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12519581/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145281593","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}