首页 > 最新文献

Advances in simulation (London, England)最新文献

英文 中文
Preparing Italian residents for global medical practice: the role of internationalization in education. 准备意大利居民的全球医疗实践:国际化在教育中的作用。
IF 4.7 Q2 HEALTH CARE SCIENCES & SERVICES Pub Date : 2025-11-25 DOI: 10.1186/s41077-025-00394-8
Claudia Ebm, Cherrelle Smith, Manuela Milani, Mia Karamatsu, Nick Pokrajac, Bernard Dannenberg, Maurizio Cecconi
{"title":"Preparing Italian residents for global medical practice: the role of internationalization in education.","authors":"Claudia Ebm, Cherrelle Smith, Manuela Milani, Mia Karamatsu, Nick Pokrajac, Bernard Dannenberg, Maurizio Cecconi","doi":"10.1186/s41077-025-00394-8","DOIUrl":"10.1186/s41077-025-00394-8","url":null,"abstract":"","PeriodicalId":72108,"journal":{"name":"Advances in simulation (London, England)","volume":"10 1","pages":"61"},"PeriodicalIF":4.7,"publicationDate":"2025-11-25","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12645667/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145607754","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
The Advocacy-Inquiry Rubric (AIR): a standard to build debriefing and feedback skills. 倡导-调查准则(AIR):建立汇报和反馈技能的标准。
IF 4.7 Q2 HEALTH CARE SCIENCES & SERVICES Pub Date : 2025-11-24 DOI: 10.1186/s41077-025-00381-z
Clément Buléon, Demian Szyld, Robert Simon, Lon Setnik, Walter J Eppich, Mary Fey, James A Lipshaw, Janice C Palaganas, Jenny W Rudolph

Background: Teaching and learning debriefing and feedback skills-especially to a level of mastery-is challenging without an agreed-upon standard. There are a number of rating scales and rubrics to identify and evaluate debriefing and feedback skills that focus on an entire feedback or debriefing conversation. However, there is no rubric to assess and provide feedback on one of these conversations' most widely used microskills, the Advocacy-Inquiry technique. This study aimed to develop and preliminarily test the Advocacy-Inquiry Rubric (AIR)-a tool designed to support the teaching, coaching, and assessment of Advocacy-Inquiry, a widely used yet challenging debriefing microskill-through an international expert consensus process.

Method: Using a four-round Delphi process, we achieved expert consensus on the behavioral markers of effective and ineffective Advocacy-Inquiry techniques. Thirty-nine experts from 13 countries identified and refined a set of key behavioral anchors for each of Advocacy-Inquiry's five elements: Preview, Observation, Point of View, Inquiry, and Listen. These descriptors were embedded first in a seven-point numeric Behaviorally Anchored Rating Scale, then in a three-point emoji-based version, and finally in a teaching and learning version. The AIR underwent two rounds of usability testing and inter-rater testing of the emoji version. Using an interpretation-use argument approach, evidence was collected for AIR's validity across scoring, generalization, extrapolation, and implication.

Results: The Delphi process established descriptors for each element of Advocacy-Inquiry, categorized by proficiency level (beginner to advanced). Usability testing enhanced the AIR's graphic layout to support both numeric ratings and formative feedback. The AIR was adapted into three tailored versions: a numeric AIR for detailed evaluation and progress tracking, an emoji AIR for peer assessment, and a teaching and learning AIR. Evidence for validity was assessed, highlighting both strengths and gaps.

Conclusion: AIR is an empirical rubric based on expert-derived criteria to support teaching, coaching, and assessing Advocacy-Inquiry microskills. The AIR offers a structured framework for self-, peer-, and mentor-led feedback and assessment to enhance a core skill of facilitators. By anchoring assessments in clear behavioral descriptors, the AIR aims to improve the quality of feedback and debriefing conversations. Future work should focus on rater training, reliability testing, and exploring the AIR's impact on real-world outcomes.

背景:教授和学习汇报和反馈技能——尤其是达到精通的程度——没有一个商定的标准是具有挑战性的。有许多等级量表和标准来识别和评估汇报和反馈技能,这些技能集中在整个反馈或汇报对话上。然而,对于这些对话中最广泛使用的微技能之一——倡导-询问技术,没有一个标准来评估和提供反馈。本研究旨在通过国际专家共识过程,开发并初步测试倡导探究准则(AIR)——一种旨在支持倡导探究的教学、指导和评估的工具,倡导探究是一种广泛使用但具有挑战性的述职微技能。方法:采用四轮德尔菲法,对有效和无效的倡导询问技术的行为标志达成专家共识。来自13个国家的39位专家为“倡导探究”的五个要素(预览、观察、观点、探究和倾听)确定并完善了一套关键的行为锚。这些描述符首先被嵌入到一个7分的数字行为锚定量表中,然后是一个3分的表情符号版本,最后是一个教学和学习版本。AIR对表情符号版本进行了两轮可用性测试和评分者间测试。使用解释-使用论证方法,收集了AIR在评分、概括、外推和暗示方面的有效性的证据。结果:德尔菲过程为倡导-探究的每个要素建立了描述符,并按熟练程度(初级到高级)分类。可用性测试增强了AIR的图形布局,以支持数字评级和形成性反馈。AIR被改编成三个量身定制的版本:用于详细评估和进度跟踪的数字AIR,用于同行评估的表情符号AIR,以及教学和学习AIR。评估了有效性证据,突出了优势和差距。结论:AIR是一个基于专家衍生标准的经验准则,用于支持教学、指导和评估倡导探究微技能。AIR为自我、同伴和导师主导的反馈和评估提供了一个结构化的框架,以提高促进者的核心技能。通过将评估固定在明确的行为描述符中,AIR旨在提高反馈和汇报对话的质量。未来的工作应该集中在评分者的训练、可靠性测试和探索AIR对现实世界结果的影响。
{"title":"The Advocacy-Inquiry Rubric (AIR): a standard to build debriefing and feedback skills.","authors":"Clément Buléon, Demian Szyld, Robert Simon, Lon Setnik, Walter J Eppich, Mary Fey, James A Lipshaw, Janice C Palaganas, Jenny W Rudolph","doi":"10.1186/s41077-025-00381-z","DOIUrl":"10.1186/s41077-025-00381-z","url":null,"abstract":"<p><strong>Background: </strong>Teaching and learning debriefing and feedback skills-especially to a level of mastery-is challenging without an agreed-upon standard. There are a number of rating scales and rubrics to identify and evaluate debriefing and feedback skills that focus on an entire feedback or debriefing conversation. However, there is no rubric to assess and provide feedback on one of these conversations' most widely used microskills, the Advocacy-Inquiry technique. This study aimed to develop and preliminarily test the Advocacy-Inquiry Rubric (AIR)-a tool designed to support the teaching, coaching, and assessment of Advocacy-Inquiry, a widely used yet challenging debriefing microskill-through an international expert consensus process.</p><p><strong>Method: </strong>Using a four-round Delphi process, we achieved expert consensus on the behavioral markers of effective and ineffective Advocacy-Inquiry techniques. Thirty-nine experts from 13 countries identified and refined a set of key behavioral anchors for each of Advocacy-Inquiry's five elements: Preview, Observation, Point of View, Inquiry, and Listen. These descriptors were embedded first in a seven-point numeric Behaviorally Anchored Rating Scale, then in a three-point emoji-based version, and finally in a teaching and learning version. The AIR underwent two rounds of usability testing and inter-rater testing of the emoji version. Using an interpretation-use argument approach, evidence was collected for AIR's validity across scoring, generalization, extrapolation, and implication.</p><p><strong>Results: </strong>The Delphi process established descriptors for each element of Advocacy-Inquiry, categorized by proficiency level (beginner to advanced). Usability testing enhanced the AIR's graphic layout to support both numeric ratings and formative feedback. The AIR was adapted into three tailored versions: a numeric AIR for detailed evaluation and progress tracking, an emoji AIR for peer assessment, and a teaching and learning AIR. Evidence for validity was assessed, highlighting both strengths and gaps.</p><p><strong>Conclusion: </strong>AIR is an empirical rubric based on expert-derived criteria to support teaching, coaching, and assessing Advocacy-Inquiry microskills. The AIR offers a structured framework for self-, peer-, and mentor-led feedback and assessment to enhance a core skill of facilitators. By anchoring assessments in clear behavioral descriptors, the AIR aims to improve the quality of feedback and debriefing conversations. Future work should focus on rater training, reliability testing, and exploring the AIR's impact on real-world outcomes.</p>","PeriodicalId":72108,"journal":{"name":"Advances in simulation (London, England)","volume":"10 1","pages":"60"},"PeriodicalIF":4.7,"publicationDate":"2025-11-24","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12645724/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145598001","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Training communication skills in a multiuser medical virtual reality simulation: a qualitative, observational study. 在多用户医疗虚拟现实模拟中训练沟通技巧:一项定性观察研究。
IF 4.7 Q2 HEALTH CARE SCIENCES & SERVICES Pub Date : 2025-11-24 DOI: 10.1186/s41077-025-00386-8
Lotte Cools, Rani Van Schoors, Fien Depaepe, Eline Dancet, Nicolas Delvaux

Background: Simulation-based education is a well-established training technique in medical curricula, also for communication skills. Virtual reality (VR) technology can enhance this form of experience-based learning. How VR interacts with training communication skills for interpersonal and interprofessional medical encounters is, however, unclear. This study investigates how VR influences communication skills and behaviors in patient-student and team encounters in medical undergraduate simulations, in order to make recommendations for VR simulation-based communication skills training (CST).

Methods: We conducted a study with 22 third-year medical students completing a dyadic VR simulation (Smart Collaboration Tutor software). We coded communication skills and behaviors for team and patient-student communication in videorecorded VR simulations. We then analyzed communication patterns and finally developed themes for VR-mediated CST.

Results: Our findings revealed that students preferred the core communication skill of asking questions, informing, and thinking aloud as process communication skills in a VR simulation. Nonverbal and paraverbal behaviors were used with unclear intent. VR negatively impacted the focus of attention and flow of simulation-based communication skills training.

Discussion: Dyadic VR simulations tend to emphasize team and task-oriented communication. Its value for patient-student and relation-oriented communication is unclear. VR influenced conversational turn-taking by altering visual and auditory perceptions. Cognitive load was enhanced, potentially diverting attention from communication goals and observational focus.

Conclusion: Multiuser VR simulation shows certain possibilities for CST in medical undergraduate simulations. Recommendations on the contextual design of VR simulations, however, need to be taken into account to safeguard the focus of attention and flow of CST.

背景:基于模拟的教育是医学课程中一种行之有效的培训技术,也是一种沟通技巧的培训技术。虚拟现实(VR)技术可以增强这种基于体验的学习形式。然而,VR如何与培训人际和跨专业医疗接触的沟通技巧相互作用尚不清楚。本研究旨在探讨虚拟现实对医学本科模拟实验中医患、团队接触中沟通技巧和行为的影响,为基于虚拟现实模拟实验的沟通技巧训练提供建议。方法:我们对22名完成二元虚拟现实模拟(智能协作导师软件)的三年级医学生进行了研究。我们将团队和患者与学生之间的沟通技巧和行为编码为视频录制的VR模拟。然后,我们分析了通信模式,并最终开发了vr介导的CST主题。结果:我们的研究结果表明,在VR模拟中,学生更喜欢提问、告知和大声思考等核心沟通技巧作为过程沟通技巧。非语言和准语言行为的使用意图不明确。VR对基于模拟的沟通技巧训练的注意力集中和流程产生负面影响。讨论:二元虚拟现实模拟倾向于强调团队和面向任务的沟通。它对于病人-学生和以关系为导向的交流的价值尚不清楚。VR通过改变视觉和听觉感知来影响会话的轮流进行。认知负荷增加,潜在地转移了人们对沟通目标和观察焦点的注意力。结论:多用户VR模拟显示了CST在医学本科模拟中的一定可能性。然而,需要考虑有关VR模拟情境设计的建议,以保障CST的关注焦点和流动。
{"title":"Training communication skills in a multiuser medical virtual reality simulation: a qualitative, observational study.","authors":"Lotte Cools, Rani Van Schoors, Fien Depaepe, Eline Dancet, Nicolas Delvaux","doi":"10.1186/s41077-025-00386-8","DOIUrl":"10.1186/s41077-025-00386-8","url":null,"abstract":"<p><strong>Background: </strong>Simulation-based education is a well-established training technique in medical curricula, also for communication skills. Virtual reality (VR) technology can enhance this form of experience-based learning. How VR interacts with training communication skills for interpersonal and interprofessional medical encounters is, however, unclear. This study investigates how VR influences communication skills and behaviors in patient-student and team encounters in medical undergraduate simulations, in order to make recommendations for VR simulation-based communication skills training (CST).</p><p><strong>Methods: </strong>We conducted a study with 22 third-year medical students completing a dyadic VR simulation (Smart Collaboration Tutor software). We coded communication skills and behaviors for team and patient-student communication in videorecorded VR simulations. We then analyzed communication patterns and finally developed themes for VR-mediated CST.</p><p><strong>Results: </strong>Our findings revealed that students preferred the core communication skill of asking questions, informing, and thinking aloud as process communication skills in a VR simulation. Nonverbal and paraverbal behaviors were used with unclear intent. VR negatively impacted the focus of attention and flow of simulation-based communication skills training.</p><p><strong>Discussion: </strong>Dyadic VR simulations tend to emphasize team and task-oriented communication. Its value for patient-student and relation-oriented communication is unclear. VR influenced conversational turn-taking by altering visual and auditory perceptions. Cognitive load was enhanced, potentially diverting attention from communication goals and observational focus.</p><p><strong>Conclusion: </strong>Multiuser VR simulation shows certain possibilities for CST in medical undergraduate simulations. Recommendations on the contextual design of VR simulations, however, need to be taken into account to safeguard the focus of attention and flow of CST.</p>","PeriodicalId":72108,"journal":{"name":"Advances in simulation (London, England)","volume":"10 1","pages":"59"},"PeriodicalIF":4.7,"publicationDate":"2025-11-24","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12642034/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145598040","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Ability of AI detection tools and humans to accurately identify different forms of AI-generated written content. 人工智能检测工具和人类准确识别不同形式的人工智能生成的书面内容的能力。
IF 4.7 Q2 HEALTH CARE SCIENCES & SERVICES Pub Date : 2025-11-22 DOI: 10.1186/s41077-025-00396-6
Adam Cheng, Yiqun Lin, Gabriel Reedy, Christine Joseph, Samantha Wirkowski, Viviane Mallette, Vikhashni Nagesh, David Krieser, Aaron Calhoun

Background: The increasing use of artificial intelligence (AI) by scholars presents a pressing challenge to healthcare publishing. While legitimate use can potentially accelerate scholarship, unethical approaches also exist, leading to factually inaccurate and biased text that may degrade scholarship. Numerous online AI detection tools exist that provide a percentage score of AI use. These can assist authors and editors in navigating this landscape. In this study, we compared the scores from three AI detection tools (ZeroGPT, PhraslyAI, and Grammarly AI Detector) across five plausible conditions of AI use and evaluated them against human assessments.

Methods: Thirty open access articles published in the journals Advances in Simulation and Simulation in Healthcare prior to 2022 were selected, and the article introductions were extracted. Five experimental conditions were examined, including: (1) 100% human written; (2) human written, light AI editing; (3) human written, heavy AI editing; (4) AI written text from human content; and (5) 100% AI written from article title. The resulting materials were assessed by three open-access AI detection tools and five blinded human raters. Results were summarized descriptively and compared using repeated measures analysis of variance (ANOVA), intraclass correlation coefficients (ICC), and Bland-Altman plots.

Results: The three AI detection tools were able to differentiate between the five test conditions (p < 0.001 for all), but varied significantly in absolute score, with ICC ranging from 0.57 to 0.95, raising concerns regarding overall reliability of these tools. Human scoring was far less consistent, with an overall accuracy of 19%, indistinguishable from chance.

Conclusion: While existing AI detection tools can meaningfully distinguish plausible AI use conditions, reliability across these tools is variable. Human scoring accuracy is uniformly low. Use of AI detection tools by scholars and journal editors may assist in determining potentially unethical use but they should not be relied upon alone at this time.

背景:越来越多的学者使用人工智能(AI)对医疗保健出版提出了紧迫的挑战。虽然合法使用可以潜在地加速学术研究,但不道德的方法也存在,导致事实不准确和有偏见的文本可能会降低学术研究。有许多在线人工智能检测工具可以提供人工智能使用的百分比分数。这些可以帮助作者和编辑导航这一景观。在这项研究中,我们比较了三种人工智能检测工具(ZeroGPT、PhraslyAI和Grammarly AI Detector)在五种可能的人工智能使用条件下的得分,并将它们与人类的评估进行了比较。方法:选取2022年前发表在《Advances in Simulation》和《Simulation in Healthcare》期刊上的开放获取文章30篇,提取文章介绍。研究了五种实验条件,包括:(1)100%人工书写;(2)人工书写,轻AI编辑;(3)人工编写,人工智能大量编辑;(4)人工智能文字来源于人类内容;(5) 100%人工智能从文章标题中编写。所得材料由三种开放获取的人工智能检测工具和五名盲法人类评分员进行评估。对结果进行描述性总结,并使用重复测量方差分析(ANOVA)、类内相关系数(ICC)和Bland-Altman图进行比较。结果:三种人工智能检测工具能够区分五种测试条件(p结论:虽然现有的人工智能检测工具可以有意义地区分合理的人工智能使用条件,但这些工具的可靠性是可变的。人类评分的准确率普遍较低。学者和期刊编辑使用人工智能检测工具可能有助于确定潜在的不道德使用,但目前不应单独依赖它们。
{"title":"Ability of AI detection tools and humans to accurately identify different forms of AI-generated written content.","authors":"Adam Cheng, Yiqun Lin, Gabriel Reedy, Christine Joseph, Samantha Wirkowski, Viviane Mallette, Vikhashni Nagesh, David Krieser, Aaron Calhoun","doi":"10.1186/s41077-025-00396-6","DOIUrl":"10.1186/s41077-025-00396-6","url":null,"abstract":"<p><strong>Background: </strong>The increasing use of artificial intelligence (AI) by scholars presents a pressing challenge to healthcare publishing. While legitimate use can potentially accelerate scholarship, unethical approaches also exist, leading to factually inaccurate and biased text that may degrade scholarship. Numerous online AI detection tools exist that provide a percentage score of AI use. These can assist authors and editors in navigating this landscape. In this study, we compared the scores from three AI detection tools (ZeroGPT, PhraslyAI, and Grammarly AI Detector) across five plausible conditions of AI use and evaluated them against human assessments.</p><p><strong>Methods: </strong>Thirty open access articles published in the journals Advances in Simulation and Simulation in Healthcare prior to 2022 were selected, and the article introductions were extracted. Five experimental conditions were examined, including: (1) 100% human written; (2) human written, light AI editing; (3) human written, heavy AI editing; (4) AI written text from human content; and (5) 100% AI written from article title. The resulting materials were assessed by three open-access AI detection tools and five blinded human raters. Results were summarized descriptively and compared using repeated measures analysis of variance (ANOVA), intraclass correlation coefficients (ICC), and Bland-Altman plots.</p><p><strong>Results: </strong>The three AI detection tools were able to differentiate between the five test conditions (p < 0.001 for all), but varied significantly in absolute score, with ICC ranging from 0.57 to 0.95, raising concerns regarding overall reliability of these tools. Human scoring was far less consistent, with an overall accuracy of 19%, indistinguishable from chance.</p><p><strong>Conclusion: </strong>While existing AI detection tools can meaningfully distinguish plausible AI use conditions, reliability across these tools is variable. Human scoring accuracy is uniformly low. Use of AI detection tools by scholars and journal editors may assist in determining potentially unethical use but they should not be relied upon alone at this time.</p>","PeriodicalId":72108,"journal":{"name":"Advances in simulation (London, England)","volume":" ","pages":"66"},"PeriodicalIF":4.7,"publicationDate":"2025-11-22","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12752165/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145574975","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Demographic biases in AI-generated simulated patient cohorts: a comparative analysis against census benchmarks. 人工智能生成的模拟患者队列中的人口统计学偏差:与人口普查基准的比较分析。
IF 4.7 Q2 HEALTH CARE SCIENCES & SERVICES Pub Date : 2025-11-18 DOI: 10.1186/s41077-025-00385-9
Miriam Veenhuizen, Andrew O'Malley

Background: Generative artificial intelligence models are being introduced as low-cost tools for creating simulated patient cohorts in undergraduate medical education. Their educational value, however, depends on the extent to which the synthetic populations mirror real-world demographic diversity. We therefore assessed whether two commonly deployed large language models produce patient profiles that reflect the current age, sex, and ethnic composition of the UK.

Methods: GPT-3.5-turbo-0125 and GPT-4-mini-2024-07-18 were each prompted, without demographic steering, to generate 250 UK-based 'patients'. Age was returned directly by the model; sex and ethnicity were inferred from given and family names using a validated census-derived classifier. Observed frequencies for each demographic variable were compared with England and Wales 2021 census expectations by chi-square goodness-of-fit tests.

Results: Both cohorts diverged significantly from census benchmarks (p < 0.0001 for every variable). Age distributions showed an absence of very young and older individuals, with certain middle-aged groups overrepresented (GPT-3.5: χ2(17) = 1310.4, p < 0.0001; GPT4mini: χ2(17) = 1866.1, p < 0.0001). Neither model produced patients younger than 25 years; GPT-3.5 generated no one older than 47 years and GPT-4-mini no one older than 56 years. Gender proportions also differed markedly, skewing heavily toward males (GPT-3.5: χ2(1) = 23.84, p < 0.0001; GPT4mini: χ2(1) = 191.7, p < 0.0001). Male patients constituted 64.7% and 92.8% of the two cohorts. Name diversity was limited: GPT-3.5 yielded 104 unique first-last-name combinations, whereas GPT-4-mini produced only nine. Ethnic profiles were similarly imbalanced, featuring overrepresentation of some groups and complete absence of others (χ2(10) = 42.19, p < 0.0001).

Conclusions: In their default state, the evaluated models create synthetic patient pools that exclude younger, older, female and most minority-ethnic representations. Such demographically narrow outputs threaten to normalise biased clinical expectations and may undermine efforts to prepare students for equitable practice. Baseline auditing of model behaviour is therefore essential, providing a benchmark against which prompt-engineering or data-curation strategies can be evaluated before generative systems are integrated into formal curricula.

背景:在本科医学教育中,生成式人工智能模型正作为低成本工具被引入,用于创建模拟患者队列。然而,它们的教育价值取决于合成人口在多大程度上反映了现实世界的人口多样性。因此,我们评估了两种常用的大型语言模型是否能产生反映英国当前年龄、性别和种族构成的患者概况。方法:在没有人口统计学指导的情况下,分别提示GPT-3.5-turbo-0125和GPT-4-mini-2024-07-18产生250名英国“患者”。年龄由模型直接返回;性别和种族是通过一个有效的人口普查分类器从名字和姓氏中推断出来的。每个人口统计变量的观察频率通过卡方拟合优度检验与英格兰和威尔士2021年人口普查预期进行了比较。结论:在其默认状态下,评估模型创建了排除年轻人、老年人、女性和大多数少数民族代表的合成患者池。这种人口统计学上狭窄的产出有可能使有偏见的临床期望正常化,并可能破坏为学生公平实践做准备的努力。因此,模型行为的基线审计是必不可少的,它提供了一个基准,在生成系统集成到正式课程之前,可以根据该基准评估快速工程或数据管理策略。
{"title":"Demographic biases in AI-generated simulated patient cohorts: a comparative analysis against census benchmarks.","authors":"Miriam Veenhuizen, Andrew O'Malley","doi":"10.1186/s41077-025-00385-9","DOIUrl":"10.1186/s41077-025-00385-9","url":null,"abstract":"<p><strong>Background: </strong>Generative artificial intelligence models are being introduced as low-cost tools for creating simulated patient cohorts in undergraduate medical education. Their educational value, however, depends on the extent to which the synthetic populations mirror real-world demographic diversity. We therefore assessed whether two commonly deployed large language models produce patient profiles that reflect the current age, sex, and ethnic composition of the UK.</p><p><strong>Methods: </strong>GPT-3.5-turbo-0125 and GPT-4-mini-2024-07-18 were each prompted, without demographic steering, to generate 250 UK-based 'patients'. Age was returned directly by the model; sex and ethnicity were inferred from given and family names using a validated census-derived classifier. Observed frequencies for each demographic variable were compared with England and Wales 2021 census expectations by chi-square goodness-of-fit tests.</p><p><strong>Results: </strong>Both cohorts diverged significantly from census benchmarks (p < 0.0001 for every variable). Age distributions showed an absence of very young and older individuals, with certain middle-aged groups overrepresented (GPT-3.5: χ2(17) = 1310.4, p < 0.0001; GPT4mini: χ2(17) = 1866.1, p < 0.0001). Neither model produced patients younger than 25 years; GPT-3.5 generated no one older than 47 years and GPT-4-mini no one older than 56 years. Gender proportions also differed markedly, skewing heavily toward males (GPT-3.5: χ2(1) = 23.84, p < 0.0001; GPT4mini: χ2(1) = 191.7, p < 0.0001). Male patients constituted 64.7% and 92.8% of the two cohorts. Name diversity was limited: GPT-3.5 yielded 104 unique first-last-name combinations, whereas GPT-4-mini produced only nine. Ethnic profiles were similarly imbalanced, featuring overrepresentation of some groups and complete absence of others (χ2(10) = 42.19, p < 0.0001).</p><p><strong>Conclusions: </strong>In their default state, the evaluated models create synthetic patient pools that exclude younger, older, female and most minority-ethnic representations. Such demographically narrow outputs threaten to normalise biased clinical expectations and may undermine efforts to prepare students for equitable practice. Baseline auditing of model behaviour is therefore essential, providing a benchmark against which prompt-engineering or data-curation strategies can be evaluated before generative systems are integrated into formal curricula.</p>","PeriodicalId":72108,"journal":{"name":"Advances in simulation (London, England)","volume":"10 1","pages":"58"},"PeriodicalIF":4.7,"publicationDate":"2025-11-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12625206/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145552096","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Efficacy of publicly accessible tourniquets: a systematic review of layperson performance utilizing simulation models. 公共可及止血带的功效:利用模拟模型对外行人的表现进行系统回顾。
IF 4.7 Q2 HEALTH CARE SCIENCES & SERVICES Pub Date : 2025-11-18 DOI: 10.1186/s41077-025-00390-y
Steven Bordonaro, Christopher Negro, Karl Neubecker, Eric C Nemec, Suzanne J Rose

Background: A large portion of preventable deaths is a result of uncontrolled bleeding due to a delay in medical intervention. While publicly accessible tourniquets raise the concern of incorrect application by laypeople, tourniquets have proven efficacy and can be effectively applied by bystanders. This systematic review aims to identify if tourniquets applied by laypeople using a basic manikin or tourniquet trainer extremity with little to no training can effectively control bleeding.

Methods: The authors used EBSCOHost to simultaneously search the following databases: Cumulated Index in Nursing and Allied Health Literature (CINAHL) Ultimate, Academic Search Premier, Cochrane Central Register of Controlled Trials, Cochrane Database of Systematic Reviews, and Medical Literature Analysis and Retrieval System Online (MEDLINE) with Full Text. Boolean search strategy included tourniquet AND (layperson OR laypeople) AND ((bleeding AND control) OR (hemorrhage AND control) OR "stop the bleed") NOT surgery. The search was limited to January 1, 2013, to August 31, 2023. Inclusion criteria were layperson participants in peer-reviewed randomized controlled or clinical trials, available in English, that assessed at least one outcome measure related to the efficacy of tourniquet application in a simulated context. Articles including duplicate data and those regarding tourniquet use/efficacy in settings other than prehospital care or bleeding control were excluded. Two independent reviewers selected studies according to prespecified inclusion and exclusion criteria. Risk of bias was assessed using the Cochrane RoB 2 tool.

Results: The initial search identified 83 studies, with 10 retained for inclusion in this review. Two different windlass rod tourniquets and one ratcheting strap tourniquet performed the best in terms of successful application by laypeople. Completing formal bleeding control training increased the average application success rate compared to no prior training. The Layperson Audiovisual Assist Tourniquet was the only audiovisual point-of-care aid that significantly increased the rate of successful applications. Just-in-Time visual cards also increased success rates significantly, showing comparable benefits to manufacturer instructions.

Conclusion: Although some laypeople can successfully place tourniquets without prior training, successful placement rates can be improved with point-of-care aids and formal bleeding control training using a basic manikin or tourniquet trainer extremity.

背景:很大一部分可预防的死亡是由于医疗干预延误而导致出血失控的结果。虽然公众可获得的止血带引起了外行人不正确使用的担忧,但止血带已经被证明是有效的,并且可以由旁观者有效地使用。本系统综述旨在确定外行人使用基本人体模型或止血带训练器进行的止血带是否可以有效地控制出血。方法:作者使用EBSCOHost同时检索以下数据库:护理与相关健康文献累积索引(CINAHL) Ultimate、学术检索Premier、Cochrane对照试验中央注册库、Cochrane系统评价数据库和医学文献分析与检索系统(MEDLINE)全文数据库。布尔搜索策略包括止血带和(外行人或外行人)和((出血和控制)或(出血和控制)或“止血”),而不是手术。搜索范围限于2013年1月1日至2023年8月31日。纳入标准是同行评议的随机对照或临床试验的外行参与者,可获得英文版本,评估至少一项与模拟环境中止血带应用效果相关的结果测量。包括重复数据的文章和关于院前护理或出血控制以外环境中止血带使用/疗效的文章被排除在外。两名独立审稿人根据预先指定的纳入和排除标准选择研究。使用Cochrane RoB 2工具评估偏倚风险。结果:最初的检索确定了83项研究,其中10项保留纳入本综述。两种不同的卷绕杆止血带和一种棘轮带止血带在外行人成功应用方面表现最好。完成正规的出血控制培训与没有事先培训相比,增加了平均应用成功率。外行人视听辅助止血带是唯一的视听护理点辅助,显着增加了成功率的应用。即时可视化卡片也显著提高了成功率,显示出与制造商说明相当的好处。结论:虽然一些外行人可以在没有事先训练的情况下成功放置止血带,但使用基本的人体模型或止血带训练器进行即时护理辅助和正式的出血控制训练可以提高放置成功率。
{"title":"Efficacy of publicly accessible tourniquets: a systematic review of layperson performance utilizing simulation models.","authors":"Steven Bordonaro, Christopher Negro, Karl Neubecker, Eric C Nemec, Suzanne J Rose","doi":"10.1186/s41077-025-00390-y","DOIUrl":"10.1186/s41077-025-00390-y","url":null,"abstract":"<p><strong>Background: </strong>A large portion of preventable deaths is a result of uncontrolled bleeding due to a delay in medical intervention. While publicly accessible tourniquets raise the concern of incorrect application by laypeople, tourniquets have proven efficacy and can be effectively applied by bystanders. This systematic review aims to identify if tourniquets applied by laypeople using a basic manikin or tourniquet trainer extremity with little to no training can effectively control bleeding.</p><p><strong>Methods: </strong>The authors used EBSCOHost to simultaneously search the following databases: Cumulated Index in Nursing and Allied Health Literature (CINAHL) Ultimate, Academic Search Premier, Cochrane Central Register of Controlled Trials, Cochrane Database of Systematic Reviews, and Medical Literature Analysis and Retrieval System Online (MEDLINE) with Full Text. Boolean search strategy included tourniquet AND (layperson OR laypeople) AND ((bleeding AND control) OR (hemorrhage AND control) OR \"stop the bleed\") NOT surgery. The search was limited to January 1, 2013, to August 31, 2023. Inclusion criteria were layperson participants in peer-reviewed randomized controlled or clinical trials, available in English, that assessed at least one outcome measure related to the efficacy of tourniquet application in a simulated context. Articles including duplicate data and those regarding tourniquet use/efficacy in settings other than prehospital care or bleeding control were excluded. Two independent reviewers selected studies according to prespecified inclusion and exclusion criteria. Risk of bias was assessed using the Cochrane RoB 2 tool.</p><p><strong>Results: </strong>The initial search identified 83 studies, with 10 retained for inclusion in this review. Two different windlass rod tourniquets and one ratcheting strap tourniquet performed the best in terms of successful application by laypeople. Completing formal bleeding control training increased the average application success rate compared to no prior training. The Layperson Audiovisual Assist Tourniquet was the only audiovisual point-of-care aid that significantly increased the rate of successful applications. Just-in-Time visual cards also increased success rates significantly, showing comparable benefits to manufacturer instructions.</p><p><strong>Conclusion: </strong>Although some laypeople can successfully place tourniquets without prior training, successful placement rates can be improved with point-of-care aids and formal bleeding control training using a basic manikin or tourniquet trainer extremity.</p>","PeriodicalId":72108,"journal":{"name":"Advances in simulation (London, England)","volume":"10 1","pages":"57"},"PeriodicalIF":4.7,"publicationDate":"2025-11-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12625582/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145552045","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Frequency of team simulation and reduction in maternal deaths following Safer Births Bundle of Care implementation-a prospective observational study. 一项前瞻性观察研究:团队模拟的频率和安全分娩护理包实施后孕产妇死亡的减少
IF 4.7 Q2 HEALTH CARE SCIENCES & SERVICES Pub Date : 2025-11-14 DOI: 10.1186/s41077-025-00387-7
Kjetil Torgeirsen, Benjamin Kamala, Estomih Mduma, Florence Salvatory Kalabamu, Robert Moshiro, Doris Østergaard, Jan Terje Kvaløy, Hege Langli Ersdal

Background: Safer Births Bundle of Care (SBBC) is a continuous quality improvement (CQI) program, implemented in 30 facilities in Tanzania, resulting in a 75% reduction in maternal deaths. Simulation training was introduced as a component of the CQI efforts, targeting individual and team skills, focusing on identified clinical needs.

Objective: The aim of this study was to describe the frequency of documented simulation sessions and the number of recurrent participants and associations with changes in maternal death.

Methods: SBBC was a stepped-wedge cluster randomised implementation study in 30 facilities in 5 regions of Tanzania from 2020 through 2023. The SimBegin® facilitator training program was introduced to train facilitators and support implementation of a training cascade. Fifteen selected healthcare workers were trained in three levels of SimBegin® to become facilitators (level 1) and mentors (level 2). Eight were trained to become instructors (level 3). In total, 90 local facilitators were trained to review local clinical data, run simulation sessions, and document in logbooks. Clinical data were collected from patient files by independent data collectors and looped back to the facilities on a weekly basis. Training interventions were planned, conducted, and evaluated based on identified gaps. Output measures were the frequency of simulation sessions, the number of recurring participants, and maternal death within 7 days postpartum the following month.

Results: Overall, 281,165 parturient women were included in this study. The SBBC implementation period was 24-32 months, and 1280 simulation sessions were documented. Maternal deaths declined from 240/100,000 births in the baseline to 60/100,000 after the start of SBBC. There was an association between the frequency of simulation sessions and the reduction in maternal deaths (23% reduction per each unit increase on the log scale, P = 0.0018), and between the number of recurring participants and the reduction in maternal deaths (16% reduction per each unit increase on the log scale, P = 0.0006).

Conclusion: This study documents a significant and clinically relevant association between the frequency of and participation in simulation sessions and the reduction of maternal deaths the following month.

Trial registration: SBBC main protocol ISRCTN Registry: ISRCTN30541755. Prospectively registered 12.10.2020.

背景:安全分娩一揽子护理(SBBC)是一项持续质量改进(CQI)方案,在坦桑尼亚的30个设施中实施,使孕产妇死亡率降低了75%。模拟训练作为CQI工作的一个组成部分被引入,以个人和团队技能为目标,专注于确定的临床需求。目的:本研究的目的是描述记录模拟会议的频率和复发参与者的数量以及与产妇死亡变化的关联。方法:SBBC是一项阶梯楔形聚类随机实施研究,于2020年至2023年在坦桑尼亚5个地区的30个设施中进行。引入SimBegin®引导员培训计划,以培训引导员并支持培训级联的实施。15名选定的卫生保健工作者接受了三个级别的SimBegin®培训,成为促进者(1级)和导师(2级)。其中8人被培训为教官(3级)。总共有90名当地辅导员接受了培训,以审查当地临床数据,进行模拟会议,并记录在日志中。临床数据由独立的数据收集人员从患者档案中收集,并每周循环传回各机构。培训干预措施是根据确定的差距进行计划、实施和评估的。输出测量是模拟会话的频率、重复参与者的数量以及下个月产后7天内的产妇死亡率。结果:本研究共纳入281165名产妇。SBBC实施周期为24-32个月,记录了1280次模拟会话。产妇死亡率从基线时的240/100 000下降到开始实行SBBC后的60/100 000。模拟会议的频率与产妇死亡率的降低之间存在关联(对数尺度上每增加一个单位减少23%,P = 0.0018),重复参与者的数量与产妇死亡率的降低之间存在关联(对数尺度上每增加一个单位减少16%,P = 0.0006)。结论:本研究记录了模拟会议的频率和参与与降低产妇死亡率之间的显著和临床相关的关联。试验注册:SBBC主协议ISRCTN注册中心:ISRCTN30541755。预期注册日期为2020年10月12日。
{"title":"Frequency of team simulation and reduction in maternal deaths following Safer Births Bundle of Care implementation-a prospective observational study.","authors":"Kjetil Torgeirsen, Benjamin Kamala, Estomih Mduma, Florence Salvatory Kalabamu, Robert Moshiro, Doris Østergaard, Jan Terje Kvaløy, Hege Langli Ersdal","doi":"10.1186/s41077-025-00387-7","DOIUrl":"10.1186/s41077-025-00387-7","url":null,"abstract":"<p><strong>Background: </strong>Safer Births Bundle of Care (SBBC) is a continuous quality improvement (CQI) program, implemented in 30 facilities in Tanzania, resulting in a 75% reduction in maternal deaths. Simulation training was introduced as a component of the CQI efforts, targeting individual and team skills, focusing on identified clinical needs.</p><p><strong>Objective: </strong>The aim of this study was to describe the frequency of documented simulation sessions and the number of recurrent participants and associations with changes in maternal death.</p><p><strong>Methods: </strong>SBBC was a stepped-wedge cluster randomised implementation study in 30 facilities in 5 regions of Tanzania from 2020 through 2023. The SimBegin® facilitator training program was introduced to train facilitators and support implementation of a training cascade. Fifteen selected healthcare workers were trained in three levels of SimBegin® to become facilitators (level 1) and mentors (level 2). Eight were trained to become instructors (level 3). In total, 90 local facilitators were trained to review local clinical data, run simulation sessions, and document in logbooks. Clinical data were collected from patient files by independent data collectors and looped back to the facilities on a weekly basis. Training interventions were planned, conducted, and evaluated based on identified gaps. Output measures were the frequency of simulation sessions, the number of recurring participants, and maternal death within 7 days postpartum the following month.</p><p><strong>Results: </strong>Overall, 281,165 parturient women were included in this study. The SBBC implementation period was 24-32 months, and 1280 simulation sessions were documented. Maternal deaths declined from 240/100,000 births in the baseline to 60/100,000 after the start of SBBC. There was an association between the frequency of simulation sessions and the reduction in maternal deaths (23% reduction per each unit increase on the log scale, P = 0.0018), and between the number of recurring participants and the reduction in maternal deaths (16% reduction per each unit increase on the log scale, P = 0.0006).</p><p><strong>Conclusion: </strong>This study documents a significant and clinically relevant association between the frequency of and participation in simulation sessions and the reduction of maternal deaths the following month.</p><p><strong>Trial registration: </strong>SBBC main protocol ISRCTN Registry: ISRCTN30541755. Prospectively registered 12.10.2020.</p>","PeriodicalId":72108,"journal":{"name":"Advances in simulation (London, England)","volume":"10 1","pages":"56"},"PeriodicalIF":4.7,"publicationDate":"2025-11-14","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12619334/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145524946","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Simulated-based training for ultrasound-guided popliteal sciatic nerve block: determining the learning curve and transference to real patient. 超声引导下腘窝坐骨神经阻滞的模拟训练:确定学习曲线并转移到真实患者。
IF 4.7 Q2 HEALTH CARE SCIENCES & SERVICES Pub Date : 2025-11-11 DOI: 10.1186/s41077-025-00389-5
Pablo F Miranda, Andrea L Araneda, Natalia P Molina, Felipe G Miranda, Christopher Morrison, Marcia A Corvetto, Fernando R Altermatt

Background: The following study aims to determine the learning curve experienced by anesthesia residents when training for an ultrasound-guided popliteal sciatic block and the transference of this training to real patient situations.

Methods: After approval by the ethics committee, eleven first-year anesthesia residents were recruited to participate in a simulation-based training program to perform a single shot in plane popliteal sciatic block. Training consisted of 10 individual sessions, with direct feedback from the instructor, with a specific Laerdal® popliteal sciatic block phantom, lasting one hour and distributed weekly. At the end of each session, the resident's performance was assessed. Residents were videotaped while performing the block, which was to be evaluated using a validated global rating scale (GRS). Additionally, a tracking motion device attached to the operator's hands (Imperial College Surgical Assessment Device, ICSAD) recorded the total distance traveled by both hands (Total path length, TPL), number of movements (NM), and total procedure time (TPT). One week later, the same assessment was done on a real patient.

Results: Ten residents completed the training and the assessments. Median values of GRS scores significantly improved from 15 to 28.3 through the training (p = 0.006). Regarding ICSAD scores, TPT improved from 126 to 63.4 s (p = 0.002), and TPL improved from 11.07 to 9.4 m (p = 0.322). When comparing the last simulated session and the subsequent measurement in an actual patient, median values of GRS, TPL and NM were not different.

Conclusions: This simulation-based training program significantly improved residents' proficiency in an ultrasound-guided popliteal sciatic block. The learning curve plateaued at session 7, and this improvement was transferred to the real patient setting. As expected, residents needed more time for the first block on a real patient than for the last simulated session.

Clinical trial number: ClinicalTrials.gov, identifier NCT06081790.

背景:下面的研究旨在确定麻醉住院医师在超声引导下进行腘窝坐骨阻滞训练时所经历的学习曲线,并将这种训练转移到真实的患者情况中。方法:经伦理委员会批准,招募11名一年级麻醉住院医师参加基于模拟的培训计划,进行平面腘窝坐骨阻滞单次射击。训练包括10次单独的训练,由教练直接反馈,使用特定的Laerdal®腘窝坐骨神经阻滞假体,持续1小时,每周进行一次。在每个疗程结束时,对住院医生的表现进行评估。居民们在表演街区时被录像,并使用有效的全球评级量表(GRS)对其进行评估。此外,附着在操作者手上的跟踪运动装置(帝国理工学院手术评估装置,ICSAD)记录了双手移动的总距离(总路径长度,TPL)、运动次数(NM)和总手术时间(TPT)。一周后,对一位真正的病人进行了同样的评估。结果:10名住院医师完成了培训和评估。通过训练,GRS评分中位数由15提高到28.3,差异有统计学意义(p = 0.006)。ICSAD评分方面,TPT从126 s提高到63.4 s (p = 0.002), TPL从11.07 m提高到9.4 m (p = 0.322)。当比较最后一次模拟会话和随后对实际患者的测量时,GRS, TPL和NM的中位数没有差异。结论:这种基于模拟的培训方案显著提高了住院医生在超声引导下腘窝坐骨阻滞的熟练程度。学习曲线在第7期趋于平稳,这种改善被转移到真实的患者环境中。正如预期的那样,住院医生在真实病人身上的第一个街区比最后一个模拟疗程需要更多的时间。临床试验编号:ClinicalTrials.gov,标识符NCT06081790。
{"title":"Simulated-based training for ultrasound-guided popliteal sciatic nerve block: determining the learning curve and transference to real patient.","authors":"Pablo F Miranda, Andrea L Araneda, Natalia P Molina, Felipe G Miranda, Christopher Morrison, Marcia A Corvetto, Fernando R Altermatt","doi":"10.1186/s41077-025-00389-5","DOIUrl":"10.1186/s41077-025-00389-5","url":null,"abstract":"<p><strong>Background: </strong>The following study aims to determine the learning curve experienced by anesthesia residents when training for an ultrasound-guided popliteal sciatic block and the transference of this training to real patient situations.</p><p><strong>Methods: </strong>After approval by the ethics committee, eleven first-year anesthesia residents were recruited to participate in a simulation-based training program to perform a single shot in plane popliteal sciatic block. Training consisted of 10 individual sessions, with direct feedback from the instructor, with a specific Laerdal® popliteal sciatic block phantom, lasting one hour and distributed weekly. At the end of each session, the resident's performance was assessed. Residents were videotaped while performing the block, which was to be evaluated using a validated global rating scale (GRS). Additionally, a tracking motion device attached to the operator's hands (Imperial College Surgical Assessment Device, ICSAD) recorded the total distance traveled by both hands (Total path length, TPL), number of movements (NM), and total procedure time (TPT). One week later, the same assessment was done on a real patient.</p><p><strong>Results: </strong>Ten residents completed the training and the assessments. Median values of GRS scores significantly improved from 15 to 28.3 through the training (p = 0.006). Regarding ICSAD scores, TPT improved from 126 to 63.4 s (p = 0.002), and TPL improved from 11.07 to 9.4 m (p = 0.322). When comparing the last simulated session and the subsequent measurement in an actual patient, median values of GRS, TPL and NM were not different.</p><p><strong>Conclusions: </strong>This simulation-based training program significantly improved residents' proficiency in an ultrasound-guided popliteal sciatic block. The learning curve plateaued at session 7, and this improvement was transferred to the real patient setting. As expected, residents needed more time for the first block on a real patient than for the last simulated session.</p><p><strong>Clinical trial number: </strong>ClinicalTrials.gov, identifier NCT06081790.</p>","PeriodicalId":72108,"journal":{"name":"Advances in simulation (London, England)","volume":"10 1","pages":"55"},"PeriodicalIF":4.7,"publicationDate":"2025-11-11","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12607028/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145497493","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Exploring trainee experiences in a structured virtual reality laparoscopic training programme for general surgeons: a longitudinal case study. 探索实习经验在一个结构化的虚拟现实腹腔镜培训计划为普通外科医生:纵向个案研究。
IF 4.7 Q2 HEALTH CARE SCIENCES & SERVICES Pub Date : 2025-10-28 DOI: 10.1186/s41077-025-00359-x
Aditi Siddharth, Sotiris Mastoridis, Michael Silva, Debbie Aitken, Helen Higham

Background: The acquisition and maintenance of technical skills in surgical specialties has become increasingly challenging for postgraduate trainees, exacerbated by factors such as the shift from traditional apprenticeship models, reduced operative time, and the impact of the COVID-19 pandemic. Virtual reality (VR) simulators offer a promising adjunct to traditional surgical training, though their integration into routine practice remain underexplored.

Objective: This qualitative study investigates the experiences and motivations of general surgical trainees who engaged with a VR laparoscopic simulator as part of a structured training program.

Methods: A case study methodology was chosen to explore the experiences of 22 general surgery trainees using a VR laparoscopic simulator over a period of 3 months. Each of the trainees were adviced to practise a minimum of five repetitions across 25 laparoscopic simulator exercises. The study was designed using Kopta's theory of technical skill learning, focusing on the cognitive phase, where trainees repetitively practised individual steps with feedback. Data collection involved qualitative questionnaires, semi-structured interviews (of seven of the trainees, 8 months later), and quantitative data from the simulator. The qualitative data was analysed using thematic analysis, and descriptive statistical tests were applied to the quantitative data for triangulation.

Results: The study identified key factors influencing trainee engagement, including ease of access, the importance of periodic rather than frequent simulation sessions, Annual Review of Competency Progression (ARCP) overview and the value of setting specific performance goals. The findings suggest that simulation can effectively complement traditional surgical training when incorporated into routine practice, with potential for broader application if barriers such as time constraints and access issues are addressed.

Conclusion: This study contributes to the literature on surgical education by highlighting the need for targeted strategies to enhance the use of simulation as an adjunct alongside more traditional training.

背景:由于传统学徒模式的转变、手术时间的缩短以及COVID-19大流行的影响,对研究生学员来说,获得和保持外科专业的技术技能越来越具有挑战性。虚拟现实(VR)模拟器为传统外科训练提供了一种很有前途的辅助手段,尽管它们与常规实践的结合仍未得到充分探索。目的:本定性研究调查了普通外科受训人员参与VR腹腔镜模拟器作为结构化培训计划的一部分的经验和动机。方法:采用个案研究的方法,对22名普外科实习生在VR腹腔镜模拟器的使用情况进行了为期3个月的研究。每个受训者被建议在25个腹腔镜模拟器练习中至少重复5次。该研究采用Kopta的技术技能学习理论设计,重点关注认知阶段,受训者在反馈的情况下重复练习个人步骤。数据收集包括定性问卷调查、半结构化访谈(8个月后对7名受训者进行访谈)和来自模拟器的定量数据。定性数据采用专题分析进行分析,定量数据采用描述性统计检验进行三角剖分。结果:该研究确定了影响学员参与度的关键因素,包括访问的便利性、定期而非频繁的模拟课程的重要性、能力发展年度回顾(ARCP)概述以及设定具体绩效目标的价值。研究结果表明,如果将模拟纳入常规实践,可以有效地补充传统的外科训练,如果解决了时间限制和访问问题等障碍,则有可能得到更广泛的应用。结论:本研究通过强调需要有针对性的策略来加强模拟作为传统训练的辅助手段,为外科教育的文献做出了贡献。
{"title":"Exploring trainee experiences in a structured virtual reality laparoscopic training programme for general surgeons: a longitudinal case study.","authors":"Aditi Siddharth, Sotiris Mastoridis, Michael Silva, Debbie Aitken, Helen Higham","doi":"10.1186/s41077-025-00359-x","DOIUrl":"10.1186/s41077-025-00359-x","url":null,"abstract":"<p><strong>Background: </strong>The acquisition and maintenance of technical skills in surgical specialties has become increasingly challenging for postgraduate trainees, exacerbated by factors such as the shift from traditional apprenticeship models, reduced operative time, and the impact of the COVID-19 pandemic. Virtual reality (VR) simulators offer a promising adjunct to traditional surgical training, though their integration into routine practice remain underexplored.</p><p><strong>Objective: </strong>This qualitative study investigates the experiences and motivations of general surgical trainees who engaged with a VR laparoscopic simulator as part of a structured training program.</p><p><strong>Methods: </strong>A case study methodology was chosen to explore the experiences of 22 general surgery trainees using a VR laparoscopic simulator over a period of 3 months. Each of the trainees were adviced to practise a minimum of five repetitions across 25 laparoscopic simulator exercises. The study was designed using Kopta's theory of technical skill learning, focusing on the cognitive phase, where trainees repetitively practised individual steps with feedback. Data collection involved qualitative questionnaires, semi-structured interviews (of seven of the trainees, 8 months later), and quantitative data from the simulator. The qualitative data was analysed using thematic analysis, and descriptive statistical tests were applied to the quantitative data for triangulation.</p><p><strong>Results: </strong>The study identified key factors influencing trainee engagement, including ease of access, the importance of periodic rather than frequent simulation sessions, Annual Review of Competency Progression (ARCP) overview and the value of setting specific performance goals. The findings suggest that simulation can effectively complement traditional surgical training when incorporated into routine practice, with potential for broader application if barriers such as time constraints and access issues are addressed.</p><p><strong>Conclusion: </strong>This study contributes to the literature on surgical education by highlighting the need for targeted strategies to enhance the use of simulation as an adjunct alongside more traditional training.</p>","PeriodicalId":72108,"journal":{"name":"Advances in simulation (London, England)","volume":"10 1","pages":"54"},"PeriodicalIF":4.7,"publicationDate":"2025-10-28","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12570560/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145395700","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Comparison of fluoroscopy time and procedure time of endovascular interventions with and without prior angiography simulator training: a meta-analysis. 有和没有事先血管造影模拟器训练的血管内介入透视时间和手术时间的比较:一项荟萃分析。
IF 4.7 Q2 HEALTH CARE SCIENCES & SERVICES Pub Date : 2025-10-27 DOI: 10.1186/s41077-025-00382-y
Timo C Meine, Johannes M Dorl, Anselm A Derda, Nima Mahmoudi, Hans-Jonas Meyer

Background: Angiography simulator training (AST) can help to train important clinical aspects of complex angiography procedures before real patient contact. The aim of the present analysis was to synthesize the results of studies on endovascular interventions performed by interventionalists with and without AST in a meta-analysis.

Methods: A systematic literature research was performed in PubMed, Web-of-Science and CINAHL to identify all relevant studies. Inclusion criteria were original research, English language and comparison of endovascular interventions in procedure time (PT) and fluoroscopy time (FT) performed by interventionalists with and without AST. Study quality was assessed using modified Downs-and-Black-instrument (maximum 8 points). Heterogeneity-analysis (study design and I2) was determined, and fixed- or random-effects model was applied to pool the effect, mean difference (MD), from the individual studies. All analyses were performed two-sided, and the level-of-significance was 0.05.

Results: Overall, 9 studies with 10 datasets and 7774 interventions were included. Study quality was 7 ± 0 for both PT and FT. Heterogeneity was present in the studies on PT (I2 = 61%) and FT (I2 = 99%), and a random-effects model was applied. MD for PT was significant with -2.63 min between the AST-group and control-group among the included studies (p = 0.02). In contrast, MD was not significant with -1.33 min between the AST-group and control-group among the included studies for FT (p = 0.21).

Conclusion: AST translates into an improved PT and similar FT in real interventions compared to conventional training. Angiography simulators offer a valuable, radiation-free alternative and expand training opportunities. Evidence is limited by study heterogeneity.

背景:血管造影模拟器训练(AST)可以帮助在真正的患者接触之前训练复杂血管造影程序的重要临床方面。本分析的目的是在荟萃分析中综合有AST和没有AST的介入医师进行血管内干预的研究结果。方法:系统查阅PubMed、Web-of-Science和CINAHL的相关文献。纳入标准为原始研究、英语语言以及有AST和没有AST的介入医师进行的血管内介入手术时间(PT)和透视时间(FT)的比较。研究质量采用改良的downs -and- black仪器进行评估(最高8分)。确定异质性分析(研究设计和I2),并应用固定效应或随机效应模型汇总单个研究的效应,即平均差异(MD)。所有分析均采用双侧分析,显著性水平为0.05。结果:共纳入9项研究,10个数据集,7774项干预措施。PT和FT的研究质量均为7±0。PT (I2 = 61%)和FT (I2 = 99%)的研究存在异质性,采用随机效应模型。在纳入的研究中,ast组与对照组之间PT的MD为-2.63 min,差异有统计学意义(p = 0.02)。相比之下,在纳入的FT研究中,ast组与对照组的MD差异不显著,为-1.33 min (p = 0.21)。结论:与传统训练相比,AST在实际干预中转化为改善的PT和类似的FT。血管造影模拟器提供了一个有价值的、无辐射的替代方案,并扩大了培训机会。证据受到研究异质性的限制。
{"title":"Comparison of fluoroscopy time and procedure time of endovascular interventions with and without prior angiography simulator training: a meta-analysis.","authors":"Timo C Meine, Johannes M Dorl, Anselm A Derda, Nima Mahmoudi, Hans-Jonas Meyer","doi":"10.1186/s41077-025-00382-y","DOIUrl":"10.1186/s41077-025-00382-y","url":null,"abstract":"<p><strong>Background: </strong>Angiography simulator training (AST) can help to train important clinical aspects of complex angiography procedures before real patient contact. The aim of the present analysis was to synthesize the results of studies on endovascular interventions performed by interventionalists with and without AST in a meta-analysis.</p><p><strong>Methods: </strong>A systematic literature research was performed in PubMed, Web-of-Science and CINAHL to identify all relevant studies. Inclusion criteria were original research, English language and comparison of endovascular interventions in procedure time (PT) and fluoroscopy time (FT) performed by interventionalists with and without AST. Study quality was assessed using modified Downs-and-Black-instrument (maximum 8 points). Heterogeneity-analysis (study design and I<sup>2</sup>) was determined, and fixed- or random-effects model was applied to pool the effect, mean difference (MD), from the individual studies. All analyses were performed two-sided, and the level-of-significance was 0.05.</p><p><strong>Results: </strong>Overall, 9 studies with 10 datasets and 7774 interventions were included. Study quality was 7 ± 0 for both PT and FT. Heterogeneity was present in the studies on PT (I<sup>2</sup> = 61%) and FT (I<sup>2</sup> = 99%), and a random-effects model was applied. MD for PT was significant with -2.63 min between the AST-group and control-group among the included studies (p = 0.02). In contrast, MD was not significant with -1.33 min between the AST-group and control-group among the included studies for FT (p = 0.21).</p><p><strong>Conclusion: </strong>AST translates into an improved PT and similar FT in real interventions compared to conventional training. Angiography simulators offer a valuable, radiation-free alternative and expand training opportunities. Evidence is limited by study heterogeneity.</p>","PeriodicalId":72108,"journal":{"name":"Advances in simulation (London, England)","volume":"10 1","pages":"53"},"PeriodicalIF":4.7,"publicationDate":"2025-10-27","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12560282/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145379668","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
期刊
Advances in simulation (London, England)
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1