首页 > 最新文献

JMIR Formative Research最新文献

英文 中文
Firearm Violence and Health in Policymaker Discourse: Mixed Methods Social Media Analysis. 政策制定者话语中的枪支暴力与健康:混合方法社会媒体分析。
IF 2 Q3 HEALTH CARE SCIENCES & SERVICES Pub Date : 2025-12-16 DOI: 10.2196/80397
Vivek A Ashok, William J K Vervilles, Katherine S Kellom, Anyun Chatterjee, Isabella Ntigbu, Okechi Boms, Andrea Szabo, Joel A Fein, Therese S Richmond, Jonathan Purtle, Matthew D Kearney, Zachary F Meisel
<p><strong>Background: </strong>Since 2019, firearm violence has remained the leading cause of death for US children and adolescents aged 1-19 years. This crisis has spurred action from policymakers, health professionals, and advocates. However, political polarization has contributed to divergent views on the causes and appropriate responses to firearm violence. Communication by elected officials, especially on social media, plays a critical role in shaping public opinion and policy agendas. Understanding how state policymakers discuss firearm violence, including the use of causal blame, calls to action, and health-related narratives, can inform more effective public health strategies.</p><p><strong>Objective: </strong>This study aimed to examine how Pennsylvania state legislators discuss firearms and firearm violence on social media and assess the extent to which their messaging aligns with public health perspectives.</p><p><strong>Methods: </strong>We conducted a 2-phase mixed methods analysis of X (formerly known as Twitter; X Corp) posts by Pennsylvania state legislators from May 27, 2017, to July 26, 2022. Posts were grouped into 3 time periods surrounding the Tree of Life Synagogue mass shooting in Pittsburgh. Using a Boolean search strategy, we identified 4573 posts related to firearms and firearm violence. After removing reposts and non-English content, we randomly sampled 1491 (32.6%) original posts authored by 152 unique legislators. Posts were coded using a structured codebook based on the Multiple Streams Framework to capture rhetorical framing, causal blame, and policy content. Interrater reliability was high (Holsti coefficient >0.8). We used chi-square tests and multivariable logistic regression to assess associations between rhetorical elements and policy mentions, adjusting for time period.</p><p><strong>Results: </strong>Mass shootings were the most frequently referenced category of firearm violence, peaking after the Tree of Life shooting (22/43, 51% vs 91/118, 77.1% vs 140/220, 63.6%; P=.004), while firearm suicide was rarely discussed. Posts using advocacy frames were nearly 5 times more likely to mention policy (adjusted odds ratio [aOR] 4.67, 95% CI 3.55-6.16), whereas those referencing mass shootings (aOR 0.54, 95% CI 0.37-0.77) or emotional appeals (aOR 0.53, 95% CI 0.40-0.69) were significantly less likely to do so. Most posts used general advocacy (aOR 2.97, 95% CI 2.13-4.13) and vague blame (aOR 8.26, 95% CI 6.02-11.35), resulting in nonspecific policy suggestions. Posts that attributed blame to firearm access were strongly associated with specific policy proposals (aOR 6.37, 95% CI 4.29-9.47) and inversely associated with general policy mentions (aOR 0.26, 95% CI 0.17-0.42). Only 9.4% (133/1422) of posts used health frames; when present, they more often referenced physical consequences (58/133, 43.6% vs 216/1358, 15.9%; P<.001).</p><p><strong>Conclusions: </strong>Pennsylvania legislators primarily focused on mass sho
背景:自2019年以来,枪支暴力仍然是美国1-19岁儿童和青少年死亡的主要原因。这场危机促使政策制定者、卫生专业人员和倡导者采取行动。然而,政治两极分化导致了对枪支暴力的起因和适当反应的不同看法。民选官员的沟通,特别是在社交媒体上的沟通,在塑造公众舆论和政策议程方面发挥着关键作用。了解州决策者如何讨论枪支暴力,包括使用因果责任、行动呼吁和与健康有关的叙述,可以为更有效的公共卫生战略提供信息。目的:本研究旨在研究宾夕法尼亚州立法者如何在社交媒体上讨论枪支和枪支暴力,并评估他们的信息传达与公共卫生观点一致的程度。方法:我们对2017年5月27日至2022年7月26日期间宾夕法尼亚州立法者发布的X(以前称为Twitter; X Corp)帖子进行了两阶段混合方法分析。在匹兹堡生命之树犹太教堂枪击案前后,帖子被分为三个时间段。使用布尔搜索策略,我们确定了4573个与枪支和枪支暴力有关的帖子。在删除转发和非英语内容后,我们随机抽取了152位独特立法者撰写的1491篇(32.6%)原创文章。使用基于多流框架的结构化代码本对帖子进行编码,以捕获修辞框架,因果责任和政策内容。信度高(Holsti系数>.8)。我们使用卡方检验和多变量逻辑回归来评估修辞元素与政策提及之间的关联,并根据时间段进行调整。结果:大规模枪击是枪支暴力中被提及频率最高的类别,在“生命之树”枪击案之后达到高峰(22/43,51% vs 91/118, 77.1% vs 140/220, 63.6%; P= 0.004),而枪支自杀很少被讨论。使用倡导框架的帖子提及政策的可能性几乎是其5倍(调整后的优势比[aOR] 4.67, 95% CI 3.55-6.16),而提及大规模枪击事件(aOR 0.54, 95% CI 0.37-0.77)或情感诉求(aOR 0.53, 95% CI 0.40-0.69)的帖子提及政策的可能性明显较低。多数帖子采用一般性倡导(aOR 2.97, 95% CI 2.13-4.13)和模糊指责(aOR 8.26, 95% CI 6.02-11.35),导致政策建议不明确。将责任归咎于枪支获取的帖子与具体政策建议密切相关(aOR为6.37,95% CI为4.29-9.47),与一般政策提及呈负相关(aOR为0.26,95% CI为0.17-0.42)。只有9.4%(133/1422)的岗位使用保健框架;当他们在场时,他们更经常提到身体后果(58/133,43.6% vs 216/1358, 15.9%)。结论:宾夕法尼亚州的立法者主要关注大规模枪击事件,依赖于情感或象征性的语言,而没有提出具体的政策。健康框架很少,通常侧重于后果而不是预防。调查结果强调,有机会支持决策者制定健康知情的信息战略,促进可操作的枪支暴力预防政策,特别是针对预防的政策。
{"title":"Firearm Violence and Health in Policymaker Discourse: Mixed Methods Social Media Analysis.","authors":"Vivek A Ashok, William J K Vervilles, Katherine S Kellom, Anyun Chatterjee, Isabella Ntigbu, Okechi Boms, Andrea Szabo, Joel A Fein, Therese S Richmond, Jonathan Purtle, Matthew D Kearney, Zachary F Meisel","doi":"10.2196/80397","DOIUrl":"10.2196/80397","url":null,"abstract":"&lt;p&gt;&lt;strong&gt;Background: &lt;/strong&gt;Since 2019, firearm violence has remained the leading cause of death for US children and adolescents aged 1-19 years. This crisis has spurred action from policymakers, health professionals, and advocates. However, political polarization has contributed to divergent views on the causes and appropriate responses to firearm violence. Communication by elected officials, especially on social media, plays a critical role in shaping public opinion and policy agendas. Understanding how state policymakers discuss firearm violence, including the use of causal blame, calls to action, and health-related narratives, can inform more effective public health strategies.&lt;/p&gt;&lt;p&gt;&lt;strong&gt;Objective: &lt;/strong&gt;This study aimed to examine how Pennsylvania state legislators discuss firearms and firearm violence on social media and assess the extent to which their messaging aligns with public health perspectives.&lt;/p&gt;&lt;p&gt;&lt;strong&gt;Methods: &lt;/strong&gt;We conducted a 2-phase mixed methods analysis of X (formerly known as Twitter; X Corp) posts by Pennsylvania state legislators from May 27, 2017, to July 26, 2022. Posts were grouped into 3 time periods surrounding the Tree of Life Synagogue mass shooting in Pittsburgh. Using a Boolean search strategy, we identified 4573 posts related to firearms and firearm violence. After removing reposts and non-English content, we randomly sampled 1491 (32.6%) original posts authored by 152 unique legislators. Posts were coded using a structured codebook based on the Multiple Streams Framework to capture rhetorical framing, causal blame, and policy content. Interrater reliability was high (Holsti coefficient &gt;0.8). We used chi-square tests and multivariable logistic regression to assess associations between rhetorical elements and policy mentions, adjusting for time period.&lt;/p&gt;&lt;p&gt;&lt;strong&gt;Results: &lt;/strong&gt;Mass shootings were the most frequently referenced category of firearm violence, peaking after the Tree of Life shooting (22/43, 51% vs 91/118, 77.1% vs 140/220, 63.6%; P=.004), while firearm suicide was rarely discussed. Posts using advocacy frames were nearly 5 times more likely to mention policy (adjusted odds ratio [aOR] 4.67, 95% CI 3.55-6.16), whereas those referencing mass shootings (aOR 0.54, 95% CI 0.37-0.77) or emotional appeals (aOR 0.53, 95% CI 0.40-0.69) were significantly less likely to do so. Most posts used general advocacy (aOR 2.97, 95% CI 2.13-4.13) and vague blame (aOR 8.26, 95% CI 6.02-11.35), resulting in nonspecific policy suggestions. Posts that attributed blame to firearm access were strongly associated with specific policy proposals (aOR 6.37, 95% CI 4.29-9.47) and inversely associated with general policy mentions (aOR 0.26, 95% CI 0.17-0.42). Only 9.4% (133/1422) of posts used health frames; when present, they more often referenced physical consequences (58/133, 43.6% vs 216/1358, 15.9%; P&lt;.001).&lt;/p&gt;&lt;p&gt;&lt;strong&gt;Conclusions: &lt;/strong&gt;Pennsylvania legislators primarily focused on mass sho","PeriodicalId":14841,"journal":{"name":"JMIR Formative Research","volume":"9 ","pages":"e80397"},"PeriodicalIF":2.0,"publicationDate":"2025-12-16","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12707438/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145768127","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Gamified Simulation for Onboarding Health Care Teams in Emergency Care: Development and Preliminary Feasibility Study. 急救护理团队入职游戏化模拟:发展及初步可行性研究。
IF 2 Q3 HEALTH CARE SCIENCES & SERVICES Pub Date : 2025-12-15 DOI: 10.2196/72202
Stephane Gobron, Antoine Lestrade, Artan Sadiku, Alexandre Bentvelzen, Leila Ait Kaci, Jean-Michel Carrier, Emmanuelle Guyot, Pierre-Nicolas Carron

Background: High staff turnover is a widespread issue across nearly all hospital departments, often exceeding 20% annually. This constant flux disrupts continuity of care and creates a recurring challenge: how to rapidly integrate new employees into complex clinical environments, both physically and functionally. Traditional onboarding methods struggle to meet this demand, particularly in services operating 24/7, such as emergency departments (EDs).

Objective: This formative study presents the design and implementation of a web-based 3D gamified simulation platform aimed at improving staff onboarding in clinical environments. The paper outlines both the technical architecture-with guidance for hospital IT departments-and the acceptability and usability for permanent staff, who play a key role in ensuring onboarding continuity. We sought to assess whether such a tool could be autonomously managed and well received by health care professionals.

Methods: The intervention consisted of 2 linked components: a real-time, browser-based 3D simulation replicating the hospital's ED and a web-based quest editor allowing nontechnical staff to update training content. The system supports self-paced onboarding through location-based tasks, object searches, quizzes, and simulated staff interactions. Two preliminary usability studies were conducted: one with 37 ED staff members testing the 3D simulation and another with 9 users exploring the quest editor. Feedback was gathered through anonymous questionnaires and a descriptive analysis.

Results: Early results showed high feasibility and acceptability. Among 3D simulation testers (n=37), 90% (33/37) found the tool helpful for understanding the department's structure, and 81% (30/37) believed it would be useful for new staff. The inclusion of personal anecdotes and gamified tasks was viewed as engaging and motivating. The quest editor (n=9) was positively rated by 91% (8/9) of users, who appreciated the ability to autonomously update content without IT support. These findings support the dual promise of the platform (ie, pedagogical flexibility and technical sustainability).

Conclusions: This work demonstrates the feasibility of a gamified simulation platform designed for high-turnover clinical environments. It highlights both the operational deployment framework and the early acceptability among key staff members. While further validation with actual new hires is needed, this formative study shows promising potential for generalization beyond emergency care. The modular and editable nature of the system makes it a viable solution for scalable onboarding in other hospital departments.

背景:高员工流失率是几乎所有医院部门普遍存在的问题,通常每年超过20%。这种不断的变化破坏了护理的连续性,并产生了一个反复出现的挑战:如何快速地将新员工融入复杂的临床环境,无论是身体上还是功能上。传统的入职方法难以满足这一需求,特别是在急诊等全天候运营的服务部门。目的:本形成性研究提出了基于web的3D游戏化模拟平台的设计和实现,旨在改善临床环境中的员工入职。本文概述了技术架构(为医院IT部门提供指导)和长期员工的可接受性和可用性,长期员工在确保入职连续性方面发挥着关键作用。我们试图评估这样一个工具是否可以自主管理,并受到卫生保健专业人员的欢迎。方法:干预包括两个相互关联的组件:一个基于浏览器的实时3D模拟复制医院急诊科,一个基于网络的任务编辑器,允许非技术人员更新培训内容。该系统通过基于位置的任务、对象搜索、测验和模拟员工互动支持自定进度的入职。我们进行了两项初步的可用性研究:一项是由37名ED员工测试3D模拟,另一项是由9名用户探索任务编辑器。通过匿名问卷调查和描述性分析收集反馈。结果:初步结果具有较高的可行性和可接受性。在3D模拟测试人员(n=37)中,90%(33/37)的人认为该工具有助于了解部门结构,81%(30/37)的人认为该工具对新员工有用。个人轶事和游戏化的任务被认为是引人入胜和激励人心的。任务编辑器(n=9)得到91%(8/9)用户的积极评价,他们欣赏在没有IT支持的情况下自主更新内容的能力。这些发现支持了该平台的双重承诺(即教学灵活性和技术可持续性)。结论:这项工作证明了为高周转临床环境设计的游戏化模拟平台的可行性。它突出了业务部署框架和关键工作人员的早期可接受性。虽然需要对实际的新员工进行进一步的验证,但这项形成性研究显示了在急诊护理之外推广的良好潜力。该系统的模块化和可编辑特性使其成为其他医院部门可扩展入职的可行解决方案。
{"title":"Gamified Simulation for Onboarding Health Care Teams in Emergency Care: Development and Preliminary Feasibility Study.","authors":"Stephane Gobron, Antoine Lestrade, Artan Sadiku, Alexandre Bentvelzen, Leila Ait Kaci, Jean-Michel Carrier, Emmanuelle Guyot, Pierre-Nicolas Carron","doi":"10.2196/72202","DOIUrl":"10.2196/72202","url":null,"abstract":"<p><strong>Background: </strong>High staff turnover is a widespread issue across nearly all hospital departments, often exceeding 20% annually. This constant flux disrupts continuity of care and creates a recurring challenge: how to rapidly integrate new employees into complex clinical environments, both physically and functionally. Traditional onboarding methods struggle to meet this demand, particularly in services operating 24/7, such as emergency departments (EDs).</p><p><strong>Objective: </strong>This formative study presents the design and implementation of a web-based 3D gamified simulation platform aimed at improving staff onboarding in clinical environments. The paper outlines both the technical architecture-with guidance for hospital IT departments-and the acceptability and usability for permanent staff, who play a key role in ensuring onboarding continuity. We sought to assess whether such a tool could be autonomously managed and well received by health care professionals.</p><p><strong>Methods: </strong>The intervention consisted of 2 linked components: a real-time, browser-based 3D simulation replicating the hospital's ED and a web-based quest editor allowing nontechnical staff to update training content. The system supports self-paced onboarding through location-based tasks, object searches, quizzes, and simulated staff interactions. Two preliminary usability studies were conducted: one with 37 ED staff members testing the 3D simulation and another with 9 users exploring the quest editor. Feedback was gathered through anonymous questionnaires and a descriptive analysis.</p><p><strong>Results: </strong>Early results showed high feasibility and acceptability. Among 3D simulation testers (n=37), 90% (33/37) found the tool helpful for understanding the department's structure, and 81% (30/37) believed it would be useful for new staff. The inclusion of personal anecdotes and gamified tasks was viewed as engaging and motivating. The quest editor (n=9) was positively rated by 91% (8/9) of users, who appreciated the ability to autonomously update content without IT support. These findings support the dual promise of the platform (ie, pedagogical flexibility and technical sustainability).</p><p><strong>Conclusions: </strong>This work demonstrates the feasibility of a gamified simulation platform designed for high-turnover clinical environments. It highlights both the operational deployment framework and the early acceptability among key staff members. While further validation with actual new hires is needed, this formative study shows promising potential for generalization beyond emergency care. The modular and editable nature of the system makes it a viable solution for scalable onboarding in other hospital departments.</p>","PeriodicalId":14841,"journal":{"name":"JMIR Formative Research","volume":"9 ","pages":"e72202"},"PeriodicalIF":2.0,"publicationDate":"2025-12-15","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12704704/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145762805","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
The Role of Practitioner- and User-Set Goals in Engagement and Psychological Distress Among Kooth Digital Health Users: Retrospective Analysis. 从业者和用户设定的目标在参与和心理困扰中的作用在Kooth数字健康用户:回顾性分析。
IF 2 Q3 HEALTH CARE SCIENCES & SERVICES Pub Date : 2025-12-15 DOI: 10.2196/70818
Jacqlyn Yourell, Jennifer Huberty, Terry Hanley, Louisa Salhi
<p><strong>Background: </strong>Youth and young adult mental health concerns are rising globally, with digital mental health platforms offering a promising solution for accessible support. Among the various features these platforms provide, goal setting and achievement have been shown to positively influence behavior change and mental health outcomes. However, there is limited understanding of how user-set goals compare to those set collaboratively with a practitioner regarding their impact on user engagement and mental health outcomes in digital mental health platforms.</p><p><strong>Objective: </strong>The purpose of this study was to examine the relationship between various goal-related variables (eg, the number of goals created and progress in user-set and practitioner-set goals) and user engagement as well as mental health (ie, psychological distress) on a free digital mental health platform. A secondary exploratory aim was to assess how different user-presenting issues were associated with platform engagement.</p><p><strong>Methods: </strong>We leveraged secondary data from a free, web-based mental health platform for youth aged 10 to 25 years in the United Kingdom that offers goal-setting features, emotional journaling, peer support, asynchronous chat with practitioners, and various self-guided well-being activities. Data included in the analyses were from youth and young adults (mean age 15.84 years, SD 2.88; 522/691, 75.5% female) who engaged with the goal-setting feature and completed both pre- and postengagement psychological distress measures between January 2020 and December 2023. We examined the relationship between user-set goals and practitioner-set goals on user engagement and psychological distress via linear regressions. The impact of different user-presenting issues on engagement was also explored via linear regression.</p><p><strong>Results: </strong>The number of practitioner-set goals created was positively associated with platform engagement (β=.16; P<.001), whereas the number of self-set goals and goal progress, whether self or practitioner set, were not. Progress on practitioner-set goals was significantly associated with reduced psychological distress (β=-.27; P<.001), while progress on self-set goals showed no significant association (P=.16). Physical health-related and school-related presenting issues were the strongest predictors of increased platform engagement (β=.21; P<.001 and β=.17; P<.001, respectively).</p><p><strong>Conclusions: </strong>These findings underscore the importance of collaborative goal setting in improving mental health outcomes for youth and young adults on digital mental health platforms. By highlighting the role of guided support and goal progression, this study enhances our understanding of how digital mental health platforms can better support young people's mental health and well-being. This paper also highlights how digital mental health platforms can serve as a valuable resource for addr
背景:青年和年轻人的心理健康问题正在全球范围内上升,数字心理健康平台为获得支持提供了一个有希望的解决方案。在这些平台提供的各种功能中,目标设定和成就已被证明对行为改变和心理健康结果有积极影响。然而,对于用户设定的目标与与从业者合作设定的目标在数字心理健康平台中对用户参与度和心理健康结果的影响如何进行比较,人们的理解有限。目的:本研究的目的是研究在一个免费的数字心理健康平台上,各种目标相关变量(例如,创建的目标数量和用户设定目标和从业者设定目标的进展)与用户参与度以及心理健康(即心理困扰)之间的关系。第二个探索目标是评估不同的用户呈现问题与平台参与度之间的关系。方法:我们利用了来自英国一个免费的、基于网络的10至25岁青少年心理健康平台的二手数据,该平台提供目标设定功能、情绪日志、同伴支持、与从业者的异步聊天以及各种自我指导的健康活动。纳入分析的数据来自青年和年轻人(平均年龄15.84岁,标准差2.88;522/691,75.5%女性),他们参与了目标设定特征,并在2020年1月至2023年12月期间完成了参与前和参与后的心理困扰测量。我们通过线性回归检验了用户设定目标和医生设定目标对用户参与度和心理困扰的影响。我们还通过线性回归探讨了不同用户呈现问题对用户粘性的影响。结果:医生设定目标的数量与平台参与度呈正相关(β= 0.16);结论:这些发现强调了协作目标设定对改善数字心理健康平台上青少年和年轻人心理健康结果的重要性。通过强调指导性支持和目标进展的作用,本研究增强了我们对数字心理健康平台如何更好地支持年轻人心理健康和福祉的理解。本文还强调了数字心理健康平台如何作为解决广泛心理健康需求的宝贵资源。
{"title":"The Role of Practitioner- and User-Set Goals in Engagement and Psychological Distress Among Kooth Digital Health Users: Retrospective Analysis.","authors":"Jacqlyn Yourell, Jennifer Huberty, Terry Hanley, Louisa Salhi","doi":"10.2196/70818","DOIUrl":"10.2196/70818","url":null,"abstract":"&lt;p&gt;&lt;strong&gt;Background: &lt;/strong&gt;Youth and young adult mental health concerns are rising globally, with digital mental health platforms offering a promising solution for accessible support. Among the various features these platforms provide, goal setting and achievement have been shown to positively influence behavior change and mental health outcomes. However, there is limited understanding of how user-set goals compare to those set collaboratively with a practitioner regarding their impact on user engagement and mental health outcomes in digital mental health platforms.&lt;/p&gt;&lt;p&gt;&lt;strong&gt;Objective: &lt;/strong&gt;The purpose of this study was to examine the relationship between various goal-related variables (eg, the number of goals created and progress in user-set and practitioner-set goals) and user engagement as well as mental health (ie, psychological distress) on a free digital mental health platform. A secondary exploratory aim was to assess how different user-presenting issues were associated with platform engagement.&lt;/p&gt;&lt;p&gt;&lt;strong&gt;Methods: &lt;/strong&gt;We leveraged secondary data from a free, web-based mental health platform for youth aged 10 to 25 years in the United Kingdom that offers goal-setting features, emotional journaling, peer support, asynchronous chat with practitioners, and various self-guided well-being activities. Data included in the analyses were from youth and young adults (mean age 15.84 years, SD 2.88; 522/691, 75.5% female) who engaged with the goal-setting feature and completed both pre- and postengagement psychological distress measures between January 2020 and December 2023. We examined the relationship between user-set goals and practitioner-set goals on user engagement and psychological distress via linear regressions. The impact of different user-presenting issues on engagement was also explored via linear regression.&lt;/p&gt;&lt;p&gt;&lt;strong&gt;Results: &lt;/strong&gt;The number of practitioner-set goals created was positively associated with platform engagement (β=.16; P&lt;.001), whereas the number of self-set goals and goal progress, whether self or practitioner set, were not. Progress on practitioner-set goals was significantly associated with reduced psychological distress (β=-.27; P&lt;.001), while progress on self-set goals showed no significant association (P=.16). Physical health-related and school-related presenting issues were the strongest predictors of increased platform engagement (β=.21; P&lt;.001 and β=.17; P&lt;.001, respectively).&lt;/p&gt;&lt;p&gt;&lt;strong&gt;Conclusions: &lt;/strong&gt;These findings underscore the importance of collaborative goal setting in improving mental health outcomes for youth and young adults on digital mental health platforms. By highlighting the role of guided support and goal progression, this study enhances our understanding of how digital mental health platforms can better support young people's mental health and well-being. This paper also highlights how digital mental health platforms can serve as a valuable resource for addr","PeriodicalId":14841,"journal":{"name":"JMIR Formative Research","volume":"9 ","pages":"e70818"},"PeriodicalIF":2.0,"publicationDate":"2025-12-15","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12750070/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145762788","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
The Impact of a Digital Peer-Supported App on Daily Steps and Lifestyle Changes Among Individuals With Prediabetes and Early-Stage Type 2 Diabetes: Prospective, Nonrandomized Controlled Trial. 数字同伴支持应用程序对糖尿病前期和早期2型糖尿病患者每日步数和生活方式改变的影响:前瞻性、非随机对照试验
IF 2 Q3 HEALTH CARE SCIENCES & SERVICES Pub Date : 2025-12-15 DOI: 10.2196/75953
Shota Yoshihara, Takeo Shibata, Mizuki Kosone, Megumi Shibuya, Kayoko Takahashi

Background: Physical activity is a simple, low-risk intervention that could be integrated into daily life to improve glycemic control in individuals with prediabetes and early-stage type 2 diabetes mellitus (T2DM). However, maintaining physical activity remains challenging, even when its benefits are well understood. Although digital peer support has the potential to promote and maintain physical activity, its effectiveness has not yet been sufficiently established.

Objective: This study examined the impact of a digital peer-supported app on daily step goal achievement and average daily step counts among individuals with prediabetes and early-stage T2DM.

Methods: This 3-month, prospective, nonrandomized controlled trial recruited participants aged 40-79 years with prediabetes or early-stage T2DM. The participants were divided into a digital peer-supported app group and a control group. The digital peer-supported app group tracked their daily steps, shared their progress with small peer groups, and received real-time feedback and support within the app. The control group tracked their steps individually using a pedometer. The primary outcome was the achievement rate of daily step goals. Secondary outcomes included the average daily step count, BMI, glycosylated hemoglobin A1c level, blood pressure, and self-reported lifestyle behaviors.

Results: A total of 32 participants (digital peer-supported app group: n=18 and control group: n=14) completed the study. The digital peer-supported app group reported a significantly higher median daily step goal achievement rate (57.2%, IQR 32.2%-90% vs 26.7%, IQR 10%-64.4%; P=.04) and daily step count (6854, IQR 4846-10388 steps vs 3946, IQR 3176-6832 steps; P<.03) compared to the control group. No significant differences were observed in glycosylated hemoglobin A1c levels, blood pressure, BMI, or lifestyle behaviors.

Conclusions: Our findings inform research in this field by suggesting that a digital peer-supported app may support daily step goal achievement and increase step counts among individuals with prediabetes and early-stage T2DM over the 3-month study period. The digital peer-supported app facilitated real-time feedback, peer approval, and continuous engagement to support participation in light physical activity.

Trial registration: UMIN-CTR UMIN000039466; https://center6.umin.ac.jp/cgi-open-bin/ctr_e/ctr_view.cgi?recptno=R000044999.

背景:体育活动是一种简单、低风险的干预措施,可以融入日常生活,改善糖尿病前期和早期2型糖尿病(T2DM)患者的血糖控制。然而,保持身体活动仍然具有挑战性,即使它的好处被充分理解。尽管数字同伴支持具有促进和维持身体活动的潜力,但其有效性尚未得到充分证实。目的:本研究考察了数字同伴支持应用程序对糖尿病前期和早期T2DM患者每日步数目标实现和平均每日步数的影响。方法:这项为期3个月的前瞻性非随机对照试验招募了年龄在40-79岁之间的糖尿病前期或早期T2DM患者。参与者被分为数字同伴支持的应用程序组和对照组。数字同伴支持的应用程序组跟踪他们的每日步数,与同伴小组分享他们的进展,并在应用程序中获得实时反馈和支持。对照组使用计步器单独跟踪他们的步数。主要结果是每日步数目标的完成率。次要结果包括平均每日步数、BMI、糖化血红蛋白A1c水平、血压和自我报告的生活方式行为。结果:共有32名参与者(数字同伴支持应用程序组:n=18,对照组:n=14)完成了研究。数字同伴支持的应用程序组报告了更高的每日步数目标成成率中位数(57.2%,IQR 32.2%-90% vs 26.7%, IQR 10%-64.4%; P= 0.04)和每日步数(6854,IQR 4846-10388步vs 3946, IQR 3176-6832步)、P1c水平、血压、BMI或生活方式行为。结论:我们的研究结果表明,在3个月的研究期间,数字同伴支持的应用程序可以支持糖尿病前期和早期T2DM患者实现每日步数目标并增加步数,从而为该领域的研究提供了信息。这款由同行支持的数字应用程序促进了实时反馈、同行认可和持续参与,以支持参与轻度体育活动。试验注册:UMIN-CTR UMIN000039466;https://center6.umin.ac.jp/cgi-open-bin/ctr_e/ctr_view.cgi?recptno=R000044999。
{"title":"The Impact of a Digital Peer-Supported App on Daily Steps and Lifestyle Changes Among Individuals With Prediabetes and Early-Stage Type 2 Diabetes: Prospective, Nonrandomized Controlled Trial.","authors":"Shota Yoshihara, Takeo Shibata, Mizuki Kosone, Megumi Shibuya, Kayoko Takahashi","doi":"10.2196/75953","DOIUrl":"10.2196/75953","url":null,"abstract":"<p><strong>Background: </strong>Physical activity is a simple, low-risk intervention that could be integrated into daily life to improve glycemic control in individuals with prediabetes and early-stage type 2 diabetes mellitus (T2DM). However, maintaining physical activity remains challenging, even when its benefits are well understood. Although digital peer support has the potential to promote and maintain physical activity, its effectiveness has not yet been sufficiently established.</p><p><strong>Objective: </strong>This study examined the impact of a digital peer-supported app on daily step goal achievement and average daily step counts among individuals with prediabetes and early-stage T2DM.</p><p><strong>Methods: </strong>This 3-month, prospective, nonrandomized controlled trial recruited participants aged 40-79 years with prediabetes or early-stage T2DM. The participants were divided into a digital peer-supported app group and a control group. The digital peer-supported app group tracked their daily steps, shared their progress with small peer groups, and received real-time feedback and support within the app. The control group tracked their steps individually using a pedometer. The primary outcome was the achievement rate of daily step goals. Secondary outcomes included the average daily step count, BMI, glycosylated hemoglobin A<sub>1c</sub> level, blood pressure, and self-reported lifestyle behaviors.</p><p><strong>Results: </strong>A total of 32 participants (digital peer-supported app group: n=18 and control group: n=14) completed the study. The digital peer-supported app group reported a significantly higher median daily step goal achievement rate (57.2%, IQR 32.2%-90% vs 26.7%, IQR 10%-64.4%; P=.04) and daily step count (6854, IQR 4846-10388 steps vs 3946, IQR 3176-6832 steps; P<.03) compared to the control group. No significant differences were observed in glycosylated hemoglobin A<sub>1c</sub> levels, blood pressure, BMI, or lifestyle behaviors.</p><p><strong>Conclusions: </strong>Our findings inform research in this field by suggesting that a digital peer-supported app may support daily step goal achievement and increase step counts among individuals with prediabetes and early-stage T2DM over the 3-month study period. The digital peer-supported app facilitated real-time feedback, peer approval, and continuous engagement to support participation in light physical activity.</p><p><strong>Trial registration: </strong>UMIN-CTR UMIN000039466; https://center6.umin.ac.jp/cgi-open-bin/ctr_e/ctr_view.cgi?recptno=R000044999.</p>","PeriodicalId":14841,"journal":{"name":"JMIR Formative Research","volume":"9 ","pages":"e75953"},"PeriodicalIF":2.0,"publicationDate":"2025-12-15","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12750071/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145762780","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Hype vs Reality in the Integration of Artificial Intelligence in Clinical Workflows. 人工智能在临床工作流程整合中的炒作与现实。
IF 2 Q3 HEALTH CARE SCIENCES & SERVICES Pub Date : 2025-12-12 DOI: 10.2196/70921
Alaa Abd-Alrazaq, Barry Solaiman, Yosra Magdi Mekki, Dena Al-Thani, Faisal Farooq, Metab Alkubeyyer, Mohamed Ziyad Abubacker, Rawan AlSaad, Sarah Aziz, Ahmed Serag, Rajat Thomas, Javaid Sheikh, Arfan Ahmed

Unlabelled: Artificial intelligence (AI) has the capacity to transform health care by improving clinical decision-making, optimizing workflows, and enhancing patient outcomes. However, this potential remains limited by a complex set of technological, human, and ethical barriers that constrain its safe and equitable implementation. This paper argues for a holistic, systems-based approach to AI integration that addresses these challenges as interconnected rather than isolated. It identifies key technological barriers, including limited explainability, algorithmic bias, integration and interoperability issues, lack of generalizability, and difficulties in validation. Human factors such as resistance to change, insufficient stakeholder engagement, and education and resource constraints further impede adoption, whereas ethical and legal challenges related to liability, privacy, informed consent, and inequity compound these obstacles. Addressing these issues requires transparent model design, diverse datasets, participatory development, and adaptive governance. Recommendations emerging from this synthesis are as follows: (1) establish standardized international regulatory and governance frameworks; (2) promote multidisciplinary co-design involving clinicians, developers, and patients; (3) invest in clinician education, AI literacy, and continuous training; (4) ensure equitable resource allocation through dedicated funding and public-private partnerships; (5) prioritize multimodal, explainable, and ethically aligned AI development; and (6) focus on long-term evaluation of AI in real-world settings to ensure adaptive, transparent, and inclusive deployment. Adopting these measures can align innovation with accountability, enabling health care systems to harness AI's transformative potential responsibly and sustainably to advance patient care and health equity.

未标记:人工智能(AI)有能力通过改善临床决策、优化工作流程和提高患者治疗效果来改变医疗保健。然而,这一潜力仍然受到一系列复杂的技术、人力和道德障碍的限制,这些障碍制约了其安全和公平的实施。本文主张采用一种整体的、基于系统的人工智能集成方法,将这些挑战视为相互关联而不是孤立的。它确定了关键的技术障碍,包括有限的可解释性、算法偏差、集成和互操作性问题、缺乏通用性以及验证中的困难。抵制变革、利益相关者参与不足、教育和资源限制等人为因素进一步阻碍了采用,而与责任、隐私、知情同意和不公平相关的道德和法律挑战使这些障碍更加复杂。解决这些问题需要透明的模型设计、多样化的数据集、参与式开发和适应性治理。从这一综合得出的建议如下:(1)建立标准化的国际监管和治理框架;(2)促进临床医生、研发人员和患者的多学科协同设计;(3)投资临床医生教育、人工智能素养和持续培训;(4)通过专项资金和公私伙伴关系确保公平的资源分配;(5)优先考虑多模式、可解释和符合伦理的人工智能开发;(6)专注于在现实环境中对人工智能进行长期评估,以确保自适应、透明和包容的部署。采取这些措施可以使创新与问责制相结合,使卫生保健系统能够负责任地和可持续地利用人工智能的变革潜力,促进患者护理和卫生公平。
{"title":"Hype vs Reality in the Integration of Artificial Intelligence in Clinical Workflows.","authors":"Alaa Abd-Alrazaq, Barry Solaiman, Yosra Magdi Mekki, Dena Al-Thani, Faisal Farooq, Metab Alkubeyyer, Mohamed Ziyad Abubacker, Rawan AlSaad, Sarah Aziz, Ahmed Serag, Rajat Thomas, Javaid Sheikh, Arfan Ahmed","doi":"10.2196/70921","DOIUrl":"10.2196/70921","url":null,"abstract":"<p><strong>Unlabelled: </strong>Artificial intelligence (AI) has the capacity to transform health care by improving clinical decision-making, optimizing workflows, and enhancing patient outcomes. However, this potential remains limited by a complex set of technological, human, and ethical barriers that constrain its safe and equitable implementation. This paper argues for a holistic, systems-based approach to AI integration that addresses these challenges as interconnected rather than isolated. It identifies key technological barriers, including limited explainability, algorithmic bias, integration and interoperability issues, lack of generalizability, and difficulties in validation. Human factors such as resistance to change, insufficient stakeholder engagement, and education and resource constraints further impede adoption, whereas ethical and legal challenges related to liability, privacy, informed consent, and inequity compound these obstacles. Addressing these issues requires transparent model design, diverse datasets, participatory development, and adaptive governance. Recommendations emerging from this synthesis are as follows: (1) establish standardized international regulatory and governance frameworks; (2) promote multidisciplinary co-design involving clinicians, developers, and patients; (3) invest in clinician education, AI literacy, and continuous training; (4) ensure equitable resource allocation through dedicated funding and public-private partnerships; (5) prioritize multimodal, explainable, and ethically aligned AI development; and (6) focus on long-term evaluation of AI in real-world settings to ensure adaptive, transparent, and inclusive deployment. Adopting these measures can align innovation with accountability, enabling health care systems to harness AI's transformative potential responsibly and sustainably to advance patient care and health equity.</p>","PeriodicalId":14841,"journal":{"name":"JMIR Formative Research","volume":"9 ","pages":"e70921"},"PeriodicalIF":2.0,"publicationDate":"2025-12-12","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12700513/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145742601","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Consumer Co-Design of an Online Resource to Build Communication Skills of Health Consumers: Mixed Methods Study. 消费者共同设计线上资源以建立健康消费者沟通技巧:混合方法研究。
IF 2 Q3 HEALTH CARE SCIENCES & SERVICES Pub Date : 2025-12-12 DOI: 10.2196/77263
Alison Beauchamp, Julieanne Hilbers, Natali Cvetanovska, Anna Wong Shee, Lidia Horvat, Sandra Rogers, Andrea Cooper, Elizabeth Flemming-Judge, Sue Rawlinson, Rebecca Jessup
<p><strong>Background: </strong>Information provided by health professionals can be complex and is often not well understood by health care consumers, leading to adverse outcomes. Clinician-led communication approaches such as "teach-back" can improve consumer understanding, yet are infrequently used by clinicians. A possible solution is to build consumers' skills to proactively check their understanding rather than waiting for the clinician to do so; however, there are few educational resources to support consumers in building these skills.</p><p><strong>Objective: </strong>This study aimed to co-design a web-based learning resource for consumers to check they have understood information provided by a clinician (ie, to "check-back").</p><p><strong>Methods: </strong>This mixed methods study used a co-design approach, consisting of 2 phases. The study was conducted during the COVID-19 pandemic, and all activities were conducted online, via email or telephone. Phase 1 (needs assessment) involved first establishing an Expert Panel of consumers, clinicians, and academic experts to guide all co-design steps of the study. Next, we sought to understand issues around health communication through focus groups and interviews with consumers and clinicians. Participants were recruited from outpatient settings and consumer representative programs within 3 health services in Victoria, Australia. Focus groups and interviews aimed to identify factors that might influence consumers' use of check-back. Deductive analysis based on the Capability, Opportunity, and Motivation-Behavior (COM-B) model was used to identify initial themes; these were discussed in depth with the Expert Panel and barriers within each theme identified. A rapid literature review was undertaken to identify strategies for web-based communication training for consumers. Phase 2 (creation of the online resource) involved an iterative process. In an online meeting, Expert Panel members brainstormed ideas for addressing barriers and prioritized these ideas for inclusion in the resource. Several drafts of the content were written before a draft online version was built. This draft was reviewed by the Expert Panel, who recommended extensive revisions. Following these revisions, we conducted an online survey and focus group with consumers and clinicians from Phase 1 to identify further improvements. Findings from this consultation were used to make final changes to the online resource.</p><p><strong>Results: </strong>The Expert Panel included 12 members. Phase 1 focus groups and interviews were held with 39 consumers and 16 clinicians. Five themes were identified: self-efficacy, pre-existing skills, clinician attitudes, information complexity, and internal barriers such as embarrassment. Phase 2 survey and focus group participants identified several issues with the second draft of the resource, focusing on functionality, accessibility, and layout. Usability and acceptability of the resource were rated
背景:卫生专业人员提供的信息可能很复杂,卫生保健消费者往往不能很好地理解,从而导致不良后果。临床医生主导的沟通方式,如“反馈”,可以提高消费者的理解,但很少被临床医生使用。一个可能的解决方案是培养消费者的技能,主动检查他们的理解,而不是等待临床医生这样做;然而,很少有教育资源来支持消费者培养这些技能。目的:本研究旨在共同设计一个基于网络的学习资源,供消费者检查他们是否理解了临床医生提供的信息(即“检查”)。方法:本研究采用协同设计方法,共分为两个阶段。这项研究是在COVID-19大流行期间进行的,所有活动都是通过在线、电子邮件或电话进行的。第一阶段(需求评估)首先涉及建立一个由消费者、临床医生和学术专家组成的专家小组,以指导研究的所有共同设计步骤。接下来,我们试图通过焦点小组和与消费者和临床医生的访谈来了解与健康沟通有关的问题。参与者从澳大利亚维多利亚州3个卫生服务机构的门诊设置和消费者代表项目中招募。焦点小组和访谈旨在确定可能影响消费者使用check-back的因素。基于能力、机会和动机-行为(COM-B)模型的演绎分析用于确定初始主题;与专家小组深入讨论了这些问题,并确定了每个主题中的障碍。进行了一项快速文献审查,以确定为消费者提供网络沟通培训的策略。阶段2(在线资源的创建)涉及一个迭代过程。在一次在线会议上,专家小组成员对解决障碍的想法进行了头脑风暴,并对这些想法进行了优先排序,以便纳入资源。在建立在线版本的草稿之前,已经写了几份内容草稿。专家小组审查了该草案,并建议进行广泛修订。在这些修订之后,我们与第一阶段的消费者和临床医生进行了在线调查和焦点小组讨论,以确定进一步的改进。这次咨询的结果被用于对在线资源进行最终修改。结果:专家组共有12名成员。第一阶段对39名消费者和16名临床医生进行了焦点小组和访谈。确定了五个主题:自我效能感、已有技能、临床医生态度、信息复杂性和尴尬等内部障碍。阶段2调查和焦点小组参与者确定了资源第二稿的几个问题,重点是功能、可访问性和布局。参与者对资源的可用性和可接受性评价很高。结论:研究结果强调了使用协同设计开发以消费者为中心的基于网络的学习资源的价值。需要进一步评价以证明其在提高消费者理解方面的有效性。
{"title":"Consumer Co-Design of an Online Resource to Build Communication Skills of Health Consumers: Mixed Methods Study.","authors":"Alison Beauchamp, Julieanne Hilbers, Natali Cvetanovska, Anna Wong Shee, Lidia Horvat, Sandra Rogers, Andrea Cooper, Elizabeth Flemming-Judge, Sue Rawlinson, Rebecca Jessup","doi":"10.2196/77263","DOIUrl":"10.2196/77263","url":null,"abstract":"&lt;p&gt;&lt;strong&gt;Background: &lt;/strong&gt;Information provided by health professionals can be complex and is often not well understood by health care consumers, leading to adverse outcomes. Clinician-led communication approaches such as \"teach-back\" can improve consumer understanding, yet are infrequently used by clinicians. A possible solution is to build consumers' skills to proactively check their understanding rather than waiting for the clinician to do so; however, there are few educational resources to support consumers in building these skills.&lt;/p&gt;&lt;p&gt;&lt;strong&gt;Objective: &lt;/strong&gt;This study aimed to co-design a web-based learning resource for consumers to check they have understood information provided by a clinician (ie, to \"check-back\").&lt;/p&gt;&lt;p&gt;&lt;strong&gt;Methods: &lt;/strong&gt;This mixed methods study used a co-design approach, consisting of 2 phases. The study was conducted during the COVID-19 pandemic, and all activities were conducted online, via email or telephone. Phase 1 (needs assessment) involved first establishing an Expert Panel of consumers, clinicians, and academic experts to guide all co-design steps of the study. Next, we sought to understand issues around health communication through focus groups and interviews with consumers and clinicians. Participants were recruited from outpatient settings and consumer representative programs within 3 health services in Victoria, Australia. Focus groups and interviews aimed to identify factors that might influence consumers' use of check-back. Deductive analysis based on the Capability, Opportunity, and Motivation-Behavior (COM-B) model was used to identify initial themes; these were discussed in depth with the Expert Panel and barriers within each theme identified. A rapid literature review was undertaken to identify strategies for web-based communication training for consumers. Phase 2 (creation of the online resource) involved an iterative process. In an online meeting, Expert Panel members brainstormed ideas for addressing barriers and prioritized these ideas for inclusion in the resource. Several drafts of the content were written before a draft online version was built. This draft was reviewed by the Expert Panel, who recommended extensive revisions. Following these revisions, we conducted an online survey and focus group with consumers and clinicians from Phase 1 to identify further improvements. Findings from this consultation were used to make final changes to the online resource.&lt;/p&gt;&lt;p&gt;&lt;strong&gt;Results: &lt;/strong&gt;The Expert Panel included 12 members. Phase 1 focus groups and interviews were held with 39 consumers and 16 clinicians. Five themes were identified: self-efficacy, pre-existing skills, clinician attitudes, information complexity, and internal barriers such as embarrassment. Phase 2 survey and focus group participants identified several issues with the second draft of the resource, focusing on functionality, accessibility, and layout. Usability and acceptability of the resource were rated","PeriodicalId":14841,"journal":{"name":"JMIR Formative Research","volume":"9 ","pages":"e77263"},"PeriodicalIF":2.0,"publicationDate":"2025-12-12","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12700338/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145742427","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Factors Associated With Suicidal Ideation Among Persons With Disabilities in South Korea: Retrospective Observational Study. 韩国残疾人自杀意念相关因素:回顾性观察研究
IF 2 Q3 HEALTH CARE SCIENCES & SERVICES Pub Date : 2025-12-12 DOI: 10.2196/80261
In-Hwan Oh, Sooyeon Jo, Joon Lee

Background: South Korea has the highest suicide rate among the Organisation for Economic Co-operation and Development nations, with particularly elevated figures among persons with disabilities. Research has shown a strong correlation between suicidal ideation and suicide attempts.

Objective: This study aimed to investigate the factors that contribute to suicidal ideation among persons with disabilities in South Korea, utilizing machine learning methods based on national survey data.

Methods: We employed data from the 2020 National Survey on Persons with Disabilities in South Korea, which included 7025 respondents. The primary variable of interest was the answer to the question, "have you thought about taking your own life in the past year?" The dataset was divided into training (80%) and test (20%) subsets. Because the survey contained too many questions (n=1394), feature selection was conducted using random forest variable importance to identify the top 100 features. Subsequently, 5 machine learning models were trained to predict suicidal ideation based on the selected features: logistic regression, support vector machine, random forest, Extreme Gradient Boosting (XGBoost), and feed-forward neural network.

Results: A total of 6832 persons with disabilities responded to the suicidal ideation question and were included in the study. The most common types of primary disability were physical disability (n=1773, 26.0%) and hearing disability (n=979, 14.3%). Of the 6832 persons with disabilities, 12.1% (n=829) indicated they had had suicidal thoughts in the past year. Significant factors that impacted suicidal ideation included intense feelings of sadness, difficulties associated with their disabilities, and overall health satisfaction. Among the models tested, the random forest model exhibited the best predictive performance with a median area under the receiver operating characteristic curve of 0.905 (IQR 0.895-0.913), a median precision of 0.592 (IQR 0.561-0.616), and a median recall of 0.588 (IQR 0.564-0.620).

Conclusions: This study highlights critical predictors of suicidal ideation in persons with disabilities in South Korea, underscoring the necessity for focused mental health interventions. The results demonstrate the potential of machine learning to identify these factors, which can aid in the development of future suicide prevention strategies. Future work is warranted to investigate if the factors identified in this study are causal.

背景:韩国是经济合作与发展组织成员国中自杀率最高的国家,其中残疾人的自杀率尤其高。研究表明,自杀意念和自杀企图之间存在很强的相关性。目的:本研究旨在利用基于国家调查数据的机器学习方法,调查导致韩国残疾人自杀意念的因素。方法:我们采用了2020年韩国残疾人全国调查的数据,其中包括7025名受访者。最重要的变量是对这个问题的回答,“在过去的一年里,你有没有想过结束自己的生命?”数据集被分为训练子集(80%)和测试子集(20%)。由于调查包含的问题太多(n=1394),因此使用随机森林变量重要性进行特征选择,以确定前100个特征。随后,根据选择的特征训练5个机器学习模型来预测自杀意念:逻辑回归、支持向量机、随机森林、极端梯度增强(XGBoost)和前馈神经网络。结果:共有6832名残疾人回答了自杀意念问题,并被纳入研究。最常见的主要残疾类型是身体残疾(n=1773, 26.0%)和听力残疾(n=979, 14.3%)。在6832名残疾人士中,12.1%(829人)表示在过去一年曾有自杀念头。影响自杀意念的重要因素包括强烈的悲伤情绪、与残疾相关的困难以及整体健康满意度。其中,随机森林模型预测效果最好,受试者工作特征曲线下的中位数面积为0.905 (IQR 0.895 ~ 0.913),中位数精度为0.592 (IQR 0.561 ~ 0.616),中位数召回率为0.588 (IQR 0.564 ~ 0.620)。结论:本研究强调了韩国残疾人自杀意念的关键预测因素,强调了重点心理健康干预的必要性。研究结果证明了机器学习识别这些因素的潜力,这有助于制定未来的自杀预防策略。未来的工作需要调查本研究中确定的因素是否有因果关系。
{"title":"Factors Associated With Suicidal Ideation Among Persons With Disabilities in South Korea: Retrospective Observational Study.","authors":"In-Hwan Oh, Sooyeon Jo, Joon Lee","doi":"10.2196/80261","DOIUrl":"10.2196/80261","url":null,"abstract":"<p><strong>Background: </strong>South Korea has the highest suicide rate among the Organisation for Economic Co-operation and Development nations, with particularly elevated figures among persons with disabilities. Research has shown a strong correlation between suicidal ideation and suicide attempts.</p><p><strong>Objective: </strong>This study aimed to investigate the factors that contribute to suicidal ideation among persons with disabilities in South Korea, utilizing machine learning methods based on national survey data.</p><p><strong>Methods: </strong>We employed data from the 2020 National Survey on Persons with Disabilities in South Korea, which included 7025 respondents. The primary variable of interest was the answer to the question, \"have you thought about taking your own life in the past year?\" The dataset was divided into training (80%) and test (20%) subsets. Because the survey contained too many questions (n=1394), feature selection was conducted using random forest variable importance to identify the top 100 features. Subsequently, 5 machine learning models were trained to predict suicidal ideation based on the selected features: logistic regression, support vector machine, random forest, Extreme Gradient Boosting (XGBoost), and feed-forward neural network.</p><p><strong>Results: </strong>A total of 6832 persons with disabilities responded to the suicidal ideation question and were included in the study. The most common types of primary disability were physical disability (n=1773, 26.0%) and hearing disability (n=979, 14.3%). Of the 6832 persons with disabilities, 12.1% (n=829) indicated they had had suicidal thoughts in the past year. Significant factors that impacted suicidal ideation included intense feelings of sadness, difficulties associated with their disabilities, and overall health satisfaction. Among the models tested, the random forest model exhibited the best predictive performance with a median area under the receiver operating characteristic curve of 0.905 (IQR 0.895-0.913), a median precision of 0.592 (IQR 0.561-0.616), and a median recall of 0.588 (IQR 0.564-0.620).</p><p><strong>Conclusions: </strong>This study highlights critical predictors of suicidal ideation in persons with disabilities in South Korea, underscoring the necessity for focused mental health interventions. The results demonstrate the potential of machine learning to identify these factors, which can aid in the development of future suicide prevention strategies. Future work is warranted to investigate if the factors identified in this study are causal.</p>","PeriodicalId":14841,"journal":{"name":"JMIR Formative Research","volume":"9 ","pages":"e80261"},"PeriodicalIF":2.0,"publicationDate":"2025-12-12","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12700334/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145742369","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Exploration of Factors That Affect Engagement With the Experience Sampling Method and Service Users' Experience of This Within the AVATAR2 Trial: Mixed Methods Study. 在AVATAR2试验中使用体验抽样方法和服务用户对此体验的影响因素探索:混合方法研究。
IF 2 Q3 HEALTH CARE SCIENCES & SERVICES Pub Date : 2025-12-12 DOI: 10.2196/78204
Sophie Dennard, Philippa Garety, Clementine Edwards, Andrew Gumley, Oliver Owrid, Lucy Miller, Stephanie Allan, Alison Duerden, Francis Yanga, Chloe Burns, Helena Fletcher, Amy Grant

Background: Experience sampling methodology (ESM) is an assessment method used in psychosis research. Symptom severity and gender may be associated with ESM engagement. Exploring qualitative experiences of using ESM among people with psychosis should aid developing more relevant, accessible digital assessments.

Objective: This study aimed to examine factors that could affect engagement with ESM, such as associations of completion rates with age, ethnicity, gender, and clinical severity. It also aimed to explore qualitatively service users' experiences of using this data collection method.

Methods: Data from 134/207 AVATAR2 trial (ISRCTN55682735) participants were used to evaluate associations between demographic variables, symptom severity, and ESM completion rates. Trial participants were purposively sampled to participate in an interview to discuss their experiences of using ESM or to discuss reasons why they chose not to use it.

Results: Multiple regression analyses of 134 participants found that age, gender, ethnicity, and clinical severity were not associated with ESM completion rates (F5,128=0.548; P=.74). A thematic analysis of 17 participant interviews found 3 overarching themes: Factors affecting engagement with ESM, Perceived benefits of ESM, and Suggestions for improvement. These themes described how ESM has multiple benefits for people with psychosis, including increasing knowledge and awareness of mental health. ESM was straightforward and easy to use; however, engaging in other activities, experiencing positive symptoms, little experience using technology, and trial involvement impacted engagement. Participant's decision to use ESM could be influenced by concerns about security and privacy.

Conclusions: Recommendations are made on how engagement with ESM can be improved, making it easier to use this method with this population, including providing increased support or training when using digital-based assessment or intervention as well as providing information on how digital data are used and recorded.

背景:经验抽样法(ESM)是一种用于精神病研究的评估方法。症状严重程度和性别可能与ESM参与有关。探索在精神病患者中使用ESM的定性经验应该有助于开发更相关、更容易获得的数字评估。目的:本研究旨在研究可能影响ESM参与的因素,如完成率与年龄、种族、性别和临床严重程度的关系。它也旨在探索定性服务用户使用这种数据收集方法的体验。方法:使用来自134/207 AVATAR2试验(ISRCTN55682735)参与者的数据来评估人口统计学变量、症状严重程度和ESM完成率之间的关系。试验参与者被有目的地抽样参加访谈,讨论他们使用ESM的经验或讨论他们选择不使用它的原因。结果:134名参与者的多元回归分析发现,年龄、性别、种族和临床严重程度与ESM完成率无关(f5128 =0.548; P= 0.74)。对17位参与者访谈的专题分析发现了3个主要主题:影响参与ESM的因素、ESM的感知利益和改进建议。这些主题描述了ESM如何为精神病患者带来多重益处,包括增加对精神卫生的知识和意识。ESM简单易用;然而,参与其他活动、经历积极症状、缺乏使用技术的经验和参与试验都会影响参与度。参与者使用ESM的决定可能会受到对安全和隐私的担忧的影响。结论:就如何改善ESM的参与提出了建议,使其更容易在这一人群中使用这种方法,包括在使用基于数字的评估或干预时提供更多的支持或培训,以及提供有关如何使用和记录数字数据的信息。
{"title":"Exploration of Factors That Affect Engagement With the Experience Sampling Method and Service Users' Experience of This Within the AVATAR2 Trial: Mixed Methods Study.","authors":"Sophie Dennard, Philippa Garety, Clementine Edwards, Andrew Gumley, Oliver Owrid, Lucy Miller, Stephanie Allan, Alison Duerden, Francis Yanga, Chloe Burns, Helena Fletcher, Amy Grant","doi":"10.2196/78204","DOIUrl":"10.2196/78204","url":null,"abstract":"<p><strong>Background: </strong>Experience sampling methodology (ESM) is an assessment method used in psychosis research. Symptom severity and gender may be associated with ESM engagement. Exploring qualitative experiences of using ESM among people with psychosis should aid developing more relevant, accessible digital assessments.</p><p><strong>Objective: </strong>This study aimed to examine factors that could affect engagement with ESM, such as associations of completion rates with age, ethnicity, gender, and clinical severity. It also aimed to explore qualitatively service users' experiences of using this data collection method.</p><p><strong>Methods: </strong>Data from 134/207 AVATAR2 trial (ISRCTN55682735) participants were used to evaluate associations between demographic variables, symptom severity, and ESM completion rates. Trial participants were purposively sampled to participate in an interview to discuss their experiences of using ESM or to discuss reasons why they chose not to use it.</p><p><strong>Results: </strong>Multiple regression analyses of 134 participants found that age, gender, ethnicity, and clinical severity were not associated with ESM completion rates (F5,128=0.548; P=.74). A thematic analysis of 17 participant interviews found 3 overarching themes: Factors affecting engagement with ESM, Perceived benefits of ESM, and Suggestions for improvement. These themes described how ESM has multiple benefits for people with psychosis, including increasing knowledge and awareness of mental health. ESM was straightforward and easy to use; however, engaging in other activities, experiencing positive symptoms, little experience using technology, and trial involvement impacted engagement. Participant's decision to use ESM could be influenced by concerns about security and privacy.</p><p><strong>Conclusions: </strong>Recommendations are made on how engagement with ESM can be improved, making it easier to use this method with this population, including providing increased support or training when using digital-based assessment or intervention as well as providing information on how digital data are used and recorded.</p>","PeriodicalId":14841,"journal":{"name":"JMIR Formative Research","volume":"9 ","pages":"e78204"},"PeriodicalIF":2.0,"publicationDate":"2025-12-12","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12700335/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145742448","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Radiomics-Based AI Model to Assist Clinicians in Intracranial Hemorrhage Diagnosis: External Validation Study. 基于放射组学的人工智能模型协助临床医生颅内出血诊断:外部验证研究。
IF 2 Q3 HEALTH CARE SCIENCES & SERVICES Pub Date : 2025-12-11 DOI: 10.2196/81038
Salita Angkurawaranon, Natipat Jitmahawong, Kittisak Unsrisong, Phattanun Thabarsa, Chakri Madla, Withawat Vuthiwong, Thanwa Sudsang, Chaisiri Angkurawaranon, Patrinee Traisathit, Papangkorn Inkeaw

Background: Early identification of the etiology of spontaneous intracerebral hemorrhage (ICH) could significantly contribute to planning a suitable treatment strategy. A notable radiomics-based artificial intelligence (AI) model for classifying causes of spontaneous ICH from brain computed tomography scans has been previously proposed.

Objective: This study aimed to externally validate and assess the utility of this AI model.

Methods: This study used 69 computed tomography scans from a separate cohort to evaluate the AI model's performance in classifying nontraumatic ICHs into primary, tumorous, and vascular malformation related. We also assessed the accuracy, sensitivity, specificity, and positive predictive value of clinicians, radiologists, and trainees in identifying the ICH causes before and after using the model's assistance. The performances were statistically analyzed by specialty and expertise levels.

Results: The AI model achieved an overall accuracy of 0.65 in classifying the 3 causes of ICH. The model's assistance improved overall diagnostic performance, narrowing the gap between nonradiology and radiology groups, as well as between trainees and experts. The accuracy increased from 0.68 to 0.72, from 0.72 to 0.76, from 0.69 to 0.74, and from 0.72 to 0.75 for nonradiologists, radiologists, trainees, and specialists, respectively. With the model's support, radiology professionals demonstrated the highest accuracy, highlighting the model's potential to enhance diagnostic consistency across different levels.

Conclusions: When applied to an external dataset, the accuracy of the AI model in categorizing spontaneous ICHs based on radiomics decreased. However, using the model as an assistant substantially improved the performance of all reader groups, including trainees and radiology and nonradiology specialists.

背景:早期识别自发性脑出血(ICH)的病因有助于制定合适的治疗策略。一个著名的基于放射组学的人工智能(AI)模型已经被提出,用于从脑计算机断层扫描中分类自发性脑出血的原因。目的:本研究旨在外部验证和评估该人工智能模型的实用性。方法:本研究使用来自单独队列的69个计算机断层扫描来评估AI模型在将非创伤性ICHs分类为原发性、肿瘤性和血管畸形相关方面的表现。我们还评估了临床医生、放射科医生和受训人员在使用模型帮助前后识别脑出血原因的准确性、敏感性、特异性和阳性预测值。按专业和专业水平对成绩进行统计分析。结果:人工智能模型对ICH的3个原因进行分类,总体准确率为0.65。该模型的帮助提高了整体诊断性能,缩小了非放射组和放射组之间的差距,以及实习生和专家之间的差距。非放射科医生、放射科医生、培训生和专科医生的准确率分别从0.68提高到0.72、从0.72提高到0.76、从0.69提高到0.74、从0.72提高到0.75。在该模型的支持下,放射学专业人员展示了最高的准确性,突出了该模型在提高不同级别诊断一致性方面的潜力。结论:当应用于外部数据集时,基于放射组学的AI模型对自发性ICHs进行分类的准确性下降。然而,使用该模型作为助手大大提高了所有读者群体的表现,包括实习生、放射学和非放射学专家。
{"title":"Radiomics-Based AI Model to Assist Clinicians in Intracranial Hemorrhage Diagnosis: External Validation Study.","authors":"Salita Angkurawaranon, Natipat Jitmahawong, Kittisak Unsrisong, Phattanun Thabarsa, Chakri Madla, Withawat Vuthiwong, Thanwa Sudsang, Chaisiri Angkurawaranon, Patrinee Traisathit, Papangkorn Inkeaw","doi":"10.2196/81038","DOIUrl":"10.2196/81038","url":null,"abstract":"<p><strong>Background: </strong>Early identification of the etiology of spontaneous intracerebral hemorrhage (ICH) could significantly contribute to planning a suitable treatment strategy. A notable radiomics-based artificial intelligence (AI) model for classifying causes of spontaneous ICH from brain computed tomography scans has been previously proposed.</p><p><strong>Objective: </strong>This study aimed to externally validate and assess the utility of this AI model.</p><p><strong>Methods: </strong>This study used 69 computed tomography scans from a separate cohort to evaluate the AI model's performance in classifying nontraumatic ICHs into primary, tumorous, and vascular malformation related. We also assessed the accuracy, sensitivity, specificity, and positive predictive value of clinicians, radiologists, and trainees in identifying the ICH causes before and after using the model's assistance. The performances were statistically analyzed by specialty and expertise levels.</p><p><strong>Results: </strong>The AI model achieved an overall accuracy of 0.65 in classifying the 3 causes of ICH. The model's assistance improved overall diagnostic performance, narrowing the gap between nonradiology and radiology groups, as well as between trainees and experts. The accuracy increased from 0.68 to 0.72, from 0.72 to 0.76, from 0.69 to 0.74, and from 0.72 to 0.75 for nonradiologists, radiologists, trainees, and specialists, respectively. With the model's support, radiology professionals demonstrated the highest accuracy, highlighting the model's potential to enhance diagnostic consistency across different levels.</p><p><strong>Conclusions: </strong>When applied to an external dataset, the accuracy of the AI model in categorizing spontaneous ICHs based on radiomics decreased. However, using the model as an assistant substantially improved the performance of all reader groups, including trainees and radiology and nonradiology specialists.</p>","PeriodicalId":14841,"journal":{"name":"JMIR Formative Research","volume":"9 ","pages":"e81038"},"PeriodicalIF":2.0,"publicationDate":"2025-12-11","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12697913/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145742629","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Large Language Model Evaluation in Traditional Chinese Medicine for Stroke: Quantitative Benchmarking Study. 中风中医大语言模型评价:定量标杆研究。
IF 2 Q3 HEALTH CARE SCIENCES & SERVICES Pub Date : 2025-12-11 DOI: 10.2196/81545
Hulin Long, Yang Deng, Yaoguang Guo, Zifan Shen, Yuzhu Zhang, Ji Bao, Yang He

Background: The application of large language models (LLMs) in medicine is rapidly advancing. However, evaluating LLM capabilities in specialized domains such as traditional Chinese medicine (TCM), which possesses a unique theoretical system and cognitive framework, remains a sizable challenge.

Objective: This study aimed to provide an empirical evaluation of different LLM types in the specialized domain of TCM stroke.

Methods: The Traditional Chinese Medicine-Stroke Evaluation Dataset (TCM-SED), a 203-question benchmark, was systematically constructed. The dataset includes 3 paradigms (short-answer questions, multiple-choice questions, and essay questions) and covers multiple knowledge dimensions, including diagnosis, pattern differentiation and treatment, herbal formulas, acupuncture, interpretation of classical texts, and patient communication. Gold standard answers were established through a multiexpert cross-validation and consensus process. The TCM-SED was subsequently used to comprehensively test 2 representative LLM models: GPT-4o (a leading international general-purpose model) and DeepSeek-R1 (a large model primarily trained on Chinese corpora).

Results: The test results revealed a differentiation in model capabilities across cognitive levels. In objective sections emphasizing precise knowledge recall, DeepSeek-R1 comprehensively outperformed GPT-4o, achieving an accuracy lead of more than 17% in the multiple-choice section (96/137, 70.1% vs 72/137, 52.6%, respectively). Conversely, in the essay section, which tested knowledge integration and complex reasoning, GPT-4o's performance notably surpassed that of DeepSeek-R1. For instance, in the interpretation of classical texts category, GPT-4o achieved a scoring rate of 90.5% (181/200), far exceeding DeepSeek-R1 (147/200, 73.5%).

Conclusions: This empirical study demonstrates that Chinese-centric models have a substantial advantage in static knowledge tasks within the TCM domain, whereas leading general-purpose models exhibit stronger dynamic reasoning and content generation capabilities. The TCM-SED, developed as the benchmark for this study, serves as an effective quantitative tool for evaluating and selecting appropriate LLMs for TCM scenarios. It also offers a valuable data foundation and a new research direction for future model optimization and alignment.

背景:大语言模型(LLMs)在医学领域的应用正在迅速发展。然而,评估法学硕士在具有独特理论体系和认知框架的专业领域(如中医)的能力仍然是一个相当大的挑战。目的:本研究旨在对中医中风专科领域不同LLM类型进行实证评价。方法:系统构建包含203个问题的中医卒中评价数据集(TCM-SED)。该数据集包括3个范例(简答题、多项选择题和作文题),涵盖了多个知识维度,包括诊断、辨证论治、中药方剂、针灸、经典文本解读和患者沟通。金标准答案是通过多专家交叉验证和共识过程建立的。随后,TCM-SED被用于全面测试2个具有代表性的LLM模型:gpt - 40(国际领先的通用模型)和DeepSeek-R1(主要在中文语料库上训练的大型模型)。结果:测试结果揭示了不同认知水平的模型能力差异。在强调精确知识回忆的客观部分,DeepSeek-R1全面优于gpt - 40,在多项选择部分的准确率领先17%以上(96/137、70.1% vs 72/137、52.6%)。相反,在测试知识整合和复杂推理的作文部分,gpt - 40的表现明显超过了DeepSeek-R1。例如,在经典文本的解释类别中,gpt - 40的得分率达到了90.5%(181/200),远远超过DeepSeek-R1(147/200, 73.5%)。结论:本实证研究表明,以中文为中心的模型在中医领域的静态知识任务中具有显著优势,而领先的通用模型则表现出更强的动态推理和内容生成能力。作为本研究的基准,TCM- sed是评估和选择适合TCM方案的法学硕士的有效定量工具。为今后的模型优化与对齐提供了有价值的数据基础和新的研究方向。
{"title":"Large Language Model Evaluation in Traditional Chinese Medicine for Stroke: Quantitative Benchmarking Study.","authors":"Hulin Long, Yang Deng, Yaoguang Guo, Zifan Shen, Yuzhu Zhang, Ji Bao, Yang He","doi":"10.2196/81545","DOIUrl":"10.2196/81545","url":null,"abstract":"<p><strong>Background: </strong>The application of large language models (LLMs) in medicine is rapidly advancing. However, evaluating LLM capabilities in specialized domains such as traditional Chinese medicine (TCM), which possesses a unique theoretical system and cognitive framework, remains a sizable challenge.</p><p><strong>Objective: </strong>This study aimed to provide an empirical evaluation of different LLM types in the specialized domain of TCM stroke.</p><p><strong>Methods: </strong>The Traditional Chinese Medicine-Stroke Evaluation Dataset (TCM-SED), a 203-question benchmark, was systematically constructed. The dataset includes 3 paradigms (short-answer questions, multiple-choice questions, and essay questions) and covers multiple knowledge dimensions, including diagnosis, pattern differentiation and treatment, herbal formulas, acupuncture, interpretation of classical texts, and patient communication. Gold standard answers were established through a multiexpert cross-validation and consensus process. The TCM-SED was subsequently used to comprehensively test 2 representative LLM models: GPT-4o (a leading international general-purpose model) and DeepSeek-R1 (a large model primarily trained on Chinese corpora).</p><p><strong>Results: </strong>The test results revealed a differentiation in model capabilities across cognitive levels. In objective sections emphasizing precise knowledge recall, DeepSeek-R1 comprehensively outperformed GPT-4o, achieving an accuracy lead of more than 17% in the multiple-choice section (96/137, 70.1% vs 72/137, 52.6%, respectively). Conversely, in the essay section, which tested knowledge integration and complex reasoning, GPT-4o's performance notably surpassed that of DeepSeek-R1. For instance, in the interpretation of classical texts category, GPT-4o achieved a scoring rate of 90.5% (181/200), far exceeding DeepSeek-R1 (147/200, 73.5%).</p><p><strong>Conclusions: </strong>This empirical study demonstrates that Chinese-centric models have a substantial advantage in static knowledge tasks within the TCM domain, whereas leading general-purpose models exhibit stronger dynamic reasoning and content generation capabilities. The TCM-SED, developed as the benchmark for this study, serves as an effective quantitative tool for evaluating and selecting appropriate LLMs for TCM scenarios. It also offers a valuable data foundation and a new research direction for future model optimization and alignment.</p>","PeriodicalId":14841,"journal":{"name":"JMIR Formative Research","volume":"9 ","pages":"e81545"},"PeriodicalIF":2.0,"publicationDate":"2025-12-11","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12741655/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145742579","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
期刊
JMIR Formative Research
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1