Inka Napiwotzki, Julian Laue, Flora Caldarone, Maximilian Idahl, Uwe Hadler, Haithem Amrani, Elisabeth Hildt, Kai G Kahl, Wolfgang Nejdl
Background: Large language models (LLMs) have rapidly advanced across numerous fields, including mental health care. A shortage of trained therapists and mental health care providers has driven informal use of LLMs for therapeutic support. However, their clinical utility remains poorly defined.
Objective: This study aimed to systematically evaluate and compare the therapeutic knowledge and single-turn response capabilities of LLMs versus psychotherapists in training in the context of behavioral activation (BA) therapy for depression, and to assess how both groups' performance changed when provided with structured therapeutic training materials.
Methods: Six LLMs and 8 human participants completed a questionnaire on depression and BA with 20 multiple-choice items and 10 therapy scenarios, each with 3 open-ended items, that postulated empathic response, use of validation strategies, and theory of mind capabilities. Human participants completed the questionnaire before and after a 5-hour workshop and 5-week period with learning materials. The LLMs received identical training content as context during the second test. All open-ended questions were rated on 5-point scales by 2 experts.
Results: At baseline, the LLMs demonstrated higher knowledge scores than human participants (61.0 vs 52.0 out of 100 points) and were rated higher in empathy (U=2.0; P=.005; r=0.917), validation quality (U=2.5; P=.006; r=0.896), anticipation of cognition (U=0.0; P=.002; r=1.000), and anticipation of emotion (U=0.0; P=.002; r=1.000). Following BA training, the LLMs maintained their performance advantage across multiple-choice and open-ended items.
Conclusions: The results suggest that LLMs may generate high-quality therapeutic single-turn responses that integrate clinical knowledge with empathetic communication. The findings hint at LLMs' potential as valuable tools in mental health care, although further clinical trials are needed to evaluate their performance in ongoing therapeutic relationships and clinical outcomes.
{"title":"Comparing Human and AI Therapists in Behavioral Activation for Depression: Cross-Sectional Questionnaire Study.","authors":"Inka Napiwotzki, Julian Laue, Flora Caldarone, Maximilian Idahl, Uwe Hadler, Haithem Amrani, Elisabeth Hildt, Kai G Kahl, Wolfgang Nejdl","doi":"10.2196/78138","DOIUrl":"10.2196/78138","url":null,"abstract":"<p><strong>Background: </strong>Large language models (LLMs) have rapidly advanced across numerous fields, including mental health care. A shortage of trained therapists and mental health care providers has driven informal use of LLMs for therapeutic support. However, their clinical utility remains poorly defined.</p><p><strong>Objective: </strong>This study aimed to systematically evaluate and compare the therapeutic knowledge and single-turn response capabilities of LLMs versus psychotherapists in training in the context of behavioral activation (BA) therapy for depression, and to assess how both groups' performance changed when provided with structured therapeutic training materials.</p><p><strong>Methods: </strong>Six LLMs and 8 human participants completed a questionnaire on depression and BA with 20 multiple-choice items and 10 therapy scenarios, each with 3 open-ended items, that postulated empathic response, use of validation strategies, and theory of mind capabilities. Human participants completed the questionnaire before and after a 5-hour workshop and 5-week period with learning materials. The LLMs received identical training content as context during the second test. All open-ended questions were rated on 5-point scales by 2 experts.</p><p><strong>Results: </strong>At baseline, the LLMs demonstrated higher knowledge scores than human participants (61.0 vs 52.0 out of 100 points) and were rated higher in empathy (U=2.0; P=.005; r=0.917), validation quality (U=2.5; P=.006; r=0.896), anticipation of cognition (U=0.0; P=.002; r=1.000), and anticipation of emotion (U=0.0; P=.002; r=1.000). Following BA training, the LLMs maintained their performance advantage across multiple-choice and open-ended items.</p><p><strong>Conclusions: </strong>The results suggest that LLMs may generate high-quality therapeutic single-turn responses that integrate clinical knowledge with empathetic communication. The findings hint at LLMs' potential as valuable tools in mental health care, although further clinical trials are needed to evaluate their performance in ongoing therapeutic relationships and clinical outcomes.</p>","PeriodicalId":14841,"journal":{"name":"JMIR Formative Research","volume":"9 ","pages":"e78138"},"PeriodicalIF":2.0,"publicationDate":"2025-12-04","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12677730/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145677474","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Roselyne Clouet, Thomas Remaud, Camille Boeffard, Samuel Serisier, Alexis Gaudin, Tony Prud'homme
Background: Video games are becoming increasingly accessible and occupy a prominent position among students' leisure activities. Recent studies have demonstrated that engagement with video games can facilitate the development of specific abilities, such as visuospatial skills and hand-eye coordination. Thus, it seemed relevant to investigate whether the practice of playing video games could enhance the technical capabilities of novice dental students.
Objective: The objective of this study was to ascertain whether dental students who identified themselves as video game players exhibited differences in fundamental technical skills in comparison to students who identified as nonplayers. This preliminary study aimed to validate the methodology and measurement tools for a subsequent prospective study.
Methods: Second-year dental students who were novices in the field of preclinical dental practice were divided into 2 groups: one designated nonplayers and the other players. The visual, motor, and cognitive coordination of the students was assessed using 3 different tests. The initial assessment focused on evaluating spatial ability, while the subsequent assessments addressed arm-hand coordination and the velocity of execution. The study data were collected in September 2021.
Results: This study included 92 second-year dental students (mean age 19.49, SD 0.8 years). Thirty-seven students were placed in the player group (40%), while 55 were placed in the nonplayer group (60%). The findings showed no statistically significant differences between the two groups when the 3 distinct tests were administered. The nonplayer group made fewer mistakes on the test evaluating spatial ability (P=.051) and achieved a higher score on the modified Precision Manual Dexterity Test, which evaluates arm-hand coordination, compared to the player group (P=.17), but without statistical significance. The nonplayer group took more time to perform the pulpotomy during the third test, which evaluated execution speed, compared to the player group, again without significance (P=.87).
Conclusions: This study did not demonstrate significant differences between player and nonplayer dental students who participated in the study with regard to their fundamental technical abilities in a preclinical training environment. Nevertheless, it validated the feasibility of a methodology for a future longitudinal study to concentrate on the evolution of acquiring technical skills during preclinical training in these two populations. Consequently, further investigation is warranted to determine the potential impact of video games on the acquisition of surgical skills, including in dentistry.
{"title":"Impact of the Practice of Playing Video Games on Technical Skills Development in Preclinical Dental Education: Preliminary Cross-Sectional Observational Study.","authors":"Roselyne Clouet, Thomas Remaud, Camille Boeffard, Samuel Serisier, Alexis Gaudin, Tony Prud'homme","doi":"10.2196/80082","DOIUrl":"10.2196/80082","url":null,"abstract":"<p><strong>Background: </strong>Video games are becoming increasingly accessible and occupy a prominent position among students' leisure activities. Recent studies have demonstrated that engagement with video games can facilitate the development of specific abilities, such as visuospatial skills and hand-eye coordination. Thus, it seemed relevant to investigate whether the practice of playing video games could enhance the technical capabilities of novice dental students.</p><p><strong>Objective: </strong>The objective of this study was to ascertain whether dental students who identified themselves as video game players exhibited differences in fundamental technical skills in comparison to students who identified as nonplayers. This preliminary study aimed to validate the methodology and measurement tools for a subsequent prospective study.</p><p><strong>Methods: </strong>Second-year dental students who were novices in the field of preclinical dental practice were divided into 2 groups: one designated nonplayers and the other players. The visual, motor, and cognitive coordination of the students was assessed using 3 different tests. The initial assessment focused on evaluating spatial ability, while the subsequent assessments addressed arm-hand coordination and the velocity of execution. The study data were collected in September 2021.</p><p><strong>Results: </strong>This study included 92 second-year dental students (mean age 19.49, SD 0.8 years). Thirty-seven students were placed in the player group (40%), while 55 were placed in the nonplayer group (60%). The findings showed no statistically significant differences between the two groups when the 3 distinct tests were administered. The nonplayer group made fewer mistakes on the test evaluating spatial ability (P=.051) and achieved a higher score on the modified Precision Manual Dexterity Test, which evaluates arm-hand coordination, compared to the player group (P=.17), but without statistical significance. The nonplayer group took more time to perform the pulpotomy during the third test, which evaluated execution speed, compared to the player group, again without significance (P=.87).</p><p><strong>Conclusions: </strong>This study did not demonstrate significant differences between player and nonplayer dental students who participated in the study with regard to their fundamental technical abilities in a preclinical training environment. Nevertheless, it validated the feasibility of a methodology for a future longitudinal study to concentrate on the evolution of acquiring technical skills during preclinical training in these two populations. Consequently, further investigation is warranted to determine the potential impact of video games on the acquisition of surgical skills, including in dentistry.</p>","PeriodicalId":14841,"journal":{"name":"JMIR Formative Research","volume":"9 ","pages":"e80082"},"PeriodicalIF":2.0,"publicationDate":"2025-12-03","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12712560/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145668584","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Yun Hao Leong, Lathiga Nambiar, Victoria Y J Tay, Sui An Lie, Ke Yuhe
Background: Large language models (LLMs) are increasingly used in medical education for feedback and grading; yet their role in postgraduate examination preparation remains uncertain due to inconsistent grading, hallucinations, and user acceptance.
Objective: This study evaluates the Personalized Anesthesia Study Support (PASS), a specialized GPT-4 model developed to assist candidates preparing for Singapore's postgraduate specialist anesthesiology examination. We assessed user acceptance, grading interrater reliability, and hallucination detection rates to determine the feasibility of integrating specialized LLMs into high-stakes examination preparation.
Methods: PASS was built on OpenAI's GPT-4 and adapted with domain-specific prompts and references. Twenty-one senior anesthesiology residents completed a mock short answer question examination, which was independently graded by 3 human examiners and 3 PASS iterations. Participants reviewed feedback from both PASS and standard GPT-4 and completed a technology acceptance model (TAM) survey. Grading reliability was evaluated using Cohen and Fleiss κ. Hallucination rates were assessed by participants and examiners.
Results: Of the 21 participants, 17 (81%) completed the TAM survey, generating 136 responses. PASS scored significantly higher than standard GPT-4 in usefulness (mean 4.25, SD 0.50 vs mean 3.44, SD 0.82; P<.001), efficiency (mean 4.12, SD 0.61 vs mean 3.41, SD 0.74; P<.001), and likelihood of future use (mean 4.13, SD 0.75 vs mean 3.59, SD 0.90; P<.001), with no significant difference in ease of use (mean 4.56, SD 0.63 vs mean 4.50, SD 0.61; P=.35). Internal grading reliability was moderate for PASS (κ=0.522) and fair for human examiners (κ=0.275). Across 316 PASS-generated responses, 67 hallucinations and 189 deviations were labeled. Hallucination labeling rates were comparable between candidates (10/67, 15%) and examiners (57/249, 22.9%; P=.21), while examiners labeled significantly more deviations (168/249, 67.5% vs 21/67, 31%; P<.001).
Conclusions: PASS demonstrated strong user acceptance and grading reliability, suggesting feasibility in high-stakes examination preparation. Experienced learners could identify major hallucinations at comparable rates to examiners, suggesting potential in self-directed learning but with continued need for caution. Further research should refine grading accuracy and explore multicenter evaluation of specialized LLMs for postgraduate medical education.
{"title":"Feasibility of a Specialized Large Language Model for Postgraduate Medical Examination Preparation: Single-Center Proof-Of-Concept Study.","authors":"Yun Hao Leong, Lathiga Nambiar, Victoria Y J Tay, Sui An Lie, Ke Yuhe","doi":"10.2196/77580","DOIUrl":"10.2196/77580","url":null,"abstract":"<p><strong>Background: </strong>Large language models (LLMs) are increasingly used in medical education for feedback and grading; yet their role in postgraduate examination preparation remains uncertain due to inconsistent grading, hallucinations, and user acceptance.</p><p><strong>Objective: </strong>This study evaluates the Personalized Anesthesia Study Support (PASS), a specialized GPT-4 model developed to assist candidates preparing for Singapore's postgraduate specialist anesthesiology examination. We assessed user acceptance, grading interrater reliability, and hallucination detection rates to determine the feasibility of integrating specialized LLMs into high-stakes examination preparation.</p><p><strong>Methods: </strong>PASS was built on OpenAI's GPT-4 and adapted with domain-specific prompts and references. Twenty-one senior anesthesiology residents completed a mock short answer question examination, which was independently graded by 3 human examiners and 3 PASS iterations. Participants reviewed feedback from both PASS and standard GPT-4 and completed a technology acceptance model (TAM) survey. Grading reliability was evaluated using Cohen and Fleiss κ. Hallucination rates were assessed by participants and examiners.</p><p><strong>Results: </strong>Of the 21 participants, 17 (81%) completed the TAM survey, generating 136 responses. PASS scored significantly higher than standard GPT-4 in usefulness (mean 4.25, SD 0.50 vs mean 3.44, SD 0.82; P<.001), efficiency (mean 4.12, SD 0.61 vs mean 3.41, SD 0.74; P<.001), and likelihood of future use (mean 4.13, SD 0.75 vs mean 3.59, SD 0.90; P<.001), with no significant difference in ease of use (mean 4.56, SD 0.63 vs mean 4.50, SD 0.61; P=.35). Internal grading reliability was moderate for PASS (κ=0.522) and fair for human examiners (κ=0.275). Across 316 PASS-generated responses, 67 hallucinations and 189 deviations were labeled. Hallucination labeling rates were comparable between candidates (10/67, 15%) and examiners (57/249, 22.9%; P=.21), while examiners labeled significantly more deviations (168/249, 67.5% vs 21/67, 31%; P<.001).</p><p><strong>Conclusions: </strong>PASS demonstrated strong user acceptance and grading reliability, suggesting feasibility in high-stakes examination preparation. Experienced learners could identify major hallucinations at comparable rates to examiners, suggesting potential in self-directed learning but with continued need for caution. Further research should refine grading accuracy and explore multicenter evaluation of specialized LLMs for postgraduate medical education.</p>","PeriodicalId":14841,"journal":{"name":"JMIR Formative Research","volume":"9 ","pages":"e77580"},"PeriodicalIF":2.0,"publicationDate":"2025-12-03","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12712563/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145668550","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Meghan Romanelli, Julien Rouvere, Isaac A Sanders, Aarthi Padmanabhan, Emily Peake, Thomas D Hull, Tim Althoff
<p><strong>Background: </strong>Gender minority populations experience an increased risk of depression and report significant barriers to accessing mental health services. While digital mental health (DMH) technologies may address barriers, it remains unclear how gender minority clients engage with DMH services and if DMH improves their clinical outcomes.</p><p><strong>Objective: </strong>This naturalistic study explored gender differences in 15-week clinical outcomes of clients receiving technology-mediated psychotherapy from a large DMH provider.</p><p><strong>Methods: </strong>This study used observational data of clients who signed up for Talkspace (Talkspace, Inc) between February 2017 and July 2021. The analytic sample included Talkspace clients (N=20,156) with a baseline 8-item Patient Health Questionnaire (PHQ-8) score ≥10. Participants completed at least 2 PHQ-8 assessments over 15 weeks of treatment. Multilevel linear models tested gender differences in depressive symptom trajectories over the course of treatment (model 1) while also controlling for baseline PHQ-8 scores (model 2) and treatment engagement indicators (model 3). Sensitivity analyses reestimated model 2 among clients who submitted a PHQ-8 survey during the week 15 assessment period and among those who discontinued treatment beforehand. Reasons for service cancellation were also described for the latter group. Gender differences in secondary clinical outcomes were examined via chi-square and Fisher exact tests.</p><p><strong>Results: </strong>In all models, there were significant week-by-gender interactions. When controlling for baseline PHQ-8 scores, rates of symptom change were significantly slower for gender-diverse participants (b=0.60; P<.001), nonbinary participants (b=0.81; P<.001), and transgender women (b=0.87; P=.007), but not for women (P=.98) or transgender men (P=.38) compared to men. By week 15, adjusted PHQ-8 scores declined 8.7 points for both men and women, versus 4.4-7.4 points for gender minority clients. Sensitivity analyses indicated attenuated symptom improvement among week-15 completers, with transgender women showing the slowest changes (b=0.76; P=.02). Among earlier dropouts, weekly symptom reductions were steep overall (eg, week 3: b=-4.06, P<.001; week 6: b=-2.31, P<.001) while certain gender minority subgroups worsened (eg, adjusted scores for transgender women increased from 15.41 at baseline to 16.08 at final week 3 PHQ-8 survey submissions). Cancellation data (3450/20,156, 17.12%) confirmed discontinuation reasons related to both symptom improvement (928/3691 reasons, 25.14%) and potential barriers to treatment engagement (eg, cost: 1431/3691, 38.77%; poor service fit or poor perceived effectiveness: 677/3691, 18.34%). Gender differences were observed in rates of treatment response (weeks 3-12; all P≤.02), symptom remission (weeks 3, 6, 9, and 15; all P≤.047), and clinically significant symptom reduction (all time points, all P≤.03). Symptom d
{"title":"Gender Differences in Trajectories of Depressive Symptoms Among Talkspace Clients: Naturalistic Observational Study.","authors":"Meghan Romanelli, Julien Rouvere, Isaac A Sanders, Aarthi Padmanabhan, Emily Peake, Thomas D Hull, Tim Althoff","doi":"10.2196/75290","DOIUrl":"10.2196/75290","url":null,"abstract":"<p><strong>Background: </strong>Gender minority populations experience an increased risk of depression and report significant barriers to accessing mental health services. While digital mental health (DMH) technologies may address barriers, it remains unclear how gender minority clients engage with DMH services and if DMH improves their clinical outcomes.</p><p><strong>Objective: </strong>This naturalistic study explored gender differences in 15-week clinical outcomes of clients receiving technology-mediated psychotherapy from a large DMH provider.</p><p><strong>Methods: </strong>This study used observational data of clients who signed up for Talkspace (Talkspace, Inc) between February 2017 and July 2021. The analytic sample included Talkspace clients (N=20,156) with a baseline 8-item Patient Health Questionnaire (PHQ-8) score ≥10. Participants completed at least 2 PHQ-8 assessments over 15 weeks of treatment. Multilevel linear models tested gender differences in depressive symptom trajectories over the course of treatment (model 1) while also controlling for baseline PHQ-8 scores (model 2) and treatment engagement indicators (model 3). Sensitivity analyses reestimated model 2 among clients who submitted a PHQ-8 survey during the week 15 assessment period and among those who discontinued treatment beforehand. Reasons for service cancellation were also described for the latter group. Gender differences in secondary clinical outcomes were examined via chi-square and Fisher exact tests.</p><p><strong>Results: </strong>In all models, there were significant week-by-gender interactions. When controlling for baseline PHQ-8 scores, rates of symptom change were significantly slower for gender-diverse participants (b=0.60; P<.001), nonbinary participants (b=0.81; P<.001), and transgender women (b=0.87; P=.007), but not for women (P=.98) or transgender men (P=.38) compared to men. By week 15, adjusted PHQ-8 scores declined 8.7 points for both men and women, versus 4.4-7.4 points for gender minority clients. Sensitivity analyses indicated attenuated symptom improvement among week-15 completers, with transgender women showing the slowest changes (b=0.76; P=.02). Among earlier dropouts, weekly symptom reductions were steep overall (eg, week 3: b=-4.06, P<.001; week 6: b=-2.31, P<.001) while certain gender minority subgroups worsened (eg, adjusted scores for transgender women increased from 15.41 at baseline to 16.08 at final week 3 PHQ-8 survey submissions). Cancellation data (3450/20,156, 17.12%) confirmed discontinuation reasons related to both symptom improvement (928/3691 reasons, 25.14%) and potential barriers to treatment engagement (eg, cost: 1431/3691, 38.77%; poor service fit or poor perceived effectiveness: 677/3691, 18.34%). Gender differences were observed in rates of treatment response (weeks 3-12; all P≤.02), symptom remission (weeks 3, 6, 9, and 15; all P≤.047), and clinically significant symptom reduction (all time points, all P≤.03). Symptom d","PeriodicalId":14841,"journal":{"name":"JMIR Formative Research","volume":"9 ","pages":"e75290"},"PeriodicalIF":2.0,"publicationDate":"2025-12-03","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12675994/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145668535","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Lijiang Zhang, Jingjing Xia, Hui Chen, Yang Bai, Jun Wang, Liuan Wang, Wenjie Ren
Background: Internet health care plays a crucial role in addressing the challenge of distributing high-quality medical resources and promoting the optimal allocation of these resources and health equity in China. Online medical consultation (OMC) plays a more significant role than online health information seeking (OHIS). Currently, the proportion of Chinese patients using OMC is low. Therefore, it is essential to enhance patient engagement with OMC and fully leverage the role of internet health care in optimizing the allocation of medical resources.
Objective: This study aims to explore the correlation mechanisms of online medical community users' switching behaviors from OHIS to OMC.
Methods: This study is based on the knowledge-attitude-practice theory, which combines the social support theory and the health belief model to construct a research model of users' willingness to transition from OHIS to OMC. The study adopts a questionnaire survey and structural equation modeling method to conduct an empirical study.
Results: Gaining knowledge about information support has a significant positive impact on perceived susceptibility (β=.339, P<.001), perceived severity (β=.348, P<.001), and perceived benefits (β=.361, P<.001), while having a significant negative impact on perceived barriers (β=-.285, P<.001). Gaining knowledge about emotional support positively affects perceived susceptibility (β=.220, P<.001) and perceived benefits (β=.149, P<.01) but does not significantly influence perceived severity (β=-.006, P>.05) or perceived barriers (β=.099, P>.05). Perceived susceptibility (β=.123, P<.05), perceived severity (β=.174, P<.001), and perceived benefits (β=.273, P<.001) positively influence patients' transition to online consultation behavior, whereas perceived barriers (β=-.112, P<.05) negatively impact this switch. In addition, we found that gaining knowledge about information support not only directly affects patients' behavior in switching to online consultations but also impacts patients' OMCs through perceived susceptibility (14.23%), perceived severity (13.17%), and perceived benefits (25.28%). In contrast, gaining knowledge about emotional support does not directly influence patient behavior transfer; it operates only through perceived susceptibility (46.95%) and perceived benefit (52.90%).
Conclusions: This study integrated the knowledge-attitude-practice framework, social support theory, and health belief model to uncover the internal logic of patients' behavioral transfers within online health communities. It confirmed the mediating role of the cognitive-emotional dual-drive pathway and health beliefs. The findings provide a scientific basis for the functional design of online health care platforms and for precise health knowledge dissemination strategies.
背景:互联网医疗在解决中国优质医疗资源配置挑战、促进优质医疗资源优化配置和卫生公平方面发挥着至关重要的作用。在线医疗咨询(OMC)比在线健康信息查询(OHIS)发挥更大的作用。目前,中国患者使用OMC的比例较低。因此,加强患者对OMC的参与,充分发挥互联网医疗在优化医疗资源配置中的作用至关重要。目的:本研究旨在探讨网络医疗社区用户从OHIS到OMC转换行为的相关机制。方法:本研究以知识-态度-实践理论为基础,结合社会支持理论和健康信念模型,构建用户从健康卫生服务向健康卫生服务转变意愿的研究模型。本研究采用问卷调查法和结构方程建模法进行实证研究。结果:获得信息支持知识对感知易感性(β= 0.339, P.05)或感知障碍(β= 0.099, P. 0.05)有显著的正向影响。结论:本研究整合了知识-态度-实践框架、社会支持理论和健康信念模型,揭示了网络健康社区患者行为转移的内在逻辑。证实了认知-情绪双驱动通路与健康信念的中介作用。研究结果为网络医疗平台的功能设计和精准的健康知识传播策略提供了科学依据。
{"title":"The Mechanism of Online Health Information Seeking Switching to Online Medical Consultation: Cross-Sectional Study.","authors":"Lijiang Zhang, Jingjing Xia, Hui Chen, Yang Bai, Jun Wang, Liuan Wang, Wenjie Ren","doi":"10.2196/78397","DOIUrl":"10.2196/78397","url":null,"abstract":"<p><strong>Background: </strong>Internet health care plays a crucial role in addressing the challenge of distributing high-quality medical resources and promoting the optimal allocation of these resources and health equity in China. Online medical consultation (OMC) plays a more significant role than online health information seeking (OHIS). Currently, the proportion of Chinese patients using OMC is low. Therefore, it is essential to enhance patient engagement with OMC and fully leverage the role of internet health care in optimizing the allocation of medical resources.</p><p><strong>Objective: </strong>This study aims to explore the correlation mechanisms of online medical community users' switching behaviors from OHIS to OMC.</p><p><strong>Methods: </strong>This study is based on the knowledge-attitude-practice theory, which combines the social support theory and the health belief model to construct a research model of users' willingness to transition from OHIS to OMC. The study adopts a questionnaire survey and structural equation modeling method to conduct an empirical study.</p><p><strong>Results: </strong>Gaining knowledge about information support has a significant positive impact on perceived susceptibility (β=.339, P<.001), perceived severity (β=.348, P<.001), and perceived benefits (β=.361, P<.001), while having a significant negative impact on perceived barriers (β=-.285, P<.001). Gaining knowledge about emotional support positively affects perceived susceptibility (β=.220, P<.001) and perceived benefits (β=.149, P<.01) but does not significantly influence perceived severity (β=-.006, P>.05) or perceived barriers (β=.099, P>.05). Perceived susceptibility (β=.123, P<.05), perceived severity (β=.174, P<.001), and perceived benefits (β=.273, P<.001) positively influence patients' transition to online consultation behavior, whereas perceived barriers (β=-.112, P<.05) negatively impact this switch. In addition, we found that gaining knowledge about information support not only directly affects patients' behavior in switching to online consultations but also impacts patients' OMCs through perceived susceptibility (14.23%), perceived severity (13.17%), and perceived benefits (25.28%). In contrast, gaining knowledge about emotional support does not directly influence patient behavior transfer; it operates only through perceived susceptibility (46.95%) and perceived benefit (52.90%).</p><p><strong>Conclusions: </strong>This study integrated the knowledge-attitude-practice framework, social support theory, and health belief model to uncover the internal logic of patients' behavioral transfers within online health communities. It confirmed the mediating role of the cognitive-emotional dual-drive pathway and health beliefs. The findings provide a scientific basis for the functional design of online health care platforms and for precise health knowledge dissemination strategies.</p>","PeriodicalId":14841,"journal":{"name":"JMIR Formative Research","volume":"9 ","pages":"e78397"},"PeriodicalIF":2.0,"publicationDate":"2025-12-03","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12778383/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145668515","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Background: Cisgender women account for 23% of new HIV diagnoses in the United States, but there are significant socio-structural barriers to engagement and retention in the preexposure prophylaxis (PrEP) cascade, particularly for women of color.
Objective: In response to the lack of evidence-based interventions to improve PrEP initiation, adherence, and persistence among women in the United States, we developed and piloted a clinic-based, socio-structural intervention to measure (1) the feasibility of delivering the adapted intervention and (2) clinic team and patient perspectives on the intervention, in preparation for a future trial on engagement and retention in the PrEP cascade among women.
Methods: We previously applied the ADAPT-ITT (Assessment, Decision, Adaptation, Production, Topical experts, Integration, Training, Testing) model to develop a culturally appropriate, evidence-based intervention, responsive to Black women's HIV prevention needs. In the present study, we set out to complete the Training and Testing phases: namely, to hire and train a PrEP navigator and to train clinic staff to deliver the adapted intervention. We completed a 4-month pilot to assess the feasibility of delivering the intervention and collecting outcomes of interest and initial outcome trends (compared to baseline). We further assessed the clinic team and patient perspectives on the intervention to understand the potential for future scale and delivery.
Results: The clinic team participants found the adapted Women's PrEP project (W-PrEP) intervention both highly feasible and relevant to patients, with minimal impact on clinic flow. Patient participants reported that the W-PrEP intervention was highly relevant and appreciated the education and counseling from the PrEP navigator-many learning about PrEP for the first time from the navigator and health care provider. The outcome measures were both feasible to collect and appropriate to capture the primary outcomes of interest. Finally, the W-PrEP intervention increased the proportion of patients counseled about PrEP from 65% to 76% of patients seen (P<.001).
Conclusions: The intervention and outcome data collection was feasible, and open-ended clinic team and patient perspectives showed positive feedback about the intervention and relevance. The associated increase in PrEP counseling is promising but necessitates further evaluation of the effects of the W-PrEP intervention on the PrEP cascade (eg, initiation, persistence, adherence) in a larger randomized trial.
{"title":"\"The Women's PrEP Project,\" a Clinic-Based, Socio-Structural Intervention to Improve the Provision of Preexposure Prophylaxis for Cisgender Women: Interrupted Time Series Pilot.","authors":"Rachel Scott, Shawnika J Hull, Deanna Kerrigan, Yan Wang, Mandi Pratt-Chapman, Tara Mathias-Prabhu, Naquia Unwala, Marisa Sadauskas, Bat-Zion Hose, Marjanna Smith, Patricia Moriarty, Tranessa Hanson, Ariam Tedla, Pamela Lotke, Peggy Ye, Hannah Arem","doi":"10.2196/80653","DOIUrl":"10.2196/80653","url":null,"abstract":"<p><strong>Background: </strong>Cisgender women account for 23% of new HIV diagnoses in the United States, but there are significant socio-structural barriers to engagement and retention in the preexposure prophylaxis (PrEP) cascade, particularly for women of color.</p><p><strong>Objective: </strong>In response to the lack of evidence-based interventions to improve PrEP initiation, adherence, and persistence among women in the United States, we developed and piloted a clinic-based, socio-structural intervention to measure (1) the feasibility of delivering the adapted intervention and (2) clinic team and patient perspectives on the intervention, in preparation for a future trial on engagement and retention in the PrEP cascade among women.</p><p><strong>Methods: </strong>We previously applied the ADAPT-ITT (Assessment, Decision, Adaptation, Production, Topical experts, Integration, Training, Testing) model to develop a culturally appropriate, evidence-based intervention, responsive to Black women's HIV prevention needs. In the present study, we set out to complete the Training and Testing phases: namely, to hire and train a PrEP navigator and to train clinic staff to deliver the adapted intervention. We completed a 4-month pilot to assess the feasibility of delivering the intervention and collecting outcomes of interest and initial outcome trends (compared to baseline). We further assessed the clinic team and patient perspectives on the intervention to understand the potential for future scale and delivery.</p><p><strong>Results: </strong>The clinic team participants found the adapted Women's PrEP project (W-PrEP) intervention both highly feasible and relevant to patients, with minimal impact on clinic flow. Patient participants reported that the W-PrEP intervention was highly relevant and appreciated the education and counseling from the PrEP navigator-many learning about PrEP for the first time from the navigator and health care provider. The outcome measures were both feasible to collect and appropriate to capture the primary outcomes of interest. Finally, the W-PrEP intervention increased the proportion of patients counseled about PrEP from 65% to 76% of patients seen (P<.001).</p><p><strong>Conclusions: </strong>The intervention and outcome data collection was feasible, and open-ended clinic team and patient perspectives showed positive feedback about the intervention and relevance. The associated increase in PrEP counseling is promising but necessitates further evaluation of the effects of the W-PrEP intervention on the PrEP cascade (eg, initiation, persistence, adherence) in a larger randomized trial.</p>","PeriodicalId":14841,"journal":{"name":"JMIR Formative Research","volume":"9 ","pages":"e80653"},"PeriodicalIF":2.0,"publicationDate":"2025-12-03","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12674590/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145668432","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Christina Wornom, Brooklyn Brekke-Kumley, Tavsimran Luthra, Lynn J Smith, Jane C Harrington
<p><strong>Background: </strong>Migraines are a significant health concern affecting millions of individuals, often requiring habitual tracking of potential triggers to mitigate or predict episodes. Digital health tools such as mobile apps offer a scalable solution for personalized tracking and pattern recognition. Migraine Insight is one such app that facilitates daily logging to quantitatively assess individualized patterns of events preceding migraine onset. However, while various triggers have been reported in migraine literature, there is limited large-scale electronic user-driven data on the frequency and relative impact of specific triggers.</p><p><strong>Objective: </strong>This study aims to address this gap by analyzing user-reported data from Migraine Insight to identify the most frequently reported triggers and evaluate their potential associations with migraine onset.</p><p><strong>Methods: </strong>Food-associated migraine triggers were identified by performing a noninterventional, retrospective analysis of self-reported data obtained via the Migraine Insight app. A collaboration was made with the representatives of the Migraine Insight app to extrapolate the data needed for the study. A preliminary keyword list was assessed from a raw data set of 2605 data entry values, extracted from a 30-day period of September 19 to October 18, 2021, to identify high ranking self-reported events, classified by dietary habits, environmental conditions, body physiology, and medications. The variables examined included the following: dietary items, environmental conditions, body physiology, and electronics. Food items were combined into similar groups, considering variable spelling and descriptions of self-reported events. The association of migraine onset after consumption of top 5 dietary products was evaluated to establish the frequency of migraine episodes for all users who reported the items.</p><p><strong>Results: </strong>Collectively, food (n=353) and beverage (n=252) totaled the highest reported entries, with chocolate, wine, tea, coffee, and cheese as the highest ranked foods for prevalence of reporting across all users and for frequency of migraine onset for users who consumed the items within 48 hours. The 4 highest nonfood entries were altered sleep patterns (n=245), stress or anxiety (n=199), rain or storm conditions (n=192), and bright light or brightness (n=191). Statistical analysis showed that chocolate was the only food trigger significantly associated with migraines (P=.003 vs 50%; P=.04 vs average). Consumption of tea approached significance (P=.051), while consumption of coffee, cheese, and wine were not significant. These findings suggest that chocolate is the most consistent dietary trigger.</p><p><strong>Conclusions: </strong>High-risk foods, environmental conditions, stress, and lighting with highest prevalence reporting have previously been reported in medical literature, implicating that a migraine tracking app is a vali
背景:偏头痛是影响数百万人的重大健康问题,通常需要习惯性地跟踪潜在的触发因素来减轻或预测发作。移动应用程序等数字健康工具为个性化跟踪和模式识别提供了可扩展的解决方案。偏头痛洞察就是这样一个应用程序,有助于每天记录,定量评估偏头痛发作前事件的个性化模式。然而,尽管偏头痛文献中报道了各种触发因素,但关于特定触发因素的频率和相对影响的大规模电子用户驱动数据有限。目的:本研究旨在通过分析用户报告的偏头痛数据来解决这一差距,以确定最频繁报告的触发因素,并评估其与偏头痛发作的潜在关联。方法:通过对偏头痛洞察应用程序获得的自我报告数据进行非干预性回顾性分析,确定与食物相关的偏头痛触发因素。与偏头痛洞察应用程序的代表合作,推断研究所需的数据。从2021年9月19日至10月18日的30天期间提取的2605个数据输入值的原始数据集中评估初步关键字列表,以确定高排名的自我报告事件,按饮食习惯、环境条件、身体生理和药物进行分类。研究的变量包括:饮食项目、环境条件、身体生理和电子产品。考虑到不同的拼写和自我报告事件的描述,食物项目被组合成类似的组。对食用前5种膳食产品后偏头痛发作的关联进行了评估,以确定所有报告这些产品的用户偏头痛发作的频率。结果:总的来说,食物(n=353)和饮料(n=252)总共报告的条目最多,巧克力、葡萄酒、茶、咖啡和奶酪是所有用户报告的患病率最高的食物,并且在48小时内食用这些物品的用户偏头痛发作的频率最高。4个最高的非食物条目是改变的睡眠模式(n=245),压力或焦虑(n=199),下雨或暴风雨条件(n=192),明亮的光线或亮度(n=191)。统计分析显示,巧克力是唯一与偏头痛显著相关的食物(P= 0.003 vs 50%; P= 0.04 vs平均值)。茶的摄入量接近显著性(P= 0.051),而咖啡、奶酪和葡萄酒的摄入量则不显著。这些发现表明,巧克力是最一致的饮食诱因。结论:高风险食物、环境条件、压力和照明在医学文献中报道的发病率最高,这意味着偏头痛跟踪应用程序是纸质日记的有效替代品。
{"title":"Self-Reported Triggers Evaluation of High-Risk Dietary and Environmental Factors Preceding Migraine Onset by Using a Mobile Tracking App (Migraine Insight): Comparative Analysis Study.","authors":"Christina Wornom, Brooklyn Brekke-Kumley, Tavsimran Luthra, Lynn J Smith, Jane C Harrington","doi":"10.2196/59951","DOIUrl":"10.2196/59951","url":null,"abstract":"<p><strong>Background: </strong>Migraines are a significant health concern affecting millions of individuals, often requiring habitual tracking of potential triggers to mitigate or predict episodes. Digital health tools such as mobile apps offer a scalable solution for personalized tracking and pattern recognition. Migraine Insight is one such app that facilitates daily logging to quantitatively assess individualized patterns of events preceding migraine onset. However, while various triggers have been reported in migraine literature, there is limited large-scale electronic user-driven data on the frequency and relative impact of specific triggers.</p><p><strong>Objective: </strong>This study aims to address this gap by analyzing user-reported data from Migraine Insight to identify the most frequently reported triggers and evaluate their potential associations with migraine onset.</p><p><strong>Methods: </strong>Food-associated migraine triggers were identified by performing a noninterventional, retrospective analysis of self-reported data obtained via the Migraine Insight app. A collaboration was made with the representatives of the Migraine Insight app to extrapolate the data needed for the study. A preliminary keyword list was assessed from a raw data set of 2605 data entry values, extracted from a 30-day period of September 19 to October 18, 2021, to identify high ranking self-reported events, classified by dietary habits, environmental conditions, body physiology, and medications. The variables examined included the following: dietary items, environmental conditions, body physiology, and electronics. Food items were combined into similar groups, considering variable spelling and descriptions of self-reported events. The association of migraine onset after consumption of top 5 dietary products was evaluated to establish the frequency of migraine episodes for all users who reported the items.</p><p><strong>Results: </strong>Collectively, food (n=353) and beverage (n=252) totaled the highest reported entries, with chocolate, wine, tea, coffee, and cheese as the highest ranked foods for prevalence of reporting across all users and for frequency of migraine onset for users who consumed the items within 48 hours. The 4 highest nonfood entries were altered sleep patterns (n=245), stress or anxiety (n=199), rain or storm conditions (n=192), and bright light or brightness (n=191). Statistical analysis showed that chocolate was the only food trigger significantly associated with migraines (P=.003 vs 50%; P=.04 vs average). Consumption of tea approached significance (P=.051), while consumption of coffee, cheese, and wine were not significant. These findings suggest that chocolate is the most consistent dietary trigger.</p><p><strong>Conclusions: </strong>High-risk foods, environmental conditions, stress, and lighting with highest prevalence reporting have previously been reported in medical literature, implicating that a migraine tracking app is a vali","PeriodicalId":14841,"journal":{"name":"JMIR Formative Research","volume":"9 ","pages":"e59951"},"PeriodicalIF":2.0,"publicationDate":"2025-12-03","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12712558/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145668496","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Zi Yang, Xiaojuan Wang, Jianlin Wang, Qi Guang, Xueqian Ding, Hao Liu, Yunpeng Xu, Jing Zhao, Ming Bai
Background: Cardiovascular diseases (CVDs) remain the leading global cause of mortality, yet traditional electrocardiogram (ECG) interpretation suffers from subjective variability and limited sensitivity to complex pathologies.
Objective: To address these challenges, we propose the Cardiovascular Multimodal Prediction Network (CaMPNet), a Transformer-based multimodal architecture that integrates raw 12-lead ECG waveforms, nine structured machine-measured the electrocardiogram (ECG) features, and demographic data (age and sex) through cross-attention fusion.
Methods: The model was trained on 384,877 records from the MIMIC-IV-ECG database and evaluated across 12 cardiovascular disease labels. To further assess temporal robustness, a temporal external validation was performed using the most recent 10% of the data, withheld chronologically from model development.
Results: On the internal test set, the model achieved a mean Area Under the Curve (AUC) of 0.845 and Area Under the Precision-Recall Curve (AUPRC) of 0.489, outperforming the ResNet-ECG baseline (AUC 0.848 but F1 0.152) and all single-modality variants. Subgroup analyses demonstrated consistent performance across demographics (male AUC 0.846 vs female 0.843; youngest quartile 0.884 vs oldest 0.811). CaMPNet retained moderate discriminative ability in temporal external validation with a mean AUC of 0.715 and AUPRC of 0.298, though performance declined due to temporal distribution shifts. Despite this, major disease categories such as atrial fibrillation, heart failure, and normal rhythm maintained high AUCs (> 0.84). Attention-based visualization revealed clinically interpretable patterns (e.g., ST-segment elevations in ST-Segment Elevation Myocardial Infarction), and ablation experiments verified the model's tolerance to missing structured inputs.
Conclusions: CaMPNet demonstrates robust and interpretable multimodal ECG-based diagnosis, offering a scalable framework for comorbidity screening and continual learning under real-world temporal dynamics.
{"title":"Multimodal Transformer-Based Electrocardiogram Analysis for Cardiovascular Comorbidity Detection: Model Development and Validation Study.","authors":"Zi Yang, Xiaojuan Wang, Jianlin Wang, Qi Guang, Xueqian Ding, Hao Liu, Yunpeng Xu, Jing Zhao, Ming Bai","doi":"10.2196/80815","DOIUrl":"10.2196/80815","url":null,"abstract":"<p><strong>Background: </strong>Cardiovascular diseases (CVDs) remain the leading global cause of mortality, yet traditional electrocardiogram (ECG) interpretation suffers from subjective variability and limited sensitivity to complex pathologies.</p><p><strong>Objective: </strong>To address these challenges, we propose the Cardiovascular Multimodal Prediction Network (CaMPNet), a Transformer-based multimodal architecture that integrates raw 12-lead ECG waveforms, nine structured machine-measured the electrocardiogram (ECG) features, and demographic data (age and sex) through cross-attention fusion.</p><p><strong>Methods: </strong>The model was trained on 384,877 records from the MIMIC-IV-ECG database and evaluated across 12 cardiovascular disease labels. To further assess temporal robustness, a temporal external validation was performed using the most recent 10% of the data, withheld chronologically from model development.</p><p><strong>Results: </strong>On the internal test set, the model achieved a mean Area Under the Curve (AUC) of 0.845 and Area Under the Precision-Recall Curve (AUPRC) of 0.489, outperforming the ResNet-ECG baseline (AUC 0.848 but F1 0.152) and all single-modality variants. Subgroup analyses demonstrated consistent performance across demographics (male AUC 0.846 vs female 0.843; youngest quartile 0.884 vs oldest 0.811). CaMPNet retained moderate discriminative ability in temporal external validation with a mean AUC of 0.715 and AUPRC of 0.298, though performance declined due to temporal distribution shifts. Despite this, major disease categories such as atrial fibrillation, heart failure, and normal rhythm maintained high AUCs (> 0.84). Attention-based visualization revealed clinically interpretable patterns (e.g., ST-segment elevations in ST-Segment Elevation Myocardial Infarction), and ablation experiments verified the model's tolerance to missing structured inputs.</p><p><strong>Conclusions: </strong>CaMPNet demonstrates robust and interpretable multimodal ECG-based diagnosis, offering a scalable framework for comorbidity screening and continual learning under real-world temporal dynamics.</p><p><strong>Clinicaltrial: </strong></p>","PeriodicalId":14841,"journal":{"name":"JMIR Formative Research","volume":" ","pages":""},"PeriodicalIF":2.0,"publicationDate":"2025-12-02","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12758841/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145661201","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Lisa Sheehy, Emma Gal-Dev, Heidi Sveistrup, Martin Bilodeau, Hillel Finestone
Background: Commercial markerless motion capture (MMC) systems show promise for use in rehabilitation and have been validated for the assessment of various parameters. However, no prior studies have evaluated MMC systems to detect stance asymmetry.
Objective: The objective of this study was to assess the accuracy and variability of the Jintronix Weight Shift Tool MMC system to estimate the percentage of weight borne on each foot.
Methods: Twelve healthy younger adults, 12 healthy older adults, and 12 people living with stroke were recruited for this cross-sectional study. The percentage of weight borne on each foot was simultaneously recorded by the Weight Shift Tool and a validated pressure mat during 2 series (raising the arm to capture the recording and without arm raise) with left lean, right lean, and equal stance. Agreement between the Weight Shift Tool and the pressure mat was assessed using Bland-Altman analyses.
Results: Bias was greatest for older adults for all stances except for right lean without arm raise. Variability was greatest for people living with stroke for all stances except for left lean with arm raise. On average, the limits of agreement were narrower during equal stance. Although bias between the Weight Shift Tool and the pressure mat was small to moderate (0.0%-11.7%), the limits of agreement were wide (12.8%-33.6% above and below the bias).
Conclusions: The Weight Shift Tool is not clinically acceptable for the estimation of the percentage of weight on each foot due to high variability. Investigation of other MMC systems is required to confirm the validity of MMC for clinical assessment.
{"title":"Accuracy and Variability of a Commercial Markerless Motion Capture System Compared to a Pressure Mat for Weight Distribution in Standing: Cross-Sectional Observational Study.","authors":"Lisa Sheehy, Emma Gal-Dev, Heidi Sveistrup, Martin Bilodeau, Hillel Finestone","doi":"10.2196/73575","DOIUrl":"10.2196/73575","url":null,"abstract":"<p><strong>Background: </strong>Commercial markerless motion capture (MMC) systems show promise for use in rehabilitation and have been validated for the assessment of various parameters. However, no prior studies have evaluated MMC systems to detect stance asymmetry.</p><p><strong>Objective: </strong>The objective of this study was to assess the accuracy and variability of the Jintronix Weight Shift Tool MMC system to estimate the percentage of weight borne on each foot.</p><p><strong>Methods: </strong>Twelve healthy younger adults, 12 healthy older adults, and 12 people living with stroke were recruited for this cross-sectional study. The percentage of weight borne on each foot was simultaneously recorded by the Weight Shift Tool and a validated pressure mat during 2 series (raising the arm to capture the recording and without arm raise) with left lean, right lean, and equal stance. Agreement between the Weight Shift Tool and the pressure mat was assessed using Bland-Altman analyses.</p><p><strong>Results: </strong>Bias was greatest for older adults for all stances except for right lean without arm raise. Variability was greatest for people living with stroke for all stances except for left lean with arm raise. On average, the limits of agreement were narrower during equal stance. Although bias between the Weight Shift Tool and the pressure mat was small to moderate (0.0%-11.7%), the limits of agreement were wide (12.8%-33.6% above and below the bias).</p><p><strong>Conclusions: </strong>The Weight Shift Tool is not clinically acceptable for the estimation of the percentage of weight on each foot due to high variability. Investigation of other MMC systems is required to confirm the validity of MMC for clinical assessment.</p>","PeriodicalId":14841,"journal":{"name":"JMIR Formative Research","volume":"9 ","pages":"e73575"},"PeriodicalIF":2.0,"publicationDate":"2025-12-02","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12709157/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145661250","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}