Katherine Williams, Fiona Lobban, Elizabeth Chamberlain
Background: Although research has found online peer support forums to be helpful for those with mental health conditions, no studies have explored the experiences of those who use forums for support with postpartum psychosis (PP) specifically.
Objective: This study aimed to understand the lived experiences of using online forums for PP, and how this form of support differs from professional and other informal support.
Methods: This study used a qualitative approach, including semistructured interviews with 8 participants. Recruitment took place via an online forum run by a charity called Action on Postpartum Psychosis. Transcripts were analyzed using interpretative phenomenological analysis.
Results: Four themes were developed in line with participants' experiences (1) from isolation to connection: validation, growth, and hope from shared experiences; (2) complementing not replacing: filling the gaps in support; (3) impacts of privacy, representation, and readiness to share on engagement; and (4) relational experiences within peer support: altruism, boundaries, and comparison. All participants believed forums were helpful to their well-being and recovery; however, some also reported difficulties with engagement, comparison, and regulating their own use. Findings suggest that forums may benefit from being designed in a way that protects users and their identities, for example, via trigger warnings and setting boundaries.
Conclusions: Peer online forums offer a unique and potentially effective addition to existing support provided by professionals and personal connections. Professionals should signpost people experiencing PP to forums, but should also understand the support that may be needed in terms of monitoring use and ensuring that appropriate boundaries are put into place.
{"title":"Experiences of Using Online Peer Forums Among People With Postpartum Psychosis: Interpretative Phenomenological Study.","authors":"Katherine Williams, Fiona Lobban, Elizabeth Chamberlain","doi":"10.2196/80717","DOIUrl":"10.2196/80717","url":null,"abstract":"<p><strong>Background: </strong>Although research has found online peer support forums to be helpful for those with mental health conditions, no studies have explored the experiences of those who use forums for support with postpartum psychosis (PP) specifically.</p><p><strong>Objective: </strong>This study aimed to understand the lived experiences of using online forums for PP, and how this form of support differs from professional and other informal support.</p><p><strong>Methods: </strong>This study used a qualitative approach, including semistructured interviews with 8 participants. Recruitment took place via an online forum run by a charity called Action on Postpartum Psychosis. Transcripts were analyzed using interpretative phenomenological analysis.</p><p><strong>Results: </strong>Four themes were developed in line with participants' experiences (1) from isolation to connection: validation, growth, and hope from shared experiences; (2) complementing not replacing: filling the gaps in support; (3) impacts of privacy, representation, and readiness to share on engagement; and (4) relational experiences within peer support: altruism, boundaries, and comparison. All participants believed forums were helpful to their well-being and recovery; however, some also reported difficulties with engagement, comparison, and regulating their own use. Findings suggest that forums may benefit from being designed in a way that protects users and their identities, for example, via trigger warnings and setting boundaries.</p><p><strong>Conclusions: </strong>Peer online forums offer a unique and potentially effective addition to existing support provided by professionals and personal connections. Professionals should signpost people experiencing PP to forums, but should also understand the support that may be needed in terms of monitoring use and ensuring that appropriate boundaries are put into place.</p>","PeriodicalId":36351,"journal":{"name":"JMIR Human Factors","volume":"12 ","pages":"e80717"},"PeriodicalIF":3.0,"publicationDate":"2025-12-24","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12780708/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145828690","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Thomas Gant, Kathryn M Taylor, Thomas Ward, Philippa Garety, Amy Hardy
<p><strong>Background: </strong>Despite the promise of digital therapeutics in providing scalable interventions for psychosis, translating them from clinical trials to routine care is challenging. SloMo is an evidence-based, digitally supported cognitive behavioral therapy for psychosis comprising a web-based therapy platform and mobile app. The therapy encourages individuals to slow down for a moment, to address fast-thinking habits fueling paranoia. SloMo has received a NICE Early Value Assessment recommendation for use in the National Health Service to address evidence gaps related to its use in the real world, and an implementation study is underway.</p><p><strong>Objective: </strong>This study aimed to optimize the SloMo software for implementation by addressing limitations of the first release, reducing technology complexity, and improving user experience, to increase equitable outcomes.</p><p><strong>Methods: </strong>An inclusive, human-centered design methodology was used to optimize SloMo. The redesign sought to reduce the technology's complexity and improve the user experience for diverse patients and therapists. The Double Diamond framework structured the iterative redesign, integrating insights from patient and public involvement consultants, therapists, and a transdisciplinary co-design team. The Double Diamond process was facilitated through 24 transdisciplinary workshops. These were supported by the following methods: identifying implementation barriers through desk research of SloMo's evidence and qualitative interviews with experts by experience (n=2); redefining user needs; iteratively developing solutions through user testing sessions with service user consultants (n=32); and validating the minimum viable product through think-aloud testing sessions with therapist (n=10) and service user (n=11) consultants.</p><p><strong>Results: </strong>Users wanted a form of cognitive behavioral therapy for psychosis that was usable, trustworthy, enjoyable, personalized, normalizing, and memorable. The redesign, therefore, included a minimalist user interface, more diverse lived experience vignette content, enhanced agency over data, greater representation of protected characteristics and their intersectionality, and intuitive navigation. Feedback from a purposively sampled patient and therapist sample validated the redesign as it was associated with a strong user experience, particularly in relation to usability and usefulness.</p><p><strong>Conclusions: </strong>The study produced a redesign of the SloMo software optimized for real-world use, whilst retaining fidelity to the therapeutic content of the previous version. Through an inclusive, human-centered approach, the optimized design of SloMo addresses barriers to adoption by reducing complexity and fostering accessibility. This study underscores the value of integrating lived experience involvement into digital therapeutics to support scalable, equitable, and sustainable mental health
{"title":"Optimizing SloMo, a Digitally Supported Therapy Targeting Paranoia, for Implementation: Inclusive, Human-Centered Design Study.","authors":"Thomas Gant, Kathryn M Taylor, Thomas Ward, Philippa Garety, Amy Hardy","doi":"10.2196/75377","DOIUrl":"10.2196/75377","url":null,"abstract":"<p><strong>Background: </strong>Despite the promise of digital therapeutics in providing scalable interventions for psychosis, translating them from clinical trials to routine care is challenging. SloMo is an evidence-based, digitally supported cognitive behavioral therapy for psychosis comprising a web-based therapy platform and mobile app. The therapy encourages individuals to slow down for a moment, to address fast-thinking habits fueling paranoia. SloMo has received a NICE Early Value Assessment recommendation for use in the National Health Service to address evidence gaps related to its use in the real world, and an implementation study is underway.</p><p><strong>Objective: </strong>This study aimed to optimize the SloMo software for implementation by addressing limitations of the first release, reducing technology complexity, and improving user experience, to increase equitable outcomes.</p><p><strong>Methods: </strong>An inclusive, human-centered design methodology was used to optimize SloMo. The redesign sought to reduce the technology's complexity and improve the user experience for diverse patients and therapists. The Double Diamond framework structured the iterative redesign, integrating insights from patient and public involvement consultants, therapists, and a transdisciplinary co-design team. The Double Diamond process was facilitated through 24 transdisciplinary workshops. These were supported by the following methods: identifying implementation barriers through desk research of SloMo's evidence and qualitative interviews with experts by experience (n=2); redefining user needs; iteratively developing solutions through user testing sessions with service user consultants (n=32); and validating the minimum viable product through think-aloud testing sessions with therapist (n=10) and service user (n=11) consultants.</p><p><strong>Results: </strong>Users wanted a form of cognitive behavioral therapy for psychosis that was usable, trustworthy, enjoyable, personalized, normalizing, and memorable. The redesign, therefore, included a minimalist user interface, more diverse lived experience vignette content, enhanced agency over data, greater representation of protected characteristics and their intersectionality, and intuitive navigation. Feedback from a purposively sampled patient and therapist sample validated the redesign as it was associated with a strong user experience, particularly in relation to usability and usefulness.</p><p><strong>Conclusions: </strong>The study produced a redesign of the SloMo software optimized for real-world use, whilst retaining fidelity to the therapeutic content of the previous version. Through an inclusive, human-centered approach, the optimized design of SloMo addresses barriers to adoption by reducing complexity and fostering accessibility. This study underscores the value of integrating lived experience involvement into digital therapeutics to support scalable, equitable, and sustainable mental health ","PeriodicalId":36351,"journal":{"name":"JMIR Human Factors","volume":"12 ","pages":"e75377"},"PeriodicalIF":3.0,"publicationDate":"2025-12-22","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12770921/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145811545","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Dylan Hamitouche, Youcef Barkat, Deven Parekh, Eva Hammer, David Benrimoh
Background: Making optimal use of mobile health technologies requires the validation of digital biomarkers, which in turn demands high levels of participant adherence and retention. However, current remote digital health studies have high attrition rates and low participant adherence, which may introduce bias and limit the generalizability of their findings.
Objective: This study aimed to identify longitudinal indicators of participant retention and adherence, which may serve to develop strategies to improve data collection in digital health studies and improve understanding of how study cohorts are shaped by participant withdrawal and nonadherence.
Methods: We performed secondary analyses on the Brighten study, which consisted of 2 remote, smartphone-based randomized controlled trials evaluating mobile apps for depression treatment, enrolling 2193 participants in total. Participants were asked, after baseline assessment, to complete 7 digital questionnaires regularly. We assessed adherence to digital questionnaires, engagement (postbaseline participation), and retention rates (the proportion of participants who continued completing questionnaires over time) as outcomes. We investigated the relationship between these outcomes and both static measures (eg, demographics and average questionnaire scores) and dynamic measures (eg, changes in questionnaire scores over time).
Results: The study included 2201 participants, of whom 1093 completed at least 1 nonbaseline questionnaire, with a median completion rate of 37.6% (IQR 15.5%-67.9%). We found significantly higher adherence rates in participants who were less depressed on average over the course of the study (t752=-5.63; P<.001) and in those who perceived clinical improvement (t744=3.78; P=.001). There were significant demographic differences in adherence and engagement, including differences by gender, race, education, income, and income satisfaction. Participants who were more depressed at baseline were more likely to withdraw before completing any nonbaseline questionnaire (t1917=-2.53; P=.01). However, participants who showed improvement in depressive symptoms during the study showed better adherence (Mann-Whitney U=127,084; P<.001) and retention (hazard ratio 0.78, 95% CI 0.67-0.91; P=.002), despite showing greater depressive symptoms at baseline.
Conclusions: We show that participants' clinical trajectory of depressive symptoms, as well as their perception of improvement, are important indicators of engagement, adherence, and retention. Expanding knowledge regarding these longitudinal indicators may improve interpretation of outcomes and help build strategies to improve retention and adherence in future clinical trials.
{"title":"Dynamic Indicators of Adherence and Retention in Adults Using a Digital Mental Health App: Longitudinal Observational Analysis From the Brighten Study.","authors":"Dylan Hamitouche, Youcef Barkat, Deven Parekh, Eva Hammer, David Benrimoh","doi":"10.2196/69464","DOIUrl":"10.2196/69464","url":null,"abstract":"<p><strong>Background: </strong>Making optimal use of mobile health technologies requires the validation of digital biomarkers, which in turn demands high levels of participant adherence and retention. However, current remote digital health studies have high attrition rates and low participant adherence, which may introduce bias and limit the generalizability of their findings.</p><p><strong>Objective: </strong>This study aimed to identify longitudinal indicators of participant retention and adherence, which may serve to develop strategies to improve data collection in digital health studies and improve understanding of how study cohorts are shaped by participant withdrawal and nonadherence.</p><p><strong>Methods: </strong>We performed secondary analyses on the Brighten study, which consisted of 2 remote, smartphone-based randomized controlled trials evaluating mobile apps for depression treatment, enrolling 2193 participants in total. Participants were asked, after baseline assessment, to complete 7 digital questionnaires regularly. We assessed adherence to digital questionnaires, engagement (postbaseline participation), and retention rates (the proportion of participants who continued completing questionnaires over time) as outcomes. We investigated the relationship between these outcomes and both static measures (eg, demographics and average questionnaire scores) and dynamic measures (eg, changes in questionnaire scores over time).</p><p><strong>Results: </strong>The study included 2201 participants, of whom 1093 completed at least 1 nonbaseline questionnaire, with a median completion rate of 37.6% (IQR 15.5%-67.9%). We found significantly higher adherence rates in participants who were less depressed on average over the course of the study (t752=-5.63; P<.001) and in those who perceived clinical improvement (t744=3.78; P=.001). There were significant demographic differences in adherence and engagement, including differences by gender, race, education, income, and income satisfaction. Participants who were more depressed at baseline were more likely to withdraw before completing any nonbaseline questionnaire (t1917=-2.53; P=.01). However, participants who showed improvement in depressive symptoms during the study showed better adherence (Mann-Whitney U=127,084; P<.001) and retention (hazard ratio 0.78, 95% CI 0.67-0.91; P=.002), despite showing greater depressive symptoms at baseline.</p><p><strong>Conclusions: </strong>We show that participants' clinical trajectory of depressive symptoms, as well as their perception of improvement, are important indicators of engagement, adherence, and retention. Expanding knowledge regarding these longitudinal indicators may improve interpretation of outcomes and help build strategies to improve retention and adherence in future clinical trials.</p>","PeriodicalId":36351,"journal":{"name":"JMIR Human Factors","volume":"12 ","pages":"e69464"},"PeriodicalIF":3.0,"publicationDate":"2025-12-22","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12721583/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145811581","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Yanli Lyu, Xi Li, Huixian Li, Mingyu Gu, Xuedong Xu, Yipei Wang, Changxiao Jin
<p><strong>Background: </strong>Telemedicine services have been developing rapidly worldwide. Following the 2018 policy enabling telemedicine for follow-up patients, this service model has gradually gained popularity in China. However, little has been done to understand the policy's implementation across different types of medical institutions or to evaluate its effectiveness.</p><p><strong>Objective: </strong>This study aims to (1) investigate the patient eligibility assessment process in various types of institutions for telemedicine services in Beijing, (2) elucidate institutions' rationale for adopting such approaches, (3) analyze discrepancies between policy and practice, and (4) provide references for the development of telemedicine services.</p><p><strong>Methods: </strong>This mixed methods study involved 36 medical institutions in Beijing, determined based on placing in the top 20% for both service volume and service quality in 2023. The study was conducted in 2 phases. First, quantitative analyses were conducted based on the questionnaires collected from each institution's contact to gather information about the assessment process and patient prerequisites. Subsequently, qualitative analyses were conducted through thematic analysis of 36 semistructured interviews with each institution's contact to acquire their considerations of this practice.</p><p><strong>Results: </strong>These 36 institutions contributed 968,786 telemedicine visits, representing 89.5% of the total service volume in Beijing. In practice, each of the visits underwent a 2-stage eligibility assessment before the physician officially accepted the patient's request. In the first stage, for assessment approaches, 86.2% (25/29) of the nonprofit, tertiary hospitals and 14.3% (1/7) of the private, for-profit institutions automatically assessed patient eligibility, while others did it manually. The assessment was based on the scope of previous visit location, diagnosis of previous visits, and visit interval. For the scope of visit location, 22 hospitals required prior visits to the same institution. For diagnosis, 7 hospitals required patients to have an identical diagnosis. For visit intervals, 11 hospitals required it to be within 6 months. The second stage assessment was conducted by physicians. Compared with policy requirements, nonprofit hospitals had stricter requirements for the scope of visit locations. The main reasons for these discrepancies included distrust in the medical outcomes from other institutions (19/26, 73.1%) and difficulties in handling interinstitutional medical disputes (18/26, 69.2%). In addition, 61.1% (22/36) of the institutions indicated that terminations of telemedicine services were primarily due to the patient's conditions.</p><p><strong>Conclusions: </strong>This pioneering multicenter, mixed methods study delineated the patient eligibility assessment process for telemedicine services in Beijing. Discrepancies were identified between real-world
{"title":"Bridging Policy and Practice in Telemedicine Follow-Up Identification: Multicenter Mixed Methods Study in Beijing.","authors":"Yanli Lyu, Xi Li, Huixian Li, Mingyu Gu, Xuedong Xu, Yipei Wang, Changxiao Jin","doi":"10.2196/75964","DOIUrl":"10.2196/75964","url":null,"abstract":"<p><strong>Background: </strong>Telemedicine services have been developing rapidly worldwide. Following the 2018 policy enabling telemedicine for follow-up patients, this service model has gradually gained popularity in China. However, little has been done to understand the policy's implementation across different types of medical institutions or to evaluate its effectiveness.</p><p><strong>Objective: </strong>This study aims to (1) investigate the patient eligibility assessment process in various types of institutions for telemedicine services in Beijing, (2) elucidate institutions' rationale for adopting such approaches, (3) analyze discrepancies between policy and practice, and (4) provide references for the development of telemedicine services.</p><p><strong>Methods: </strong>This mixed methods study involved 36 medical institutions in Beijing, determined based on placing in the top 20% for both service volume and service quality in 2023. The study was conducted in 2 phases. First, quantitative analyses were conducted based on the questionnaires collected from each institution's contact to gather information about the assessment process and patient prerequisites. Subsequently, qualitative analyses were conducted through thematic analysis of 36 semistructured interviews with each institution's contact to acquire their considerations of this practice.</p><p><strong>Results: </strong>These 36 institutions contributed 968,786 telemedicine visits, representing 89.5% of the total service volume in Beijing. In practice, each of the visits underwent a 2-stage eligibility assessment before the physician officially accepted the patient's request. In the first stage, for assessment approaches, 86.2% (25/29) of the nonprofit, tertiary hospitals and 14.3% (1/7) of the private, for-profit institutions automatically assessed patient eligibility, while others did it manually. The assessment was based on the scope of previous visit location, diagnosis of previous visits, and visit interval. For the scope of visit location, 22 hospitals required prior visits to the same institution. For diagnosis, 7 hospitals required patients to have an identical diagnosis. For visit intervals, 11 hospitals required it to be within 6 months. The second stage assessment was conducted by physicians. Compared with policy requirements, nonprofit hospitals had stricter requirements for the scope of visit locations. The main reasons for these discrepancies included distrust in the medical outcomes from other institutions (19/26, 73.1%) and difficulties in handling interinstitutional medical disputes (18/26, 69.2%). In addition, 61.1% (22/36) of the institutions indicated that terminations of telemedicine services were primarily due to the patient's conditions.</p><p><strong>Conclusions: </strong>This pioneering multicenter, mixed methods study delineated the patient eligibility assessment process for telemedicine services in Beijing. Discrepancies were identified between real-world","PeriodicalId":36351,"journal":{"name":"JMIR Human Factors","volume":"12 ","pages":"e75964"},"PeriodicalIF":3.0,"publicationDate":"2025-12-19","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12716420/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145795094","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Anthony Kelly, Niharika Bhardwaj, Trine Theresa Holmberg Sainte-Marie, Pepijn Van de Ven, Ruth Melia, John Eustis Williams, Kim Mathiasen, Amalie Søgaard Nielsen
Background: Trust in artificial intelligence (AI) remains a critical barrier to the adoption of AI in mental health care. This study explores the formation of trust in an AI mental health model and its human-computer interface among clinicians at a web-based mental health clinic in the Region of Southern Denmark with national coverage.
Objective: This study aims to explore clinicians' perspectives on how trust is built in the context of an AI-supported mental health screening model and to identify the factors that influence this process.
Methods: This was a qualitative case study using semistructured interviews with clinicians involved in the pilot of a mental health AI model. Thematic analysis was used to identify key factors contributing to trust formation.
Results: Clinicians' initial attitudes toward AI were shaped by prior positive experiences with AI and their perception of AI's potential to reduce cognitive load in routine screening. Trust development followed a sequential pattern resembling a "trust journey": (1) sense-making, (2) risk appraisal, and (3) conditional decision to rely. Trust formation was supported by the explainability of the model, particularly through (1) visualization of confidence and uncertainty through violin plots, aligning with the clinicians' expectations of decision ambiguity; (2) feature attribution for and against predictions, which mirrored clinical reasoning; and (3) use of pseudo-sumscores in the AI model, which increased interpretability by presenting explanations in familiar clinical formats. Trust was contextually bounded to low-risk clinical scenarios, such as preinterview patient screening, and contingent on safety protocols (eg, suicide risk flagging). The use of both structured and unstructured patient data was seen as key to expanding trust into more complex clinical contexts. Participants also expressed a need for ongoing evaluation data to reinforce and maintain trust.
Conclusions: Clinicians' trust in AI tools is contextually and sequentially constructed, influenced by both model performance and alignment with clinical reasoning. Interpretability features were essential in establishing intrinsic trust, particularly when presented in ways that resonate with clinical norms. For broader acceptance and responsible deployment, trust must be supported by rigorous evaluation data and the inclusion of clinically relevant data types in model design.
{"title":"Investigating How Clinicians Form Trust in an AI-Based Mental Health Model: Qualitative Case Study.","authors":"Anthony Kelly, Niharika Bhardwaj, Trine Theresa Holmberg Sainte-Marie, Pepijn Van de Ven, Ruth Melia, John Eustis Williams, Kim Mathiasen, Amalie Søgaard Nielsen","doi":"10.2196/79658","DOIUrl":"10.2196/79658","url":null,"abstract":"<p><strong>Background: </strong>Trust in artificial intelligence (AI) remains a critical barrier to the adoption of AI in mental health care. This study explores the formation of trust in an AI mental health model and its human-computer interface among clinicians at a web-based mental health clinic in the Region of Southern Denmark with national coverage.</p><p><strong>Objective: </strong>This study aims to explore clinicians' perspectives on how trust is built in the context of an AI-supported mental health screening model and to identify the factors that influence this process.</p><p><strong>Methods: </strong>This was a qualitative case study using semistructured interviews with clinicians involved in the pilot of a mental health AI model. Thematic analysis was used to identify key factors contributing to trust formation.</p><p><strong>Results: </strong>Clinicians' initial attitudes toward AI were shaped by prior positive experiences with AI and their perception of AI's potential to reduce cognitive load in routine screening. Trust development followed a sequential pattern resembling a \"trust journey\": (1) sense-making, (2) risk appraisal, and (3) conditional decision to rely. Trust formation was supported by the explainability of the model, particularly through (1) visualization of confidence and uncertainty through violin plots, aligning with the clinicians' expectations of decision ambiguity; (2) feature attribution for and against predictions, which mirrored clinical reasoning; and (3) use of pseudo-sumscores in the AI model, which increased interpretability by presenting explanations in familiar clinical formats. Trust was contextually bounded to low-risk clinical scenarios, such as preinterview patient screening, and contingent on safety protocols (eg, suicide risk flagging). The use of both structured and unstructured patient data was seen as key to expanding trust into more complex clinical contexts. Participants also expressed a need for ongoing evaluation data to reinforce and maintain trust.</p><p><strong>Conclusions: </strong>Clinicians' trust in AI tools is contextually and sequentially constructed, influenced by both model performance and alignment with clinical reasoning. Interpretability features were essential in establishing intrinsic trust, particularly when presented in ways that resonate with clinical norms. For broader acceptance and responsible deployment, trust must be supported by rigorous evaluation data and the inclusion of clinically relevant data types in model design.</p>","PeriodicalId":36351,"journal":{"name":"JMIR Human Factors","volume":"12 ","pages":"e79658"},"PeriodicalIF":3.0,"publicationDate":"2025-12-19","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12716233/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145795075","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Hui Wen Lim, Jennifer Sumner, Abigail Ang, Camille Keck, Emily Hwee Hoon Chew, Alexander Wenjun Yip
Background: Technology is rapidly reshaping conventional hospital environments into smart spaces, enhancing care, improving clinical workflows, and reducing workloads. However, successful implementation depends not only on the effectiveness of the technology but also on organizational readiness for change.
Objective: This study aimed to identify the key enablers and barriers to readiness for change for a smart hospital ward initiative.
Methods: We conducted a qualitative study to gauge organizational readiness for change for a smart ward initiative. Using purposive sampling, we captured diverse views from clinicians, IT staff, operational support staff, and health care redesign staff. Data were coded deductively under 3 key domains in Weiner's theory of organizational readiness: change efficacy, change commitment, and contextual factors. Subthemes were derived inductively under each domain.
Results: We interviewed 19 participants, including clinicians and support staff. Six subthemes emerged: (1) perceived valence and feasibility; (2) transparency and trust in management; (3) shared understanding and readiness to act; (4) resources, training, and staff capability; (5) innovation culture; and (6) past experiences. Participants viewed the initiative as valuable and were motivated to change, citing that the institution's innovation culture was a key enabler. However, there were key barriers, including unclear timelines, inconsistent training, limited resources, and a lack of infrastructure to support innovation. Concerns about overreliance on technology were also prominent, with staff wary of its impact on clinical judgment and system reliability.
Conclusions: Enabling readiness for the smart ward initiative requires transparent communication of timelines and project awareness, particularly for ground staff, the development of training frameworks, and adequate prioritization of innovation. Alleviating commonly reported technology concerns, such as overreliance, loss of human touch, and system reliability, will also be key to adoption and sustainability.
{"title":"Organizational Readiness for Change in the Era of Smart Hospital Wards: Qualitative Study of Health Care Workers' Insights.","authors":"Hui Wen Lim, Jennifer Sumner, Abigail Ang, Camille Keck, Emily Hwee Hoon Chew, Alexander Wenjun Yip","doi":"10.2196/81932","DOIUrl":"10.2196/81932","url":null,"abstract":"<p><strong>Background: </strong>Technology is rapidly reshaping conventional hospital environments into smart spaces, enhancing care, improving clinical workflows, and reducing workloads. However, successful implementation depends not only on the effectiveness of the technology but also on organizational readiness for change.</p><p><strong>Objective: </strong>This study aimed to identify the key enablers and barriers to readiness for change for a smart hospital ward initiative.</p><p><strong>Methods: </strong>We conducted a qualitative study to gauge organizational readiness for change for a smart ward initiative. Using purposive sampling, we captured diverse views from clinicians, IT staff, operational support staff, and health care redesign staff. Data were coded deductively under 3 key domains in Weiner's theory of organizational readiness: change efficacy, change commitment, and contextual factors. Subthemes were derived inductively under each domain.</p><p><strong>Results: </strong>We interviewed 19 participants, including clinicians and support staff. Six subthemes emerged: (1) perceived valence and feasibility; (2) transparency and trust in management; (3) shared understanding and readiness to act; (4) resources, training, and staff capability; (5) innovation culture; and (6) past experiences. Participants viewed the initiative as valuable and were motivated to change, citing that the institution's innovation culture was a key enabler. However, there were key barriers, including unclear timelines, inconsistent training, limited resources, and a lack of infrastructure to support innovation. Concerns about overreliance on technology were also prominent, with staff wary of its impact on clinical judgment and system reliability.</p><p><strong>Conclusions: </strong>Enabling readiness for the smart ward initiative requires transparent communication of timelines and project awareness, particularly for ground staff, the development of training frameworks, and adequate prioritization of innovation. Alleviating commonly reported technology concerns, such as overreliance, loss of human touch, and system reliability, will also be key to adoption and sustainability.</p>","PeriodicalId":36351,"journal":{"name":"JMIR Human Factors","volume":"12 ","pages":"e81932"},"PeriodicalIF":3.0,"publicationDate":"2025-12-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12757706/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145782296","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Michael Wakeman, Lydia Tesfaye, Tim Gregory, Erin Leahy, Gunnar Baskin, Greg Gruse, Brandon Kendrick, Sherine El-Toukhy
Background: Health messages are integral to smoking cessation interventions. Common approaches to health message development include expert-crafted messages and audience-generated messages, which produce messages that can be monotonic, didactic, and limited in number. We introduce an alternative approach to health message development that relies on user-generated content available on open-content platforms as a source of health messages.
Objective: We examined the acceptability of user-generated content curated from Twitter (subsequently rebranded X) as a source of health support messages in a newly developed smoking cessation mobile intervention called Quit Journey and the optimal timing and frequency with which health messages can be deployed to support app users in real time.
Methods: A total of 12 semistructured focus groups were held with 38 young adults with low socioeconomic status who smoked cigarettes, wanted to quit, and were aged 18 to 29 years. Focus groups were held virtually on GoTo Meeting, audio recorded, and transcribed verbatim. Deductive thematic analysis was used, with themes based on 5 constructs from the second unified theory of acceptance and use of technology (ie, effort expectancy, facilitating conditions, hedonic motivation, performance expectancy, and social influence) and negative, neutral, and positive sentiment.
Results: Participants perceived user-generated content positively (56/108, 51.9% of the quotes) and focused on their perceived usefulness (37/108, 34.3% of the quotes). User-generated content was perceived as authentic, nonrepetitive support from people with similar real-life experiences. Negative or sarcastic user-generated content elicited negative reactions from participants. Participants preferred receiving 3 or fewer daily messages, ideally before cravings. Suggestions focused on the need to screen user-generated content before its inclusion in the app library and allow app users to customize message frequency and timing.
Conclusions: User-generated content was deemed an acceptable source of health messages. This content can improve the efficacy and effectiveness of smoking cessation interventions by increasing their pool of unique messages that may be better received and more persuasive than expert-curated content. User-generated content can be used to curate health messages for all medical conditions and behaviors with relevant publicly available online content for integration in behavioral interventions given its high volume, brevity, and narrative-like nature. Future research is needed to investigate the effects of user-generated content on health behaviors and identify the theoretical mechanisms for these effects.
{"title":"Perceptions of User-Generated Content as a Source of Health Messages in Smoking Cessation Mobile Interventions: Focus Group Study.","authors":"Michael Wakeman, Lydia Tesfaye, Tim Gregory, Erin Leahy, Gunnar Baskin, Greg Gruse, Brandon Kendrick, Sherine El-Toukhy","doi":"10.2196/76804","DOIUrl":"10.2196/76804","url":null,"abstract":"<p><strong>Background: </strong>Health messages are integral to smoking cessation interventions. Common approaches to health message development include expert-crafted messages and audience-generated messages, which produce messages that can be monotonic, didactic, and limited in number. We introduce an alternative approach to health message development that relies on user-generated content available on open-content platforms as a source of health messages.</p><p><strong>Objective: </strong>We examined the acceptability of user-generated content curated from Twitter (subsequently rebranded X) as a source of health support messages in a newly developed smoking cessation mobile intervention called Quit Journey and the optimal timing and frequency with which health messages can be deployed to support app users in real time.</p><p><strong>Methods: </strong>A total of 12 semistructured focus groups were held with 38 young adults with low socioeconomic status who smoked cigarettes, wanted to quit, and were aged 18 to 29 years. Focus groups were held virtually on GoTo Meeting, audio recorded, and transcribed verbatim. Deductive thematic analysis was used, with themes based on 5 constructs from the second unified theory of acceptance and use of technology (ie, effort expectancy, facilitating conditions, hedonic motivation, performance expectancy, and social influence) and negative, neutral, and positive sentiment.</p><p><strong>Results: </strong>Participants perceived user-generated content positively (56/108, 51.9% of the quotes) and focused on their perceived usefulness (37/108, 34.3% of the quotes). User-generated content was perceived as authentic, nonrepetitive support from people with similar real-life experiences. Negative or sarcastic user-generated content elicited negative reactions from participants. Participants preferred receiving 3 or fewer daily messages, ideally before cravings. Suggestions focused on the need to screen user-generated content before its inclusion in the app library and allow app users to customize message frequency and timing.</p><p><strong>Conclusions: </strong>User-generated content was deemed an acceptable source of health messages. This content can improve the efficacy and effectiveness of smoking cessation interventions by increasing their pool of unique messages that may be better received and more persuasive than expert-curated content. User-generated content can be used to curate health messages for all medical conditions and behaviors with relevant publicly available online content for integration in behavioral interventions given its high volume, brevity, and narrative-like nature. Future research is needed to investigate the effects of user-generated content on health behaviors and identify the theoretical mechanisms for these effects.</p>","PeriodicalId":36351,"journal":{"name":"JMIR Human Factors","volume":"12 ","pages":"e76804"},"PeriodicalIF":3.0,"publicationDate":"2025-12-17","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12711135/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145776117","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
<p><strong>Background: </strong>Although various apps have been developed to support health behaviors, they are mostly commercial, possibly limiting the number of users. The ME-BYO index was developed by Kanagawa Prefecture in 2019 to comprehensively and numerically measure and visualize an individual's current health status and future disease risk by quantifying data. The ME-BYO index is free of charge, so it can be made available to as many prefectural residents as possible for health promotion. Effective online strength training programs are being developed that, when combined with ME-BYO index measurements, will help with both exercise habits and health management.</p><p><strong>Objective: </strong>In this study, we aimed to validate the fidelity, feasibility, and acceptability of self-measurement using the ME-BYO index during the implementation of an online strength training program.</p><p><strong>Methods: </strong>Participants were 23 adults aged 40 years or older who did not regularly perform muscle strengthening exercises. The strength training program was performed twice a week for 8 weeks (16 sessions in total), and the ME-BYO index was explained to the participants so that they could self-measure the ME-BYO index with a smartphone on the day of the strength training program, before its implementation. The ME-BYO index during the study period was continuously collected from the app, and the ME-BYO index adherence rate was calculated. Questionnaires were used to assess the feasibility (difficulty of measurement and motivation to improve lifestyle) and acceptability (intention to maintain measurement and appropriate frequency of measurement) of implementing and continuing the ME-BYO index measurements. Changes in the ME-BYO index between the first and second halves of the program period, examination of items strongly related to changes in the ME-BYO index, and a comparison of physical fitness and health outcomes before and after the program period were conducted.</p><p><strong>Results: </strong>The mean ME-BYO index adherence rate during the strength training program was 89.4% (SD 17%). Regarding acceptability, the participants were highly motivated to continue measuring the ME-BYO index (77%), and the appropriate frequency of measurement was once a week and twice a week (31% and 31%, respectively). Panel data analysis of the self-measured ME-BYO index showed no significant change in the ME-BYO index score; however, it increased to a higher score. Examination of the items that increased the overall score indicated that systolic blood pressure, mental resilience, and Mini-Cog scores were the relevant factors. The pre- and postprogram measurements showed no significant changes in items other than physical fitness.</p><p><strong>Conclusions: </strong>The fidelity, feasibility, and acceptability of measuring the ME-BYO index during a regular online strength training program were high, suggesting that self-measurement of the ME-BYO index could
{"title":"Use of the ME-BYO Index, a Mobile Health App, During an Online Strength Training Program in Adults: Fidelity, Feasibility, and Acceptability Study.","authors":"Yoshinobu Saito, Naoki Kikuchi, Kaname Watanabe, Sho Nakamura, Hiroto Narimatsu","doi":"10.2196/63123","DOIUrl":"10.2196/63123","url":null,"abstract":"<p><strong>Background: </strong>Although various apps have been developed to support health behaviors, they are mostly commercial, possibly limiting the number of users. The ME-BYO index was developed by Kanagawa Prefecture in 2019 to comprehensively and numerically measure and visualize an individual's current health status and future disease risk by quantifying data. The ME-BYO index is free of charge, so it can be made available to as many prefectural residents as possible for health promotion. Effective online strength training programs are being developed that, when combined with ME-BYO index measurements, will help with both exercise habits and health management.</p><p><strong>Objective: </strong>In this study, we aimed to validate the fidelity, feasibility, and acceptability of self-measurement using the ME-BYO index during the implementation of an online strength training program.</p><p><strong>Methods: </strong>Participants were 23 adults aged 40 years or older who did not regularly perform muscle strengthening exercises. The strength training program was performed twice a week for 8 weeks (16 sessions in total), and the ME-BYO index was explained to the participants so that they could self-measure the ME-BYO index with a smartphone on the day of the strength training program, before its implementation. The ME-BYO index during the study period was continuously collected from the app, and the ME-BYO index adherence rate was calculated. Questionnaires were used to assess the feasibility (difficulty of measurement and motivation to improve lifestyle) and acceptability (intention to maintain measurement and appropriate frequency of measurement) of implementing and continuing the ME-BYO index measurements. Changes in the ME-BYO index between the first and second halves of the program period, examination of items strongly related to changes in the ME-BYO index, and a comparison of physical fitness and health outcomes before and after the program period were conducted.</p><p><strong>Results: </strong>The mean ME-BYO index adherence rate during the strength training program was 89.4% (SD 17%). Regarding acceptability, the participants were highly motivated to continue measuring the ME-BYO index (77%), and the appropriate frequency of measurement was once a week and twice a week (31% and 31%, respectively). Panel data analysis of the self-measured ME-BYO index showed no significant change in the ME-BYO index score; however, it increased to a higher score. Examination of the items that increased the overall score indicated that systolic blood pressure, mental resilience, and Mini-Cog scores were the relevant factors. The pre- and postprogram measurements showed no significant changes in items other than physical fitness.</p><p><strong>Conclusions: </strong>The fidelity, feasibility, and acceptability of measuring the ME-BYO index during a regular online strength training program were high, suggesting that self-measurement of the ME-BYO index could","PeriodicalId":36351,"journal":{"name":"JMIR Human Factors","volume":"12 ","pages":"e63123"},"PeriodicalIF":3.0,"publicationDate":"2025-12-16","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12707806/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145769423","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
<p><strong>Background: </strong>ChatGPT excels in natural language tasks, but its performance in the Chinese National Medical Licensing Examination (NMLE) and Chinese medical education remains underexplored. Meanwhile, Chinese corpus-based large language models (LLMs) such as ERNIE Bot, Tongyi Qianwen, Doubao, and DeepSeek have emerged, yet their effectiveness in the NMLE awaits systematic evaluation.</p><p><strong>Objective: </strong>This study aimed to quantitatively compare the performance of 6 LLMs (GPT-3.5, GPT-4, ERNIE Bot, Tongyi Qianwen, Doubao, and DeepSeek) in answering NMLE questions from 2018 to 2024 and analyze their feasibility as supplementary tools in Chinese medical education.</p><p><strong>Methods: </strong>We selected questions from the 4 content units of the NMLE's General Written test (2018-2024), preprocessed image- and table-based content into standardized text, and input the questions into each model. We evaluated the accuracy, comprehensiveness, and logical coherence of the responses, with quantitative comparison centered on scores and accuracy rates against the official answer keys (passing score: 360/600).</p><p><strong>Results: </strong>GPT-4 outperformed GPT-3.5 across all units, achieving average accuracies of 66.57% (SD 3.21%; unit 1), 69.05% (SD 2.87%; unit 2), 71.71% (SD 2.53%; unit 3), and 80.67% (SD 2.19%; unit 4), with consistent scores above the passing threshold. Among the Chinese models, DeepSeek demonstrated the highest overall performance, with an average score of 454.8 (SD 17.3) and average accuracies of 73.2% (unit 1, SD 2.89%) and 71.5% (unit 3, SD 2.64%), as well as average accuracies of 70.3% (unit 2, SD 3.02%) and 78.2% (unit 4, SD 2.47%). ERNIE Bot (mean score 442.3, SD 19.6; unit 1 accuracy =70.8%, SD 3.01%; unit 2 accuracy =68.7%, SD 3.15%; unit 3 accuracy =69.1%, SD 2.93%; unit 4 accuracy =68.3%, SD 2.76%), Tongyi Qianwen (mean score 426.5, SD 21.4; unit 1 accuracy =67.4%, SD 3.22%; unit 2 accuracy =65.9%, SD 3.31%; unit 3 accuracy =66.2%, SD 3.08%; unit 4 accuracy =67.2%, SD 2.89%), and Doubao (mean score 413.7, SD 23.1; unit 1 accuracy =65.2%, SD 3.45%; unit 2 accuracy =63.8%, SD 3.52%; unit 3 accuracy =64.1%, SD 3.27%; unit 4 accuracy =62.8%, SD 3.11%) all exceeded the passing score. DeepSeek's overall average accuracy (75.8%, SD 2.73%) was significantly higher than those of the other Chinese models (χ²₁=11.4, P=.001 vs ERNIE Bot; χ²₁=28.7, P<.001 vs Tongyi Qianwen; χ²₁=45.3, P<.001 vs Doubao). GPT-4's overall average accuracy (77.0%, SD 2.58%) was slightly higher than that of DeepSeek but not statistically significant (χ²₁=2.2, P=.14), while both outperformed GPT-3.5 (overall accuracy =68.5%, SD 3.67%; χ²₁=89.8, P<.001 for GPT-4 vs GPT-3.5; χ²₁=76.3, P<.001 for DeepSeek vs GPT-3.5).</p><p><strong>Conclusions: </strong>GPT-4 and Chinese-developed LLMs such as DeepSeek show potential as supplementary tools in Chinese medical education given their solid performance on the NMLE. However, further
{"title":"Multiple Large Language Models' Performance on the Chinese Medical Licensing Examination: Quantitative Comparative Study.","authors":"Yanyu Diao, Mengyuan Wu, Jingwen Xu, Yifeng Pan","doi":"10.2196/77978","DOIUrl":"10.2196/77978","url":null,"abstract":"<p><strong>Background: </strong>ChatGPT excels in natural language tasks, but its performance in the Chinese National Medical Licensing Examination (NMLE) and Chinese medical education remains underexplored. Meanwhile, Chinese corpus-based large language models (LLMs) such as ERNIE Bot, Tongyi Qianwen, Doubao, and DeepSeek have emerged, yet their effectiveness in the NMLE awaits systematic evaluation.</p><p><strong>Objective: </strong>This study aimed to quantitatively compare the performance of 6 LLMs (GPT-3.5, GPT-4, ERNIE Bot, Tongyi Qianwen, Doubao, and DeepSeek) in answering NMLE questions from 2018 to 2024 and analyze their feasibility as supplementary tools in Chinese medical education.</p><p><strong>Methods: </strong>We selected questions from the 4 content units of the NMLE's General Written test (2018-2024), preprocessed image- and table-based content into standardized text, and input the questions into each model. We evaluated the accuracy, comprehensiveness, and logical coherence of the responses, with quantitative comparison centered on scores and accuracy rates against the official answer keys (passing score: 360/600).</p><p><strong>Results: </strong>GPT-4 outperformed GPT-3.5 across all units, achieving average accuracies of 66.57% (SD 3.21%; unit 1), 69.05% (SD 2.87%; unit 2), 71.71% (SD 2.53%; unit 3), and 80.67% (SD 2.19%; unit 4), with consistent scores above the passing threshold. Among the Chinese models, DeepSeek demonstrated the highest overall performance, with an average score of 454.8 (SD 17.3) and average accuracies of 73.2% (unit 1, SD 2.89%) and 71.5% (unit 3, SD 2.64%), as well as average accuracies of 70.3% (unit 2, SD 3.02%) and 78.2% (unit 4, SD 2.47%). ERNIE Bot (mean score 442.3, SD 19.6; unit 1 accuracy =70.8%, SD 3.01%; unit 2 accuracy =68.7%, SD 3.15%; unit 3 accuracy =69.1%, SD 2.93%; unit 4 accuracy =68.3%, SD 2.76%), Tongyi Qianwen (mean score 426.5, SD 21.4; unit 1 accuracy =67.4%, SD 3.22%; unit 2 accuracy =65.9%, SD 3.31%; unit 3 accuracy =66.2%, SD 3.08%; unit 4 accuracy =67.2%, SD 2.89%), and Doubao (mean score 413.7, SD 23.1; unit 1 accuracy =65.2%, SD 3.45%; unit 2 accuracy =63.8%, SD 3.52%; unit 3 accuracy =64.1%, SD 3.27%; unit 4 accuracy =62.8%, SD 3.11%) all exceeded the passing score. DeepSeek's overall average accuracy (75.8%, SD 2.73%) was significantly higher than those of the other Chinese models (χ²₁=11.4, P=.001 vs ERNIE Bot; χ²₁=28.7, P<.001 vs Tongyi Qianwen; χ²₁=45.3, P<.001 vs Doubao). GPT-4's overall average accuracy (77.0%, SD 2.58%) was slightly higher than that of DeepSeek but not statistically significant (χ²₁=2.2, P=.14), while both outperformed GPT-3.5 (overall accuracy =68.5%, SD 3.67%; χ²₁=89.8, P<.001 for GPT-4 vs GPT-3.5; χ²₁=76.3, P<.001 for DeepSeek vs GPT-3.5).</p><p><strong>Conclusions: </strong>GPT-4 and Chinese-developed LLMs such as DeepSeek show potential as supplementary tools in Chinese medical education given their solid performance on the NMLE. However, further ","PeriodicalId":36351,"journal":{"name":"JMIR Human Factors","volume":"12 ","pages":"e77978"},"PeriodicalIF":3.0,"publicationDate":"2025-12-16","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12707437/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145769344","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Kyle De Boer, John Simpson, Purnima Sharma, Trevor J Steele
Background: Needleless access devices are essential for intravenous therapy but can be a source of contamination and catheter-related bloodstream infections (CRBSIs) if not disinfected properly. The BD PosiFlush SafeScrub (Becton, Dickinson and Company) is designed to support aseptic nontouch technique (ANTT) by incorporating a built-in reminder to "scrub-the-hub" before flushing. This feature can help improve compliance with disinfection practices and may reduce the risk of microbial contamination.
Objective: This study aimed to evaluate compliance with scrubbing before flushing using the BD PosiFlush SafeScrub in a simulated clinical environment compared with standard disinfection and flushing practices (alcohol swabs and prefilled saline syringes).
Methods: A cross-sectional, within-subjects, simulated-use compliance study was conducted with health care professionals familiarized with BD PosiFlush SafeScrub (a prefilled BD PosiFlush Syringe with an integrated disinfecting unit). Compliance was defined according to the disinfection procedure specified for each scenario; participants were considered compliant with standard practice if they followed their own institutional policy (ranging from 5 to 30 seconds or based on stroke counts), while compliance with BD PosiFlush SafeScrub required scrubbing for at least 10 seconds with a minimum of 8 clockwise and 8 counterclockwise rotations, in accordance with the instructions for use (IFU). Compliance with disinfection was monitored and recorded for both the BD PosiFlush SafeScrub and standard disinfection and flushing practice.
Results: Compliance with catheter hub disinfection was assessed among 60 participants for BD PosiFlush SafeScrub and 57 participants for standard practice. During preaccess procedures (Flush 1), BD PosiFlush SafeScrub achieved 46% compliance versus 21% with standard practice, representing an absolute improvement of 25% and a 119% relative improvement (P<.001). During the postmedication procedure (Flush 2), compliance was 22% with BD PosiFlush SafeScrub compared with 13% for standard practice, corresponding to a 9% absolute improvement and 69% relative improvement, although not statistically significant (P=.12). Overall, the compliance rate was 34% (81/240 interactions) in the BD PosiFlush SafeScrub group compared with 17% (39/228 interactions) in the standard practice group, representing an absolute improvement of 17% and a relative improvement of 100% (P<.001).
Conclusions: The BD PosiFlush SafeScrub, with its integrated disinfection unit, yielded approximately double the scrub-the-hub compliance (34%) before flushing compared to the standard practice of alcohol pads and prefilled saline syringes (17%), supporting its role in facilitating adherence to ANTT, which may reduce microbial growth.
{"title":"Enhancing Safety in Clinical Practice: Cross-Sectional, Within-Subjects, Simulated-Use Compliance Study Using BD PosiFlush SafeScrub.","authors":"Kyle De Boer, John Simpson, Purnima Sharma, Trevor J Steele","doi":"10.2196/78967","DOIUrl":"10.2196/78967","url":null,"abstract":"<p><strong>Background: </strong>Needleless access devices are essential for intravenous therapy but can be a source of contamination and catheter-related bloodstream infections (CRBSIs) if not disinfected properly. The BD PosiFlush SafeScrub (Becton, Dickinson and Company) is designed to support aseptic nontouch technique (ANTT) by incorporating a built-in reminder to \"scrub-the-hub\" before flushing. This feature can help improve compliance with disinfection practices and may reduce the risk of microbial contamination.</p><p><strong>Objective: </strong>This study aimed to evaluate compliance with scrubbing before flushing using the BD PosiFlush SafeScrub in a simulated clinical environment compared with standard disinfection and flushing practices (alcohol swabs and prefilled saline syringes).</p><p><strong>Methods: </strong>A cross-sectional, within-subjects, simulated-use compliance study was conducted with health care professionals familiarized with BD PosiFlush SafeScrub (a prefilled BD PosiFlush Syringe with an integrated disinfecting unit). Compliance was defined according to the disinfection procedure specified for each scenario; participants were considered compliant with standard practice if they followed their own institutional policy (ranging from 5 to 30 seconds or based on stroke counts), while compliance with BD PosiFlush SafeScrub required scrubbing for at least 10 seconds with a minimum of 8 clockwise and 8 counterclockwise rotations, in accordance with the instructions for use (IFU). Compliance with disinfection was monitored and recorded for both the BD PosiFlush SafeScrub and standard disinfection and flushing practice.</p><p><strong>Results: </strong>Compliance with catheter hub disinfection was assessed among 60 participants for BD PosiFlush SafeScrub and 57 participants for standard practice. During preaccess procedures (Flush 1), BD PosiFlush SafeScrub achieved 46% compliance versus 21% with standard practice, representing an absolute improvement of 25% and a 119% relative improvement (P<.001). During the postmedication procedure (Flush 2), compliance was 22% with BD PosiFlush SafeScrub compared with 13% for standard practice, corresponding to a 9% absolute improvement and 69% relative improvement, although not statistically significant (P=.12). Overall, the compliance rate was 34% (81/240 interactions) in the BD PosiFlush SafeScrub group compared with 17% (39/228 interactions) in the standard practice group, representing an absolute improvement of 17% and a relative improvement of 100% (P<.001).</p><p><strong>Conclusions: </strong>The BD PosiFlush SafeScrub, with its integrated disinfection unit, yielded approximately double the scrub-the-hub compliance (34%) before flushing compared to the standard practice of alcohol pads and prefilled saline syringes (17%), supporting its role in facilitating adherence to ANTT, which may reduce microbial growth.</p>","PeriodicalId":36351,"journal":{"name":"JMIR Human Factors","volume":" ","pages":"e78967"},"PeriodicalIF":3.0,"publicationDate":"2025-12-15","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12750072/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145574753","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}