Pub Date : 2025-10-15DOI: 10.1136/bmjhci-2025-101640
Santiago Frid, Octavi Bassegoda, Maria Araceli Camacho Mahamud, Gemma Sanjuan, Miguel Ángel Armengol de la Hoz, Leo Celi, Isaac Cano Franco, Gerard Anmella, Tomas Cuñat López, Ana Lucía Arellano, Lina María Leguízamo-Martínez, Laura Mezquita, Petter Axcell Peñafiel Macías, Antonio Gallardo-Pizarro, Ruben González Colom, Arturo Renú Jornet, Guillem Bracons Cucó, Xavier Borrat Frigola
Objectives: To describe the implementation of a multidisciplinary, ethically grounded hackathon as a model to develop and evaluate generative AI (GenAI) solutions for real-world clinical challenges within a hospital setting.
Methods: The GenAI Health Hackathon (GAHH) organised at Hospital Clínic de Barcelona included 13 challenges were selected via an internal call based on clinical impact, feasibility and data availability. Participants accessed anonymised real-world data through a secure cloud environment. Teams employed large language models and retrieval-augmented generation to build prototypes addressing tasks such as clinical text structuring, decision support and workflow automation. Human-in-the-loop validation, explainability and regulatory safeguards were emphasised.
Results: The hackathon yielded multiple AI prototypes tested on real data. Results varied: entity recognition reached 90.5% accuracy, summarisation >90% clinician concordance and nutritional models achieved F1 scores of 0.75-0.93. Lower scores (F1<0.52, Jaccard Index <0.4) were seen in complex reasoning or multilingual tasks. Bias was explored in 10 projects, with mitigations such as stratified sampling, prompt tuning, disclaimers and expert oversight. A transferable framework was proposed to replicate responsible GenAI hackathons in clinical contexts.
Discussion: Interdisciplinary collaboration and real-world testing proved essential for aligning GenAI with clinical needs. The hackathon revealed challenges in bias, evaluation and integration but offered a transferable framework for responsible innovation under General Data Protection Regulation and the European Union Artificial Intelligence Act.
Conclusions: The GAHH demonstrated that GenAI can be safely and effectively applied in healthcare with rigorous governance and interdisciplinary collaboration, offering a scalable model for responsible AI innovation.
目的:描述一个多学科的、基于伦理的黑客马拉松的实施,作为一个模型来开发和评估生成人工智能(GenAI)解决方案,以应对医院环境中现实世界的临床挑战。方法:在Clínic de Barcelona医院组织的GenAI健康黑客马拉松(GAHH)包括13项挑战,这些挑战是通过基于临床影响、可行性和数据可用性的内部呼吁选择的。参与者通过安全的云环境访问匿名的真实数据。团队使用大型语言模型和检索增强生成来构建解决诸如临床文本结构、决策支持和工作流自动化等任务的原型。强调了人在循环验证、可解释性和监管保障。结果:黑客马拉松产生了多个在真实数据上测试的AI原型。结果各不相同:实体识别准确率达到90.5%,总结准确率达到90%,临床医师一致性和营养模型的F1评分达到0.75-0.93。低分数(f1)讨论:跨学科合作和真实世界的测试被证明是使GenAI符合临床需求的关键。黑客马拉松揭示了偏见、评估和整合方面的挑战,但根据《通用数据保护条例》和《欧盟人工智能法案》,为负责任的创新提供了一个可转移的框架。结论:GAHH表明,通过严格的治理和跨学科合作,GenAI可以安全有效地应用于医疗保健领域,为负责任的人工智能创新提供了可扩展的模式。
{"title":"Bridging generative AI and healthcare practice: insights from the GenAI Health Hackathon at Hospital Clínic de Barcelona.","authors":"Santiago Frid, Octavi Bassegoda, Maria Araceli Camacho Mahamud, Gemma Sanjuan, Miguel Ángel Armengol de la Hoz, Leo Celi, Isaac Cano Franco, Gerard Anmella, Tomas Cuñat López, Ana Lucía Arellano, Lina María Leguízamo-Martínez, Laura Mezquita, Petter Axcell Peñafiel Macías, Antonio Gallardo-Pizarro, Ruben González Colom, Arturo Renú Jornet, Guillem Bracons Cucó, Xavier Borrat Frigola","doi":"10.1136/bmjhci-2025-101640","DOIUrl":"10.1136/bmjhci-2025-101640","url":null,"abstract":"<p><strong>Objectives: </strong>To describe the implementation of a multidisciplinary, ethically grounded hackathon as a model to develop and evaluate generative AI (GenAI) solutions for real-world clinical challenges within a hospital setting.</p><p><strong>Methods: </strong>The GenAI Health Hackathon (GAHH) organised at Hospital Clínic de Barcelona included 13 challenges were selected via an internal call based on clinical impact, feasibility and data availability. Participants accessed anonymised real-world data through a secure cloud environment. Teams employed large language models and retrieval-augmented generation to build prototypes addressing tasks such as clinical text structuring, decision support and workflow automation. Human-in-the-loop validation, explainability and regulatory safeguards were emphasised.</p><p><strong>Results: </strong>The hackathon yielded multiple AI prototypes tested on real data. Results varied: entity recognition reached 90.5% accuracy, summarisation >90% clinician concordance and nutritional models achieved F1 scores of 0.75-0.93. Lower scores (F1<0.52, Jaccard Index <0.4) were seen in complex reasoning or multilingual tasks. Bias was explored in 10 projects, with mitigations such as stratified sampling, prompt tuning, disclaimers and expert oversight. A transferable framework was proposed to replicate responsible GenAI hackathons in clinical contexts.</p><p><strong>Discussion: </strong>Interdisciplinary collaboration and real-world testing proved essential for aligning GenAI with clinical needs. The hackathon revealed challenges in bias, evaluation and integration but offered a transferable framework for responsible innovation under General Data Protection Regulation and the European Union Artificial Intelligence Act.</p><p><strong>Conclusions: </strong>The GAHH demonstrated that GenAI can be safely and effectively applied in healthcare with rigorous governance and interdisciplinary collaboration, offering a scalable model for responsible AI innovation.</p>","PeriodicalId":9050,"journal":{"name":"BMJ Health & Care Informatics","volume":"32 1","pages":""},"PeriodicalIF":4.4,"publicationDate":"2025-10-15","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12530383/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145298316","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2025-10-15DOI: 10.1136/bmjhci-2025-101632
Marcello Di Pumpo, Maria Rosaria Gualano, Danilo Buonsenso, Francesca Raffaelli, Daniele Donà, Vittorio Maio, Patrizia Laurenti, Walter Ricciardi, Leonardo Villani
Objectives: Antimicrobial resistance is a critical public health threat. Large language models (LLMs) show great capability for providing health information. This study evaluates the effectiveness of LLMs in providing information on antibiotic use and infection management.
Methods: Using a mixed-method approach, responses to healthcare expert-designed scenarios from ChatGPT 3.5, ChatGPT 4.0, Claude 2.0 and Gemini 1.0, in both Italian and English, were analysed. Computational text analysis assessed readability, lexical diversity and sentiment, while content quality was assessed by three experts via DISCERN tool.
Results: 16 scenarios were developed. A total of 101 outputs and 5454 Likert-scale (1-5) scores were obtained for the analysis. A general positive performance gradient was found from ChatGPT 3.5 and 4.0 to Claude to Gemini. Gemini, although producing only five outputs before self-inhibition, consistently outperformed the other models across almost all metrics, producing more detailed, accessible, varied content and a positive overtone. ChatGPT 4.0 demonstrated the highest lexical diversity. A difference in performance by language was observed. All models showed a median score of 1 (IQR=2) regarding the domain addressing antimicrobial resistance.
Discussion: The study highlights a positive performance gradient towards Gemini, which showed superior content quality, accessibility and contextual awareness, although acknowledging its smaller dataset. Generating appropriate content to address antimicrobial resistance proved challenging.
Conclusions: LLMs offer great promise to provide appropriate medical information. However, they should play a supporting role rather than representing a replacement option for medical professionals, confirming the need for expert oversight and improved artificial intelligence design.
{"title":"Large language models as information providers for appropriate antimicrobial use: computational text analysis and expert-rated comparison of ChatGPT, Claude and Gemini.","authors":"Marcello Di Pumpo, Maria Rosaria Gualano, Danilo Buonsenso, Francesca Raffaelli, Daniele Donà, Vittorio Maio, Patrizia Laurenti, Walter Ricciardi, Leonardo Villani","doi":"10.1136/bmjhci-2025-101632","DOIUrl":"10.1136/bmjhci-2025-101632","url":null,"abstract":"<p><strong>Objectives: </strong>Antimicrobial resistance is a critical public health threat. Large language models (LLMs) show great capability for providing health information. This study evaluates the effectiveness of LLMs in providing information on antibiotic use and infection management.</p><p><strong>Methods: </strong>Using a mixed-method approach, responses to healthcare expert-designed scenarios from ChatGPT 3.5, ChatGPT 4.0, Claude 2.0 and Gemini 1.0, in both Italian and English, were analysed. Computational text analysis assessed readability, lexical diversity and sentiment, while content quality was assessed by three experts via DISCERN tool.</p><p><strong>Results: </strong>16 scenarios were developed. A total of 101 outputs and 5454 Likert-scale (1-5) scores were obtained for the analysis. A general positive performance gradient was found from ChatGPT 3.5 and 4.0 to Claude to Gemini. Gemini, although producing only five outputs before self-inhibition, consistently outperformed the other models across almost all metrics, producing more detailed, accessible, varied content and a positive overtone. ChatGPT 4.0 demonstrated the highest lexical diversity. A difference in performance by language was observed. All models showed a median score of 1 (IQR=2) regarding the domain addressing antimicrobial resistance.</p><p><strong>Discussion: </strong>The study highlights a positive performance gradient towards Gemini, which showed superior content quality, accessibility and contextual awareness, although acknowledging its smaller dataset. Generating appropriate content to address antimicrobial resistance proved challenging.</p><p><strong>Conclusions: </strong>LLMs offer great promise to provide appropriate medical information. However, they should play a supporting role rather than representing a replacement option for medical professionals, confirming the need for expert oversight and improved artificial intelligence design.</p>","PeriodicalId":9050,"journal":{"name":"BMJ Health & Care Informatics","volume":"32 1","pages":""},"PeriodicalIF":4.4,"publicationDate":"2025-10-15","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12530423/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145298349","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Objectives: To evaluate the impact of implementing a multidisciplinary integrated telehealth platform in central Taiwan on healthcare accessibility, emergency response and chronic disease management.
Methods: We analysed data from 26 institutions within a central Taiwan telehealth network between 2022 and 2024. The study evaluated the use and benefits of teleconsultation, artificial intelligence-assisted ECG monitoring during prehospital ambulance transfers and outcomes in patients with cryptogenic stroke following the platform integration. Satisfaction surveys were performed.
Results: By 2024, more than 300 teleconsultations were performed across 26 partner facilities. Non-emergent referral rates fell from 30% in 2022 to 10% in 2024 following teleconsultations. Emergent stroke teleconsultations allowed thrombolytic therapy within the golden hour in 83% of cases. At-home ECG monitoring helped detect atrial fibrillation in 25% of cryptogenic stroke patients within 2 weeks, ensuring timely recall and initiation of appropriate antiarrhythmic therapy to prevent recurrent stroke. Surveys indicated that 83% of healthcare providers and patients were satisfied with telehealth services.
Discussion: The single-centre study showcases a multidisciplinary integrated telehealth model. However, confounders existed, including changes in the healthcare system, selection bias and technology disparities. Satisfaction data may be biased. The short timeframe precludes long-term analysis, underscoring the need for broader, controlled studies to assess the sustained impact of telehealth.
Conclusion: The integrated telehealth centre model provides a scalable and replicable approach for healthcare delivery. Studies for long-term benefits and outcomes will help improve telehealth models.
{"title":"Implementing an integrated multidisciplinary telehealth platform: a case study at Taichung Veterans General Hospital.","authors":"Pei-Ju Tu, Jin-An Huang, Chi-Sheng Wang, Pi-Shan Hsu, Shi-Yi Lin, Yi-Ting Tsai, Ching-Tsung Chen, Chia-Hua Chu, Hui-Mei Huang, Jiunn-Cherng Lin, Hsin-Ju Tu, Yi-Ju Chen","doi":"10.1136/bmjhci-2025-101484","DOIUrl":"10.1136/bmjhci-2025-101484","url":null,"abstract":"<p><strong>Objectives: </strong>To evaluate the impact of implementing a multidisciplinary integrated telehealth platform in central Taiwan on healthcare accessibility, emergency response and chronic disease management.</p><p><strong>Methods: </strong>We analysed data from 26 institutions within a central Taiwan telehealth network between 2022 and 2024. The study evaluated the use and benefits of teleconsultation, artificial intelligence-assisted ECG monitoring during prehospital ambulance transfers and outcomes in patients with cryptogenic stroke following the platform integration. Satisfaction surveys were performed.</p><p><strong>Results: </strong>By 2024, more than 300 teleconsultations were performed across 26 partner facilities. Non-emergent referral rates fell from 30% in 2022 to 10% in 2024 following teleconsultations. Emergent stroke teleconsultations allowed thrombolytic therapy within the golden hour in 83% of cases. At-home ECG monitoring helped detect atrial fibrillation in 25% of cryptogenic stroke patients within 2 weeks, ensuring timely recall and initiation of appropriate antiarrhythmic therapy to prevent recurrent stroke. Surveys indicated that 83% of healthcare providers and patients were satisfied with telehealth services.</p><p><strong>Discussion: </strong>The single-centre study showcases a multidisciplinary integrated telehealth model. However, confounders existed, including changes in the healthcare system, selection bias and technology disparities. Satisfaction data may be biased. The short timeframe precludes long-term analysis, underscoring the need for broader, controlled studies to assess the sustained impact of telehealth.</p><p><strong>Conclusion: </strong>The integrated telehealth centre model provides a scalable and replicable approach for healthcare delivery. Studies for long-term benefits and outcomes will help improve telehealth models.</p>","PeriodicalId":9050,"journal":{"name":"BMJ Health & Care Informatics","volume":"32 1","pages":""},"PeriodicalIF":4.4,"publicationDate":"2025-10-15","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12530431/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145298362","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2025-10-10DOI: 10.1136/bmjhci-2024-101418
Dorothee A Busch, Mats L Richter, Jens Hüsers, Mareike Przysucha, Florian Kücking, Jonathan M Mang, Maurice Moelleken, Joachim Dissemond, Jan Heggemann, Guido Hafer, Carola Berking, Cornelia Erfurt-Berge, Ursula H Hübner
Study objectives: Chronic wounds represent a significant economic and personal burden. For their successful treatment, the causes must be known and treated. Wounds caused by pyoderma gangrenosum (PG), a rare inflammatory skin disease, are often misdiagnosed. This study, therefore, aims to develop a machine learning model capable of differentiating PG from other wound types, focusing on chronic leg wounds to address this diagnostic challenge.
Methods: We used 3674 wound photographs from three specialised wound centres with the four most common types of foot and leg ulcers and the rare inflammatory differential diagnosis PG. The convolutional neural network classifier ConvNeXt 'B' was pretrained on LAION2B, ImageNet12k and ImageNet 1k before being trained and fine-tuned on an 85:15 train, validation split.
Results: The model achieved an overall high accuracy in multiclass classification of the chronic wounds (unbalanced accuracy 90%, balanced accuracy 87%). The sensitivity for identifying PG was 94%, while the sensitivity forother chronic wound types was 97% for diabetic foot ulcers (DFU), 92% for venous leg ulcers (VLU), 78% for mixed leg ulcers and 74% for arterial leg ulcers.
Discussion: The machine learning model effectively differentiates PG from the most common leg and foot ulcers and was very accurate for classifying DFU and VLU. A higher rate of misclassifications occurred for the other vascular ulcers, that is, mixed and arterial leg ulcers. This aligns with the challenges in clinical practice.
Conclusion: Despite the limited number of wound images, this novel multiclass wound classification model accurately identified PG and differentiated leg and foot ulcer subtypes, providing a foundation for a diagnostic support system.
{"title":"Machine learning model to classify chronic leg wounds and identify pyoderma gangrenosum.","authors":"Dorothee A Busch, Mats L Richter, Jens Hüsers, Mareike Przysucha, Florian Kücking, Jonathan M Mang, Maurice Moelleken, Joachim Dissemond, Jan Heggemann, Guido Hafer, Carola Berking, Cornelia Erfurt-Berge, Ursula H Hübner","doi":"10.1136/bmjhci-2024-101418","DOIUrl":"10.1136/bmjhci-2024-101418","url":null,"abstract":"<p><strong>Study objectives: </strong>Chronic wounds represent a significant economic and personal burden. For their successful treatment, the causes must be known and treated. Wounds caused by pyoderma gangrenosum (PG), a rare inflammatory skin disease, are often misdiagnosed. This study, therefore, aims to develop a machine learning model capable of differentiating PG from other wound types, focusing on chronic leg wounds to address this diagnostic challenge.</p><p><strong>Methods: </strong>We used 3674 wound photographs from three specialised wound centres with the four most common types of foot and leg ulcers and the rare inflammatory differential diagnosis PG. The convolutional neural network classifier ConvNeXt 'B' was pretrained on LAION2B, ImageNet12k and ImageNet 1k before being trained and fine-tuned on an 85:15 train, validation split.</p><p><strong>Results: </strong>The model achieved an overall high accuracy in multiclass classification of the chronic wounds (unbalanced accuracy 90%, balanced accuracy 87%). The sensitivity for identifying PG was 94%, while the sensitivity forother chronic wound types was 97% for diabetic foot ulcers (DFU), 92% for venous leg ulcers (VLU), 78% for mixed leg ulcers and 74% for arterial leg ulcers.</p><p><strong>Discussion: </strong>The machine learning model effectively differentiates PG from the most common leg and foot ulcers and was very accurate for classifying DFU and VLU. A higher rate of misclassifications occurred for the other vascular ulcers, that is, mixed and arterial leg ulcers. This aligns with the challenges in clinical practice.</p><p><strong>Conclusion: </strong>Despite the limited number of wound images, this novel multiclass wound classification model accurately identified PG and differentiated leg and foot ulcer subtypes, providing a foundation for a diagnostic support system.</p>","PeriodicalId":9050,"journal":{"name":"BMJ Health & Care Informatics","volume":"32 1","pages":""},"PeriodicalIF":4.4,"publicationDate":"2025-10-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12517012/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145273708","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2025-10-10DOI: 10.1136/bmjhci-2024-101261
Thomas Hinneh, Bernard Mensah, Charles Kwanin, Chidi Okonkwo, Samuel Byiringiro
Objective: The increasing burden of hypertension in Africa underscores the need to embrace digital health innovations to improve delivery and access to quality hypertension care. This review aimed at (1) identifying barriers and facilitators to the implementation and uptake of digital health tools and (2) examining the scope and use of digital health tools based on the Practical Reviews in Self-Management Support (PRISMS) taxonomy.
Materials and methods: We searched PubMed, CINAHL (Medline) and HINARI from inception to June 2024. The Joanna Briggs Institute (Population, Concept, and Context (PCC)) framework guided the formulation of research questions, and the PRISMS taxonomy was used to analyse the functions of digital tools.
Results: Sixteen studies (k=16) across three African regions were included. Common digital health tools were mobile Health (mHealth) or electronic Health applications and short message service (SMS)-based interventions. Supported self-management functions included medication adherence (k=10), lifestyle counselling (k=12) and home blood pressure monitoring (k=9). Implementation strategies included prior training (k=10), continuous digital support (k=10) and provision of resources including BP devices and data credit (k=11). Targeted users were healthcare workers (k=8), patients (k=11) or both (k=3). Barriers included limited digital literacy, poor communication among healthcare workers, privacy concerns and weak internet infrastructure. Facilitators included competency-based training, contextual adaptations, continuous technical support and enhanced user experience through effective feedback systems between users.
Conclusion: Digital health tools, particularly mHealth apps and SMS, support key hypertension self-management tasks in African settings. Addressing technological and contextual barriers while reinforcing training and support systems is critical to successful implementation and scale-up.
目标:非洲高血压负担的日益加重凸显了采用数字卫生创新以改善提供和获得高质量高血压护理的必要性。该审查旨在(1)确定实施和采用数字健康工具的障碍和促进因素,(2)根据自我管理支持实践审查(PRISMS)分类法检查数字健康工具的范围和使用情况。材料与方法:检索PubMed、CINAHL (Medline)和HINARI数据库,检索时间为建站至2024年6月。乔安娜布里格斯研究所(Population, Concept, and Context, PCC)框架指导了研究问题的制定,PRISMS分类法用于分析数字工具的功能。结果:16项研究(k=16)涵盖了三个非洲地区。常见的数字保健工具是移动保健(mHealth)或电子保健应用程序和基于短信服务(SMS)的干预措施。支持的自我管理功能包括药物依从性(k=10)、生活方式咨询(k=12)和家庭血压监测(k=9)。实施策略包括预先培训(k=10)、持续数字化支持(k=10)和提供包括BP设备和数据信用在内的资源(k=11)。目标用户是医护人员(k=8)、患者(k=11)或两者(k=3)。障碍包括数字素养有限、卫生保健工作者之间沟通不畅、隐私问题和互联网基础设施薄弱。促进因素包括基于能力的培训、上下文适应、持续的技术支持和通过用户之间有效的反馈系统增强的用户体验。结论:数字健康工具,特别是移动健康应用程序和短信,支持非洲环境中的关键高血压自我管理任务。在加强培训和支持系统的同时,解决技术和背景障碍对于成功实施和扩大规模至关重要。
{"title":"Digital health tools in hypertension management in sub-Saharan Africa: a scoping review of barriers and facilitators of adoption into mainstream healthcare.","authors":"Thomas Hinneh, Bernard Mensah, Charles Kwanin, Chidi Okonkwo, Samuel Byiringiro","doi":"10.1136/bmjhci-2024-101261","DOIUrl":"10.1136/bmjhci-2024-101261","url":null,"abstract":"<p><strong>Objective: </strong>The increasing burden of hypertension in Africa underscores the need to embrace digital health innovations to improve delivery and access to quality hypertension care. This review aimed at (1) identifying barriers and facilitators to the implementation and uptake of digital health tools and (2) examining the scope and use of digital health tools based on the Practical Reviews in Self-Management Support (PRISMS) taxonomy.</p><p><strong>Materials and methods: </strong>We searched PubMed, CINAHL (Medline) and HINARI from inception to June 2024. The Joanna Briggs Institute (Population, Concept, and Context (PCC)) framework guided the formulation of research questions, and the PRISMS taxonomy was used to analyse the functions of digital tools.</p><p><strong>Results: </strong>Sixteen studies (k=16) across three African regions were included. Common digital health tools were mobile Health (mHealth) or electronic Health applications and short message service (SMS)-based interventions. Supported self-management functions included medication adherence (k=10), lifestyle counselling (k=12) and home blood pressure monitoring (k=9). Implementation strategies included prior training (k=10), continuous digital support (k=10) and provision of resources including BP devices and data credit (k=11). Targeted users were healthcare workers (k=8), patients (k=11) or both (k=3). Barriers included limited digital literacy, poor communication among healthcare workers, privacy concerns and weak internet infrastructure. Facilitators included competency-based training, contextual adaptations, continuous technical support and enhanced user experience through effective feedback systems between users.</p><p><strong>Conclusion: </strong>Digital health tools, particularly mHealth apps and SMS, support key hypertension self-management tasks in African settings. Addressing technological and contextual barriers while reinforcing training and support systems is critical to successful implementation and scale-up.</p>","PeriodicalId":9050,"journal":{"name":"BMJ Health & Care Informatics","volume":"32 1","pages":""},"PeriodicalIF":4.4,"publicationDate":"2025-10-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12517033/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145273733","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2025-10-10DOI: 10.1136/bmjhci-2025-101709
Yang Fann
{"title":"Transforming healthcare with evidence-based digital health innovations.","authors":"Yang Fann","doi":"10.1136/bmjhci-2025-101709","DOIUrl":"10.1136/bmjhci-2025-101709","url":null,"abstract":"","PeriodicalId":9050,"journal":{"name":"BMJ Health & Care Informatics","volume":"32 1","pages":""},"PeriodicalIF":4.4,"publicationDate":"2025-10-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12517031/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145273713","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2025-10-10DOI: 10.1136/bmjhci-2025-101448
Jennifer Ziegler, Barret N M Rush, Asher A Mendelson, Sylvain A Lother, Leo Celi
{"title":"Inflation of the journal impact factor.","authors":"Jennifer Ziegler, Barret N M Rush, Asher A Mendelson, Sylvain A Lother, Leo Celi","doi":"10.1136/bmjhci-2025-101448","DOIUrl":"10.1136/bmjhci-2025-101448","url":null,"abstract":"","PeriodicalId":9050,"journal":{"name":"BMJ Health & Care Informatics","volume":"32 1","pages":""},"PeriodicalIF":4.4,"publicationDate":"2025-10-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12516997/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145273782","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Objectives: This study developed clinical evidence-based pathways (CEBPWs) to standardise treatment protocols, align diagnosis-reimbursement criteria, detect upcoding and enable early overtreatment warnings.
Methods: The CEBPWs were developed based on hospitalised patient-level data from 1 January 2022 to 31 June 2024 in 166 public hospitals in 16 administrative districts of Shanghai. It includes a total of 5 319 336 cases, involving 3 688 108 groups of 'diagnosis+therapy'. 2.61 billion records of hospitalisation charges and 876.45 million records of outpatient charges were collected. GROWTH algorithm was used to find the combination of frequently charged items for examination, treatment, drugs and devices in 'diagnosis+therapy' group.
Results: CEBPWs comprise five key elements: objective evidence identification, accurate classification, value weighting, frequency weighting and temporal sequencing of evidence. We applied CEBPWs to 224 diseases, detecting issues including upcoding, overtreatment and fragmented care episodes to enhance healthcare quality. CEBPWs achieve 100% coverage in diagnostics, therapy and consumables, with 81.81% drug coverage. The pilot evaluation showed that there were violations in 433 cases, with a frequency deviation of 8.64% and cost deviation of 10.82%, with 8.95% for diagnosis, 9.44% for therapy, 14.81% for drugs and 8.98% for consumables.
Discussion: We were developed CEBPWs, breaking the limitations of the clinical pathways is that the experience of clinical experts rather than objective criterion based on the characteristics of big data and lack of diagnostic and therapy standards integrated with payment standards.
Conclusion: The results indicate that CEBPW is critical tool for hospital management and regulation, address many drawbacks of clinical pathways.
{"title":"Development of data-driven clinical pathways: the big data clinical evidence-based pathways.","authors":"Xin Cui, Mengyun Sui, Hua Xie, Wen Chen, Wenqi Tian, Peiwen Wang, Xiaohua Jiang, Chen Fu, Su Xu","doi":"10.1136/bmjhci-2024-101312","DOIUrl":"10.1136/bmjhci-2024-101312","url":null,"abstract":"<p><strong>Objectives: </strong>This study developed clinical evidence-based pathways (CEBPWs) to standardise treatment protocols, align diagnosis-reimbursement criteria, detect upcoding and enable early overtreatment warnings.</p><p><strong>Methods: </strong>The CEBPWs were developed based on hospitalised patient-level data from 1 January 2022 to 31 June 2024 in 166 public hospitals in 16 administrative districts of Shanghai. It includes a total of 5 319 336 cases, involving 3 688 108 groups of 'diagnosis+therapy'. 2.61 billion records of hospitalisation charges and 876.45 million records of outpatient charges were collected. GROWTH algorithm was used to find the combination of frequently charged items for examination, treatment, drugs and devices in 'diagnosis+therapy' group.</p><p><strong>Results: </strong>CEBPWs comprise five key elements: objective evidence identification, accurate classification, value weighting, frequency weighting and temporal sequencing of evidence. We applied CEBPWs to 224 diseases, detecting issues including upcoding, overtreatment and fragmented care episodes to enhance healthcare quality. CEBPWs achieve 100% coverage in diagnostics, therapy and consumables, with 81.81% drug coverage. The pilot evaluation showed that there were violations in 433 cases, with a frequency deviation of 8.64% and cost deviation of 10.82%, with 8.95% for diagnosis, 9.44% for therapy, 14.81% for drugs and 8.98% for consumables.</p><p><strong>Discussion: </strong>We were developed CEBPWs, breaking the limitations of the clinical pathways is that the experience of clinical experts rather than objective criterion based on the characteristics of big data and lack of diagnostic and therapy standards integrated with payment standards.</p><p><strong>Conclusion: </strong>The results indicate that CEBPW is critical tool for hospital management and regulation, address many drawbacks of clinical pathways.</p>","PeriodicalId":9050,"journal":{"name":"BMJ Health & Care Informatics","volume":"32 1","pages":""},"PeriodicalIF":4.4,"publicationDate":"2025-10-07","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12506050/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145249602","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Objectives: Due to the big disease burden of undiagnosed chronic obstructive pulmonary disease (COPD), we aimed to investigate the differences in the characteristics and risk factors of patients with undiagnosed COPD in China.
Methods: We used data from the 'Happy Breathing' Programme through April 2023. Current study is a cohort design. Participants were divided into high risk, undiagnosed and diagnosed COPD. Univariate logistic regression, lasso regression, decision tree, random forest and gradient boosting machine were used to screen the variables. Comparisons were conducted between undiagnosed and patients with diagnosed COPD.
Results: A total of 1603 high-risk, 4688 undiagnosed and 1634 patients with diagnosed COPD were identified. Patients with undiagnosed COPD had the lowest level of education, the poorest COPD-related knowledge and most biofuel users compared with high-risk populations and diagnosed patients (p<0.001). After multivariable logistic regression analysis, COPD-related knowledge score (OR=0.96, 95% CI 0.95 to 0.97), COPD Assessment Test Score (OR=1.01, 95% CI 1.00 to 1.02) and modified Medical Research Council Dyspnea Scale (OR=1.26, 95% CI 1.14 to 1.39) remained significant. Analysis of follow-up data showed that patients with undiagnosed COPD had lighter symptoms and experienced less acute exacerbations than diagnosed patients (p<0.001).
Discussion: Most patients with COPD remain undiagnosed until they feel dyspnoea or hospitalisation due to acute exacerbation. Undiagnosed COPD contributes significantly to the disease burden.
Conclusion: In China, patients with undiagnosed COPD were poorly educated, consumed more biofuels, smoked more and had limited COPD-related knowledge. Patients with undiagnosed COPD are also at risk of acute exacerbation.
{"title":"Characteristics and risk factors of patients with undiagnosed COPD in China: results of a nationwide study from the 'Happy Breathing' Programme with mixed methods evaluation.","authors":"Xingyao Tang, Jun Pan, Fang Fang, Yong Li, JiePing Lei, Hongtao Niu, Wei Li, Fen Dong, Zhoude Zheng, Yaodie Peng, Ting Yang, Chen Wang, Cunbo Jia, Ke Huang","doi":"10.1136/bmjhci-2024-101323","DOIUrl":"10.1136/bmjhci-2024-101323","url":null,"abstract":"<p><strong>Objectives: </strong>Due to the big disease burden of undiagnosed chronic obstructive pulmonary disease (COPD), we aimed to investigate the differences in the characteristics and risk factors of patients with undiagnosed COPD in China.</p><p><strong>Methods: </strong>We used data from the 'Happy Breathing' Programme through April 2023. Current study is a cohort design. Participants were divided into high risk, undiagnosed and diagnosed COPD. Univariate logistic regression, lasso regression, decision tree, random forest and gradient boosting machine were used to screen the variables. Comparisons were conducted between undiagnosed and patients with diagnosed COPD.</p><p><strong>Results: </strong>A total of 1603 high-risk, 4688 undiagnosed and 1634 patients with diagnosed COPD were identified. Patients with undiagnosed COPD had the lowest level of education, the poorest COPD-related knowledge and most biofuel users compared with high-risk populations and diagnosed patients (p<0.001). After multivariable logistic regression analysis, COPD-related knowledge score (OR=0.96, 95% CI 0.95 to 0.97), COPD Assessment Test Score (OR=1.01, 95% CI 1.00 to 1.02) and modified Medical Research Council Dyspnea Scale (OR=1.26, 95% CI 1.14 to 1.39) remained significant. Analysis of follow-up data showed that patients with undiagnosed COPD had lighter symptoms and experienced less acute exacerbations than diagnosed patients (p<0.001).</p><p><strong>Discussion: </strong>Most patients with COPD remain undiagnosed until they feel dyspnoea or hospitalisation due to acute exacerbation. Undiagnosed COPD contributes significantly to the disease burden.</p><p><strong>Conclusion: </strong>In China, patients with undiagnosed COPD were poorly educated, consumed more biofuels, smoked more and had limited COPD-related knowledge. Patients with undiagnosed COPD are also at risk of acute exacerbation.</p><p><strong>Trial registration number: </strong>NCT04318912.</p>","PeriodicalId":9050,"journal":{"name":"BMJ Health & Care Informatics","volume":"32 1","pages":""},"PeriodicalIF":4.4,"publicationDate":"2025-10-07","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12506152/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145243852","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2025-10-05DOI: 10.1136/bmjhci-2025-101512
William Clackett, Ian A Zealley, Zelei Yang, Ghali Salahia, Richard D White
Objectives: This study aimed to evaluate the feasibility of using a large language model (LLM) to generate patient information leaflets (PILs) with improved readability based on PILs in the field of interventional radiology.
Methods: PILs were acquired from the Cardiovascular and Interventional Radiology Society of Europe website, reformatted, and uploaded to the GPT-4 user interface with a prompt aimed to simplify the language. Automated readability metrics were used to evaluate the readability of original and LLM-modified PILs. Factual accuracy was assessed by human evaluation from three consultant interventional radiologists using an agreed marking scheme.
Results: LLM-modified PILs had significantly lower mean reading grade (9.5±0.5) compared with original PILs (11.1±0.1) (p<0.01). However, the recommended reading grade of 6 (expected to be understood by 11- to 12-year-old children) was not achieved. Human evaluation revealed that most LLM-modified PILs had minor concerns regarding factual accuracy, but no errors that could result in serious patient harm were detected.
Discussion: LLMs appear to be a powerful tool in improving the readability of PILs within the field of interventional radiology. However, clinical experts are still required in PIL development to ensure the factual accuracy of these augmented documents is not compromised.
Conclusion: LLMs should be considered as a useful tool to assist with the development and revision of PILs in the field of interventional radiology.
{"title":"Better understanding: can a large language model safely improve readability of patient information leaflets in interventional radiology?","authors":"William Clackett, Ian A Zealley, Zelei Yang, Ghali Salahia, Richard D White","doi":"10.1136/bmjhci-2025-101512","DOIUrl":"10.1136/bmjhci-2025-101512","url":null,"abstract":"<p><strong>Objectives: </strong>This study aimed to evaluate the feasibility of using a large language model (LLM) to generate patient information leaflets (PILs) with improved readability based on PILs in the field of interventional radiology.</p><p><strong>Methods: </strong>PILs were acquired from the Cardiovascular and Interventional Radiology Society of Europe website, reformatted, and uploaded to the GPT-4 user interface with a prompt aimed to simplify the language. Automated readability metrics were used to evaluate the readability of original and LLM-modified PILs. Factual accuracy was assessed by human evaluation from three consultant interventional radiologists using an agreed marking scheme.</p><p><strong>Results: </strong>LLM-modified PILs had significantly lower mean reading grade (9.5±0.5) compared with original PILs (11.1±0.1) (p<0.01). However, the recommended reading grade of 6 (expected to be understood by 11- to 12-year-old children) was not achieved. Human evaluation revealed that most LLM-modified PILs had minor concerns regarding factual accuracy, but no errors that could result in serious patient harm were detected.</p><p><strong>Discussion: </strong>LLMs appear to be a powerful tool in improving the readability of PILs within the field of interventional radiology. However, clinical experts are still required in PIL development to ensure the factual accuracy of these augmented documents is not compromised.</p><p><strong>Conclusion: </strong>LLMs should be considered as a useful tool to assist with the development and revision of PILs in the field of interventional radiology.</p>","PeriodicalId":9050,"journal":{"name":"BMJ Health & Care Informatics","volume":"32 1","pages":""},"PeriodicalIF":4.4,"publicationDate":"2025-10-05","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12506176/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145237997","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}