Pub Date : 2025-11-13DOI: 10.1136/bmjhci-2025-101641
Yanjia Cao, Jiashuo Sun, Sali Ahmed, Pakwanja Twea, Jonathan Chiwanda Banda, David A Watkins, Yanfang Su
Objectives: The prevalence of non-communicable diseases (NCDs) is rising in low- and middle-income countries, including Malawi, yet spatial inequalities in NCD healthcare coverage remain poorly understood. In this research, we aim to: (1) develop a novel hierarchical geospatial framework to assess population coverage and accessibility of NCD services in Malawi and (2) identify underserved areas and provide evidence for targeted resource allocation.
Methods: Using 2019 Malawi Harmonized Health Facility Assessment Survey, hierarchical catchment areas were defined by facility type-primary healthcare (PHCs), district-level and central hospitals, with distance thresholds of 5 km walking, 25 km driving and 100 km driving, respectively. Incorporating facility readiness, we computed population coverage at the third administrative level. When estimating spatial accessibility, we used enhanced two-step floating catchment area, applying Gaussian distance decay for chronic conditions and inverse power for acute conditions.
Results: Secondary and tertiary facilities (STFs) covered over 60% of population, providing broader NCD service than PHCs, where coverage was lower than 20%, particularly for acute conditions. Population coverage was higher in central and southeastern Malawi, notably around Mzuzu, Lilongwe and Blantyre. However, at least 24% of the population were not covered for any NCD conditions. Additionally, only 11.9% of the population lived in regions of high or very high accessibility to PHCs.
Discussion: We found substantial geographic inequalities in NCD service coverage and access, highlighting underserved regions and the demand to strengthen PHC readiness.
Conclusion: This hierarchical geospatial approach offers insights for resource allocation and improving healthcare equity in other low-resource settings.
{"title":"Examining healthcare inequality for non-communicable diseases in Malawi: a hierarchical geospatial modelling approach.","authors":"Yanjia Cao, Jiashuo Sun, Sali Ahmed, Pakwanja Twea, Jonathan Chiwanda Banda, David A Watkins, Yanfang Su","doi":"10.1136/bmjhci-2025-101641","DOIUrl":"10.1136/bmjhci-2025-101641","url":null,"abstract":"<p><strong>Objectives: </strong>The prevalence of non-communicable diseases (NCDs) is rising in low- and middle-income countries, including Malawi, yet spatial inequalities in NCD healthcare coverage remain poorly understood. In this research, we aim to: (1) develop a novel hierarchical geospatial framework to assess population coverage and accessibility of NCD services in Malawi and (2) identify underserved areas and provide evidence for targeted resource allocation.</p><p><strong>Methods: </strong>Using 2019 Malawi Harmonized Health Facility Assessment Survey, hierarchical catchment areas were defined by facility type-primary healthcare (PHCs), district-level and central hospitals, with distance thresholds of 5 km walking, 25 km driving and 100 km driving, respectively. Incorporating facility readiness, we computed population coverage at the third administrative level. When estimating spatial accessibility, we used enhanced two-step floating catchment area, applying Gaussian distance decay for chronic conditions and inverse power for acute conditions.</p><p><strong>Results: </strong>Secondary and tertiary facilities (STFs) covered over 60% of population, providing broader NCD service than PHCs, where coverage was lower than 20%, particularly for acute conditions. Population coverage was higher in central and southeastern Malawi, notably around Mzuzu, Lilongwe and Blantyre. However, at least 24% of the population were not covered for any NCD conditions. Additionally, only 11.9% of the population lived in regions of high or very high accessibility to PHCs.</p><p><strong>Discussion: </strong>We found substantial geographic inequalities in NCD service coverage and access, highlighting underserved regions and the demand to strengthen PHC readiness.</p><p><strong>Conclusion: </strong>This hierarchical geospatial approach offers insights for resource allocation and improving healthcare equity in other low-resource settings.</p>","PeriodicalId":9050,"journal":{"name":"BMJ Health & Care Informatics","volume":"32 1","pages":""},"PeriodicalIF":4.4,"publicationDate":"2025-11-13","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12625903/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145522493","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2025-10-29DOI: 10.1136/bmjhci-2024-101360
Rishi Ramessur, Roxanne Crosby-Nwaobi, Claire Lovegrove, Julia Theodossiades, Rachel Thomas, Estelle Ioannidou, Radhika Rampat, Peter B M Thomas
Objectives: To explore factors important to patients when choosing a secondary care provider.
Methods: A survey was distributed to 376 participants at Moorfields Eye Hospital comprising both free-text responses and Likert scale elements. Word frequency analysis was applied to free-text responses, and K-means analysis to identify clusters from Likert responses.
Results: Reputation, expertise, quality of care and clinical outcomes were the most important factors driving patient preference-more so than practicalities such as travel, ease of access, parking and proximity to healthcare provider. One of two identified clusters of patients appeared to place higher value on societal benefits (eg, sustainability, carbon footprint minimisation, cost-efficiency for National Health Service (NHS)) than the other.
Discussion: The current NHS approach of highlighting travel distance, wait times and Care Quality Commission (CQC) ratings when presenting choice is inadequate to support informed choice of provider. Further work in a more representative cohort and exploration of real-world patient decisions is warranted.
{"title":"Exploring factors influencing patient choice in outpatient ophthalmology provider in the North London region: a patient survey.","authors":"Rishi Ramessur, Roxanne Crosby-Nwaobi, Claire Lovegrove, Julia Theodossiades, Rachel Thomas, Estelle Ioannidou, Radhika Rampat, Peter B M Thomas","doi":"10.1136/bmjhci-2024-101360","DOIUrl":"10.1136/bmjhci-2024-101360","url":null,"abstract":"<p><strong>Objectives: </strong>To explore factors important to patients when choosing a secondary care provider.</p><p><strong>Methods: </strong>A survey was distributed to 376 participants at Moorfields Eye Hospital comprising both free-text responses and Likert scale elements. Word frequency analysis was applied to free-text responses, and K-means analysis to identify clusters from Likert responses.</p><p><strong>Results: </strong>Reputation, expertise, quality of care and clinical outcomes were the most important factors driving patient preference-more so than practicalities such as travel, ease of access, parking and proximity to healthcare provider. One of two identified clusters of patients appeared to place higher value on societal benefits (eg, sustainability, carbon footprint minimisation, cost-efficiency for National Health Service (NHS)) than the other.</p><p><strong>Discussion: </strong>The current NHS approach of highlighting travel distance, wait times and Care Quality Commission (CQC) ratings when presenting choice is inadequate to support informed choice of provider. Further work in a more representative cohort and exploration of real-world patient decisions is warranted.</p>","PeriodicalId":9050,"journal":{"name":"BMJ Health & Care Informatics","volume":"32 1","pages":""},"PeriodicalIF":4.4,"publicationDate":"2025-10-29","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12574417/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145399732","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Objective: Inefficiencies and high administrative burdens are commonly reported in multidisciplinary tumour boards (TBs). The objective of the study was to evaluate the impact of a digital solution on the common challenges encountered in the urogenital and gynaecology cancer TBs in Central Netherlands.
Methods: A single-arm, mixed-methods observational study was conducted over 32 weeks in 2023. Data were collected from surveys and the Vitaly system across baseline and post implementation TB meetings.
Results: The implementation of Vitaly resulted in a 43% reduction in preparation time for urology TBs and a 44% reduction for gynaecology TBs. Postponement rates decreased by nearly 50% for urology and maintained significantly below the 5% acceptable threshold, for both urology and gynaecology TBs.
Discussion and conclusions: Despite challenges in data collection due to varied hospital environments, the study showed positive outcomes that underscore the potential benefits of such digital transformation and broader applicability and scalability across various clinical settings.
{"title":"Digital tumour board solution enhances case preparation time and reduces postponements: an implementer report.","authors":"Katja Brne, Katharina Abraham, Mieke Arnoldus, Kim Romijnders, Špela Uršič Bensa, Jaap Trappenburg","doi":"10.1136/bmjhci-2024-101332","DOIUrl":"10.1136/bmjhci-2024-101332","url":null,"abstract":"<p><strong>Objective: </strong>Inefficiencies and high administrative burdens are commonly reported in multidisciplinary tumour boards (TBs). The objective of the study was to evaluate the impact of a digital solution on the common challenges encountered in the urogenital and gynaecology cancer TBs in Central Netherlands.</p><p><strong>Methods: </strong>A single-arm, mixed-methods observational study was conducted over 32 weeks in 2023. Data were collected from surveys and the Vitaly system across baseline and post implementation TB meetings.</p><p><strong>Results: </strong>The implementation of Vitaly resulted in a 43% reduction in preparation time for urology TBs and a 44% reduction for gynaecology TBs. Postponement rates decreased by nearly 50% for urology and maintained significantly below the 5% acceptable threshold, for both urology and gynaecology TBs.</p><p><strong>Discussion and conclusions: </strong>Despite challenges in data collection due to varied hospital environments, the study showed positive outcomes that underscore the potential benefits of such digital transformation and broader applicability and scalability across various clinical settings.</p>","PeriodicalId":9050,"journal":{"name":"BMJ Health & Care Informatics","volume":"32 1","pages":""},"PeriodicalIF":4.4,"publicationDate":"2025-10-29","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12606494/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145399754","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2025-10-28DOI: 10.1136/bmjhci-2025-101516
Surui Liang, Xiaojiao Wang, Jing Jing Su, Winnie Wai In Wong, Rick Yiu Cho Kwan
Objectives: The study aims to evaluate the effectiveness of nature-based virtual reality (NBVR) interventions in alleviating physical symptoms (eg, pain, fatigue, nausea) and mental symptoms (eg, anxiety, depression, distress) in patients with cancer undergoing chemotherapy.
Methods: This systematic review protocol is developed in accordance with the Preferred Reporting Items for Systematic Reviews and Meta-Analyses guidelines. We will conduct a comprehensive electronic search of healthcare databases for studies published in English from 2012 to February 2025. The databases to be searched include PubMed, EMBASE, Web of Science, PsycINFO, Allied and Complementary Medicine, CINAHL Complete and Cochrane Library. We will include randomised controlled trials investigating NBVR aimed at improving the physical and psychological outcomes for patients with cancer during chemotherapy, compared with waitlist controls or traditional methods. The risk of bias will be systematically evaluated using the Cochrane Risk of Bias tool.
Results: As this is a protocol, no results are available yet. The review will present quantitative and qualitative syntheses of NBVR effects on physical and psychological outcomes in chemotherapy patients.
Discussion: By evaluating rigorously designed trials, this review will clarify the potential of NBVR as a supportive intervention for patients with cancer during chemotherapy. Findings are expected to inform clinical practice and guide future research directions in integrating immersive natural environments into cancer care.
Conclusion: This systematic review will provide comprehensive evidence on NBVR interventions for patients with cancer receiving chemotherapy, highlighting their potential benefits for physical symptom relief and mental health improvement.
Prospero registration number: CRD420251016213.
目的:本研究旨在评估基于自然的虚拟现实(NBVR)干预在缓解癌症化疗患者身体症状(如疼痛、疲劳、恶心)和精神症状(如焦虑、抑郁、苦恼)方面的有效性。方法:本系统评价方案是根据系统评价和荟萃分析指南的首选报告项目制定的。我们将在2012年至2025年2月期间对以英文发表的研究进行全面的医疗数据库电子检索。检索数据库包括PubMed、EMBASE、Web of Science、PsycINFO、Allied and Complementary Medicine、CINAHL Complete和Cochrane Library。我们将纳入调查NBVR的随机对照试验,目的是与等候名单对照或传统方法相比,改善癌症患者化疗期间的生理和心理结果。将使用Cochrane偏倚风险工具对偏倚风险进行系统评估。结果:由于这是一个方案,目前还没有结果。本综述将介绍NBVR对化疗患者生理和心理结果的定量和定性综合影响。讨论:通过评估严格设计的试验,本综述将阐明NBVR作为化疗期间癌症患者支持干预的潜力。研究结果有望为临床实践提供信息,并指导未来将沉浸式自然环境融入癌症治疗的研究方向。结论:本系统综述将为癌症化疗患者的NBVR干预提供全面的证据,强调其在缓解身体症状和改善心理健康方面的潜在益处。普洛斯彼罗注册号:CRD420251016213。
{"title":"Effects of nature-based virtual reality interventions on physical and mental health symptoms in patients with cancer undergoing chemotherapy: a systematic review and meta-analysis protocol.","authors":"Surui Liang, Xiaojiao Wang, Jing Jing Su, Winnie Wai In Wong, Rick Yiu Cho Kwan","doi":"10.1136/bmjhci-2025-101516","DOIUrl":"10.1136/bmjhci-2025-101516","url":null,"abstract":"<p><strong>Objectives: </strong>The study aims to evaluate the effectiveness of nature-based virtual reality (NBVR) interventions in alleviating physical symptoms (eg, pain, fatigue, nausea) and mental symptoms (eg, anxiety, depression, distress) in patients with cancer undergoing chemotherapy.</p><p><strong>Methods: </strong>This systematic review protocol is developed in accordance with the Preferred Reporting Items for Systematic Reviews and Meta-Analyses guidelines. We will conduct a comprehensive electronic search of healthcare databases for studies published in English from 2012 to February 2025. The databases to be searched include PubMed, EMBASE, Web of Science, PsycINFO, Allied and Complementary Medicine, CINAHL Complete and Cochrane Library. We will include randomised controlled trials investigating NBVR aimed at improving the physical and psychological outcomes for patients with cancer during chemotherapy, compared with waitlist controls or traditional methods. The risk of bias will be systematically evaluated using the Cochrane Risk of Bias tool.</p><p><strong>Results: </strong>As this is a protocol, no results are available yet. The review will present quantitative and qualitative syntheses of NBVR effects on physical and psychological outcomes in chemotherapy patients.</p><p><strong>Discussion: </strong>By evaluating rigorously designed trials, this review will clarify the potential of NBVR as a supportive intervention for patients with cancer during chemotherapy. Findings are expected to inform clinical practice and guide future research directions in integrating immersive natural environments into cancer care.</p><p><strong>Conclusion: </strong>This systematic review will provide comprehensive evidence on NBVR interventions for patients with cancer receiving chemotherapy, highlighting their potential benefits for physical symptom relief and mental health improvement.</p><p><strong>Prospero registration number: </strong>CRD420251016213.</p>","PeriodicalId":9050,"journal":{"name":"BMJ Health & Care Informatics","volume":"32 1","pages":""},"PeriodicalIF":4.4,"publicationDate":"2025-10-28","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12574406/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145386892","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2025-10-22DOI: 10.1136/bmjhci-2025-101631
Elma Jelin, Lilja Charlotte Storset, Rebecka M Norman, Hilde Hestad Hestad Iversen, Lina Harvold Ellingsen-Dalskau, Petter Mæhlum, Erik Velldal, Lilja Øvrelid, Oyvind Bjertnaes
Background: Automatic analysis of free-text patient comments enables the efficient processing of large feedback volumes, reducing reliance on manual review. A 2021 review examined natural language processing (NLP) and sentiment analysis (SA) in patient experience research; however, recent advances in deep learning and generative artificial intelligence (AI) call for an updated synthesis.
Objectives: This scoping review aims to map and summarise recent studies applying SA to unstructured patient experience data related to healthcare services.
Methods: Following Joanna Briggs Institute methodology and PRISMA-ScR (Preferred Reporting Items for Systematic Reviews and Meta-Analyses extension for Scoping Reviews) guidelines, we conducted a comprehensive search across Medline, CINAHL, Web of Science, Cochrane, Embase and APA PsycINFO. We included studies published from January 2020 to March 2024 in English or a Scandinavian language. Eligible studies analysed patient feedback using NLP techniques and described the development or validation of SA models. Two reviewers independently screened the studies and extracted data, which were presented in tables both tabular and narrative forms.
Results: 30 studies were included, primarily from the USA, Europe and Asia. Patient comments were mostly sourced from online platforms such as social media. Feedback largely concerned hospital care. 18 studies employed rule-based SA approaches, while 12 applied supervised machine learning (ML) and only 4 studies used deep learning models. Few addressed the visualisation or practical applications in healthcare.
Conclusion: Despite significant progress, modern methods like deep learning and generative AI remain underused in SA of patient-experience data. Limited focus on implementation restricts SA's role in quality improvement. Future research should assess advanced methods and their cost-effectiveness versus traditional ML.
背景:对自由文本患者评论的自动分析能够有效处理大量反馈,减少对人工审查的依赖。2021年的一篇综述研究了患者体验研究中的自然语言处理(NLP)和情感分析(SA);然而,深度学习和生成式人工智能(AI)的最新进展要求进行更新的综合。目的:本综述旨在绘制和总结将SA应用于与医疗服务相关的非结构化患者体验数据的最新研究。方法:根据Joanna Briggs研究所的方法和PRISMA-ScR(首选报告项目为系统评价和荟萃分析扩展范围评价)指南,我们在Medline, CINAHL, Web of Science, Cochrane, Embase和APA PsycINFO上进行了全面的检索。我们纳入了从2020年1月到2024年3月用英语或斯堪的纳维亚语发表的研究。合格的研究使用NLP技术分析了患者反馈,并描述了SA模型的开发或验证。两名审稿人独立筛选研究并提取数据,以表格和叙述形式呈现。结果:纳入了30项研究,主要来自美国、欧洲和亚洲。患者的评论大多来自社交媒体等在线平台。反馈主要与医院护理有关。18项研究采用了基于规则的SA方法,12项研究应用了监督机器学习(ML),只有4项研究使用了深度学习模型。很少有人讨论可视化或在医疗保健中的实际应用。结论:尽管取得了重大进展,但深度学习和生成式人工智能等现代方法在患者体验数据的SA中仍未得到充分利用。对实施的有限关注限制了SA在质量改进中的作用。未来的研究应该评估先进的方法及其与传统机器学习相比的成本效益。
{"title":"From words to action? A scoping review on automatic sentiment analysis of patient experience comments from online sources and surveys.","authors":"Elma Jelin, Lilja Charlotte Storset, Rebecka M Norman, Hilde Hestad Hestad Iversen, Lina Harvold Ellingsen-Dalskau, Petter Mæhlum, Erik Velldal, Lilja Øvrelid, Oyvind Bjertnaes","doi":"10.1136/bmjhci-2025-101631","DOIUrl":"10.1136/bmjhci-2025-101631","url":null,"abstract":"<p><strong>Background: </strong>Automatic analysis of free-text patient comments enables the efficient processing of large feedback volumes, reducing reliance on manual review. A 2021 review examined natural language processing (NLP) and sentiment analysis (SA) in patient experience research; however, recent advances in deep learning and generative artificial intelligence (AI) call for an updated synthesis.</p><p><strong>Objectives: </strong>This scoping review aims to map and summarise recent studies applying SA to unstructured patient experience data related to healthcare services.</p><p><strong>Methods: </strong>Following Joanna Briggs Institute methodology and PRISMA-ScR (Preferred Reporting Items for Systematic Reviews and Meta-Analyses extension for Scoping Reviews) guidelines, we conducted a comprehensive search across Medline, CINAHL, Web of Science, Cochrane, Embase and APA PsycINFO. We included studies published from January 2020 to March 2024 in English or a Scandinavian language. Eligible studies analysed patient feedback using NLP techniques and described the development or validation of SA models. Two reviewers independently screened the studies and extracted data, which were presented in tables both tabular and narrative forms.</p><p><strong>Results: </strong>30 studies were included, primarily from the USA, Europe and Asia. Patient comments were mostly sourced from online platforms such as social media. Feedback largely concerned hospital care. 18 studies employed rule-based SA approaches, while 12 applied supervised machine learning (ML) and only 4 studies used deep learning models. Few addressed the visualisation or practical applications in healthcare.</p><p><strong>Conclusion: </strong>Despite significant progress, modern methods like deep learning and generative AI remain underused in SA of patient-experience data. Limited focus on implementation restricts SA's role in quality improvement. Future research should assess advanced methods and their cost-effectiveness versus traditional ML.</p>","PeriodicalId":9050,"journal":{"name":"BMJ Health & Care Informatics","volume":"32 1","pages":""},"PeriodicalIF":4.4,"publicationDate":"2025-10-22","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12548608/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145342827","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2025-10-22DOI: 10.1136/bmjhci-2024-101378
Priyanka D Sood, Star Liu, Rita R Kalyani, Hadi Kharrazi
Objective: Significant variations exist in computable phenotype definitions to identify patients with type 2 diabetes (T2D) using electronic health records (EHRs). These variations cause challenges in identifying T2D populations for clinical research. To address these challenges, this study compares the variations in common phenotypes in identifying patients with T2D using EHRs.
Methods: A retrospective data analysis was performed using clinical data extracted from EHRs of 207 813 adult patients captured 2017-2019. Multiple T2D phenotypes were used: (1) Surveillance, Prevention and Management of Diabetes Mellitus, (2) Centers for Medicare and Medicaid Services Chronic Conditions Data Warehouse (CCW), (3) eMERGE Northwestern Group, (4) Durham Diabetes Coalition (DDC) and (5) a definition developed by a panel of experts at Johns Hopkins.
Results: Each phenotype definition identified a different T2D population with a unique composition of demographics and clinical features. Although the identified patients overlapped across phenotypes, only 22.7% (47 326) of the population was commonly identified across all definitions. Of the phenotypes, DDC identified the greatest number of patients with T2D (139 832, 67.3%), while CCW had the highest mean age (65.3 years), the highest percentage of black patients (35%) and the highest mean Charlson comorbidity score of 2.96. DDC identified patients with T2D with the lowest means of inpatient (0.64) and emergency room (1.06) visits.
Conclusion: Our study highlights the complexity of computable T2D phenotypes in translating commonly agreed T2D clinical definitions when applied against retrospective EHR data. Our findings provide an understanding of using appropriate phenotypes to identify, enrol and analyse T2D populations of interest using EHR data.
{"title":"Comparing computable type 2 diabetes phenotype definitions in identifying populations of interest for clinical research.","authors":"Priyanka D Sood, Star Liu, Rita R Kalyani, Hadi Kharrazi","doi":"10.1136/bmjhci-2024-101378","DOIUrl":"10.1136/bmjhci-2024-101378","url":null,"abstract":"<p><strong>Objective: </strong>Significant variations exist in computable phenotype definitions to identify patients with type 2 diabetes (T2D) using electronic health records (EHRs). These variations cause challenges in identifying T2D populations for clinical research. To address these challenges, this study compares the variations in common phenotypes in identifying patients with T2D using EHRs.</p><p><strong>Methods: </strong>A retrospective data analysis was performed using clinical data extracted from EHRs of 207 813 adult patients captured 2017-2019. Multiple T2D phenotypes were used: (1) Surveillance, Prevention and Management of Diabetes Mellitus, (2) Centers for Medicare and Medicaid Services Chronic Conditions Data Warehouse (CCW), (3) eMERGE Northwestern Group, (4) Durham Diabetes Coalition (DDC) and (5) a definition developed by a panel of experts at Johns Hopkins.</p><p><strong>Results: </strong>Each phenotype definition identified a different T2D population with a unique composition of demographics and clinical features. Although the identified patients overlapped across phenotypes, only 22.7% (47 326) of the population was commonly identified across all definitions. Of the phenotypes, DDC identified the greatest number of patients with T2D (139 832, 67.3%), while CCW had the highest mean age (65.3 years), the highest percentage of black patients (35%) and the highest mean Charlson comorbidity score of 2.96. DDC identified patients with T2D with the lowest means of inpatient (0.64) and emergency room (1.06) visits.</p><p><strong>Conclusion: </strong>Our study highlights the complexity of computable T2D phenotypes in translating commonly agreed T2D clinical definitions when applied against retrospective EHR data. Our findings provide an understanding of using appropriate phenotypes to identify, enrol and analyse T2D populations of interest using EHR data.</p>","PeriodicalId":9050,"journal":{"name":"BMJ Health & Care Informatics","volume":"32 1","pages":""},"PeriodicalIF":4.4,"publicationDate":"2025-10-22","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12548627/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145342819","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Objectives: To develop a machine learning (ML)-based predictive model for assessing the risk of pre-eclampsia using routinely collected clinical data.
Methods: We retrospectively analysed data from 2444 pregnant women who delivered at Chi Mei Medical Center between 2015 and 2019, excluding patients under 20 years old. Pre-eclampsia was defined as blood pressure >140/90 mm Hg with proteinuria. Key features included gestational age, body weight, blood pressure and medical history. Five ML models were trained-logistic regression, random forest, light gradient boosting machine, extreme gradient boosting (XGBoost) and multilayer perceptron-using a 70/30 train-validation split. Synthetic minority oversampling technique was applied to address class imbalance. SHapley Additive exPlanations (SHAP) analysis was used for feature importance.
Results: Among the five models, XGBoost showed the best performance with the highest accuracy, sensitivity, specificity and an area under the receiver operating characteristics curve of 0.921. SHAP analysis identified diastolic blood pressure, systolic blood pressure and urine glucose as top predictors.
Discussion: Our findings demonstrate that ML, particularly XGBoost, can effectively predict the risk of pre-eclampsia using standard clinical data. This approach avoids the need for expensive tests while maintaining high accuracy.
Conclusion: The XGBoost model offers a cost-effective and accurate method for pre-eclampsia risk prediction, enabling real-time assessment and supporting early intervention. Future studies will focus on larger data sets and clinical integration.
目的:利用常规收集的临床数据,开发一种基于机器学习(ML)的预测模型,用于评估子痫前期风险。方法:我们回顾性分析2015年至2019年在奇美医疗中心分娩的2444名孕妇的数据,不包括20岁以下的患者。先兆子痫定义为血压bb0 140/90 mm Hg伴蛋白尿。主要特征包括胎龄、体重、血压和病史。使用70/30的训练-验证分割,训练了5个ML模型-逻辑回归,随机森林,轻梯度增强机,极端梯度增强机(XGBoost)和多层感知器。采用合成少数派过采样技术解决类不平衡问题。特征重要性采用SHapley加性解释(SHAP)分析。结果:5种模型中,XGBoost表现最佳,准确度、灵敏度、特异度最高,受试者工作特征曲线下面积为0.921。SHAP分析确定舒张压、收缩压和尿糖是最重要的预测因子。讨论:我们的研究结果表明,ML,特别是XGBoost,可以使用标准临床数据有效预测子痫前期的风险。这种方法避免了昂贵的测试,同时保持了较高的准确性。结论:XGBoost模型为子痫前期风险预测提供了一种经济、准确的方法,可实时评估并支持早期干预。未来的研究将集中于更大的数据集和临床整合。
{"title":"Machine learning predictive system to predict the risk of developing pre-eclampsia.","authors":"Ing-Luen Shyu, Chung-Feng Liu, Yung-Chieh Tsai, Yu-Shan Ma, Tian-Ni Kuo, Shiue Yow-Ling","doi":"10.1136/bmjhci-2024-101151","DOIUrl":"10.1136/bmjhci-2024-101151","url":null,"abstract":"<p><strong>Objectives: </strong>To develop a machine learning (ML)-based predictive model for assessing the risk of pre-eclampsia using routinely collected clinical data.</p><p><strong>Methods: </strong>We retrospectively analysed data from 2444 pregnant women who delivered at Chi Mei Medical Center between 2015 and 2019, excluding patients under 20 years old. Pre-eclampsia was defined as blood pressure >140/90 mm Hg with proteinuria. Key features included gestational age, body weight, blood pressure and medical history. Five ML models were trained-logistic regression, random forest, light gradient boosting machine, extreme gradient boosting (XGBoost) and multilayer perceptron-using a 70/30 train-validation split. Synthetic minority oversampling technique was applied to address class imbalance. SHapley Additive exPlanations (SHAP) analysis was used for feature importance.</p><p><strong>Results: </strong>Among the five models, XGBoost showed the best performance with the highest accuracy, sensitivity, specificity and an area under the receiver operating characteristics curve of 0.921. SHAP analysis identified diastolic blood pressure, systolic blood pressure and urine glucose as top predictors.</p><p><strong>Discussion: </strong>Our findings demonstrate that ML, particularly XGBoost, can effectively predict the risk of pre-eclampsia using standard clinical data. This approach avoids the need for expensive tests while maintaining high accuracy.</p><p><strong>Conclusion: </strong>The XGBoost model offers a cost-effective and accurate method for pre-eclampsia risk prediction, enabling real-time assessment and supporting early intervention. Future studies will focus on larger data sets and clinical integration.</p>","PeriodicalId":9050,"journal":{"name":"BMJ Health & Care Informatics","volume":"32 1","pages":""},"PeriodicalIF":4.4,"publicationDate":"2025-10-17","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12542561/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145311934","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Objectives: Rapid discrimination of infections caused by Mycobacterium tuberculosis (MTB) and non-tuberculous mycobacteria (NTM) is crucial in clinical settings. Despite overlapping clinical and radiological features, the two require markedly different therapeutic approaches and public health responses. Current laboratory methods are time-consuming and complex, underscoring the urgent need for a simple and efficient diagnostic tool to inform public health decision-making.
Methods: Demographic, haematological and biochemical data were collected from two hospitals in Jiangsu province, China, between December 2018 and October 2024. A total of 400 patients were included in the training cohort, with 66 patients used for external validation. Six machine learning models were developed using routine laboratory features, and their performance was evaluated using multiple metrics.
Results: The random forest (RF) model outperformed others using 49 routine lab features, achieving 82.71% accuracy in the internal cohort and 87.69% in external validation. SHapley Additive exPlanations (SHAP) model identified the top 10 critical features influencing model decisions, namely, chloride, sodium, gender, prealbumin, high-density lipoprotein, procalcitonin, albumin, globulin, total protein and creatine. Based on these indicators, an interactive web-based tool was developed (https://mtb-ntm.streamlit.app).
Discussion: The features identified by the model align with established clinical parameters and existing studies. Certain previously underestimated variables, such as Cl and Na, exhibited substantial importance in distinguishing between MTB and NTM, offering valuable insights for the development of decision-support tools.
Conclusion: Routine laboratory indicators coupled with the RF model demonstrated potential capacity as an auxiliary diagnostic tool for discriminating MTB and NTM disease, offering effective medical support in resource-limited and remote settings.
{"title":"Rapid discrimination of <i>Mycobacterium tuberculosis</i> and non-tuberculous mycobacteria disease via interpretive machine learning analysis of routine laboratory tests.","authors":"Jia-Wei Tang, Xue-Song Xiong, Ting-Ting Huang, Yu-Lu Zhang, Lin-Fei Yao, Wen-Wen Zhang, Yun-Yun Xie, Quan-Fa Liang, Zhi-Xuan Tan, Kun Jiang, Xin Liu, Liang Wang","doi":"10.1136/bmjhci-2025-101575","DOIUrl":"10.1136/bmjhci-2025-101575","url":null,"abstract":"<p><strong>Objectives: </strong>Rapid discrimination of infections caused by <i>Mycobacterium tuberculosis</i> (MTB) and non-tuberculous mycobacteria (NTM) is crucial in clinical settings. Despite overlapping clinical and radiological features, the two require markedly different therapeutic approaches and public health responses. Current laboratory methods are time-consuming and complex, underscoring the urgent need for a simple and efficient diagnostic tool to inform public health decision-making.</p><p><strong>Methods: </strong>Demographic, haematological and biochemical data were collected from two hospitals in Jiangsu province, China, between December 2018 and October 2024. A total of 400 patients were included in the training cohort, with 66 patients used for external validation. Six machine learning models were developed using routine laboratory features, and their performance was evaluated using multiple metrics.</p><p><strong>Results: </strong>The random forest (RF) model outperformed others using 49 routine lab features, achieving 82.71% accuracy in the internal cohort and 87.69% in external validation. SHapley Additive exPlanations (SHAP) model identified the top 10 critical features influencing model decisions, namely, chloride, sodium, gender, prealbumin, high-density lipoprotein, procalcitonin, albumin, globulin, total protein and creatine. Based on these indicators, an interactive web-based tool was developed (https://mtb-ntm.streamlit.app).</p><p><strong>Discussion: </strong>The features identified by the model align with established clinical parameters and existing studies. Certain previously underestimated variables, such as Cl and Na, exhibited substantial importance in distinguishing between MTB and NTM, offering valuable insights for the development of decision-support tools.</p><p><strong>Conclusion: </strong>Routine laboratory indicators coupled with the RF model demonstrated potential capacity as an auxiliary diagnostic tool for discriminating MTB and NTM disease, offering effective medical support in resource-limited and remote settings.</p>","PeriodicalId":9050,"journal":{"name":"BMJ Health & Care Informatics","volume":"32 1","pages":""},"PeriodicalIF":4.4,"publicationDate":"2025-10-17","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12542539/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145312016","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2025-10-15DOI: 10.1136/bmjhci-2025-101640
Santiago Frid, Octavi Bassegoda, Maria Araceli Camacho Mahamud, Gemma Sanjuan, Miguel Ángel Armengol de la Hoz, Leo Celi, Isaac Cano Franco, Gerard Anmella, Tomas Cuñat López, Ana Lucía Arellano, Lina María Leguízamo-Martínez, Laura Mezquita, Petter Axcell Peñafiel Macías, Antonio Gallardo-Pizarro, Ruben González Colom, Arturo Renú Jornet, Guillem Bracons Cucó, Xavier Borrat Frigola
Objectives: To describe the implementation of a multidisciplinary, ethically grounded hackathon as a model to develop and evaluate generative AI (GenAI) solutions for real-world clinical challenges within a hospital setting.
Methods: The GenAI Health Hackathon (GAHH) organised at Hospital Clínic de Barcelona included 13 challenges were selected via an internal call based on clinical impact, feasibility and data availability. Participants accessed anonymised real-world data through a secure cloud environment. Teams employed large language models and retrieval-augmented generation to build prototypes addressing tasks such as clinical text structuring, decision support and workflow automation. Human-in-the-loop validation, explainability and regulatory safeguards were emphasised.
Results: The hackathon yielded multiple AI prototypes tested on real data. Results varied: entity recognition reached 90.5% accuracy, summarisation >90% clinician concordance and nutritional models achieved F1 scores of 0.75-0.93. Lower scores (F1<0.52, Jaccard Index <0.4) were seen in complex reasoning or multilingual tasks. Bias was explored in 10 projects, with mitigations such as stratified sampling, prompt tuning, disclaimers and expert oversight. A transferable framework was proposed to replicate responsible GenAI hackathons in clinical contexts.
Discussion: Interdisciplinary collaboration and real-world testing proved essential for aligning GenAI with clinical needs. The hackathon revealed challenges in bias, evaluation and integration but offered a transferable framework for responsible innovation under General Data Protection Regulation and the European Union Artificial Intelligence Act.
Conclusions: The GAHH demonstrated that GenAI can be safely and effectively applied in healthcare with rigorous governance and interdisciplinary collaboration, offering a scalable model for responsible AI innovation.
目的:描述一个多学科的、基于伦理的黑客马拉松的实施,作为一个模型来开发和评估生成人工智能(GenAI)解决方案,以应对医院环境中现实世界的临床挑战。方法:在Clínic de Barcelona医院组织的GenAI健康黑客马拉松(GAHH)包括13项挑战,这些挑战是通过基于临床影响、可行性和数据可用性的内部呼吁选择的。参与者通过安全的云环境访问匿名的真实数据。团队使用大型语言模型和检索增强生成来构建解决诸如临床文本结构、决策支持和工作流自动化等任务的原型。强调了人在循环验证、可解释性和监管保障。结果:黑客马拉松产生了多个在真实数据上测试的AI原型。结果各不相同:实体识别准确率达到90.5%,总结准确率达到90%,临床医师一致性和营养模型的F1评分达到0.75-0.93。低分数(f1)讨论:跨学科合作和真实世界的测试被证明是使GenAI符合临床需求的关键。黑客马拉松揭示了偏见、评估和整合方面的挑战,但根据《通用数据保护条例》和《欧盟人工智能法案》,为负责任的创新提供了一个可转移的框架。结论:GAHH表明,通过严格的治理和跨学科合作,GenAI可以安全有效地应用于医疗保健领域,为负责任的人工智能创新提供了可扩展的模式。
{"title":"Bridging generative AI and healthcare practice: insights from the GenAI Health Hackathon at Hospital Clínic de Barcelona.","authors":"Santiago Frid, Octavi Bassegoda, Maria Araceli Camacho Mahamud, Gemma Sanjuan, Miguel Ángel Armengol de la Hoz, Leo Celi, Isaac Cano Franco, Gerard Anmella, Tomas Cuñat López, Ana Lucía Arellano, Lina María Leguízamo-Martínez, Laura Mezquita, Petter Axcell Peñafiel Macías, Antonio Gallardo-Pizarro, Ruben González Colom, Arturo Renú Jornet, Guillem Bracons Cucó, Xavier Borrat Frigola","doi":"10.1136/bmjhci-2025-101640","DOIUrl":"10.1136/bmjhci-2025-101640","url":null,"abstract":"<p><strong>Objectives: </strong>To describe the implementation of a multidisciplinary, ethically grounded hackathon as a model to develop and evaluate generative AI (GenAI) solutions for real-world clinical challenges within a hospital setting.</p><p><strong>Methods: </strong>The GenAI Health Hackathon (GAHH) organised at Hospital Clínic de Barcelona included 13 challenges were selected via an internal call based on clinical impact, feasibility and data availability. Participants accessed anonymised real-world data through a secure cloud environment. Teams employed large language models and retrieval-augmented generation to build prototypes addressing tasks such as clinical text structuring, decision support and workflow automation. Human-in-the-loop validation, explainability and regulatory safeguards were emphasised.</p><p><strong>Results: </strong>The hackathon yielded multiple AI prototypes tested on real data. Results varied: entity recognition reached 90.5% accuracy, summarisation >90% clinician concordance and nutritional models achieved F1 scores of 0.75-0.93. Lower scores (F1<0.52, Jaccard Index <0.4) were seen in complex reasoning or multilingual tasks. Bias was explored in 10 projects, with mitigations such as stratified sampling, prompt tuning, disclaimers and expert oversight. A transferable framework was proposed to replicate responsible GenAI hackathons in clinical contexts.</p><p><strong>Discussion: </strong>Interdisciplinary collaboration and real-world testing proved essential for aligning GenAI with clinical needs. The hackathon revealed challenges in bias, evaluation and integration but offered a transferable framework for responsible innovation under General Data Protection Regulation and the European Union Artificial Intelligence Act.</p><p><strong>Conclusions: </strong>The GAHH demonstrated that GenAI can be safely and effectively applied in healthcare with rigorous governance and interdisciplinary collaboration, offering a scalable model for responsible AI innovation.</p>","PeriodicalId":9050,"journal":{"name":"BMJ Health & Care Informatics","volume":"32 1","pages":""},"PeriodicalIF":4.4,"publicationDate":"2025-10-15","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12530383/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145298316","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2025-10-15DOI: 10.1136/bmjhci-2025-101632
Marcello Di Pumpo, Maria Rosaria Gualano, Danilo Buonsenso, Francesca Raffaelli, Daniele Donà, Vittorio Maio, Patrizia Laurenti, Walter Ricciardi, Leonardo Villani
Objectives: Antimicrobial resistance is a critical public health threat. Large language models (LLMs) show great capability for providing health information. This study evaluates the effectiveness of LLMs in providing information on antibiotic use and infection management.
Methods: Using a mixed-method approach, responses to healthcare expert-designed scenarios from ChatGPT 3.5, ChatGPT 4.0, Claude 2.0 and Gemini 1.0, in both Italian and English, were analysed. Computational text analysis assessed readability, lexical diversity and sentiment, while content quality was assessed by three experts via DISCERN tool.
Results: 16 scenarios were developed. A total of 101 outputs and 5454 Likert-scale (1-5) scores were obtained for the analysis. A general positive performance gradient was found from ChatGPT 3.5 and 4.0 to Claude to Gemini. Gemini, although producing only five outputs before self-inhibition, consistently outperformed the other models across almost all metrics, producing more detailed, accessible, varied content and a positive overtone. ChatGPT 4.0 demonstrated the highest lexical diversity. A difference in performance by language was observed. All models showed a median score of 1 (IQR=2) regarding the domain addressing antimicrobial resistance.
Discussion: The study highlights a positive performance gradient towards Gemini, which showed superior content quality, accessibility and contextual awareness, although acknowledging its smaller dataset. Generating appropriate content to address antimicrobial resistance proved challenging.
Conclusions: LLMs offer great promise to provide appropriate medical information. However, they should play a supporting role rather than representing a replacement option for medical professionals, confirming the need for expert oversight and improved artificial intelligence design.
{"title":"Large language models as information providers for appropriate antimicrobial use: computational text analysis and expert-rated comparison of ChatGPT, Claude and Gemini.","authors":"Marcello Di Pumpo, Maria Rosaria Gualano, Danilo Buonsenso, Francesca Raffaelli, Daniele Donà, Vittorio Maio, Patrizia Laurenti, Walter Ricciardi, Leonardo Villani","doi":"10.1136/bmjhci-2025-101632","DOIUrl":"10.1136/bmjhci-2025-101632","url":null,"abstract":"<p><strong>Objectives: </strong>Antimicrobial resistance is a critical public health threat. Large language models (LLMs) show great capability for providing health information. This study evaluates the effectiveness of LLMs in providing information on antibiotic use and infection management.</p><p><strong>Methods: </strong>Using a mixed-method approach, responses to healthcare expert-designed scenarios from ChatGPT 3.5, ChatGPT 4.0, Claude 2.0 and Gemini 1.0, in both Italian and English, were analysed. Computational text analysis assessed readability, lexical diversity and sentiment, while content quality was assessed by three experts via DISCERN tool.</p><p><strong>Results: </strong>16 scenarios were developed. A total of 101 outputs and 5454 Likert-scale (1-5) scores were obtained for the analysis. A general positive performance gradient was found from ChatGPT 3.5 and 4.0 to Claude to Gemini. Gemini, although producing only five outputs before self-inhibition, consistently outperformed the other models across almost all metrics, producing more detailed, accessible, varied content and a positive overtone. ChatGPT 4.0 demonstrated the highest lexical diversity. A difference in performance by language was observed. All models showed a median score of 1 (IQR=2) regarding the domain addressing antimicrobial resistance.</p><p><strong>Discussion: </strong>The study highlights a positive performance gradient towards Gemini, which showed superior content quality, accessibility and contextual awareness, although acknowledging its smaller dataset. Generating appropriate content to address antimicrobial resistance proved challenging.</p><p><strong>Conclusions: </strong>LLMs offer great promise to provide appropriate medical information. However, they should play a supporting role rather than representing a replacement option for medical professionals, confirming the need for expert oversight and improved artificial intelligence design.</p>","PeriodicalId":9050,"journal":{"name":"BMJ Health & Care Informatics","volume":"32 1","pages":""},"PeriodicalIF":4.4,"publicationDate":"2025-10-15","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12530423/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145298349","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}