Pub Date : 2025-07-09DOI: 10.1016/j.mcpdig.2025.100249
Jonathan Nover MBA, RN , Matthew Bai MD , Prem Tismina MS , Ganesh Raut MS , Dhavalkumar Patel MS , Girish N. Nadkarni MD, MPH , Benjamin S. Abella MD, MPhil , Eyal Klang MD , Robert Freeman DNP, RN, NE-BC
Objective
To prospectively compare nurse predictions with a machine learning (ML) model for hospital admissions and evaluate whether adding the nurse prediction to ML outputs enhances predictive performance.
Patients and Methods
In this prospective, observational study at 6 hospitals in a large mixed quaternary/community emergency department (ED) system (annual ED census ∼500,000), triage nurses recorded a binary admission prediction for adult patients. These predictions were compared with an ensemble ML model (XGBoost + Bio-Clinical BERT) trained on structured data (demographics, vital signs, and medical history) and triage text. Nurse predictions were similarly analyzed and then integrated with the ML output to assess for improvement in predictive accuracy.
Results
The ensemble ML model (XGBoost + Bio-Clinical BERT) was trained on 1.8 million historical ED visits (January 2019 to December 2023). It was then tested on 46,912 prospective ED visits with recorded nurse predictions (September 1, 2024 to October 31, 2024). In the prospective arm, nurse predictions yielded an accuracy of 81.6% (95% CI, 81.3-81.9), a sensitivity of 64.8% (63.7-65.8), and a specificity of 85.7% (85.3-86.0). At a 0.30 probability threshold, the ML model attained an accuracy of 85.4% (85.0-85.7) and a sensitivity of 70.8% (69.8-71.7). Combining nurse predictions with the ML output did not improve accuracy beyond the model alone.
Conclusion
Machine learning-based predictions outperformed triage nurse estimates for hospital admissions. These findings suggest that an admission prediction system anchored by ML can perform reliably using data available at triage.
{"title":"Comparing Machine Learning and Nurse Predictions for Hospital Admissions in a Multisite Emergency Care System","authors":"Jonathan Nover MBA, RN , Matthew Bai MD , Prem Tismina MS , Ganesh Raut MS , Dhavalkumar Patel MS , Girish N. Nadkarni MD, MPH , Benjamin S. Abella MD, MPhil , Eyal Klang MD , Robert Freeman DNP, RN, NE-BC","doi":"10.1016/j.mcpdig.2025.100249","DOIUrl":"10.1016/j.mcpdig.2025.100249","url":null,"abstract":"<div><h3>Objective</h3><div>To prospectively compare nurse predictions with a machine learning (ML) model for hospital admissions and evaluate whether adding the nurse prediction to ML outputs enhances predictive performance.</div></div><div><h3>Patients and Methods</h3><div>In this prospective, observational study at 6 hospitals in a large mixed quaternary/community emergency department (ED) system (annual ED census ∼500,000), triage nurses recorded a binary admission prediction for adult patients. These predictions were compared with an ensemble ML model (XGBoost + Bio-Clinical BERT) trained on structured data (demographics, vital signs, and medical history) and triage text. Nurse predictions were similarly analyzed and then integrated with the ML output to assess for improvement in predictive accuracy.</div></div><div><h3>Results</h3><div>The ensemble ML model (XGBoost + Bio-Clinical BERT) was trained on 1.8 million historical ED visits (January 2019 to December 2023). It was then tested on 46,912 prospective ED visits with recorded nurse predictions (September 1, 2024 to October 31, 2024). In the prospective arm, nurse predictions yielded an accuracy of 81.6% (95% CI, 81.3-81.9), a sensitivity of 64.8% (63.7-65.8), and a specificity of 85.7% (85.3-86.0). At a 0.30 probability threshold, the ML model attained an accuracy of 85.4% (85.0-85.7) and a sensitivity of 70.8% (69.8-71.7). Combining nurse predictions with the ML output did not improve accuracy beyond the model alone.</div></div><div><h3>Conclusion</h3><div>Machine learning-based predictions outperformed triage nurse estimates for hospital admissions. These findings suggest that an admission prediction system anchored by ML can perform reliably using data available at triage.</div></div>","PeriodicalId":74127,"journal":{"name":"Mayo Clinic Proceedings. Digital health","volume":"3 3","pages":"Article 100249"},"PeriodicalIF":0.0,"publicationDate":"2025-07-09","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"144739648","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2025-07-08DOI: 10.1016/j.mcpdig.2025.100251
Lovisa Hellsten MPH , Viktor H. Ahlqvist PhD , Anna M. Nielsen RM, PhD , Gunnar Brandén PhD , Anna Mia Ekström MD, PhD , Kyriaki Kosidou MD, PhD
Objective
To examine uptake of in-person and digital sexual and reproductive health (SRH) services among adolescents and young adults, quantify uptake across time, and explore whether the introduction of digital services affected the sociodemographic composition of users.
Patients and Methods
This Swedish total population study included all Stockholm residents aged 12-22 years between January 1st 2018 and December 31st 2022. The primary outcome was in-person or digital visits (chat and video) of SRH services within a year, identified using regional health care registries. Sociodemographic predictors included sex, age, migrant background, parental education, and household income, analyzed with repeated-measures multivariable regressions.
Results
Among the 454,405 individuals, 23.96% had at some point used SRH services (80.01% women) between 2018 and 2022. In-person visits remained the predominant mode of contact. Women had higher annual utilization rate of both in-person (women: 15.27%; 95% CI, 15.13-15.40; men: 1.75%; 95% CI, 1.72-1.78) and digital visits (women: 2.23%; 95% CI, 2.16-2.30; men: 0.12%; 95% CI, 0.11-0.13). Significantly lower uptake was also observed in the lowest income quintile (digital: adjusted odds ratio [aOR], 0.34; 95% CI, 0.31-0.36; in-person: aOR, 0.43; 95% CI, 0.42-0.45) compared with the highest quintile (reference group). Among digital visits, chat was more equitably used than video consultations across sociodemographic groups, including smaller differences between the highest and lowest income quintiles (chat: aOR, 0.59; 95% CI, 0.54-0.65; video: aOR, 0.25; 95% CI, 0.23-0.27). Only modest reductions in socioeconomic disparities were observed after the introduction of digital services.
Conclusions
Sociodemographic disparities in utilization were not alleviated by the introduction of digital visits; in-person users were also the primary digital users. Chat could be more equitable than video, but further research is needed.
{"title":"Youth Uptake of Digital Sexual and Reproductive Health Services Across Sociodemographic Groups (2018-2022): A Total Population Study from Stockholm, Sweden","authors":"Lovisa Hellsten MPH , Viktor H. Ahlqvist PhD , Anna M. Nielsen RM, PhD , Gunnar Brandén PhD , Anna Mia Ekström MD, PhD , Kyriaki Kosidou MD, PhD","doi":"10.1016/j.mcpdig.2025.100251","DOIUrl":"10.1016/j.mcpdig.2025.100251","url":null,"abstract":"<div><h3>Objective</h3><div>To examine uptake of in-person and digital sexual and reproductive health (SRH) services among adolescents and young adults, quantify uptake across time, and explore whether the introduction of digital services affected the sociodemographic composition of users.</div></div><div><h3>Patients and Methods</h3><div>This Swedish total population study included all Stockholm residents aged 12-22 years between January 1st 2018 and December 31st 2022. The primary outcome was in-person or digital visits (chat and video) of SRH services within a year, identified using regional health care registries. Sociodemographic predictors included sex, age, migrant background, parental education, and household income, analyzed with repeated-measures multivariable regressions.</div></div><div><h3>Results</h3><div>Among the 454,405 individuals, 23.96% had at some point used SRH services (80.01% women) between 2018 and 2022. In-person visits remained the predominant mode of contact. Women had higher annual utilization rate of both in-person (women: 15.27%; 95% CI, 15.13-15.40; men: 1.75%; 95% CI, 1.72-1.78) and digital visits (women: 2.23%; 95% CI, 2.16-2.30; men: 0.12%; 95% CI, 0.11-0.13). Significantly lower uptake was also observed in the lowest income quintile (digital: adjusted odds ratio [aOR], 0.34; 95% CI, 0.31-0.36; in-person: aOR, 0.43; 95% CI, 0.42-0.45) compared with the highest quintile (reference group). Among digital visits, chat was more equitably used than video consultations across sociodemographic groups, including smaller differences between the highest and lowest income quintiles (chat: aOR, 0.59; 95% CI, 0.54-0.65; video: aOR, 0.25; 95% CI, 0.23-0.27). Only modest reductions in socioeconomic disparities were observed after the introduction of digital services.</div></div><div><h3>Conclusions</h3><div>Sociodemographic disparities in utilization were not alleviated by the introduction of digital visits; in-person users were also the primary digital users. Chat could be more equitable than video, but further research is needed.</div></div>","PeriodicalId":74127,"journal":{"name":"Mayo Clinic Proceedings. Digital health","volume":"3 3","pages":"Article 100251"},"PeriodicalIF":0.0,"publicationDate":"2025-07-08","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"144724163","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2025-07-07DOI: 10.1016/j.mcpdig.2025.100250
Maria Normand Larsen MD , Tatjana Sandreva Dreisig MD , Maja Kjær Rasmussen MSc , Anders N.Ø. Schultz MD , Thyge Lynghøj Nielsen MD, PhD , Thea K. Fischer MD, DMSc
Given the imbalance between high care demand and strained hospital capacity, hospital-at-home (HaH) models offer a potential solution by providing hospital-level care in patients’ homes. This scoping review maps the literature on hospital-led virtual health care within HaH models for acute infections, focusing on intervention characteristics and evaluation designs. Following Johanna Briggs Institute guidelines and PRISMA-ScR, we included studies on virtual and hybrid HaH models using telemedicine for remote monitoring and interventions. The literature searches were performed from October 3, 2022 to October 22, 2022, and updated on July 11, 2024 and identified 15,062 potentially relevant records. From these, 79 studies met the inclusion criteria, highlighting the diversity of HaH models and their evaluations. Hybrid models provided broader treatment options, but many studies lacked detailed intervention descriptions, complicating implementation and meta-analyses. Most studies evaluated patient outcomes, with limited attention to health care staff and relatives. Nearly 45,000 participants were assessed, but only 254 participated in randomized controlled trials, indicating a need for more high-level evidence. Relevant gaps remain, including model heterogeneity and inconsistent reporting.
{"title":"Virtual Health Care in Hospital-at-Home Models for Patients With Acute Infections: A Scoping Review","authors":"Maria Normand Larsen MD , Tatjana Sandreva Dreisig MD , Maja Kjær Rasmussen MSc , Anders N.Ø. Schultz MD , Thyge Lynghøj Nielsen MD, PhD , Thea K. Fischer MD, DMSc","doi":"10.1016/j.mcpdig.2025.100250","DOIUrl":"10.1016/j.mcpdig.2025.100250","url":null,"abstract":"<div><div>Given the imbalance between high care demand and strained hospital capacity, hospital-at-home (HaH) models offer a potential solution by providing hospital-level care in patients’ homes. This scoping review maps the literature on hospital-led virtual health care within HaH models for acute infections, focusing on intervention characteristics and evaluation designs. Following Johanna Briggs Institute guidelines and PRISMA-ScR, we included studies on virtual and hybrid HaH models using telemedicine for remote monitoring and interventions. The literature searches were performed from October 3, 2022 to October 22, 2022, and updated on July 11, 2024 and identified 15,062 potentially relevant records. From these, 79 studies met the inclusion criteria, highlighting the diversity of HaH models and their evaluations. Hybrid models provided broader treatment options, but many studies lacked detailed intervention descriptions, complicating implementation and meta-analyses. Most studies evaluated patient outcomes, with limited attention to health care staff and relatives. Nearly 45,000 participants were assessed, but only 254 participated in randomized controlled trials, indicating a need for more high-level evidence. Relevant gaps remain, including model heterogeneity and inconsistent reporting.</div></div>","PeriodicalId":74127,"journal":{"name":"Mayo Clinic Proceedings. Digital health","volume":"3 3","pages":"Article 100250"},"PeriodicalIF":0.0,"publicationDate":"2025-07-07","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"144748793","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2025-07-04DOI: 10.1016/j.mcpdig.2025.100247
Jack B. Korleski MD , Regina M. Koch MD , Thanh P. Ho MD , Steven I. Robinson MBBS , Scott H. Okuno MD , Joerg Herrmann MD , Brittany L. Siontis MD
Objective
The objective of this study was to understand the utility of artificial intelligence-enabled electrocardiogram (AI-ECG) to assess the tolerability of anthracycline chemotherapy.
Patients and Methods
From December 18, 2006 to October 15, 2020, we identified adults with sarcoma who were treated with anthracycline chemotherapy at our institution who had an ECG within 1 year prior to treatment initiation. Utilizing previously defined AI-ECG nomograms, we obtained age and ejection fraction (EF) predictions. Changes in AI-ECG age were correlated with chemotherapy tolerance (the rates of dose reductions, treatment delays, and early discontinuation). We measured the sensitivity and specificity of the ECG to predict an EF of less than 50% or 35% prior to treatment and compared how changes in the AI-ECG EF prediction related to changes in echocardiography-based EF.
Results
Forty patients met the eligibility criteria. Sixty-eight percent of the patients were men. The median age was 56.5 years (18-76 years). We did not find differences in chemotherapy tolerance between patients who had an elevated or decreased ECG age. There was a trend `toward higher rates of dose reductions in patients with high ECG aging (odds ratio, 5.13; P=.32). The AI-ECG low EF prediction had a sensitivity of 100% and a specificity of 94% to isolate patients with an EF of less than 50% prior to treatment. Two patients’ EF decreased more than 10% after treatment, and both cases showed significant increases in the low EF prediction.
Conclusion
Overall, AI-based predictions on ECG tracings could be a way to monitor for decreases in EF during treatment with anthracycline chemotherapy. We recommend further studies to evaluate AI-ECG aging as a marker for chemotherapy tolerance.
{"title":"Predicting Tolerance to Anthracycline Chemotherapy Using Electrocardiogram-Based Artificial Intelligence in Sarcoma","authors":"Jack B. Korleski MD , Regina M. Koch MD , Thanh P. Ho MD , Steven I. Robinson MBBS , Scott H. Okuno MD , Joerg Herrmann MD , Brittany L. Siontis MD","doi":"10.1016/j.mcpdig.2025.100247","DOIUrl":"10.1016/j.mcpdig.2025.100247","url":null,"abstract":"<div><h3>Objective</h3><div>The objective of this study was to understand the utility of artificial intelligence-enabled electrocardiogram (AI-ECG) to assess the tolerability of anthracycline chemotherapy.</div></div><div><h3>Patients and Methods</h3><div>From December 18, 2006 to October 15, 2020, we identified adults with sarcoma who were treated with anthracycline chemotherapy at our institution who had an ECG within 1 year prior to treatment initiation. Utilizing previously defined AI-ECG nomograms, we obtained age and ejection fraction (EF) predictions. Changes in AI-ECG age were correlated with chemotherapy tolerance (the rates of dose reductions, treatment delays, and early discontinuation). We measured the sensitivity and specificity of the ECG to predict an EF of less than 50% or 35% prior to treatment and compared how changes in the AI-ECG EF prediction related to changes in echocardiography-based EF.</div></div><div><h3>Results</h3><div>Forty patients met the eligibility criteria. Sixty-eight percent of the patients were men. The median age was 56.5 years (18-76 years). We did not find differences in chemotherapy tolerance between patients who had an elevated or decreased ECG age. There was a trend `toward higher rates of dose reductions in patients with high ECG aging (odds ratio, 5.13; <em>P</em>=.32). The AI-ECG low EF prediction had a sensitivity of 100% and a specificity of 94% to isolate patients with an EF of less than 50% prior to treatment. Two patients’ EF decreased more than 10% after treatment, and both cases showed significant increases in the low EF prediction.</div></div><div><h3>Conclusion</h3><div>Overall, AI-based predictions on ECG tracings could be a way to monitor for decreases in EF during treatment with anthracycline chemotherapy. We recommend further studies to evaluate AI-ECG aging as a marker for chemotherapy tolerance.</div></div>","PeriodicalId":74127,"journal":{"name":"Mayo Clinic Proceedings. Digital health","volume":"3 3","pages":"Article 100247"},"PeriodicalIF":0.0,"publicationDate":"2025-07-04","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"144686358","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2025-06-26DOI: 10.1016/j.mcpdig.2025.100246
Mike Zack MD, PhD, MPH, Danil N. Stupichev MSc, Alex J. Moore BSc, Ioan D. Slobodchikov MSc, David G. Sokolov MSc, Igor F. Trifonov MSc, Allan Gobbs MSc
Pharmacogenomics is entering a transformative phase as high-throughput “omics” techniques become increasingly integrated with state-of-the-art artificial intelligence (AI) methods. Although early successes in single-gene pharmacogenetics reported clear clinical benefits, many drug response phenotypes are governed by intricate networks of genomic variants, epigenetic modifications, and metabolic pathways. Multi-omics approaches address this complexity by capturing genomic, transcriptomic, proteomic, and metabolomic data layers, offering a comprehensive view of patient-specific biology. Advanced AI models, including deep neural networks, graph neural networks, and representation learning techniques, further enhance this landscape by detecting hidden patterns, filling gaps in incomplete data sets, and enabling in silico simulations of treatment responses. Such capabilities not only improve predictive accuracy but also deepen mechanistic insights, revealing how gene–gene and gene–environment interactions shape therapeutic outcomes. At the same time, real-world data from diverse patient populations is broadening the evidence base, underscoring the importance of inclusive datasets and population-specific algorithms to reduce health disparities. Despite challenges related to data harmonization, interpretability, and regulatory oversight, the synergy between multi-omics integration and AI-driven analytics holds relevant promise for revolutionizing clinical decision-making. In this review, we highlighted key technological advances, discussed current limitations, and outlined future directions for translating multi-omics plus AI innovations into routine personalized medicine.
{"title":"Artificial Intelligence and Multi-Omics in Pharmacogenomics: A New Era of Precision Medicine","authors":"Mike Zack MD, PhD, MPH, Danil N. Stupichev MSc, Alex J. Moore BSc, Ioan D. Slobodchikov MSc, David G. Sokolov MSc, Igor F. Trifonov MSc, Allan Gobbs MSc","doi":"10.1016/j.mcpdig.2025.100246","DOIUrl":"10.1016/j.mcpdig.2025.100246","url":null,"abstract":"<div><div>Pharmacogenomics is entering a transformative phase as high-throughput “omics” techniques become increasingly integrated with state-of-the-art artificial intelligence (AI) methods. Although early successes in single-gene pharmacogenetics reported clear clinical benefits, many drug response phenotypes are governed by intricate networks of genomic variants, epigenetic modifications, and metabolic pathways. Multi-omics approaches address this complexity by capturing genomic, transcriptomic, proteomic, and metabolomic data layers, offering a comprehensive view of patient-specific biology. Advanced AI models, including deep neural networks, graph neural networks, and representation learning techniques, further enhance this landscape by detecting hidden patterns, filling gaps in incomplete data sets, and enabling in silico simulations of treatment responses. Such capabilities not only improve predictive accuracy but also deepen mechanistic insights, revealing how gene–gene and gene–environment interactions shape therapeutic outcomes. At the same time, real-world data from diverse patient populations is broadening the evidence base, underscoring the importance of inclusive datasets and population-specific algorithms to reduce health disparities. Despite challenges related to data harmonization, interpretability, and regulatory oversight, the synergy between multi-omics integration and AI-driven analytics holds relevant promise for revolutionizing clinical decision-making. In this review, we highlighted key technological advances, discussed current limitations, and outlined future directions for translating multi-omics plus AI innovations into routine personalized medicine.</div></div>","PeriodicalId":74127,"journal":{"name":"Mayo Clinic Proceedings. Digital health","volume":"3 3","pages":"Article 100246"},"PeriodicalIF":0.0,"publicationDate":"2025-06-26","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"144662090","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
To introduce a novel, artificial Intelligence (AI), deep learning-based application for automated diagnosis of female genital schistosomiasis (FGS), a disease estimated to affect around 56 million women and girls in sub-Saharan Africa.
Patients and Methods
This study focused on cervical images collected from a high endemic FGS area in Cameroon, from August 1, 2020 to August 31, 2021. We applied the You Only Look Once deep learning model and employed a 5-fold cross-validation approach, accompanied by sensitivity analysis, to optimize model performance.
Results
The model achieved a sensitivity of 0.96 (76/78) and an accuracy of 0.78 (97/125), demonstrating improved performance over an existing, non-AI-based, computerized image diagnostic method, which has a sensitivity of 0.94 (73/78) but an accuracy of 0.58 (73/125) on the same dataset. In addition, the AI model significantly reduced processing time, decreasing from 47 minutes to under 90 seconds for testing 250 images.
Conclusion
This study highlights the potential of deep learning-based models for automated diagnosis for FGS while reducing the reliance on specialized clinical expertise. It also underscores the need for further work to address current limitations of such AI-based methods for FGS diagnosis.
目的介绍一种基于人工智能(AI)深度学习的新型应用程序,用于自动诊断女性生殖器血吸虫病(FGS),这种疾病估计影响撒哈拉以南非洲约5600万妇女和女孩。患者和方法本研究集中于2020年8月1日至2021年8月31日在喀麦隆FGS高发地区采集的宫颈图像。我们采用You Only Look Once深度学习模型,并采用5倍交叉验证方法,并辅以敏感性分析,以优化模型性能。结果该模型的灵敏度为0.96(76/78),准确度为0.78(97/125),与现有的非基于人工智能的计算机图像诊断方法相比,该模型的性能有所提高,该方法在同一数据集上的灵敏度为0.94(73/78),准确度为0.58(73/125)。此外,AI模型显著缩短了处理时间,在测试250张图像时从47分钟减少到90秒以下。本研究强调了基于深度学习的模型在FGS自动诊断中的潜力,同时减少了对专业临床专家的依赖。它还强调需要进一步开展工作,以解决目前这种基于人工智能的FGS诊断方法的局限性。
{"title":"Artificial Intelligence Image-Diagnosis for Female Genital Schistosomiasis","authors":"Jiayuan Zhu MSc , J. Alison Noble DPhil , Mireille Gomes DPhil","doi":"10.1016/j.mcpdig.2025.100245","DOIUrl":"10.1016/j.mcpdig.2025.100245","url":null,"abstract":"<div><h3>Objective</h3><div>To introduce a novel, artificial Intelligence (AI), deep learning-based application for automated diagnosis of female genital schistosomiasis (FGS), a disease estimated to affect around 56 million women and girls in sub-Saharan Africa.</div></div><div><h3>Patients and Methods</h3><div>This study focused on cervical images collected from a high endemic FGS area in Cameroon, from August 1, 2020 to August 31, 2021. We applied the You Only Look Once deep learning model and employed a 5-fold cross-validation approach, accompanied by sensitivity analysis, to optimize model performance.</div></div><div><h3>Results</h3><div>The model achieved a sensitivity of 0.96 (76/78) and an accuracy of 0.78 (97/125), demonstrating improved performance over an existing, non-AI-based, computerized image diagnostic method, which has a sensitivity of 0.94 (73/78) but an accuracy of 0.58 (73/125) on the same dataset. In addition, the AI model significantly reduced processing time, decreasing from 47 minutes to under 90 seconds for testing 250 images.</div></div><div><h3>Conclusion</h3><div>This study highlights the potential of deep learning-based models for automated diagnosis for FGS while reducing the reliance on specialized clinical expertise. It also underscores the need for further work to address current limitations of such AI-based methods for FGS diagnosis.</div></div>","PeriodicalId":74127,"journal":{"name":"Mayo Clinic Proceedings. Digital health","volume":"3 3","pages":"Article 100245"},"PeriodicalIF":0.0,"publicationDate":"2025-06-21","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"144605925","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2025-06-11DOI: 10.1016/j.mcpdig.2025.100243
Samuel E. Antia MD, MSc , Collins N. Ugwu MD, MSc , Vishal Ghodka BE , Babangida S. Chori MSc , Muhammad S. Nazir MD, MSc , Chizoba A. Odili PhD , Godsent C. Isiguzo MD, PhD , Sri Vasireddy MS, MBA , Augustine N. Odili MD, PhD
Objective
To evaluate the feasibility, usability, and efficacy of innovative generative pretrained transformer chatbot in improving self-care in hypertensive patients in a resource-limited setting.
Patients and Methods
A single-arm nonblinded clinical trial was deployed in a busy cardiology clinic in a low-resource setting. Artificial intelligence–enabled chatbot (Healthy Heart Assistant) was activated in smartphones of 50 adults on treatment for hypertension. Participants were trained on how to use the Healthy Heart Assistant including setting medication and appointment reminders. Baseline questionnaires were administered at enrollment and 30 days later to explore acceptability, feasibility and usability of the bot. We used chatbot usability questionnaire and self-made Healthy Heart Assistant satisfaction questionnaire to assess bot usability and patients’ satisfaction, respectively. The study began on April 5, 2024, through July 15, 2024.
Results
Of 200 hypertensive clinic attendees, 70 (35%) had internet-enabled bot-compatible cell phones, of which 50 hypertensive patients were recruited to participate in the study. Among 50 participants, 2 (4%) were lost to follow-up; 19 (39.6%) were women; and 40 (83.3%) had attained tertiary level of education. Mean time of training to use bot was 5.7 minutes, with 35 (70.8%) of participants being able to use the bot within 5 minutes. The median frequency of chats for participants within the timeframe was an average of 1.5 chats/day. Chatbot usability questionnaire score was 69.5%, whereas self-made Healthy Heart Assistant satisfaction questionnaire score was 90%.
Conclusion
This proof-of-concept study shows that generative artificial intelligence can be applied with reasonable success in hypertension self-care in low-resource settings and has potential for being effective.
{"title":"Healthy Heart Assistant, a WhatsApp-Based Generative Pretrained Transformer Technology, for Self-Care in Hypertensive Patients","authors":"Samuel E. Antia MD, MSc , Collins N. Ugwu MD, MSc , Vishal Ghodka BE , Babangida S. Chori MSc , Muhammad S. Nazir MD, MSc , Chizoba A. Odili PhD , Godsent C. Isiguzo MD, PhD , Sri Vasireddy MS, MBA , Augustine N. Odili MD, PhD","doi":"10.1016/j.mcpdig.2025.100243","DOIUrl":"10.1016/j.mcpdig.2025.100243","url":null,"abstract":"<div><h3>Objective</h3><div>To evaluate the feasibility, usability, and efficacy of innovative generative pretrained transformer chatbot in improving self-care in hypertensive patients in a resource-limited setting.</div></div><div><h3>Patients and Methods</h3><div>A single-arm nonblinded clinical trial was deployed in a busy cardiology clinic in a low-resource setting. Artificial intelligence–enabled chatbot (Healthy Heart Assistant) was activated in smartphones of 50 adults on treatment for hypertension. Participants were trained on how to use the Healthy Heart Assistant including setting medication and appointment reminders. Baseline questionnaires were administered at enrollment and 30 days later to explore acceptability, feasibility and usability of the bot. We used chatbot usability questionnaire and self-made Healthy Heart Assistant satisfaction questionnaire to assess bot usability and patients’ satisfaction, respectively. The study began on April 5, 2024, through July 15, 2024.</div></div><div><h3>Results</h3><div>Of 200 hypertensive clinic attendees, 70 (35%) had internet-enabled bot-compatible cell phones, of which 50 hypertensive patients were recruited to participate in the study. Among 50 participants, 2 (4%) were lost to follow-up; 19 (39.6%) were women; and 40 (83.3%) had attained tertiary level of education. Mean time of training to use bot was 5.7 minutes, with 35 (70.8%) of participants being able to use the bot within 5 minutes. The median frequency of chats for participants within the timeframe was an average of 1.5 chats/day. Chatbot usability questionnaire score was 69.5%, whereas self-made Healthy Heart Assistant satisfaction questionnaire score was 90%.</div></div><div><h3>Conclusion</h3><div>This proof-of-concept study shows that generative artificial intelligence can be applied with reasonable success in hypertension self-care in low-resource settings and has potential for being effective.</div></div>","PeriodicalId":74127,"journal":{"name":"Mayo Clinic Proceedings. Digital health","volume":"3 3","pages":"Article 100243"},"PeriodicalIF":0.0,"publicationDate":"2025-06-11","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"144549292","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Digital self-diagnosis tools, or symptom checkers, many of which incorporate artificial intelligence technology, are intended to provide diagnostic information and triage advice to lay users. This narrative overview of reviews explores the common themes and issues raised by existing evidence synthesis literature on these tools to establish a common ground for interdisciplinary research. We searched 3 bibliographic databases (PubMed, Scopus and Web of Science) and Google Scholar using keyword combinations of artificial, self-diagnosis, intelligence, and machine learning for publications from 2019 to 2023. We included systematic reviews, meta-analyses, scoping reviews, narrative syntheses, and opinion pieces that discussed tools where users proactively entered personal health information to acquire a predicted diagnosis of their symptoms or triage advice. This overview reveals significant gaps in understanding the key areas of development, implementation, impact, and oversight of digital self-diagnosis tools. Additionally, the terminology used to describe these tools and their underlying technologies varies widely, encompassing technologies ranging from simple branching logic algorithms to complex deep neural networks. Our interdisciplinary analysis identified gaps and critical areas for future research across all stages of the lifecycles of these tools. The diverse challenges uncovered highlight the necessity for multiagency and multidisciplinary efforts promoting responsible development and implementation.
数字自我诊断工具或症状检查器,其中许多都结合了人工智能技术,旨在为非专业用户提供诊断信息和分类建议。本综述探讨了现有证据综合文献中关于这些工具提出的共同主题和问题,以建立跨学科研究的共同基础。我们检索了3个书目数据库(PubMed、Scopus和Web of Science)和谷歌Scholar,使用人工、自我诊断、智能和机器学习的关键字组合检索了2019年至2023年的出版物。我们纳入了系统评价、荟萃分析、范围评价、叙述综合和意见片段,讨论了用户主动输入个人健康信息以获得对其症状的预测诊断或分诊建议的工具。这一概述揭示了在理解数字自我诊断工具的开发、实施、影响和监督的关键领域方面的重大差距。此外,用于描述这些工具及其底层技术的术语差异很大,包括从简单的分支逻辑算法到复杂的深度神经网络等技术。我们的跨学科分析确定了这些工具生命周期所有阶段的差距和未来研究的关键领域。所发现的各种挑战突出了多机构和多学科努力促进负责任的发展和执行的必要性。
{"title":"Artificial Intelligence in Digital Self-Diagnosis Tools: A Narrative Overview of Reviews","authors":"Aikaterini Mentzou PhD , Amy Rogers MD , Edzia Carvalho PhD , Angela Daly PhD , Maeve Malone HDip , Xaroula Kerasidou PhD","doi":"10.1016/j.mcpdig.2025.100242","DOIUrl":"10.1016/j.mcpdig.2025.100242","url":null,"abstract":"<div><div>Digital self-diagnosis tools, or symptom checkers, many of which incorporate artificial intelligence technology, are intended to provide diagnostic information and triage advice to lay users. This narrative overview of reviews explores the common themes and issues raised by existing evidence synthesis literature on these tools to establish a common ground for interdisciplinary research. We searched 3 bibliographic databases (PubMed, Scopus and Web of Science) and Google Scholar using keyword combinations of <em>artificial</em>, <em>self-diagnosis</em>, <em>intelligence</em>, and <em>machine learning</em> for publications from 2019 to 2023. We included systematic reviews, meta-analyses, scoping reviews, narrative syntheses, and opinion pieces that discussed tools where users proactively entered personal health information to acquire a predicted diagnosis of their symptoms or triage advice. This overview reveals significant gaps in understanding the key areas of development, implementation, impact, and oversight of digital self-diagnosis tools. Additionally, the terminology used to describe these tools and their underlying technologies varies widely, encompassing technologies ranging from simple branching logic algorithms to complex deep neural networks. Our interdisciplinary analysis identified gaps and critical areas for future research across all stages of the lifecycles of these tools. The diverse challenges uncovered highlight the necessity for multiagency and multidisciplinary efforts promoting responsible development and implementation.</div></div>","PeriodicalId":74127,"journal":{"name":"Mayo Clinic Proceedings. Digital health","volume":"3 3","pages":"Article 100242"},"PeriodicalIF":0.0,"publicationDate":"2025-06-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"144534516","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2025-06-10DOI: 10.1016/j.mcpdig.2025.100244
Juan Pablo Botero-Aguirre MS , Michael Andrés García-Rivera MS
Objective
To develop and validate a named entity recognition (NER) model based on BERT-based model trained on Spanish-language corpor, for extracting allergy-related information from unstructured electronic health records.
Patients and Methods
The model was fine-tuned using 16,176 manually annotated allergy-related entities from anonimized patient records (hospitalized patients between January 1, 2021, and June 30, 2024). The data set was divided into training (80%) and testing (20%) subsets, and model performance was evaluated using accuracy, recall, and F1 score. The validated model was applied to another data set with 80,917 medication prescriptions from 5859 hospitalized patients with at least one prescribed medication (during August and September 2024) to detect potential prescription errors. Sensitivity, specificity, and Cohen κ were calculated using manual expert review as the gold standard.
Results
The model achieved an accuracy of 87.28% and an F1 score of 0.80. It effectively identified medication names (F1=0.91) and adverse reactions (F1=0.85) but struggled with recommendation-related entities (F1=0.29). The model detected prescription errors in 0.96% of cases, with a sensitivity of 75.73% and specificity of 99.98%. The weighted κ score (0.7797) indicated substantial agreement with expert annotations.
Conclusion
The BERT-based model trained on Spanish-language corpora–based NER model demonstrated strong performance in identifying nonallergic cases (specificity, 99.98%; negative predictive value, 99.97%) and showed promise for clinical decision support. Despite moderate sensitivity (75.73%), these results highlight the feasibility of using Spanish-language NER models to enhance medication safety.
{"title":"Natural Language Processing for Enhanced Clinical Decision Support in Allergy Verification for Medication Prescriptions","authors":"Juan Pablo Botero-Aguirre MS , Michael Andrés García-Rivera MS","doi":"10.1016/j.mcpdig.2025.100244","DOIUrl":"10.1016/j.mcpdig.2025.100244","url":null,"abstract":"<div><h3>Objective</h3><div>To develop and validate a named entity recognition (NER) model based on BERT-based model trained on Spanish-language corpor, for extracting allergy-related information from unstructured electronic health records.</div></div><div><h3>Patients and Methods</h3><div>The model was fine-tuned using 16,176 manually annotated allergy-related entities from anonimized patient records (hospitalized patients between January 1, 2021, and June 30, 2024). The data set was divided into training (80%) and testing (20%) subsets, and model performance was evaluated using accuracy, recall, and F1 score. The validated model was applied to another data set with 80,917 medication prescriptions from 5859 hospitalized patients with at least one prescribed medication (during August and September 2024) to detect potential prescription errors. Sensitivity, specificity, and Cohen κ were calculated using manual expert review as the gold standard.</div></div><div><h3>Results</h3><div>The model achieved an accuracy of 87.28% and an F1 score of 0.80. It effectively identified medication names (F1=0.91) and adverse reactions (F1=0.85) but struggled with recommendation-related entities (F1=0.29). The model detected prescription errors in 0.96% of cases, with a sensitivity of 75.73% and specificity of 99.98%. The weighted κ score (0.7797) indicated substantial agreement with expert annotations.</div></div><div><h3>Conclusion</h3><div>The BERT-based model trained on Spanish-language corpora–based NER model demonstrated strong performance in identifying nonallergic cases (specificity, 99.98%; negative predictive value, 99.97%) and showed promise for clinical decision support. Despite moderate sensitivity (75.73%), these results highlight the feasibility of using Spanish-language NER models to enhance medication safety.</div></div>","PeriodicalId":74127,"journal":{"name":"Mayo Clinic Proceedings. Digital health","volume":"3 3","pages":"Article 100244"},"PeriodicalIF":0.0,"publicationDate":"2025-06-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"144571685","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2025-06-09DOI: 10.1016/j.mcpdig.2025.100239
Isabell Rabe DMD, MPH , Jan P. Ehlers DVM, MA
Objective
To show the effectiveness of cycle applications in both areas of application—contraception and intended pregnancy.
Methods
A systematic review based on the PubMed and Google Scholar databases, with the addition of a hand search, was conducted from May 11, 2023, through April 11, 2024, to objectively answer this question. Of 1539 sources with matching search terms, 19 sources remained after checking for inclusion criteria according to the Preferred Reporting Items for Systematic Reviews and Meta-Analyses scheme. These were analyzed according to an evaluation scale regarding their quality in various areas. The average quality ratings and pregnancy probabilities of the studies were compared.
Results
Comparability within and between the subquestions was hardly possible owing to different presentation of results, bias risks, and mostly uncontrolled study designs. Applications for those wishing to become pregnant provided better quality ratings in some cases. There were indications that cycle applications shorten the time to achieving a desired pregnancy in cases of reduced fertility. In addition, some seem to have a similar contraceptive safety as the contraceptive pill but require significantly more compliance.
Conclusion
Independent, controlled studies with a diverse clientele of test subjects are necessary for a scientific classification. In addition, social, structural, and political adjustments are needed to enable individuals to make informed decisions about the use of cycle and fertility applications.
{"title":"Reliability of Cycle Applications for Pregnancy Planning and Contraception: A Systematic Review","authors":"Isabell Rabe DMD, MPH , Jan P. Ehlers DVM, MA","doi":"10.1016/j.mcpdig.2025.100239","DOIUrl":"10.1016/j.mcpdig.2025.100239","url":null,"abstract":"<div><h3>Objective</h3><div>To show the effectiveness of cycle applications in both areas of application—contraception and intended pregnancy.</div></div><div><h3>Methods</h3><div>A systematic review based on the PubMed and Google Scholar databases, with the addition of a hand search, was conducted from May 11, 2023, through April 11, 2024, to objectively answer this question. Of 1539 sources with matching search terms, 19 sources remained after checking for inclusion criteria according to the Preferred Reporting Items for Systematic Reviews and Meta-Analyses scheme. These were analyzed according to an evaluation scale regarding their quality in various areas. The average quality ratings and pregnancy probabilities of the studies were compared.</div></div><div><h3>Results</h3><div>Comparability within and between the subquestions was hardly possible owing to different presentation of results, bias risks, and mostly uncontrolled study designs. Applications for those wishing to become pregnant provided better quality ratings in some cases. There were indications that cycle applications shorten the time to achieving a desired pregnancy in cases of reduced fertility. In addition, some seem to have a similar contraceptive safety as the contraceptive pill but require significantly more compliance.</div></div><div><h3>Conclusion</h3><div>Independent, controlled studies with a diverse clientele of test subjects are necessary for a scientific classification. In addition, social, structural, and political adjustments are needed to enable individuals to make informed decisions about the use of cycle and fertility applications.</div></div>","PeriodicalId":74127,"journal":{"name":"Mayo Clinic Proceedings. Digital health","volume":"3 4","pages":"Article 100239"},"PeriodicalIF":0.0,"publicationDate":"2025-06-09","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145109430","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}