Background: Prior authorization (PA) is a major source of administrative burden, treatment delay, and clinician burnout. Artificial intelligence (AI), particularly large language models (LLMs), is increasingly used to assist with clinical documentation, yet its reliability for payer-facing administrative tasks remains uncertain.
Objective: To evaluate the quality of PA letters drafted by ChatGPT-5 for commonly used medications requiring PA in nephrology. Quality was evaluated based on correctness and strength of clinical reasoning.
Methods: We created a single standardized prompt and applied it across 29 nephrology scenarios to generate PA letters. Each PA letter was reviewed against four criteria: 1) absence of false statements or hallucinations, 2) correctness of ICD-10 coding, 3) presence and validity of citations, and 4) clinical reasoning, rated on a 4-point Likert scale (illogical, weak, adequate and strong). FDA drug labels, KDIGO guidelines and related randomized controlled trials were used as reference standards.
Results: Out of 29 letters, one letter (3.5%) contained false statements mentioning an irrelevant clinical trial. The ICD-10 diagnosis code was correct in 23 letters (79.3%), most errors were related to chronic kidney disease (CKD) staging or internal diagnostic inconsistencies. 27 letters (93.1%) cited valid references, with one letter citing an incorrect trial and another one citing a correct KDIGO guideline with inaccessible link. Twenty-six letters (89.7%) demonstrated strong clinical reasoning, supported by guideline-oriented or FDA label-aligned justification. The remaining 3 letters were rated as adequate reasoning. The main areas for improvement involved citing relevant references and emphasizing special considerations, for example Risk Evaluation and Mitigation Strategy (REMS) compliance for eculizumab.
Conclusions: ChatGPT-5 can generate clinically coherent PA drafts for nephrology medications, but limitations in coding precision and citation reliability persist. With appropriate oversight, AI-assisted documentation may reduce administrative burden while maintaining safety and accuracy.
{"title":"Quality assessment of large language model-generated prior authorization letters in nephrology.","authors":"Noppawit Aiumtrakul, Charat Thongprayoon, Chutawat Kookanok, Methavee Poochanasri, Kitinan Phichedwanichskul, Wisit Cheungpasitporn","doi":"10.3389/fdgth.2026.1767648","DOIUrl":"https://doi.org/10.3389/fdgth.2026.1767648","url":null,"abstract":"<p><strong>Background: </strong>Prior authorization (PA) is a major source of administrative burden, treatment delay, and clinician burnout. Artificial intelligence (AI), particularly large language models (LLMs), is increasingly used to assist with clinical documentation, yet its reliability for payer-facing administrative tasks remains uncertain.</p><p><strong>Objective: </strong>To evaluate the quality of PA letters drafted by ChatGPT-5 for commonly used medications requiring PA in nephrology. Quality was evaluated based on correctness and strength of clinical reasoning.</p><p><strong>Methods: </strong>We created a single standardized prompt and applied it across 29 nephrology scenarios to generate PA letters. Each PA letter was reviewed against four criteria: 1) absence of false statements or hallucinations, 2) correctness of ICD-10 coding, 3) presence and validity of citations, and 4) clinical reasoning, rated on a 4-point Likert scale (illogical, weak, adequate and strong). FDA drug labels, KDIGO guidelines and related randomized controlled trials were used as reference standards.</p><p><strong>Results: </strong>Out of 29 letters, one letter (3.5%) contained false statements mentioning an irrelevant clinical trial. The ICD-10 diagnosis code was correct in 23 letters (79.3%), most errors were related to chronic kidney disease (CKD) staging or internal diagnostic inconsistencies. 27 letters (93.1%) cited valid references, with one letter citing an incorrect trial and another one citing a correct KDIGO guideline with inaccessible link. Twenty-six letters (89.7%) demonstrated strong clinical reasoning, supported by guideline-oriented or FDA label-aligned justification. The remaining 3 letters were rated as adequate reasoning. The main areas for improvement involved citing relevant references and emphasizing special considerations, for example Risk Evaluation and Mitigation Strategy (REMS) compliance for eculizumab.</p><p><strong>Conclusions: </strong>ChatGPT-5 can generate clinically coherent PA drafts for nephrology medications, but limitations in coding precision and citation reliability persist. With appropriate oversight, AI-assisted documentation may reduce administrative burden while maintaining safety and accuracy.</p>","PeriodicalId":73078,"journal":{"name":"Frontiers in digital health","volume":"8 ","pages":"1767648"},"PeriodicalIF":3.2,"publicationDate":"2026-03-03","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12992280/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"147482637","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
While most existing digital immunization systems lack mechanisms to capture high fidelity real-time data to respond to current needs; many others are not designed to support interoperability and data sharing across the continuum of care for the health Sustainable Development Goals (SDGs). In this paper, we used the World Health Organization (WHO) Digital Documentation of COVID-19 Certificates (DDCC) as a proxy to demonstrate and operationalize how an efficient digital immunization system could strengthen service delivery and optimize outcomes for the SDGs. This paper appraises the technical, ethical and cultural considerations for establishing DDCC and how it can be operationalized among national health systems. It demonstrates how digital health investments can support routine immunization for the SDGs and highlights the critical role global health leadership plays in shaping reforms for national digital transformation agenda. The adoption and institutionalization of digital immunization systems offer opportunities to bridge multiple information solutions and strengthen immunization service delivery towards sustainable outcomes for the SDGs. Thus, it is recommended that Development partners and implementers jointly work with governments to shape the national digital health ecosystem that connects multiple healthcare journeys for the sustainable immunization agenda 2030.
{"title":"Globalizing digital immunization systems for the sustainable development goals: a perspective.","authors":"Sunny Ibeneme, Sean Blaschke, Khin Devi Aung, Benson Droti, Ridwan Gustiana, Hillary Kipruto, Basil Rodriques","doi":"10.3389/fdgth.2026.1687302","DOIUrl":"https://doi.org/10.3389/fdgth.2026.1687302","url":null,"abstract":"<p><p>While most existing digital immunization systems lack mechanisms to capture high fidelity real-time data to respond to current needs; many others are not designed to support interoperability and data sharing across the continuum of care for the health Sustainable Development Goals (SDGs). In this paper, we used the World Health Organization (WHO) Digital Documentation of COVID-19 Certificates (DDCC) as a proxy to demonstrate and operationalize how an efficient digital immunization system could strengthen service delivery and optimize outcomes for the SDGs. This paper appraises the technical, ethical and cultural considerations for establishing DDCC and how it can be operationalized among national health systems. It demonstrates how digital health investments can support routine immunization for the SDGs and highlights the critical role global health leadership plays in shaping reforms for national digital transformation agenda. The adoption and institutionalization of digital immunization systems offer opportunities to bridge multiple information solutions and strengthen immunization service delivery towards sustainable outcomes for the SDGs. Thus, it is recommended that Development partners and implementers jointly work with governments to shape the national digital health ecosystem that connects multiple healthcare journeys for the sustainable immunization agenda 2030.</p>","PeriodicalId":73078,"journal":{"name":"Frontiers in digital health","volume":"8 ","pages":"1687302"},"PeriodicalIF":3.2,"publicationDate":"2026-03-03","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12993993/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"147482668","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2026-03-03eCollection Date: 2026-01-01DOI: 10.3389/fdgth.2026.1812402
Jan-Willem J R van 't Klooster, Michela Capasso, Daan van Gorssel, Elvis Vrolijk, Giorgio Rettagliata, Demy Gerritsen, Mirjam Hegeman, Emanuele Tauro, Enrico Gianluca Caiani, Harald E Vonkeman
[This corrects the article DOI: 10.3389/fdgth.2025.1653168.].
[这更正了文章DOI: 10.3389/fdgth.2025.1653168.]。
{"title":"Correction: A GPT-reinforced social robot for patient communication: a pilot study.","authors":"Jan-Willem J R van 't Klooster, Michela Capasso, Daan van Gorssel, Elvis Vrolijk, Giorgio Rettagliata, Demy Gerritsen, Mirjam Hegeman, Emanuele Tauro, Enrico Gianluca Caiani, Harald E Vonkeman","doi":"10.3389/fdgth.2026.1812402","DOIUrl":"10.3389/fdgth.2026.1812402","url":null,"abstract":"<p><p>[This corrects the article DOI: 10.3389/fdgth.2025.1653168.].</p>","PeriodicalId":73078,"journal":{"name":"Frontiers in digital health","volume":"8 ","pages":"1812402"},"PeriodicalIF":3.2,"publicationDate":"2026-03-03","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12994146/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"147482577","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2026-03-02eCollection Date: 2026-01-01DOI: 10.3389/fdgth.2026.1774844
Yiping Tong, Yang Li, Chenxi Liu, Xiang Chen, Linbo Xing, Zhiyuan Cao, Yanlei Wang
Background: Short videos have become a primary channel for Lumbar Disc Herniation (LDH) patients to obtain disease knowledge and rehabilitation guidance. Algorithm-driven personalized recommendations may expose patients to heterogeneous LDH-related content, affecting their health anxiety and rehabilitation trajectories.
Objective: This study explored the impacts of LDH-related short video exposure duration and content types on health anxiety and lumbar functional rehabilitation in LDH patients, and verified the mediating role of health anxiety.
Methods: A 6-month prospective cohort study enrolled 213 LDH outpatients from Luoyang Orthopedic-Traumatological Hospital (Jan-Apr 2025). Demographic, clinical and short video usage data were collected. Health anxiety (MCQ-HA) and lumbar function (JOA) were assessed at baseline and follow-up. Pearson correlation, multiple linear regression, subgroup analysis and Bootstrap mediation analysis (5,000 resamplings) were used.
Results: At 6-month follow-up, the mean JOA score decreased from 23.00 ± 1.59 at baseline to 21.96 ± 3.03, and the mean MCQ-HA score increased from 20.77 ± 4.57-21.86 ± 6.14. Pearson correlation analysis showed that daily viewing duration and exposure frequency to awareness-motivation content were significantly negatively correlated with ΔJOA (r = -0.36, r = -0.33; both P < 0.001) and positively correlated with ΔMCQ-HA (r = 0.31, r = 0.34; both P < 0.001). Multiple linear regression indicated that ΔJOA in the >60 min daily viewing group was significantly lower than that in the <30 min group; exposure frequency to awareness-motivation content was independently negatively associated with ΔJOA and positively associated with ΔMCQ-HA (both P < 0.001), with no significant associations found for other content categories (all P > 0.05). Subgroup analysis based on clinical efficacy criteria revealed significant differences in recovery outcomes across viewing duration groups (χ2 = 18.75, P = 0.004). Bootstrap mediation analysis confirmed that ΔMCQ-HA mediated 16.13% of the total effect of daily viewing duration on ΔJOA and 20.80% of the total effect of awareness-motivation content exposure frequency on ΔJOA.
Conclusion: Prolonged short video exposure and frequent awareness-motivation content viewing were associated with poorer rehabilitation and higher health anxiety, with health anxiety partially mediating these relationships, providing empirical evidence for digital health guidance.
背景:短视频已成为腰椎间盘突出症(LDH)患者获取疾病知识和康复指导的主要渠道。算法驱动的个性化推荐可能会使患者接触到与ldl相关的异质性内容,影响他们的健康焦虑和康复轨迹。目的:本研究探讨LDH相关短视频播放时长和内容类型对LDH患者健康焦虑和腰椎功能康复的影响,并验证健康焦虑的中介作用。方法:一项为期6个月的前瞻性队列研究纳入了洛阳骨科创伤医院门诊的213例LDH患者(2025年1 - 4月)。收集人口统计、临床和短视频使用数据。在基线和随访时评估健康焦虑(MCQ-HA)和腰椎功能(JOA)。采用Pearson相关、多元线性回归、亚组分析和Bootstrap中介分析(重采样5000次)。结果:随访6个月,平均JOA评分由基线时的23.00±1.59下降至21.96±3.03,MCQ-HA评分由20.77±4.57上升至21.86±6.14。Pearson相关分析显示,每日观看时长和意识动机内容暴露频率与ΔJOA呈显著负相关(r = -0.36, r = -0.33); P均为ΔMCQ-HA (r = 0.31, r = 0.34); >60分钟每日观看组P均为ΔJOA显著低于ΔJOA组,与ΔMCQ-HA呈正相关(P均为> 0.05)。基于临床疗效标准的亚组分析显示,不同观看时间组的恢复结果差异有统计学意义(χ 2 = 18.75, P = 0.004)。Bootstrap中介分析证实,ΔMCQ-HA介导了每日观看时长对ΔJOA的总影响的16.13%,以及意识-动机内容曝光频率对ΔJOA的总影响的20.80%。结论:长时间观看短视频和频繁观看意识-动机内容与较差的康复和较高的健康焦虑相关,健康焦虑在其中起部分中介作用,为数字健康指导提供了经验证据。
{"title":"The impact of algorithm-driven exposure to disease-related short videos on rehabilitation outcomes in lumbar disc herniation patients: content heterogeneity and psychological mediating mechanisms.","authors":"Yiping Tong, Yang Li, Chenxi Liu, Xiang Chen, Linbo Xing, Zhiyuan Cao, Yanlei Wang","doi":"10.3389/fdgth.2026.1774844","DOIUrl":"https://doi.org/10.3389/fdgth.2026.1774844","url":null,"abstract":"<p><strong>Background: </strong>Short videos have become a primary channel for Lumbar Disc Herniation (LDH) patients to obtain disease knowledge and rehabilitation guidance. Algorithm-driven personalized recommendations may expose patients to heterogeneous LDH-related content, affecting their health anxiety and rehabilitation trajectories.</p><p><strong>Objective: </strong>This study explored the impacts of LDH-related short video exposure duration and content types on health anxiety and lumbar functional rehabilitation in LDH patients, and verified the mediating role of health anxiety.</p><p><strong>Methods: </strong>A 6-month prospective cohort study enrolled 213 LDH outpatients from Luoyang Orthopedic-Traumatological Hospital (Jan-Apr 2025). Demographic, clinical and short video usage data were collected. Health anxiety (MCQ-HA) and lumbar function (JOA) were assessed at baseline and follow-up. Pearson correlation, multiple linear regression, subgroup analysis and Bootstrap mediation analysis (5,000 resamplings) were used.</p><p><strong>Results: </strong>At 6-month follow-up, the mean JOA score decreased from 23.00 ± 1.59 at baseline to 21.96 ± 3.03, and the mean MCQ-HA score increased from 20.77 ± 4.57-21.86 ± 6.14. Pearson correlation analysis showed that daily viewing duration and exposure frequency to awareness-motivation content were significantly negatively correlated with <i>Δ</i>JOA (<i>r</i> = -0.36, <i>r</i> = -0.33; both <i>P</i> < 0.001) and positively correlated with <i>Δ</i>MCQ-HA (<i>r</i> = 0.31, <i>r</i> = 0.34; both <i>P</i> < 0.001). Multiple linear regression indicated that <i>Δ</i>JOA in the >60 min daily viewing group was significantly lower than that in the <30 min group; exposure frequency to awareness-motivation content was independently negatively associated with <i>Δ</i>JOA and positively associated with <i>Δ</i>MCQ-HA (both <i>P</i> < 0.001), with no significant associations found for other content categories (all <i>P</i> > 0.05). Subgroup analysis based on clinical efficacy criteria revealed significant differences in recovery outcomes across viewing duration groups (<i>χ</i> <sup>2</sup> = 18.75, <i>P</i> = 0.004). Bootstrap mediation analysis confirmed that <i>Δ</i>MCQ-HA mediated 16.13% of the total effect of daily viewing duration on <i>Δ</i>JOA and 20.80% of the total effect of awareness-motivation content exposure frequency on <i>Δ</i>JOA.</p><p><strong>Conclusion: </strong>Prolonged short video exposure and frequent awareness-motivation content viewing were associated with poorer rehabilitation and higher health anxiety, with health anxiety partially mediating these relationships, providing empirical evidence for digital health guidance.</p>","PeriodicalId":73078,"journal":{"name":"Frontiers in digital health","volume":"8 ","pages":"1774844"},"PeriodicalIF":3.2,"publicationDate":"2026-03-02","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12989517/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"147476302","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2026-03-02eCollection Date: 2026-01-01DOI: 10.3389/fdgth.2026.1710349
Kaarin J Anstey, Brooke Brady, Lidan Zheng, Jana Koch, Md Hamidul Huque, Michelle K Lupton, Ralph Martins, Daniel Ashworth, Erin Goddard, Nikki-Anne Wilson, Claudia M Hillenbrand, Ralf B Loeffler, Maria Markoulli, Arun V Krishnan, Tanya Layton, Ranmalee Eramudugolla
Purpose: The Resilient Minds (ReMind) cohort was established to investigate cognitive and mental health resilience across the life course, addressing a gap in longitudinal evidence about resilience. The study collected data on traditional medical and lifestyle risk factors for chronic disease, genetics, and a range of mental health and cognitive outcomes. It also aimed to explore contemporary contextual influences on resilience, including internet use, social engagement, environmental exposures, and life course adversities such as perceived discrimination.
Participants: The cohort included 1,640 adults aged 18-93 years, recruited through social media and community groups, to participate in a fully remote, two-year health study. Participants completed online surveys, cognitive and sensory testing, and intensive "sprints" occurring approximately every three months, during which daily surveys and digital health data were collected. A brain-health substudy (BHS) is being conducted for participants aged 50 years and older (current n = 184/400 planned), involving to evaluate neuroimaging, blood and ocular biomarkers to assess resilience and cognitive decline.
Findings to date: Thirty percent of participants were born overseas, and the average years of education were 14.7, 15.0 and 14.1 for young, middle aged and older adults, respectively. Among adults aged 65 years and older, 41.9% reported hypertension, 39.1% high cholesterol, 7.1% diabetes, and 22.4% obesity. In the BHS, 18% met criteria for Subjective Cognitive Decline, and 15% met criteria for Mild Cognitive Impairment.
Future plans: The initial study duration is 2 years, with plans to seek funding for extended follow-up to identify long-term predictors of cognitive and mental health resilience and the development of cognitive impairment in ageing.
{"title":"Cohort profile: the Resilient Minds national study of mental health and cognitive resilience in community dwelling adults aged 18 to 93.","authors":"Kaarin J Anstey, Brooke Brady, Lidan Zheng, Jana Koch, Md Hamidul Huque, Michelle K Lupton, Ralph Martins, Daniel Ashworth, Erin Goddard, Nikki-Anne Wilson, Claudia M Hillenbrand, Ralf B Loeffler, Maria Markoulli, Arun V Krishnan, Tanya Layton, Ranmalee Eramudugolla","doi":"10.3389/fdgth.2026.1710349","DOIUrl":"https://doi.org/10.3389/fdgth.2026.1710349","url":null,"abstract":"<p><strong>Purpose: </strong>The Resilient Minds (ReMind) cohort was established to investigate cognitive and mental health resilience across the life course, addressing a gap in longitudinal evidence about resilience. The study collected data on traditional medical and lifestyle risk factors for chronic disease, genetics, and a range of mental health and cognitive outcomes. It also aimed to explore contemporary contextual influences on resilience, including internet use, social engagement, environmental exposures, and life course adversities such as perceived discrimination.</p><p><strong>Participants: </strong>The cohort included 1,640 adults aged 18-93 years, recruited through social media and community groups, to participate in a fully remote, two-year health study. Participants completed online surveys, cognitive and sensory testing, and intensive \"sprints\" occurring approximately every three months, during which daily surveys and digital health data were collected. A brain-health substudy (BHS) is being conducted for participants aged 50 years and older (current <i>n</i> = 184/400 planned), involving to evaluate neuroimaging, blood and ocular biomarkers to assess resilience and cognitive decline.</p><p><strong>Findings to date: </strong>Thirty percent of participants were born overseas, and the average years of education were 14.7, 15.0 and 14.1 for young, middle aged and older adults, respectively. Among adults aged 65 years and older, 41.9% reported hypertension, 39.1% high cholesterol, 7.1% diabetes, and 22.4% obesity. In the BHS, 18% met criteria for Subjective Cognitive Decline, and 15% met criteria for Mild Cognitive Impairment.</p><p><strong>Future plans: </strong>The initial study duration is 2 years, with plans to seek funding for extended follow-up to identify long-term predictors of cognitive and mental health resilience and the development of cognitive impairment in ageing.</p>","PeriodicalId":73078,"journal":{"name":"Frontiers in digital health","volume":"8 ","pages":"1710349"},"PeriodicalIF":3.2,"publicationDate":"2026-03-02","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12989522/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"147476211","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2026-03-02eCollection Date: 2026-01-01DOI: 10.3389/fdgth.2026.1728920
Albert Attom
This article provides a critical narrative synthesis of literature on healthcare management in Africa, focusing on two interconnected areas: the impact of managerial capability on shaping integrated healthcare ecosystems and the adoption, implementation, and governance of digital health innovations within diverse health system contexts. Based on health systems strengthening frameworks and socio-technical views on digital transformation, the article explores how managerial skills influence the development and outcomes of digital health initiatives across African settings. Rather than presenting new empirical data, it uses comparative analysis of existing studies to highlight opportunities and ongoing challenges, such as uneven managerial digital skills, resistance to change, system fragmentation, and unintended effects like digital exclusion. The article concludes with a clear and practical agenda for future research and policy, emphasising the vital role of digitally competent managers in fostering supportive organisational cultures, promoting system integration, and ensuring meaningful adoption of digital health innovations by frontline health workers and patient populations.
{"title":"Health care management in Africa: a debate for future research and agenda.","authors":"Albert Attom","doi":"10.3389/fdgth.2026.1728920","DOIUrl":"https://doi.org/10.3389/fdgth.2026.1728920","url":null,"abstract":"<p><p>This article provides a critical narrative synthesis of literature on healthcare management in Africa, focusing on two interconnected areas: the impact of managerial capability on shaping integrated healthcare ecosystems and the adoption, implementation, and governance of digital health innovations within diverse health system contexts. Based on health systems strengthening frameworks and socio-technical views on digital transformation, the article explores how managerial skills influence the development and outcomes of digital health initiatives across African settings. Rather than presenting new empirical data, it uses comparative analysis of existing studies to highlight opportunities and ongoing challenges, such as uneven managerial digital skills, resistance to change, system fragmentation, and unintended effects like digital exclusion. The article concludes with a clear and practical agenda for future research and policy, emphasising the vital role of digitally competent managers in fostering supportive organisational cultures, promoting system integration, and ensuring meaningful adoption of digital health innovations by frontline health workers and patient populations.</p>","PeriodicalId":73078,"journal":{"name":"Frontiers in digital health","volume":"8 ","pages":"1728920"},"PeriodicalIF":3.2,"publicationDate":"2026-03-02","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12989621/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"147476193","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2026-03-02eCollection Date: 2026-01-01DOI: 10.3389/fdgth.2026.1761601
Anna N Khoruzhaya, Mariya D Varyukhina, Rustam A Erizhokov, Ivan A Blokhin, Roman V Reshetnikov, Mariya R Kodenko, Anastasia P Pamova, Tikhon A Burtsev, Kirill M Arzamasov, Olga V Omelyanskaya, Anton V Vladzymyrskyy, Yuriy A Vasilev
Background: Medical text summarization using large language models (LLMs) has reached an inflection point in 2024-2025, with adapted models demonstrating capability to match or exceed human expert performance in specific tasks. However, critical gaps persist in safety validation, evaluation frameworks, and clinical deployment readiness. A comprehensive review revealed that only 7% of studies conducted external validation and 3% performed patient safety assessments, with hallucination rates ranging from 1.47% to 61.6%. Existing reporting guidelines, including CONSORT-AI, SPIRIT-AI, TRIPOD-LLM, and DEAL, do not adequately address the specific requirements of medical text summarization tasks.
Objective: to develop MEDAI-LLM-SUMM, the first specialized reporting checklist for research on medical text summarization using LLMs, addressing critical gaps in existing reporting standards.
Methods: A modified iterative consensus approach was employed, comprising three sequential stages: (1) a systematic literature review of 216 publications from PubMed and eLibrary (2023-2025) following PRISMA guidelines and an analysis of existing reporting standards (TRIPOD-LLM, DEAL, CONSORT-AI, SPIRIT-AI, TRIPOD + AI, CLAIM, STARD-AI); (2) development of an initial 44-item, 7-section checklist by a supervisory group; (3) three rounds of face-to-face consensus discussions with a multidisciplinary expert panel of 11 specialists (3 radiologists, 2 clinicians, 3 medical informatics experts, 1 biostatistician, and 2 medical LLM developers). The consensus criterion required unanimous agreement from all panel members.
Results: The final MEDAI-LLM-SUMM checklist comprises 24 items organized into six sections: (A) Clinical validity (4 items addressing clinical task definition, expert involvement, hypothesis formulation, and medical expertise requirements); (B) Model Selection (5 items covering model justification, system requirements, deployment environment, LLM-as-judge approach, and prompt documentation); (C) Data (3 items on datasets, reference summaries with expert consensus, and data stratification); (D) Quality Assessment (8 items including evaluation metrics, clinical metrics, expert evaluation, hallucination detection, LLM-judge assessment, sample size justification, pilot testing, and limitations documentation); (E) Safety (2 items on ethical approval and data anonymization); and (F) Data Availability (2 items on code and dataset accessibility). Comparative analysis with six existing reporting standards demonstrated that MEDAI-LLM-SUMM uniquely addresses hallucination assessment requirements, reference summary creation methodology, LLM-as-judge validation protocols, and detailed pilot testing specifications.
{"title":"MEDAI-LLM-SUMM: a reporting checklist for medical text summarization studies using large language models.","authors":"Anna N Khoruzhaya, Mariya D Varyukhina, Rustam A Erizhokov, Ivan A Blokhin, Roman V Reshetnikov, Mariya R Kodenko, Anastasia P Pamova, Tikhon A Burtsev, Kirill M Arzamasov, Olga V Omelyanskaya, Anton V Vladzymyrskyy, Yuriy A Vasilev","doi":"10.3389/fdgth.2026.1761601","DOIUrl":"https://doi.org/10.3389/fdgth.2026.1761601","url":null,"abstract":"<p><strong>Background: </strong>Medical text summarization using large language models (LLMs) has reached an inflection point in 2024-2025, with adapted models demonstrating capability to match or exceed human expert performance in specific tasks. However, critical gaps persist in safety validation, evaluation frameworks, and clinical deployment readiness. A comprehensive review revealed that only 7% of studies conducted external validation and 3% performed patient safety assessments, with hallucination rates ranging from 1.47% to 61.6%. Existing reporting guidelines, including CONSORT-AI, SPIRIT-AI, TRIPOD-LLM, and DEAL, do not adequately address the specific requirements of medical text summarization tasks.</p><p><strong>Objective: </strong>to develop MEDAI-LLM-SUMM, the first specialized reporting checklist for research on medical text summarization using LLMs, addressing critical gaps in existing reporting standards.</p><p><strong>Methods: </strong>A modified iterative consensus approach was employed, comprising three sequential stages: (1) a systematic literature review of 216 publications from PubMed and eLibrary (2023-2025) following PRISMA guidelines and an analysis of existing reporting standards (TRIPOD-LLM, DEAL, CONSORT-AI, SPIRIT-AI, TRIPOD + AI, CLAIM, STARD-AI); (2) development of an initial 44-item, 7-section checklist by a supervisory group; (3) three rounds of face-to-face consensus discussions with a multidisciplinary expert panel of 11 specialists (3 radiologists, 2 clinicians, 3 medical informatics experts, 1 biostatistician, and 2 medical LLM developers). The consensus criterion required unanimous agreement from all panel members.</p><p><strong>Results: </strong>The final MEDAI-LLM-SUMM checklist comprises 24 items organized into six sections: (A) Clinical validity (4 items addressing clinical task definition, expert involvement, hypothesis formulation, and medical expertise requirements); (B) Model Selection (5 items covering model justification, system requirements, deployment environment, LLM-as-judge approach, and prompt documentation); (C) Data (3 items on datasets, reference summaries with expert consensus, and data stratification); (D) Quality Assessment (8 items including evaluation metrics, clinical metrics, expert evaluation, hallucination detection, LLM-judge assessment, sample size justification, pilot testing, and limitations documentation); (E) Safety (2 items on ethical approval and data anonymization); and (F) Data Availability (2 items on code and dataset accessibility). Comparative analysis with six existing reporting standards demonstrated that MEDAI-LLM-SUMM uniquely addresses hallucination assessment requirements, reference summary creation methodology, LLM-as-judge validation protocols, and detailed pilot testing specifications.</p>","PeriodicalId":73078,"journal":{"name":"Frontiers in digital health","volume":"8 ","pages":"1761601"},"PeriodicalIF":3.2,"publicationDate":"2026-03-02","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12989547/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"147476306","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2026-02-27eCollection Date: 2026-01-01DOI: 10.3389/fdgth.2026.1633888
Mahmoud Yousef, Kareem Essam Aly, Mariam Ahmed, Fatimaelzahraa Ali Ahmed, Khalid Al Jalham, Shidin Balakrishnan
Background: Deep learning (DL) methods for surgical video analysis have expanded rapidly in minimally invasive surgery (MIS). However, a structured bibliometric overview focused on DL-based surgical instrument segmentation, detection, and tracking is lacking. The objective of this review is to systematically map the research landscape with this focus, by examining publication trends, influential authors, institutions, and countries, collaboration networks, keyword co-occurrence patterns, and the thematic trajectory of the discipline.
Methods: We performed a bibliometric analysis of original research articles on DL-based surgical instrument segmentation/detection/tracking in laparoscopic or robotic MIS, published between 2017 and 2024. Searches were conducted in six databases namely PubMed, Scopus, IEEE Xplore, Embase, Medline, and Web of Science. Records were de-duplicated in EndNote and analyzed using the Bibliometrix R package, with co-authorship, co-citation, and keyword networks visualized in VOSviewer. Citation counts were extracted from each study's respective database and interpreted cautiously given the influence of publication age.
Results: We included 217 articles. Annual output increased from 2017 to a peak in 2023, indicating sustained growth in DL research for MIS instrument analysis. The most productive countries included the United States and France, with major institutional contributions from the University of Strasbourg and Furtwangen University. Keyword analysis indicated continued dominance of convolutional neural networks alongside emerging themes including transformer-based architectures, multimodal learning, and real-time intraoperative applications.
Conclusions: This bibliometric study characterizes the evolution, leading contributors, collaboration patterns, and thematic trajectories of DL-based instrument segmentation/detection/tracking in MIS. While these findings can inform research prioritization and collaboration, this study does not evaluate clinical effectiveness. Future work should prioritize explainable and efficient real-time models, standardized annotation protocols, and broader global partnerships to support responsible clinical translation.
背景:用于外科手术视频分析的深度学习(DL)方法在微创外科(MIS)中迅速扩展。然而,缺乏结构化的文献计量学综述,侧重于基于dl的手术器械分割、检测和跟踪。本综述的目的是通过研究出版趋势、有影响力的作者、机构和国家、合作网络、关键词共现模式和学科的主题轨迹,系统地绘制这一重点的研究图景。方法:我们对2017年至2024年间发表的基于dl的腹腔镜或机器人MIS中手术器械分割/检测/跟踪的原创研究文章进行文献计量学分析。在PubMed、Scopus、IEEE Xplore、Embase、Medline和Web of Science六个数据库中进行了检索。在EndNote中对记录进行重复数据删除,并使用Bibliometrix R软件包进行分析,并在VOSviewer中可视化共同作者、共同被引和关键词网络。引用计数从每个研究各自的数据库中提取,并考虑到出版年龄的影响进行谨慎解释。结果:纳入217篇文献。从2017年的年产量增加到2023年的峰值,表明MIS仪器分析的DL研究持续增长。生产力最高的国家包括美国和法国,斯特拉斯堡大学和富特旺根大学提供了主要的机构捐助。关键词分析表明,卷积神经网络将继续占据主导地位,同时还有基于变压器的架构、多模态学习和实时术中应用等新兴主题。结论:这项文献计量学研究描述了MIS中基于dl的仪器分割/检测/跟踪的演变、主要贡献者、合作模式和主题轨迹。虽然这些发现可以为研究优先级和合作提供信息,但本研究并未评估临床效果。未来的工作应优先考虑可解释和高效的实时模型、标准化注释协议和更广泛的全球合作伙伴关系,以支持负责任的临床翻译。
{"title":"Bibliometric analysis of deep learning for surgical instrument segmentation, detection and tracking in minimally invasive surgery.","authors":"Mahmoud Yousef, Kareem Essam Aly, Mariam Ahmed, Fatimaelzahraa Ali Ahmed, Khalid Al Jalham, Shidin Balakrishnan","doi":"10.3389/fdgth.2026.1633888","DOIUrl":"https://doi.org/10.3389/fdgth.2026.1633888","url":null,"abstract":"<p><strong>Background: </strong>Deep learning (DL) methods for surgical video analysis have expanded rapidly in minimally invasive surgery (MIS). However, a structured bibliometric overview focused on DL-based surgical instrument segmentation, detection, and tracking is lacking. The objective of this review is to systematically map the research landscape with this focus, by examining publication trends, influential authors, institutions, and countries, collaboration networks, keyword co-occurrence patterns, and the thematic trajectory of the discipline.</p><p><strong>Methods: </strong>We performed a bibliometric analysis of original research articles on DL-based surgical instrument segmentation/detection/tracking in laparoscopic or robotic MIS, published between 2017 and 2024. Searches were conducted in six databases namely PubMed, Scopus, IEEE Xplore, Embase, Medline, and Web of Science. Records were de-duplicated in EndNote and analyzed using the Bibliometrix R package, with co-authorship, co-citation, and keyword networks visualized in VOSviewer. Citation counts were extracted from each study's respective database and interpreted cautiously given the influence of publication age.</p><p><strong>Results: </strong>We included 217 articles. Annual output increased from 2017 to a peak in 2023, indicating sustained growth in DL research for MIS instrument analysis. The most productive countries included the United States and France, with major institutional contributions from the University of Strasbourg and Furtwangen University. Keyword analysis indicated continued dominance of convolutional neural networks alongside emerging themes including transformer-based architectures, multimodal learning, and real-time intraoperative applications.</p><p><strong>Conclusions: </strong>This bibliometric study characterizes the evolution, leading contributors, collaboration patterns, and thematic trajectories of DL-based instrument segmentation/detection/tracking in MIS. While these findings can inform research prioritization and collaboration, this study does not evaluate clinical effectiveness. Future work should prioritize explainable and efficient real-time models, standardized annotation protocols, and broader global partnerships to support responsible clinical translation.</p>","PeriodicalId":73078,"journal":{"name":"Frontiers in digital health","volume":"8 ","pages":"1633888"},"PeriodicalIF":3.2,"publicationDate":"2026-02-27","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12983469/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"147470441","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2026-02-27eCollection Date: 2026-01-01DOI: 10.3389/fdgth.2026.1771281
Abishek Ravichandran, Tamilarasi Kathirvel Murugan, Logeswari Govindaraj, Vishal M
Parkinson's disease (PD) is a progressive neurodegenerative disorder characterized by motor and non-motor impairments, where early diagnosis remains challenging due to reliance on subjective clinical assessments. Recent artificial intelligence (AI)-based approaches have demonstrated promise in identifying subtle PD biomarkers from individual modalities such as speech, gait, and handwriting; however, unimodal systems often fail to capture the heterogeneity of the disease and provide limited interpretability. To address these limitations, this study proposes a multimodal deep learning framework that integrates handwriting, gait, and speech modalities using an early feature fusion strategy for robust and interpretable PD detection. Each modality is processed through a dedicated feature extraction pipeline using deep neural networks, followed by static feature concatenation and classification using an XGBoost model. Model transparency is enhanced using explainable AI (XAI) techniques, including SHapley Additive exPlanations (SHAP) and Gradient-weighted Class Activation Mapping (Grad-CAM), enabling clinical interpretability of modality- and feature-level contributions. Experimental evaluation on benchmark datasets demonstrates that the proposed trimodal fusion model achieves an accuracy of 92%, outperforming unimodal handwriting (91%), gait (90%), and speech (74%) models. The fusion framework attains a macro F1-score of 0.89, an area under the ROC curve (AUC) of 0.95, and an average precision (AP) of 0.96, indicating strong discriminative capability and robustness. Confusion matrix analysis reveals balanced sensitivity (90%) and specificity (89%) across classes. Explainability analysis confirms that handwriting tremor patterns, gait force asymmetries, and speech spectral instabilities are key contributors to PD prediction. These results highlight the effectiveness of explainable multimodal AI in delivering accurate, reliable, and clinically interpretable solutions for early PD detection.
{"title":"Explainable multimodal feature fusion networks for Parkinson's disease prediction.","authors":"Abishek Ravichandran, Tamilarasi Kathirvel Murugan, Logeswari Govindaraj, Vishal M","doi":"10.3389/fdgth.2026.1771281","DOIUrl":"https://doi.org/10.3389/fdgth.2026.1771281","url":null,"abstract":"<p><p>Parkinson's disease (PD) is a progressive neurodegenerative disorder characterized by motor and non-motor impairments, where early diagnosis remains challenging due to reliance on subjective clinical assessments. Recent artificial intelligence (AI)-based approaches have demonstrated promise in identifying subtle PD biomarkers from individual modalities such as speech, gait, and handwriting; however, unimodal systems often fail to capture the heterogeneity of the disease and provide limited interpretability. To address these limitations, this study proposes a multimodal deep learning framework that integrates handwriting, gait, and speech modalities using an early feature fusion strategy for robust and interpretable PD detection. Each modality is processed through a dedicated feature extraction pipeline using deep neural networks, followed by static feature concatenation and classification using an XGBoost model. Model transparency is enhanced using explainable AI (XAI) techniques, including SHapley Additive exPlanations (SHAP) and Gradient-weighted Class Activation Mapping (Grad-CAM), enabling clinical interpretability of modality- and feature-level contributions. Experimental evaluation on benchmark datasets demonstrates that the proposed trimodal fusion model achieves an accuracy of 92%, outperforming unimodal handwriting (91%), gait (90%), and speech (74%) models. The fusion framework attains a macro F1-score of 0.89, an area under the ROC curve (AUC) of 0.95, and an average precision (AP) of 0.96, indicating strong discriminative capability and robustness. Confusion matrix analysis reveals balanced sensitivity (90%) and specificity (89%) across classes. Explainability analysis confirms that handwriting tremor patterns, gait force asymmetries, and speech spectral instabilities are key contributors to PD prediction. These results highlight the effectiveness of explainable multimodal AI in delivering accurate, reliable, and clinically interpretable solutions for early PD detection.</p>","PeriodicalId":73078,"journal":{"name":"Frontiers in digital health","volume":"8 ","pages":"1771281"},"PeriodicalIF":3.2,"publicationDate":"2026-02-27","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12982457/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"147470416","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2026-02-27eCollection Date: 2026-01-01DOI: 10.3389/fdgth.2026.1743131
Juliana Basulo-Ribeiro, Ana Ferreira, Leonor Teixeira
The advance of medicine and technology has been a catalyst for the population's longevity, reflected in the increase in elderly citizens. However, this increase also comes a burden on caregivers. To address this gap between need and accessibility, the motivation for this study arises, highlighting the challenges faced by an ageing population. This work presents a preliminary proof of concept of an innovative digital tool (a mobile app prototype) to support older people to live more independently and safely, while facilitating communication between them, caregivers and health professionals. To develop the prototype for monitoring elderly health, the User-Centred Design methodology was applied, concluding with a usability evaluation. As a proof-of-concept, this study suggests that combining technology with human support may contribute to improved elderly care and empowerment; however, these implications remain preliminary and require validation in larger and more diverse evaluations. Theoretically, it uses a social determinant of health lens to outline potential ways in which health apps could support access to care in this age group, to be examined in future, larger-scale evaluations. From a practical perspective, it contributes with a preliminary proof of concept and prototype offering use-cases aimed at improving the quality of life of this population.
{"title":"SeniorHealth Tracker application to support the elderly: technological innovation leveraging humanisation.","authors":"Juliana Basulo-Ribeiro, Ana Ferreira, Leonor Teixeira","doi":"10.3389/fdgth.2026.1743131","DOIUrl":"https://doi.org/10.3389/fdgth.2026.1743131","url":null,"abstract":"<p><p>The advance of medicine and technology has been a catalyst for the population's longevity, reflected in the increase in elderly citizens. However, this increase also comes a burden on caregivers. To address this gap between need and accessibility, the motivation for this study arises, highlighting the challenges faced by an ageing population. This work presents a preliminary proof of concept of an innovative digital tool (a mobile app prototype) to support older people to live more independently and safely, while facilitating communication between them, caregivers and health professionals. To develop the prototype for monitoring elderly health, the User-Centred Design methodology was applied, concluding with a usability evaluation. As a proof-of-concept, this study suggests that combining technology with human support may contribute to improved elderly care and empowerment; however, these implications remain preliminary and require validation in larger and more diverse evaluations. Theoretically, it uses a social determinant of health lens to outline potential ways in which health apps could support access to care in this age group, to be examined in future, larger-scale evaluations. From a practical perspective, it contributes with a preliminary proof of concept and prototype offering use-cases aimed at improving the quality of life of this population.</p>","PeriodicalId":73078,"journal":{"name":"Frontiers in digital health","volume":"8 ","pages":"1743131"},"PeriodicalIF":3.2,"publicationDate":"2026-02-27","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12982169/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"147470394","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}