PLOS digital health最新文献_第2页

A pilot feasibility study of human-centered design for cirrhosis care: Development and pilot testing of SMARTLiver prototype, a FHIR-based clinical decision support system for hepatology. 肝硬化以人为中心设计的中试可行性研究：基于fhr的肝病临床决策支持系统SMARTLiver原型的开发和中试测试

IF 7.7

PLOS digital health

Pub Date : 2026-01-20 eCollection Date: 2026-01-01 DOI: 10.1371/journal.pdig.0000969

Keerthika Sunchu, Archita P Desai, Raj Vuppalanchi, Saptarshi Purkayastha

Management of cirrhosis suffers from poor guideline adherence due to fragmented electronic health record (EHR) systems that scatter critical patient data across multiple modules, creating cognitive burden for clinicians and impeding evidence-based care delivery. We developed SMARTLiver, a Substitutable Medical Applications and Reusable Technologies on Fast Healthcare Interoperability Resources (SMART-on-FHIR) clinical decision support application employing human-centered design principles to consolidate patient data, incorporate evidence-based guidelines, and enhance cirrhosis care workflows. Following literature reviews of cirrhosis management guidelines and clinical workflow analysis within our health system, we created a FHIR-based application integrating automated task management, prognostic scoring, patient-reported outcomes, and real-time clinical decision support features. Usability evaluation with five clinical staff members using Think-Aloud protocols and the validated Health-ITUES survey revealed high satisfaction scores for Clinical Utility (4.4-4.6/5.0) and User Interface design (4.2/5.0), with moderate scores for workflow integration (4.0/5.0) and decision support (3.8-4.0/5.0). Qualitative feedback aligned with quantitative results, identifying enhancement opportunities in customization controls and notification management. The SMARTLiver prototype demonstrated technical feasibility in aggregating fragmented clinical data into a unified interface, automating evidence-based task generation, and maintaining interoperability across healthcare systems. This pilot study provides initial evidence for the potential of SMART-on-FHIR technology to address EHR fragmentation in cirrhosis care, though clinical effectiveness remains to be demonstrated.

由于分散的电子健康记录（EHR）系统将关键患者数据分散在多个模块中，给临床医生带来认知负担，阻碍了循证护理的提供，肝硬化管理的指南依从性较差。我们开发了SMARTLiver，这是一款基于快速医疗互操作性资源（SMART-on-FHIR）的替代医疗应用和可重用技术临床决策支持应用程序，采用以人为本的设计原则来整合患者数据，纳入循证指南，并增强肝硬化护理工作流程。根据对肝硬化管理指南的文献回顾和我们卫生系统内的临床工作流程分析，我们创建了一个基于fhir的应用程序，集成了自动任务管理、预后评分、患者报告的结果和实时临床决策支持功能。对5名临床工作人员使用Think-Aloud方案和经过验证的Health-ITUES调查进行的可用性评估显示，临床效用（4.4-4.6/5.0）和用户界面设计（4.2/5.0）得分较高，工作流程集成（4.0/5.0）和决策支持（3.8-4.0/5.0）得分中等。定性反馈与定量结果一致，确定了自定义控制和通知管理方面的增强机会。SMARTLiver原型展示了将分散的临床数据聚合到统一界面、自动化循证任务生成以及保持医疗系统互操作性的技术可行性。这项初步研究为SMART-on-FHIR技术在肝硬化治疗中解决电子病历碎片化问题的潜力提供了初步证据，尽管临床有效性仍有待证实。

{"title":"A pilot feasibility study of human-centered design for cirrhosis care: Development and pilot testing of SMARTLiver prototype, a FHIR-based clinical decision support system for hepatology.","authors":"Keerthika Sunchu, Archita P Desai, Raj Vuppalanchi, Saptarshi Purkayastha","doi":"10.1371/journal.pdig.0000969","DOIUrl":"10.1371/journal.pdig.0000969","url":null,"abstract":"Management of cirrhosis suffers from poor guideline adherence due to fragmented electronic health record (EHR) systems that scatter critical patient data across multiple modules, creating cognitive burden for clinicians and impeding evidence-based care delivery. We developed SMARTLiver, a Substitutable Medical Applications and Reusable Technologies on Fast Healthcare Interoperability Resources (SMART-on-FHIR) clinical decision support application employing human-centered design principles to consolidate patient data, incorporate evidence-based guidelines, and enhance cirrhosis care workflows. Following literature reviews of cirrhosis management guidelines and clinical workflow analysis within our health system, we created a FHIR-based application integrating automated task management, prognostic scoring, patient-reported outcomes, and real-time clinical decision support features. Usability evaluation with five clinical staff members using Think-Aloud protocols and the validated Health-ITUES survey revealed high satisfaction scores for Clinical Utility (4.4-4.6/5.0) and User Interface design (4.2/5.0), with moderate scores for workflow integration (4.0/5.0) and decision support (3.8-4.0/5.0). Qualitative feedback aligned with quantitative results, identifying enhancement opportunities in customization controls and notification management. The SMARTLiver prototype demonstrated technical feasibility in aggregating fragmented clinical data into a unified interface, automating evidence-based task generation, and maintaining interoperability across healthcare systems. This pilot study provides initial evidence for the potential of SMART-on-FHIR technology to address EHR fragmentation in cirrhosis care, though clinical effectiveness remains to be demonstrated.","PeriodicalId":74465,"journal":{"name":"PLOS digital health","volume":"5 1","pages":"e0000969"},"PeriodicalIF":7.7,"publicationDate":"2026-01-20","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12818595/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"146013732","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Decentralized clinical trials: A comprehensive analysis of trends, technologies, and global challenges. 分散临床试验：趋势、技术和全球挑战的综合分析。

IF 7.7

PLOS digital health

Pub Date : 2026-01-16 eCollection Date: 2026-01-01 DOI: 10.1371/journal.pdig.0001191

Sara Kijewski, Claire McBride, Eric Owens, Elsa Bernheim, Effy Vayena

Decentralized clinical trials (DCTs), particularly in the U.S., gained substantial attention during the COVID-19 pandemic, enabling trial activities to be conducted from participants' homes or local healthcare facilities despite restrictions and lockdowns. Regardless of the growth in interest, many facets of the DCT landscape remain unexplored or nascent in their development. This study aims to explore the key characteristics and development of the U.S.-registered DCT landscape, adoption patterns across various clinical contexts, and the role of digital technologies. We analyzed 1370 decentralized trials from ClinicalTrials.gov, collected using a broad DCT-keyword search. The data were screened and coded manually, and analyzed descriptively for temporal trends, purpose of decentralization, intervention type, geographic representation, and digitalization. Our findings align with previous reports of a growing, heterogeneous landscape of DCTs, with behavioral interventions appearing more suitable for decentralization than other types of interventions. Notably, most DCTs still focus on evaluating decentralized methods rather than merely implementing them in their investigations. Often, studies integrate digital tools either as the interventions themselves or to enable the digital delivery of study activities. Although the trial registry used is U.S.-based, and a U.S. partner is part of more than 50% of the studies identified, many trials are done in multiple countries or countries outside of the U.S. (42%). Among these trials, the data revealed considerable differences, with digitalized DCTs in this sample concentrated in high-income countries. Despite rapid growth in DCTs, our findings suggest the presence of a field in development, very much focused on establishing a methodological foundation. To unlock the potential of DCTs locally and globally, four critical areas demand further attention: digital equity, regulatory frameworks for diverse technologies, establishment of methodological validation processes, and further research on barriers to implementation.

分散临床试验（dct），特别是在美国，在2019冠状病毒病大流行期间获得了大量关注，使试验活动能够在参与者家中或当地医疗机构进行，尽管有限制和封锁。尽管人们的兴趣在增长，但DCT领域的许多方面仍未被探索或处于发展初期。本研究旨在探讨美国注册DCT的主要特征和发展，不同临床背景下的采用模式，以及数字技术的作用。我们分析了ClinicalTrials.gov网站上1370个分散的试验，这些试验是通过广泛的dct关键字搜索收集的。对数据进行手动筛选和编码，并对时间趋势、分权目的、干预类型、地理代表性和数字化进行描述性分析。我们的研究结果与之前的报告一致，即dct的异质性日益增加，行为干预似乎比其他类型的干预更适合分散。值得注意的是，大多数dct仍然专注于评估分散的方法，而不仅仅是在调查中实施这些方法。通常，研究将数字工具作为干预措施本身或使研究活动的数字化交付成为可能。尽管所使用的试验注册是在美国，并且超过50%的研究是由美国合作伙伴参与的，但许多试验是在多个国家或美国以外的国家进行的（42%）。在这些试验中，数据显示出相当大的差异，该样本中的数字化dct集中在高收入国家。尽管dct快速增长，但我们的研究结果表明，在发展中存在一个领域，非常注重建立方法学基础。为了释放dct在本地和全球的潜力，需要进一步关注四个关键领域：数字公平、不同技术的监管框架、方法验证流程的建立以及对实施障碍的进一步研究。

{"title":"Decentralized clinical trials: A comprehensive analysis of trends, technologies, and global challenges.","authors":"Sara Kijewski, Claire McBride, Eric Owens, Elsa Bernheim, Effy Vayena","doi":"10.1371/journal.pdig.0001191","DOIUrl":"10.1371/journal.pdig.0001191","url":null,"abstract":"Decentralized clinical trials (DCTs), particularly in the U.S., gained substantial attention during the COVID-19 pandemic, enabling trial activities to be conducted from participants' homes or local healthcare facilities despite restrictions and lockdowns. Regardless of the growth in interest, many facets of the DCT landscape remain unexplored or nascent in their development. This study aims to explore the key characteristics and development of the U.S.-registered DCT landscape, adoption patterns across various clinical contexts, and the role of digital technologies. We analyzed 1370 decentralized trials from ClinicalTrials.gov, collected using a broad DCT-keyword search. The data were screened and coded manually, and analyzed descriptively for temporal trends, purpose of decentralization, intervention type, geographic representation, and digitalization. Our findings align with previous reports of a growing, heterogeneous landscape of DCTs, with behavioral interventions appearing more suitable for decentralization than other types of interventions. Notably, most DCTs still focus on evaluating decentralized methods rather than merely implementing them in their investigations. Often, studies integrate digital tools either as the interventions themselves or to enable the digital delivery of study activities. Although the trial registry used is U.S.-based, and a U.S. partner is part of more than 50% of the studies identified, many trials are done in multiple countries or countries outside of the U.S. (42%). Among these trials, the data revealed considerable differences, with digitalized DCTs in this sample concentrated in high-income countries. Despite rapid growth in DCTs, our findings suggest the presence of a field in development, very much focused on establishing a methodological foundation. To unlock the potential of DCTs locally and globally, four critical areas demand further attention: digital equity, regulatory frameworks for diverse technologies, establishment of methodological validation processes, and further research on barriers to implementation.","PeriodicalId":74465,"journal":{"name":"PLOS digital health","volume":"5 1","pages":"e0001191"},"PeriodicalIF":7.7,"publicationDate":"2026-01-16","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12810901/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145992136","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Impact of virtual ICU implementation on clinical outcomes across multiple critical care units: A before-and-after study. 虚拟ICU实施对多个重症监护病房临床结果的影响：一项前后研究。

IF 7.7

PLOS digital health

Pub Date : 2026-01-16 eCollection Date: 2026-01-01 DOI: 10.1371/journal.pdig.0001186

Annemarie Nguyen, Sprague W Hazard, Anthony S Bonavia

Virtual intensive care units (vICUs) provide continuous remote monitoring and support for critically ill patients. Increasing patient complexity and staffing shortages have driven interest in vICUs, but evidence of their impact on clinical outcomes is limited. This study evaluated the effect of vICU implementation across critical care units in a large academic medical center. We conducted a before-and-after study comparing outcomes during the initial vICU implementation period (October 2022-April 2023) and the established program period (October 2023-April 2024), with a 6-month washout interval. Adult patients from a multispecialty surgical intensive care unit (ICU), neurocritical care unit, and ICU step-down unit were included if they had ICU stays longer than 6 h, hospital stays under 30 days, and mechanical ventilation for at least 12 h. The primary outcome was ICU length of stay, with secondary outcomes including hospital stay, ventilation time, vasopressor use, readmissions, and mortality. Among 530 patients (266 implementation, 264 established), ICU length of stay decreased from 232 to 198 h (p=0.011), ventilation time from 110 to 90 h (p=0.044), and vasopressor use for more than 12 h from 55% to 43% (p=0.007). Hospital stay, mortality, and readmission rates were unchanged. Subgroup analysis showed the greatest improvements in the surgical ICU, with reductions in ICU stay, ventilation time, and vasopressor use. These improvements may reflect earlier recognition of deterioration, better care coordination, and timely withdrawal of intensive therapies. Variation across units highlights the need to tailor vICU integration strategies to specific clinical workflows. These findings suggest that vICU implementation is feasible and may enhance critical care efficiency, though further multi-center studies are needed to determine generalizability and to assess patient-centered and economic outcomes.

虚拟重症监护病房（vicu）为危重患者提供持续的远程监测和支持。日益增加的患者复杂性和人员短缺促使人们对vicu产生了兴趣，但有关其对临床结果影响的证据有限。本研究评估了在一家大型学术医疗中心的重症监护病房实施vICU的效果。我们进行了一项前后研究，比较了初始vICU实施期间（2022年10月至2023年4月）和既定计划期间（2010月至2024年4月）的结果，并进行了6个月的洗脱期。来自多专科外科重症监护病房（ICU）、神经危重监护病房和ICU降压病房的成年患者，如果其ICU住院时间超过6小时、住院时间小于30天、机械通气至少12小时，则纳入研究。主要结局是ICU住院时间，次要结局包括住院时间、通气时间、血管加压剂使用、再入院和死亡率。在530例患者中（266例实施，264例已建立），ICU住院时间从232小时减少到198小时（p=0.011），通气时间从110小时减少到90小时（p=0.044），血管加压药使用超过12小时从55%减少到43% （p=0.007）。住院时间、死亡率和再入院率没有变化。亚组分析显示，外科ICU的改善最大，ICU住院时间、通气时间和血管加压药的使用都减少了。这些改善可能反映了对恶化的早期认识、更好的护理协调以及及时退出强化治疗。各单位之间的差异突出了为特定临床工作流程量身定制vICU集成策略的必要性。这些发现表明，vICU的实施是可行的，并可能提高重症监护效率，但需要进一步的多中心研究来确定其普遍性，并评估以患者为中心的经济结果。

{"title":"Impact of virtual ICU implementation on clinical outcomes across multiple critical care units: A before-and-after study.","authors":"Annemarie Nguyen, Sprague W Hazard, Anthony S Bonavia","doi":"10.1371/journal.pdig.0001186","DOIUrl":"10.1371/journal.pdig.0001186","url":null,"abstract":"Virtual intensive care units (vICUs) provide continuous remote monitoring and support for critically ill patients. Increasing patient complexity and staffing shortages have driven interest in vICUs, but evidence of their impact on clinical outcomes is limited. This study evaluated the effect of vICU implementation across critical care units in a large academic medical center. We conducted a before-and-after study comparing outcomes during the initial vICU implementation period (October 2022-April 2023) and the established program period (October 2023-April 2024), with a 6-month washout interval. Adult patients from a multispecialty surgical intensive care unit (ICU), neurocritical care unit, and ICU step-down unit were included if they had ICU stays longer than 6 h, hospital stays under 30 days, and mechanical ventilation for at least 12 h. The primary outcome was ICU length of stay, with secondary outcomes including hospital stay, ventilation time, vasopressor use, readmissions, and mortality. Among 530 patients (266 implementation, 264 established), ICU length of stay decreased from 232 to 198 h (p=0.011), ventilation time from 110 to 90 h (p=0.044), and vasopressor use for more than 12 h from 55% to 43% (p=0.007). Hospital stay, mortality, and readmission rates were unchanged. Subgroup analysis showed the greatest improvements in the surgical ICU, with reductions in ICU stay, ventilation time, and vasopressor use. These improvements may reflect earlier recognition of deterioration, better care coordination, and timely withdrawal of intensive therapies. Variation across units highlights the need to tailor vICU integration strategies to specific clinical workflows. These findings suggest that vICU implementation is feasible and may enhance critical care efficiency, though further multi-center studies are needed to determine generalizability and to assess patient-centered and economic outcomes.","PeriodicalId":74465,"journal":{"name":"PLOS digital health","volume":"5 1","pages":"e0001186"},"PeriodicalIF":7.7,"publicationDate":"2026-01-16","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12810791/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145992158","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Immersive virtual reality-based intervention for psychological wellbeing among older adults: A systematic review and meta-analysis. 基于沉浸式虚拟现实的老年人心理健康干预：系统回顾和荟萃分析。

IF 7.7

PLOS digital health

Pub Date : 2026-01-15 eCollection Date: 2026-01-01 DOI: 10.1371/journal.pdig.0001110

Jing Jing Su, Chi-Keung Chan, Ladislav Batalik, Wai Chung Chung, Chen Lei, Rick Yiu Cho Kwan

Immersive virtual reality (IVR) is an emerging therapeutic modality that engages older adults in psychological therapeutically oriented activities developed to improve their psychological well-being. This systematic review aims to investigate the effects of IVR psychological intervention on psychological symptoms and well-being. A systematic review and meta-analysis was conducted following the Cochrane Handbook for Systematic Reviews of Interventions. Six databases were searched, including Embase, PubMed, Web of Science, Scopus, CINAHL, and PsycINFO, covering the period from 2010 to December 2024. RevMan 5.3 was utilized for meta-analysis, and the Cochrane Risk of Bias tool was employed for quality assessment. Ten randomized controlled trials of 746 older adults were included. The IVR interventions employed reminiscence (40%), garden/forest therapy (40%), cognitive stimulation (10%), and multi-sensory stimulation to reduce psychological symptoms and improve self-perception (10%). Data pooling suggested that IVR interventions have significantly reduced depressive symptoms [n = 5; SMD = -0.83, 95%CI (-1.05, -0.60), I2 = 21%, p < .001]; anxiety [n = 5, SMD = -0.77, 95% CI (-1.32, -0.22), I2 = 70%, p = .006]. Synthesis without meta-analysis (SWiM) was conducted for stress and affect outcomes following SWiM guidance. In all three studies (100%), IVR produced statistically significant reductions in stress versus usual/standard care, and in both studies (100%), it yielded statistically significant improvements in affect-higher positive and lower negative affect-compared with the respective control conditions. IVR-based interventions could be an alternative method for alleviating the psychological symptoms of older adults. Registration: PROSPERO CRD42024575387.

沉浸式虚拟现实（IVR）是一种新兴的治疗方式，使老年人参与心理治疗导向的活动，以改善他们的心理健康。本系统综述旨在探讨IVR心理干预对心理症状和幸福感的影响。根据Cochrane干预措施系统评价手册进行了系统评价和荟萃分析。检索了Embase、PubMed、Web of Science、Scopus、CINAHL、PsycINFO等6个数据库，检索时间为2010年至2024年12月。meta分析采用RevMan 5.3，质量评价采用Cochrane偏倚风险工具。10项随机对照试验纳入746名老年人。IVR干预采用回忆（40%）、花园/森林疗法（40%）、认知刺激（10%）和多感官刺激来减少心理症状和改善自我知觉（10%）。数据汇总显示IVR干预可显著减少抑郁症状[n = 5；SMD = -0.83, 95%CI (-1.05, -0.60), I2 = 21%, p

{"title":"Immersive virtual reality-based intervention for psychological wellbeing among older adults: A systematic review and meta-analysis.","authors":"Jing Jing Su, Chi-Keung Chan, Ladislav Batalik, Wai Chung Chung, Chen Lei, Rick Yiu Cho Kwan","doi":"10.1371/journal.pdig.0001110","DOIUrl":"10.1371/journal.pdig.0001110","url":null,"abstract":"Immersive virtual reality (IVR) is an emerging therapeutic modality that engages older adults in psychological therapeutically oriented activities developed to improve their psychological well-being. This systematic review aims to investigate the effects of IVR psychological intervention on psychological symptoms and well-being. A systematic review and meta-analysis was conducted following the Cochrane Handbook for Systematic Reviews of Interventions. Six databases were searched, including Embase, PubMed, Web of Science, Scopus, CINAHL, and PsycINFO, covering the period from 2010 to December 2024. RevMan 5.3 was utilized for meta-analysis, and the Cochrane Risk of Bias tool was employed for quality assessment. Ten randomized controlled trials of 746 older adults were included. The IVR interventions employed reminiscence (40%), garden/forest therapy (40%), cognitive stimulation (10%), and multi-sensory stimulation to reduce psychological symptoms and improve self-perception (10%). Data pooling suggested that IVR interventions have significantly reduced depressive symptoms [n = 5; SMD = -0.83, 95%CI (-1.05, -0.60), I2 = 21%, p < .001]; anxiety [n = 5, SMD = -0.77, 95% CI (-1.32, -0.22), I2 = 70%, p = .006]. Synthesis without meta-analysis (SWiM) was conducted for stress and affect outcomes following SWiM guidance. In all three studies (100%), IVR produced statistically significant reductions in stress versus usual/standard care, and in both studies (100%), it yielded statistically significant improvements in affect-higher positive and lower negative affect-compared with the respective control conditions. IVR-based interventions could be an alternative method for alleviating the psychological symptoms of older adults. Registration: PROSPERO CRD42024575387.","PeriodicalId":74465,"journal":{"name":"PLOS digital health","volume":"5 1","pages":"e0001110"},"PeriodicalIF":7.7,"publicationDate":"2026-01-15","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12806852/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145986032","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Lessons learned from implementing FAIRification workflows in diabetes research in Germany. 在德国糖尿病研究中实施farification工作流程的经验教训。

IF 7.7

PLOS digital health

Pub Date : 2026-01-13 eCollection Date: 2026-01-01 DOI: 10.1371/journal.pdig.0001139

Esther Thea Inau, Angela Dedié, Ivona Anastasova, Renate Schick, Brigitte Fröhlich, Michael Roden, Andreas L Birkenfeld, Martin Hrabě de Angelis, Martin Preusse, Dagmar Waltemath, Atinkut Alamirrew Zeleke

The FAIR principles guide data stewardship towards maximizing the value of scientific data while offering a high level of flexibility to accommodate differences in standards and scientific practices. Research communities have developed and implemented domain-specific workflows to make their data FAIR. This work compares the implementation of two externally developed structured FAIRification workflows-a generic workflow and a domain-specific workflow- using the example of metadata captured in diabetes research in Germany and applying the FAIR data maturity model developed by the Research Data Alliance. Interestingly, the implementation of both workflows required similar resources and led us to achieve the same FAIRness rating. We therefore conclude that the adaptations made in the FAIRification workflow for health research data improve efficiency but do not necessarily lead to higher FAIRness scores when applied to core data sets. Based on the results of our workflow comparison, we identified a list of requirements that should be met for the FAIRification of a core data set regardless of the workflow employed. In the future, FAIR data strategies and infrastructure should be planned and implemented as early as possible in the FAIRification journey. It is anticipated that this comparative analysis will help establish standard operating procedures for the FAIRification of core data sets for health studies.

公平原则指导数据管理，使科学数据的价值最大化，同时提供高度的灵活性，以适应标准和科学实践的差异。研究团体已经开发并实现了特定于领域的工作流，以使他们的数据公平。本研究以德国糖尿病研究中捕获的元数据为例，并应用研究数据联盟开发的FAIR数据成熟度模型，比较了两种外部开发的结构化FAIR工作流（通用工作流和特定领域工作流）的实施情况。有趣的是，这两个工作流的实现需要相似的资源，并使我们获得相同的公平评级。因此，我们得出结论，在卫生研究数据的公平化工作流程中所做的调整提高了效率，但在应用于核心数据集时并不一定导致更高的公平得分。基于工作流比较的结果，我们确定了一个需求列表，无论采用何种工作流，核心数据集的标准化都应该满足这些需求。未来，FAIR数据策略和基础设施应该在FAIR化过程中尽早规划和实施。预计这种比较分析将有助于建立标准作业程序，以统一卫生研究的核心数据集。

{"title":"Lessons learned from implementing FAIRification workflows in diabetes research in Germany.","authors":"Esther Thea Inau, Angela Dedié, Ivona Anastasova, Renate Schick, Brigitte Fröhlich, Michael Roden, Andreas L Birkenfeld, Martin Hrabě de Angelis, Martin Preusse, Dagmar Waltemath, Atinkut Alamirrew Zeleke","doi":"10.1371/journal.pdig.0001139","DOIUrl":"10.1371/journal.pdig.0001139","url":null,"abstract":"The FAIR principles guide data stewardship towards maximizing the value of scientific data while offering a high level of flexibility to accommodate differences in standards and scientific practices. Research communities have developed and implemented domain-specific workflows to make their data FAIR. This work compares the implementation of two externally developed structured FAIRification workflows-a generic workflow and a domain-specific workflow- using the example of metadata captured in diabetes research in Germany and applying the FAIR data maturity model developed by the Research Data Alliance. Interestingly, the implementation of both workflows required similar resources and led us to achieve the same FAIRness rating. We therefore conclude that the adaptations made in the FAIRification workflow for health research data improve efficiency but do not necessarily lead to higher FAIRness scores when applied to core data sets. Based on the results of our workflow comparison, we identified a list of requirements that should be met for the FAIRification of a core data set regardless of the workflow employed. In the future, FAIR data strategies and infrastructure should be planned and implemented as early as possible in the FAIRification journey. It is anticipated that this comparative analysis will help establish standard operating procedures for the FAIRification of core data sets for health studies.","PeriodicalId":74465,"journal":{"name":"PLOS digital health","volume":"5 1","pages":"e0001139"},"PeriodicalIF":7.7,"publicationDate":"2026-01-13","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12799184/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145968073","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Moving beyond the empty cell: The threat of decontextualized healthcare data. 超越空单元格：脱离上下文的医疗保健数据的威胁。

IF 7.7

PLOS digital health

Pub Date : 2026-01-13 eCollection Date: 2026-01-01 DOI: 10.1371/journal.pdig.0001194

Aya El Mir, Eric Bezerra de Sousa, Ignacio Mesina-Estarrón, Leo Anthony Celi, Moad Hani, Mohammed Benjelloun, Neha Nageswaran, Saïd Mahmoudi, Shaheen Siddiqui, Sreeram Sadasivam, William Greig Mitchell

Missing, inaccurate, or poorly documented data in healthcare is often treated as a technical problem to be statistically resolved via imputation, deletion, or modeling assumptions about randomness. However, such inaccuracies relate to far more complex socioeconomic and geopolitical issues, rather than "errors of data entry" to be ameliorated with statistical modeling techniques. We outline that what is really missing or inaccurate is the context in which the data is collected-and that only by understanding this context can we begin to prevent artificial intelligence's (AIs) amplification of misleading, decontextualized data. We critically examine how traditional modeling methods fail to account for the factors that influence what data gets recorded, and for whom. We show how AI systems trained on decontextualized data reinforce health inequities at scale. And, we review recent literature on context-aware approaches to understanding data, that incorporate metadata, social determinants of health, fairness constraints, and participatory governance to build more ethical and representative systems. Our analysis urges the AI and healthcare communities to move beyond the traditional emphasis on statistical convenience, toward socially grounded and interdisciplinary strategies for handling decontextualized data.

医疗保健中缺少、不准确或记录不良的数据通常被视为一个技术问题，需要通过对随机性的插入、删除或建模假设来统计解决。然而，这种不准确与更复杂的社会经济和地缘政治问题有关，而不是用统计建模技术来改善的“数据输入错误”。我们概述说，真正缺失或不准确的是收集数据的背景，只有理解了这一背景，我们才能开始防止人工智能（ai）放大误导性的、脱离背景的数据。我们批判性地研究了传统建模方法如何无法解释影响记录数据的因素，以及为谁记录。我们展示了在非情境化数据上训练的人工智能系统如何大规模地加剧了卫生不平等。此外，我们回顾了最近关于上下文感知方法来理解数据的文献，这些方法包括元数据、健康的社会决定因素、公平约束和参与式治理，以建立更具道德和代表性的系统。我们的分析敦促人工智能和医疗保健社区超越传统上对统计便利性的强调，转向基于社会和跨学科的策略来处理非上下文数据。

{"title":"Moving beyond the empty cell: The threat of decontextualized healthcare data.","authors":"Aya El Mir, Eric Bezerra de Sousa, Ignacio Mesina-Estarrón, Leo Anthony Celi, Moad Hani, Mohammed Benjelloun, Neha Nageswaran, Saïd Mahmoudi, Shaheen Siddiqui, Sreeram Sadasivam, William Greig Mitchell","doi":"10.1371/journal.pdig.0001194","DOIUrl":"10.1371/journal.pdig.0001194","url":null,"abstract":"Missing, inaccurate, or poorly documented data in healthcare is often treated as a technical problem to be statistically resolved via imputation, deletion, or modeling assumptions about randomness. However, such inaccuracies relate to far more complex socioeconomic and geopolitical issues, rather than \"errors of data entry\" to be ameliorated with statistical modeling techniques. We outline that what is really missing or inaccurate is the context in which the data is collected-and that only by understanding this context can we begin to prevent artificial intelligence's (AIs) amplification of misleading, decontextualized data. We critically examine how traditional modeling methods fail to account for the factors that influence what data gets recorded, and for whom. We show how AI systems trained on decontextualized data reinforce health inequities at scale. And, we review recent literature on context-aware approaches to understanding data, that incorporate metadata, social determinants of health, fairness constraints, and participatory governance to build more ethical and representative systems. Our analysis urges the AI and healthcare communities to move beyond the traditional emphasis on statistical convenience, toward socially grounded and interdisciplinary strategies for handling decontextualized data.","PeriodicalId":74465,"journal":{"name":"PLOS digital health","volume":"5 1","pages":"e0001194"},"PeriodicalIF":7.7,"publicationDate":"2026-01-13","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12798993/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145968075","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Remote home cardiotocography: A systematic review and meta-analysis. 远程家庭心脏造影：系统回顾和荟萃分析。

IF 7.7

PLOS digital health

Pub Date : 2026-01-12 eCollection Date: 2026-01-01 DOI: 10.1371/journal.pdig.0001184

Jack Le Vance, Adekunle Adeoye, Rebecca Man, Nashwa Eltaweel, Leo Gurney, R Katie Morris, Victoria Hodgetts Morton

Cardiotocography (CTG) is a common investigative modality in obstetrics to evaluate the fetal condition. Advancements in digital technology has enabled the innovation of CTG monitoring for usage in the home setting. This review aims to comprehensively examine the current evidence on the effectiveness and applicability of home antenatal CTG monitoring. MEDLINE, EMBASE, Cochrane, Web of Science, and PubMed databases were searched from inception to June 2025. Primary studies examining home antenatal CTG were included. For randomised controlled trials (RCTs), the joint primary outcomes were perinatal mortality and emergency caesarean section. For observational studies, the feasibility, diagnostic accuracy, qualitative and economic burden of home CTG were evaluated. RCTs were eligible for meta-analysis using risk ratio or mean difference, with 95% confidence intervals. Included observational studies were narratively described due to significant methodological heterogeneity. 39 studies (28 observational, seven RCTs and four qualitative studies), comprising of 7240 participants were included. Home antenatal CTG monitoring was non-inferior to conventional care across all meta-analysed maternal, perinatal and healthcare usage outcomes. GRADE assessments were low/very low quality of evidence. Home CTG monitoring was feasible in several settings and remote interpretation was graded as moderate to excellent. Transmission failures were frequently low but commonly occurred due to infrastructure and/or equipment errors. Remote CTG monitoring demonstrated comparative capabilities to conventional CTG with respect to coincidence and beat-to-beat variability. Overall acceptability ratings were high for patient and providers. Often implementation costs were high but accrued back by non-fixed savings when compared against routine care. High-quality studies were underrepresented, particularly when assessing service-led and safety outcomes. Home antenatal CTG monitoring demonstrates noninferiority to conventional care across several outcomes, representing a promising avenue for antenatal management However, current evidence is of low quality and additional high-quality evidence with sufficient methodological detail and standardised outcome assessment is required prior to making definitive recommendations.

心脏摄影（CTG）是一种常见的调查方式，在产科评估胎儿状况。数字技术的进步使CTG监测的创新能够在家庭环境中使用。本综述旨在全面审查目前的证据对家庭产前CTG监测的有效性和适用性。检索了MEDLINE、EMBASE、Cochrane、Web of Science和PubMed数据库，检索时间从创立到2025年6月。包括对家庭产前CTG的初步研究。在随机对照试验（RCTs）中，联合主要结局是围产期死亡率和紧急剖腹产。在观察性研究中，评估了家庭CTG的可行性、诊断准确性、定性和经济负担。随机对照试验采用风险比或平均差进行meta分析，置信区间为95%。由于方法学的异质性，纳入的观察性研究采用叙述性描述。纳入39项研究（28项观察性研究、7项随机对照试验和4项定性研究），包括7240名受试者。在所有荟萃分析的孕产妇、围产期和医疗保健使用结果中，家庭产前CTG监测并不逊于传统护理。GRADE评价证据质量低/非常低。家庭CTG监测在一些情况下是可行的，远程口译被评为中等到优秀。传输故障通常较低，但通常是由于基础设施和/或设备错误造成的。远程CTG监测显示了与常规CTG相比，在一致性和拍间变异性方面的能力。患者和提供者的总体接受度评分都很高。通常，实施成本很高，但与常规护理相比，非固定节余可累计回来。高质量的研究代表性不足，特别是在评估服务导向和安全结果时。家庭产前CTG监测显示，在几个结果上优于传统护理，代表了产前管理的一个有希望的途径。然而，目前的证据质量较低，在提出明确建议之前，需要额外的高质量证据，包括足够的方法细节和标准化的结果评估。

{"title":"Remote home cardiotocography: A systematic review and meta-analysis.","authors":"Jack Le Vance, Adekunle Adeoye, Rebecca Man, Nashwa Eltaweel, Leo Gurney, R Katie Morris, Victoria Hodgetts Morton","doi":"10.1371/journal.pdig.0001184","DOIUrl":"10.1371/journal.pdig.0001184","url":null,"abstract":"Cardiotocography (CTG) is a common investigative modality in obstetrics to evaluate the fetal condition. Advancements in digital technology has enabled the innovation of CTG monitoring for usage in the home setting. This review aims to comprehensively examine the current evidence on the effectiveness and applicability of home antenatal CTG monitoring. MEDLINE, EMBASE, Cochrane, Web of Science, and PubMed databases were searched from inception to June 2025. Primary studies examining home antenatal CTG were included. For randomised controlled trials (RCTs), the joint primary outcomes were perinatal mortality and emergency caesarean section. For observational studies, the feasibility, diagnostic accuracy, qualitative and economic burden of home CTG were evaluated. RCTs were eligible for meta-analysis using risk ratio or mean difference, with 95% confidence intervals. Included observational studies were narratively described due to significant methodological heterogeneity. 39 studies (28 observational, seven RCTs and four qualitative studies), comprising of 7240 participants were included. Home antenatal CTG monitoring was non-inferior to conventional care across all meta-analysed maternal, perinatal and healthcare usage outcomes. GRADE assessments were low/very low quality of evidence. Home CTG monitoring was feasible in several settings and remote interpretation was graded as moderate to excellent. Transmission failures were frequently low but commonly occurred due to infrastructure and/or equipment errors. Remote CTG monitoring demonstrated comparative capabilities to conventional CTG with respect to coincidence and beat-to-beat variability. Overall acceptability ratings were high for patient and providers. Often implementation costs were high but accrued back by non-fixed savings when compared against routine care. High-quality studies were underrepresented, particularly when assessing service-led and safety outcomes. Home antenatal CTG monitoring demonstrates noninferiority to conventional care across several outcomes, representing a promising avenue for antenatal management However, current evidence is of low quality and additional high-quality evidence with sufficient methodological detail and standardised outcome assessment is required prior to making definitive recommendations.","PeriodicalId":74465,"journal":{"name":"PLOS digital health","volume":"5 1","pages":"e0001184"},"PeriodicalIF":7.7,"publicationDate":"2026-01-12","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12795381/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145960893","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Economic cost of strategic implementation approaches to increase uptake of digital therapeutics for substance use disorders in a large integrated health system. 在大型综合卫生系统中增加数字治疗药物使用障碍的战略实施方法的经济成本。

IF 7.7

PLOS digital health

Pub Date : 2026-01-08 eCollection Date: 2026-01-01 DOI: 10.1371/journal.pdig.0001145

Edwin S Wong, Caitlin N Dorsey, Tara C Beatty, Jennifer F Bobb, Kelsey Stefanik-Guizlo, Dustin L Key, Arvind Ramaprasan, Abisola E Idu, John C Fortney, Jessica Mogk, Lorella Palazzo, Ryan M Caldeiro, Deborah King, Angela Garza McWethy, Joseph E Glass

Evidence-based digital therapeutics are a promising approach for the scale-up of substance use disorder (SUD) treatments. Despite demonstrated efficacy, utilization of digital therapeutics is low. Strategic implementation approaches have potential for increasing digital therapeutic use. Applicability to health systems depends, in part, on the economic costs. The objective of this study was to describe implementation and intervention costs of implementation strategies to increase uptake of an evidence-based digital treatment for SUD. We conducted an economic evaluation alongside a hybrid type III cluster-randomized trial within a large integrated health system. All clinics implemented a standard implementation (SI) strategy, and clinics were assigned using 2x2 factorial randomization to additionally receive practice facilitation (PF) and/or health coaching (HC). Implementation costs included the cost of time devoted to implementation activities and direct operating costs. Time devoted to implementation activities was ascertained through structured meeting logs and time use surveys. Operating costs were captured using project budget reports. Intervention costs included expenses for prescriptions and healthcare encounters related to the digital therapeutic, measured using electronic health record data. Univariate statistics were calculated for cost estimates with comparisons presented by trial arm, implementation activity, staff role and study month. Analyses were conducted from a health system perspective. Twenty-one primary care sites participated in the trial. Over the 50-month study period, the total cost of all implementation activities was $748,088. Implementation costs per clinic were highest in the SI + PF + HC arm ($48,029), followed by SI + HC ($36,544), SI + PF ($30,665) and SI alone ($24,774). Intervention costs were highest in the SI + PF + HC arm ($18,051), followed by SI + PF ($11,492), SI + HC ($967) and SI alone ($1,879). Findings from this study can guide health systems by informing the economic investment required to employ implementation strategies demonstrated to increase uptake of evidence-based practices for behavioral health conditions. Trial Registration: NCT05160233.

基于证据的数字疗法是扩大物质使用障碍（SUD）治疗的一种有前途的方法。尽管显示出疗效，但数字疗法的使用率很低。战略实施方法具有增加数字治疗使用的潜力。对卫生系统的适用性部分取决于经济成本。本研究的目的是描述实施策略的实施和干预成本，以增加对SUD的循证数字治疗的吸收。我们在一个大型综合卫生系统中进行了一项混合III型集群随机试验的经济评估。所有诊所都实施了标准实施（SI）策略，并采用2x2因子随机化分配诊所，以额外接受实践指导（PF）和/或健康指导（HC）。执行费用包括用于执行活动的时间费用和直接业务费用。通过结构化的会议记录和时间使用调查确定了用于执行活动的时间。使用项目预算报告获取运营成本。干预成本包括处方费用和与数字治疗相关的医疗保健就诊费用，使用电子健康记录数据进行测量。用试验组、执行活动、工作人员作用和研究月份进行比较，计算费用概算的单变量统计。从卫生系统的角度进行了分析。21个初级保健站点参与了试验。在50个月的研究期间，所有执行活动的费用总额为748 088美元。每家诊所的实施成本在SI + PF + HC组最高（48,029美元），其次是SI + HC（36,544美元），SI + PF（30,665美元）和单独SI（24,774美元）。SI + PF + HC组的干预成本最高（18,051美元），其次是SI + PF（11,492美元），SI + HC（967美元）和单独SI（1,879美元）。本研究的结果可以指导卫生系统，为采用已证明的实施战略所需的经济投资提供信息，以增加对行为健康状况的循证实践的吸收。试验注册：NCT05160233。

{"title":"Economic cost of strategic implementation approaches to increase uptake of digital therapeutics for substance use disorders in a large integrated health system.","authors":"Edwin S Wong, Caitlin N Dorsey, Tara C Beatty, Jennifer F Bobb, Kelsey Stefanik-Guizlo, Dustin L Key, Arvind Ramaprasan, Abisola E Idu, John C Fortney, Jessica Mogk, Lorella Palazzo, Ryan M Caldeiro, Deborah King, Angela Garza McWethy, Joseph E Glass","doi":"10.1371/journal.pdig.0001145","DOIUrl":"10.1371/journal.pdig.0001145","url":null,"abstract":"Evidence-based digital therapeutics are a promising approach for the scale-up of substance use disorder (SUD) treatments. Despite demonstrated efficacy, utilization of digital therapeutics is low. Strategic implementation approaches have potential for increasing digital therapeutic use. Applicability to health systems depends, in part, on the economic costs. The objective of this study was to describe implementation and intervention costs of implementation strategies to increase uptake of an evidence-based digital treatment for SUD. We conducted an economic evaluation alongside a hybrid type III cluster-randomized trial within a large integrated health system. All clinics implemented a standard implementation (SI) strategy, and clinics were assigned using 2x2 factorial randomization to additionally receive practice facilitation (PF) and/or health coaching (HC). Implementation costs included the cost of time devoted to implementation activities and direct operating costs. Time devoted to implementation activities was ascertained through structured meeting logs and time use surveys. Operating costs were captured using project budget reports. Intervention costs included expenses for prescriptions and healthcare encounters related to the digital therapeutic, measured using electronic health record data. Univariate statistics were calculated for cost estimates with comparisons presented by trial arm, implementation activity, staff role and study month. Analyses were conducted from a health system perspective. Twenty-one primary care sites participated in the trial. Over the 50-month study period, the total cost of all implementation activities was $748,088. Implementation costs per clinic were highest in the SI + PF + HC arm ($48,029), followed by SI + HC ($36,544), SI + PF ($30,665) and SI alone ($24,774). Intervention costs were highest in the SI + PF + HC arm ($18,051), followed by SI + PF ($11,492), SI + HC ($967) and SI alone ($1,879). Findings from this study can guide health systems by informing the economic investment required to employ implementation strategies demonstrated to increase uptake of evidence-based practices for behavioral health conditions. Trial Registration: NCT05160233.","PeriodicalId":74465,"journal":{"name":"PLOS digital health","volume":"5 1","pages":"e0001145"},"PeriodicalIF":7.7,"publicationDate":"2026-01-08","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12782369/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145936640","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Evaluating few-shot prompting for spectrogram-based lung sound classification using a multimodal language model. 使用多模态语言模型评估基于声谱图的肺音分类的少针提示。

IF 7.7

PLOS digital health

Pub Date : 2026-01-07 eCollection Date: 2026-01-01 DOI: 10.1371/journal.pdig.0001179

Nicholas Dietrich, David McShannon, Mark F Rzepka

Traditional deep learning models for lung sound analysis require large, labeled datasets, whereas multimodal large language models (LLMs) may offer a flexible, prompt-based alternative. This study aimed to evaluate the utility of a general-purpose multimodal LLM, GPT-4o, for lung sound classification from mel-spectrograms and assess whether a few-shot prompt approach improves performance over zero-shot prompting. Using the ICBHI 2017 Respiratory Sound Database, 6898 annotated respiratory cycles were converted into mel-spectrograms. GPT-4o was prompted to classify each spectrogram using both zero-shot and few-shot strategies. Model outputs were evaluated against ground truth labels using performance metrics including accuracy, precision, recall, and F1-score. Few-shot prompting improved overall accuracy (0.363 vs. 0.320) and yielded modest gains in precision (0.316 vs. 0.283), recall (0.300 vs. 0.287), and F1-score (0.308 vs. 0.285) across labels. McNemar's test indicated a statistically significant difference in performance between prompting strategies (p < 0.001). Model repeatability analysis demonstrated high agreement (κ = 0.76-0.88; agreement: 89-96%), indicating excellent consistency. GPT-4o demonstrated limited but statistically significant performance gains using few-shot prompting for lung sound classification. While current performance remains insufficient for clinical deployment, this prompt-based approach provides a baseline for spectrogram-based multimodal tasks and a foundation for future exploration of prompt-based multimodal inference.

用于肺音分析的传统深度学习模型需要大型标记数据集，而多模态大语言模型（llm）可能提供灵活的、基于提示的替代方案。本研究旨在评估通用多模态LLM gpt - 40从mel谱图中进行肺音分类的效用，并评估少量提示方法是否比零提示方法提高了性能。使用ICBHI 2017呼吸声数据库，将6898个注释呼吸周期转换为mel谱图。提示gpt - 40使用零射击和少射击策略对每个频谱图进行分类。模型输出通过使用包括准确性、精密度、召回率和f1分数在内的性能指标来评估真实值标签。几次提示提高了总体准确率（0.363 vs. 0.320），并在各标签上获得了适度的精度（0.316 vs. 0.283）、召回率（0.300 vs. 0.287）和f1分数（0.308 vs. 0.285）。McNemar的测验显示，不同的提示策略在表现上有显著的统计学差异(p

{"title":"Evaluating few-shot prompting for spectrogram-based lung sound classification using a multimodal language model.","authors":"Nicholas Dietrich, David McShannon, Mark F Rzepka","doi":"10.1371/journal.pdig.0001179","DOIUrl":"10.1371/journal.pdig.0001179","url":null,"abstract":"Traditional deep learning models for lung sound analysis require large, labeled datasets, whereas multimodal large language models (LLMs) may offer a flexible, prompt-based alternative. This study aimed to evaluate the utility of a general-purpose multimodal LLM, GPT-4o, for lung sound classification from mel-spectrograms and assess whether a few-shot prompt approach improves performance over zero-shot prompting. Using the ICBHI 2017 Respiratory Sound Database, 6898 annotated respiratory cycles were converted into mel-spectrograms. GPT-4o was prompted to classify each spectrogram using both zero-shot and few-shot strategies. Model outputs were evaluated against ground truth labels using performance metrics including accuracy, precision, recall, and F1-score. Few-shot prompting improved overall accuracy (0.363 vs. 0.320) and yielded modest gains in precision (0.316 vs. 0.283), recall (0.300 vs. 0.287), and F1-score (0.308 vs. 0.285) across labels. McNemar's test indicated a statistically significant difference in performance between prompting strategies (p < 0.001). Model repeatability analysis demonstrated high agreement (κ = 0.76-0.88; agreement: 89-96%), indicating excellent consistency. GPT-4o demonstrated limited but statistically significant performance gains using few-shot prompting for lung sound classification. While current performance remains insufficient for clinical deployment, this prompt-based approach provides a baseline for spectrogram-based multimodal tasks and a foundation for future exploration of prompt-based multimodal inference.","PeriodicalId":74465,"journal":{"name":"PLOS digital health","volume":"5 1","pages":"e0001179"},"PeriodicalIF":7.7,"publicationDate":"2026-01-07","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12779135/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145918953","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Development and validation of an artificial intelligence model based on liver CSE-MRI fat maps for predicting dyslipidemia. 基于肝脏CSE-MRI脂肪图预测血脂异常的人工智能模型的开发和验证。

IF 7.7

PLOS digital health

Pub Date : 2026-01-07 eCollection Date: 2026-01-01 DOI: 10.1371/journal.pdig.0001119

Bo Jiang, Weijun Situ, Zhichao Feng, Jianmin Yuan, Yina Wang, Xiaofan Chen, Xiong Wu, Kai Deng, Haitao Yang, Xiao Xiao, Xi Guo, Junjiao Hu

This study aimed to develop and validate an artificial intelligence (AI) model for the non-invasive early detection of dyslipidemia using liver chemical shift-encoded MRI (CSE-MRI) fat maps. An automated AI pipeline was developed to predict abnormalities in four lipid indicators: triglyceride, total cholesterol, low-density lipoprotein cholesterol, and high-density lipoprotein cholesterol. The study utilized 1,757 liver CSE-MRI fat images from 89 patients who underwent MRI scans and contemporaneous blood lipid testing. Transfer learning was applied using several pre-trained networks, including ResNet18, MobileNet, DenseNet, AlexNet, and SqueezeNet. Model performance was evaluated via 8-fold cross-validation, with the optimal model further assessed on a held-out test set using confusion matrices and derived metrics. Significant performance differences were observed among models. The optimal model, based on ResNet18, demonstrated high accuracy in the internal validation set. On the independent test set, this model achieved accuracies of 0.853 for triglyceride, 0.833 for total cholesterol, 0.937 for low-density lipoprotein cholesterol, and 0.936 for high-density lipoprotein cholesterol, with corresponding F1-Scores of 0.885, 0.571, 0.886, and 0.897. The AI model based on liver CSE-MRI fat maps shows high accuracy and generalization in predicting abnormalities for three key lipid indices, validating its potential as an early warning tool for dyslipidemia. Expanding the training dataset could further enhance performance for all lipid indices.

本研究旨在开发和验证一种人工智能（AI）模型，用于使用肝脏化学移位编码MRI （CSE-MRI）脂肪图无创早期检测血脂异常。开发了一个自动化的AI管道来预测四种脂质指标的异常：甘油三酯、总胆固醇、低密度脂蛋白胆固醇和高密度脂蛋白胆固醇。该研究利用了89名患者的1757张肝脏CSE-MRI脂肪图像，这些患者接受了MRI扫描和同期血脂检测。迁移学习应用于几个预训练的网络，包括ResNet18、MobileNet、DenseNet、AlexNet和SqueezeNet。通过8倍交叉验证来评估模型的性能，并使用混淆矩阵和衍生指标在持续测试集上进一步评估最佳模型。模型之间的性能差异显著。基于ResNet18的最优模型在内部验证集中显示出较高的准确率。在独立测试集上，该模型对甘油三酯、总胆固醇、低密度脂蛋白胆固醇和高密度脂蛋白胆固醇的准确率分别为0.853、0.833、0.937和0.936，对应的f1 - score分别为0.885、0.571、0.886和0.897。基于肝脏CSE-MRI脂肪图的AI模型在预测三个关键脂质指标异常方面显示出较高的准确性和通用性，验证了其作为血脂异常早期预警工具的潜力。扩展训练数据集可以进一步提高所有脂质指标的性能。

{"title":"Development and validation of an artificial intelligence model based on liver CSE-MRI fat maps for predicting dyslipidemia.","authors":"Bo Jiang, Weijun Situ, Zhichao Feng, Jianmin Yuan, Yina Wang, Xiaofan Chen, Xiong Wu, Kai Deng, Haitao Yang, Xiao Xiao, Xi Guo, Junjiao Hu","doi":"10.1371/journal.pdig.0001119","DOIUrl":"10.1371/journal.pdig.0001119","url":null,"abstract":"This study aimed to develop and validate an artificial intelligence (AI) model for the non-invasive early detection of dyslipidemia using liver chemical shift-encoded MRI (CSE-MRI) fat maps. An automated AI pipeline was developed to predict abnormalities in four lipid indicators: triglyceride, total cholesterol, low-density lipoprotein cholesterol, and high-density lipoprotein cholesterol. The study utilized 1,757 liver CSE-MRI fat images from 89 patients who underwent MRI scans and contemporaneous blood lipid testing. Transfer learning was applied using several pre-trained networks, including ResNet18, MobileNet, DenseNet, AlexNet, and SqueezeNet. Model performance was evaluated via 8-fold cross-validation, with the optimal model further assessed on a held-out test set using confusion matrices and derived metrics. Significant performance differences were observed among models. The optimal model, based on ResNet18, demonstrated high accuracy in the internal validation set. On the independent test set, this model achieved accuracies of 0.853 for triglyceride, 0.833 for total cholesterol, 0.937 for low-density lipoprotein cholesterol, and 0.936 for high-density lipoprotein cholesterol, with corresponding F1-Scores of 0.885, 0.571, 0.886, and 0.897. The AI model based on liver CSE-MRI fat maps shows high accuracy and generalization in predicting abnormalities for three key lipid indices, validating its potential as an early warning tool for dyslipidemia. Expanding the training dataset could further enhance performance for all lipid indices.","PeriodicalId":74465,"journal":{"name":"PLOS digital health","volume":"5 1","pages":"e0001119"},"PeriodicalIF":7.7,"publicationDate":"2026-01-07","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12779152/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145919029","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0