NPJ Digital Medicine最新文献

英文中文

Detecting early cognitive deficits in preclinical Alzheimer’s disease using a remote digital multi-day learning paradigm 使用远程数字多日学习模式检测临床前阿尔茨海默病的早期认知缺陷

IF 15.2 1区医学 Q1 HEALTH CARE SCIENCES & SERVICES

NPJ Digital Medicine

Pub Date : 2025-01-13 DOI: 10.1038/s41746-024-01347-7

Roos J. Jutten, Daniel Soberanes, Cassidy P. Molinare, Stephanie Hsieh, Michelle E. Farrell, Aaron S. Schultz, Dorene M. Rentz, Gad A. Marshall, Keith A. Johnson, Reisa A. Sperling, Rebecca E. Amariglio, Kathryn V. Papp

Remote, digital cognitive testing on an individual’s own device provides the opportunity to deploy previously understudied but promising cognitive paradigms in preclinical Alzheimer’s disease (AD). The Boston Remote Assessment for NeuroCognitive Health (BRANCH) captures a personalized learning curve for the same information presented over seven consecutive days. Here, we examined BRANCH multi-day learning curves (MDLCs) in 167 cognitively unimpaired older adults (age = 74.3 ± 7.5, 63% female) with different amyloid-β (A) and tau (T) biomarker profiles on positron emission tomography. MDLC scores decreased across ascending biomarker groups, with the A + T- group performing numerically worse (β = –0.24, 95%CI[–0.55,0.07], p = 0.128) and the A + T+ group performing significantly worse (β = –0.58, 95%CI[–1.06,–0.10], p = 0.018) than the A-T- group. Further, lower MDLC scores were associated with greater cortical thinning (β = 0.18, 95%CI[0.04,0.34], p = 0.013). Our results suggest that diminished MDLCs track with advanced AD pathophysiology, and demonstrate how a digital multi-day learning paradigm can provide novel insights about cognitive decline during preclinical AD.

在个人自己的设备上进行远程数字认知测试，为临床前阿尔茨海默病（AD）中部署以前未被充分研究但有前途的认知范式提供了机会。波士顿神经认知健康远程评估（BRANCH）捕获了连续7天提供的相同信息的个性化学习曲线。在这里，我们检测了167名认知功能正常的老年人（年龄= 74.3±7.5,63%为女性）的BRANCH多日学习曲线（mdlc），这些老年人在正电子发射断层扫描上具有不同的淀粉样蛋白-β (A)和tau (T)生物标志物。MDLC评分在升高的生物标志物组中下降，A + T-组的数值表现较差（β = - 0.24, 95%CI[- 0.55,0.07], p = 0.128）， A + T+组的表现明显较A-T-组差（β = - 0.58, 95%CI[- 1.06, - 0.10], p = 0.018）。此外，较低的MDLC评分与较大的皮质变薄相关（β = 0.18, 95%CI[0.04,0.34], p = 0.013）。我们的研究结果表明，mdlc的减少与阿尔茨海默病的晚期病理生理有关，并证明了数字多日学习范式如何为临床前阿尔茨海默病的认知衰退提供新的见解。

{"title":"Detecting early cognitive deficits in preclinical Alzheimer’s disease using a remote digital multi-day learning paradigm","authors":"Roos J. Jutten, Daniel Soberanes, Cassidy P. Molinare, Stephanie Hsieh, Michelle E. Farrell, Aaron S. Schultz, Dorene M. Rentz, Gad A. Marshall, Keith A. Johnson, Reisa A. Sperling, Rebecca E. Amariglio, Kathryn V. Papp","doi":"10.1038/s41746-024-01347-7","DOIUrl":"https://doi.org/10.1038/s41746-024-01347-7","url":null,"abstract":"Remote, digital cognitive testing on an individual’s own device provides the opportunity to deploy previously understudied but promising cognitive paradigms in preclinical Alzheimer’s disease (AD). The Boston Remote Assessment for NeuroCognitive Health (BRANCH) captures a personalized learning curve for the same information presented over seven consecutive days. Here, we examined BRANCH multi-day learning curves (MDLCs) in 167 cognitively unimpaired older adults (age = 74.3 ± 7.5, 63% female) with different amyloid-β (A) and tau (T) biomarker profiles on positron emission tomography. MDLC scores decreased across ascending biomarker groups, with the A + T- group performing numerically worse (β = –0.24, 95%CI[–0.55,0.07], p = 0.128) and the A + T+ group performing significantly worse (β = –0.58, 95%CI[–1.06,–0.10], p = 0.018) than the A-T- group. Further, lower MDLC scores were associated with greater cortical thinning (β = 0.18, 95%CI[0.04,0.34], p = 0.013). Our results suggest that diminished MDLCs track with advanced AD pathophysiology, and demonstrate how a digital multi-day learning paradigm can provide novel insights about cognitive decline during preclinical AD.","PeriodicalId":19349,"journal":{"name":"NPJ Digital Medicine","volume":"11 1","pages":""},"PeriodicalIF":15.2,"publicationDate":"2025-01-13","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142968195","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Unsupervised deep learning of electrocardiograms enables scalable human disease profiling 无监督的心电图深度学习使可扩展的人类疾病分析成为可能

IF 15.2 1区医学 Q1 HEALTH CARE SCIENCES & SERVICES

NPJ Digital Medicine

Pub Date : 2025-01-12 DOI: 10.1038/s41746-024-01418-9

Sam F. Friedman, Shaan Khurshid, Rachael A. Venn, Xin Wang, Nate Diamant, Paolo Di Achille, Lu-Chen Weng, Seung Hoan Choi, Christopher Reeder, James P. Pirruccello, Pulkit Singh, Emily S. Lau, Anthony Philippakis, Christopher D. Anderson, Mahnaz Maddah, Puneet Batra, Patrick T. Ellinor, Jennifer E. Ho, Steven A. Lubitz

The 12-lead electrocardiogram (ECG) is inexpensive and widely available. Whether conditions across the human disease landscape can be detected using the ECG is unclear. We developed a deep learning denoising autoencoder and systematically evaluated associations between ECG encodings and ~1,600 Phecode-based diseases in three datasets separate from model development, and meta-analyzed the results. The latent space ECG model identified associations with 645 prevalent and 606 incident Phecodes. Associations were most enriched in the circulatory (n = 140, 82% of category-specific Phecodes), respiratory (n = 53, 62%) and endocrine/metabolic (n = 73, 45%) categories, with additional associations across the phenome. The strongest ECG association was with hypertension (p < 2.2×10^-308). The ECG latent space model demonstrated more associations than models using standard ECG intervals, and offered favorable discrimination of prevalent disease compared to models comprising age, sex, and race. We further demonstrate how latent space models can be used to generate disease-specific ECG waveforms and facilitate individual disease profiling.

12导联心电图（ECG）价格低廉，可广泛使用。目前尚不清楚是否可以使用ECG检测人类疾病状况。我们开发了一种深度学习去噪自动编码器，并系统地评估了ECG编码与约1,600种基于phecode的疾病之间的关联，这些数据集与模型开发分开，并对结果进行了meta分析。潜在空间ECG模型确定了与645个流行和606个事件的关联。关联在循环（n = 140, 82%的类别特异性Phecodes），呼吸（n = 53, 62%）和内分泌/代谢（n = 73, 45%）类别中最为丰富，其他关联在整个表型中。与高血压的ECG相关性最强（p < 2.2×10-308）。与使用标准心电图间隔的模型相比，ECG潜伏空间模型显示出更多的关联，与包含年龄、性别和种族的模型相比，该模型对流行疾病有更好的区分。我们进一步展示了潜伏空间模型如何用于生成疾病特异性ECG波形并促进个体疾病分析。

{"title":"Unsupervised deep learning of electrocardiograms enables scalable human disease profiling","authors":"Sam F. Friedman, Shaan Khurshid, Rachael A. Venn, Xin Wang, Nate Diamant, Paolo Di Achille, Lu-Chen Weng, Seung Hoan Choi, Christopher Reeder, James P. Pirruccello, Pulkit Singh, Emily S. Lau, Anthony Philippakis, Christopher D. Anderson, Mahnaz Maddah, Puneet Batra, Patrick T. Ellinor, Jennifer E. Ho, Steven A. Lubitz","doi":"10.1038/s41746-024-01418-9","DOIUrl":"https://doi.org/10.1038/s41746-024-01418-9","url":null,"abstract":"The 12-lead electrocardiogram (ECG) is inexpensive and widely available. Whether conditions across the human disease landscape can be detected using the ECG is unclear. We developed a deep learning denoising autoencoder and systematically evaluated associations between ECG encodings and ~1,600 Phecode-based diseases in three datasets separate from model development, and meta-analyzed the results. The latent space ECG model identified associations with 645 prevalent and 606 incident Phecodes. Associations were most enriched in the circulatory (n = 140, 82% of category-specific Phecodes), respiratory (n = 53, 62%) and endocrine/metabolic (n = 73, 45%) categories, with additional associations across the phenome. The strongest ECG association was with hypertension (p < 2.2×10-308). The ECG latent space model demonstrated more associations than models using standard ECG intervals, and offered favorable discrimination of prevalent disease compared to models comprising age, sex, and race. We further demonstrate how latent space models can be used to generate disease-specific ECG waveforms and facilitate individual disease profiling.","PeriodicalId":19349,"journal":{"name":"NPJ Digital Medicine","volume":"10 1","pages":""},"PeriodicalIF":15.2,"publicationDate":"2025-01-12","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142967754","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Identification of cardiac wall motion abnormalities in diverse populations by deep learning of the electrocardiogram 通过心电图的深度学习识别不同人群的心壁运动异常

IF 15.2 1区医学 Q1 HEALTH CARE SCIENCES & SERVICES

NPJ Digital Medicine

Pub Date : 2025-01-11 DOI: 10.1038/s41746-024-01407-y

Albert J. Rogers, Neal K. Bhatia, Sabyasachi Bandyopadhyay, James Tooley, Rayan Ansari, Vyom Thakkar, Justin Xu, Jessica Torres Soto, Jagteshwar S. Tung, Mahmood I. Alhusseini, Paul Clopton, Reza Sameni, Gari D. Clifford, J. Weston Hughes, Euan A. Ashley, Marco V. Perez, Matei Zaharia, Sanjiv M. Narayan

Cardiac wall motion abnormalities (WMA) are strong predictors of mortality, but current screening methods using Q waves from electrocardiograms (ECGs) have limited accuracy and vary across racial and ethnic groups. This study aimed to identify novel ECG features using deep learning to enhance WMA detection, referencing echocardiography as the gold standard. We collected ECG and echocardiogram data from 35,210 patients in California and labeled WMA using unstructured language parsing of echocardiographic reports. A deep neural network (ECG-WMA-Net) was trained and outperformed both expert ECG interpretation and Q-wave indices, achieving an AUROC of 0.781 (CI: 0.762–0.799). The model was externally validated in a diverse cohort from Georgia (n = 2338), with an AUC of 0.723 (CI: 0.685–0.757). Explainability analysis revealed significant contributions from QRS and T-wave regions. This deep learning approach improves WMA screening accuracy, potentially addressing physiological differences not captured by standard ECG-based methods.

心壁运动异常（WMA）是死亡率的有力预测指标，但目前使用心电图（ECGs） Q波的筛查方法准确性有限，并且在种族和民族群体中存在差异。本研究旨在利用深度学习来识别新的心电图特征，以增强WMA检测，参考超声心动图作为金标准。我们收集了加利福尼亚35210名患者的心电图和超声心动图数据，并使用超声心动图报告的非结构化语言解析标记了WMA。经过训练的深度神经网络（ECG- wma - net）优于专家心电解释和q波指数，AUROC为0.781 （CI: 0.762-0.799）。该模型在乔治亚州的不同队列中进行了外部验证（n = 2338）， AUC为0.723 （CI: 0.685-0.757）。可解释性分析显示，QRS和t波区域的贡献显著。这种深度学习方法提高了WMA筛查的准确性，潜在地解决了标准的基于ecg的方法无法捕捉到的生理差异。

{"title":"Identification of cardiac wall motion abnormalities in diverse populations by deep learning of the electrocardiogram","authors":"Albert J. Rogers, Neal K. Bhatia, Sabyasachi Bandyopadhyay, James Tooley, Rayan Ansari, Vyom Thakkar, Justin Xu, Jessica Torres Soto, Jagteshwar S. Tung, Mahmood I. Alhusseini, Paul Clopton, Reza Sameni, Gari D. Clifford, J. Weston Hughes, Euan A. Ashley, Marco V. Perez, Matei Zaharia, Sanjiv M. Narayan","doi":"10.1038/s41746-024-01407-y","DOIUrl":"https://doi.org/10.1038/s41746-024-01407-y","url":null,"abstract":"Cardiac wall motion abnormalities (WMA) are strong predictors of mortality, but current screening methods using Q waves from electrocardiograms (ECGs) have limited accuracy and vary across racial and ethnic groups. This study aimed to identify novel ECG features using deep learning to enhance WMA detection, referencing echocardiography as the gold standard. We collected ECG and echocardiogram data from 35,210 patients in California and labeled WMA using unstructured language parsing of echocardiographic reports. A deep neural network (ECG-WMA-Net) was trained and outperformed both expert ECG interpretation and Q-wave indices, achieving an AUROC of 0.781 (CI: 0.762–0.799). The model was externally validated in a diverse cohort from Georgia (n = 2338), with an AUC of 0.723 (CI: 0.685–0.757). Explainability analysis revealed significant contributions from QRS and T-wave regions. This deep learning approach improves WMA screening accuracy, potentially addressing physiological differences not captured by standard ECG-based methods.","PeriodicalId":19349,"journal":{"name":"NPJ Digital Medicine","volume":"49 1","pages":""},"PeriodicalIF":15.2,"publicationDate":"2025-01-11","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142961435","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Whole examination AI estimation of fetal biometrics from 20-week ultrasound scans 20周超声扫描胎儿生物特征的全检AI估计

IF 15.2 1区医学 Q1 HEALTH CARE SCIENCES & SERVICES

NPJ Digital Medicine

Pub Date : 2025-01-11 DOI: 10.1038/s41746-024-01406-z

Lorenzo Venturini, Samuel Budd, Alfonso Farruggia, Robert Wright, Jacqueline Matthew, Thomas G. Day, Bernhard Kainz, Reza Razavi, Jo V. Hajnal

The current approach to fetal anomaly screening is based on biometric measurements derived from individually selected ultrasound images. In this paper, we introduce a paradigm shift that attains human-level performance in biometric measurement by aggregating automatically extracted biometrics from every frame across an entire scan, with no need for operator intervention. We use a neural network to classify each frame of an ultrasound video recording. We then measure fetal biometrics in every frame where appropriate anatomy is visible. We use a Bayesian method to estimate the true value of each biometric from a large number of measurements and probabilistically reject outliers. We performed a retrospective experiment on 1457 recordings (comprising 48 million frames) of 20-week ultrasound scans, estimated fetal biometrics in those scans and compared our estimates to real-time manual measurements. Our method achieves human-level performance in estimating fetal biometrics and estimates well-calibrated credible intervals for the true biometric value.

目前胎儿异常筛查的方法是基于个体选择的超声图像衍生的生物测量。在本文中，我们介绍了一种范式转换，通过在整个扫描过程中从每帧自动提取生物特征，而无需操作员干预，从而实现生物特征测量中的人类水平性能。我们使用神经网络对超声录像的每一帧进行分类。然后，我们在每一帧中测量胎儿的生物特征，在适当的解剖可见。我们使用贝叶斯方法从大量测量中估计每个生物特征的真实值，并概率地拒绝异常值。我们对1457份20周超声扫描记录（包括4800万帧）进行了回顾性实验，估计了这些扫描中的胎儿生物特征，并将我们的估计与实时人工测量进行了比较。我们的方法在估计胎儿生物特征方面达到了人类水平的性能，并为真实的生物特征值估计了良好校准的可信区间。

{"title":"Whole examination AI estimation of fetal biometrics from 20-week ultrasound scans","authors":"Lorenzo Venturini, Samuel Budd, Alfonso Farruggia, Robert Wright, Jacqueline Matthew, Thomas G. Day, Bernhard Kainz, Reza Razavi, Jo V. Hajnal","doi":"10.1038/s41746-024-01406-z","DOIUrl":"https://doi.org/10.1038/s41746-024-01406-z","url":null,"abstract":"The current approach to fetal anomaly screening is based on biometric measurements derived from individually selected ultrasound images. In this paper, we introduce a paradigm shift that attains human-level performance in biometric measurement by aggregating automatically extracted biometrics from every frame across an entire scan, with no need for operator intervention. We use a neural network to classify each frame of an ultrasound video recording. We then measure fetal biometrics in every frame where appropriate anatomy is visible. We use a Bayesian method to estimate the true value of each biometric from a large number of measurements and probabilistically reject outliers. We performed a retrospective experiment on 1457 recordings (comprising 48 million frames) of 20-week ultrasound scans, estimated fetal biometrics in those scans and compared our estimates to real-time manual measurements. Our method achieves human-level performance in estimating fetal biometrics and estimates well-calibrated credible intervals for the true biometric value.","PeriodicalId":19349,"journal":{"name":"NPJ Digital Medicine","volume":"40 1","pages":""},"PeriodicalIF":15.2,"publicationDate":"2025-01-11","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142967755","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Impact of digital health interventions on glycemic control and weight management 数字健康干预对血糖控制和体重管理的影响

IF 15.2 1区医学 Q1 HEALTH CARE SCIENCES & SERVICES

NPJ Digital Medicine

Pub Date : 2025-01-09 DOI: 10.1038/s41746-025-01430-7

Arvind Veluvali, Ashkan Dehghani Zahedani, Amir Hosseinian, Nima Aghaeepour, Tracey McLaughlin, Mark Woodward, Alex DiTullio, Noosheen Hashemi, Michael P. Snyder

This retrospective cohort study evaluates the impact of an AI-supported continuous glucose monitoring (CGM) mobile app (“January V2”) on glycemic control and weight management in 944 users, including healthy individuals and those with prediabetes or type 2 diabetes (T2D). The app, leveraging AI to personalize feedback, tracked users’ food intake, activity, and glucose responses over 14 days. Significant improvements in time in range (TIR) were observed, particularly in users with lower baseline TIR. Healthy users’ TIR increased from 74.7% to 85.5% (p < 0.0001), while T2D users’ TIR improved from 49.7% to 57.4% (p < 0.0004). Higher app engagement correlated with greater TIR improvements. Users also experienced an average weight reduction of 3.3 lbs over 33 days. These findings suggest that AI-enhanced digital health interventions can improve glycemic control and promote weight loss, particularly when users are actively engaged.

本回顾性队列研究评估了人工智能支持的连续血糖监测（CGM）移动应用程序（“January V2”）对944名用户的血糖控制和体重管理的影响，包括健康个体和糖尿病前期或2型糖尿病（T2D）患者。该应用程序利用人工智能来个性化反馈，在14天内跟踪用户的食物摄入量、活动和血糖反应。观察到范围内时间（TIR）的显著改善，特别是在基线TIR较低的用户中。健康用户的TIR从74.7%提高到85.5% (p < 0.0001)， T2D用户的TIR从49.7%提高到57.4% （p < 0.0004）。更高的应用粘性与更高的TIR改善相关。使用者在33天内平均体重减轻了3.3磅。这些发现表明，人工智能增强的数字健康干预措施可以改善血糖控制并促进减肥，尤其是在用户积极参与的情况下。

引用次数: 0

A novel model for retinal imaging in the diagnosis of Alzheimer’s disease 一种诊断阿尔茨海默病的视网膜成像新模型

IF 15.2 1区医学 Q1 HEALTH CARE SCIENCES & SERVICES

NPJ Digital Medicine

Pub Date : 2025-01-09 DOI: 10.1038/s41746-025-01437-0

Kimia Heydari, Elizabeth J. Enichen, Serena Wang, Grace C. Nickel, Joseph C. Kvedar

Alzheimer’s disease is the fifth-leading cause of death for adults over the age of 65. Retinal imaging has emerged to find more accurate diagnostic tool for Alzheimer’s Disease. This paper highlights Hao et al.’s development of a new deep learning tool, EyeAD, which studies Optical Coherence Tomography Angiography (OCT-A) of patients with Alzheimer’s. Integrating this model into clinical workflows may offer novel insights into the progression of this disease.

阿尔茨海默病是65岁以上成年人死亡的第五大原因。视网膜成像的出现为阿尔茨海默病找到了更准确的诊断工具。本文重点介绍了Hao等人开发的一种新的深度学习工具EyeAD，该工具用于研究阿尔茨海默病患者的光学相干断层扫描血管造影（OCT-A）。将该模型整合到临床工作流程中可能会为这种疾病的进展提供新的见解。

引用次数: 0

Standardized patient profile review using large language models for case adjudication in observational research 在观察性研究中使用大型语言模型进行病例裁决的标准化患者档案审查

IF 15.2 1区医学 Q1 HEALTH CARE SCIENCES & SERVICES

NPJ Digital Medicine

Pub Date : 2025-01-09 DOI: 10.1038/s41746-025-01433-4

Martijn J. Schuemie, Anna Ostropolets, Aleh Zhuk, Uladzislau Korsik, Seung In Seo, Marc A. Suchard, George Hripcsak, Patrick B. Ryan

Using administrative claims and electronic health records for observational studies is common but challenging due to data limitations. Researchers rely on phenotype algorithms, requiring labor-intensive chart reviews for validation. This study investigates whether case adjudication using the previously introduced Knowledge-Enhanced Electronic Profile Review (KEEPER) system with large language models (LLMs) is feasible and could serve as a viable alternative to manual chart review. The task involves adjudicating cases identified by a phenotype algorithm, with KEEPER extracting predefined findings such as symptoms, comorbidities, and treatments from structured data. LLMs then evaluate KEEPER outputs to determine whether a patient truly qualifies as a case. We tested four LLMs including GPT-4, hosted locally to ensure privacy. Using zero-shot prompting and iterative prompt optimization, we found LLM performance, across ten diseases, varied by prompt and model, with sensitivities from 78 to 98% and specificities from 48 to 98%, indicating promise for automating phenotype evaluation.

在观察性研究中使用行政索赔和电子健康记录是很常见的，但由于数据的限制，这具有挑战性。研究人员依靠表现型算法，需要劳动密集型的图表审查验证。本研究调查了使用先前引入的具有大型语言模型（llm）的知识增强电子档案审查（KEEPER）系统进行案件裁决是否可行，是否可以作为人工图表审查的可行替代方案。这项任务包括对由表型算法确定的病例进行裁决，KEEPER从结构化数据中提取预定义的发现，如症状、合并症和治疗方法。llm然后评估KEEPER输出，以确定患者是否真正符合病例。我们测试了四个llm，包括GPT-4，它们在本地托管以确保隐私。使用零提示和迭代提示优化，我们发现LLM在10种疾病中的表现因提示和模型而异，敏感性从78%到98%，特异性从48%到98%，表明有望实现自动化表型评估。

{"title":"Standardized patient profile review using large language models for case adjudication in observational research","authors":"Martijn J. Schuemie, Anna Ostropolets, Aleh Zhuk, Uladzislau Korsik, Seung In Seo, Marc A. Suchard, George Hripcsak, Patrick B. Ryan","doi":"10.1038/s41746-025-01433-4","DOIUrl":"https://doi.org/10.1038/s41746-025-01433-4","url":null,"abstract":"Using administrative claims and electronic health records for observational studies is common but challenging due to data limitations. Researchers rely on phenotype algorithms, requiring labor-intensive chart reviews for validation. This study investigates whether case adjudication using the previously introduced Knowledge-Enhanced Electronic Profile Review (KEEPER) system with large language models (LLMs) is feasible and could serve as a viable alternative to manual chart review. The task involves adjudicating cases identified by a phenotype algorithm, with KEEPER extracting predefined findings such as symptoms, comorbidities, and treatments from structured data. LLMs then evaluate KEEPER outputs to determine whether a patient truly qualifies as a case. We tested four LLMs including GPT-4, hosted locally to ensure privacy. Using zero-shot prompting and iterative prompt optimization, we found LLM performance, across ten diseases, varied by prompt and model, with sensitivities from 78 to 98% and specificities from 48 to 98%, indicating promise for automating phenotype evaluation.","PeriodicalId":19349,"journal":{"name":"NPJ Digital Medicine","volume":"37 1","pages":""},"PeriodicalIF":15.2,"publicationDate":"2025-01-09","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142936695","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Scaling convolutional neural networks achieves expert level seizure detection in neonatal EEG 缩放卷积神经网络实现了专家级的新生儿脑电图癫痫检测

IF 15.2 1区医学 Q1 HEALTH CARE SCIENCES & SERVICES

NPJ Digital Medicine

Pub Date : 2025-01-08 DOI: 10.1038/s41746-024-01416-x

Robert Hogan, Sean R. Mathieson, Aurel Luca, Soraia Ventura, Sean Griffin, Geraldine B. Boylan, John M. O’Toole

Neonatal seizures require urgent treatment, but often go undetected without expert EEG monitoring. We have developed and validated a seizure detection model using retrospective EEG data from 332 neonates. A convolutional neural network was trained and tested on over 50,000 hours (n = 202) of annotated single-channel EEG containing 12,402 seizure events. This model was then validated on two independent multi-reviewer datasets (n = 51 and n = 79). Increasing data and model size improved performance: Matthews correlation coefficient (MCC) and Pearson’s correlation (r) increased by up to 50% (15%) with data (model) scaling. The largest model (21m parameters) achieved state-of-the-art on an open-access dataset (MCC = 0.764, r = 0.824, and AUC = 0.982). This model also attained expert-level performance on both validation sets, a first in this field, with no significant difference in inter-rater agreement when the model replaces an expert (∣Δκ∣ < 0.094, p > 0.05).

新生儿癫痫发作需要紧急治疗，但通常在没有专家脑电图监测的情况下未被发现。我们利用332名新生儿的回顾性脑电图数据开发并验证了癫痫发作检测模型。卷积神经网络在包含12,402个癫痫事件的超过50,000小时（n = 202）的注释单通道EEG上进行训练和测试。然后在两个独立的多审稿人数据集（n = 51和n = 79）上验证该模型。增加数据和模型大小可以提高性能：随着数据（模型）的缩放，马修斯相关系数（MCC）和皮尔逊相关系数(r)增加了50%（15%）。最大的模型（21个参数）在开放获取数据集上达到了最先进的水平（MCC = 0.764, r = 0.824, AUC = 0.982）。该模型在两个验证集上也达到了专家级别的性能，这是该领域的第一个，当模型取代专家时，评分者之间的一致性没有显着差异（∣Δκ∣< 0.094, p > 0.05）。

{"title":"Scaling convolutional neural networks achieves expert level seizure detection in neonatal EEG","authors":"Robert Hogan, Sean R. Mathieson, Aurel Luca, Soraia Ventura, Sean Griffin, Geraldine B. Boylan, John M. O’Toole","doi":"10.1038/s41746-024-01416-x","DOIUrl":"https://doi.org/10.1038/s41746-024-01416-x","url":null,"abstract":"Neonatal seizures require urgent treatment, but often go undetected without expert EEG monitoring. We have developed and validated a seizure detection model using retrospective EEG data from 332 neonates. A convolutional neural network was trained and tested on over 50,000 hours (n = 202) of annotated single-channel EEG containing 12,402 seizure events. This model was then validated on two independent multi-reviewer datasets (n = 51 and n = 79). Increasing data and model size improved performance: Matthews correlation coefficient (MCC) and Pearson’s correlation (r) increased by up to 50% (15%) with data (model) scaling. The largest model (21m parameters) achieved state-of-the-art on an open-access dataset (MCC = 0.764, r = 0.824, and AUC = 0.982). This model also attained expert-level performance on both validation sets, a first in this field, with no significant difference in inter-rater agreement when the model replaces an expert (∣Δκ∣ < 0.094, p > 0.05).","PeriodicalId":19349,"journal":{"name":"NPJ Digital Medicine","volume":"74 1","pages":""},"PeriodicalIF":15.2,"publicationDate":"2025-01-08","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142935709","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Device based monitoring in digital care and its impact on hospital service use 数字医疗中基于设备的监测及其对医院服务使用的影响

IF 15.2 1区医学 Q1 HEALTH CARE SCIENCES & SERVICES

NPJ Digital Medicine

Pub Date : 2025-01-08 DOI: 10.1038/s41746-024-01427-8

Anne-Jet S. Jansen, Guido M. Peters, Laura Kooij, Carine J. M. Doggen, Wim H. van Harten

This systematic review examines four themes of device-based remote monitoring (DRM): technology, patient monitoring and support, integration of DRM into clinical care, and patient engagement, and their impact on hospital service use. We included randomized controlled trials (RCTs) until 2024 comparing hospital service use in DRM with usual care. Hospital service use decreased in DRM in 72% of the 116 included RCTs. Non-implantable devices were most commonly used to measure data, but showed a lower decrease in hospital service use than implanted or mobile devices (69% vs 89% and 76%). Providing 24/7 support for patients led to a decrease (81% of the studies). DRM replacing usual care, involving designated healthcare providers, and patient-performed data transmission led to a greater decrease in hospital service use. DRM has the potential to further reduce hospital service use with additional factors such as sufficient support, automated processes, and optimized care redesign.

本系统综述研究了基于设备的远程监测（DRM）的四个主题：技术、患者监测和支持、DRM与临床护理的集成、患者参与及其对医院服务使用的影响。我们纳入了2024年前的随机对照试验（rct），比较DRM与常规护理的医院服务使用情况。在116项纳入的随机对照试验中，72%的医院服务使用降低了DRM。非植入式设备最常用于测量数据，但显示医院服务使用率的下降幅度低于植入式或移动设备（69%对89%和76%）。为患者提供24/7全天候支持导致死亡率下降（81%的研究）。DRM取代了常规护理（包括指定的医疗保健提供者）和患者执行的数据传输，导致医院服务的使用大幅减少。DRM具有进一步减少医院服务使用的潜力，如充分的支持、自动化流程和优化的护理重新设计。

引用次数: 0

A hybrid machine learning approach for the personalized prognostication of aggressive skin cancers 一种用于侵袭性皮肤癌个性化预测的混合机器学习方法

IF 15.2 1区医学 Q1 HEALTH CARE SCIENCES & SERVICES

NPJ Digital Medicine

Pub Date : 2025-01-08 DOI: 10.1038/s41746-024-01329-9

Tom W. Andrew, Mogdad Alrawi, Ruth Plummer, Nick Reynolds, Vern Sondak, Isaac Brownell, Penny E. Lovat, Aidan Rose, Sophia Z. Shalhout

Accurate prognostication guides optimal clinical management in skin cancer. Merkel cell carcinoma (MCC) is the most aggressive form of skin cancer that often presents in advanced stages and is associated with poor survival rates. There are no personalized prognostic tools in use in MCC. We employed explainability analysis to reveal new insights into mortality risk factors for this highly aggressive cancer. We then combined deep learning feature selection with a modified XGBoost framework, to develop a web-based prognostic tool for MCC termed ‘DeepMerkel’. DeepMerkel can make accurate personalised, time-dependent survival predictions for MCC from readily available clinical information. It demonstrated generalizability through high predictive performance in an international clinical cohort, out-performing current population-based prognostic staging systems. MCC and DeepMerkel provide the exemplar model of personalised machine learning prognostic tools in aggressive skin cancers.

准确的预后指导皮肤癌的最佳临床管理。默克尔细胞癌（MCC）是最具侵袭性的皮肤癌，通常出现在晚期，生存率较低。在MCC中没有使用个性化的预后工具。我们采用可解释性分析来揭示这种高度侵袭性癌症的死亡危险因素的新见解。然后，我们将深度学习特征选择与改进的XGBoost框架相结合，开发了一种基于网络的MCC预测工具，称为“DeepMerkel”。DeepMerkel可以根据现成的临床信息对MCC做出准确的个性化、随时间变化的生存预测。它通过在国际临床队列中的高预测性能证明了通用性，优于当前基于人群的预后分期系统。MCC和DeepMerkel为侵袭性皮肤癌的个性化机器学习预后工具提供了范例模型。

引用次数: 0

首页上一页

类型

全部化学•材料生命科学医学物理工程技术环境•农林材料科学地球科学法学管理学化学环境科学与生态学计算机科学教育学经济学农林科学人文科学生物学数学物理与天体物理心理学综合性期刊其他工业工程理学历史学农学文学信息工程

数据库

全部 ACS Publications Elsevier ieeexplore Springer The Royal Society of Chemistry Wiley

期刊

NPJ Digital Medicine

全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.

﹀