Pub Date : 2025-01-13DOI: 10.1038/s41746-024-01347-7
Roos J. Jutten, Daniel Soberanes, Cassidy P. Molinare, Stephanie Hsieh, Michelle E. Farrell, Aaron S. Schultz, Dorene M. Rentz, Gad A. Marshall, Keith A. Johnson, Reisa A. Sperling, Rebecca E. Amariglio, Kathryn V. Papp
Remote, digital cognitive testing on an individual’s own device provides the opportunity to deploy previously understudied but promising cognitive paradigms in preclinical Alzheimer’s disease (AD). The Boston Remote Assessment for NeuroCognitive Health (BRANCH) captures a personalized learning curve for the same information presented over seven consecutive days. Here, we examined BRANCH multi-day learning curves (MDLCs) in 167 cognitively unimpaired older adults (age = 74.3 ± 7.5, 63% female) with different amyloid-β (A) and tau (T) biomarker profiles on positron emission tomography. MDLC scores decreased across ascending biomarker groups, with the A + T- group performing numerically worse (β = –0.24, 95%CI[–0.55,0.07], p = 0.128) and the A + T+ group performing significantly worse (β = –0.58, 95%CI[–1.06,–0.10], p = 0.018) than the A-T- group. Further, lower MDLC scores were associated with greater cortical thinning (β = 0.18, 95%CI[0.04,0.34], p = 0.013). Our results suggest that diminished MDLCs track with advanced AD pathophysiology, and demonstrate how a digital multi-day learning paradigm can provide novel insights about cognitive decline during preclinical AD.
在个人自己的设备上进行远程数字认知测试,为临床前阿尔茨海默病(AD)中部署以前未被充分研究但有前途的认知范式提供了机会。波士顿神经认知健康远程评估(BRANCH)捕获了连续7天提供的相同信息的个性化学习曲线。在这里,我们检测了167名认知功能正常的老年人(年龄= 74.3±7.5,63%为女性)的BRANCH多日学习曲线(mdlc),这些老年人在正电子发射断层扫描上具有不同的淀粉样蛋白-β (A)和tau (T)生物标志物。MDLC评分在升高的生物标志物组中下降,A + T-组的数值表现较差(β = - 0.24, 95%CI[- 0.55,0.07], p = 0.128), A + T+组的表现明显较A-T-组差(β = - 0.58, 95%CI[- 1.06, - 0.10], p = 0.018)。此外,较低的MDLC评分与较大的皮质变薄相关(β = 0.18, 95%CI[0.04,0.34], p = 0.013)。我们的研究结果表明,mdlc的减少与阿尔茨海默病的晚期病理生理有关,并证明了数字多日学习范式如何为临床前阿尔茨海默病的认知衰退提供新的见解。
{"title":"Detecting early cognitive deficits in preclinical Alzheimer’s disease using a remote digital multi-day learning paradigm","authors":"Roos J. Jutten, Daniel Soberanes, Cassidy P. Molinare, Stephanie Hsieh, Michelle E. Farrell, Aaron S. Schultz, Dorene M. Rentz, Gad A. Marshall, Keith A. Johnson, Reisa A. Sperling, Rebecca E. Amariglio, Kathryn V. Papp","doi":"10.1038/s41746-024-01347-7","DOIUrl":"https://doi.org/10.1038/s41746-024-01347-7","url":null,"abstract":"<p>Remote, digital cognitive testing on an individual’s own device provides the opportunity to deploy previously understudied but promising cognitive paradigms in preclinical Alzheimer’s disease (AD). The Boston Remote Assessment for NeuroCognitive Health (BRANCH) captures a personalized learning curve for the same information presented over seven consecutive days. Here, we examined BRANCH multi-day learning curves (MDLCs) in 167 cognitively unimpaired older adults (age = 74.3 ± 7.5, 63% female) with different amyloid-β (A) and tau (T) biomarker profiles on positron emission tomography. MDLC scores decreased across ascending biomarker groups, with the A + T- group performing numerically worse (β = –0.24, 95%CI[–0.55,0.07], <i>p</i> = 0.128) and the A + T+ group performing significantly worse (β = –0.58, 95%CI[–1.06,–0.10], <i>p</i> = 0.018) than the A-T- group. Further, lower MDLC scores were associated with greater cortical thinning (β = 0.18, 95%CI[0.04,0.34], <i>p</i> = 0.013). Our results suggest that diminished MDLCs track with advanced AD pathophysiology, and demonstrate how a digital multi-day learning paradigm can provide novel insights about cognitive decline during preclinical AD.</p>","PeriodicalId":19349,"journal":{"name":"NPJ Digital Medicine","volume":"11 1","pages":""},"PeriodicalIF":15.2,"publicationDate":"2025-01-13","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142968195","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2025-01-12DOI: 10.1038/s41746-024-01418-9
Sam F. Friedman, Shaan Khurshid, Rachael A. Venn, Xin Wang, Nate Diamant, Paolo Di Achille, Lu-Chen Weng, Seung Hoan Choi, Christopher Reeder, James P. Pirruccello, Pulkit Singh, Emily S. Lau, Anthony Philippakis, Christopher D. Anderson, Mahnaz Maddah, Puneet Batra, Patrick T. Ellinor, Jennifer E. Ho, Steven A. Lubitz
The 12-lead electrocardiogram (ECG) is inexpensive and widely available. Whether conditions across the human disease landscape can be detected using the ECG is unclear. We developed a deep learning denoising autoencoder and systematically evaluated associations between ECG encodings and ~1,600 Phecode-based diseases in three datasets separate from model development, and meta-analyzed the results. The latent space ECG model identified associations with 645 prevalent and 606 incident Phecodes. Associations were most enriched in the circulatory (n = 140, 82% of category-specific Phecodes), respiratory (n = 53, 62%) and endocrine/metabolic (n = 73, 45%) categories, with additional associations across the phenome. The strongest ECG association was with hypertension (p < 2.2×10-308). The ECG latent space model demonstrated more associations than models using standard ECG intervals, and offered favorable discrimination of prevalent disease compared to models comprising age, sex, and race. We further demonstrate how latent space models can be used to generate disease-specific ECG waveforms and facilitate individual disease profiling.
{"title":"Unsupervised deep learning of electrocardiograms enables scalable human disease profiling","authors":"Sam F. Friedman, Shaan Khurshid, Rachael A. Venn, Xin Wang, Nate Diamant, Paolo Di Achille, Lu-Chen Weng, Seung Hoan Choi, Christopher Reeder, James P. Pirruccello, Pulkit Singh, Emily S. Lau, Anthony Philippakis, Christopher D. Anderson, Mahnaz Maddah, Puneet Batra, Patrick T. Ellinor, Jennifer E. Ho, Steven A. Lubitz","doi":"10.1038/s41746-024-01418-9","DOIUrl":"https://doi.org/10.1038/s41746-024-01418-9","url":null,"abstract":"<p>The 12-lead electrocardiogram (ECG) is inexpensive and widely available. Whether conditions across the human disease landscape can be detected using the ECG is unclear. We developed a deep learning denoising autoencoder and systematically evaluated associations between ECG encodings and ~1,600 Phecode-based diseases in three datasets separate from model development, and meta-analyzed the results. The latent space ECG model identified associations with 645 prevalent and 606 incident Phecodes. Associations were most enriched in the circulatory (<i>n</i> = 140, 82% of category-specific Phecodes), respiratory (<i>n</i> = 53, 62%) and endocrine/metabolic (<i>n</i> = 73, 45%) categories, with additional associations across the phenome. The strongest ECG association was with hypertension (p < 2.2×10<sup>-308</sup>). The ECG latent space model demonstrated more associations than models using standard ECG intervals, and offered favorable discrimination of prevalent disease compared to models comprising age, sex, and race. We further demonstrate how latent space models can be used to generate disease-specific ECG waveforms and facilitate individual disease profiling.</p>","PeriodicalId":19349,"journal":{"name":"NPJ Digital Medicine","volume":"10 1","pages":""},"PeriodicalIF":15.2,"publicationDate":"2025-01-12","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142967754","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2025-01-11DOI: 10.1038/s41746-024-01407-y
Albert J. Rogers, Neal K. Bhatia, Sabyasachi Bandyopadhyay, James Tooley, Rayan Ansari, Vyom Thakkar, Justin Xu, Jessica Torres Soto, Jagteshwar S. Tung, Mahmood I. Alhusseini, Paul Clopton, Reza Sameni, Gari D. Clifford, J. Weston Hughes, Euan A. Ashley, Marco V. Perez, Matei Zaharia, Sanjiv M. Narayan
Cardiac wall motion abnormalities (WMA) are strong predictors of mortality, but current screening methods using Q waves from electrocardiograms (ECGs) have limited accuracy and vary across racial and ethnic groups. This study aimed to identify novel ECG features using deep learning to enhance WMA detection, referencing echocardiography as the gold standard. We collected ECG and echocardiogram data from 35,210 patients in California and labeled WMA using unstructured language parsing of echocardiographic reports. A deep neural network (ECG-WMA-Net) was trained and outperformed both expert ECG interpretation and Q-wave indices, achieving an AUROC of 0.781 (CI: 0.762–0.799). The model was externally validated in a diverse cohort from Georgia (n = 2338), with an AUC of 0.723 (CI: 0.685–0.757). Explainability analysis revealed significant contributions from QRS and T-wave regions. This deep learning approach improves WMA screening accuracy, potentially addressing physiological differences not captured by standard ECG-based methods.
{"title":"Identification of cardiac wall motion abnormalities in diverse populations by deep learning of the electrocardiogram","authors":"Albert J. Rogers, Neal K. Bhatia, Sabyasachi Bandyopadhyay, James Tooley, Rayan Ansari, Vyom Thakkar, Justin Xu, Jessica Torres Soto, Jagteshwar S. Tung, Mahmood I. Alhusseini, Paul Clopton, Reza Sameni, Gari D. Clifford, J. Weston Hughes, Euan A. Ashley, Marco V. Perez, Matei Zaharia, Sanjiv M. Narayan","doi":"10.1038/s41746-024-01407-y","DOIUrl":"https://doi.org/10.1038/s41746-024-01407-y","url":null,"abstract":"<p>Cardiac wall motion abnormalities (WMA) are strong predictors of mortality, but current screening methods using Q waves from electrocardiograms (ECGs) have limited accuracy and vary across racial and ethnic groups. This study aimed to identify novel ECG features using deep learning to enhance WMA detection, referencing echocardiography as the gold standard. We collected ECG and echocardiogram data from 35,210 patients in California and labeled WMA using unstructured language parsing of echocardiographic reports. A deep neural network (ECG-WMA-Net) was trained and outperformed both expert ECG interpretation and Q-wave indices, achieving an AUROC of 0.781 (CI: 0.762–0.799). The model was externally validated in a diverse cohort from Georgia (<i>n</i> = 2338), with an AUC of 0.723 (CI: 0.685–0.757). Explainability analysis revealed significant contributions from QRS and T-wave regions. This deep learning approach improves WMA screening accuracy, potentially addressing physiological differences not captured by standard ECG-based methods.</p>","PeriodicalId":19349,"journal":{"name":"NPJ Digital Medicine","volume":"49 1","pages":""},"PeriodicalIF":15.2,"publicationDate":"2025-01-11","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142961435","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2025-01-11DOI: 10.1038/s41746-024-01406-z
Lorenzo Venturini, Samuel Budd, Alfonso Farruggia, Robert Wright, Jacqueline Matthew, Thomas G. Day, Bernhard Kainz, Reza Razavi, Jo V. Hajnal
The current approach to fetal anomaly screening is based on biometric measurements derived from individually selected ultrasound images. In this paper, we introduce a paradigm shift that attains human-level performance in biometric measurement by aggregating automatically extracted biometrics from every frame across an entire scan, with no need for operator intervention. We use a neural network to classify each frame of an ultrasound video recording. We then measure fetal biometrics in every frame where appropriate anatomy is visible. We use a Bayesian method to estimate the true value of each biometric from a large number of measurements and probabilistically reject outliers. We performed a retrospective experiment on 1457 recordings (comprising 48 million frames) of 20-week ultrasound scans, estimated fetal biometrics in those scans and compared our estimates to real-time manual measurements. Our method achieves human-level performance in estimating fetal biometrics and estimates well-calibrated credible intervals for the true biometric value.
{"title":"Whole examination AI estimation of fetal biometrics from 20-week ultrasound scans","authors":"Lorenzo Venturini, Samuel Budd, Alfonso Farruggia, Robert Wright, Jacqueline Matthew, Thomas G. Day, Bernhard Kainz, Reza Razavi, Jo V. Hajnal","doi":"10.1038/s41746-024-01406-z","DOIUrl":"https://doi.org/10.1038/s41746-024-01406-z","url":null,"abstract":"<p>The current approach to fetal anomaly screening is based on biometric measurements derived from individually selected ultrasound images. In this paper, we introduce a paradigm shift that attains human-level performance in biometric measurement by aggregating automatically extracted biometrics from every frame across an entire scan, with no need for operator intervention. We use a neural network to classify each frame of an ultrasound video recording. We then measure fetal biometrics in every frame where appropriate anatomy is visible. We use a Bayesian method to estimate the true value of each biometric from a large number of measurements and probabilistically reject outliers. We performed a retrospective experiment on 1457 recordings (comprising 48 million frames) of 20-week ultrasound scans, estimated fetal biometrics in those scans and compared our estimates to real-time manual measurements. Our method achieves human-level performance in estimating fetal biometrics and estimates well-calibrated credible intervals for the true biometric value.</p>","PeriodicalId":19349,"journal":{"name":"NPJ Digital Medicine","volume":"40 1","pages":""},"PeriodicalIF":15.2,"publicationDate":"2025-01-11","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142967755","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2025-01-09DOI: 10.1038/s41746-025-01430-7
Arvind Veluvali, Ashkan Dehghani Zahedani, Amir Hosseinian, Nima Aghaeepour, Tracey McLaughlin, Mark Woodward, Alex DiTullio, Noosheen Hashemi, Michael P. Snyder
This retrospective cohort study evaluates the impact of an AI-supported continuous glucose monitoring (CGM) mobile app (“January V2”) on glycemic control and weight management in 944 users, including healthy individuals and those with prediabetes or type 2 diabetes (T2D). The app, leveraging AI to personalize feedback, tracked users’ food intake, activity, and glucose responses over 14 days. Significant improvements in time in range (TIR) were observed, particularly in users with lower baseline TIR. Healthy users’ TIR increased from 74.7% to 85.5% (p < 0.0001), while T2D users’ TIR improved from 49.7% to 57.4% (p < 0.0004). Higher app engagement correlated with greater TIR improvements. Users also experienced an average weight reduction of 3.3 lbs over 33 days. These findings suggest that AI-enhanced digital health interventions can improve glycemic control and promote weight loss, particularly when users are actively engaged.
{"title":"Impact of digital health interventions on glycemic control and weight management","authors":"Arvind Veluvali, Ashkan Dehghani Zahedani, Amir Hosseinian, Nima Aghaeepour, Tracey McLaughlin, Mark Woodward, Alex DiTullio, Noosheen Hashemi, Michael P. Snyder","doi":"10.1038/s41746-025-01430-7","DOIUrl":"https://doi.org/10.1038/s41746-025-01430-7","url":null,"abstract":"<p>This retrospective cohort study evaluates the impact of an AI-supported continuous glucose monitoring (CGM) mobile app (“January V2”) on glycemic control and weight management in 944 users, including healthy individuals and those with prediabetes or type 2 diabetes (T2D). The app, leveraging AI to personalize feedback, tracked users’ food intake, activity, and glucose responses over 14 days. Significant improvements in time in range (TIR) were observed, particularly in users with lower baseline TIR. Healthy users’ TIR increased from 74.7% to 85.5% (<i>p</i> < 0.0001), while T2D users’ TIR improved from 49.7% to 57.4% (<i>p</i> < 0.0004). Higher app engagement correlated with greater TIR improvements. Users also experienced an average weight reduction of 3.3 lbs over 33 days. These findings suggest that AI-enhanced digital health interventions can improve glycemic control and promote weight loss, particularly when users are actively engaged.</p>","PeriodicalId":19349,"journal":{"name":"NPJ Digital Medicine","volume":"30 1","pages":""},"PeriodicalIF":15.2,"publicationDate":"2025-01-09","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142936696","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2025-01-09DOI: 10.1038/s41746-025-01437-0
Kimia Heydari, Elizabeth J. Enichen, Serena Wang, Grace C. Nickel, Joseph C. Kvedar
Alzheimer’s disease is the fifth-leading cause of death for adults over the age of 65. Retinal imaging has emerged to find more accurate diagnostic tool for Alzheimer’s Disease. This paper highlights Hao et al.’s development of a new deep learning tool, EyeAD, which studies Optical Coherence Tomography Angiography (OCT-A) of patients with Alzheimer’s. Integrating this model into clinical workflows may offer novel insights into the progression of this disease.
{"title":"A novel model for retinal imaging in the diagnosis of Alzheimer’s disease","authors":"Kimia Heydari, Elizabeth J. Enichen, Serena Wang, Grace C. Nickel, Joseph C. Kvedar","doi":"10.1038/s41746-025-01437-0","DOIUrl":"https://doi.org/10.1038/s41746-025-01437-0","url":null,"abstract":"Alzheimer’s disease is the fifth-leading cause of death for adults over the age of 65. Retinal imaging has emerged to find more accurate diagnostic tool for Alzheimer’s Disease. This paper highlights Hao et al.’s development of a new deep learning tool, EyeAD, which studies Optical Coherence Tomography Angiography (OCT-A) of patients with Alzheimer’s. Integrating this model into clinical workflows may offer novel insights into the progression of this disease.","PeriodicalId":19349,"journal":{"name":"NPJ Digital Medicine","volume":"6 1","pages":""},"PeriodicalIF":15.2,"publicationDate":"2025-01-09","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142936694","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2025-01-09DOI: 10.1038/s41746-025-01433-4
Martijn J. Schuemie, Anna Ostropolets, Aleh Zhuk, Uladzislau Korsik, Seung In Seo, Marc A. Suchard, George Hripcsak, Patrick B. Ryan
Using administrative claims and electronic health records for observational studies is common but challenging due to data limitations. Researchers rely on phenotype algorithms, requiring labor-intensive chart reviews for validation. This study investigates whether case adjudication using the previously introduced Knowledge-Enhanced Electronic Profile Review (KEEPER) system with large language models (LLMs) is feasible and could serve as a viable alternative to manual chart review. The task involves adjudicating cases identified by a phenotype algorithm, with KEEPER extracting predefined findings such as symptoms, comorbidities, and treatments from structured data. LLMs then evaluate KEEPER outputs to determine whether a patient truly qualifies as a case. We tested four LLMs including GPT-4, hosted locally to ensure privacy. Using zero-shot prompting and iterative prompt optimization, we found LLM performance, across ten diseases, varied by prompt and model, with sensitivities from 78 to 98% and specificities from 48 to 98%, indicating promise for automating phenotype evaluation.
{"title":"Standardized patient profile review using large language models for case adjudication in observational research","authors":"Martijn J. Schuemie, Anna Ostropolets, Aleh Zhuk, Uladzislau Korsik, Seung In Seo, Marc A. Suchard, George Hripcsak, Patrick B. Ryan","doi":"10.1038/s41746-025-01433-4","DOIUrl":"https://doi.org/10.1038/s41746-025-01433-4","url":null,"abstract":"<p>Using administrative claims and electronic health records for observational studies is common but challenging due to data limitations. Researchers rely on phenotype algorithms, requiring labor-intensive chart reviews for validation. This study investigates whether case adjudication using the previously introduced Knowledge-Enhanced Electronic Profile Review (KEEPER) system with large language models (LLMs) is feasible and could serve as a viable alternative to manual chart review. The task involves adjudicating cases identified by a phenotype algorithm, with KEEPER extracting predefined findings such as symptoms, comorbidities, and treatments from structured data. LLMs then evaluate KEEPER outputs to determine whether a patient truly qualifies as a case. We tested four LLMs including GPT-4, hosted locally to ensure privacy. Using zero-shot prompting and iterative prompt optimization, we found LLM performance, across ten diseases, varied by prompt and model, with sensitivities from 78 to 98% and specificities from 48 to 98%, indicating promise for automating phenotype evaluation.</p>","PeriodicalId":19349,"journal":{"name":"NPJ Digital Medicine","volume":"37 1","pages":""},"PeriodicalIF":15.2,"publicationDate":"2025-01-09","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142936695","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2025-01-08DOI: 10.1038/s41746-024-01416-x
Robert Hogan, Sean R. Mathieson, Aurel Luca, Soraia Ventura, Sean Griffin, Geraldine B. Boylan, John M. O’Toole
Neonatal seizures require urgent treatment, but often go undetected without expert EEG monitoring. We have developed and validated a seizure detection model using retrospective EEG data from 332 neonates. A convolutional neural network was trained and tested on over 50,000 hours (n = 202) of annotated single-channel EEG containing 12,402 seizure events. This model was then validated on two independent multi-reviewer datasets (n = 51 and n = 79). Increasing data and model size improved performance: Matthews correlation coefficient (MCC) and Pearson’s correlation (r) increased by up to 50% (15%) with data (model) scaling. The largest model (21m parameters) achieved state-of-the-art on an open-access dataset (MCC = 0.764, r = 0.824, and AUC = 0.982). This model also attained expert-level performance on both validation sets, a first in this field, with no significant difference in inter-rater agreement when the model replaces an expert (∣Δκ∣ < 0.094, p > 0.05).
新生儿癫痫发作需要紧急治疗,但通常在没有专家脑电图监测的情况下未被发现。我们利用332名新生儿的回顾性脑电图数据开发并验证了癫痫发作检测模型。卷积神经网络在包含12,402个癫痫事件的超过50,000小时(n = 202)的注释单通道EEG上进行训练和测试。然后在两个独立的多审稿人数据集(n = 51和n = 79)上验证该模型。增加数据和模型大小可以提高性能:随着数据(模型)的缩放,马修斯相关系数(MCC)和皮尔逊相关系数(r)增加了50%(15%)。最大的模型(21个参数)在开放获取数据集上达到了最先进的水平(MCC = 0.764, r = 0.824, AUC = 0.982)。该模型在两个验证集上也达到了专家级别的性能,这是该领域的第一个,当模型取代专家时,评分者之间的一致性没有显着差异(∣Δκ∣< 0.094, p > 0.05)。
{"title":"Scaling convolutional neural networks achieves expert level seizure detection in neonatal EEG","authors":"Robert Hogan, Sean R. Mathieson, Aurel Luca, Soraia Ventura, Sean Griffin, Geraldine B. Boylan, John M. O’Toole","doi":"10.1038/s41746-024-01416-x","DOIUrl":"https://doi.org/10.1038/s41746-024-01416-x","url":null,"abstract":"<p>Neonatal seizures require urgent treatment, but often go undetected without expert EEG monitoring. We have developed and validated a seizure detection model using retrospective EEG data from 332 neonates. A convolutional neural network was trained and tested on over 50,000 hours (<i>n</i> = 202) of annotated single-channel EEG containing 12,402 seizure events. This model was then validated on two independent multi-reviewer datasets (<i>n</i> = 51 and <i>n</i> = 79). Increasing data and model size improved performance: Matthews correlation coefficient (MCC) and Pearson’s correlation (<i>r</i>) increased by up to 50% (15%) with data (model) scaling. The largest model (21m parameters) achieved state-of-the-art on an open-access dataset (MCC = 0.764, <i>r</i> = 0.824, and AUC = 0.982). This model also attained expert-level performance on both validation sets, a first in this field, with no significant difference in inter-rater agreement when the model replaces an expert (<span>∣</span><i>Δ</i><i>κ</i><span>∣</span> < 0.094, <i>p</i> > 0.05).</p>","PeriodicalId":19349,"journal":{"name":"NPJ Digital Medicine","volume":"74 1","pages":""},"PeriodicalIF":15.2,"publicationDate":"2025-01-08","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142935709","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2025-01-08DOI: 10.1038/s41746-024-01427-8
Anne-Jet S. Jansen, Guido M. Peters, Laura Kooij, Carine J. M. Doggen, Wim H. van Harten
This systematic review examines four themes of device-based remote monitoring (DRM): technology, patient monitoring and support, integration of DRM into clinical care, and patient engagement, and their impact on hospital service use. We included randomized controlled trials (RCTs) until 2024 comparing hospital service use in DRM with usual care. Hospital service use decreased in DRM in 72% of the 116 included RCTs. Non-implantable devices were most commonly used to measure data, but showed a lower decrease in hospital service use than implanted or mobile devices (69% vs 89% and 76%). Providing 24/7 support for patients led to a decrease (81% of the studies). DRM replacing usual care, involving designated healthcare providers, and patient-performed data transmission led to a greater decrease in hospital service use. DRM has the potential to further reduce hospital service use with additional factors such as sufficient support, automated processes, and optimized care redesign.
{"title":"Device based monitoring in digital care and its impact on hospital service use","authors":"Anne-Jet S. Jansen, Guido M. Peters, Laura Kooij, Carine J. M. Doggen, Wim H. van Harten","doi":"10.1038/s41746-024-01427-8","DOIUrl":"https://doi.org/10.1038/s41746-024-01427-8","url":null,"abstract":"<p>This systematic review examines four themes of device-based remote monitoring (DRM): technology, patient monitoring and support, integration of DRM into clinical care, and patient engagement, and their impact on hospital service use. We included randomized controlled trials (RCTs) until 2024 comparing hospital service use in DRM with usual care. Hospital service use decreased in DRM in 72% of the 116 included RCTs. Non-implantable devices were most commonly used to measure data, but showed a lower decrease in hospital service use than implanted or mobile devices (69% vs 89% and 76%). Providing 24/7 support for patients led to a decrease (81% of the studies). DRM replacing usual care, involving designated healthcare providers, and patient-performed data transmission led to a greater decrease in hospital service use. DRM has the potential to further reduce hospital service use with additional factors such as sufficient support, automated processes, and optimized care redesign.</p>","PeriodicalId":19349,"journal":{"name":"NPJ Digital Medicine","volume":"1 1","pages":""},"PeriodicalIF":15.2,"publicationDate":"2025-01-08","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142935708","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2025-01-08DOI: 10.1038/s41746-024-01329-9
Tom W. Andrew, Mogdad Alrawi, Ruth Plummer, Nick Reynolds, Vern Sondak, Isaac Brownell, Penny E. Lovat, Aidan Rose, Sophia Z. Shalhout
Accurate prognostication guides optimal clinical management in skin cancer. Merkel cell carcinoma (MCC) is the most aggressive form of skin cancer that often presents in advanced stages and is associated with poor survival rates. There are no personalized prognostic tools in use in MCC. We employed explainability analysis to reveal new insights into mortality risk factors for this highly aggressive cancer. We then combined deep learning feature selection with a modified XGBoost framework, to develop a web-based prognostic tool for MCC termed ‘DeepMerkel’. DeepMerkel can make accurate personalised, time-dependent survival predictions for MCC from readily available clinical information. It demonstrated generalizability through high predictive performance in an international clinical cohort, out-performing current population-based prognostic staging systems. MCC and DeepMerkel provide the exemplar model of personalised machine learning prognostic tools in aggressive skin cancers.
{"title":"A hybrid machine learning approach for the personalized prognostication of aggressive skin cancers","authors":"Tom W. Andrew, Mogdad Alrawi, Ruth Plummer, Nick Reynolds, Vern Sondak, Isaac Brownell, Penny E. Lovat, Aidan Rose, Sophia Z. Shalhout","doi":"10.1038/s41746-024-01329-9","DOIUrl":"https://doi.org/10.1038/s41746-024-01329-9","url":null,"abstract":"<p>Accurate prognostication guides optimal clinical management in skin cancer. Merkel cell carcinoma (MCC) is the most aggressive form of skin cancer that often presents in advanced stages and is associated with poor survival rates. There are no personalized prognostic tools in use in MCC. We employed explainability analysis to reveal new insights into mortality risk factors for this highly aggressive cancer. We then combined deep learning feature selection with a modified XGBoost framework, to develop a web-based prognostic tool for MCC termed ‘DeepMerkel’. DeepMerkel can make accurate personalised, time-dependent survival predictions for MCC from readily available clinical information. It demonstrated generalizability through high predictive performance in an international clinical cohort, out-performing current population-based prognostic staging systems. MCC and DeepMerkel provide the exemplar model of personalised machine learning prognostic tools in aggressive skin cancers.</p>","PeriodicalId":19349,"journal":{"name":"NPJ Digital Medicine","volume":"35 1","pages":""},"PeriodicalIF":15.2,"publicationDate":"2025-01-08","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142935710","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}