使用卷积神经网络 U-Sleep 对儿科睡眠阶段自动分类进行评估

Ajay Kevat, Rylan Steinkey, Sadasivam Suresh, Warren R Ruehland, Jasneek Chawla, Philip I Terrill, Andrew Collaro, Kartik Iyer
{"title":"使用卷积神经网络 U-Sleep 对儿科睡眠阶段自动分类进行评估","authors":"Ajay Kevat, Rylan Steinkey, Sadasivam Suresh, Warren R Ruehland, Jasneek Chawla, Philip I Terrill, Andrew Collaro, Kartik Iyer","doi":"10.1101/2024.08.18.24312174","DOIUrl":null,"url":null,"abstract":"Study Objectives U-Sleep is a publicly-available automated sleep stager, but has not been independently validated using pediatric data. We aimed to a) test the hypothesis that U-Sleep performance is equivalent to trained humans, using a concordance dataset of 50 pediatric polysomnogram excerpts scored by multiple trained scorers, and b) identify clinical and demographic characteristics that impact U-Sleep accuracy, using a clinical dataset of 3114 polysomnograms from a tertiary center. Methods Agreement between U-Sleep and gold 30-second epoch sleep staging was determined across both datasets. Utilizing the concordance dataset, the hypothesis of equivalence between human scorers and U-Sleep was tested using a Wilcoxon two one-sided test (TOST). Multivariable regression and generalized additive modelling were used on the clinical dataset to estimate the effects of age, comorbidities and polysomnographic findings on U-Sleep performance. Results The median (interquartile range) Cohens kappa agreement of U-Sleep and individual trained humans relative to gold scoring for 5-stage sleep staging in the concordance dataset were similar, kappa=0.79(0.19) vs 0.78(0.13) respectively, and satisfied statistical equivalence (TOST p<0.01). Median (interquartile range) kappa agreement between U-Sleep 2.0 and clinical sleep-staging was kappa=0.69(0.22). Modelling indicated lower performance for children <2 years, those with medical comorbidities possibly altering sleep electroencephalography (kappa reduction=0.07-0.15) and those with decreased sleep efficiency or sleep-disordered breathing (kappa reduction=0.1). Conclusion While U-Sleep algorithms showed statistically equivalent performance to trained scorers, accuracy was lower in children <2 years and those with sleep-disordered breathing or comorbidities affecting electroencephalography. U-Sleep is suitable for pediatric clinical utilization provided automated staging is followed by expert clinician review.","PeriodicalId":501549,"journal":{"name":"medRxiv - Pediatrics","volume":"111 1","pages":""},"PeriodicalIF":0.0000,"publicationDate":"2024-08-20","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Evaluation of automated pediatric sleep stage classification using U-Sleep - a convolutional neural network\",\"authors\":\"Ajay Kevat, Rylan Steinkey, Sadasivam Suresh, Warren R Ruehland, Jasneek Chawla, Philip I Terrill, Andrew Collaro, Kartik Iyer\",\"doi\":\"10.1101/2024.08.18.24312174\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Study Objectives U-Sleep is a publicly-available automated sleep stager, but has not been independently validated using pediatric data. We aimed to a) test the hypothesis that U-Sleep performance is equivalent to trained humans, using a concordance dataset of 50 pediatric polysomnogram excerpts scored by multiple trained scorers, and b) identify clinical and demographic characteristics that impact U-Sleep accuracy, using a clinical dataset of 3114 polysomnograms from a tertiary center. Methods Agreement between U-Sleep and gold 30-second epoch sleep staging was determined across both datasets. Utilizing the concordance dataset, the hypothesis of equivalence between human scorers and U-Sleep was tested using a Wilcoxon two one-sided test (TOST). Multivariable regression and generalized additive modelling were used on the clinical dataset to estimate the effects of age, comorbidities and polysomnographic findings on U-Sleep performance. Results The median (interquartile range) Cohens kappa agreement of U-Sleep and individual trained humans relative to gold scoring for 5-stage sleep staging in the concordance dataset were similar, kappa=0.79(0.19) vs 0.78(0.13) respectively, and satisfied statistical equivalence (TOST p<0.01). Median (interquartile range) kappa agreement between U-Sleep 2.0 and clinical sleep-staging was kappa=0.69(0.22). Modelling indicated lower performance for children <2 years, those with medical comorbidities possibly altering sleep electroencephalography (kappa reduction=0.07-0.15) and those with decreased sleep efficiency or sleep-disordered breathing (kappa reduction=0.1). Conclusion While U-Sleep algorithms showed statistically equivalent performance to trained scorers, accuracy was lower in children <2 years and those with sleep-disordered breathing or comorbidities affecting electroencephalography. U-Sleep is suitable for pediatric clinical utilization provided automated staging is followed by expert clinician review.\",\"PeriodicalId\":501549,\"journal\":{\"name\":\"medRxiv - Pediatrics\",\"volume\":\"111 1\",\"pages\":\"\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2024-08-20\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"medRxiv - Pediatrics\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1101/2024.08.18.24312174\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"medRxiv - Pediatrics","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1101/2024.08.18.24312174","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0

摘要

研究目的 U-Sleep 是一种公开的自动睡眠分级器,但尚未使用儿科数据进行独立验证。我们的目的是:(a)使用由多名训练有素的评分员评分的 50 个儿科多导睡眠图节选的一致性数据集,检验 U-Sleep 的性能是否与训练有素的人类相当;(b)使用来自一家三级中心的 3114 个多导睡眠图的临床数据集,确定影响 U-Sleep 准确性的临床和人口特征。方法 在两个数据集中确定 U-Sleep 与黄金 30 秒历时睡眠分期之间的一致性。利用一致性数据集,采用 Wilcoxon 双单侧检验 (TOST) 测试了人类评分员与 U-Sleep 之间的等效性假设。在临床数据集上使用多变量回归和广义相加模型来估计年龄、合并症和多导睡眠图检查结果对 U-Sleep 性能的影响。结果 在一致性数据集中,U-Sleep 和经过训练的人类个体相对于五阶段睡眠分期黄金评分的中位数(四分位数间距)科恩斯卡帕一致性相似,卡帕分别为 0.79(0.19) vs 0.78(0.13),符合统计学等效性(TOST p<0.01)。U-Sleep 2.0 与临床睡眠分期的 kappa 一致度中位数(四分位数间距)为 kappa=0.69(0.22)。建模结果表明,2 岁儿童、患有可能改变睡眠脑电图的并发症的儿童(kappa 值减小=0.07-0.15)以及睡眠效率下降或睡眠呼吸紊乱的儿童(kappa 值减小=0.1)的睡眠分级效果较差。结论 虽然 U-Sleep 算法在统计学上与训练有素的评分员表现相当,但在 2 岁儿童和有睡眠呼吸障碍或合并症影响脑电图的儿童中准确率较低。U-Sleep 适合儿科临床使用,但自动分期需经临床专家审核。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
Evaluation of automated pediatric sleep stage classification using U-Sleep - a convolutional neural network
Study Objectives U-Sleep is a publicly-available automated sleep stager, but has not been independently validated using pediatric data. We aimed to a) test the hypothesis that U-Sleep performance is equivalent to trained humans, using a concordance dataset of 50 pediatric polysomnogram excerpts scored by multiple trained scorers, and b) identify clinical and demographic characteristics that impact U-Sleep accuracy, using a clinical dataset of 3114 polysomnograms from a tertiary center. Methods Agreement between U-Sleep and gold 30-second epoch sleep staging was determined across both datasets. Utilizing the concordance dataset, the hypothesis of equivalence between human scorers and U-Sleep was tested using a Wilcoxon two one-sided test (TOST). Multivariable regression and generalized additive modelling were used on the clinical dataset to estimate the effects of age, comorbidities and polysomnographic findings on U-Sleep performance. Results The median (interquartile range) Cohens kappa agreement of U-Sleep and individual trained humans relative to gold scoring for 5-stage sleep staging in the concordance dataset were similar, kappa=0.79(0.19) vs 0.78(0.13) respectively, and satisfied statistical equivalence (TOST p<0.01). Median (interquartile range) kappa agreement between U-Sleep 2.0 and clinical sleep-staging was kappa=0.69(0.22). Modelling indicated lower performance for children <2 years, those with medical comorbidities possibly altering sleep electroencephalography (kappa reduction=0.07-0.15) and those with decreased sleep efficiency or sleep-disordered breathing (kappa reduction=0.1). Conclusion While U-Sleep algorithms showed statistically equivalent performance to trained scorers, accuracy was lower in children <2 years and those with sleep-disordered breathing or comorbidities affecting electroencephalography. U-Sleep is suitable for pediatric clinical utilization provided automated staging is followed by expert clinician review.
求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
自引率
0.00%
发文量
0
期刊最新文献
Evaluating the Effectiveness of Nirsevimab in Reducing Pediatric RSV Hospitalizations in Spain The Impact of Chlorinated Drinking Water Exposure on Gut Microbiota Development in Infants: a Randomised Controlled Trial. Development of a Novel Dynamic Leak Model to Simulate Leak for Performance Testing of Manual Neonatal Resuscitation Devices. Does Leak Matter? A Bench Study How young people experienced Long COVID services: a qualitative analysis. Knowledge of Childhood and Adolescent Cancer Among Primary Caregivers: An Observational and Prospective Study
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1