Data-driven characterization of individuals with delayed autism diagnosis

Dan Aizenberg, Ido Shalev, Florina Uzefovsky, Alal Eran
{"title":"Data-driven characterization of individuals with delayed autism diagnosis","authors":"Dan Aizenberg, Ido Shalev, Florina Uzefovsky, Alal Eran","doi":"10.1101/2024.07.26.24311003","DOIUrl":null,"url":null,"abstract":"<strong>Importance:</strong> Despite tremendous improvement in early identification of autism, ~25% of children receive their diagnosis after the age of six. Since evidence-based practices are more effective when started early, delayed diagnosis prevents many children from receiving optimal support. <strong>Objective:</strong> To identify and comparatively characterize groups of individuals diagnosed with Autism Spectrum Disorder (ASD) after the age of six.\n<strong>Design:</strong> This cross-sectional study used various machine learning approaches to classify, characterize, and compare individuals from the Simons Foundation Powering Autism Research for Knowledge (SPARK) cohort, recruited between 2015-2020.\n<strong>Setting:</strong> Analyses of medical histories and behavioral instruments. Participants: 23,632 SPARK participants. <strong>Exposure:</strong> ASD diagnosis upon registration to SPARK.\n<strong>Main Outcomes and Measures:</strong> Clusters of individuals diagnosed after the age of six (delayed ASD diagnosis) and their defining characteristics, as compared to individuals diagnosed before the age of six (timely ASD diagnosis). Odds and mean ratios were used for feature comparisons. Shapley values were used to assess the predictive value of these features, and correlation-based cliques were used to understand their interconnectedness. <strong>Results:</strong> Two robust subgroups of individuals with delayed ASD diagnosis were detected. The first, D1, included 3,612 individuals with lower support needs as compared to 17,992 individuals with a timely diagnosis. The second subgroup, D2, included 2,028 individuals with higher support needs, as consistently reflected by all commonly-used behavioral instruments, the greatest being repetitive and restrictive behaviors measured by the Repetitive Behavior Scale - Revised (RBS-R; D1: MR = 0.6854, 95% CI = [0.6848, 0.686]; D2: MR = 1.4223, 95% CI = [1.4210,1.4238], P = 3.54x10^-134). Moreover, individuals belonging to D1 had fewer comorbidities as compared to individuals with a timely ASD diagnosis, while D2 individuals had more (D1: mean = 3.47, t = 15.21; D2: mean = 8.12, t = 48.26, p &lt; 2.23x10^-308). A Random Forest classifier trained on the groups' characteristics achieved an AUC of 0.94. Further connectivity analysis of the groups' most informative characteristics demonstrated their distinct topological differences.\n<strong>Conclusions and Relevance:</strong> This analysis identified two opposite groups of individuals with delayed ASD diagnosis, thereby providing valuable insights for the development of targeted diagnostic strategies.","PeriodicalId":501454,"journal":{"name":"medRxiv - Health Informatics","volume":"20 1","pages":""},"PeriodicalIF":0.0000,"publicationDate":"2024-07-27","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"medRxiv - Health Informatics","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1101/2024.07.26.24311003","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0

Abstract

Importance: Despite tremendous improvement in early identification of autism, ~25% of children receive their diagnosis after the age of six. Since evidence-based practices are more effective when started early, delayed diagnosis prevents many children from receiving optimal support. Objective: To identify and comparatively characterize groups of individuals diagnosed with Autism Spectrum Disorder (ASD) after the age of six. Design: This cross-sectional study used various machine learning approaches to classify, characterize, and compare individuals from the Simons Foundation Powering Autism Research for Knowledge (SPARK) cohort, recruited between 2015-2020. Setting: Analyses of medical histories and behavioral instruments. Participants: 23,632 SPARK participants. Exposure: ASD diagnosis upon registration to SPARK. Main Outcomes and Measures: Clusters of individuals diagnosed after the age of six (delayed ASD diagnosis) and their defining characteristics, as compared to individuals diagnosed before the age of six (timely ASD diagnosis). Odds and mean ratios were used for feature comparisons. Shapley values were used to assess the predictive value of these features, and correlation-based cliques were used to understand their interconnectedness. Results: Two robust subgroups of individuals with delayed ASD diagnosis were detected. The first, D1, included 3,612 individuals with lower support needs as compared to 17,992 individuals with a timely diagnosis. The second subgroup, D2, included 2,028 individuals with higher support needs, as consistently reflected by all commonly-used behavioral instruments, the greatest being repetitive and restrictive behaviors measured by the Repetitive Behavior Scale - Revised (RBS-R; D1: MR = 0.6854, 95% CI = [0.6848, 0.686]; D2: MR = 1.4223, 95% CI = [1.4210,1.4238], P = 3.54x10^-134). Moreover, individuals belonging to D1 had fewer comorbidities as compared to individuals with a timely ASD diagnosis, while D2 individuals had more (D1: mean = 3.47, t = 15.21; D2: mean = 8.12, t = 48.26, p < 2.23x10^-308). A Random Forest classifier trained on the groups' characteristics achieved an AUC of 0.94. Further connectivity analysis of the groups' most informative characteristics demonstrated their distinct topological differences. Conclusions and Relevance: This analysis identified two opposite groups of individuals with delayed ASD diagnosis, thereby providing valuable insights for the development of targeted diagnostic strategies.
查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
自闭症诊断延迟者的数据驱动特征描述
重要性:尽管在早期识别自闭症方面取得了巨大进步,但仍有约 25% 的儿童在六岁后才得到诊断。由于循证实践在早期开始时更为有效,延迟诊断使许多儿童无法获得最佳支持。目标:识别并比较六岁后被诊断为自闭症谱系障碍(ASD)的人群特征:这项横断面研究采用了多种机器学习方法,对西蒙斯基金会自闭症知识研究(SPARK)队列中的个体进行分类、特征描述和比较:分析病史和行为工具。参与者:23632 名 SPARK 参与者。暴露:主要结果和测量:与六岁前确诊(及时确诊)的个体相比,六岁后确诊(延迟确诊)的个体集群及其定义特征。在进行特征比较时使用了比值比和均值比。沙普利值用于评估这些特征的预测价值,基于相关性的群组用于了解它们之间的相互联系。研究结果发现了两个具有延迟 ASD 诊断的强大亚群。第一个亚群(D1)包括 3,612 名需要较少支持的个体,而及时诊断的个体则有 17,992 名。第二个亚组 D2 包括 2028 名需要更多支持的个体,所有常用的行为测量工具都一致反映了这一点,其中最大的需求是重复和限制性行为,由重复行为量表-修订版(RBS-R;D1.MR = 0.6854,D2.MR = 0.6854)测量:D1:MR = 0.6854,95% CI = [0.6848,0.686];D2:MR = 1.4223,95% CI = [1.4210,1.4238],P = 3.54x10^-134)。此外,与及时确诊的 ASD 患者相比,D1 患者的合并症较少,而 D2 患者的合并症较多(D1:平均值 = 3.47,t = 15.21;D2:平均值 = 8.12,t = 48.26,P <2.23x10^-308)。根据各组特征训练的随机森林分类器的 AUC 为 0.94。对两组信息量最大的特征进行的进一步连通性分析表明,它们之间存在明显的拓扑差异:这项分析确定了两组具有延迟 ASD 诊断的相反群体,从而为制定有针对性的诊断策略提供了有价值的见解。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 去求助
来源期刊
自引率
0.00%
发文量
0
期刊最新文献
A case is not a case is not a case - challenges and solutions in determining urolithiasis caseloads using the digital infrastructure of a clinical data warehouse Reliable Online Auditory Cognitive Testing: An observational study Federated Multiple Imputation for Variables that Are Missing Not At Random in Distributed Electronic Health Records Characterizing the connection between Parkinson's disease progression and healthcare utilization Generative AI and Large Language Models in Reducing Medication Related Harm and Adverse Drug Events - A Scoping Review
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1