Dan Aizenberg, Ido Shalev, Florina Uzefovsky, Alal Eran
{"title":"Data-driven characterization of individuals with delayed autism diagnosis","authors":"Dan Aizenberg, Ido Shalev, Florina Uzefovsky, Alal Eran","doi":"10.1101/2024.07.26.24311003","DOIUrl":null,"url":null,"abstract":"<strong>Importance:</strong> Despite tremendous improvement in early identification of autism, ~25% of children receive their diagnosis after the age of six. Since evidence-based practices are more effective when started early, delayed diagnosis prevents many children from receiving optimal support. <strong>Objective:</strong> To identify and comparatively characterize groups of individuals diagnosed with Autism Spectrum Disorder (ASD) after the age of six.\n<strong>Design:</strong> This cross-sectional study used various machine learning approaches to classify, characterize, and compare individuals from the Simons Foundation Powering Autism Research for Knowledge (SPARK) cohort, recruited between 2015-2020.\n<strong>Setting:</strong> Analyses of medical histories and behavioral instruments. Participants: 23,632 SPARK participants. <strong>Exposure:</strong> ASD diagnosis upon registration to SPARK.\n<strong>Main Outcomes and Measures:</strong> Clusters of individuals diagnosed after the age of six (delayed ASD diagnosis) and their defining characteristics, as compared to individuals diagnosed before the age of six (timely ASD diagnosis). Odds and mean ratios were used for feature comparisons. Shapley values were used to assess the predictive value of these features, and correlation-based cliques were used to understand their interconnectedness. <strong>Results:</strong> Two robust subgroups of individuals with delayed ASD diagnosis were detected. The first, D1, included 3,612 individuals with lower support needs as compared to 17,992 individuals with a timely diagnosis. The second subgroup, D2, included 2,028 individuals with higher support needs, as consistently reflected by all commonly-used behavioral instruments, the greatest being repetitive and restrictive behaviors measured by the Repetitive Behavior Scale - Revised (RBS-R; D1: MR = 0.6854, 95% CI = [0.6848, 0.686]; D2: MR = 1.4223, 95% CI = [1.4210,1.4238], P = 3.54x10^-134). Moreover, individuals belonging to D1 had fewer comorbidities as compared to individuals with a timely ASD diagnosis, while D2 individuals had more (D1: mean = 3.47, t = 15.21; D2: mean = 8.12, t = 48.26, p < 2.23x10^-308). A Random Forest classifier trained on the groups' characteristics achieved an AUC of 0.94. Further connectivity analysis of the groups' most informative characteristics demonstrated their distinct topological differences.\n<strong>Conclusions and Relevance:</strong> This analysis identified two opposite groups of individuals with delayed ASD diagnosis, thereby providing valuable insights for the development of targeted diagnostic strategies.","PeriodicalId":501454,"journal":{"name":"medRxiv - Health Informatics","volume":"20 1","pages":""},"PeriodicalIF":0.0000,"publicationDate":"2024-07-27","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"medRxiv - Health Informatics","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1101/2024.07.26.24311003","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0
Abstract
Importance: Despite tremendous improvement in early identification of autism, ~25% of children receive their diagnosis after the age of six. Since evidence-based practices are more effective when started early, delayed diagnosis prevents many children from receiving optimal support. Objective: To identify and comparatively characterize groups of individuals diagnosed with Autism Spectrum Disorder (ASD) after the age of six.
Design: This cross-sectional study used various machine learning approaches to classify, characterize, and compare individuals from the Simons Foundation Powering Autism Research for Knowledge (SPARK) cohort, recruited between 2015-2020.
Setting: Analyses of medical histories and behavioral instruments. Participants: 23,632 SPARK participants. Exposure: ASD diagnosis upon registration to SPARK.
Main Outcomes and Measures: Clusters of individuals diagnosed after the age of six (delayed ASD diagnosis) and their defining characteristics, as compared to individuals diagnosed before the age of six (timely ASD diagnosis). Odds and mean ratios were used for feature comparisons. Shapley values were used to assess the predictive value of these features, and correlation-based cliques were used to understand their interconnectedness. Results: Two robust subgroups of individuals with delayed ASD diagnosis were detected. The first, D1, included 3,612 individuals with lower support needs as compared to 17,992 individuals with a timely diagnosis. The second subgroup, D2, included 2,028 individuals with higher support needs, as consistently reflected by all commonly-used behavioral instruments, the greatest being repetitive and restrictive behaviors measured by the Repetitive Behavior Scale - Revised (RBS-R; D1: MR = 0.6854, 95% CI = [0.6848, 0.686]; D2: MR = 1.4223, 95% CI = [1.4210,1.4238], P = 3.54x10^-134). Moreover, individuals belonging to D1 had fewer comorbidities as compared to individuals with a timely ASD diagnosis, while D2 individuals had more (D1: mean = 3.47, t = 15.21; D2: mean = 8.12, t = 48.26, p < 2.23x10^-308). A Random Forest classifier trained on the groups' characteristics achieved an AUC of 0.94. Further connectivity analysis of the groups' most informative characteristics demonstrated their distinct topological differences.
Conclusions and Relevance: This analysis identified two opposite groups of individuals with delayed ASD diagnosis, thereby providing valuable insights for the development of targeted diagnostic strategies.