大脑表型对语言和执行功能的预测可以在不同的真实世界数据中存活：发育人群中的数据集转移。

IF 4.6 2区医学 Q1 NEUROSCIENCES Developmental Cognitive Neuroscience Pub Date : 2024-10-16 DOI:10.1016/j.dcn.2024.101464

Brendan D. Adkinson , Matthew Rosenblatt , Javid Dadashkarimi , Link Tejavibulya , Rongtao Jiang , Stephanie Noble , Dustin Scheinost

{"title":"大脑表型对语言和执行功能的预测可以在不同的真实世界数据中存活：发育人群中的数据集转移。","authors":"Brendan D. Adkinson , Matthew Rosenblatt , Javid Dadashkarimi , Link Tejavibulya , Rongtao Jiang , Stephanie Noble , Dustin Scheinost","doi":"10.1016/j.dcn.2024.101464","DOIUrl":null,"url":null,"abstract":"<div><div>Predictive modeling potentially increases the reproducibility and generalizability of neuroimaging brain-phenotype associations. Yet, the evaluation of a model in another dataset is underutilized. Among studies that undertake external validation, there is a notable lack of attention to generalization across dataset-specific idiosyncrasies (i.e., dataset shifts). Research settings, by design, remove the between-site variations that real-world and, eventually, clinical applications demand. Here, we rigorously test the ability of a range of predictive models to generalize across three diverse, unharmonized developmental samples: the Philadelphia Neurodevelopmental Cohort (n=1291), the Healthy Brain Network (n=1110), and the Human Connectome Project in Development (n=428). These datasets have high inter-dataset heterogeneity, encompassing substantial variations in age distribution, sex, racial and ethnic minority representation, recruitment geography, clinical symptom burdens, fMRI tasks, sequences, and behavioral measures. Through advanced methodological approaches, we demonstrate that reproducible and generalizable brain-behavior associations can be realized across diverse dataset features. Results indicate the potential of functional connectome-based predictive models to be robust despite substantial inter-dataset variability. Notably, for the HCPD and HBN datasets, the best predictions were not from training and testing in the same dataset (i.e., cross-validation) but across datasets. This result suggests that training on diverse data may improve prediction in specific cases. Overall, this work provides a critical foundation for future work evaluating the generalizability of brain-phenotype associations in real-world scenarios and clinical settings.</div></div>","PeriodicalId":49083,"journal":{"name":"Developmental Cognitive Neuroscience","volume":"70 ","pages":"Article 101464"},"PeriodicalIF":4.6000,"publicationDate":"2024-10-16","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Brain-phenotype predictions of language and executive function can survive across diverse real-world data: Dataset shifts in developmental populations\",\"authors\":\"Brendan D. Adkinson , Matthew Rosenblatt , Javid Dadashkarimi , Link Tejavibulya , Rongtao Jiang , Stephanie Noble , Dustin Scheinost\",\"doi\":\"10.1016/j.dcn.2024.101464\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<div><div>Predictive modeling potentially increases the reproducibility and generalizability of neuroimaging brain-phenotype associations. Yet, the evaluation of a model in another dataset is underutilized. Among studies that undertake external validation, there is a notable lack of attention to generalization across dataset-specific idiosyncrasies (i.e., dataset shifts). Research settings, by design, remove the between-site variations that real-world and, eventually, clinical applications demand. Here, we rigorously test the ability of a range of predictive models to generalize across three diverse, unharmonized developmental samples: the Philadelphia Neurodevelopmental Cohort (n=1291), the Healthy Brain Network (n=1110), and the Human Connectome Project in Development (n=428). These datasets have high inter-dataset heterogeneity, encompassing substantial variations in age distribution, sex, racial and ethnic minority representation, recruitment geography, clinical symptom burdens, fMRI tasks, sequences, and behavioral measures. Through advanced methodological approaches, we demonstrate that reproducible and generalizable brain-behavior associations can be realized across diverse dataset features. Results indicate the potential of functional connectome-based predictive models to be robust despite substantial inter-dataset variability. Notably, for the HCPD and HBN datasets, the best predictions were not from training and testing in the same dataset (i.e., cross-validation) but across datasets. This result suggests that training on diverse data may improve prediction in specific cases. Overall, this work provides a critical foundation for future work evaluating the generalizability of brain-phenotype associations in real-world scenarios and clinical settings.</div></div>\",\"PeriodicalId\":49083,\"journal\":{\"name\":\"Developmental Cognitive Neuroscience\",\"volume\":\"70 \",\"pages\":\"Article 101464\"},\"PeriodicalIF\":4.6000,\"publicationDate\":\"2024-10-16\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Developmental Cognitive Neuroscience\",\"FirstCategoryId\":\"3\",\"ListUrlMain\":\"https://www.sciencedirect.com/science/article/pii/S1878929324001257\",\"RegionNum\":2,\"RegionCategory\":\"医学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q1\",\"JCRName\":\"NEUROSCIENCES\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Developmental Cognitive Neuroscience","FirstCategoryId":"3","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S1878929324001257","RegionNum":2,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"NEUROSCIENCES","Score":null,"Total":0}

引用次数: 0

摘要

预测模型有可能提高神经成像脑表型关联的可重复性和可推广性。然而，在另一个数据集中对模型的评估却未得到充分利用。在进行外部验证的研究中，明显缺乏对跨数据集特异性（即数据集偏移）通用性的关注。研究环境在设计上消除了真实世界和最终临床应用所需的站点间差异。在这里，我们严格测试了一系列预测模型在三种不同的、未协调的发育样本中的泛化能力：费城神经发育队列（n=1291）、健康大脑网络（n=1110）和发育中的人类连接组项目（n=428）。这些数据集在年龄分布、性别、种族和少数族裔代表性、招募地域、临床症状负担、fMRI 任务、序列和行为测量等方面存在很大差异，数据集之间的异质性很高。通过先进的方法论，我们证明了在不同的数据集特征中可以实现可重复和可推广的大脑行为关联。研究结果表明，尽管数据集之间存在很大差异，但基于功能连接组的预测模型仍具有稳健性。值得注意的是，就 HCPD 和 HBN 数据集而言，最佳预测不是来自同一数据集的训练和测试（即交叉验证），而是来自跨数据集的训练和测试。这一结果表明，在特定情况下，在不同数据上进行训练可能会提高预测效果。总之，这项工作为今后评估大脑表型关联在现实世界和临床环境中的可推广性奠定了重要基础。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文

微信好友朋友圈 QQ好友复制链接

本刊更多论文

Brain-phenotype predictions of language and executive function can survive across diverse real-world data: Dataset shifts in developmental populations

Predictive modeling potentially increases the reproducibility and generalizability of neuroimaging brain-phenotype associations. Yet, the evaluation of a model in another dataset is underutilized. Among studies that undertake external validation, there is a notable lack of attention to generalization across dataset-specific idiosyncrasies (i.e., dataset shifts). Research settings, by design, remove the between-site variations that real-world and, eventually, clinical applications demand. Here, we rigorously test the ability of a range of predictive models to generalize across three diverse, unharmonized developmental samples: the Philadelphia Neurodevelopmental Cohort (n=1291), the Healthy Brain Network (n=1110), and the Human Connectome Project in Development (n=428). These datasets have high inter-dataset heterogeneity, encompassing substantial variations in age distribution, sex, racial and ethnic minority representation, recruitment geography, clinical symptom burdens, fMRI tasks, sequences, and behavioral measures. Through advanced methodological approaches, we demonstrate that reproducible and generalizable brain-behavior associations can be realized across diverse dataset features. Results indicate the potential of functional connectome-based predictive models to be robust despite substantial inter-dataset variability. Notably, for the HCPD and HBN datasets, the best predictions were not from training and testing in the same dataset (i.e., cross-validation) but across datasets. This result suggests that training on diverse data may improve prediction in specific cases. Overall, this work provides a critical foundation for future work evaluating the generalizability of brain-phenotype associations in real-world scenarios and clinical settings.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

Developmental Cognitive Neuroscience NEUROSCIENCES-

CiteScore

7.60

自引率

10.60%

发文量

124

审稿时长

6-12 weeks

期刊介绍： The journal publishes theoretical and research papers on cognitive brain development, from infancy through childhood and adolescence and into adulthood. It covers neurocognitive development and neurocognitive processing in both typical and atypical development, including social and affective aspects. Appropriate methodologies for the journal include, but are not limited to, functional neuroimaging (fMRI and MEG), electrophysiology (EEG and ERP), NIRS and transcranial magnetic stimulation, as well as other basic neuroscience approaches using cellular and animal models that directly address cognitive brain development, patient studies, case studies, post-mortem studies and pharmacological studies.