Population modeling with machine learning can enhance measures of mental health - Open-data replication

Q4 Neuroscience Neuroimage. Reports Pub Date : 2023-06-01 DOI:10.1016/j.ynirp.2023.100163
Ty Easley , Ruiqi Chen , Kayla Hannon , Rosie Dutt , Janine Bijsterbosch
{"title":"Population modeling with machine learning can enhance measures of mental health - Open-data replication","authors":"Ty Easley ,&nbsp;Ruiqi Chen ,&nbsp;Kayla Hannon ,&nbsp;Rosie Dutt ,&nbsp;Janine Bijsterbosch","doi":"10.1016/j.ynirp.2023.100163","DOIUrl":null,"url":null,"abstract":"<div><p>Efforts to predict trait phenotypes based on functional MRI data from large cohorts have been hampered by low prediction accuracy and/or small effect sizes. Although these findings are highly replicable, the small effect sizes are somewhat surprising given the presumed brain basis of phenotypic traits such as neuroticism and fluid intelligence. We aim to replicate previous work and additionally test multiple data manipulations that may improve prediction accuracy by addressing data pollution challenges. Specifically, we added additional fMRI features, averaged the target phenotype across multiple measurements to obtain more accurate estimates of the underlying trait, balanced the target phenotype's distribution through undersampling of majority scores, and identified data-driven subtypes to investigate the impact of between-participant heterogeneity. Our results replicated prior results from Dadi et al. (2021) in a larger sample. Each data manipulation further led to small but consistent improvements in prediction accuracy, which were largely additive when combining multiple data manipulations. Combining data manipulations (i.e., extended fMRI features, averaged target phenotype, balanced target phenotype distribution) led to a three-fold increase in prediction accuracy for fluid intelligence compared to prior work. These findings highlight the benefit of several relatively easy and low-cost data manipulations, which may positively impact future work.</p></div>","PeriodicalId":74277,"journal":{"name":"Neuroimage. Reports","volume":"3 2","pages":"Article 100163"},"PeriodicalIF":0.0000,"publicationDate":"2023-06-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Neuroimage. Reports","FirstCategoryId":"1085","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S2666956023000089","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q4","JCRName":"Neuroscience","Score":null,"Total":0}
引用次数: 1

Abstract

Efforts to predict trait phenotypes based on functional MRI data from large cohorts have been hampered by low prediction accuracy and/or small effect sizes. Although these findings are highly replicable, the small effect sizes are somewhat surprising given the presumed brain basis of phenotypic traits such as neuroticism and fluid intelligence. We aim to replicate previous work and additionally test multiple data manipulations that may improve prediction accuracy by addressing data pollution challenges. Specifically, we added additional fMRI features, averaged the target phenotype across multiple measurements to obtain more accurate estimates of the underlying trait, balanced the target phenotype's distribution through undersampling of majority scores, and identified data-driven subtypes to investigate the impact of between-participant heterogeneity. Our results replicated prior results from Dadi et al. (2021) in a larger sample. Each data manipulation further led to small but consistent improvements in prediction accuracy, which were largely additive when combining multiple data manipulations. Combining data manipulations (i.e., extended fMRI features, averaged target phenotype, balanced target phenotype distribution) led to a three-fold increase in prediction accuracy for fluid intelligence compared to prior work. These findings highlight the benefit of several relatively easy and low-cost data manipulations, which may positively impact future work.

查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
使用机器学习进行人口建模可以增强心理健康指标-开放数据复制
基于来自大队列的功能性MRI数据预测性状表型的努力受到了预测准确性低和/或效应大小小的阻碍。尽管这些发现是高度可复制的,但考虑到神经质和流体智力等表型特征的假定大脑基础,这种小的效应大小有些令人惊讶。我们的目标是复制以前的工作,并额外测试多种数据操作,这些操作可以通过解决数据污染挑战来提高预测准确性。具体而言,我们添加了额外的功能磁共振成像特征,在多个测量中对目标表型进行平均,以获得对潜在特征的更准确估计,通过对多数分数的欠采样来平衡目标表型的分布,并确定数据驱动的亚型,以调查参与者之间异质性的影响。我们的结果在更大的样本中复制了Dadi等人之前的结果。(2021)。每一次数据操作都进一步导致了预测精度的微小但一致的提高,当组合多个数据操作时,这在很大程度上是相加的。与先前的工作相比,结合数据处理(即扩展的fMRI特征、平均目标表型、平衡的目标表型分布)使流体智能的预测准确性提高了三倍。这些发现突出了几种相对简单和低成本的数据操作的好处,这可能会对未来的工作产生积极影响。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 去求助
来源期刊
Neuroimage. Reports
Neuroimage. Reports Neuroscience (General)
CiteScore
1.90
自引率
0.00%
发文量
0
审稿时长
87 days
期刊最新文献
Associations between structural brain alterations and dysfunction across cognitive domains in cerebral small vessel disease: A systematic review and meta-analysis Volumetric brain analysis and associated retinal thinning in autosomal dominant optic atrophy patients PyCaret machine learning library with three preprocessing steps after eLORETA source estimation predicts Alzheimer's disease Early diagnosis of Alzheimer's disease based on brain morphological changes: A comprehensive approach combining voxel-based morphometry and deep learning Visuomotor training induces network reorganisation of frontal eye field and cuneus connectivity: A task-based fMRI study
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1