Jiaqi Li, Ari Segel, Xinyang Feng, Jiaxin Cindy Tu, Andy Eck, Kelsey T King, Babatunde Adeyemo, Nicole R Karcher, Likai Chen, Adam T Eggebrecht, Muriah D Wheelock
{"title":"网络级富集为机器学习结果的生物学解释提供了一个框架。","authors":"Jiaqi Li, Ari Segel, Xinyang Feng, Jiaxin Cindy Tu, Andy Eck, Kelsey T King, Babatunde Adeyemo, Nicole R Karcher, Likai Chen, Adam T Eggebrecht, Muriah D Wheelock","doi":"10.1162/netn_a_00383","DOIUrl":null,"url":null,"abstract":"<p><p>Machine learning algorithms are increasingly being utilized to identify brain connectivity biomarkers linked to behavioral and clinical outcomes. However, research often prioritizes prediction accuracy at the expense of biological interpretability, and inconsistent implementation of ML methods may hinder model accuracy. To address this, our paper introduces a network-level enrichment approach, which integrates brain system organization in the context of connectome-wide statistical analysis to reveal network-level links between brain connectivity and behavior. To demonstrate the efficacy of this approach, we used linear support vector regression (LSVR) models to examine the relationship between resting-state functional connectivity networks and chronological age. We compared network-level associations based on raw LSVR weights to those produced from the forward and inverse models. Results indicated that not accounting for shared family variance inflated prediction performance, the k-best feature selection via Pearson correlation reduced accuracy and reliability, and raw LSVR model weights produced network-level associations that deviated from the significant brain systems identified by forward and inverse models. Our findings offer crucial insights for applying machine learning to neuroimaging data, emphasizing the value of network enrichment for biological interpretation.</p>","PeriodicalId":48520,"journal":{"name":"Network Neuroscience","volume":"8 3","pages":"762-790"},"PeriodicalIF":3.6000,"publicationDate":"2024-10-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11349033/pdf/","citationCount":"0","resultStr":"{\"title\":\"Network-level enrichment provides a framework for biological interpretation of machine learning results.\",\"authors\":\"Jiaqi Li, Ari Segel, Xinyang Feng, Jiaxin Cindy Tu, Andy Eck, Kelsey T King, Babatunde Adeyemo, Nicole R Karcher, Likai Chen, Adam T Eggebrecht, Muriah D Wheelock\",\"doi\":\"10.1162/netn_a_00383\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<p><p>Machine learning algorithms are increasingly being utilized to identify brain connectivity biomarkers linked to behavioral and clinical outcomes. However, research often prioritizes prediction accuracy at the expense of biological interpretability, and inconsistent implementation of ML methods may hinder model accuracy. To address this, our paper introduces a network-level enrichment approach, which integrates brain system organization in the context of connectome-wide statistical analysis to reveal network-level links between brain connectivity and behavior. To demonstrate the efficacy of this approach, we used linear support vector regression (LSVR) models to examine the relationship between resting-state functional connectivity networks and chronological age. We compared network-level associations based on raw LSVR weights to those produced from the forward and inverse models. Results indicated that not accounting for shared family variance inflated prediction performance, the k-best feature selection via Pearson correlation reduced accuracy and reliability, and raw LSVR model weights produced network-level associations that deviated from the significant brain systems identified by forward and inverse models. Our findings offer crucial insights for applying machine learning to neuroimaging data, emphasizing the value of network enrichment for biological interpretation.</p>\",\"PeriodicalId\":48520,\"journal\":{\"name\":\"Network Neuroscience\",\"volume\":\"8 3\",\"pages\":\"762-790\"},\"PeriodicalIF\":3.6000,\"publicationDate\":\"2024-10-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11349033/pdf/\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Network Neuroscience\",\"FirstCategoryId\":\"3\",\"ListUrlMain\":\"https://doi.org/10.1162/netn_a_00383\",\"RegionNum\":3,\"RegionCategory\":\"医学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"2024/1/1 0:00:00\",\"PubModel\":\"eCollection\",\"JCR\":\"Q2\",\"JCRName\":\"NEUROSCIENCES\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Network Neuroscience","FirstCategoryId":"3","ListUrlMain":"https://doi.org/10.1162/netn_a_00383","RegionNum":3,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"2024/1/1 0:00:00","PubModel":"eCollection","JCR":"Q2","JCRName":"NEUROSCIENCES","Score":null,"Total":0}
引用次数: 0
摘要
人们越来越多地利用机器学习算法来识别与行为和临床结果相关的大脑连接生物标记物。然而,研究往往以牺牲生物可解释性为代价来优先考虑预测的准确性,而且机器学习方法的实施不一致可能会妨碍模型的准确性。为了解决这个问题,我们的论文介绍了一种网络级富集方法,该方法在全连接体统计分析的背景下整合了大脑系统组织,以揭示大脑连接性与行为之间的网络级联系。为了证明这种方法的有效性,我们使用线性支持向量回归(LSVR)模型来研究静息态功能连接网络与年龄之间的关系。我们将基于原始 LSVR 权重的网络级关联与正向和逆向模型产生的关联进行了比较。结果表明,不考虑共享族变异会提高预测性能,通过皮尔逊相关性进行的 k 最佳特征选择降低了准确性和可靠性,原始 LSVR 模型权重产生的网络级关联偏离了正向和逆向模型确定的重要大脑系统。我们的研究结果为将机器学习应用于神经成像数据提供了重要启示,强调了网络丰富性对生物学解释的价值。
Network-level enrichment provides a framework for biological interpretation of machine learning results.
Machine learning algorithms are increasingly being utilized to identify brain connectivity biomarkers linked to behavioral and clinical outcomes. However, research often prioritizes prediction accuracy at the expense of biological interpretability, and inconsistent implementation of ML methods may hinder model accuracy. To address this, our paper introduces a network-level enrichment approach, which integrates brain system organization in the context of connectome-wide statistical analysis to reveal network-level links between brain connectivity and behavior. To demonstrate the efficacy of this approach, we used linear support vector regression (LSVR) models to examine the relationship between resting-state functional connectivity networks and chronological age. We compared network-level associations based on raw LSVR weights to those produced from the forward and inverse models. Results indicated that not accounting for shared family variance inflated prediction performance, the k-best feature selection via Pearson correlation reduced accuracy and reliability, and raw LSVR model weights produced network-level associations that deviated from the significant brain systems identified by forward and inverse models. Our findings offer crucial insights for applying machine learning to neuroimaging data, emphasizing the value of network enrichment for biological interpretation.