从生物约束基因组解码多表型层遗传标记到表型贝叶斯稀疏回归

Frontiers in molecular medicine Pub Date : 2022-03-30 eCollection Date: 2022-01-01 DOI:10.3389/fmmed.2022.830956
Marie Deprez, Julien Moreira, Maxime Sermesant, Marco Lorenzi
{"title":"从生物约束基因组解码多表型层遗传标记到表型贝叶斯稀疏回归","authors":"Marie Deprez, Julien Moreira, Maxime Sermesant, Marco Lorenzi","doi":"10.3389/fmmed.2022.830956","DOIUrl":null,"url":null,"abstract":"<p><p>The applicability of multivariate approaches for the joint analysis of genomics and phenomics information is currently limited by the lack of scalability, and by the difficulty of interpreting the related findings from a biological perspective. To tackle these limitations, we present Bayesian Genome-to-Phenome Sparse Regression (G2PSR), a novel multivariate regression method based on sparse SNP-gene constraints. The statistical framework of G2PSR is based on a Bayesian neural network, were constraints on SNPs-genes associations are integrated by incorporating <i>a priori</i> knowledge linking variants to their respective genes, to then reconstruct the phenotypic data in the output layer. Interpretability is promoted by inducing sparsity on the genes through variational dropout, allowing to estimate the uncertainty associated with each gene, and related SNPs, in the reconstruction task. Ultimately, G2PSR is conceived to prevent multiple testing correction and to assess the combined effect of SNPs, thus increasing the statistical power in detecting genome-to-phenome associations. The effectiveness of G2PSR was demonstrated on synthetic and real data, with respect to state-of-the-art methods based on group-wise sparsity constraints. The application on real data consisted in an imaging-genetics analysis on the Alzheimer's Disease Neuroimaging Initiative data, relating SNPs from more than 3,500 genes to clinical and multi-variate brain volumetric information. The experimental results show that our method can provide accurate selection of relevant genes in dataset with large SNPs-to-samples ratio, thus overcoming the main limitations of current genome-to-phenome association methods.</p>","PeriodicalId":73090,"journal":{"name":"Frontiers in molecular medicine","volume":" ","pages":"830956"},"PeriodicalIF":0.0000,"publicationDate":"2022-03-30","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11285669/pdf/","citationCount":"0","resultStr":"{\"title\":\"Decoding Genetic Markers of Multiple Phenotypic Layers Through Biologically Constrained Genome-To-Phenome Bayesian Sparse Regression.\",\"authors\":\"Marie Deprez, Julien Moreira, Maxime Sermesant, Marco Lorenzi\",\"doi\":\"10.3389/fmmed.2022.830956\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<p><p>The applicability of multivariate approaches for the joint analysis of genomics and phenomics information is currently limited by the lack of scalability, and by the difficulty of interpreting the related findings from a biological perspective. To tackle these limitations, we present Bayesian Genome-to-Phenome Sparse Regression (G2PSR), a novel multivariate regression method based on sparse SNP-gene constraints. The statistical framework of G2PSR is based on a Bayesian neural network, were constraints on SNPs-genes associations are integrated by incorporating <i>a priori</i> knowledge linking variants to their respective genes, to then reconstruct the phenotypic data in the output layer. Interpretability is promoted by inducing sparsity on the genes through variational dropout, allowing to estimate the uncertainty associated with each gene, and related SNPs, in the reconstruction task. Ultimately, G2PSR is conceived to prevent multiple testing correction and to assess the combined effect of SNPs, thus increasing the statistical power in detecting genome-to-phenome associations. The effectiveness of G2PSR was demonstrated on synthetic and real data, with respect to state-of-the-art methods based on group-wise sparsity constraints. The application on real data consisted in an imaging-genetics analysis on the Alzheimer's Disease Neuroimaging Initiative data, relating SNPs from more than 3,500 genes to clinical and multi-variate brain volumetric information. The experimental results show that our method can provide accurate selection of relevant genes in dataset with large SNPs-to-samples ratio, thus overcoming the main limitations of current genome-to-phenome association methods.</p>\",\"PeriodicalId\":73090,\"journal\":{\"name\":\"Frontiers in molecular medicine\",\"volume\":\" \",\"pages\":\"830956\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2022-03-30\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11285669/pdf/\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Frontiers in molecular medicine\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.3389/fmmed.2022.830956\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"2022/1/1 0:00:00\",\"PubModel\":\"eCollection\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Frontiers in molecular medicine","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.3389/fmmed.2022.830956","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"2022/1/1 0:00:00","PubModel":"eCollection","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0

摘要

目前,由于缺乏可扩展性以及难以从生物学角度解释相关发现,多元方法在基因组学和表型信息联合分析中的适用性受到限制。为了解决这些局限性,我们提出了贝叶斯基因组-表型稀疏回归(G2PSR),这是一种基于稀疏SNP基因约束的新的多元回归方法。G2PSR的统计框架基于贝叶斯神经网络,通过结合将变体连接到其各自基因的先验知识来整合SNPs基因关联的约束,然后在输出层中重建表型数据。通过变分缺失诱导基因的稀疏性,从而在重建任务中估计与每个基因和相关SNPs相关的不确定性,从而提高了可解释性。最终,G2PSR旨在防止多重检测校正,并评估SNPs的综合效应,从而提高检测基因组与现象关联的统计能力。G2PSR的有效性在合成和真实数据上得到了证明,与基于组稀疏性约束的最先进方法相比。实际数据的应用包括对阿尔茨海默病神经成像倡议数据的成像遗传学分析,将3500多个基因的SNPs与临床和多变量大脑体积信息联系起来。实验结果表明,我们的方法可以在SNPs与样本比例较大的数据集中准确选择相关基因,从而克服了当前基因组与现象关联方法的主要局限性。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
Decoding Genetic Markers of Multiple Phenotypic Layers Through Biologically Constrained Genome-To-Phenome Bayesian Sparse Regression.

The applicability of multivariate approaches for the joint analysis of genomics and phenomics information is currently limited by the lack of scalability, and by the difficulty of interpreting the related findings from a biological perspective. To tackle these limitations, we present Bayesian Genome-to-Phenome Sparse Regression (G2PSR), a novel multivariate regression method based on sparse SNP-gene constraints. The statistical framework of G2PSR is based on a Bayesian neural network, were constraints on SNPs-genes associations are integrated by incorporating a priori knowledge linking variants to their respective genes, to then reconstruct the phenotypic data in the output layer. Interpretability is promoted by inducing sparsity on the genes through variational dropout, allowing to estimate the uncertainty associated with each gene, and related SNPs, in the reconstruction task. Ultimately, G2PSR is conceived to prevent multiple testing correction and to assess the combined effect of SNPs, thus increasing the statistical power in detecting genome-to-phenome associations. The effectiveness of G2PSR was demonstrated on synthetic and real data, with respect to state-of-the-art methods based on group-wise sparsity constraints. The application on real data consisted in an imaging-genetics analysis on the Alzheimer's Disease Neuroimaging Initiative data, relating SNPs from more than 3,500 genes to clinical and multi-variate brain volumetric information. The experimental results show that our method can provide accurate selection of relevant genes in dataset with large SNPs-to-samples ratio, thus overcoming the main limitations of current genome-to-phenome association methods.

求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
自引率
0.00%
发文量
0
期刊最新文献
Editorial: Gene therapy and genome editing for metabolic liver disorders. An artificial transcription factor that activates potent interferon-γ expression in human Jurkat T Cells. Immune-checkpoint-inhibitor therapy directed against PD-L1 is tolerated in the heart without manifestation of cardiac inflammation in a preclinical reversible melanoma mouse model. Human-specific gene ARHGAP11B-potentially an additional tool in the treatment of neurodegenerative diseases? DeltaRex-G, tumor targeted retrovector encoding a CCNG1 inhibitor, for CAR-T cell therapy induced cytokine release syndrome.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1