重构基因内变异基因型相关性矩阵,联合分析推算数据和测序数据

IF 0.6 4区 生物学 Q4 GENETICS & HEREDITY Russian Journal of Genetics Pub Date : 2024-07-27 DOI:10.1134/s1022795424700418
G. R. Svishcheva, A. V. Kirichenko, N. M. Belonogova, E. E. Elgaeva, Ya. A. Tsepilov, I. V. Zorkoltseva, T. I. Axenovich
{"title":"重构基因内变异基因型相关性矩阵,联合分析推算数据和测序数据","authors":"G. R. Svishcheva, A. V. Kirichenko, N. M. Belonogova, E. E. Elgaeva, Ya. A. Tsepilov, I. V. Zorkoltseva, T. I. Axenovich","doi":"10.1134/s1022795424700418","DOIUrl":null,"url":null,"abstract":"<h3 data-test=\"abstract-sub-heading\">\n<b>Abstract</b>—</h3><p>When combining imputed and sequenced data in a single gene-based association analysis, the problem of reconstructing genetic correlation matrices arises. It is related to the fact that the correlations between genotypes of all imputed variants and the correlations between genotypes of all sequenced variants are known for a gene but we do not know the correlations between genotypes of variants, one of which is imputed, and the other is sequenced. To recover these correlations, we propose an efficient method based on maximising the determinant of the matrix. This method has a number of useful properties and an analytical solution for our task. Approbation of the proposed method was performed by comparing reconstructed and real correlation matrices constructed on individual genotypes from the UK Biobank. Comparison of the results of gene-based association analysis performed by the SKAT, BT, and PCA methods on reconstructed and real matrices using modelled summary statistics and calculated summary statistics on real phenotypes showed high quality of reconstruction and robustness of the method to different gene structures.</p>","PeriodicalId":21441,"journal":{"name":"Russian Journal of Genetics","volume":"17 1","pages":""},"PeriodicalIF":0.6000,"publicationDate":"2024-07-27","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Reconstruction of a Matrix of Genotypic Correlations between Variants within a Gene for Joint Analysis of Imputed and Sequenced Data\",\"authors\":\"G. R. Svishcheva, A. V. Kirichenko, N. M. Belonogova, E. E. Elgaeva, Ya. A. Tsepilov, I. V. Zorkoltseva, T. I. Axenovich\",\"doi\":\"10.1134/s1022795424700418\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<h3 data-test=\\\"abstract-sub-heading\\\">\\n<b>Abstract</b>—</h3><p>When combining imputed and sequenced data in a single gene-based association analysis, the problem of reconstructing genetic correlation matrices arises. It is related to the fact that the correlations between genotypes of all imputed variants and the correlations between genotypes of all sequenced variants are known for a gene but we do not know the correlations between genotypes of variants, one of which is imputed, and the other is sequenced. To recover these correlations, we propose an efficient method based on maximising the determinant of the matrix. This method has a number of useful properties and an analytical solution for our task. Approbation of the proposed method was performed by comparing reconstructed and real correlation matrices constructed on individual genotypes from the UK Biobank. Comparison of the results of gene-based association analysis performed by the SKAT, BT, and PCA methods on reconstructed and real matrices using modelled summary statistics and calculated summary statistics on real phenotypes showed high quality of reconstruction and robustness of the method to different gene structures.</p>\",\"PeriodicalId\":21441,\"journal\":{\"name\":\"Russian Journal of Genetics\",\"volume\":\"17 1\",\"pages\":\"\"},\"PeriodicalIF\":0.6000,\"publicationDate\":\"2024-07-27\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Russian Journal of Genetics\",\"FirstCategoryId\":\"99\",\"ListUrlMain\":\"https://doi.org/10.1134/s1022795424700418\",\"RegionNum\":4,\"RegionCategory\":\"生物学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q4\",\"JCRName\":\"GENETICS & HEREDITY\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Russian Journal of Genetics","FirstCategoryId":"99","ListUrlMain":"https://doi.org/10.1134/s1022795424700418","RegionNum":4,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q4","JCRName":"GENETICS & HEREDITY","Score":null,"Total":0}
引用次数: 0

摘要

摘要-在基于单个基因的关联分析中结合估算数据和测序数据时,会出现重建遗传相关矩阵的问题。这与以下事实有关:对于一个基因,所有估算变异体的基因型之间的相关性和所有测序变异体的基因型之间的相关性是已知的,但我们不知道变异体(其中一个是估算的,另一个是测序的)的基因型之间的相关性。为了恢复这些相关性,我们提出了一种基于矩阵行列式最大化的有效方法。该方法具有许多有用的特性,并为我们的任务提供了一个分析解决方案。通过比较在英国生物库的单个基因型上构建的重建相关矩阵和真实相关矩阵,对所提出的方法进行了验证。通过比较 SKAT、BT 和 PCA 方法对重建矩阵和真实矩阵进行的基于基因的关联分析结果(使用建模的汇总统计量和计算的真实表型汇总统计量),显示了该方法重建的高质量和对不同基因结构的鲁棒性。
本文章由计算机程序翻译,如有差异,请以英文原文为准。

摘要图片

查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
Reconstruction of a Matrix of Genotypic Correlations between Variants within a Gene for Joint Analysis of Imputed and Sequenced Data

Abstract

When combining imputed and sequenced data in a single gene-based association analysis, the problem of reconstructing genetic correlation matrices arises. It is related to the fact that the correlations between genotypes of all imputed variants and the correlations between genotypes of all sequenced variants are known for a gene but we do not know the correlations between genotypes of variants, one of which is imputed, and the other is sequenced. To recover these correlations, we propose an efficient method based on maximising the determinant of the matrix. This method has a number of useful properties and an analytical solution for our task. Approbation of the proposed method was performed by comparing reconstructed and real correlation matrices constructed on individual genotypes from the UK Biobank. Comparison of the results of gene-based association analysis performed by the SKAT, BT, and PCA methods on reconstructed and real matrices using modelled summary statistics and calculated summary statistics on real phenotypes showed high quality of reconstruction and robustness of the method to different gene structures.

求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
Russian Journal of Genetics
Russian Journal of Genetics 生物-遗传学
CiteScore
1.00
自引率
33.30%
发文量
126
审稿时长
1 months
期刊介绍: Russian Journal of Genetics is a journal intended to make significant contribution to the development of genetics. The journal publishes reviews and experimental papers in the areas of theoretical and applied genetics. It presents fundamental research on genetic processes at molecular, cell, organism, and population levels, including problems of the conservation and rational management of genetic resources and the functional genomics, evolutionary genomics and medical genetics.
期刊最新文献
The Molecular Genetic Characteristics of the Mutant Strain B-162/2 of the Bacteria Pseudomonas chlororaphis subsp. aurantiaca Genome-Wide Analysis in the Study of the Fetal Growth Restriction Pathogenetics Differentiation and Taxonomic Identification of Roburoid Oaks in the Caucasian and Crimean Regions Using Nuclear Microsatellite Markers Analysis of the Calpastatin (CAST  ) and Androgen Receptor (AR) Gene Polymorphisms as Biomarkers for Meat Quality Traits in Reindeer Rangifer tarandus Analysis of the Genetic Structures of 29 Horse Breeds of Russian Selection by STR Markers
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1