Peter Orchard, Thomas W Blackwell, Linda Kachuri, Peter J Castaldi, Michael H Cho, Stephanie A Christenson, Peter Durda, Stacey Gabriel, Craig P Hersh, Scott Huntsman, Seungyong Hwang, Roby Joehanes, Mari Johnson, Xingnan Li, Honghuang Lin, Ching-Ti Liu, Yongmei Liu, Angel C Y Mak, Ani W Manichaikul, David Paik, Aabida Saferali, Joshua D Smith, Kent D Taylor, Russell P Tracy, Jiongming Wang, Mingqiang Wang, Joshua S Weinstock, Jeffrey Weiss, Heather E Wheeler, Ying Zhou, Sebastian Zoellner, Joseph C Wu, Luisa Mestroni, Sharon Graw, Matthew R G Taylor, Victor E Ortega, Craig W Johnson, Weiniu Gan, Goncalo Abecasis, Deborah A Nickerson, Namrata Gupta, Kristin Ardlie, Prescott G Woodruff, Yinan Zheng, Russell P Bowler, Deborah A Meyers, Alex Reiner, Charles Kooperberg, Elad Ziv, Vasan S Ramachandran, Martin G Larson, L Adrienne Cupples, Esteban G Burchard, Edwin K Silverman, Stephen S Rich, Nancy Heard-Costa, Hua Tang, Jerome I Rotter, Albert V Smith, Daniel Levy, François Aguet, Laura Scott, Laura M Raffield, Stephen C J Parker
{"title":"TOPMed中数量性状位点表达与剪接的交叉队列分析。","authors":"Peter Orchard, Thomas W Blackwell, Linda Kachuri, Peter J Castaldi, Michael H Cho, Stephanie A Christenson, Peter Durda, Stacey Gabriel, Craig P Hersh, Scott Huntsman, Seungyong Hwang, Roby Joehanes, Mari Johnson, Xingnan Li, Honghuang Lin, Ching-Ti Liu, Yongmei Liu, Angel C Y Mak, Ani W Manichaikul, David Paik, Aabida Saferali, Joshua D Smith, Kent D Taylor, Russell P Tracy, Jiongming Wang, Mingqiang Wang, Joshua S Weinstock, Jeffrey Weiss, Heather E Wheeler, Ying Zhou, Sebastian Zoellner, Joseph C Wu, Luisa Mestroni, Sharon Graw, Matthew R G Taylor, Victor E Ortega, Craig W Johnson, Weiniu Gan, Goncalo Abecasis, Deborah A Nickerson, Namrata Gupta, Kristin Ardlie, Prescott G Woodruff, Yinan Zheng, Russell P Bowler, Deborah A Meyers, Alex Reiner, Charles Kooperberg, Elad Ziv, Vasan S Ramachandran, Martin G Larson, L Adrienne Cupples, Esteban G Burchard, Edwin K Silverman, Stephen S Rich, Nancy Heard-Costa, Hua Tang, Jerome I Rotter, Albert V Smith, Daniel Levy, François Aguet, Laura Scott, Laura M Raffield, Stephen C J Parker","doi":"10.1101/2025.02.19.25322561","DOIUrl":null,"url":null,"abstract":"<p><p>Most genetic variants associated with complex traits and diseases occur in non-coding genomic regions and are hypothesized to regulate gene expression. To understand the genetics underlying gene expression variability, we characterize 14,324 ancestrally diverse RNA-sequencing samples from the NHLBI Trans-Omics for Precision Medicine (TOPMed) program and integrate whole genome sequencing data to perform <i>cis</i> and <i>trans</i> expression and splicing quantitative trait locus (<i>cis</i>-/trans-e/sQTL) analyses in six tissues and cell types, most notably whole blood (N=6,454) and lung (N=1,291). We show this dataset enables greater detection of secondary cis-e/sQTL signals than was achieved in previous studies, and that secondary cis-eQTL and primary trans-eQTL signal discovery is not saturated even though eGene discovery is. Most TOPMed trans-eQTL signals colocalize with cis-e/sQTL signals, suggesting many trans signals are mediated by cis signals. We fine-map European UK BioBank GWAS signals from 164 traits and colocalize the resulting 34,107 fine-mapped GWAS signals with TOPMed e/sQTL signals, finding that of 10,611 GWAS signals with a colocalization, 7,096 GWAS signals colocalize with at least one secondary e/sQTL signal. These results demonstrate that larger e/sQTL analyses will continue to uncover secondary e/sQTL signals, and that these new signals will benefit GWAS interpretation.</p>","PeriodicalId":94281,"journal":{"name":"medRxiv : the preprint server for health sciences","volume":" ","pages":""},"PeriodicalIF":0.0000,"publicationDate":"2025-02-21","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11875316/pdf/","citationCount":"0","resultStr":"{\"title\":\"Cross-cohort analysis of expression and splicing quantitative trait loci in TOPMed.\",\"authors\":\"Peter Orchard, Thomas W Blackwell, Linda Kachuri, Peter J Castaldi, Michael H Cho, Stephanie A Christenson, Peter Durda, Stacey Gabriel, Craig P Hersh, Scott Huntsman, Seungyong Hwang, Roby Joehanes, Mari Johnson, Xingnan Li, Honghuang Lin, Ching-Ti Liu, Yongmei Liu, Angel C Y Mak, Ani W Manichaikul, David Paik, Aabida Saferali, Joshua D Smith, Kent D Taylor, Russell P Tracy, Jiongming Wang, Mingqiang Wang, Joshua S Weinstock, Jeffrey Weiss, Heather E Wheeler, Ying Zhou, Sebastian Zoellner, Joseph C Wu, Luisa Mestroni, Sharon Graw, Matthew R G Taylor, Victor E Ortega, Craig W Johnson, Weiniu Gan, Goncalo Abecasis, Deborah A Nickerson, Namrata Gupta, Kristin Ardlie, Prescott G Woodruff, Yinan Zheng, Russell P Bowler, Deborah A Meyers, Alex Reiner, Charles Kooperberg, Elad Ziv, Vasan S Ramachandran, Martin G Larson, L Adrienne Cupples, Esteban G Burchard, Edwin K Silverman, Stephen S Rich, Nancy Heard-Costa, Hua Tang, Jerome I Rotter, Albert V Smith, Daniel Levy, François Aguet, Laura Scott, Laura M Raffield, Stephen C J Parker\",\"doi\":\"10.1101/2025.02.19.25322561\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<p><p>Most genetic variants associated with complex traits and diseases occur in non-coding genomic regions and are hypothesized to regulate gene expression. To understand the genetics underlying gene expression variability, we characterize 14,324 ancestrally diverse RNA-sequencing samples from the NHLBI Trans-Omics for Precision Medicine (TOPMed) program and integrate whole genome sequencing data to perform <i>cis</i> and <i>trans</i> expression and splicing quantitative trait locus (<i>cis</i>-/trans-e/sQTL) analyses in six tissues and cell types, most notably whole blood (N=6,454) and lung (N=1,291). We show this dataset enables greater detection of secondary cis-e/sQTL signals than was achieved in previous studies, and that secondary cis-eQTL and primary trans-eQTL signal discovery is not saturated even though eGene discovery is. Most TOPMed trans-eQTL signals colocalize with cis-e/sQTL signals, suggesting many trans signals are mediated by cis signals. We fine-map European UK BioBank GWAS signals from 164 traits and colocalize the resulting 34,107 fine-mapped GWAS signals with TOPMed e/sQTL signals, finding that of 10,611 GWAS signals with a colocalization, 7,096 GWAS signals colocalize with at least one secondary e/sQTL signal. These results demonstrate that larger e/sQTL analyses will continue to uncover secondary e/sQTL signals, and that these new signals will benefit GWAS interpretation.</p>\",\"PeriodicalId\":94281,\"journal\":{\"name\":\"medRxiv : the preprint server for health sciences\",\"volume\":\" \",\"pages\":\"\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2025-02-21\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11875316/pdf/\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"medRxiv : the preprint server for health sciences\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1101/2025.02.19.25322561\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"medRxiv : the preprint server for health sciences","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1101/2025.02.19.25322561","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0
摘要
大多数与复杂性状和疾病相关的遗传变异发生在非编码基因组区域,并被假设调节基因表达。为了了解基因表达变异的遗传学基础,我们对来自NHLBI trans- omics for Precision Medicine (TOPMed)项目的14324个祖先不同的rna测序样本进行了表征,并整合了全基因组测序数据,在6种组织和细胞类型中进行了顺式和反式表达和剪接数量性状位点(cis -/trans-e/sQTL)分析,其中最著名的是全血(N=6,454)和肺(N=1,291)。我们发现,该数据集比以前的研究能够更好地检测次要顺式-e/sQTL信号,并且次要顺式- eqtl和主要反式- eqtl信号发现并不饱和,即使eGene发现是饱和的。大多数TOPMed trans- eqtl信号与cis-e/sQTL信号共定位,表明许多trans信号是由cis信号介导的。我们对来自欧洲UK BioBank的164个性状的GWAS信号进行了精细定位,并将得到的34,107个GWAS信号与TOPMed e/sQTL信号进行了共定位,发现在10,611个具有共定位的GWAS信号中,有7,096个GWAS信号与至少一个二级e/sQTL信号共定位。这些结果表明,更大的e/sQTL分析将继续发现次要的e/sQTL信号,这些新信号将有利于GWAS的解释。
Cross-cohort analysis of expression and splicing quantitative trait loci in TOPMed.
Most genetic variants associated with complex traits and diseases occur in non-coding genomic regions and are hypothesized to regulate gene expression. To understand the genetics underlying gene expression variability, we characterize 14,324 ancestrally diverse RNA-sequencing samples from the NHLBI Trans-Omics for Precision Medicine (TOPMed) program and integrate whole genome sequencing data to perform cis and trans expression and splicing quantitative trait locus (cis-/trans-e/sQTL) analyses in six tissues and cell types, most notably whole blood (N=6,454) and lung (N=1,291). We show this dataset enables greater detection of secondary cis-e/sQTL signals than was achieved in previous studies, and that secondary cis-eQTL and primary trans-eQTL signal discovery is not saturated even though eGene discovery is. Most TOPMed trans-eQTL signals colocalize with cis-e/sQTL signals, suggesting many trans signals are mediated by cis signals. We fine-map European UK BioBank GWAS signals from 164 traits and colocalize the resulting 34,107 fine-mapped GWAS signals with TOPMed e/sQTL signals, finding that of 10,611 GWAS signals with a colocalization, 7,096 GWAS signals colocalize with at least one secondary e/sQTL signal. These results demonstrate that larger e/sQTL analyses will continue to uncover secondary e/sQTL signals, and that these new signals will benefit GWAS interpretation.