Pub Date : 2024-11-07Epub Date: 2024-10-14DOI: 10.1016/j.ajhg.2024.09.005
Daniel W Bellott, Jennifer F Hughes, Helen Skaletsky, Erik C Owen, David C Page
A recent publication describing the assembly of the Y chromosomes of 43 males was remarkable not only for its ambitious technical scope but also for the startling suggestion that the boundary of the pseudoautosomal region 1 (PAR1), where the human X and Y chromosomes engage in crossing-over during male meiosis, lies 500 kb distal to its previously reported location. Where is the boundary of the human PAR1? We first review the evidence that mapped the PAR boundary, or PAB, before the human genome draft sequence was produced, then examine post-genomic datasets for evidence of crossing-over between the X and Y, and lastly re-examine contiguous sequence assemblies of the PAR-NPY boundary to see whether they support a more distal PAB. We find ample evidence of X-Y crossovers throughout the 500 kb in question, some as close as 246 bp to the previously reported PAB. Our new analyses, combined with previous studies over the past 40 years, provide overwhelming evidence to support the original position and narrow the probable location of the PAB to a 201-bp window.
最近发表的一篇文章描述了 43 条男性 Y 染色体的组装过程,这篇文章不仅因其雄心勃勃的技术范围而引人注目,而且还令人吃惊地发现,人类 X 染色体和 Y 染色体在男性减数分裂过程中发生交叉的假常染色体区 1(PAR1)的边界,与之前报道的位置相差 500 kb。人类 PAR1 的边界在哪里?我们首先回顾了在人类基因组草案序列产生之前绘制 PAR 边界或 PAB 的证据,然后检查了基因组之后的数据集,以寻找 X 和 Y 染色体交叉的证据,最后重新检查了 PAR-NPY 边界的连续序列组装,以确定它们是否支持更远的 PAB。我们发现在整个 500 kb 的范围内存在大量 X-Y 交叉的证据,其中一些证据与之前报道的 PAB 相差 246 bp。我们的新分析与过去 40 年的研究相结合,提供了大量证据支持最初的位置,并将 PAB 的可能位置缩小到 201 bp 窗口。
{"title":"Where is the boundary of the human pseudoautosomal region?","authors":"Daniel W Bellott, Jennifer F Hughes, Helen Skaletsky, Erik C Owen, David C Page","doi":"10.1016/j.ajhg.2024.09.005","DOIUrl":"10.1016/j.ajhg.2024.09.005","url":null,"abstract":"<p><p>A recent publication describing the assembly of the Y chromosomes of 43 males was remarkable not only for its ambitious technical scope but also for the startling suggestion that the boundary of the pseudoautosomal region 1 (PAR1), where the human X and Y chromosomes engage in crossing-over during male meiosis, lies 500 kb distal to its previously reported location. Where is the boundary of the human PAR1? We first review the evidence that mapped the PAR boundary, or PAB, before the human genome draft sequence was produced, then examine post-genomic datasets for evidence of crossing-over between the X and Y, and lastly re-examine contiguous sequence assemblies of the PAR-NPY boundary to see whether they support a more distal PAB. We find ample evidence of X-Y crossovers throughout the 500 kb in question, some as close as 246 bp to the previously reported PAB. Our new analyses, combined with previous studies over the past 40 years, provide overwhelming evidence to support the original position and narrow the probable location of the PAB to a 201-bp window.</p>","PeriodicalId":7659,"journal":{"name":"American journal of human genetics","volume":" ","pages":"2530-2541"},"PeriodicalIF":5.4,"publicationDate":"2024-11-07","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11568759/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142455978","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2024-11-07Epub Date: 2024-10-28DOI: 10.1016/j.ajhg.2024.09.009
Morten Dybdahl Krebs, Kajsa-Lotta Georgii Hellberg, Mischa Lundberg, Vivek Appadurai, Henrik Ohlsson, Emil Pedersen, Jette Steinbach, Jamie Matthews, Richard Border, Sonja LaBianca, Xabier Calle, Joeri J Meijsen, Andrés Ingason, Alfonso Buil, Bjarni J Vilhjálmsson, Jonathan Flint, Silviu-Alin Bacanu, Na Cai, Andy Dahl, Noah Zaitlen, Thomas Werge, Kenneth S Kendler, Andrew J Schork
Large biobank samples provide an opportunity to integrate broad phenotyping, familial records, and molecular genetics data to study complex traits and diseases. We introduce Pearson-Aitken Family Genetic Risk Scores (PA-FGRS), a method for estimating disease liability from patterns of diagnoses in extended, age-censored genealogical records. We then apply the method to study a paradigmatic complex disorder, major depressive disorder (MDD), using the iPSYCH2015 case-cohort study of 30,949 MDD cases, 39,655 random population controls, and more than 2 million relatives. We show that combining PA-FGRS liabilities estimated from family records with molecular genotypes of probands improves three lines of inquiry. Incorporating PA-FGRS liabilities improves classification of MDD over and above polygenic scores, identifies robust genetic contributions to clinical heterogeneity in MDD associated with comorbidity, recurrence, and severity and can improve the power of genome-wide association studies. Our method is flexible and easy to use, and our study approaches are generalizable to other datasets and other complex traits and diseases.
{"title":"Genetic liability estimated from large-scale family data improves genetic prediction, risk score profiling, and gene mapping for major depression.","authors":"Morten Dybdahl Krebs, Kajsa-Lotta Georgii Hellberg, Mischa Lundberg, Vivek Appadurai, Henrik Ohlsson, Emil Pedersen, Jette Steinbach, Jamie Matthews, Richard Border, Sonja LaBianca, Xabier Calle, Joeri J Meijsen, Andrés Ingason, Alfonso Buil, Bjarni J Vilhjálmsson, Jonathan Flint, Silviu-Alin Bacanu, Na Cai, Andy Dahl, Noah Zaitlen, Thomas Werge, Kenneth S Kendler, Andrew J Schork","doi":"10.1016/j.ajhg.2024.09.009","DOIUrl":"10.1016/j.ajhg.2024.09.009","url":null,"abstract":"<p><p>Large biobank samples provide an opportunity to integrate broad phenotyping, familial records, and molecular genetics data to study complex traits and diseases. We introduce Pearson-Aitken Family Genetic Risk Scores (PA-FGRS), a method for estimating disease liability from patterns of diagnoses in extended, age-censored genealogical records. We then apply the method to study a paradigmatic complex disorder, major depressive disorder (MDD), using the iPSYCH2015 case-cohort study of 30,949 MDD cases, 39,655 random population controls, and more than 2 million relatives. We show that combining PA-FGRS liabilities estimated from family records with molecular genotypes of probands improves three lines of inquiry. Incorporating PA-FGRS liabilities improves classification of MDD over and above polygenic scores, identifies robust genetic contributions to clinical heterogeneity in MDD associated with comorbidity, recurrence, and severity and can improve the power of genome-wide association studies. Our method is flexible and easy to use, and our study approaches are generalizable to other datasets and other complex traits and diseases.</p>","PeriodicalId":7659,"journal":{"name":"American journal of human genetics","volume":" ","pages":"2494-2509"},"PeriodicalIF":5.4,"publicationDate":"2024-11-07","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11568754/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142543072","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2024-11-07DOI: 10.1016/j.ajhg.2024.10.001
Milena Deal, Asha Kar, Seung Hyuk T Lee, Marcus Alvarez, Sandhya Rajkumar, Uma Thanigai Arasu, Dorota Kaminska, Ville Männistö, Sini Heinonen, Birgitta W van der Kolk, Ulla Säiläkivi, Tuure Saarinen, Anne Juuti, Jussi Pihlajamäki, Minna U Kaikkonen, Markku Laakso, Kirsi H Pietiläinen, Päivi Pajukanta
Mechanisms of abdominal obesity GWAS variants have remained largely unknown. To elucidate these mechanisms, we leveraged subcutaneous adipose tissue (SAT) single nucleus RNA-sequencing and genomics data. After discovering that heritability of abdominal obesity is enriched in adipocytes, we focused on a SAT unique adipocyte marker gene, the transcription factor TBX15, and its abdominal obesity-associated deleterious missense variant, rs10494217. The allele frequency of rs10494217 revealed a north-to-south decreasing gradient, with consistent significant FST values observed for 25 different populations when compared to Finns, a population with a history of genetic isolation. Given the role of Tbx15 in mouse thermogenesis, the frequency may have increased as an adaptation to cold in Finns. Our selection analysis provided significant evidence of selection for the abdominal obesity risk allele T of rs10494217 in Finns, with a north-to-south decreasing trend in other populations, and demonstrated that latitude significantly predicts the allele frequency. We also discovered that the risk allele status significantly affects SAT adipocyte expression of multiple adipocyte marker genes in trans in two cohorts. Two of these trans genes have been connected to thermogenesis, supporting the thermogenic effect of the TBX15 missense variant as a possible cause of its selection. Adipose expression of one trans gene, a lncRNA, AC002066.1, was strongly associated with adipocyte size, implicating it in metabolically unhealthy adipocyte hypertrophy. In summary, the abdominal obesity variant rs10494217 was selected in Finns, and individuals with the risk allele have trans effects on adipocyte expression of genes relating to thermogenesis and adipocyte hypertrophy.
{"title":"An abdominal obesity missense variant in the adipocyte thermogenesis gene TBX15 is implicated in adaptation to cold in Finns.","authors":"Milena Deal, Asha Kar, Seung Hyuk T Lee, Marcus Alvarez, Sandhya Rajkumar, Uma Thanigai Arasu, Dorota Kaminska, Ville Männistö, Sini Heinonen, Birgitta W van der Kolk, Ulla Säiläkivi, Tuure Saarinen, Anne Juuti, Jussi Pihlajamäki, Minna U Kaikkonen, Markku Laakso, Kirsi H Pietiläinen, Päivi Pajukanta","doi":"10.1016/j.ajhg.2024.10.001","DOIUrl":"10.1016/j.ajhg.2024.10.001","url":null,"abstract":"<p><p>Mechanisms of abdominal obesity GWAS variants have remained largely unknown. To elucidate these mechanisms, we leveraged subcutaneous adipose tissue (SAT) single nucleus RNA-sequencing and genomics data. After discovering that heritability of abdominal obesity is enriched in adipocytes, we focused on a SAT unique adipocyte marker gene, the transcription factor TBX15, and its abdominal obesity-associated deleterious missense variant, rs10494217. The allele frequency of rs10494217 revealed a north-to-south decreasing gradient, with consistent significant F<sub>ST</sub> values observed for 25 different populations when compared to Finns, a population with a history of genetic isolation. Given the role of Tbx15 in mouse thermogenesis, the frequency may have increased as an adaptation to cold in Finns. Our selection analysis provided significant evidence of selection for the abdominal obesity risk allele T of rs10494217 in Finns, with a north-to-south decreasing trend in other populations, and demonstrated that latitude significantly predicts the allele frequency. We also discovered that the risk allele status significantly affects SAT adipocyte expression of multiple adipocyte marker genes in trans in two cohorts. Two of these trans genes have been connected to thermogenesis, supporting the thermogenic effect of the TBX15 missense variant as a possible cause of its selection. Adipose expression of one trans gene, a lncRNA, AC002066.1, was strongly associated with adipocyte size, implicating it in metabolically unhealthy adipocyte hypertrophy. In summary, the abdominal obesity variant rs10494217 was selected in Finns, and individuals with the risk allele have trans effects on adipocyte expression of genes relating to thermogenesis and adipocyte hypertrophy.</p>","PeriodicalId":7659,"journal":{"name":"American journal of human genetics","volume":"111 11","pages":"2542-2560"},"PeriodicalIF":5.4,"publicationDate":"2024-11-07","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11568758/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142602784","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2024-11-07Epub Date: 2024-10-02DOI: 10.1016/j.ajhg.2024.09.001
Yu Chen, Sihan Liu, Zongyao Ren, Feiran Wang, Qiuman Liang, Yi Jiang, Rujia Dai, Fangyuan Duan, Cong Han, Zhilin Ning, Yan Xia, Miao Li, Kai Yuan, Wenying Qiu, Xiao-Xin Yan, Jiapei Dai, Richard F Kopp, Jufang Huang, Shuhua Xu, Beisha Tang, Lingqian Wu, Eric R Gamazon, Tim Bigdeli, Elliot Gershon, Hailiang Huang, Chao Ma, Chunyu Liu, Chao Chen
Research on brain expression quantitative trait loci (eQTLs) has illuminated the genetic underpinnings of schizophrenia (SCZ). Yet most of these studies have been centered on European populations, leading to a constrained understanding of population diversities and disease risks. To address this gap, we examined genotype and RNA-seq data from African Americans (AA, n = 158), Europeans (EUR, n = 408), and East Asians (EAS, n = 217). When comparing eQTLs between EUR and non-EUR populations, we observed concordant patterns of genetic regulatory effect, particularly in terms of the effect sizes of the eQTLs. However, 343,737 cis-eQTLs linked to 1,276 genes and 198,769 SNPs were found to be specific to non-EUR populations. Over 90% of observed population differences in eQTLs could be traced back to differences in allele frequency. Furthermore, 35% of these eQTLs were notably rare in the EUR population. Integrating brain eQTLs with SCZ signals from diverse populations, we observed a higher disease heritability enrichment of brain eQTLs in matched populations compared to mismatched ones. Prioritization analysis identified five risk genes (SFXN2, VPS37B, DENR, FTCDNL1, and NT5DC2) and three potential regulatory variants in known risk genes (CNNM2, MTRFR, and MPHOSPH9) that were missed in the EUR dataset. Our findings underscore that increasing genetic ancestral diversity is more efficient for power improvement than merely increasing the sample size within single-ancestry eQTLs datasets. Such a strategy will not only improve our understanding of the biological underpinnings of population structures but also pave the way for the identification of risk genes in SCZ.
{"title":"Cross-ancestry analysis of brain QTLs enhances interpretation of schizophrenia genome-wide association studies.","authors":"Yu Chen, Sihan Liu, Zongyao Ren, Feiran Wang, Qiuman Liang, Yi Jiang, Rujia Dai, Fangyuan Duan, Cong Han, Zhilin Ning, Yan Xia, Miao Li, Kai Yuan, Wenying Qiu, Xiao-Xin Yan, Jiapei Dai, Richard F Kopp, Jufang Huang, Shuhua Xu, Beisha Tang, Lingqian Wu, Eric R Gamazon, Tim Bigdeli, Elliot Gershon, Hailiang Huang, Chao Ma, Chunyu Liu, Chao Chen","doi":"10.1016/j.ajhg.2024.09.001","DOIUrl":"10.1016/j.ajhg.2024.09.001","url":null,"abstract":"<p><p>Research on brain expression quantitative trait loci (eQTLs) has illuminated the genetic underpinnings of schizophrenia (SCZ). Yet most of these studies have been centered on European populations, leading to a constrained understanding of population diversities and disease risks. To address this gap, we examined genotype and RNA-seq data from African Americans (AA, n = 158), Europeans (EUR, n = 408), and East Asians (EAS, n = 217). When comparing eQTLs between EUR and non-EUR populations, we observed concordant patterns of genetic regulatory effect, particularly in terms of the effect sizes of the eQTLs. However, 343,737 cis-eQTLs linked to 1,276 genes and 198,769 SNPs were found to be specific to non-EUR populations. Over 90% of observed population differences in eQTLs could be traced back to differences in allele frequency. Furthermore, 35% of these eQTLs were notably rare in the EUR population. Integrating brain eQTLs with SCZ signals from diverse populations, we observed a higher disease heritability enrichment of brain eQTLs in matched populations compared to mismatched ones. Prioritization analysis identified five risk genes (SFXN2, VPS37B, DENR, FTCDNL1, and NT5DC2) and three potential regulatory variants in known risk genes (CNNM2, MTRFR, and MPHOSPH9) that were missed in the EUR dataset. Our findings underscore that increasing genetic ancestral diversity is more efficient for power improvement than merely increasing the sample size within single-ancestry eQTLs datasets. Such a strategy will not only improve our understanding of the biological underpinnings of population structures but also pave the way for the identification of risk genes in SCZ.</p>","PeriodicalId":7659,"journal":{"name":"American journal of human genetics","volume":" ","pages":"2444-2457"},"PeriodicalIF":8.1,"publicationDate":"2024-11-07","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11568756/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142370763","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2024-11-07Epub Date: 2024-10-01DOI: 10.1016/j.ajhg.2024.09.002
Xiaoyu Yin, Marcy Richardson, Andreas Laner, Xuemei Shi, Elisabet Ognedal, Valeria Vasta, Thomas V O Hansen, Marta Pineda, Deborah Ritter, Johan de Dunnen, Emadeldin Hassanin, Wencong Lyman Lin, Ester Borras, Karl Krahn, Margareta Nordling, Alexandra Martins, Khalid Mahmood, Emily Nadeau, Victoria Beshay, Carli Tops, Maurizio Genuardi, Tina Pesaran, Ian M Frayling, Gabriel Capellá, Andrew Latchford, Sean V Tavtigian, Carlo Maj, Sharon E Plon, Marc S Greenblatt, Finlay A Macrae, Isabel Spier, Stefan Aretz
Pathogenic constitutional APC variants underlie familial adenomatous polyposis, the most common hereditary gastrointestinal polyposis syndrome. To improve variant classification and resolve the interpretative challenges of variants of uncertain significance (VUSs), APC-specific variant classification criteria were developed by the ClinGen-InSiGHT Hereditary Colorectal Cancer/Polyposis Variant Curation Expert Panel (VCEP) based on the criteria of the American College of Medical Genetics and Genomics and the Association for Molecular Pathology (ACMG/AMP). A streamlined algorithm using the APC-specific criteria was developed and applied to assess all APC variants in ClinVar and the International Society for Gastrointestinal Hereditary Tumours (InSiGHT) international reference APC Leiden Open Variation Database (LOVD) variant database, which included a total of 10,228 unique APC variants. Among the ClinVar and LOVD variants with an initial classification of (likely) benign or (likely) pathogenic, 94% and 96% remained in their original categories, respectively. In contrast, 41% ClinVar and 61% LOVD VUSs were reclassified into clinically meaningful classes, the vast majority as (likely) benign. The total number of VUSs was reduced by 37%. In 24 out of 37 (65%) promising APC variants that remained VUS despite evidence for pathogenicity, a data-mining-driven work-up allowed their reclassification as (likely) pathogenic. These results demonstrated that the application of APC-specific criteria substantially reduced the number of VUSs in ClinVar and LOVD. The study also demonstrated the feasibility of a systematic approach to variant classification in large datasets, which might serve as a generalizable model for other gene- or disease-specific variant interpretation initiatives. It also allowed for the prioritization of VUSs that will benefit from in-depth evidence collection. This subset of APC variants was approved by the VCEP and made publicly available through ClinVar and LOVD for widespread clinical use.
{"title":"Large-scale application of ClinGen-InSiGHT APC-specific ACMG/AMP variant classification criteria leads to substantial reduction in VUS.","authors":"Xiaoyu Yin, Marcy Richardson, Andreas Laner, Xuemei Shi, Elisabet Ognedal, Valeria Vasta, Thomas V O Hansen, Marta Pineda, Deborah Ritter, Johan de Dunnen, Emadeldin Hassanin, Wencong Lyman Lin, Ester Borras, Karl Krahn, Margareta Nordling, Alexandra Martins, Khalid Mahmood, Emily Nadeau, Victoria Beshay, Carli Tops, Maurizio Genuardi, Tina Pesaran, Ian M Frayling, Gabriel Capellá, Andrew Latchford, Sean V Tavtigian, Carlo Maj, Sharon E Plon, Marc S Greenblatt, Finlay A Macrae, Isabel Spier, Stefan Aretz","doi":"10.1016/j.ajhg.2024.09.002","DOIUrl":"10.1016/j.ajhg.2024.09.002","url":null,"abstract":"<p><p>Pathogenic constitutional APC variants underlie familial adenomatous polyposis, the most common hereditary gastrointestinal polyposis syndrome. To improve variant classification and resolve the interpretative challenges of variants of uncertain significance (VUSs), APC-specific variant classification criteria were developed by the ClinGen-InSiGHT Hereditary Colorectal Cancer/Polyposis Variant Curation Expert Panel (VCEP) based on the criteria of the American College of Medical Genetics and Genomics and the Association for Molecular Pathology (ACMG/AMP). A streamlined algorithm using the APC-specific criteria was developed and applied to assess all APC variants in ClinVar and the International Society for Gastrointestinal Hereditary Tumours (InSiGHT) international reference APC Leiden Open Variation Database (LOVD) variant database, which included a total of 10,228 unique APC variants. Among the ClinVar and LOVD variants with an initial classification of (likely) benign or (likely) pathogenic, 94% and 96% remained in their original categories, respectively. In contrast, 41% ClinVar and 61% LOVD VUSs were reclassified into clinically meaningful classes, the vast majority as (likely) benign. The total number of VUSs was reduced by 37%. In 24 out of 37 (65%) promising APC variants that remained VUS despite evidence for pathogenicity, a data-mining-driven work-up allowed their reclassification as (likely) pathogenic. These results demonstrated that the application of APC-specific criteria substantially reduced the number of VUSs in ClinVar and LOVD. The study also demonstrated the feasibility of a systematic approach to variant classification in large datasets, which might serve as a generalizable model for other gene- or disease-specific variant interpretation initiatives. It also allowed for the prioritization of VUSs that will benefit from in-depth evidence collection. This subset of APC variants was approved by the VCEP and made publicly available through ClinVar and LOVD for widespread clinical use.</p>","PeriodicalId":7659,"journal":{"name":"American journal of human genetics","volume":" ","pages":"2427-2443"},"PeriodicalIF":8.1,"publicationDate":"2024-11-07","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11568752/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142363999","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2024-10-21DOI: 10.1016/j.ajhg.2024.09.008
Yosuke Tanigawa,Manolis Kellis
Balancing the tradeoff between quantity and quality of phenotypic data is critical in omics studies. Measurements below the limit of quantification (BLQ) are often tagged in quality control fields, but these flags are currently underutilized in human genetics studies. Extreme phenotype sampling is advantageous for mapping rare variant effects. We hypothesize that genetic drivers, along with environmental and technical factors, contribute to the presence of BLQ flags. Here, we introduce "hypometric genetics" (hMG) analysis and uncover a genetic basis for BLQ flags, indicating an additional source of genetic signal for genetic discovery, especially from phenotypic extremes. Applying our hMG approach to n = 227,469 UK Biobank individuals with metabolomic profiles, we reveal more than 5% heritability for BLQ flags and report biologically relevant associations, for example, at APOC3, APOA5, and PDE3B loci. For common variants, polygenic scores trained only for BLQ flags predict the corresponding quantitative traits with 91% accuracy, validating the genetic basis. For rare coding variant associations, we find an asymmetric 65.4% higher enrichment of metabolite-lowering associations for BLQ flags, highlighting the impact of putative loss-of-function variants with large effects on phenotypic extremes. Joint analysis of binarized BLQ flags and the corresponding quantitative metabolite measurements improves power in Bayesian rare variant aggregation tests, resulting in an average of 181% more prioritized genes. Our approach is broadly applicable to omics profiling. Overall, our results underscore the benefit of integrating quality control flags and quantitative measurements and highlight the advantage of joint analysis of population-based samples and phenotypic extremes in human genetics studies.
{"title":"Hypometric genetics: Improved power in genetic discovery by incorporating quality control flags.","authors":"Yosuke Tanigawa,Manolis Kellis","doi":"10.1016/j.ajhg.2024.09.008","DOIUrl":"https://doi.org/10.1016/j.ajhg.2024.09.008","url":null,"abstract":"Balancing the tradeoff between quantity and quality of phenotypic data is critical in omics studies. Measurements below the limit of quantification (BLQ) are often tagged in quality control fields, but these flags are currently underutilized in human genetics studies. Extreme phenotype sampling is advantageous for mapping rare variant effects. We hypothesize that genetic drivers, along with environmental and technical factors, contribute to the presence of BLQ flags. Here, we introduce \"hypometric genetics\" (hMG) analysis and uncover a genetic basis for BLQ flags, indicating an additional source of genetic signal for genetic discovery, especially from phenotypic extremes. Applying our hMG approach to n = 227,469 UK Biobank individuals with metabolomic profiles, we reveal more than 5% heritability for BLQ flags and report biologically relevant associations, for example, at APOC3, APOA5, and PDE3B loci. For common variants, polygenic scores trained only for BLQ flags predict the corresponding quantitative traits with 91% accuracy, validating the genetic basis. For rare coding variant associations, we find an asymmetric 65.4% higher enrichment of metabolite-lowering associations for BLQ flags, highlighting the impact of putative loss-of-function variants with large effects on phenotypic extremes. Joint analysis of binarized BLQ flags and the corresponding quantitative metabolite measurements improves power in Bayesian rare variant aggregation tests, resulting in an average of 181% more prioritized genes. Our approach is broadly applicable to omics profiling. Overall, our results underscore the benefit of integrating quality control flags and quantitative measurements and highlight the advantage of joint analysis of population-based samples and phenotypic extremes in human genetics studies.","PeriodicalId":7659,"journal":{"name":"American journal of human genetics","volume":"25 1","pages":""},"PeriodicalIF":9.8,"publicationDate":"2024-10-21","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142489504","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2024-10-03Epub Date: 2024-09-05DOI: 10.1016/j.ajhg.2024.08.008
Luoying Jiang, Shao Wei Hu, Zijing Wang, Yi Zhou, Honghai Tang, Yuxin Chen, Daqi Wang, Xintai Fan, Lei Han, Huawei Li, Dazhi Shi, Yingzi He, Yilai Shu
Gene therapy has made significant progress in the treatment of hereditary hearing loss. However, most research has focused on deafness-related genes that are primarily expressed in hair cells with less attention given to multisite-expressed deafness genes. MPZL2, the second leading cause of mild-to-moderate hereditary deafness, is widely expressed in different inner ear cells. We generated a mouse model with a deletion in the Mpzl2 gene, which displayed moderate and slowly progressive hearing loss, mimicking the phenotype of individuals with DFNB111. We developed a gene replacement therapy system mediated by AAV-ie for efficient transduction in various types of cochlear cells. AAV-ie-Mpzl2 administration significantly lowered the auditory brainstem response and distortion product otoacoustic emission thresholds of Mpzl2-/- mice for at least seven months. AAV-ie-Mpzl2 delivery restored the structural integrity in both outer hair cells and Deiters cells. This study suggests the potential of gene therapy for MPZL2-related deafness and provides a proof of concept for gene therapy targeting other deafness-related genes that are expressed in different cell populations in the cochlea.
{"title":"Hearing restoration by gene replacement therapy for a multisite-expressed gene in a mouse model of human DFNB111 deafness.","authors":"Luoying Jiang, Shao Wei Hu, Zijing Wang, Yi Zhou, Honghai Tang, Yuxin Chen, Daqi Wang, Xintai Fan, Lei Han, Huawei Li, Dazhi Shi, Yingzi He, Yilai Shu","doi":"10.1016/j.ajhg.2024.08.008","DOIUrl":"10.1016/j.ajhg.2024.08.008","url":null,"abstract":"<p><p>Gene therapy has made significant progress in the treatment of hereditary hearing loss. However, most research has focused on deafness-related genes that are primarily expressed in hair cells with less attention given to multisite-expressed deafness genes. MPZL2, the second leading cause of mild-to-moderate hereditary deafness, is widely expressed in different inner ear cells. We generated a mouse model with a deletion in the Mpzl2 gene, which displayed moderate and slowly progressive hearing loss, mimicking the phenotype of individuals with DFNB111. We developed a gene replacement therapy system mediated by AAV-ie for efficient transduction in various types of cochlear cells. AAV-ie-Mpzl2 administration significantly lowered the auditory brainstem response and distortion product otoacoustic emission thresholds of Mpzl2<sup>-/-</sup> mice for at least seven months. AAV-ie-Mpzl2 delivery restored the structural integrity in both outer hair cells and Deiters cells. This study suggests the potential of gene therapy for MPZL2-related deafness and provides a proof of concept for gene therapy targeting other deafness-related genes that are expressed in different cell populations in the cochlea.</p>","PeriodicalId":7659,"journal":{"name":"American journal of human genetics","volume":" ","pages":"2253-2264"},"PeriodicalIF":8.1,"publicationDate":"2024-10-03","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11480802/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142144956","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2024-10-03Epub Date: 2024-08-26DOI: 10.1016/j.ajhg.2024.07.021
Juehan Wang, Zixuan Zhang, Zeyun Lu, Nicholas Mancuso, Steven Gazal
Multi-ancestry genome-wide association studies (GWASs) have highlighted the existence of variants with ancestry-specific effect sizes. Understanding where and why these ancestry-specific effects occur is fundamental to understanding the genetic basis of human diseases and complex traits. Here, we characterized genes differentially expressed across ancestries (ancDE genes) at the cell-type level by leveraging single-cell RNA-sequencing data in peripheral blood mononuclear cells for 21 individuals with East Asian (EAS) ancestry and 23 individuals with European (EUR) ancestry (172,385 cells); then, we tested whether variants surrounding those genes were enriched in disease variants with ancestry-specific effect sizes by leveraging ancestry-matched GWASs of 31 diseases and complex traits (average n ∼ 90,000 and ∼ 267,000 in EAS and EUR, respectively). We observed that ancDE genes tended to be cell-type specific and enriched in genes interacting with the environment and in variants with ancestry-specific disease effect sizes, which suggests cell-type-specific, gene-by-environment interactions shared between regulatory and disease architectures. Finally, we illustrated how different environments might have led to ancestry-specific myeloid cell leukemia 1 (MCL1) expression in B cells and ancestry-specific allele effect sizes in lymphocyte count GWASs for variants surrounding MCL1. Our results imply that large single-cell and GWAS datasets from diverse ancestries are required to improve our understanding of human diseases.
{"title":"Genes with differential expression across ancestries are enriched in ancestry-specific disease effects likely due to gene-by-environment interactions.","authors":"Juehan Wang, Zixuan Zhang, Zeyun Lu, Nicholas Mancuso, Steven Gazal","doi":"10.1016/j.ajhg.2024.07.021","DOIUrl":"10.1016/j.ajhg.2024.07.021","url":null,"abstract":"<p><p>Multi-ancestry genome-wide association studies (GWASs) have highlighted the existence of variants with ancestry-specific effect sizes. Understanding where and why these ancestry-specific effects occur is fundamental to understanding the genetic basis of human diseases and complex traits. Here, we characterized genes differentially expressed across ancestries (ancDE genes) at the cell-type level by leveraging single-cell RNA-sequencing data in peripheral blood mononuclear cells for 21 individuals with East Asian (EAS) ancestry and 23 individuals with European (EUR) ancestry (172,385 cells); then, we tested whether variants surrounding those genes were enriched in disease variants with ancestry-specific effect sizes by leveraging ancestry-matched GWASs of 31 diseases and complex traits (average n ∼ 90,000 and ∼ 267,000 in EAS and EUR, respectively). We observed that ancDE genes tended to be cell-type specific and enriched in genes interacting with the environment and in variants with ancestry-specific disease effect sizes, which suggests cell-type-specific, gene-by-environment interactions shared between regulatory and disease architectures. Finally, we illustrated how different environments might have led to ancestry-specific myeloid cell leukemia 1 (MCL1) expression in B cells and ancestry-specific allele effect sizes in lymphocyte count GWASs for variants surrounding MCL1. Our results imply that large single-cell and GWAS datasets from diverse ancestries are required to improve our understanding of human diseases.</p>","PeriodicalId":7659,"journal":{"name":"American journal of human genetics","volume":" ","pages":"2117-2128"},"PeriodicalIF":8.1,"publicationDate":"2024-10-03","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11480800/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142078889","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2024-10-03Epub Date: 2024-09-02DOI: 10.1016/j.ajhg.2024.08.002
Patricia J Sullivan, Julian M W Quinn, Weilin Wu, Mark Pinese, Mark J Cowley
Variants that alter gene splicing are estimated to comprise up to a third of all disease-causing variants, yet they are hard to predict from DNA sequencing data alone. To overcome this, many groups are incorporating RNA-based analyses, which are resource intensive, particularly for diagnostic laboratories. There are thousands of functionally validated variants that induce mis-splicing; however, this information is not consolidated, and they are under-represented in ClinVar, which presents a barrier to variant interpretation and can result in duplication of validation efforts. To address this issue, we developed SpliceVarDB, an online database consolidating over 50,000 variants assayed for their effects on splicing in over 8,000 human genes. We evaluated over 500 published data sources and established a spliceogenicity scale to standardize, harmonize, and consolidate variant validation data generated by a range of experimental protocols. According to the strength of their supporting evidence, variants were classified as "splice-altering" (∼25%), "not splice-altering" (∼25%), and "low-frequency splice-altering" (∼50%), which correspond to weak or indeterminate evidence of spliceogenicity. Importantly, 55% of the splice-altering variants in SpliceVarDB are outside the canonical splice sites (5.6% are deep intronic). These variants can support the variant curation diagnostic pathway and can be used to provide the high-quality data necessary to develop more accurate in silico splicing predictors. The variants are accessible through an online platform, SpliceVarDB, with additional features for visualization, variant information, in silico predictions, and validation metrics. SpliceVarDB is a very large collection of splice-altering variants and is available at https://splicevardb.org.
{"title":"SpliceVarDB: A comprehensive database of experimentally validated human splicing variants.","authors":"Patricia J Sullivan, Julian M W Quinn, Weilin Wu, Mark Pinese, Mark J Cowley","doi":"10.1016/j.ajhg.2024.08.002","DOIUrl":"10.1016/j.ajhg.2024.08.002","url":null,"abstract":"<p><p>Variants that alter gene splicing are estimated to comprise up to a third of all disease-causing variants, yet they are hard to predict from DNA sequencing data alone. To overcome this, many groups are incorporating RNA-based analyses, which are resource intensive, particularly for diagnostic laboratories. There are thousands of functionally validated variants that induce mis-splicing; however, this information is not consolidated, and they are under-represented in ClinVar, which presents a barrier to variant interpretation and can result in duplication of validation efforts. To address this issue, we developed SpliceVarDB, an online database consolidating over 50,000 variants assayed for their effects on splicing in over 8,000 human genes. We evaluated over 500 published data sources and established a spliceogenicity scale to standardize, harmonize, and consolidate variant validation data generated by a range of experimental protocols. According to the strength of their supporting evidence, variants were classified as \"splice-altering\" (∼25%), \"not splice-altering\" (∼25%), and \"low-frequency splice-altering\" (∼50%), which correspond to weak or indeterminate evidence of spliceogenicity. Importantly, 55% of the splice-altering variants in SpliceVarDB are outside the canonical splice sites (5.6% are deep intronic). These variants can support the variant curation diagnostic pathway and can be used to provide the high-quality data necessary to develop more accurate in silico splicing predictors. The variants are accessible through an online platform, SpliceVarDB, with additional features for visualization, variant information, in silico predictions, and validation metrics. SpliceVarDB is a very large collection of splice-altering variants and is available at https://splicevardb.org.</p>","PeriodicalId":7659,"journal":{"name":"American journal of human genetics","volume":" ","pages":"2164-2175"},"PeriodicalIF":8.1,"publicationDate":"2024-10-03","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11480807/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142124576","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2024-10-03Epub Date: 2024-09-02DOI: 10.1016/j.ajhg.2024.08.001
Sara Mansoorshahi, Anji T Yetman, Malenka M Bissell, Yuli Y Kim, Hector I Michelena, Julie De Backer, Laura Muiño Mosquera, Dawn S Hui, Anthony Caffarelli, Maria G Andreassi, Ilenia Foffa, Dongchuan Guo, Rodolfo Citro, Margot De Marco, Justin T Tretter, Shaine A Morris, Simon C Body, Jessica X Chong, Michael J Bamshad, Dianna M Milewicz, Siddharth K Prakash
Bicuspid aortic valve (BAV) is the most common congenital heart lesion with an estimated population prevalence of 1%. We hypothesize that specific gene variants predispose to early-onset complications of BAV (EBAV). We analyzed whole-exome sequences (WESs) to identify rare coding variants that contribute to BAV disease in 215 EBAV-affected families. Predicted damaging variants in candidate genes with moderate or strong supportive evidence to cause developmental cardiac phenotypes were present in 107 EBAV-affected families (50% of total), including genes that cause BAV (9%) or heritable thoracic aortic disease (HTAD, 19%). After appropriate filtration, we also identified 129 variants in 54 candidate genes that are associated with autosomal-dominant congenital heart phenotypes, including recurrent deleterious variation of FBN2, MYH6, channelopathy genes, and type 1 and 5 collagen genes. These findings confirm our hypothesis that unique rare genetic variants drive early-onset presentations of BAV disease.
{"title":"Whole-exome sequencing uncovers the genetic complexity of bicuspid aortic valve in families with early-onset complications.","authors":"Sara Mansoorshahi, Anji T Yetman, Malenka M Bissell, Yuli Y Kim, Hector I Michelena, Julie De Backer, Laura Muiño Mosquera, Dawn S Hui, Anthony Caffarelli, Maria G Andreassi, Ilenia Foffa, Dongchuan Guo, Rodolfo Citro, Margot De Marco, Justin T Tretter, Shaine A Morris, Simon C Body, Jessica X Chong, Michael J Bamshad, Dianna M Milewicz, Siddharth K Prakash","doi":"10.1016/j.ajhg.2024.08.001","DOIUrl":"10.1016/j.ajhg.2024.08.001","url":null,"abstract":"<p><p>Bicuspid aortic valve (BAV) is the most common congenital heart lesion with an estimated population prevalence of 1%. We hypothesize that specific gene variants predispose to early-onset complications of BAV (EBAV). We analyzed whole-exome sequences (WESs) to identify rare coding variants that contribute to BAV disease in 215 EBAV-affected families. Predicted damaging variants in candidate genes with moderate or strong supportive evidence to cause developmental cardiac phenotypes were present in 107 EBAV-affected families (50% of total), including genes that cause BAV (9%) or heritable thoracic aortic disease (HTAD, 19%). After appropriate filtration, we also identified 129 variants in 54 candidate genes that are associated with autosomal-dominant congenital heart phenotypes, including recurrent deleterious variation of FBN2, MYH6, channelopathy genes, and type 1 and 5 collagen genes. These findings confirm our hypothesis that unique rare genetic variants drive early-onset presentations of BAV disease.</p>","PeriodicalId":7659,"journal":{"name":"American journal of human genetics","volume":" ","pages":"2219-2231"},"PeriodicalIF":8.1,"publicationDate":"2024-10-03","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11480851/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142124577","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}