首页 > 最新文献

American journal of human genetics最新文献

英文 中文
Local genetic correlation via knockoffs reduces confounding due to cross-trait assortative mating. 通过 "山寨 "产生的局部遗传相关性减少了跨性状同配所造成的混杂。
IF 8.1 1区 生物学 Q1 GENETICS & HEREDITY Pub Date : 2024-11-07 DOI: 10.1016/j.ajhg.2024.10.012
Shiyang Ma, Fan Wang, Richard Border, Joseph Buxbaum, Noah Zaitlen, Iuliana Ionita-Laza

Local genetic correlation analysis is an important tool for identifying genetic loci with shared biology across traits. Recently, Border et al. have shown that the results of these analyses are confounded by cross-trait assortative mating (xAM), leading to many false-positive findings. Here, we describe LAVA-Knock, a local genetic correlation method that builds off an existing genetic correlation method, LAVA, and augments it by generating synthetic data in a way that preserves local and long-range linkage disequilibrium (LD), allowing us to reduce the confounding induced by xAM. We show in simulations based on a realistic xAM model and in genome-wide association study (GWAS) applications for 630 trait pairs that LAVA-Knock can greatly reduce the bias due to xAM relative to LAVA. Furthermore, we show a significant positive correlation between the reduction in local genetic correlations and estimates in the literature of cross-mate phenotype correlations; in particular, pairs of traits that are known to have high cross-mate phenotype correlation values have a significantly higher reduction in the number of local genetic correlations compared with other trait pairs. A few representative examples include education and intelligence, education and alcohol consumption, and attention-deficit hyperactivity disorder and depression. These results suggest that LAVA-Knock can reduce confounding due to both short-range LD and long-range LD induced by xAM.

局部遗传相关性分析是确定具有跨性状共同生物学特性的遗传位点的重要工具。最近,Border 等人的研究表明,这些分析的结果会受到跨性状同配(xAM)的干扰,从而导致许多假阳性结果。在这里,我们介绍一种局部遗传相关方法 LAVA-Knock,它以现有的遗传相关方法 LAVA 为基础,并通过生成合成数据的方式对其进行增强,从而保留局部和长程连锁不平衡(LD),使我们能够减少 xAM 引起的混杂。我们在基于现实 xAM 模型的模拟和针对 630 个性状对的全基因组关联研究(GWAS)应用中表明,相对于 LAVA,LAVA-Knock 能大大减少 xAM 带来的偏差。此外,我们还发现,局部遗传相关性的降低与文献中对跨配偶表型相关性的估计之间存在显著的正相关;特别是,与其他性状对相比,已知具有较高跨配偶表型相关性值的性状对的局部遗传相关性数量的降低幅度明显更高。一些有代表性的例子包括教育与智力、教育与饮酒、注意力缺陷多动障碍与抑郁。这些结果表明,LAVA-Knock 可以减少由 xAM 引起的短程 LD 和长程 LD 所造成的混杂。
{"title":"Local genetic correlation via knockoffs reduces confounding due to cross-trait assortative mating.","authors":"Shiyang Ma, Fan Wang, Richard Border, Joseph Buxbaum, Noah Zaitlen, Iuliana Ionita-Laza","doi":"10.1016/j.ajhg.2024.10.012","DOIUrl":"https://doi.org/10.1016/j.ajhg.2024.10.012","url":null,"abstract":"<p><p>Local genetic correlation analysis is an important tool for identifying genetic loci with shared biology across traits. Recently, Border et al. have shown that the results of these analyses are confounded by cross-trait assortative mating (xAM), leading to many false-positive findings. Here, we describe LAVA-Knock, a local genetic correlation method that builds off an existing genetic correlation method, LAVA, and augments it by generating synthetic data in a way that preserves local and long-range linkage disequilibrium (LD), allowing us to reduce the confounding induced by xAM. We show in simulations based on a realistic xAM model and in genome-wide association study (GWAS) applications for 630 trait pairs that LAVA-Knock can greatly reduce the bias due to xAM relative to LAVA. Furthermore, we show a significant positive correlation between the reduction in local genetic correlations and estimates in the literature of cross-mate phenotype correlations; in particular, pairs of traits that are known to have high cross-mate phenotype correlation values have a significantly higher reduction in the number of local genetic correlations compared with other trait pairs. A few representative examples include education and intelligence, education and alcohol consumption, and attention-deficit hyperactivity disorder and depression. These results suggest that LAVA-Knock can reduce confounding due to both short-range LD and long-range LD induced by xAM.</p>","PeriodicalId":7659,"journal":{"name":"American journal of human genetics","volume":" ","pages":""},"PeriodicalIF":8.1,"publicationDate":"2024-11-07","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142638545","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Large-scale application of ClinGen-InSiGHT APC-specific ACMG/AMP variant classification criteria leads to substantial reduction in VUS. 大规模应用 ClinGen-InSiGHT APC 特异性 ACMG/AMP 变异分类标准可大幅减少 VUS。
IF 5.4 1区 生物学 Q1 GENETICS & HEREDITY Pub Date : 2024-11-07 Epub Date: 2024-10-01 DOI: 10.1016/j.ajhg.2024.09.002
Xiaoyu Yin, Marcy Richardson, Andreas Laner, Xuemei Shi, Elisabet Ognedal, Valeria Vasta, Thomas V O Hansen, Marta Pineda, Deborah Ritter, Johan de Dunnen, Emadeldin Hassanin, Wencong Lyman Lin, Ester Borras, Karl Krahn, Margareta Nordling, Alexandra Martins, Khalid Mahmood, Emily Nadeau, Victoria Beshay, Carli Tops, Maurizio Genuardi, Tina Pesaran, Ian M Frayling, Gabriel Capellá, Andrew Latchford, Sean V Tavtigian, Carlo Maj, Sharon E Plon, Marc S Greenblatt, Finlay A Macrae, Isabel Spier, Stefan Aretz

Pathogenic constitutional APC variants underlie familial adenomatous polyposis, the most common hereditary gastrointestinal polyposis syndrome. To improve variant classification and resolve the interpretative challenges of variants of uncertain significance (VUSs), APC-specific variant classification criteria were developed by the ClinGen-InSiGHT Hereditary Colorectal Cancer/Polyposis Variant Curation Expert Panel (VCEP) based on the criteria of the American College of Medical Genetics and Genomics and the Association for Molecular Pathology (ACMG/AMP). A streamlined algorithm using the APC-specific criteria was developed and applied to assess all APC variants in ClinVar and the International Society for Gastrointestinal Hereditary Tumours (InSiGHT) international reference APC Leiden Open Variation Database (LOVD) variant database, which included a total of 10,228 unique APC variants. Among the ClinVar and LOVD variants with an initial classification of (likely) benign or (likely) pathogenic, 94% and 96% remained in their original categories, respectively. In contrast, 41% ClinVar and 61% LOVD VUSs were reclassified into clinically meaningful classes, the vast majority as (likely) benign. The total number of VUSs was reduced by 37%. In 24 out of 37 (65%) promising APC variants that remained VUS despite evidence for pathogenicity, a data-mining-driven work-up allowed their reclassification as (likely) pathogenic. These results demonstrated that the application of APC-specific criteria substantially reduced the number of VUSs in ClinVar and LOVD. The study also demonstrated the feasibility of a systematic approach to variant classification in large datasets, which might serve as a generalizable model for other gene- or disease-specific variant interpretation initiatives. It also allowed for the prioritization of VUSs that will benefit from in-depth evidence collection. This subset of APC variants was approved by the VCEP and made publicly available through ClinVar and LOVD for widespread clinical use.

家族性腺瘤性息肉病是最常见的遗传性胃肠道息肉病综合征,其致病性宪制性 APC 变异是该综合征的基础。为了改进变异分类并解决意义不确定变异 (VUS) 的解释难题,ClinGen-InSiGHT 遗传性结直肠癌/息肉病变异整理专家小组 (VCEP) 根据美国医学遗传学和基因组学学院及分子病理学协会 (ACMG/AMP) 的标准制定了 APC 特异性变异分类标准。我们开发了一种使用 APC 特异性标准的简化算法,并将其应用于评估 ClinVar 和国际胃肠道遗传性肿瘤学会 (InSiGHT) 国际参考 APC 莱顿开放变异数据库 (LOVD) 变异数据库中的所有 APC 变异,该数据库共包含 10,228 个独特的 APC 变异。在初始分类为(可能)良性或(可能)致病的 ClinVar 和 LOVD 变异中,分别有 94% 和 96% 的变异仍保留在原来的类别中。相比之下,41% 的 ClinVar 和 61% 的 LOVD VUS 被重新分类为有临床意义的类别,其中绝大多数为(可能)良性。VUS 总数减少了 37%。在 37 个有希望的 APC 变体中,有 24 个(65%)尽管有致病性证据,但仍然是 VUS,数据挖掘驱动的工作使它们被重新分类为(可能)致病性变体。这些结果表明,应用 APC 特异性标准大大减少了 ClinVar 和 LOVD 中 VUS 的数量。该研究还证明了在大型数据集中进行变异体分类的系统方法的可行性,该方法可作为其他基因或疾病特异性变异体解释计划的通用模型。此外,该方法还能优先考虑将受益于深入证据收集的 VUS。这一 APC 变异子集已获得 VCEP 批准,并通过 ClinVar 和 LOVD 公开发布,供临床广泛使用。
{"title":"Large-scale application of ClinGen-InSiGHT APC-specific ACMG/AMP variant classification criteria leads to substantial reduction in VUS.","authors":"Xiaoyu Yin, Marcy Richardson, Andreas Laner, Xuemei Shi, Elisabet Ognedal, Valeria Vasta, Thomas V O Hansen, Marta Pineda, Deborah Ritter, Johan de Dunnen, Emadeldin Hassanin, Wencong Lyman Lin, Ester Borras, Karl Krahn, Margareta Nordling, Alexandra Martins, Khalid Mahmood, Emily Nadeau, Victoria Beshay, Carli Tops, Maurizio Genuardi, Tina Pesaran, Ian M Frayling, Gabriel Capellá, Andrew Latchford, Sean V Tavtigian, Carlo Maj, Sharon E Plon, Marc S Greenblatt, Finlay A Macrae, Isabel Spier, Stefan Aretz","doi":"10.1016/j.ajhg.2024.09.002","DOIUrl":"10.1016/j.ajhg.2024.09.002","url":null,"abstract":"<p><p>Pathogenic constitutional APC variants underlie familial adenomatous polyposis, the most common hereditary gastrointestinal polyposis syndrome. To improve variant classification and resolve the interpretative challenges of variants of uncertain significance (VUSs), APC-specific variant classification criteria were developed by the ClinGen-InSiGHT Hereditary Colorectal Cancer/Polyposis Variant Curation Expert Panel (VCEP) based on the criteria of the American College of Medical Genetics and Genomics and the Association for Molecular Pathology (ACMG/AMP). A streamlined algorithm using the APC-specific criteria was developed and applied to assess all APC variants in ClinVar and the International Society for Gastrointestinal Hereditary Tumours (InSiGHT) international reference APC Leiden Open Variation Database (LOVD) variant database, which included a total of 10,228 unique APC variants. Among the ClinVar and LOVD variants with an initial classification of (likely) benign or (likely) pathogenic, 94% and 96% remained in their original categories, respectively. In contrast, 41% ClinVar and 61% LOVD VUSs were reclassified into clinically meaningful classes, the vast majority as (likely) benign. The total number of VUSs was reduced by 37%. In 24 out of 37 (65%) promising APC variants that remained VUS despite evidence for pathogenicity, a data-mining-driven work-up allowed their reclassification as (likely) pathogenic. These results demonstrated that the application of APC-specific criteria substantially reduced the number of VUSs in ClinVar and LOVD. The study also demonstrated the feasibility of a systematic approach to variant classification in large datasets, which might serve as a generalizable model for other gene- or disease-specific variant interpretation initiatives. It also allowed for the prioritization of VUSs that will benefit from in-depth evidence collection. This subset of APC variants was approved by the VCEP and made publicly available through ClinVar and LOVD for widespread clinical use.</p>","PeriodicalId":7659,"journal":{"name":"American journal of human genetics","volume":" ","pages":"2427-2443"},"PeriodicalIF":5.4,"publicationDate":"2024-11-07","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11568752/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142363999","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Resolution of ring chromosomes, Robertsonian translocations, and complex structural variants from long-read sequencing and telomere-to-telomere assembly. 通过长线程测序和端粒到端粒组装解析环状染色体、罗伯逊易位和复杂结构变异。
IF 8.1 1区 生物学 Q1 GENETICS & HEREDITY Pub Date : 2024-11-05 DOI: 10.1016/j.ajhg.2024.10.006
Yulia Mostovoy, Philip M Boone, Yongqing Huang, Kiran V Garimella, Kar-Tong Tan, Bianca E Russell, Monica Salani, Celine E F de Esch, John Lemanski, Benjamin Curall, Jen Hauenstein, Diane Lucente, Tera Bowers, Tim DeSmet, Stacey Gabriel, Cynthia C Morton, Matthew Meyerson, Alex R Hastie, James Gusella, Fabiola Quintero-Rivera, Harrison Brand, Michael E Talkowski

Delineation of structural variants (SVs) at sequence resolution in highly repetitive genomic regions has long been intractable. The sequence properties, origins, and functional effects of classes of genomic rearrangements such as ring chromosomes and Robertsonian translocations thus remain unknown. To resolve these complex structures, we leveraged several recent milestones in the field, including (1) the emergence of long-read sequencing, (2) the gapless telomere-to-telomere (T2T) assembly, and (3) a tool (BigClipper) to discover chromosomal rearrangements from long reads. We applied these technologies across 13 cases with ring chromosomes, Robertsonian translocations, and complex SVs that were unresolved by short reads, followed by validation using optical genome mapping (OGM). Our analyses resolved 10 of 13 cases, including a Robertsonian translocation and all ring chromosomes. Multiple breakpoints were localized to genomic regions previously recalcitrant to sequencing such as acrocentric p-arms, ribosomal DNA arrays, and telomeric repeats, and involved complex structures such as a deletion-inversion and interchromosomal dispersed duplications. We further performed methylation profiling from long-read data to discover phased differential methylation in a gene promoter proximal to a ring fusion, suggesting a long-range position effect (LRPE) with heterochromatin spreading. Breakpoint sequences suggested mechanisms of SV formation such as microhomology-mediated and non-homologous end-joining, as well as non-allelic homologous recombination. These methods provide some of the first glimpses into the sequence resolution of Robertsonian translocations and illuminate the structural diversity of ring chromosomes and complex chromosomal rearrangements with implications for genome biology, prediction of LRPEs from integrated multi-omics technologies, and molecular diagnostics in rare disease cases.

长期以来,在高度重复的基因组区域以序列分辨率划分结构变异(SV)一直是个难题。因此,环状染色体和罗伯逊易位等各类基因组重排的序列特性、起源和功能影响仍然未知。为了解决这些复杂的结构,我们利用了该领域最近的几个里程碑,包括:(1)长读数测序的出现;(2)无间隙端粒到端粒(T2T)组装;(3)从长读数中发现染色体重排的工具(BigClipper)。我们将这些技术应用于 13 个具有环状染色体、罗伯逊易位和复杂 SV 的病例,这些病例都是短读数无法解决的,随后我们使用光学基因组图谱(OGM)进行了验证。我们的分析解决了 13 个病例中的 10 个,包括一个罗伯逊易位和所有环状染色体。多个断点被定位在以前难以测序的基因组区域,如同心p臂、核糖体DNA阵列和端粒重复序列,并涉及复杂的结构,如缺失-倒位和染色体间分散重复。我们进一步利用长读数数据进行了甲基化分析,发现在环状融合近端基因启动子中存在阶段性差异甲基化,这表明存在异染色质扩散的长程位置效应(LRPE)。断点序列表明了 SV 的形成机制,如微组学介导和非同源末端连接,以及非等位同源重组。这些方法首次揭示了罗伯逊易位的序列解析,阐明了环状染色体和复杂染色体重排的结构多样性,对基因组生物学、综合多组学技术预测 LRPEs 以及罕见病例的分子诊断具有重要意义。
{"title":"Resolution of ring chromosomes, Robertsonian translocations, and complex structural variants from long-read sequencing and telomere-to-telomere assembly.","authors":"Yulia Mostovoy, Philip M Boone, Yongqing Huang, Kiran V Garimella, Kar-Tong Tan, Bianca E Russell, Monica Salani, Celine E F de Esch, John Lemanski, Benjamin Curall, Jen Hauenstein, Diane Lucente, Tera Bowers, Tim DeSmet, Stacey Gabriel, Cynthia C Morton, Matthew Meyerson, Alex R Hastie, James Gusella, Fabiola Quintero-Rivera, Harrison Brand, Michael E Talkowski","doi":"10.1016/j.ajhg.2024.10.006","DOIUrl":"https://doi.org/10.1016/j.ajhg.2024.10.006","url":null,"abstract":"<p><p>Delineation of structural variants (SVs) at sequence resolution in highly repetitive genomic regions has long been intractable. The sequence properties, origins, and functional effects of classes of genomic rearrangements such as ring chromosomes and Robertsonian translocations thus remain unknown. To resolve these complex structures, we leveraged several recent milestones in the field, including (1) the emergence of long-read sequencing, (2) the gapless telomere-to-telomere (T2T) assembly, and (3) a tool (BigClipper) to discover chromosomal rearrangements from long reads. We applied these technologies across 13 cases with ring chromosomes, Robertsonian translocations, and complex SVs that were unresolved by short reads, followed by validation using optical genome mapping (OGM). Our analyses resolved 10 of 13 cases, including a Robertsonian translocation and all ring chromosomes. Multiple breakpoints were localized to genomic regions previously recalcitrant to sequencing such as acrocentric p-arms, ribosomal DNA arrays, and telomeric repeats, and involved complex structures such as a deletion-inversion and interchromosomal dispersed duplications. We further performed methylation profiling from long-read data to discover phased differential methylation in a gene promoter proximal to a ring fusion, suggesting a long-range position effect (LRPE) with heterochromatin spreading. Breakpoint sequences suggested mechanisms of SV formation such as microhomology-mediated and non-homologous end-joining, as well as non-allelic homologous recombination. These methods provide some of the first glimpses into the sequence resolution of Robertsonian translocations and illuminate the structural diversity of ring chromosomes and complex chromosomal rearrangements with implications for genome biology, prediction of LRPEs from integrated multi-omics technologies, and molecular diagnostics in rare disease cases.</p>","PeriodicalId":7659,"journal":{"name":"American journal of human genetics","volume":" ","pages":""},"PeriodicalIF":8.1,"publicationDate":"2024-11-05","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142612272","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
GPS-Net: Discovering prognostic pathway modules based on network regularized kernel learning. GPS-Net:基于网络正则化核学习发现预后路径模块
IF 8.1 1区 生物学 Q1 GENETICS & HEREDITY Pub Date : 2024-10-30 DOI: 10.1016/j.ajhg.2024.10.004
Sijie Yao, Kaiqiao Li, Tingyi Li, Xiaoqing Yu, Pei Fen Kuan, Xuefeng Wang

The search for prognostic biomarkers capable of predicting patient outcomes, by analyzing gene expression in tissue samples and other molecular profiles, remains largely focused on single-gene-based or global-gene-search approaches. Gene-centric approaches, while foundational, fail to capture the higher-order dependencies that reflect the activities of co-regulated processes, pathway alterations, and regulatory networks, all of which are crucial in determining the patient outcomes in complex diseases like cancer. Here, we introduce GPS-Net, a computational framework that fills the gap in efficiently identifying prognostic modules by incorporating the holistic pathway structures and the network of gene interactions. By innovatively incorporating advanced multiple kernel learning techniques and network-based regularization, the proposed method not only enhances the accuracy of biomarker and pathway identification but also significantly reduces computational complexity, as demonstrated by extensive simulation studies. Applying GPS-Net, we identified key pathways that are predictive of patient outcomes in a cancer immunotherapy study. Overall, our approach provides a novel framework that renders genome-wide pathway-level prognostic analysis both feasible and scalable, synergizing both mechanism-driven and data-driven methodologies for precision genomics.

通过分析组织样本中的基因表达和其他分子特征,寻找能够预测患者预后的预后生物标志物,在很大程度上仍然集中在基于单基因或全局基因的研究方法上。以基因为中心的方法虽然具有基础性,但却无法捕捉到反映共调过程、通路改变和调控网络活动的高阶依赖性,而所有这些都是决定癌症等复杂疾病患者预后的关键。在此,我们介绍一种计算框架 GPS-Net,该框架通过整合整体通路结构和基因相互作用网络,填补了有效识别预后模块的空白。通过创新性地结合先进的多核学习技术和基于网络的正则化,所提出的方法不仅提高了生物标记物和通路识别的准确性,还显著降低了计算复杂性,这一点已在大量的模拟研究中得到证实。应用 GPS-Net,我们在一项癌症免疫疗法研究中确定了可预测患者预后的关键通路。总之,我们的方法提供了一个新颖的框架,使全基因组通路级预后分析变得既可行又可扩展,为精准基因组学的机制驱动和数据驱动方法提供了协同效应。
{"title":"GPS-Net: Discovering prognostic pathway modules based on network regularized kernel learning.","authors":"Sijie Yao, Kaiqiao Li, Tingyi Li, Xiaoqing Yu, Pei Fen Kuan, Xuefeng Wang","doi":"10.1016/j.ajhg.2024.10.004","DOIUrl":"10.1016/j.ajhg.2024.10.004","url":null,"abstract":"<p><p>The search for prognostic biomarkers capable of predicting patient outcomes, by analyzing gene expression in tissue samples and other molecular profiles, remains largely focused on single-gene-based or global-gene-search approaches. Gene-centric approaches, while foundational, fail to capture the higher-order dependencies that reflect the activities of co-regulated processes, pathway alterations, and regulatory networks, all of which are crucial in determining the patient outcomes in complex diseases like cancer. Here, we introduce GPS-Net, a computational framework that fills the gap in efficiently identifying prognostic modules by incorporating the holistic pathway structures and the network of gene interactions. By innovatively incorporating advanced multiple kernel learning techniques and network-based regularization, the proposed method not only enhances the accuracy of biomarker and pathway identification but also significantly reduces computational complexity, as demonstrated by extensive simulation studies. Applying GPS-Net, we identified key pathways that are predictive of patient outcomes in a cancer immunotherapy study. Overall, our approach provides a novel framework that renders genome-wide pathway-level prognostic analysis both feasible and scalable, synergizing both mechanism-driven and data-driven methodologies for precision genomics.</p>","PeriodicalId":7659,"journal":{"name":"American journal of human genetics","volume":" ","pages":""},"PeriodicalIF":8.1,"publicationDate":"2024-10-30","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142602781","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Genomes and epigenomes of matched normal and tumor breast tissue reveal diverse evolutionary trajectories and tumor-host interactions. 匹配的正常和肿瘤乳腺组织的基因组和表观基因组揭示了不同的进化轨迹和肿瘤-宿主相互作用。
IF 8.1 1区 生物学 Q1 GENETICS & HEREDITY Pub Date : 2024-10-29 DOI: 10.1016/j.ajhg.2024.10.005
Bin Zhu, Avraam Tapinos, Hela Koka, Priscilla Ming Yi Lee, Tongwu Zhang, Wei Zhu, Xiaoyu Wang, Alyssa Klein, DongHyuk Lee, Gary M Tse, Koon-Ho Tsang, Cherry Wu, Min Hua, Chad A Highfill, Petra Lenz, Weiyin Zhou, Difei Wang, Wen Luo, Kristine Jones, Amy Hutchinson, Belynda Hicks, Montserrat Garcia-Closas, Stephen Chanock, Lap Ah Tse, David C Wedge, Xiaohong R Yang

Normal tissues adjacent to the tumor (NATs) may harbor early breast carcinogenesis events driven by field cancerization. Although previous studies have characterized copy-number (CN) and transcriptomic alterations, the evolutionary history of NATs in breast cancer (BC) remains poorly characterized. Utilizing whole-genome sequencing (WGS), methylation profiling, and RNA sequencing (RNA-seq), we analyzed paired germline, NATs, and tumor samples from 43 individuals with BC in Hong Kong (HK). We found that single-nucleotide variants (SNVs) were common in NATs, with one-third of NAT samples exhibiting SNVs in driver genes, many of which were present in paired tumor samples. The most frequently mutated genes in both tumor and NAT samples were PIK3CA, TP53, GATA3, and AKT1. In contrast, large-scale aberrations such as somatic CN alterations (SCNAs) and structural variants (SVs) were rarely detected in NAT samples. We generated phylogenetic trees to investigate the evolutionary history of paired NAT and tumor samples. They could be categorized into tumor only, shared, and multiple-tree groups, the last of which is concordant with non-genetic field cancerization. These groups exhibited distinct genomic and epigenomic characteristics in both NAT and tumor samples. Specifically, NAT samples in the shared-tree group showed higher number of mutations, while NAT samples belonging to the multiple-tree group showed a less inflammatory tumor microenvironment (TME), characterized by a higher proportion of regulatory T cells (Tregs) and lower presence of CD14 cell populations. In summary, our findings highlight the diverse evolutionary history in BC NAT/tumor pairs and the impact of field cancerization and TME in shaping the genomic evolutionary history of tumors.

与肿瘤相邻的正常组织(NATs)可能蕴藏着由现场癌化驱动的早期乳腺癌发生事件。尽管以前的研究已经描述了拷贝数(CN)和转录组的改变,但对乳腺癌(BC)中NATs的进化史仍然缺乏描述。利用全基因组测序(WGS)、甲基化分析和 RNA 测序(RNA-seq),我们分析了香港 43 名乳腺癌患者的配对种系、NATs 和肿瘤样本。我们发现,单核苷酸变异(SNVs)在NATs中很常见,三分之一的NAT样本显示出驱动基因中的SNVs,其中许多变异出现在配对的肿瘤样本中。肿瘤和 NAT 样本中最常见的突变基因是 PIK3CA、TP53、GATA3 和 AKT1。相比之下,NAT样本中很少检测到体细胞CN改变(SCNA)和结构变异(SV)等大规模畸变。我们生成了系统发生树来研究配对的 NAT 和肿瘤样本的进化史。它们可被分为仅肿瘤组、共享组和多树组,其中最后一组与非基因领域的癌症化是一致的。这些组别在 NAT 和肿瘤样本中都表现出不同的基因组和表观基因组特征。具体来说,共享树组的 NAT 样本突变较多,而多树组的 NAT 样本炎症性肿瘤微环境(TME)较低,调节性 T 细胞(Tregs)比例较高,CD14 细胞群较少。总之,我们的研究结果凸显了BC NAT/肿瘤配对的多样化进化史,以及野外癌化和TME在塑造肿瘤基因组进化史方面的影响。
{"title":"Genomes and epigenomes of matched normal and tumor breast tissue reveal diverse evolutionary trajectories and tumor-host interactions.","authors":"Bin Zhu, Avraam Tapinos, Hela Koka, Priscilla Ming Yi Lee, Tongwu Zhang, Wei Zhu, Xiaoyu Wang, Alyssa Klein, DongHyuk Lee, Gary M Tse, Koon-Ho Tsang, Cherry Wu, Min Hua, Chad A Highfill, Petra Lenz, Weiyin Zhou, Difei Wang, Wen Luo, Kristine Jones, Amy Hutchinson, Belynda Hicks, Montserrat Garcia-Closas, Stephen Chanock, Lap Ah Tse, David C Wedge, Xiaohong R Yang","doi":"10.1016/j.ajhg.2024.10.005","DOIUrl":"https://doi.org/10.1016/j.ajhg.2024.10.005","url":null,"abstract":"<p><p>Normal tissues adjacent to the tumor (NATs) may harbor early breast carcinogenesis events driven by field cancerization. Although previous studies have characterized copy-number (CN) and transcriptomic alterations, the evolutionary history of NATs in breast cancer (BC) remains poorly characterized. Utilizing whole-genome sequencing (WGS), methylation profiling, and RNA sequencing (RNA-seq), we analyzed paired germline, NATs, and tumor samples from 43 individuals with BC in Hong Kong (HK). We found that single-nucleotide variants (SNVs) were common in NATs, with one-third of NAT samples exhibiting SNVs in driver genes, many of which were present in paired tumor samples. The most frequently mutated genes in both tumor and NAT samples were PIK3CA, TP53, GATA3, and AKT1. In contrast, large-scale aberrations such as somatic CN alterations (SCNAs) and structural variants (SVs) were rarely detected in NAT samples. We generated phylogenetic trees to investigate the evolutionary history of paired NAT and tumor samples. They could be categorized into tumor only, shared, and multiple-tree groups, the last of which is concordant with non-genetic field cancerization. These groups exhibited distinct genomic and epigenomic characteristics in both NAT and tumor samples. Specifically, NAT samples in the shared-tree group showed higher number of mutations, while NAT samples belonging to the multiple-tree group showed a less inflammatory tumor microenvironment (TME), characterized by a higher proportion of regulatory T cells (Tregs) and lower presence of CD14 cell populations. In summary, our findings highlight the diverse evolutionary history in BC NAT/tumor pairs and the impact of field cancerization and TME in shaping the genomic evolutionary history of tumors.</p>","PeriodicalId":7659,"journal":{"name":"American journal of human genetics","volume":" ","pages":""},"PeriodicalIF":8.1,"publicationDate":"2024-10-29","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142567635","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
3D genome topology distinguishes molecular subgroups of medulloblastoma. 三维基因组拓扑学可区分髓母细胞瘤分子亚群。
IF 8.1 1区 生物学 Q1 GENETICS & HEREDITY Pub Date : 2024-10-22 DOI: 10.1016/j.ajhg.2024.10.003
John J Y Lee, Michael J Johnston, Hamza Farooq, Huey-Miin Chen, Subhi Talal Younes, Raul Suarez, Melissa Zwaig, Nikoleta Juretic, William A Weiss, Jiannis Ragoussis, Nada Jabado, Michael D Taylor, Marco Gallo

Four main medulloblastoma (MB) molecular subtypes have been identified based on transcriptional, DNA methylation, and genetic profiles. However, it is currently not known whether 3D genome architecture differs between MB subtypes. To address this question, we performed in situ Hi-C to reconstruct the 3D genome architecture of MB subtypes. In total, we generated Hi-C and matching transcriptome data for 28 surgical specimens and Hi-C data for one patient-derived xenograft. The average resolution of the Hi-C maps was 6,833 bp. Using these data, we found that insulation scores of topologically associating domains (TADs) were effective at distinguishing MB molecular subgroups. TAD insulation score differences between subtypes were globally not associated with differential gene expression, although we identified few exceptions near genes expressed in the lineages of origin of specific MB subtypes. Our study therefore supports the notion that TAD insulation scores can distinguish MB subtypes independently of their transcriptional differences.

根据转录、DNA甲基化和遗传特征,已确定了四种主要的髓母细胞瘤(MB)分子亚型。然而,目前尚不清楚髓母细胞瘤亚型之间的三维基因组结构是否存在差异。为了解决这个问题,我们进行了原位 Hi-C 重建 MB 亚型的三维基因组结构。我们总共生成了 28 个手术标本的 Hi-C 和匹配的转录组数据,以及一个患者来源异种移植的 Hi-C 数据。Hi-C 图谱的平均分辨率为 6,833 bp。利用这些数据,我们发现拓扑关联结构域(TAD)的绝缘分数能有效区分 MB 分子亚群。尽管我们在特定 MB 亚型的起源谱系附近发现了一些表达基因,但亚型之间的 TAD 绝缘分数差异总体上与基因表达差异无关。因此,我们的研究支持这样一种观点,即 TAD 绝缘分数可以独立于转录差异区分 MB 亚型。
{"title":"3D genome topology distinguishes molecular subgroups of medulloblastoma.","authors":"John J Y Lee, Michael J Johnston, Hamza Farooq, Huey-Miin Chen, Subhi Talal Younes, Raul Suarez, Melissa Zwaig, Nikoleta Juretic, William A Weiss, Jiannis Ragoussis, Nada Jabado, Michael D Taylor, Marco Gallo","doi":"10.1016/j.ajhg.2024.10.003","DOIUrl":"https://doi.org/10.1016/j.ajhg.2024.10.003","url":null,"abstract":"<p><p>Four main medulloblastoma (MB) molecular subtypes have been identified based on transcriptional, DNA methylation, and genetic profiles. However, it is currently not known whether 3D genome architecture differs between MB subtypes. To address this question, we performed in situ Hi-C to reconstruct the 3D genome architecture of MB subtypes. In total, we generated Hi-C and matching transcriptome data for 28 surgical specimens and Hi-C data for one patient-derived xenograft. The average resolution of the Hi-C maps was 6,833 bp. Using these data, we found that insulation scores of topologically associating domains (TADs) were effective at distinguishing MB molecular subgroups. TAD insulation score differences between subtypes were globally not associated with differential gene expression, although we identified few exceptions near genes expressed in the lineages of origin of specific MB subtypes. Our study therefore supports the notion that TAD insulation scores can distinguish MB subtypes independently of their transcriptional differences.</p>","PeriodicalId":7659,"journal":{"name":"American journal of human genetics","volume":" ","pages":""},"PeriodicalIF":8.1,"publicationDate":"2024-10-22","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142556951","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Hypometric genetics: Improved power in genetic discovery by incorporating quality control flags. 超计量遗传学:纳入质量控制标志,提高基因发现的能力。
IF 9.8 1区 生物学 Q1 GENETICS & HEREDITY Pub Date : 2024-10-21 DOI: 10.1016/j.ajhg.2024.09.008
Yosuke Tanigawa,Manolis Kellis
Balancing the tradeoff between quantity and quality of phenotypic data is critical in omics studies. Measurements below the limit of quantification (BLQ) are often tagged in quality control fields, but these flags are currently underutilized in human genetics studies. Extreme phenotype sampling is advantageous for mapping rare variant effects. We hypothesize that genetic drivers, along with environmental and technical factors, contribute to the presence of BLQ flags. Here, we introduce "hypometric genetics" (hMG) analysis and uncover a genetic basis for BLQ flags, indicating an additional source of genetic signal for genetic discovery, especially from phenotypic extremes. Applying our hMG approach to n = 227,469 UK Biobank individuals with metabolomic profiles, we reveal more than 5% heritability for BLQ flags and report biologically relevant associations, for example, at APOC3, APOA5, and PDE3B loci. For common variants, polygenic scores trained only for BLQ flags predict the corresponding quantitative traits with 91% accuracy, validating the genetic basis. For rare coding variant associations, we find an asymmetric 65.4% higher enrichment of metabolite-lowering associations for BLQ flags, highlighting the impact of putative loss-of-function variants with large effects on phenotypic extremes. Joint analysis of binarized BLQ flags and the corresponding quantitative metabolite measurements improves power in Bayesian rare variant aggregation tests, resulting in an average of 181% more prioritized genes. Our approach is broadly applicable to omics profiling. Overall, our results underscore the benefit of integrating quality control flags and quantitative measurements and highlight the advantage of joint analysis of population-based samples and phenotypic extremes in human genetics studies.
在表型数据的数量和质量之间权衡利弊,是omics 研究的关键。低于定量限(BLQ)的测量值通常会被标记在质量控制字段中,但这些标记目前在人类遗传学研究中还未得到充分利用。极端表型取样有利于绘制罕见变异效应图。我们假设,遗传驱动因素以及环境和技术因素是导致出现 BLQ 标志的原因。在这里,我们引入了 "低计量遗传学"(hMG)分析,发现了 BLQ 标志的遗传基础,为遗传发现提供了额外的遗传信号源,尤其是来自极端表型的信号源。我们对 n = 227,469 名英国生物库中具有代谢组学特征的个体采用了 hMG 方法,发现 BLQ 标志的遗传率超过 5%,并报告了与生物学相关的关联,例如在 APOC3、APOA5 和 PDE3B 位点上的关联。对于常见变异,仅针对 BLQ 标志训练的多基因评分预测相应数量性状的准确率为 91%,验证了遗传基础。对于罕见编码变异的关联,我们发现 BLQ 标志的代谢物降低关联的富集度不对称地高达 65.4%,突出了对表型极端影响较大的假定功能缺失变异的影响。对二进制 BLQ 标志和相应的定量代谢物测量结果进行联合分析,提高了贝叶斯稀有变异聚合测试的能力,使优先考虑的基因平均增加了 181%。我们的方法广泛适用于 omics 图谱分析。总之,我们的研究结果强调了整合质量控制标志和定量测量的好处,并突出了在人类遗传学研究中联合分析基于人群的样本和表型极端的优势。
{"title":"Hypometric genetics: Improved power in genetic discovery by incorporating quality control flags.","authors":"Yosuke Tanigawa,Manolis Kellis","doi":"10.1016/j.ajhg.2024.09.008","DOIUrl":"https://doi.org/10.1016/j.ajhg.2024.09.008","url":null,"abstract":"Balancing the tradeoff between quantity and quality of phenotypic data is critical in omics studies. Measurements below the limit of quantification (BLQ) are often tagged in quality control fields, but these flags are currently underutilized in human genetics studies. Extreme phenotype sampling is advantageous for mapping rare variant effects. We hypothesize that genetic drivers, along with environmental and technical factors, contribute to the presence of BLQ flags. Here, we introduce \"hypometric genetics\" (hMG) analysis and uncover a genetic basis for BLQ flags, indicating an additional source of genetic signal for genetic discovery, especially from phenotypic extremes. Applying our hMG approach to n = 227,469 UK Biobank individuals with metabolomic profiles, we reveal more than 5% heritability for BLQ flags and report biologically relevant associations, for example, at APOC3, APOA5, and PDE3B loci. For common variants, polygenic scores trained only for BLQ flags predict the corresponding quantitative traits with 91% accuracy, validating the genetic basis. For rare coding variant associations, we find an asymmetric 65.4% higher enrichment of metabolite-lowering associations for BLQ flags, highlighting the impact of putative loss-of-function variants with large effects on phenotypic extremes. Joint analysis of binarized BLQ flags and the corresponding quantitative metabolite measurements improves power in Bayesian rare variant aggregation tests, resulting in an average of 181% more prioritized genes. Our approach is broadly applicable to omics profiling. Overall, our results underscore the benefit of integrating quality control flags and quantitative measurements and highlight the advantage of joint analysis of population-based samples and phenotypic extremes in human genetics studies.","PeriodicalId":7659,"journal":{"name":"American journal of human genetics","volume":"25 1","pages":""},"PeriodicalIF":9.8,"publicationDate":"2024-10-21","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142489504","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Hearing restoration by gene replacement therapy for a multisite-expressed gene in a mouse model of human DFNB111 deafness. 在人类 DFNB111 耳聋小鼠模型中通过多位点表达基因的基因替代疗法恢复听力。
IF 8.1 1区 生物学 Q1 GENETICS & HEREDITY Pub Date : 2024-10-03 Epub Date: 2024-09-05 DOI: 10.1016/j.ajhg.2024.08.008
Luoying Jiang, Shao Wei Hu, Zijing Wang, Yi Zhou, Honghai Tang, Yuxin Chen, Daqi Wang, Xintai Fan, Lei Han, Huawei Li, Dazhi Shi, Yingzi He, Yilai Shu

Gene therapy has made significant progress in the treatment of hereditary hearing loss. However, most research has focused on deafness-related genes that are primarily expressed in hair cells with less attention given to multisite-expressed deafness genes. MPZL2, the second leading cause of mild-to-moderate hereditary deafness, is widely expressed in different inner ear cells. We generated a mouse model with a deletion in the Mpzl2 gene, which displayed moderate and slowly progressive hearing loss, mimicking the phenotype of individuals with DFNB111. We developed a gene replacement therapy system mediated by AAV-ie for efficient transduction in various types of cochlear cells. AAV-ie-Mpzl2 administration significantly lowered the auditory brainstem response and distortion product otoacoustic emission thresholds of Mpzl2-/- mice for at least seven months. AAV-ie-Mpzl2 delivery restored the structural integrity in both outer hair cells and Deiters cells. This study suggests the potential of gene therapy for MPZL2-related deafness and provides a proof of concept for gene therapy targeting other deafness-related genes that are expressed in different cell populations in the cochlea.

基因疗法在治疗遗传性听力损失方面取得了重大进展。然而,大多数研究都集中在主要在毛细胞中表达的耳聋相关基因上,而较少关注多位点表达的耳聋基因。MPZL2是导致轻度至中度遗传性耳聋的第二大原因,它在不同的内耳细胞中广泛表达。我们建立了一个 Mpzl2 基因缺失的小鼠模型,该模型表现出中度和缓慢进行性听力损失,与 DFNB111 患者的表型相似。我们开发了一种由 AAV-ie 介导的基因替代疗法系统,可高效转导各种类型的耳蜗细胞。AAV-ie-Mpzl2能显著降低Mpzl2-/-小鼠的听性脑干反应和畸变产物耳声发射阈值,持续时间至少7个月。AAV-ie-Mpzl2 的递送恢复了外毛细胞和 Deiters 细胞的结构完整性。这项研究表明了基因疗法治疗MPZL2相关耳聋的潜力,并为针对耳蜗中不同细胞群中表达的其他耳聋相关基因的基因疗法提供了概念验证。
{"title":"Hearing restoration by gene replacement therapy for a multisite-expressed gene in a mouse model of human DFNB111 deafness.","authors":"Luoying Jiang, Shao Wei Hu, Zijing Wang, Yi Zhou, Honghai Tang, Yuxin Chen, Daqi Wang, Xintai Fan, Lei Han, Huawei Li, Dazhi Shi, Yingzi He, Yilai Shu","doi":"10.1016/j.ajhg.2024.08.008","DOIUrl":"10.1016/j.ajhg.2024.08.008","url":null,"abstract":"<p><p>Gene therapy has made significant progress in the treatment of hereditary hearing loss. However, most research has focused on deafness-related genes that are primarily expressed in hair cells with less attention given to multisite-expressed deafness genes. MPZL2, the second leading cause of mild-to-moderate hereditary deafness, is widely expressed in different inner ear cells. We generated a mouse model with a deletion in the Mpzl2 gene, which displayed moderate and slowly progressive hearing loss, mimicking the phenotype of individuals with DFNB111. We developed a gene replacement therapy system mediated by AAV-ie for efficient transduction in various types of cochlear cells. AAV-ie-Mpzl2 administration significantly lowered the auditory brainstem response and distortion product otoacoustic emission thresholds of Mpzl2<sup>-/-</sup> mice for at least seven months. AAV-ie-Mpzl2 delivery restored the structural integrity in both outer hair cells and Deiters cells. This study suggests the potential of gene therapy for MPZL2-related deafness and provides a proof of concept for gene therapy targeting other deafness-related genes that are expressed in different cell populations in the cochlea.</p>","PeriodicalId":7659,"journal":{"name":"American journal of human genetics","volume":" ","pages":"2253-2264"},"PeriodicalIF":8.1,"publicationDate":"2024-10-03","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11480802/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142144956","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Genes with differential expression across ancestries are enriched in ancestry-specific disease effects likely due to gene-by-environment interactions. 在不同血统中表达不同的基因富集在血统特异性疾病效应中,这可能是由于基因与环境的相互作用。
IF 8.1 1区 生物学 Q1 GENETICS & HEREDITY Pub Date : 2024-10-03 Epub Date: 2024-08-26 DOI: 10.1016/j.ajhg.2024.07.021
Juehan Wang, Zixuan Zhang, Zeyun Lu, Nicholas Mancuso, Steven Gazal

Multi-ancestry genome-wide association studies (GWASs) have highlighted the existence of variants with ancestry-specific effect sizes. Understanding where and why these ancestry-specific effects occur is fundamental to understanding the genetic basis of human diseases and complex traits. Here, we characterized genes differentially expressed across ancestries (ancDE genes) at the cell-type level by leveraging single-cell RNA-sequencing data in peripheral blood mononuclear cells for 21 individuals with East Asian (EAS) ancestry and 23 individuals with European (EUR) ancestry (172,385 cells); then, we tested whether variants surrounding those genes were enriched in disease variants with ancestry-specific effect sizes by leveraging ancestry-matched GWASs of 31 diseases and complex traits (average n ∼ 90,000 and ∼ 267,000 in EAS and EUR, respectively). We observed that ancDE genes tended to be cell-type specific and enriched in genes interacting with the environment and in variants with ancestry-specific disease effect sizes, which suggests cell-type-specific, gene-by-environment interactions shared between regulatory and disease architectures. Finally, we illustrated how different environments might have led to ancestry-specific myeloid cell leukemia 1 (MCL1) expression in B cells and ancestry-specific allele effect sizes in lymphocyte count GWASs for variants surrounding MCL1. Our results imply that large single-cell and GWAS datasets from diverse ancestries are required to improve our understanding of human diseases.

多祖先全基因组关联研究(GWAS)凸显了具有祖先特异性效应大小的变异的存在。要想了解人类疾病和复杂性状的遗传基础,就必须了解这些祖先特异性效应发生的位置和原因。在这里,我们利用 21 名东亚(EAS)血统个体和 23 名欧洲(EUR)血统个体(172,385 个细胞)外周血单核细胞中的单细胞 RNA 序列数据,在细胞类型水平上描述了不同血统差异表达的基因(ancDE 基因);然后,我们利用 31 种疾病和复杂性状的祖先匹配 GWAS(EAS 和 EUR 的平均 n ∼ 90,000 和 ∼ 267,000),检测了这些基因周围的变异是否富集了具有祖先特异效应大小的疾病变异。我们观察到,ancDE 基因往往具有细胞类型特异性,并富集于与环境相互作用的基因和具有祖先特异性疾病效应大小的变异中,这表明细胞类型特异性、基因与环境的相互作用在调控和疾病结构之间具有共享性。最后,我们说明了不同的环境是如何导致 B 细胞中具有祖先特异性的髓细胞白血病 1(MCL1)表达,以及淋巴细胞计数 GWAS 中围绕 MCL1 的变异具有祖先特异性的等位基因效应大小的。我们的研究结果表明,要提高我们对人类疾病的认识,需要来自不同祖先的大型单细胞和 GWAS 数据集。
{"title":"Genes with differential expression across ancestries are enriched in ancestry-specific disease effects likely due to gene-by-environment interactions.","authors":"Juehan Wang, Zixuan Zhang, Zeyun Lu, Nicholas Mancuso, Steven Gazal","doi":"10.1016/j.ajhg.2024.07.021","DOIUrl":"10.1016/j.ajhg.2024.07.021","url":null,"abstract":"<p><p>Multi-ancestry genome-wide association studies (GWASs) have highlighted the existence of variants with ancestry-specific effect sizes. Understanding where and why these ancestry-specific effects occur is fundamental to understanding the genetic basis of human diseases and complex traits. Here, we characterized genes differentially expressed across ancestries (ancDE genes) at the cell-type level by leveraging single-cell RNA-sequencing data in peripheral blood mononuclear cells for 21 individuals with East Asian (EAS) ancestry and 23 individuals with European (EUR) ancestry (172,385 cells); then, we tested whether variants surrounding those genes were enriched in disease variants with ancestry-specific effect sizes by leveraging ancestry-matched GWASs of 31 diseases and complex traits (average n ∼ 90,000 and ∼ 267,000 in EAS and EUR, respectively). We observed that ancDE genes tended to be cell-type specific and enriched in genes interacting with the environment and in variants with ancestry-specific disease effect sizes, which suggests cell-type-specific, gene-by-environment interactions shared between regulatory and disease architectures. Finally, we illustrated how different environments might have led to ancestry-specific myeloid cell leukemia 1 (MCL1) expression in B cells and ancestry-specific allele effect sizes in lymphocyte count GWASs for variants surrounding MCL1. Our results imply that large single-cell and GWAS datasets from diverse ancestries are required to improve our understanding of human diseases.</p>","PeriodicalId":7659,"journal":{"name":"American journal of human genetics","volume":" ","pages":"2117-2128"},"PeriodicalIF":8.1,"publicationDate":"2024-10-03","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11480800/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142078889","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
SpliceVarDB: A comprehensive database of experimentally validated human splicing variants. SpliceVarDB:经实验验证的人类剪接变体综合数据库。
IF 8.1 1区 生物学 Q1 GENETICS & HEREDITY Pub Date : 2024-10-03 Epub Date: 2024-09-02 DOI: 10.1016/j.ajhg.2024.08.002
Patricia J Sullivan, Julian M W Quinn, Weilin Wu, Mark Pinese, Mark J Cowley

Variants that alter gene splicing are estimated to comprise up to a third of all disease-causing variants, yet they are hard to predict from DNA sequencing data alone. To overcome this, many groups are incorporating RNA-based analyses, which are resource intensive, particularly for diagnostic laboratories. There are thousands of functionally validated variants that induce mis-splicing; however, this information is not consolidated, and they are under-represented in ClinVar, which presents a barrier to variant interpretation and can result in duplication of validation efforts. To address this issue, we developed SpliceVarDB, an online database consolidating over 50,000 variants assayed for their effects on splicing in over 8,000 human genes. We evaluated over 500 published data sources and established a spliceogenicity scale to standardize, harmonize, and consolidate variant validation data generated by a range of experimental protocols. According to the strength of their supporting evidence, variants were classified as "splice-altering" (∼25%), "not splice-altering" (∼25%), and "low-frequency splice-altering" (∼50%), which correspond to weak or indeterminate evidence of spliceogenicity. Importantly, 55% of the splice-altering variants in SpliceVarDB are outside the canonical splice sites (5.6% are deep intronic). These variants can support the variant curation diagnostic pathway and can be used to provide the high-quality data necessary to develop more accurate in silico splicing predictors. The variants are accessible through an online platform, SpliceVarDB, with additional features for visualization, variant information, in silico predictions, and validation metrics. SpliceVarDB is a very large collection of splice-altering variants and is available at https://splicevardb.org.

据估计,改变基因剪接的变异占所有致病变异的三分之一,但仅靠 DNA 测序数据很难预测这些变异。为了克服这一问题,许多研究小组正在采用基于 RNA 的分析方法,这种方法需要大量资源,尤其是对诊断实验室而言。有成千上万个经过功能验证的变体会诱发错误剪接;然而,这些信息并没有整合在一起,在 ClinVar 中的代表性也不足,这给变体解释带来了障碍,并可能导致重复验证工作。为了解决这个问题,我们开发了 SpliceVarDB,这是一个在线数据库,整合了 8000 多个人类基因中 50,000 多个对剪接有影响的变异。我们评估了 500 多个已发表的数据源,并建立了剪接致病性量表,以标准化、统一和整合各种实验方案产生的变体验证数据。根据其支持证据的强度,变体被分为 "剪接改变"(∼25%)、"不剪接改变"(∼25%)和 "低频剪接改变"(∼50%),它们对应于剪接致性的弱证据或不确定证据。重要的是,SpliceVarDB 中 55% 的剪接改变变异位于典型剪接位点之外(5.6% 位于深内含子)。这些变体可支持变体整理诊断途径,并可用于提供必要的高质量数据,以开发更准确的硅剪接预测工具。这些变异可通过在线平台 SpliceVarDB 访问,该平台还具有可视化、变异信息、硅学预测和验证指标等附加功能。SpliceVarDB 是一个非常大的剪接改变变体集合,可在 https://splicevardb.org 上查阅。
{"title":"SpliceVarDB: A comprehensive database of experimentally validated human splicing variants.","authors":"Patricia J Sullivan, Julian M W Quinn, Weilin Wu, Mark Pinese, Mark J Cowley","doi":"10.1016/j.ajhg.2024.08.002","DOIUrl":"10.1016/j.ajhg.2024.08.002","url":null,"abstract":"<p><p>Variants that alter gene splicing are estimated to comprise up to a third of all disease-causing variants, yet they are hard to predict from DNA sequencing data alone. To overcome this, many groups are incorporating RNA-based analyses, which are resource intensive, particularly for diagnostic laboratories. There are thousands of functionally validated variants that induce mis-splicing; however, this information is not consolidated, and they are under-represented in ClinVar, which presents a barrier to variant interpretation and can result in duplication of validation efforts. To address this issue, we developed SpliceVarDB, an online database consolidating over 50,000 variants assayed for their effects on splicing in over 8,000 human genes. We evaluated over 500 published data sources and established a spliceogenicity scale to standardize, harmonize, and consolidate variant validation data generated by a range of experimental protocols. According to the strength of their supporting evidence, variants were classified as \"splice-altering\" (∼25%), \"not splice-altering\" (∼25%), and \"low-frequency splice-altering\" (∼50%), which correspond to weak or indeterminate evidence of spliceogenicity. Importantly, 55% of the splice-altering variants in SpliceVarDB are outside the canonical splice sites (5.6% are deep intronic). These variants can support the variant curation diagnostic pathway and can be used to provide the high-quality data necessary to develop more accurate in silico splicing predictors. The variants are accessible through an online platform, SpliceVarDB, with additional features for visualization, variant information, in silico predictions, and validation metrics. SpliceVarDB is a very large collection of splice-altering variants and is available at https://splicevardb.org.</p>","PeriodicalId":7659,"journal":{"name":"American journal of human genetics","volume":" ","pages":"2164-2175"},"PeriodicalIF":8.1,"publicationDate":"2024-10-03","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11480807/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142124576","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
期刊
American journal of human genetics
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1