Pub Date : 2024-07-18Epub Date: 2024-05-21DOI: 10.1016/j.xhgg.2024.100301
Christopher J Yoon, Chang Hyun Nam, Taewoo Kim, Jeong Seok Lee, Ryul Kim, Kijong Yi, June-Young Koh, Jiye Kim, Hyein Won, Ji Won Oh, Obi L Griffith, Malachi Griffith, Joohon Sung, Tae Yeul Kim, Duck Cho, Ji Seon Choi, Young Seok Ju
While most dizygotic twins have a dichorionic placenta, rare cases of dizygotic twins with a monochorionic placenta have been reported. The monochorionic placenta in dizygotic twins allows in utero exchange of embryonic cells, resulting in chimerism in the twins. In practice, this chimerism is incidentally identified in mixed ABO blood types or in the presence of cells with a discordant sex chromosome. Here, we applied whole-genome sequencing to one triplet and one twin family to precisely understand their zygotic compositions, using millions of genomic variants as barcodes of zygotic origins. Peripheral blood showed asymmetrical contributions from two sister zygotes, where one of the zygotes was the major clone in both twins. Single-cell RNA sequencing of peripheral blood tissues further showed differential contributions from the two sister zygotes across blood cell types. In contrast, buccal tissues were pure in genetic composition, suggesting that in utero cellular exchanges were confined to the blood tissues. Our study illustrates the cellular history of twinning during human development, which is critical for managing the health of chimeric individuals in the era of genomic medicine.
{"title":"Whole-genome sequences reveal zygotic composition in chimeric twins.","authors":"Christopher J Yoon, Chang Hyun Nam, Taewoo Kim, Jeong Seok Lee, Ryul Kim, Kijong Yi, June-Young Koh, Jiye Kim, Hyein Won, Ji Won Oh, Obi L Griffith, Malachi Griffith, Joohon Sung, Tae Yeul Kim, Duck Cho, Ji Seon Choi, Young Seok Ju","doi":"10.1016/j.xhgg.2024.100301","DOIUrl":"10.1016/j.xhgg.2024.100301","url":null,"abstract":"<p><p>While most dizygotic twins have a dichorionic placenta, rare cases of dizygotic twins with a monochorionic placenta have been reported. The monochorionic placenta in dizygotic twins allows in utero exchange of embryonic cells, resulting in chimerism in the twins. In practice, this chimerism is incidentally identified in mixed ABO blood types or in the presence of cells with a discordant sex chromosome. Here, we applied whole-genome sequencing to one triplet and one twin family to precisely understand their zygotic compositions, using millions of genomic variants as barcodes of zygotic origins. Peripheral blood showed asymmetrical contributions from two sister zygotes, where one of the zygotes was the major clone in both twins. Single-cell RNA sequencing of peripheral blood tissues further showed differential contributions from the two sister zygotes across blood cell types. In contrast, buccal tissues were pure in genetic composition, suggesting that in utero cellular exchanges were confined to the blood tissues. Our study illustrates the cellular history of twinning during human development, which is critical for managing the health of chimeric individuals in the era of genomic medicine.</p>","PeriodicalId":34530,"journal":{"name":"HGG Advances","volume":" ","pages":"100301"},"PeriodicalIF":3.3,"publicationDate":"2024-07-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11201346/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141076827","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2024-07-18Epub Date: 2024-05-03DOI: 10.1016/j.xhgg.2024.100302
Helen Shang, Yi Ding, Vidhya Venkateswaran, Kristin Boulier, Nikhita Kathuria-Prakash, Parisa Boodaghi Malidarreh, Jacob M Luber, Bogdan Pasaniuc
Polygenic scores (PGSs) summarize the combined effect of common risk variants and are associated with breast cancer risk in patients without identifiable monogenic risk factors. One of the most well-validated PGSs in breast cancer to date is PGS313, which was developed from a Northern European biobank but has shown attenuated performance in non-European ancestries. We further investigate the generalizability of the PGS313 for American women of European (EA), African (AFR), Asian (EAA), and Latinx (HL) ancestry within one institution with a singular electronic health record (EHR) system, genotyping platform, and quality control process. We found that the PGS313 achieved overlapping areas under the receiver operator characteristic (ROC) curve (AUCs) in females of HL (AUC = 0.68, 95% confidence interval [CI] = 0.65-0.71) and EA ancestry (AUC = 0.70, 95% CI = 0.69-0.71) but lower AUCs for the AFR and EAA populations (AFR: AUC = 0.61, 95% CI = 0.56-0.65; EAA: AUC = 0.64, 95% CI = 0.60-0.680). While PGS313 is associated with hormone-receptor-positive (HR+) disease in EA Americans (odds ratio [OR] = 1.42, 95% CI = 1.16-1.64), this association is lost in African, Latinx, and Asian Americans. In summary, we found that PGS313 was significantly associated with breast cancer but with attenuated accuracy in women of AFR and EAA descent within a singular health system in Los Angeles. Our work further highlights the need for additional validation in diverse cohorts prior to the clinical implementation of PGSs.
{"title":"Generalizability of PGS<sub>313</sub> for breast cancer risk in a Los Angeles biobank.","authors":"Helen Shang, Yi Ding, Vidhya Venkateswaran, Kristin Boulier, Nikhita Kathuria-Prakash, Parisa Boodaghi Malidarreh, Jacob M Luber, Bogdan Pasaniuc","doi":"10.1016/j.xhgg.2024.100302","DOIUrl":"10.1016/j.xhgg.2024.100302","url":null,"abstract":"<p><p>Polygenic scores (PGSs) summarize the combined effect of common risk variants and are associated with breast cancer risk in patients without identifiable monogenic risk factors. One of the most well-validated PGSs in breast cancer to date is PGS<sub>313</sub>, which was developed from a Northern European biobank but has shown attenuated performance in non-European ancestries. We further investigate the generalizability of the PGS<sub>313</sub> for American women of European (EA), African (AFR), Asian (EAA), and Latinx (HL) ancestry within one institution with a singular electronic health record (EHR) system, genotyping platform, and quality control process. We found that the PGS<sub>313</sub> achieved overlapping areas under the receiver operator characteristic (ROC) curve (AUCs) in females of HL (AUC = 0.68, 95% confidence interval [CI] = 0.65-0.71) and EA ancestry (AUC = 0.70, 95% CI = 0.69-0.71) but lower AUCs for the AFR and EAA populations (AFR: AUC = 0.61, 95% CI = 0.56-0.65; EAA: AUC = 0.64, 95% CI = 0.60-0.680). While PGS<sub>313</sub> is associated with hormone-receptor-positive (HR+) disease in EA Americans (odds ratio [OR] = 1.42, 95% CI = 1.16-1.64), this association is lost in African, Latinx, and Asian Americans. In summary, we found that PGS<sub>313</sub> was significantly associated with breast cancer but with attenuated accuracy in women of AFR and EAA descent within a singular health system in Los Angeles. Our work further highlights the need for additional validation in diverse cohorts prior to the clinical implementation of PGSs.</p>","PeriodicalId":34530,"journal":{"name":"HGG Advances","volume":" ","pages":"100302"},"PeriodicalIF":3.3,"publicationDate":"2024-07-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11137525/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"140867594","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2024-07-18Epub Date: 2024-06-06DOI: 10.1016/j.xhgg.2024.100315
Dorothy M Chen, Ruocheng Dong, Linda Kachuri, Thomas J Hoffmann, Yu Jiang, Sonja I Berndt, John P Shelley, Kerry R Schaffer, Mitchell J Machiela, Neal D Freedman, Wen-Yi Huang, Shengchao A Li, Hans Lilja, Amy C Justice, Ravi K Madduri, Alex A Rodriguez, Stephen K Van Den Eeden, Stephen J Chanock, Christopher A Haiman, David V Conti, Robert J Klein, Jonathan D Mosley, John S Witte, Rebecca E Graff
Deciphering the genetic basis of prostate-specific antigen (PSA) levels may improve their utility for prostate cancer (PCa) screening. Using genome-wide association study (GWAS) summary statistics from 95,768 PCa-free men, we conducted a transcriptome-wide association study (TWAS) to examine impacts of genetically predicted gene expression on PSA. Analyses identified 41 statistically significant (p < 0.05/12,192 = 4.10 × 10-6) associations in whole blood and 39 statistically significant (p < 0.05/13,844 = 3.61 × 10-6) associations in prostate tissue, with 18 genes associated in both tissues. Cross-tissue analyses identified 155 statistically significantly (p < 0.05/22,249 = 2.25 × 10-6) genes. Out of 173 unique PSA-associated genes across analyses, we replicated 151 (87.3%) in a TWAS of 209,318 PCa-free individuals from the Million Veteran Program. Based on conditional analyses, we found 20 genes (11 single tissue, nine cross-tissue) that were associated with PSA levels in the discovery TWAS that were not attributable to a lead variant from a GWAS. Ten of these 20 genes replicated, and two of the replicated genes had colocalization probability of >0.5: CCNA2 and HIST1H2BN. Six of the 20 identified genes are not known to impact PCa risk. Fine-mapping based on whole blood and prostate tissue revealed five protein-coding genes with evidence of causal relationships with PSA levels. Of these five genes, four exhibited evidence of colocalization and one was conditionally independent of previous GWAS findings. These results yield hypotheses that should be further explored to improve understanding of genetic factors underlying PSA levels.
{"title":"Transcriptome-wide association analysis identifies candidate susceptibility genes for prostate-specific antigen levels in men without prostate cancer.","authors":"Dorothy M Chen, Ruocheng Dong, Linda Kachuri, Thomas J Hoffmann, Yu Jiang, Sonja I Berndt, John P Shelley, Kerry R Schaffer, Mitchell J Machiela, Neal D Freedman, Wen-Yi Huang, Shengchao A Li, Hans Lilja, Amy C Justice, Ravi K Madduri, Alex A Rodriguez, Stephen K Van Den Eeden, Stephen J Chanock, Christopher A Haiman, David V Conti, Robert J Klein, Jonathan D Mosley, John S Witte, Rebecca E Graff","doi":"10.1016/j.xhgg.2024.100315","DOIUrl":"10.1016/j.xhgg.2024.100315","url":null,"abstract":"<p><p>Deciphering the genetic basis of prostate-specific antigen (PSA) levels may improve their utility for prostate cancer (PCa) screening. Using genome-wide association study (GWAS) summary statistics from 95,768 PCa-free men, we conducted a transcriptome-wide association study (TWAS) to examine impacts of genetically predicted gene expression on PSA. Analyses identified 41 statistically significant (p < 0.05/12,192 = 4.10 × 10<sup>-6</sup>) associations in whole blood and 39 statistically significant (p < 0.05/13,844 = 3.61 × 10<sup>-6</sup>) associations in prostate tissue, with 18 genes associated in both tissues. Cross-tissue analyses identified 155 statistically significantly (p < 0.05/22,249 = 2.25 × 10<sup>-6</sup>) genes. Out of 173 unique PSA-associated genes across analyses, we replicated 151 (87.3%) in a TWAS of 209,318 PCa-free individuals from the Million Veteran Program. Based on conditional analyses, we found 20 genes (11 single tissue, nine cross-tissue) that were associated with PSA levels in the discovery TWAS that were not attributable to a lead variant from a GWAS. Ten of these 20 genes replicated, and two of the replicated genes had colocalization probability of >0.5: CCNA2 and HIST1H2BN. Six of the 20 identified genes are not known to impact PCa risk. Fine-mapping based on whole blood and prostate tissue revealed five protein-coding genes with evidence of causal relationships with PSA levels. Of these five genes, four exhibited evidence of colocalization and one was conditionally independent of previous GWAS findings. These results yield hypotheses that should be further explored to improve understanding of genetic factors underlying PSA levels.</p>","PeriodicalId":34530,"journal":{"name":"HGG Advances","volume":" ","pages":"100315"},"PeriodicalIF":3.3,"publicationDate":"2024-07-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11262184/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141284896","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2024-07-18Epub Date: 2024-06-13DOI: 10.1016/j.xhgg.2024.100318
Muhammed Hasan Çelik, Julien Gagneur, Ryan G Lim, Jie Wu, Leslie M Thompson, Xiaohui Xie
The high heritability of amyotrophic lateral sclerosis (ALS) contrasts with its low molecular diagnosis rate post-genetic testing, pointing to potential undiscovered genetic factors. To aid the exploration of these factors, we introduced EpiOut, an algorithm to identify chromatin accessibility outliers that are regions exhibiting divergent accessibility from the population baseline in a single or few samples. Annotation of accessible regions with histone chromatin immunoprecipitation sequencing and Hi-C indicates that outliers are concentrated in functional loci, especially among promoters interacting with active enhancers. Across different omics levels, outliers are robustly replicated, and chromatin accessibility outliers are reliable predictors of gene expression outliers and aberrant protein levels. When promoter accessibility does not align with gene expression, our results indicate that molecular aberrations are more likely to be linked to post-transcriptional regulation rather than transcriptional regulation. Our findings demonstrate that the outlier detection paradigm can uncover dysregulated regions in rare diseases. EpiOut is available at github.com/uci-cbcl/EpiOut.
{"title":"Identifying dysregulated regions in amyotrophic lateral sclerosis through chromatin accessibility outliers.","authors":"Muhammed Hasan Çelik, Julien Gagneur, Ryan G Lim, Jie Wu, Leslie M Thompson, Xiaohui Xie","doi":"10.1016/j.xhgg.2024.100318","DOIUrl":"10.1016/j.xhgg.2024.100318","url":null,"abstract":"<p><p>The high heritability of amyotrophic lateral sclerosis (ALS) contrasts with its low molecular diagnosis rate post-genetic testing, pointing to potential undiscovered genetic factors. To aid the exploration of these factors, we introduced EpiOut, an algorithm to identify chromatin accessibility outliers that are regions exhibiting divergent accessibility from the population baseline in a single or few samples. Annotation of accessible regions with histone chromatin immunoprecipitation sequencing and Hi-C indicates that outliers are concentrated in functional loci, especially among promoters interacting with active enhancers. Across different omics levels, outliers are robustly replicated, and chromatin accessibility outliers are reliable predictors of gene expression outliers and aberrant protein levels. When promoter accessibility does not align with gene expression, our results indicate that molecular aberrations are more likely to be linked to post-transcriptional regulation rather than transcriptional regulation. Our findings demonstrate that the outlier detection paradigm can uncover dysregulated regions in rare diseases. EpiOut is available at github.com/uci-cbcl/EpiOut.</p>","PeriodicalId":34530,"journal":{"name":"HGG Advances","volume":" ","pages":"100318"},"PeriodicalIF":3.3,"publicationDate":"2024-07-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11260578/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141318511","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2024-07-18Epub Date: 2024-03-23DOI: 10.1016/j.xhgg.2024.100286
Anne M Slavotinek, Michelle L Thompson, Lisa J Martin, Bruce D Gelb
Genetic testing with exome sequencing and genome sequencing is increasingly offered to infants and children with cardiovascular diseases. However, the rates of positive diagnoses after genetic testing within the different categories of cardiac disease and phenotypic subtypes of congenital heart disease (CHD) have been little studied. We report the diagnostic yield after next-generation sequencing in 500 patients with CHD from diverse population subgroups that were enrolled at three different sites in the Clinical Sequencing Evidence-Generating Research consortium. Patients were ascertained due to a primary cardiovascular issue comprising arrhythmia, cardiomyopathy, and/or CHD, and corresponding human phenotype ontology terms were selected to describe the cardiac and extracardiac findings. We examined the diagnostic yield for patients with arrhythmia, cardiomyopathy, and/or CHD and phenotypic subtypes of CHD comprising conotruncal defects, heterotaxy, left ventricular outflow tract obstruction, septal defects, and "other" heart defects. We found a significant increase in the frequency of positive findings for patients who underwent genome sequencing compared to exome sequencing and for syndromic cardiac defects compared to isolated cardiac defects. We also found significantly higher diagnostic rates for patients who presented with isolated cardiomyopathy compared to isolated CHD. For patients with syndromic presentations who underwent genome sequencing, there were significant differences in the numbers of positive diagnoses for phenotypic subcategories of CHD, ranging from 31.7% for septal defects to 60% for "other". Despite variation in the diagnostic yield at each site, our results support genetic testing in pediatric patients with syndromic and isolated cardiovascular issues and in all subtypes of CHD.
{"title":"Diagnostic yield after next-generation sequencing in pediatric cardiovascular disease.","authors":"Anne M Slavotinek, Michelle L Thompson, Lisa J Martin, Bruce D Gelb","doi":"10.1016/j.xhgg.2024.100286","DOIUrl":"10.1016/j.xhgg.2024.100286","url":null,"abstract":"<p><p>Genetic testing with exome sequencing and genome sequencing is increasingly offered to infants and children with cardiovascular diseases. However, the rates of positive diagnoses after genetic testing within the different categories of cardiac disease and phenotypic subtypes of congenital heart disease (CHD) have been little studied. We report the diagnostic yield after next-generation sequencing in 500 patients with CHD from diverse population subgroups that were enrolled at three different sites in the Clinical Sequencing Evidence-Generating Research consortium. Patients were ascertained due to a primary cardiovascular issue comprising arrhythmia, cardiomyopathy, and/or CHD, and corresponding human phenotype ontology terms were selected to describe the cardiac and extracardiac findings. We examined the diagnostic yield for patients with arrhythmia, cardiomyopathy, and/or CHD and phenotypic subtypes of CHD comprising conotruncal defects, heterotaxy, left ventricular outflow tract obstruction, septal defects, and \"other\" heart defects. We found a significant increase in the frequency of positive findings for patients who underwent genome sequencing compared to exome sequencing and for syndromic cardiac defects compared to isolated cardiac defects. We also found significantly higher diagnostic rates for patients who presented with isolated cardiomyopathy compared to isolated CHD. For patients with syndromic presentations who underwent genome sequencing, there were significant differences in the numbers of positive diagnoses for phenotypic subcategories of CHD, ranging from 31.7% for septal defects to 60% for \"other\". Despite variation in the diagnostic yield at each site, our results support genetic testing in pediatric patients with syndromic and isolated cardiovascular issues and in all subtypes of CHD.</p>","PeriodicalId":34530,"journal":{"name":"HGG Advances","volume":" ","pages":"100286"},"PeriodicalIF":3.3,"publicationDate":"2024-07-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11024993/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"140194673","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2024-07-18Epub Date: 2024-03-29DOI: 10.1016/j.xhgg.2024.100287
Sadegheh Haghshenas, Hidde J Bout, Josephine M Schijns, Michael A Levy, Jennifer Kerkhof, Pratibha Bhai, Haley McConkey, Zandra A Jenkins, Ella M Williams, Benjamin J Halliday, Sylvia A Huisman, Peter Lauffer, Vivian de Waard, Laura Witteveen, Siddharth Banka, Angela F Brady, Elena Galazzi, Julien van Gils, Anna C E Hurst, Frank J Kaiser, Didier Lacombe, Antonio F Martinez-Monseny, Patricia Fergelot, Fabíola P Monteiro, Ilaria Parenti, Luca Persani, Fernando Santos-Simarro, Brittany N Simpson, Mariëlle Alders, Stephen P Robertson, Bekim Sadikovic, Leonie A Menke
CREB-binding protein (CBP, encoded by CREBBP) and its paralog E1A-associated protein (p300, encoded by EP300) are involved in histone acetylation and transcriptional regulation. Variants that produce a null allele or disrupt the catalytic domain of either protein cause Rubinstein-Taybi syndrome (RSTS), while pathogenic missense and in-frame indel variants in parts of exons 30 and 31 cause phenotypes recently described as Menke-Hennekam syndrome (MKHK). To distinguish MKHK subtypes and define their characteristics, molecular and extended clinical data on 82 individuals (54 unpublished) with variants affecting CBP (n = 71) or p300 (n = 11) (NP_004371.2 residues 1,705-1,875 and NP_001420.2 residues 1,668-1,833, respectively) were summarized. Additionally, genome-wide DNA methylation profiles were assessed in DNA extracted from whole peripheral blood from 54 individuals. Most variants clustered closely around the zinc-binding residues of two zinc-finger domains (ZZ and TAZ2) and within the first α helix of the fourth intrinsically disordered linker (ID4) of CBP/p300. Domain-specific methylation profiles were discerned for the ZZ domain in CBP/p300 (found in nine out of 10 tested individuals) and TAZ2 domain in CBP (in 14 out of 20), while a domain-specific diagnostic episignature was refined for the ID4 domain in CBP/p300 (in 21 out of 21). Phenotypes including intellectual disability of varying degree and distinct physical features were defined for each of the regions. These findings demonstrate existence of at least three MKHK subtypes, which are domain specific (MKHK-ZZ, MKHK-TAZ2, and MKHK-ID4) rather than gene specific (CREBBP/EP300). DNA methylation episignatures enable stratification of molecular pathophysiologic entities within a gene or across a family of paralogous genes.
{"title":"Menke-Hennekam syndrome; delineation of domain-specific subtypes with distinct clinical and DNA methylation profiles.","authors":"Sadegheh Haghshenas, Hidde J Bout, Josephine M Schijns, Michael A Levy, Jennifer Kerkhof, Pratibha Bhai, Haley McConkey, Zandra A Jenkins, Ella M Williams, Benjamin J Halliday, Sylvia A Huisman, Peter Lauffer, Vivian de Waard, Laura Witteveen, Siddharth Banka, Angela F Brady, Elena Galazzi, Julien van Gils, Anna C E Hurst, Frank J Kaiser, Didier Lacombe, Antonio F Martinez-Monseny, Patricia Fergelot, Fabíola P Monteiro, Ilaria Parenti, Luca Persani, Fernando Santos-Simarro, Brittany N Simpson, Mariëlle Alders, Stephen P Robertson, Bekim Sadikovic, Leonie A Menke","doi":"10.1016/j.xhgg.2024.100287","DOIUrl":"10.1016/j.xhgg.2024.100287","url":null,"abstract":"<p><p>CREB-binding protein (CBP, encoded by CREBBP) and its paralog E1A-associated protein (p300, encoded by EP300) are involved in histone acetylation and transcriptional regulation. Variants that produce a null allele or disrupt the catalytic domain of either protein cause Rubinstein-Taybi syndrome (RSTS), while pathogenic missense and in-frame indel variants in parts of exons 30 and 31 cause phenotypes recently described as Menke-Hennekam syndrome (MKHK). To distinguish MKHK subtypes and define their characteristics, molecular and extended clinical data on 82 individuals (54 unpublished) with variants affecting CBP (n = 71) or p300 (n = 11) (NP_004371.2 residues 1,705-1,875 and NP_001420.2 residues 1,668-1,833, respectively) were summarized. Additionally, genome-wide DNA methylation profiles were assessed in DNA extracted from whole peripheral blood from 54 individuals. Most variants clustered closely around the zinc-binding residues of two zinc-finger domains (ZZ and TAZ2) and within the first α helix of the fourth intrinsically disordered linker (ID4) of CBP/p300. Domain-specific methylation profiles were discerned for the ZZ domain in CBP/p300 (found in nine out of 10 tested individuals) and TAZ2 domain in CBP (in 14 out of 20), while a domain-specific diagnostic episignature was refined for the ID4 domain in CBP/p300 (in 21 out of 21). Phenotypes including intellectual disability of varying degree and distinct physical features were defined for each of the regions. These findings demonstrate existence of at least three MKHK subtypes, which are domain specific (MKHK-ZZ, MKHK-TAZ2, and MKHK-ID4) rather than gene specific (CREBBP/EP300). DNA methylation episignatures enable stratification of molecular pathophysiologic entities within a gene or across a family of paralogous genes.</p>","PeriodicalId":34530,"journal":{"name":"HGG Advances","volume":" ","pages":"100287"},"PeriodicalIF":3.3,"publicationDate":"2024-07-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11040166/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"140327217","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2024-07-18Epub Date: 2024-04-24DOI: 10.1016/j.xhgg.2024.100299
Rachel Y Oh, Ali AlMail, David Cheerie, George Guirguis, Huayun Hou, Kyoko E Yuki, Bushra Haque, Bhooma Thiruvahindrapuram, Christian R Marshall, Roberto Mendoza-Londono, Adam Shlien, Lianna G Kyriakopoulou, Susan Walker, James J Dowling, Michael D Wilson, Gregory Costain
Canonical splice site variants (CSSVs) are often presumed to cause loss-of-function (LoF) and are assigned very strong evidence of pathogenicity (according to American College of Medical Genetics/Association for Molecular Pathology criterion PVS1). The exact nature and predictability of splicing effects of unselected rare CSSVs in blood-expressed genes are poorly understood. We identified 168 rare CSSVs in blood-expressed genes in 112 individuals using genome sequencing, and studied their impact on splicing using RNA sequencing (RNA-seq). There was no evidence of a frameshift, nor of reduced expression consistent with nonsense-mediated decay, for 25.6% of CSSVs: 17.9% had wildtype splicing only and normal junction depths, 3.6% resulted in cryptic splice site usage and in-frame insertions or deletions, 3.6% resulted in full exon skipping (in frame), and 0.6% resulted in full intron inclusion (in frame). Blind to these RNA-seq data, we attempted to predict the precise impact of CSSVs by applying in silico tools and the ClinGen Sequence Variant Interpretation Working Group 2018 guidelines for applying PVS1 criterion. The predicted impact on splicing using (1) SpliceAI, (2) MaxEntScan, and (3) AutoPVS1, an automatic classification tool for PVS1 interpretation of null variants that utilizes Ensembl Variant Effect Predictor and MaxEntScan, was concordant with RNA-seq analyses for 65%, 63%, and 61% of CSSVs, respectively. In summary, approximately one in four rare CSSVs did not show evidence for LoF based on analysis of RNA-seq data. Predictions from in silico methods were often discordant with findings from RNA-seq. More caution may be warranted in applying PVS1-level evidence to CSSVs in the absence of functional data.
{"title":"A systematic assessment of the impact of rare canonical splice site variants on splicing using functional and in silico methods.","authors":"Rachel Y Oh, Ali AlMail, David Cheerie, George Guirguis, Huayun Hou, Kyoko E Yuki, Bushra Haque, Bhooma Thiruvahindrapuram, Christian R Marshall, Roberto Mendoza-Londono, Adam Shlien, Lianna G Kyriakopoulou, Susan Walker, James J Dowling, Michael D Wilson, Gregory Costain","doi":"10.1016/j.xhgg.2024.100299","DOIUrl":"10.1016/j.xhgg.2024.100299","url":null,"abstract":"<p><p>Canonical splice site variants (CSSVs) are often presumed to cause loss-of-function (LoF) and are assigned very strong evidence of pathogenicity (according to American College of Medical Genetics/Association for Molecular Pathology criterion PVS1). The exact nature and predictability of splicing effects of unselected rare CSSVs in blood-expressed genes are poorly understood. We identified 168 rare CSSVs in blood-expressed genes in 112 individuals using genome sequencing, and studied their impact on splicing using RNA sequencing (RNA-seq). There was no evidence of a frameshift, nor of reduced expression consistent with nonsense-mediated decay, for 25.6% of CSSVs: 17.9% had wildtype splicing only and normal junction depths, 3.6% resulted in cryptic splice site usage and in-frame insertions or deletions, 3.6% resulted in full exon skipping (in frame), and 0.6% resulted in full intron inclusion (in frame). Blind to these RNA-seq data, we attempted to predict the precise impact of CSSVs by applying in silico tools and the ClinGen Sequence Variant Interpretation Working Group 2018 guidelines for applying PVS1 criterion. The predicted impact on splicing using (1) SpliceAI, (2) MaxEntScan, and (3) AutoPVS1, an automatic classification tool for PVS1 interpretation of null variants that utilizes Ensembl Variant Effect Predictor and MaxEntScan, was concordant with RNA-seq analyses for 65%, 63%, and 61% of CSSVs, respectively. In summary, approximately one in four rare CSSVs did not show evidence for LoF based on analysis of RNA-seq data. Predictions from in silico methods were often discordant with findings from RNA-seq. More caution may be warranted in applying PVS1-level evidence to CSSVs in the absence of functional data.</p>","PeriodicalId":34530,"journal":{"name":"HGG Advances","volume":" ","pages":"100299"},"PeriodicalIF":3.3,"publicationDate":"2024-07-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11144818/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"140859278","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2024-07-18Epub Date: 2024-06-06DOI: 10.1016/j.xhgg.2024.100316
Sara Azidane, Xavier Gallego, Lynn Durham, Mario Cáceres, Emre Guney, Laura Pérez-Cano
Copy-number variants (CNVs) are genome-wide structural variations involving the duplication or deletion of large nucleotide sequences. While these types of variations can be commonly found in humans, large and rare CNVs are known to contribute to the development of various neurodevelopmental disorders (NDDs), including autism spectrum disorder (ASD). Nevertheless, given that these NDD-risk CNVs cover broad regions of the genome, it is particularly challenging to pinpoint the critical gene(s) responsible for the manifestation of the phenotype. In this study, we performed a meta-analysis of CNV data from 11,614 affected individuals with NDDs and 4,031 control individuals from SFARI database to identify 41 NDD-risk CNV loci, including 24 novel regions. We also found evidence for dosage-sensitive genes within these regions being significantly enriched for known NDD-risk genes and pathways. In addition, a significant proportion of these genes was found to (1) converge in protein-protein interaction networks, (2) be among most expressed genes in the brain across all developmental stages, and (3) be hit by deletions that are significantly over-transmitted to individuals with ASD within multiplex ASD families from the iHART cohort. Finally, we conducted a burden analysis using 4,281 NDD cases from Decipher and iHART cohorts, and 2,504 neurotypical control individuals from 1000 Genomes and iHART, which resulted in the validation of the association of 162 dosage-sensitive genes driving risk for NDDs, including 22 novel NDD-risk genes. Importantly, most NDD-risk CNV loci entail multiple NDD-risk genes in agreement with a polygenic model associated with the majority of NDD cases.
{"title":"Identification of novel driver risk genes in CNV loci associated with neurodevelopmental disorders.","authors":"Sara Azidane, Xavier Gallego, Lynn Durham, Mario Cáceres, Emre Guney, Laura Pérez-Cano","doi":"10.1016/j.xhgg.2024.100316","DOIUrl":"10.1016/j.xhgg.2024.100316","url":null,"abstract":"<p><p>Copy-number variants (CNVs) are genome-wide structural variations involving the duplication or deletion of large nucleotide sequences. While these types of variations can be commonly found in humans, large and rare CNVs are known to contribute to the development of various neurodevelopmental disorders (NDDs), including autism spectrum disorder (ASD). Nevertheless, given that these NDD-risk CNVs cover broad regions of the genome, it is particularly challenging to pinpoint the critical gene(s) responsible for the manifestation of the phenotype. In this study, we performed a meta-analysis of CNV data from 11,614 affected individuals with NDDs and 4,031 control individuals from SFARI database to identify 41 NDD-risk CNV loci, including 24 novel regions. We also found evidence for dosage-sensitive genes within these regions being significantly enriched for known NDD-risk genes and pathways. In addition, a significant proportion of these genes was found to (1) converge in protein-protein interaction networks, (2) be among most expressed genes in the brain across all developmental stages, and (3) be hit by deletions that are significantly over-transmitted to individuals with ASD within multiplex ASD families from the iHART cohort. Finally, we conducted a burden analysis using 4,281 NDD cases from Decipher and iHART cohorts, and 2,504 neurotypical control individuals from 1000 Genomes and iHART, which resulted in the validation of the association of 162 dosage-sensitive genes driving risk for NDDs, including 22 novel NDD-risk genes. Importantly, most NDD-risk CNV loci entail multiple NDD-risk genes in agreement with a polygenic model associated with the majority of NDD cases.</p>","PeriodicalId":34530,"journal":{"name":"HGG Advances","volume":" ","pages":"100316"},"PeriodicalIF":3.3,"publicationDate":"2024-07-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11264174/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141288712","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2024-07-18Epub Date: 2024-06-19DOI: 10.1016/j.xhgg.2024.100320
Xinan Wang, Kangcheng Hou, Biagio Ricciuti, Joao V Alessi, Xihao Li, Federica Pecci, Rounak Dey, Jia Luo, Mark M Awad, Alexander Gusev, Xihong Lin, Bruce E Johnson, David C Christiani
The KRAS mutation is the most common oncogenic driver in patients with non-small cell lung cancer (NSCLC). However, a detailed understanding of how self-reported race and/or ethnicity (SIRE), genetically inferred ancestry (GIA), and their interaction affect KRAS mutation is largely unknown. Here, we investigated the associations between SIRE, quantitative GIA, and KRAS mutation and its allele-specific subtypes in a multi-ethnic cohort of 3,918 patients from the Boston Lung Cancer Survival cohort and the Chinese OrigiMed cohort with an independent validation cohort of 1,450 patients with NSCLC. This comprehensive analysis included detailed covariates such as age at diagnosis, sex, clinical stage, cancer histology, and smoking status. We report that SIRE is significantly associated with KRAS mutations, modified by sex, with SIRE-Asian patients showing lower rates of KRAS mutation, transversion substitution, and the allele-specific subtype KRASG12C compared to SIRE-White patients after adjusting for potential confounders. Moreover, GIA was found to correlate with KRAS mutations, where patients with a higher proportion of European ancestry had an increased risk of KRAS mutations, especially more transition substitutions and KRASG12D. Notably, among SIRE-White patients, an increase in European ancestry was linked to a higher likelihood of KRAS mutations, whereas an increase in admixed American ancestry was associated with a reduced likelihood, suggesting that quantitative GIA offers additional information beyond SIRE. The association of SIRE, GIA, and their interplay with KRAS driver mutations in NSCLC highlights the importance of incorporating both into population-based cancer research, aiming to refine clinical decision-making processes and mitigate health disparities.
{"title":"Additional impact of genetic ancestry over race/ethnicity to prevalence of KRAS mutations and allele-specific subtypes in non-small cell lung cancer.","authors":"Xinan Wang, Kangcheng Hou, Biagio Ricciuti, Joao V Alessi, Xihao Li, Federica Pecci, Rounak Dey, Jia Luo, Mark M Awad, Alexander Gusev, Xihong Lin, Bruce E Johnson, David C Christiani","doi":"10.1016/j.xhgg.2024.100320","DOIUrl":"10.1016/j.xhgg.2024.100320","url":null,"abstract":"<p><p>The KRAS mutation is the most common oncogenic driver in patients with non-small cell lung cancer (NSCLC). However, a detailed understanding of how self-reported race and/or ethnicity (SIRE), genetically inferred ancestry (GIA), and their interaction affect KRAS mutation is largely unknown. Here, we investigated the associations between SIRE, quantitative GIA, and KRAS mutation and its allele-specific subtypes in a multi-ethnic cohort of 3,918 patients from the Boston Lung Cancer Survival cohort and the Chinese OrigiMed cohort with an independent validation cohort of 1,450 patients with NSCLC. This comprehensive analysis included detailed covariates such as age at diagnosis, sex, clinical stage, cancer histology, and smoking status. We report that SIRE is significantly associated with KRAS mutations, modified by sex, with SIRE-Asian patients showing lower rates of KRAS mutation, transversion substitution, and the allele-specific subtype KRAS<sup>G12C</sup> compared to SIRE-White patients after adjusting for potential confounders. Moreover, GIA was found to correlate with KRAS mutations, where patients with a higher proportion of European ancestry had an increased risk of KRAS mutations, especially more transition substitutions and KRAS<sup>G12D</sup>. Notably, among SIRE-White patients, an increase in European ancestry was linked to a higher likelihood of KRAS mutations, whereas an increase in admixed American ancestry was associated with a reduced likelihood, suggesting that quantitative GIA offers additional information beyond SIRE. The association of SIRE, GIA, and their interplay with KRAS driver mutations in NSCLC highlights the importance of incorporating both into population-based cancer research, aiming to refine clinical decision-making processes and mitigate health disparities.</p>","PeriodicalId":34530,"journal":{"name":"HGG Advances","volume":" ","pages":"100320"},"PeriodicalIF":3.3,"publicationDate":"2024-07-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11452329/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141432989","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}