{"title":"A framework for associating structural variants with cell-specific transcription factors and histone modifications in defect phenotypes","authors":"T. Becker, D. Bayarsaihan, Dong-Guk Shin","doi":"10.1109/BIBM55620.2022.9995130","DOIUrl":null,"url":null,"abstract":"Structural Variations (SVs) naturally occur in healthy human populations and account for more genomic difference than Single Nucleotide Variations (SNVs). However, because of the low prevalence in gene coding regions and the ubiquity in the intergenic regions, determining SV association for defect phenotypes remains challenging. We developed a framework that classifies five categories of SVs using defect associated genes along with cell-specific histone modifications (HM) concurrently with transcription factor binding sites (TFBS). We completed a comprehensive structural variation analysis of 17 family trios consisting of healthy paternal and maternal genomes along with the non-syndromic Orofacial cleft (OFC) child genomes which are publicly available from the Kids First Data Resource Portal. For the integrative analysis, we used ChIP-seq data from Neural Crest and Mesenchymal Stem Cell types from ENCODE along with TFBS taken from JASPAR. We found that the OFC children had elevated regulatory SVs when classified with our framework, aligning with prior studies regarding the complex non-syndromic phenotype. This result supports the use of a defect-specific gene paralog lists integrated with HMs and TFBS to identify plausible SV regions connected to a defect phenotype.","PeriodicalId":210337,"journal":{"name":"2022 IEEE International Conference on Bioinformatics and Biomedicine (BIBM)","volume":" 47","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2022-12-06","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2022 IEEE International Conference on Bioinformatics and Biomedicine (BIBM)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/BIBM55620.2022.9995130","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0
Abstract
Structural Variations (SVs) naturally occur in healthy human populations and account for more genomic difference than Single Nucleotide Variations (SNVs). However, because of the low prevalence in gene coding regions and the ubiquity in the intergenic regions, determining SV association for defect phenotypes remains challenging. We developed a framework that classifies five categories of SVs using defect associated genes along with cell-specific histone modifications (HM) concurrently with transcription factor binding sites (TFBS). We completed a comprehensive structural variation analysis of 17 family trios consisting of healthy paternal and maternal genomes along with the non-syndromic Orofacial cleft (OFC) child genomes which are publicly available from the Kids First Data Resource Portal. For the integrative analysis, we used ChIP-seq data from Neural Crest and Mesenchymal Stem Cell types from ENCODE along with TFBS taken from JASPAR. We found that the OFC children had elevated regulatory SVs when classified with our framework, aligning with prior studies regarding the complex non-syndromic phenotype. This result supports the use of a defect-specific gene paralog lists integrated with HMs and TFBS to identify plausible SV regions connected to a defect phenotype.