Brydon P. G. Wall, My Nguyen, J. Chuck Harrell, Mikhail G. Dozmorov
{"title":"Machine and deep learning methods for predicting 3D genome organization","authors":"Brydon P. G. Wall, My Nguyen, J. Chuck Harrell, Mikhail G. Dozmorov","doi":"arxiv-2403.03231","DOIUrl":null,"url":null,"abstract":"Three-Dimensional (3D) chromatin interactions, such as enhancer-promoter\ninteractions (EPIs), loops, Topologically Associating Domains (TADs), and A/B\ncompartments play critical roles in a wide range of cellular processes by\nregulating gene expression. Recent development of chromatin conformation\ncapture technologies has enabled genome-wide profiling of various 3D\nstructures, even with single cells. However, current catalogs of 3D structures\nremain incomplete and unreliable due to differences in technology, tools, and\nlow data resolution. Machine learning methods have emerged as an alternative to\nobtain missing 3D interactions and/or improve resolution. Such methods\nfrequently use genome annotation data (ChIP-seq, DNAse-seq, etc.), DNA\nsequencing information (k-mers, Transcription Factor Binding Site (TFBS)\nmotifs), and other genomic properties to learn the associations between genomic\nfeatures and chromatin interactions. In this review, we discuss computational\ntools for predicting three types of 3D interactions (EPIs, chromatin\ninteractions, TAD boundaries) and analyze their pros and cons. We also point\nout obstacles of computational prediction of 3D interactions and suggest future\nresearch directions.","PeriodicalId":501070,"journal":{"name":"arXiv - QuanBio - Genomics","volume":"19 1","pages":""},"PeriodicalIF":0.0000,"publicationDate":"2024-03-04","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"arXiv - QuanBio - Genomics","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/arxiv-2403.03231","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0
Abstract
Three-Dimensional (3D) chromatin interactions, such as enhancer-promoter
interactions (EPIs), loops, Topologically Associating Domains (TADs), and A/B
compartments play critical roles in a wide range of cellular processes by
regulating gene expression. Recent development of chromatin conformation
capture technologies has enabled genome-wide profiling of various 3D
structures, even with single cells. However, current catalogs of 3D structures
remain incomplete and unreliable due to differences in technology, tools, and
low data resolution. Machine learning methods have emerged as an alternative to
obtain missing 3D interactions and/or improve resolution. Such methods
frequently use genome annotation data (ChIP-seq, DNAse-seq, etc.), DNA
sequencing information (k-mers, Transcription Factor Binding Site (TFBS)
motifs), and other genomic properties to learn the associations between genomic
features and chromatin interactions. In this review, we discuss computational
tools for predicting three types of 3D interactions (EPIs, chromatin
interactions, TAD boundaries) and analyze their pros and cons. We also point
out obstacles of computational prediction of 3D interactions and suggest future
research directions.