Ignacio L Ibarra, Johanna Schneeberger, Ege Erdogan, Lennart Redl, Laura Martens, Dominik Klein, H. Aliee, Fabian J. Theis
{"title":"Learning sequence-based regulatory dynamics in single-cell genomics","authors":"Ignacio L Ibarra, Johanna Schneeberger, Ege Erdogan, Lennart Redl, Laura Martens, Dominik Klein, H. Aliee, Fabian J. Theis","doi":"10.1101/2024.08.07.605876","DOIUrl":null,"url":null,"abstract":"Epigenomics assays, such as chromatin accessibility, can identify DNA-sequence-specific regulatory factors. Models that predict read counts from sequence features can explain cell-based readouts using specific DNA patterns (genomic motifs) but do not encode the changes in genomic regulation over time, which is crucial for understanding biological events during cell transitions. To bridge this gap, we present muBind, a deep learning model that accurately predicts genomic counts of single-cell datasets based on DNA sequence features, their cell-based activities, and cell relationships (graphs) in a single architecture, enhancing the interpretability of cell transitions due to the possibility of inspecting motif activities weighted by nearest neighbors. MuBind shows competitive performance in bulk and single-cell genomics. When complemented with graphs learned from RNA-based dynamical models used as injected priors in our model, muBind enhances through motif-graph interactions the identification of transcriptional regulators explaining cell transition events, including Sox9 in pancreatic endocrinogenesis scATAC-seq, and Gli3/Prdm16 in mouse neurogenesis and human organoids scRNA-seq, both supported by independent evidence, including associations between chromatin and motif activities over pseudotime, TF-gene expression patterns, and biological knowledge of these regulators. muBind advances our understanding of cell transitions by revealing regulatory motifs and their interactions, providing valuable insights for genomic research and gene regulatory network dynamics. It is available at https://github.com/theislab/mubind.","PeriodicalId":505198,"journal":{"name":"bioRxiv","volume":"10 1","pages":""},"PeriodicalIF":0.0000,"publicationDate":"2024-08-08","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"bioRxiv","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1101/2024.08.07.605876","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0
Abstract
Epigenomics assays, such as chromatin accessibility, can identify DNA-sequence-specific regulatory factors. Models that predict read counts from sequence features can explain cell-based readouts using specific DNA patterns (genomic motifs) but do not encode the changes in genomic regulation over time, which is crucial for understanding biological events during cell transitions. To bridge this gap, we present muBind, a deep learning model that accurately predicts genomic counts of single-cell datasets based on DNA sequence features, their cell-based activities, and cell relationships (graphs) in a single architecture, enhancing the interpretability of cell transitions due to the possibility of inspecting motif activities weighted by nearest neighbors. MuBind shows competitive performance in bulk and single-cell genomics. When complemented with graphs learned from RNA-based dynamical models used as injected priors in our model, muBind enhances through motif-graph interactions the identification of transcriptional regulators explaining cell transition events, including Sox9 in pancreatic endocrinogenesis scATAC-seq, and Gli3/Prdm16 in mouse neurogenesis and human organoids scRNA-seq, both supported by independent evidence, including associations between chromatin and motif activities over pseudotime, TF-gene expression patterns, and biological knowledge of these regulators. muBind advances our understanding of cell transitions by revealing regulatory motifs and their interactions, providing valuable insights for genomic research and gene regulatory network dynamics. It is available at https://github.com/theislab/mubind.