Haizhou Zhao, Hill Lam Lau, Kun Zhang, Chun Kit Kwok
RNA Guanine-quadruplexes (rG4s) are important nucleic acid structures that govern vital biological processes. Although numerous tools have been developed to target rG4s, few specific tools are capable of discerning individual rG4 of interest. Herein, we design and synthesize the first L-aptamer-antisense oligonucleotide (ASO) conjugate, L-Apt.4-1c-ASO15nt(APP), with a focus on recognizing the amyloid precursor protein (APP) rG4 region as an example. The L-aptamer module binds with the rG4 structure, whereas ASO hybridizes with flanking sequences. Together, these two modules enhance the precise recognition of APP rG4. We demonstrate that the L-Apt.4-1c-ASO15nt(APP) conjugate can interact with the APP rG4 region with sub-nanomolar binding affinity, and distinguish APP rG4 from other G4s and non-G4s in vitro and in cells. We also show that L-Apt.4-1c-ASO15nt(APP) can inhibit APP protein expression. Notably, we investigate the inhibitory mechanism of this newly developed tool, and reveal that it controls gene expression by hindering DHX36 protein from unraveling the rG4, as well as by promoting translational inhibition and RNase H-mediated mRNA knockdown activity. Our novel L-aptamer-ASO conjugate tool not only enables the specific recognition of rG4 region of interest, but also allows efficient gene control via targeting rG4-containing transcripts in cells.
{"title":"Selective recognition of RNA G-quadruplex in vitro and in cells by L-aptamer-D-oligonucleotide conjugate.","authors":"Haizhou Zhao, Hill Lam Lau, Kun Zhang, Chun Kit Kwok","doi":"10.1093/nar/gkae1034","DOIUrl":"10.1093/nar/gkae1034","url":null,"abstract":"<p><p>RNA Guanine-quadruplexes (rG4s) are important nucleic acid structures that govern vital biological processes. Although numerous tools have been developed to target rG4s, few specific tools are capable of discerning individual rG4 of interest. Herein, we design and synthesize the first L-aptamer-antisense oligonucleotide (ASO) conjugate, L-Apt.4-1c-ASO15nt(APP), with a focus on recognizing the amyloid precursor protein (APP) rG4 region as an example. The L-aptamer module binds with the rG4 structure, whereas ASO hybridizes with flanking sequences. Together, these two modules enhance the precise recognition of APP rG4. We demonstrate that the L-Apt.4-1c-ASO15nt(APP) conjugate can interact with the APP rG4 region with sub-nanomolar binding affinity, and distinguish APP rG4 from other G4s and non-G4s in vitro and in cells. We also show that L-Apt.4-1c-ASO15nt(APP) can inhibit APP protein expression. Notably, we investigate the inhibitory mechanism of this newly developed tool, and reveal that it controls gene expression by hindering DHX36 protein from unraveling the rG4, as well as by promoting translational inhibition and RNase H-mediated mRNA knockdown activity. Our novel L-aptamer-ASO conjugate tool not only enables the specific recognition of rG4 region of interest, but also allows efficient gene control via targeting rG4-containing transcripts in cells.</p>","PeriodicalId":19471,"journal":{"name":"Nucleic Acids Research","volume":" ","pages":""},"PeriodicalIF":16.6,"publicationDate":"2024-11-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142668535","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Damian Szklarczyk, Katerina Nastou, Mikaela Koutrouli, Rebecca Kirsch, Farrokh Mehryary, Radja Hachilif, Dewei Hu, Matteo E Peluso, Qingyao Huang, Tao Fang, Nadezhda T Doncheva, Sampo Pyysalo, Peer Bork, Lars J Jensen, Christian von Mering
Proteins cooperate, regulate and bind each other to achieve their functions. Understanding the complex network of their interactions is essential for a systems-level description of cellular processes. The STRING database compiles, scores and integrates protein-protein association information drawn from experimental assays, computational predictions and prior knowledge. Its goal is to create comprehensive and objective global networks that encompass both physical and functional interactions. Additionally, STRING provides supplementary tools such as network clustering and pathway enrichment analysis. The latest version, STRING 12.5, introduces a new 'regulatory network', for which it gathers evidence on the type and directionality of interactions using curated pathway databases and a fine-tuned language model parsing the literature. This update enables users to visualize and access three distinct network types-functional, physical and regulatory-separately, each applicable to distinct research needs. In addition, the pathway enrichment detection functionality has been updated, with better false discovery rate corrections, redundancy filtering and improved visual displays. The resource now also offers improved annotations of clustered networks and provides users with downloadable network embeddings, which facilitate the use of STRING networks in machine learning and allow cross-species transfer of protein information. The STRING database is available online at https://string-db.org/.
{"title":"The STRING database in 2025: protein networks with directionality of regulation.","authors":"Damian Szklarczyk, Katerina Nastou, Mikaela Koutrouli, Rebecca Kirsch, Farrokh Mehryary, Radja Hachilif, Dewei Hu, Matteo E Peluso, Qingyao Huang, Tao Fang, Nadezhda T Doncheva, Sampo Pyysalo, Peer Bork, Lars J Jensen, Christian von Mering","doi":"10.1093/nar/gkae1113","DOIUrl":"10.1093/nar/gkae1113","url":null,"abstract":"<p><p>Proteins cooperate, regulate and bind each other to achieve their functions. Understanding the complex network of their interactions is essential for a systems-level description of cellular processes. The STRING database compiles, scores and integrates protein-protein association information drawn from experimental assays, computational predictions and prior knowledge. Its goal is to create comprehensive and objective global networks that encompass both physical and functional interactions. Additionally, STRING provides supplementary tools such as network clustering and pathway enrichment analysis. The latest version, STRING 12.5, introduces a new 'regulatory network', for which it gathers evidence on the type and directionality of interactions using curated pathway databases and a fine-tuned language model parsing the literature. This update enables users to visualize and access three distinct network types-functional, physical and regulatory-separately, each applicable to distinct research needs. In addition, the pathway enrichment detection functionality has been updated, with better false discovery rate corrections, redundancy filtering and improved visual displays. The resource now also offers improved annotations of clustered networks and provides users with downloadable network embeddings, which facilitate the use of STRING networks in machine learning and allow cross-species transfer of protein information. The STRING database is available online at https://string-db.org/.</p>","PeriodicalId":19471,"journal":{"name":"Nucleic Acids Research","volume":" ","pages":""},"PeriodicalIF":16.6,"publicationDate":"2024-11-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142668548","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Joshua Straka, Jude B Khatib, Lindsey Pale, Claudia M Nicolae, George-Lucian Moldovan
Suppression of single-stranded DNA (ssDNA) gap accumulation at replication forks has emerged as a potential determinant of chemosensitivity in homologous recombination (HR)-deficient tumors, as ssDNA gaps are transformed into cytotoxic double-stranded DNA breaks. We have previously shown that the histone chaperone CAF-1's nucleosome deposition function is vital to preventing degradation of stalled replication forks correlating with HR-deficient cells' response to genotoxic drugs. Here we report that the CAF-1-ASF1 pathway promotes ssDNA gap accumulation at replication forks in both wild-type and breast cancer (BRCA)-deficient backgrounds. We show that this is independent of CAF-1's nucleosome deposition function but instead may rely on its proper localization to replication forks. Moreover, we show that the efficient localization to nascent DNA of PrimPol, the enzyme responsible for repriming upon replication stress, is dependent on CAF-1. As PrimPol has been shown to be responsible for generating ssDNA gaps as a byproduct of its repriming function, CAF-1's role in its recruitment could directly impact ssDNA gap formation. We also show that chemoresistance observed in HR-deficient cells when CAF-1 or ASF1A are lost correlates with suppression of ssDNA gaps rather than protection of stalled replication forks. Overall, this work identifies an unexpected role of CAF-1 in regulating PrimPol recruitment and ssDNA gap generation.
{"title":"CAF-1 promotes efficient PrimPol recruitment to nascent DNA for single-stranded DNA gap formation.","authors":"Joshua Straka, Jude B Khatib, Lindsey Pale, Claudia M Nicolae, George-Lucian Moldovan","doi":"10.1093/nar/gkae1068","DOIUrl":"10.1093/nar/gkae1068","url":null,"abstract":"<p><p>Suppression of single-stranded DNA (ssDNA) gap accumulation at replication forks has emerged as a potential determinant of chemosensitivity in homologous recombination (HR)-deficient tumors, as ssDNA gaps are transformed into cytotoxic double-stranded DNA breaks. We have previously shown that the histone chaperone CAF-1's nucleosome deposition function is vital to preventing degradation of stalled replication forks correlating with HR-deficient cells' response to genotoxic drugs. Here we report that the CAF-1-ASF1 pathway promotes ssDNA gap accumulation at replication forks in both wild-type and breast cancer (BRCA)-deficient backgrounds. We show that this is independent of CAF-1's nucleosome deposition function but instead may rely on its proper localization to replication forks. Moreover, we show that the efficient localization to nascent DNA of PrimPol, the enzyme responsible for repriming upon replication stress, is dependent on CAF-1. As PrimPol has been shown to be responsible for generating ssDNA gaps as a byproduct of its repriming function, CAF-1's role in its recruitment could directly impact ssDNA gap formation. We also show that chemoresistance observed in HR-deficient cells when CAF-1 or ASF1A are lost correlates with suppression of ssDNA gaps rather than protection of stalled replication forks. Overall, this work identifies an unexpected role of CAF-1 in regulating PrimPol recruitment and ssDNA gap generation.</p>","PeriodicalId":19471,"journal":{"name":"Nucleic Acids Research","volume":" ","pages":""},"PeriodicalIF":16.6,"publicationDate":"2024-11-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142668357","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Genomic, epigenomic and transcriptomic alterations are hallmarks of cancer cells, and are closely connected. Especially, epigenetic regulation plays a critical role in tumorigenesis and progression. The growing single-cell epigenome data in cancer research provide new opportunities for data mining from a more comprehensive perspective. However, there is still a lack of databases designed for interactively exploring the single-cell multi-omics data of human pan-cancer, especially for the single-cell epigenome data. To fill in the gap, we developed scCancerExplorer, a comprehensive and user-friendly database to facilitate the exploration of the single-cell genome, epigenome (chromatin accessibility and DNA methylation), and transcriptome data of 50 cancer types. Five major modules were provided to explore those data interactively, including 'Integrated multi-omics analysis', 'Single-cell transcriptome', 'Single-cell epigenome', 'Single-cell genome' and 'TCGA analysis'. By simple clicking, users can easily investigate gene expression features, chromatin accessibility patterns, transcription factor activities, DNA methylation states, copy number variations and TCGA survival analysis results. Taken together, scCancerExplorer is distinguished from previous databases with rich and interactive functions for exploring the single-cell multi-omics data of human pan-cancer. It bridges the gap between single-cell multi-omics data and the end-users, and will facilitate progress in the field of cancer research. scCancerExplorer is freely accessible via https://bianlab.cn/scCancerExplorer.
{"title":"scCancerExplorer: a comprehensive database for interactively exploring single-cell multi-omics data of human pan-cancer.","authors":"Changzhi Huang, Zekai Liu, Yunlei Guo, Wanchu Wang, Zhen Yuan, Yusheng Guan, Deng Pan, Zhibin Hu, Linhua Sun, Zan Fu, Shuhui Bian","doi":"10.1093/nar/gkae1100","DOIUrl":"10.1093/nar/gkae1100","url":null,"abstract":"<p><p>Genomic, epigenomic and transcriptomic alterations are hallmarks of cancer cells, and are closely connected. Especially, epigenetic regulation plays a critical role in tumorigenesis and progression. The growing single-cell epigenome data in cancer research provide new opportunities for data mining from a more comprehensive perspective. However, there is still a lack of databases designed for interactively exploring the single-cell multi-omics data of human pan-cancer, especially for the single-cell epigenome data. To fill in the gap, we developed scCancerExplorer, a comprehensive and user-friendly database to facilitate the exploration of the single-cell genome, epigenome (chromatin accessibility and DNA methylation), and transcriptome data of 50 cancer types. Five major modules were provided to explore those data interactively, including 'Integrated multi-omics analysis', 'Single-cell transcriptome', 'Single-cell epigenome', 'Single-cell genome' and 'TCGA analysis'. By simple clicking, users can easily investigate gene expression features, chromatin accessibility patterns, transcription factor activities, DNA methylation states, copy number variations and TCGA survival analysis results. Taken together, scCancerExplorer is distinguished from previous databases with rich and interactive functions for exploring the single-cell multi-omics data of human pan-cancer. It bridges the gap between single-cell multi-omics data and the end-users, and will facilitate progress in the field of cancer research. scCancerExplorer is freely accessible via https://bianlab.cn/scCancerExplorer.</p>","PeriodicalId":19471,"journal":{"name":"Nucleic Acids Research","volume":" ","pages":""},"PeriodicalIF":16.6,"publicationDate":"2024-11-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142668533","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Unraveling the causal variants from genome wide association studies (GWASs) is pivotal for understanding genetic underpinnings of complex traits and diseases. Despite continuous efforts, tools to refine and prioritize GWAS signals need enhancement to address the direct causal implications of genetic variations. To overcome challenges related to statistical fine-mapping in identifying causal variants, CAUSALdb has been updated with novel features and comprehensive datasets, morphing into CAUSALdb2. This expanded repository integrates 15 057 updated GWAS summary statistics across 10 839 unique traits and implements both LD-based and LD-free fine-mapping approaches, including innovative applications of approximate Bayes Factor and SuSiE. Additionally, by incorporating larger LD reference panels such as TOPMED and UK Biobank, and integrating functional annotations via PolyFun, CAUSALdb2 enhances the accuracy and context of fine-mapping results. The database now supports interrogation of additional causal signals and offers sophisticated visualizations to aid researchers in deciphering complex genetic architectures. By facilitating a deeper and more precise characterisation of causal variants, CAUSALdb2 serves as a crucial tool for advancing the genetic analysis of complex diseases. Available freely, CAUSALdb2 continues to set benchmarks in the post-GWAS era, fostering the development of targeted diagnostics and therapeutics derived from responsible genetic research. Explore these advancements at http://mulinlab.org/causaldb.
{"title":"CAUSALdb2: an updated database for causal variants of complex traits.","authors":"Jianhua Wang, Liao Ouyang, Tianyi You, Nianling Yang, Xinran Xu, Wenwen Zhang, Hongxi Yang, Xianfu Yi, Dandan Huang, Wenhao Zhou, Mulin Jun Li","doi":"10.1093/nar/gkae1096","DOIUrl":"10.1093/nar/gkae1096","url":null,"abstract":"<p><p>Unraveling the causal variants from genome wide association studies (GWASs) is pivotal for understanding genetic underpinnings of complex traits and diseases. Despite continuous efforts, tools to refine and prioritize GWAS signals need enhancement to address the direct causal implications of genetic variations. To overcome challenges related to statistical fine-mapping in identifying causal variants, CAUSALdb has been updated with novel features and comprehensive datasets, morphing into CAUSALdb2. This expanded repository integrates 15 057 updated GWAS summary statistics across 10 839 unique traits and implements both LD-based and LD-free fine-mapping approaches, including innovative applications of approximate Bayes Factor and SuSiE. Additionally, by incorporating larger LD reference panels such as TOPMED and UK Biobank, and integrating functional annotations via PolyFun, CAUSALdb2 enhances the accuracy and context of fine-mapping results. The database now supports interrogation of additional causal signals and offers sophisticated visualizations to aid researchers in deciphering complex genetic architectures. By facilitating a deeper and more precise characterisation of causal variants, CAUSALdb2 serves as a crucial tool for advancing the genetic analysis of complex diseases. Available freely, CAUSALdb2 continues to set benchmarks in the post-GWAS era, fostering the development of targeted diagnostics and therapeutics derived from responsible genetic research. Explore these advancements at http://mulinlab.org/causaldb.</p>","PeriodicalId":19471,"journal":{"name":"Nucleic Acids Research","volume":" ","pages":""},"PeriodicalIF":16.6,"publicationDate":"2024-11-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142668261","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Kasun W Samarasinghe, Max Kotlyar, Sylvain D Vallet, Catherine Hayes, Alexandra Naba, Igor Jurisica, Frédérique Lisacek, Sylvie Ricard-Blum
MatrixDB, a member of the International Molecular Exchange consortium (IMEx), is a curated interaction database focused on interactions established by extracellular matrix (ECM) constituents including proteins, proteoglycans, glycosaminoglycans and ECM bioactive fragments. The architecture of MatrixDB was upgraded to ease interaction data export, allow versioning and programmatic access and ensure sustainability. The new version of the database includes more than twice the number of manually curated and experimentally-supported interactions. High-confidence predicted interactions were imported from the Integrated Interactions Database to increase the coverage of the ECM interactome. ECM and ECM-associated proteins of five species (human, murine, bovine, avian and zebrafish) were annotated with matrisome divisions and categories, which are used for computational analyses of ECM -omic datasets. Biological pathways from the Reactome Pathway Knowledgebase were also added to the biomolecule description. New transcriptomic and expanded proteomic datasets were imported in MatrixDB to generate cell- and tissue-specific ECM networks using the newly developed in-house Network Explorer integrated in the database. MatrixDB is freely available at https://matrixdb.univ-lyon1.fr.
{"title":"MatrixDB 2024: an increased coverage of extracellular matrix interactions, a new Network Explorer and a new web interface.","authors":"Kasun W Samarasinghe, Max Kotlyar, Sylvain D Vallet, Catherine Hayes, Alexandra Naba, Igor Jurisica, Frédérique Lisacek, Sylvie Ricard-Blum","doi":"10.1093/nar/gkae1088","DOIUrl":"10.1093/nar/gkae1088","url":null,"abstract":"<p><p>MatrixDB, a member of the International Molecular Exchange consortium (IMEx), is a curated interaction database focused on interactions established by extracellular matrix (ECM) constituents including proteins, proteoglycans, glycosaminoglycans and ECM bioactive fragments. The architecture of MatrixDB was upgraded to ease interaction data export, allow versioning and programmatic access and ensure sustainability. The new version of the database includes more than twice the number of manually curated and experimentally-supported interactions. High-confidence predicted interactions were imported from the Integrated Interactions Database to increase the coverage of the ECM interactome. ECM and ECM-associated proteins of five species (human, murine, bovine, avian and zebrafish) were annotated with matrisome divisions and categories, which are used for computational analyses of ECM -omic datasets. Biological pathways from the Reactome Pathway Knowledgebase were also added to the biomolecule description. New transcriptomic and expanded proteomic datasets were imported in MatrixDB to generate cell- and tissue-specific ECM networks using the newly developed in-house Network Explorer integrated in the database. MatrixDB is freely available at https://matrixdb.univ-lyon1.fr.</p>","PeriodicalId":19471,"journal":{"name":"Nucleic Acids Research","volume":" ","pages":""},"PeriodicalIF":16.6,"publicationDate":"2024-11-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142668400","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Correction to 'Click display: a rapid and efficient in vitro protein display method for directed evolution'.","authors":"","doi":"10.1093/nar/gkae1172","DOIUrl":"10.1093/nar/gkae1172","url":null,"abstract":"","PeriodicalId":19471,"journal":{"name":"Nucleic Acids Research","volume":" ","pages":""},"PeriodicalIF":16.6,"publicationDate":"2024-11-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142668267","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Spatial transcriptomics sequencing technology deepens our understanding of the diversity of cell behaviors, fates and states within complex tissue, which is often determined by the fine-tuning of regulatory network functional activities. Therefore, characterizing the functional activity within tissue space is helpful for revealing the functional features that drive spatial heterogeneity, and understanding complex biological processes. Here, we describe a database, SPathDB (http://bio-bigdata.hrbmu.edu.cn/SPathDB/), which aims to dissect the pathway-mediated multidimensional spatial heterogeneity in the context of functional activity. We manually curated spatial transcriptomics datasets and biological pathways from public data resources. SPathDB consists of 1689 868 spatial spots of 695 slices from 84 spatial transcriptome datasets of human and mouse, which involves 36 tissues, and also diseases such as cancer, and provides interactive analysis and visualization of the functional activities of 114 998 pathways across these spatial spots. SPathDB provides five flexible interfaces to retrieve and analyze pathways with highly variable functional activity across spatial spots, the distribution of pathway functional activities along pseudo-space axis, pathway-mediated spatial intercellular communications and the associations between spatial pathway functional activity and the occurrence of cell types. SPathDB will serve as a foundational resource for identifying functional features and elucidating underlying mechanisms of spatial heterogeneity.
{"title":"SPathDB: a comprehensive database of spatial pathway activity atlas.","authors":"Feng Li, Xinyu Song, Wenli Fan, Liying Pei, Jiaqi Liu, Rui Zhao, Yifang Zhang, Mengyue Li, Kaiyue Song, Yu Sun, Chunlong Zhang, Yunpeng Zhang, Yanjun Xu","doi":"10.1093/nar/gkae1041","DOIUrl":"https://doi.org/10.1093/nar/gkae1041","url":null,"abstract":"<p><p>Spatial transcriptomics sequencing technology deepens our understanding of the diversity of cell behaviors, fates and states within complex tissue, which is often determined by the fine-tuning of regulatory network functional activities. Therefore, characterizing the functional activity within tissue space is helpful for revealing the functional features that drive spatial heterogeneity, and understanding complex biological processes. Here, we describe a database, SPathDB (http://bio-bigdata.hrbmu.edu.cn/SPathDB/), which aims to dissect the pathway-mediated multidimensional spatial heterogeneity in the context of functional activity. We manually curated spatial transcriptomics datasets and biological pathways from public data resources. SPathDB consists of 1689 868 spatial spots of 695 slices from 84 spatial transcriptome datasets of human and mouse, which involves 36 tissues, and also diseases such as cancer, and provides interactive analysis and visualization of the functional activities of 114 998 pathways across these spatial spots. SPathDB provides five flexible interfaces to retrieve and analyze pathways with highly variable functional activity across spatial spots, the distribution of pathway functional activities along pseudo-space axis, pathway-mediated spatial intercellular communications and the associations between spatial pathway functional activity and the occurrence of cell types. SPathDB will serve as a foundational resource for identifying functional features and elucidating underlying mechanisms of spatial heterogeneity.</p>","PeriodicalId":19471,"journal":{"name":"Nucleic Acids Research","volume":" ","pages":""},"PeriodicalIF":16.6,"publicationDate":"2024-11-15","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142639452","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
RNA molecules function in numerous biological processes by folding into intricate structures. Here we present RASP v2.0, an updated database for RNA structure probing data featuring a substantially expanded collection of datasets along with enhanced online structural analysis functionalities. Compared to the previous version, RASP v2.0 includes the following improvements: (i) the number of RNA structure datasets has increased from 156 to 438, comprising 216 transcriptome-wide RNA structure datasets, 141 target-specific RNA structure datasets, and 81 RNA-RNA interaction datasets, thereby broadening species coverage from 18 to 24, (ii) a deep learning-based model has been implemented to impute missing structural signals for 59 transcriptome-wide RNA structure datasets with low structure score coverage, significantly enhancing data quality, particularly for low-abundance RNAs, (iii) three new online analysis modules have been deployed to assist RNA structure studies, including missing structure score imputation, RNA secondary and tertiary structure prediction, and RNA binding protein (RBP) binding prediction. By providing a resource of much more comprehensive RNA structure data, RASP v2.0 is poised to facilitate the exploration of RNA structure-function relationships across diverse biological processes. RASP v2.0 is freely accessible at http://rasp2.zhanglab.net/.
{"title":"RASP v2.0: an updated atlas for RNA structure probing data.","authors":"Kunting Mu, Yuhan Fei, Yiran Xu, Qiangfeng Cliff Zhang","doi":"10.1093/nar/gkae1117","DOIUrl":"https://doi.org/10.1093/nar/gkae1117","url":null,"abstract":"<p><p>RNA molecules function in numerous biological processes by folding into intricate structures. Here we present RASP v2.0, an updated database for RNA structure probing data featuring a substantially expanded collection of datasets along with enhanced online structural analysis functionalities. Compared to the previous version, RASP v2.0 includes the following improvements: (i) the number of RNA structure datasets has increased from 156 to 438, comprising 216 transcriptome-wide RNA structure datasets, 141 target-specific RNA structure datasets, and 81 RNA-RNA interaction datasets, thereby broadening species coverage from 18 to 24, (ii) a deep learning-based model has been implemented to impute missing structural signals for 59 transcriptome-wide RNA structure datasets with low structure score coverage, significantly enhancing data quality, particularly for low-abundance RNAs, (iii) three new online analysis modules have been deployed to assist RNA structure studies, including missing structure score imputation, RNA secondary and tertiary structure prediction, and RNA binding protein (RBP) binding prediction. By providing a resource of much more comprehensive RNA structure data, RASP v2.0 is poised to facilitate the exploration of RNA structure-function relationships across diverse biological processes. RASP v2.0 is freely accessible at http://rasp2.zhanglab.net/.</p>","PeriodicalId":19471,"journal":{"name":"Nucleic Acids Research","volume":" ","pages":""},"PeriodicalIF":16.6,"publicationDate":"2024-11-15","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142639399","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}