首页 > 最新文献

Briefings in Functional Genomics最新文献

英文 中文
Recent advances in differential expression analysis for single-cell RNA-seq and spatially resolved transcriptomic studies. 用于单细胞 RNA-seq 和空间解析转录组研究的差异表达分析的最新进展。
IF 4 3区 生物学 Q3 BIOTECHNOLOGY & APPLIED MICROBIOLOGY Pub Date : 2024-03-20 DOI: 10.1093/bfgp/elad011
Xiya Guo, Jin Ning, Yuanze Chen, Guoliang Liu, Liyan Zhao, Yue Fan, Shiquan Sun

Differential expression (DE) analysis is a necessary step in the analysis of single-cell RNA sequencing (scRNA-seq) and spatially resolved transcriptomics (SRT) data. Unlike traditional bulk RNA-seq, DE analysis for scRNA-seq or SRT data has unique characteristics that may contribute to the difficulty of detecting DE genes. However, the plethora of DE tools that work with various assumptions makes it difficult to choose an appropriate one. Furthermore, a comprehensive review on detecting DE genes for scRNA-seq data or SRT data from multi-condition, multi-sample experimental designs is lacking. To bridge such a gap, here, we first focus on the challenges of DE detection, then highlight potential opportunities that facilitate further progress in scRNA-seq or SRT analysis, and finally provide insights and guidance in selecting appropriate DE tools or developing new computational DE methods.

差异表达(DE)分析是分析单细胞 RNA 测序(scRNA-seq)和空间分辨转录组学(SRT)数据的必要步骤。与传统的大容量 RNA-seq 不同,scRNA-seq 或 SRT 数据的差异表达分析具有独特的特点,可能导致难以检测到差异表达基因。然而,由于有大量的 DE 工具可在各种假设条件下工作,因此很难选择合适的工具。此外,关于从多条件、多样本实验设计中检测scRNA-seq数据或SRT数据中的DE基因,目前还缺乏全面的综述。为了弥补这一空白,我们在此首先关注 DE 检测所面临的挑战,然后强调促进 scRNA-seq 或 SRT 分析进一步发展的潜在机遇,最后为选择合适的 DE 工具或开发新的计算 DE 方法提供见解和指导。
{"title":"Recent advances in differential expression analysis for single-cell RNA-seq and spatially resolved transcriptomic studies.","authors":"Xiya Guo, Jin Ning, Yuanze Chen, Guoliang Liu, Liyan Zhao, Yue Fan, Shiquan Sun","doi":"10.1093/bfgp/elad011","DOIUrl":"10.1093/bfgp/elad011","url":null,"abstract":"<p><p>Differential expression (DE) analysis is a necessary step in the analysis of single-cell RNA sequencing (scRNA-seq) and spatially resolved transcriptomics (SRT) data. Unlike traditional bulk RNA-seq, DE analysis for scRNA-seq or SRT data has unique characteristics that may contribute to the difficulty of detecting DE genes. However, the plethora of DE tools that work with various assumptions makes it difficult to choose an appropriate one. Furthermore, a comprehensive review on detecting DE genes for scRNA-seq data or SRT data from multi-condition, multi-sample experimental designs is lacking. To bridge such a gap, here, we first focus on the challenges of DE detection, then highlight potential opportunities that facilitate further progress in scRNA-seq or SRT analysis, and finally provide insights and guidance in selecting appropriate DE tools or developing new computational DE methods.</p>","PeriodicalId":55323,"journal":{"name":"Briefings in Functional Genomics","volume":" ","pages":"95-109"},"PeriodicalIF":4.0,"publicationDate":"2024-03-20","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"9258877","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Integrating functional scoring and regulatory data to predict the effect of non-coding SNPs in a complex neurological disease. 整合功能评分和调控数据,预测非编码 SNPs 对一种复杂神经疾病的影响。
IF 4 3区 生物学 Q3 BIOTECHNOLOGY & APPLIED MICROBIOLOGY Pub Date : 2024-03-20 DOI: 10.1093/bfgp/elad020
Daniela Felício, Miguel Alves-Ferreira, Mariana Santos, Marlene Quintas, Alexandra M Lopes, Carolina Lemos, Nádia Pinto, Sandra Martins

Most SNPs associated with complex diseases seem to lie in non-coding regions of the genome; however, their contribution to gene expression and disease phenotype remains poorly understood. Here, we established a workflow to provide assistance in prioritising the functional relevance of non-coding SNPs of candidate genes as susceptibility loci in polygenic neurological disorders. To illustrate the applicability of our workflow, we considered the multifactorial disorder migraine as a model to follow our step-by-step approach. We annotated the overlap of selected SNPs with regulatory elements and assessed their potential impact on gene expression based on publicly available prediction algorithms and functional genomics information. Some migraine risk loci have been hypothesised to reside in non-coding regions and to be implicated in the neurotransmission pathway. In this study, we used a set of 22 non-coding SNPs from neurotransmission and synaptic machinery-related genes previously suggested to be involved in migraine susceptibility based on our candidate gene association studies. After prioritising these SNPs, we focused on non-reported ones that demonstrated high regulatory potential: (1) VAMP2_rs1150 (3' UTR) was predicted as a target of hsa-mir-5010-3p miRNA, possibly disrupting its own gene expression; (2) STX1A_rs6951030 (proximal enhancer) may affect the binding affinity of zinc-finger transcription factors (namely ZNF423) and disturb TBL2 gene expression; and (3) SNAP25_rs2327264 (distal enhancer) expected to be in a binding site of ONECUT2 transcription factor. This study demonstrated the applicability of our practical workflow to facilitate the prioritisation of potentially relevant non-coding SNPs and predict their functional impact in multifactorial neurological diseases.

与复杂疾病相关的大多数 SNP 似乎都位于基因组的非编码区;然而,人们对这些 SNP 对基因表达和疾病表型的贡献仍然知之甚少。在此,我们建立了一个工作流程,以帮助确定候选基因的非编码 SNPs 作为多基因神经系统疾病易感位点的功能相关性。为了说明工作流程的适用性,我们将多因素疾病偏头痛作为一个模型,按照我们的方法逐步进行研究。我们注释了所选 SNP 与调控元件的重叠,并根据公开可用的预测算法和功能基因组学信息评估了它们对基因表达的潜在影响。一些偏头痛风险基因位点被假定位于非编码区,并与神经传递途径有关。在本研究中,我们使用了22个非编码SNPs,这些SNPs来自神经传递和突触机械相关基因,之前根据候选基因关联研究,这些基因被认为与偏头痛易感性有关。在对这些 SNP 进行优先排序后,我们重点研究了那些未报告的、具有高调控潜力的 SNP:(1)VAMP2_rs1150(3' UTR)被预测为 hsa-mir-5010-3p miRNA 的靶点,可能会干扰其自身基因的表达;(2)STX1A_rs6951030(近端增强子)可能会影响锌指转录因子(即 ZNF423)的结合亲和力,干扰 TBL2 基因的表达;(3)SNAP25_rs2327264(远端增强子)预计位于 ONECUT2 转录因子的结合位点。这项研究证明了我们的实用工作流程的适用性,它有助于对潜在相关的非编码 SNP 进行优先排序,并预测它们在多因素神经系统疾病中的功能影响。
{"title":"Integrating functional scoring and regulatory data to predict the effect of non-coding SNPs in a complex neurological disease.","authors":"Daniela Felício, Miguel Alves-Ferreira, Mariana Santos, Marlene Quintas, Alexandra M Lopes, Carolina Lemos, Nádia Pinto, Sandra Martins","doi":"10.1093/bfgp/elad020","DOIUrl":"10.1093/bfgp/elad020","url":null,"abstract":"<p><p>Most SNPs associated with complex diseases seem to lie in non-coding regions of the genome; however, their contribution to gene expression and disease phenotype remains poorly understood. Here, we established a workflow to provide assistance in prioritising the functional relevance of non-coding SNPs of candidate genes as susceptibility loci in polygenic neurological disorders. To illustrate the applicability of our workflow, we considered the multifactorial disorder migraine as a model to follow our step-by-step approach. We annotated the overlap of selected SNPs with regulatory elements and assessed their potential impact on gene expression based on publicly available prediction algorithms and functional genomics information. Some migraine risk loci have been hypothesised to reside in non-coding regions and to be implicated in the neurotransmission pathway. In this study, we used a set of 22 non-coding SNPs from neurotransmission and synaptic machinery-related genes previously suggested to be involved in migraine susceptibility based on our candidate gene association studies. After prioritising these SNPs, we focused on non-reported ones that demonstrated high regulatory potential: (1) VAMP2_rs1150 (3' UTR) was predicted as a target of hsa-mir-5010-3p miRNA, possibly disrupting its own gene expression; (2) STX1A_rs6951030 (proximal enhancer) may affect the binding affinity of zinc-finger transcription factors (namely ZNF423) and disturb TBL2 gene expression; and (3) SNAP25_rs2327264 (distal enhancer) expected to be in a binding site of ONECUT2 transcription factor. This study demonstrated the applicability of our practical workflow to facilitate the prioritisation of potentially relevant non-coding SNPs and predict their functional impact in multifactorial neurological diseases.</p>","PeriodicalId":55323,"journal":{"name":"Briefings in Functional Genomics","volume":" ","pages":"138-149"},"PeriodicalIF":4.0,"publicationDate":"2024-03-20","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"9918600","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Functional characteristics of DNA N6-methyladenine modification based on long-read sequencing in pancreatic cancer. 基于长线程测序的胰腺癌 DNA N6-甲基腺嘌呤修饰的功能特征
IF 4 3区 生物学 Q3 BIOTECHNOLOGY & APPLIED MICROBIOLOGY Pub Date : 2024-03-20 DOI: 10.1093/bfgp/elad021
Dianshuang Zhou, Shiwei Guo, Yangyang Wang, Jiyun Zhao, Honghao Liu, Feiyang Zhou, Yan Huang, Yue Gu, Gang Jin, Yan Zhang

Abnormalities of DNA modifications are closely related to the pathogenesis and prognosis of pancreatic cancer. The development of third-generation sequencing technology has brought opportunities for the study of new epigenetic modification in cancer. Here, we screened the N6-methyladenine (6mA) and 5-methylcytosine (5mC) modification in pancreatic cancer based on Oxford Nanopore Technologies sequencing. The 6mA levels were lower compared with 5mC and upregulated in pancreatic cancer. We developed a novel method to define differentially methylated deficient region (DMDR), which overlapped 1319 protein-coding genes in pancreatic cancer. Genes screened by DMDRs were more significantly enriched in the cancer genes compared with the traditional differential methylation method (P < 0.001 versus P = 0.21, hypergeometric test). We then identified a survival-related signature based on DMDRs (DMDRSig) that stratified patients into high- and low-risk groups. Functional enrichment analysis indicated that 891 genes were closely related to alternative splicing. Multi-omics data from the cancer genome atlas showed that these genes were frequently altered in cancer samples. Survival analysis indicated that seven genes with high expression (ADAM9, ADAM10, EPS8, FAM83A, FAM111B, LAMA3 and TES) were significantly associated with poor prognosis. In addition, the distinction for pancreatic cancer subtypes was determined using 46 subtype-specific genes and unsupervised clustering. Overall, our study is the first to explore the molecular characteristics of 6mA modifications in pancreatic cancer, indicating that 6mA has the potential to be a target for future clinical treatment.

DNA 修饰异常与胰腺癌的发病机制和预后密切相关。第三代测序技术的发展为研究癌症中新的表观遗传修饰带来了机遇。在此,我们基于牛津纳米孔技术测序筛选了胰腺癌中的N6-甲基腺嘌呤(6mA)和5-甲基胞嘧啶(5mC)修饰。与 5mC 相比,6mA 水平较低,并且在胰腺癌中上调。我们开发了一种界定差异甲基化缺陷区(DMDR)的新方法,该方法与胰腺癌中的 1319 个蛋白编码基因重叠。与传统的差异甲基化方法相比,通过DMDR筛选出的基因在癌症基因中的富集程度更高(P
{"title":"Functional characteristics of DNA N6-methyladenine modification based on long-read sequencing in pancreatic cancer.","authors":"Dianshuang Zhou, Shiwei Guo, Yangyang Wang, Jiyun Zhao, Honghao Liu, Feiyang Zhou, Yan Huang, Yue Gu, Gang Jin, Yan Zhang","doi":"10.1093/bfgp/elad021","DOIUrl":"10.1093/bfgp/elad021","url":null,"abstract":"<p><p>Abnormalities of DNA modifications are closely related to the pathogenesis and prognosis of pancreatic cancer. The development of third-generation sequencing technology has brought opportunities for the study of new epigenetic modification in cancer. Here, we screened the N6-methyladenine (6mA) and 5-methylcytosine (5mC) modification in pancreatic cancer based on Oxford Nanopore Technologies sequencing. The 6mA levels were lower compared with 5mC and upregulated in pancreatic cancer. We developed a novel method to define differentially methylated deficient region (DMDR), which overlapped 1319 protein-coding genes in pancreatic cancer. Genes screened by DMDRs were more significantly enriched in the cancer genes compared with the traditional differential methylation method (P < 0.001 versus P = 0.21, hypergeometric test). We then identified a survival-related signature based on DMDRs (DMDRSig) that stratified patients into high- and low-risk groups. Functional enrichment analysis indicated that 891 genes were closely related to alternative splicing. Multi-omics data from the cancer genome atlas showed that these genes were frequently altered in cancer samples. Survival analysis indicated that seven genes with high expression (ADAM9, ADAM10, EPS8, FAM83A, FAM111B, LAMA3 and TES) were significantly associated with poor prognosis. In addition, the distinction for pancreatic cancer subtypes was determined using 46 subtype-specific genes and unsupervised clustering. Overall, our study is the first to explore the molecular characteristics of 6mA modifications in pancreatic cancer, indicating that 6mA has the potential to be a target for future clinical treatment.</p>","PeriodicalId":55323,"journal":{"name":"Briefings in Functional Genomics","volume":" ","pages":"150-162"},"PeriodicalIF":4.0,"publicationDate":"2024-03-20","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"9588453","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Quantifying transcriptome diversity: a review. 量化转录组多样性:综述。
IF 2.5 3区 生物学 Q3 BIOTECHNOLOGY & APPLIED MICROBIOLOGY Pub Date : 2024-03-20 DOI: 10.1093/bfgp/elad019
Emma F Jones, Anisha Haldar, Vishal H Oza, Brittany N Lasseigne

Following the central dogma of molecular biology, gene expression heterogeneity can aid in predicting and explaining the wide variety of protein products, functions and, ultimately, heterogeneity in phenotypes. There is currently overlapping terminology used to describe the types of diversity in gene expression profiles, and overlooking these nuances can misrepresent important biological information. Here, we describe transcriptome diversity as a measure of the heterogeneity in (1) the expression of all genes within a sample or a single gene across samples in a population (gene-level diversity) or (2) the isoform-specific expression of a given gene (isoform-level diversity). We first overview modulators and quantification of transcriptome diversity at the gene level. Then, we discuss the role alternative splicing plays in driving transcript isoform-level diversity and how it can be quantified. Additionally, we overview computational resources for calculating gene-level and isoform-level diversity for high-throughput sequencing data. Finally, we discuss future applications of transcriptome diversity. This review provides a comprehensive overview of how gene expression diversity arises, and how measuring it determines a more complete picture of heterogeneity across proteins, cells, tissues, organisms and species.

根据分子生物学的核心教条,基因表达异质性有助于预测和解释各种蛋白质产物、功能以及最终的表型异质性。目前用于描述基因表达谱多样性类型的术语存在重叠,忽略这些细微差别可能会错误地反映重要的生物学信息。在此,我们将转录组多样性描述为衡量以下方面异质性的一种方法:(1)一个样本中所有基因的表达,或一个群体中不同样本中单个基因的表达(基因水平多样性),或(2)给定基因的同工酶特异性表达(同工酶水平多样性)。我们首先概述了基因水平转录组多样性的调节因子和量化方法。然后,我们将讨论替代剪接在推动转录本同工酶水平多样性方面所起的作用,以及如何对其进行量化。此外,我们还概述了计算高通量测序数据的基因水平和同工酶水平多样性的计算资源。最后,我们讨论了转录组多样性的未来应用。本综述全面概述了基因表达多样性是如何产生的,以及如何通过测量基因表达多样性来更全面地了解蛋白质、细胞、组织、生物体和物种之间的异质性。
{"title":"Quantifying transcriptome diversity: a review.","authors":"Emma F Jones, Anisha Haldar, Vishal H Oza, Brittany N Lasseigne","doi":"10.1093/bfgp/elad019","DOIUrl":"10.1093/bfgp/elad019","url":null,"abstract":"<p><p>Following the central dogma of molecular biology, gene expression heterogeneity can aid in predicting and explaining the wide variety of protein products, functions and, ultimately, heterogeneity in phenotypes. There is currently overlapping terminology used to describe the types of diversity in gene expression profiles, and overlooking these nuances can misrepresent important biological information. Here, we describe transcriptome diversity as a measure of the heterogeneity in (1) the expression of all genes within a sample or a single gene across samples in a population (gene-level diversity) or (2) the isoform-specific expression of a given gene (isoform-level diversity). We first overview modulators and quantification of transcriptome diversity at the gene level. Then, we discuss the role alternative splicing plays in driving transcript isoform-level diversity and how it can be quantified. Additionally, we overview computational resources for calculating gene-level and isoform-level diversity for high-throughput sequencing data. Finally, we discuss future applications of transcriptome diversity. This review provides a comprehensive overview of how gene expression diversity arises, and how measuring it determines a more complete picture of heterogeneity across proteins, cells, tissues, organisms and species.</p>","PeriodicalId":55323,"journal":{"name":"Briefings in Functional Genomics","volume":" ","pages":"83-94"},"PeriodicalIF":2.5,"publicationDate":"2024-03-20","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11484519/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"10195229","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
NTpred: a robust and precise machine learning framework for in silico identification of Tyrosine nitration sites in protein sequences. NTpred:用于蛋白质序列中酪氨酸硝化位点硅学鉴定的强大而精确的机器学习框架。
IF 4 3区 生物学 Q3 BIOTECHNOLOGY & APPLIED MICROBIOLOGY Pub Date : 2024-03-20 DOI: 10.1093/bfgp/elad018
Sourajyoti Datta, Muhammad Nabeel Asim, Andreas Dengel, Sheraz Ahmed

Post-translational modifications (PTMs) either enhance a protein's activity in various sub-cellular processes, or degrade their activity which leads toward failure of intracellular processes. Tyrosine nitration (NT) modification degrades protein's activity that initiates and propagates various diseases including neurodegenerative, cardiovascular, autoimmune diseases and carcinogenesis. Identification of NT modification supports development of novel therapies and drug discoveries for associated diseases. Identification of NT modification in biochemical labs is expensive, time consuming and error-prone. To supplement this process, several computational approaches have been proposed. However these approaches fail to precisely identify NT modification, due to the extraction of irrelevant, redundant and less discriminative features from protein sequences. This paper presents the NTpred framework that is competent in extracting comprehensive features from raw protein sequences using four different sequence encoders. To reap the benefits of different encoders, it generates four additional feature spaces by fusing different combinations of individual encodings. Furthermore, it eradicates irrelevant and redundant features from eight different feature spaces through a Recursive Feature Elimination process. Selected features of four individual encodings and four feature fusion vectors are used to train eight different Gradient Boosted Tree classifiers. The probability scores from the trained classifiers are utilized to generate a new probabilistic feature space, which is used to train a Logistic Regression classifier. On the BD1 benchmark dataset, the proposed framework outperforms the existing best-performing predictor in 5-fold cross validation and independent test evaluation with combined improvement of 13.7% in MCC and 20.1% in AUC. Similarly, on the BD2 benchmark dataset, the proposed framework outperforms the existing best-performing predictor with combined improvement of 5.3% in MCC and 1.0% in AUC. NTpred is publicly available for further experimentation and predictive use at: https://sds_genetic_analysis.opendfki.de/PredNTS/.

翻译后修饰(PTMs)可以增强蛋白质在各种亚细胞过程中的活性,也可以降低其活性,导致细胞内过程失效。酪氨酸硝化(NT)修饰会降低蛋白质的活性,从而引发和传播各种疾病,包括神经退行性疾病、心血管疾病、自身免疫性疾病和致癌疾病。NT修饰的鉴定有助于针对相关疾病开发新型疗法和药物。在生化实验室鉴定 NT 修饰既昂贵又耗时,而且容易出错。为了补充这一过程,人们提出了几种计算方法。然而,由于从蛋白质序列中提取了不相关的、冗余的和辨别力较低的特征,这些方法无法精确地识别 NT 修饰。本文介绍了 NTpred 框架,该框架能利用四种不同的序列编码器从原始蛋白质序列中提取综合特征。为了充分利用不同编码器的优势,它通过融合不同的编码组合生成了四个额外的特征空间。此外,它还通过递归特征消除过程,从八个不同的特征空间中消除无关和冗余特征。从四个单独编码和四个特征融合向量中选取的特征用于训练八个不同的梯度提升树分类器。训练好的分类器的概率分数被用来生成新的概率特征空间,并用于训练逻辑回归分类器。在 BD1 基准数据集上,所提出的框架在 5 倍交叉验证和独立测试评估中的表现优于现有表现最好的预测器,MCC 和 AUC 分别提高了 13.7% 和 20.1%。同样,在 BD2 基准数据集上,拟议框架的 MCC 和 AUC 分别提高了 5.3% 和 1.0%,优于现有表现最佳的预测器。NTpred 可在以下网址公开获取,供进一步实验和预测使用:https://sds_genetic_analysis.opendfki.de/PredNTS/。
{"title":"NTpred: a robust and precise machine learning framework for in silico identification of Tyrosine nitration sites in protein sequences.","authors":"Sourajyoti Datta, Muhammad Nabeel Asim, Andreas Dengel, Sheraz Ahmed","doi":"10.1093/bfgp/elad018","DOIUrl":"10.1093/bfgp/elad018","url":null,"abstract":"<p><p>Post-translational modifications (PTMs) either enhance a protein's activity in various sub-cellular processes, or degrade their activity which leads toward failure of intracellular processes. Tyrosine nitration (NT) modification degrades protein's activity that initiates and propagates various diseases including neurodegenerative, cardiovascular, autoimmune diseases and carcinogenesis. Identification of NT modification supports development of novel therapies and drug discoveries for associated diseases. Identification of NT modification in biochemical labs is expensive, time consuming and error-prone. To supplement this process, several computational approaches have been proposed. However these approaches fail to precisely identify NT modification, due to the extraction of irrelevant, redundant and less discriminative features from protein sequences. This paper presents the NTpred framework that is competent in extracting comprehensive features from raw protein sequences using four different sequence encoders. To reap the benefits of different encoders, it generates four additional feature spaces by fusing different combinations of individual encodings. Furthermore, it eradicates irrelevant and redundant features from eight different feature spaces through a Recursive Feature Elimination process. Selected features of four individual encodings and four feature fusion vectors are used to train eight different Gradient Boosted Tree classifiers. The probability scores from the trained classifiers are utilized to generate a new probabilistic feature space, which is used to train a Logistic Regression classifier. On the BD1 benchmark dataset, the proposed framework outperforms the existing best-performing predictor in 5-fold cross validation and independent test evaluation with combined improvement of 13.7% in MCC and 20.1% in AUC. Similarly, on the BD2 benchmark dataset, the proposed framework outperforms the existing best-performing predictor with combined improvement of 5.3% in MCC and 1.0% in AUC. NTpred is publicly available for further experimentation and predictive use at: https://sds_genetic_analysis.opendfki.de/PredNTS/.</p>","PeriodicalId":55323,"journal":{"name":"Briefings in Functional Genomics","volume":" ","pages":"163-179"},"PeriodicalIF":4.0,"publicationDate":"2024-03-20","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"9544857","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Integrating single-cell RNA sequencing data to genome-wide association analysis data identifies significant cell types in influenza A virus infection and COVID-19. 将单细胞 RNA 测序数据与全基因组关联分析数据相结合,确定了甲型流感病毒感染和 COVID-19 的重要细胞类型。
IF 4 3区 生物学 Q3 BIOTECHNOLOGY & APPLIED MICROBIOLOGY Pub Date : 2024-03-20 DOI: 10.1093/bfgp/elad025
Yixin Zou, Xifang Sun, Yifan Wang, Yidi Wang, Xiangyu Ye, Junlan Tu, Rongbin Yu, Peng Huang

With the global pandemic of COVID-19, the research on influenza virus has entered a new stage, but it is difficult to elucidate the pathogenesis of influenza disease. Genome-wide association studies (GWASs) have greatly shed light on the role of host genetic background in influenza pathogenesis and prognosis, whereas single-cell RNA sequencing (scRNA-seq) has enabled unprecedented resolution of cellular diversity and in vivo following influenza disease. Here, we performed a comprehensive analysis of influenza GWAS and scRNA-seq data to reveal cell types associated with influenza disease and provide clues to understanding pathogenesis. We downloaded two GWAS summary data, two scRNA-seq data on influenza disease. After defining cell types for each scRNA-seq data, we used RolyPoly and LDSC-cts to integrate GWAS and scRNA-seq. Furthermore, we analyzed scRNA-seq data from the peripheral blood mononuclear cells (PBMCs) of a healthy population to validate and compare our results. After processing the scRNA-seq data, we obtained approximately 70 000 cells and identified up to 13 cell types. For the European population analysis, we determined an association between neutrophils and influenza disease. For the East Asian population analysis, we identified an association between monocytes and influenza disease. In addition, we also identified monocytes as a significantly related cell type in a dataset of healthy human PBMCs. In this comprehensive analysis, we identified neutrophils and monocytes as influenza disease-associated cell types. More attention and validation should be given in future studies.

随着 COVID-19 在全球的大流行,流感病毒的研究进入了一个新的阶段,但流感发病机制的阐明却困难重重。全基因组关联研究(GWAS)极大地揭示了宿主遗传背景在流感发病和预后中的作用,而单细胞 RNA 测序(scRNA-seq)则实现了对流感发病后细胞多样性和体内情况的前所未有的解析。在此,我们对流感 GWAS 和 scRNA-seq 数据进行了全面分析,以揭示与流感疾病相关的细胞类型,为了解发病机制提供线索。我们下载了两份 GWAS 总结数据和两份有关流感疾病的 scRNA-seq 数据。为每个 scRNA-seq 数据定义细胞类型后,我们使用 RolyPoly 和 LDSC-cts 整合了 GWAS 和 scRNA-seq 数据。此外,我们还分析了健康人群外周血单核细胞(PBMC)的 scRNA-seq 数据,以验证和比较我们的结果。在处理了 scRNA-seq 数据后,我们获得了约 7 万个细胞,并确定了多达 13 种细胞类型。在欧洲人群分析中,我们确定了中性粒细胞与流感疾病之间的关联。在东亚人群分析中,我们确定了单核细胞与流感疾病之间的关联。此外,我们还在一个健康人类 PBMC 数据集中发现单核细胞是一种明显相关的细胞类型。在这项综合分析中,我们发现中性粒细胞和单核细胞是与流感疾病相关的细胞类型。今后的研究应给予更多关注和验证。
{"title":"Integrating single-cell RNA sequencing data to genome-wide association analysis data identifies significant cell types in influenza A virus infection and COVID-19.","authors":"Yixin Zou, Xifang Sun, Yifan Wang, Yidi Wang, Xiangyu Ye, Junlan Tu, Rongbin Yu, Peng Huang","doi":"10.1093/bfgp/elad025","DOIUrl":"10.1093/bfgp/elad025","url":null,"abstract":"<p><p>With the global pandemic of COVID-19, the research on influenza virus has entered a new stage, but it is difficult to elucidate the pathogenesis of influenza disease. Genome-wide association studies (GWASs) have greatly shed light on the role of host genetic background in influenza pathogenesis and prognosis, whereas single-cell RNA sequencing (scRNA-seq) has enabled unprecedented resolution of cellular diversity and in vivo following influenza disease. Here, we performed a comprehensive analysis of influenza GWAS and scRNA-seq data to reveal cell types associated with influenza disease and provide clues to understanding pathogenesis. We downloaded two GWAS summary data, two scRNA-seq data on influenza disease. After defining cell types for each scRNA-seq data, we used RolyPoly and LDSC-cts to integrate GWAS and scRNA-seq. Furthermore, we analyzed scRNA-seq data from the peripheral blood mononuclear cells (PBMCs) of a healthy population to validate and compare our results. After processing the scRNA-seq data, we obtained approximately 70 000 cells and identified up to 13 cell types. For the European population analysis, we determined an association between neutrophils and influenza disease. For the East Asian population analysis, we identified an association between monocytes and influenza disease. In addition, we also identified monocytes as a significantly related cell type in a dataset of healthy human PBMCs. In this comprehensive analysis, we identified neutrophils and monocytes as influenza disease-associated cell types. More attention and validation should be given in future studies.</p>","PeriodicalId":55323,"journal":{"name":"Briefings in Functional Genomics","volume":" ","pages":"110-117"},"PeriodicalIF":4.0,"publicationDate":"2024-03-20","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"9669193","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Genomic islands and their role in fitness traits of two key sepsis-causing bacterial pathogens. 两种主要败血症致病细菌的基因组岛及其在适应性特征中的作用。
IF 4 3区 生物学 Q3 BIOTECHNOLOGY & APPLIED MICROBIOLOGY Pub Date : 2024-01-18 DOI: 10.1093/bfgp/elac051
Mohd Ilyas, Dyuti Purkait, Krishnamohan Atmakuri

To survive and establish a niche for themselves, bacteria constantly evolve. Toward that, they not only insert point mutations and promote illegitimate recombinations within their genomes but also insert pieces of 'foreign' deoxyribonucleic acid, which are commonly referred to as 'genomic islands' (GEIs). The GEIs come in several forms, structures and types, often providing a fitness advantage to the harboring bacterium. In pathogenic bacteria, some GEIs may enhance virulence, thus altering disease burden, morbidity and mortality. Hence, delineating (i) the GEIs framework, (ii) their encoded functions, (iii) the triggers that help them move, (iv) the mechanisms they exploit to move among bacteria and (v) identification of their natural reservoirs will aid in superior tackling of several bacterial diseases, including sepsis. Given the vast array of comparative genomics data, in this short review, we provide an overview of the GEIs, their types and the compositions therein, especially highlighting GEIs harbored by two important pathogens, viz. Acinetobacter baumannii and Klebsiella pneumoniae, which prominently trigger sepsis in low- and middle-income countries. Our efforts help shed some light on the challenges these pathogens pose when equipped with GEIs. We hope that this review will provoke intense research into understanding GEIs, the cues that drive their mobility across bacteria and the ways and means to prevent their transfer, especially across pathogenic bacteria.

为了生存和建立自己的生存空间,细菌不断进化。为此,它们不仅在基因组中插入点突变和促进非法重组,而且还插入 "外来 "脱氧核糖核酸片段,即通常所说的 "基因组岛"(GEIs)。基因组岛有多种形式、结构和类型,通常会给携带细菌带来健康优势。在致病细菌中,一些 GEIs 可能会增强毒性,从而改变疾病负担、发病率和死亡率。因此,划定 (i) GEIs 框架,(ii) 它们的编码功能,(iii) 帮助它们移动的触发器,(iv) 它们在细菌间移动的机制,以及 (v) 识别它们的天然储库,将有助于更好地应对包括败血症在内的多种细菌性疾病。鉴于比较基因组学数据数量庞大,在这篇简短的综述中,我们将概述 GEIs 及其类型和组成,特别强调两种重要病原体(即鲍曼不动杆菌和肺炎克雷伯菌)所携带的 GEIs,这两种病原体在低收入和中等收入国家是败血症的主要诱因。我们的努力有助于揭示这些病原体在配备 GEIs 时所带来的挑战。我们希望这篇综述能引发人们深入研究,了解 GEIs、驱动它们在细菌间移动的线索以及防止它们转移(尤其是在致病菌间转移)的方式方法。
{"title":"Genomic islands and their role in fitness traits of two key sepsis-causing bacterial pathogens.","authors":"Mohd Ilyas, Dyuti Purkait, Krishnamohan Atmakuri","doi":"10.1093/bfgp/elac051","DOIUrl":"10.1093/bfgp/elac051","url":null,"abstract":"<p><p>To survive and establish a niche for themselves, bacteria constantly evolve. Toward that, they not only insert point mutations and promote illegitimate recombinations within their genomes but also insert pieces of 'foreign' deoxyribonucleic acid, which are commonly referred to as 'genomic islands' (GEIs). The GEIs come in several forms, structures and types, often providing a fitness advantage to the harboring bacterium. In pathogenic bacteria, some GEIs may enhance virulence, thus altering disease burden, morbidity and mortality. Hence, delineating (i) the GEIs framework, (ii) their encoded functions, (iii) the triggers that help them move, (iv) the mechanisms they exploit to move among bacteria and (v) identification of their natural reservoirs will aid in superior tackling of several bacterial diseases, including sepsis. Given the vast array of comparative genomics data, in this short review, we provide an overview of the GEIs, their types and the compositions therein, especially highlighting GEIs harbored by two important pathogens, viz. Acinetobacter baumannii and Klebsiella pneumoniae, which prominently trigger sepsis in low- and middle-income countries. Our efforts help shed some light on the challenges these pathogens pose when equipped with GEIs. We hope that this review will provoke intense research into understanding GEIs, the cues that drive their mobility across bacteria and the ways and means to prevent their transfer, especially across pathogenic bacteria.</p>","PeriodicalId":55323,"journal":{"name":"Briefings in Functional Genomics","volume":" ","pages":"55-68"},"PeriodicalIF":4.0,"publicationDate":"2024-01-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"10364958","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Significance of understanding the genomics of host-pathogen interaction in limiting antibiotic resistance development: lessons from COVID-19 pandemic. 了解宿主与病原体相互作用的基因组学对限制抗生素耐药性发展的意义:从 COVID-19 大流行中汲取的教训。
IF 4 3区 生物学 Q3 BIOTECHNOLOGY & APPLIED MICROBIOLOGY Pub Date : 2024-01-18 DOI: 10.1093/bfgp/elad001
Vikas Yadav, Srividhya Ravichandran

The entire world is facing the stiff challenge of COVID-19 pandemic. To overcome the spread of this highly infectious disease, several short-sighted strategies were adopted such as the use of broad-spectrum antibiotics and antifungals. However, the misuse and/or overuse of antibiotics have accentuated the emergence of the next pandemic: antimicrobial resistance (AMR). It is believed that pathogens while transferring between humans and the environment carry virulence and antibiotic-resistant factors from varied species. It is presumed that all such genetic factors are quantifiable and predictable, a better understanding of which could be a limiting step for the progression of AMR. Herein, we have reviewed how genomics-based understanding of host-pathogen interactions during COVID-19 could reduce the non-judicial use of antibiotics and prevent the eruption of an AMR-based pandemic in future.

全世界都面临着 COVID-19 大流行的严峻挑战。为了遏制这种高度传染性疾病的传播,人们采取了一些短视的策略,如使用广谱抗生素和抗真菌药物。然而,抗生素的滥用和/或过度使用加剧了下一个流行病的出现:抗菌药耐药性(AMR)。据认为,病原体在人类和环境之间传播时,会携带来自不同物种的毒性和抗生素耐药性因子。据推测,所有这些遗传因子都是可以量化和预测的,更好地了解这些遗传因子可能会限制 AMR 的发展。在此,我们回顾了基于基因组学对 COVID-19 期间宿主与病原体相互作用的理解如何减少抗生素的非司法使用并防止未来爆发基于 AMR 的大流行。
{"title":"Significance of understanding the genomics of host-pathogen interaction in limiting antibiotic resistance development: lessons from COVID-19 pandemic.","authors":"Vikas Yadav, Srividhya Ravichandran","doi":"10.1093/bfgp/elad001","DOIUrl":"10.1093/bfgp/elad001","url":null,"abstract":"<p><p>The entire world is facing the stiff challenge of COVID-19 pandemic. To overcome the spread of this highly infectious disease, several short-sighted strategies were adopted such as the use of broad-spectrum antibiotics and antifungals. However, the misuse and/or overuse of antibiotics have accentuated the emergence of the next pandemic: antimicrobial resistance (AMR). It is believed that pathogens while transferring between humans and the environment carry virulence and antibiotic-resistant factors from varied species. It is presumed that all such genetic factors are quantifiable and predictable, a better understanding of which could be a limiting step for the progression of AMR. Herein, we have reviewed how genomics-based understanding of host-pathogen interactions during COVID-19 could reduce the non-judicial use of antibiotics and prevent the eruption of an AMR-based pandemic in future.</p>","PeriodicalId":55323,"journal":{"name":"Briefings in Functional Genomics","volume":" ","pages":"69-74"},"PeriodicalIF":4.0,"publicationDate":"2024-01-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"10593381","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Respiratory tract infection: an unfamiliar risk factor in high-altitude pulmonary edema. 呼吸道感染:高海拔肺水肿的一个陌生风险因素。
IF 4 3区 生物学 Q3 BIOTECHNOLOGY & APPLIED MICROBIOLOGY Pub Date : 2024-01-18 DOI: 10.1093/bfgp/elac048
Raushni Choudhary, Swati Kumari, Manzoor Ali, Tashi Thinlas, Stanzen Rabyang, Aastha Mishra

The dramatic changes in physiology at high altitude (HA) as a result of the characteristic hypobaric hypoxia condition can modify innate and adaptive defense mechanisms of the body. As a consequence, few sojourners visiting HA with mild or asymptomatic infection may have an enhanced susceptibility to high-altitude pulmonary edema (HAPE), an acute but severe altitude sickness. It develops upon rapid ascent to altitudes above 2500 m, in otherwise healthy individuals. Though HAPE has been studied extensively, an elaborate exploration of the HA disease burden and the potential risk factors associated with its manifestation are poorly described. The present review discusses respiratory tract infection (RTI) as an unfamiliar but important risk factor in enhancing HAPE susceptibility in sojourners for two primary reasons. First, the symptoms of RTI s resemble those of HAPE. Secondly, the imbalanced pathways contributing to vascular dysfunction in HAPE also participate in the pathogenesis of the infectious processes. These pathways have a crucial role in shaping host response against viral and bacterial infections and may further worsen the clinical outcomes at HA. Respiratory tract pathogenic agents, if screened in HAPE patients, can help in ascertaining their role in disease risk and also point toward their association with the disease severity. The microbial screenings and identifications of pathogens with diseases are the foundation for describing potential molecular mechanisms underlying host response to the microbial challenge. The prior knowledge of such infections may predict the manifestation of disease etiology and provide better therapeutic options.

高海拔地区(HA)特有的低压缺氧条件会导致生理机能发生巨大变化,从而改变人体的先天和适应性防御机制。因此,少数轻度感染或无症状的高海拔地区旅行者可能更容易患上高海拔肺水肿(HAPE),这是一种急性但严重的高原病。高海拔肺水肿是一种急性但严重的高原反应,当快速上升到海拔 2500 米以上时,原本健康的人也会出现高海拔肺水肿。尽管对高山肺水肿的研究已经非常广泛,但对高山肺水肿的疾病负担以及与其表现相关的潜在风险因素的详细探讨却很少。本综述将呼吸道感染(RTI)作为一个陌生但重要的风险因素来讨论,认为它可增加旅居者对高原反应性肺炎的易感性,主要原因有两个。首先,RTI 的症状与 HAPE 相似。其次,导致 HAPE 中血管功能障碍的失衡途径也参与了感染过程的发病机制。这些途径在形成宿主对病毒和细菌感染的反应方面起着至关重要的作用,并可能进一步恶化 HAPE 的临床结果。如果对 HAPE 患者的呼吸道致病因子进行筛查,将有助于确定它们在疾病风险中的作用,并指出它们与疾病严重程度的关联。微生物筛查和与疾病有关的病原体鉴定是描述宿主应对微生物挑战的潜在分子机制的基础。对此类感染的预先了解可以预测疾病的病因表现,并提供更好的治疗方案。
{"title":"Respiratory tract infection: an unfamiliar risk factor in high-altitude pulmonary edema.","authors":"Raushni Choudhary, Swati Kumari, Manzoor Ali, Tashi Thinlas, Stanzen Rabyang, Aastha Mishra","doi":"10.1093/bfgp/elac048","DOIUrl":"10.1093/bfgp/elac048","url":null,"abstract":"<p><p>The dramatic changes in physiology at high altitude (HA) as a result of the characteristic hypobaric hypoxia condition can modify innate and adaptive defense mechanisms of the body. As a consequence, few sojourners visiting HA with mild or asymptomatic infection may have an enhanced susceptibility to high-altitude pulmonary edema (HAPE), an acute but severe altitude sickness. It develops upon rapid ascent to altitudes above 2500 m, in otherwise healthy individuals. Though HAPE has been studied extensively, an elaborate exploration of the HA disease burden and the potential risk factors associated with its manifestation are poorly described. The present review discusses respiratory tract infection (RTI) as an unfamiliar but important risk factor in enhancing HAPE susceptibility in sojourners for two primary reasons. First, the symptoms of RTI s resemble those of HAPE. Secondly, the imbalanced pathways contributing to vascular dysfunction in HAPE also participate in the pathogenesis of the infectious processes. These pathways have a crucial role in shaping host response against viral and bacterial infections and may further worsen the clinical outcomes at HA. Respiratory tract pathogenic agents, if screened in HAPE patients, can help in ascertaining their role in disease risk and also point toward their association with the disease severity. The microbial screenings and identifications of pathogens with diseases are the foundation for describing potential molecular mechanisms underlying host response to the microbial challenge. The prior knowledge of such infections may predict the manifestation of disease etiology and provide better therapeutic options.</p>","PeriodicalId":55323,"journal":{"name":"Briefings in Functional Genomics","volume":" ","pages":"38-45"},"PeriodicalIF":4.0,"publicationDate":"2024-01-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"10364149","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Experimental and computational methods for studying the dynamics of RNA-RNA interactions in SARS-COV2 genomes. 研究 SARS-COV2 基因组中 RNA-RNA 相互作用动态的实验和计算方法。
IF 4 3区 生物学 Q3 BIOTECHNOLOGY & APPLIED MICROBIOLOGY Pub Date : 2024-01-18 DOI: 10.1093/bfgp/elac050
Mansi Srivastava, Matthew R Dukeshire, Quoseena Mir, Okiemute Beatrice Omoru, Amirhossein Manzourolajdad, Sarath Chandra Janga

Long-range ribonucleic acid (RNA)-RNA interactions (RRI) are prevalent in positive-strand RNA viruses, including Beta-coronaviruses, and these take part in regulatory roles, including the regulation of sub-genomic RNA production rates. Crosslinking of interacting RNAs and short read-based deep sequencing of resulting RNA-RNA hybrids have shown that these long-range structures exist in severe acute respiratory syndrome coronavirus (SARS-CoV)-2 on both genomic and sub-genomic levels and in dynamic topologies. Furthermore, co-evolution of coronaviruses with their hosts is navigated by genetic variations made possible by its large genome, high recombination frequency and a high mutation rate. SARS-CoV-2's mutations are known to occur spontaneously during replication, and thousands of aggregate mutations have been reported since the emergence of the virus. Although many long-range RRIs have been experimentally identified using high-throughput methods for the wild-type SARS-CoV-2 strain, evolutionary trajectory of these RRIs across variants, impact of mutations on RRIs and interaction of SARS-CoV-2 RNAs with the host have been largely open questions in the field. In this review, we summarize recent computational tools and experimental methods that have been enabling the mapping of RRIs in viral genomes, with a specific focus on SARS-CoV-2. We also present available informatics resources to navigate the RRI maps and shed light on the impact of mutations on the RRI space in viral genomes. Investigating the evolution of long-range RNA interactions and that of virus-host interactions can contribute to the understanding of new and emerging variants as well as aid in developing improved RNA therapeutics critical for combating future outbreaks.

长程核糖核酸(RNA)-RNA 相互作用(RRI)在正链 RNA 病毒(包括 Beta-冠状病毒)中非常普遍,这些作用参与了调控作用,包括对亚基因组 RNA 生成率的调控。对相互作用的 RNA 进行交联以及对由此产生的 RNA-RNA 杂交体进行基于短读的深度测序表明,严重急性呼吸系统综合征冠状病毒(SARS-CoV)-2 在基因组和亚基因组水平上以及在动态拓扑结构中都存在这些长程结构。此外,冠状病毒与宿主的共同进化是由其庞大的基因组、高重组频率和高突变率所带来的遗传变异所引导的。据了解,SARS-CoV-2 的变异是在复制过程中自发发生的,自病毒出现以来,已报道了数千次聚合变异。尽管已经利用高通量方法在野生型 SARS-CoV-2 株系中实验性地发现了许多长程 RRI,但这些 RRI 在不同变异株中的进化轨迹、突变对 RRI 的影响以及 SARS-CoV-2 RNA 与宿主的相互作用在很大程度上仍是该领域的未决问题。在这篇综述中,我们总结了最近在病毒基因组中绘制 RRIs 图谱的计算工具和实验方法,并特别关注 SARS-CoV-2 病毒。我们还介绍了可用来浏览 RRI 图谱的信息学资源,并阐明了突变对病毒基因组中 RRI 空间的影响。研究长程 RNA 相互作用和病毒与宿主相互作用的演变,有助于了解新出现的变种,并有助于开发对抗击未来疫情至关重要的改良 RNA 疗法。
{"title":"Experimental and computational methods for studying the dynamics of RNA-RNA interactions in SARS-COV2 genomes.","authors":"Mansi Srivastava, Matthew R Dukeshire, Quoseena Mir, Okiemute Beatrice Omoru, Amirhossein Manzourolajdad, Sarath Chandra Janga","doi":"10.1093/bfgp/elac050","DOIUrl":"10.1093/bfgp/elac050","url":null,"abstract":"<p><p>Long-range ribonucleic acid (RNA)-RNA interactions (RRI) are prevalent in positive-strand RNA viruses, including Beta-coronaviruses, and these take part in regulatory roles, including the regulation of sub-genomic RNA production rates. Crosslinking of interacting RNAs and short read-based deep sequencing of resulting RNA-RNA hybrids have shown that these long-range structures exist in severe acute respiratory syndrome coronavirus (SARS-CoV)-2 on both genomic and sub-genomic levels and in dynamic topologies. Furthermore, co-evolution of coronaviruses with their hosts is navigated by genetic variations made possible by its large genome, high recombination frequency and a high mutation rate. SARS-CoV-2's mutations are known to occur spontaneously during replication, and thousands of aggregate mutations have been reported since the emergence of the virus. Although many long-range RRIs have been experimentally identified using high-throughput methods for the wild-type SARS-CoV-2 strain, evolutionary trajectory of these RRIs across variants, impact of mutations on RRIs and interaction of SARS-CoV-2 RNAs with the host have been largely open questions in the field. In this review, we summarize recent computational tools and experimental methods that have been enabling the mapping of RRIs in viral genomes, with a specific focus on SARS-CoV-2. We also present available informatics resources to navigate the RRI maps and shed light on the impact of mutations on the RRI space in viral genomes. Investigating the evolution of long-range RNA interactions and that of virus-host interactions can contribute to the understanding of new and emerging variants as well as aid in developing improved RNA therapeutics critical for combating future outbreaks.</p>","PeriodicalId":55323,"journal":{"name":"Briefings in Functional Genomics","volume":" ","pages":"46-54"},"PeriodicalIF":4.0,"publicationDate":"2024-01-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10799312/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"10666297","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
期刊
Briefings in Functional Genomics
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1