Xiya Guo, Jin Ning, Yuanze Chen, Guoliang Liu, Liyan Zhao, Yue Fan, Shiquan Sun
Differential expression (DE) analysis is a necessary step in the analysis of single-cell RNA sequencing (scRNA-seq) and spatially resolved transcriptomics (SRT) data. Unlike traditional bulk RNA-seq, DE analysis for scRNA-seq or SRT data has unique characteristics that may contribute to the difficulty of detecting DE genes. However, the plethora of DE tools that work with various assumptions makes it difficult to choose an appropriate one. Furthermore, a comprehensive review on detecting DE genes for scRNA-seq data or SRT data from multi-condition, multi-sample experimental designs is lacking. To bridge such a gap, here, we first focus on the challenges of DE detection, then highlight potential opportunities that facilitate further progress in scRNA-seq or SRT analysis, and finally provide insights and guidance in selecting appropriate DE tools or developing new computational DE methods.
差异表达(DE)分析是分析单细胞 RNA 测序(scRNA-seq)和空间分辨转录组学(SRT)数据的必要步骤。与传统的大容量 RNA-seq 不同,scRNA-seq 或 SRT 数据的差异表达分析具有独特的特点,可能导致难以检测到差异表达基因。然而,由于有大量的 DE 工具可在各种假设条件下工作,因此很难选择合适的工具。此外,关于从多条件、多样本实验设计中检测scRNA-seq数据或SRT数据中的DE基因,目前还缺乏全面的综述。为了弥补这一空白,我们在此首先关注 DE 检测所面临的挑战,然后强调促进 scRNA-seq 或 SRT 分析进一步发展的潜在机遇,最后为选择合适的 DE 工具或开发新的计算 DE 方法提供见解和指导。
{"title":"Recent advances in differential expression analysis for single-cell RNA-seq and spatially resolved transcriptomic studies.","authors":"Xiya Guo, Jin Ning, Yuanze Chen, Guoliang Liu, Liyan Zhao, Yue Fan, Shiquan Sun","doi":"10.1093/bfgp/elad011","DOIUrl":"10.1093/bfgp/elad011","url":null,"abstract":"<p><p>Differential expression (DE) analysis is a necessary step in the analysis of single-cell RNA sequencing (scRNA-seq) and spatially resolved transcriptomics (SRT) data. Unlike traditional bulk RNA-seq, DE analysis for scRNA-seq or SRT data has unique characteristics that may contribute to the difficulty of detecting DE genes. However, the plethora of DE tools that work with various assumptions makes it difficult to choose an appropriate one. Furthermore, a comprehensive review on detecting DE genes for scRNA-seq data or SRT data from multi-condition, multi-sample experimental designs is lacking. To bridge such a gap, here, we first focus on the challenges of DE detection, then highlight potential opportunities that facilitate further progress in scRNA-seq or SRT analysis, and finally provide insights and guidance in selecting appropriate DE tools or developing new computational DE methods.</p>","PeriodicalId":55323,"journal":{"name":"Briefings in Functional Genomics","volume":" ","pages":"95-109"},"PeriodicalIF":4.0,"publicationDate":"2024-03-20","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"9258877","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Daniela Felício, Miguel Alves-Ferreira, Mariana Santos, Marlene Quintas, Alexandra M Lopes, Carolina Lemos, Nádia Pinto, Sandra Martins
Most SNPs associated with complex diseases seem to lie in non-coding regions of the genome; however, their contribution to gene expression and disease phenotype remains poorly understood. Here, we established a workflow to provide assistance in prioritising the functional relevance of non-coding SNPs of candidate genes as susceptibility loci in polygenic neurological disorders. To illustrate the applicability of our workflow, we considered the multifactorial disorder migraine as a model to follow our step-by-step approach. We annotated the overlap of selected SNPs with regulatory elements and assessed their potential impact on gene expression based on publicly available prediction algorithms and functional genomics information. Some migraine risk loci have been hypothesised to reside in non-coding regions and to be implicated in the neurotransmission pathway. In this study, we used a set of 22 non-coding SNPs from neurotransmission and synaptic machinery-related genes previously suggested to be involved in migraine susceptibility based on our candidate gene association studies. After prioritising these SNPs, we focused on non-reported ones that demonstrated high regulatory potential: (1) VAMP2_rs1150 (3' UTR) was predicted as a target of hsa-mir-5010-3p miRNA, possibly disrupting its own gene expression; (2) STX1A_rs6951030 (proximal enhancer) may affect the binding affinity of zinc-finger transcription factors (namely ZNF423) and disturb TBL2 gene expression; and (3) SNAP25_rs2327264 (distal enhancer) expected to be in a binding site of ONECUT2 transcription factor. This study demonstrated the applicability of our practical workflow to facilitate the prioritisation of potentially relevant non-coding SNPs and predict their functional impact in multifactorial neurological diseases.
与复杂疾病相关的大多数 SNP 似乎都位于基因组的非编码区;然而,人们对这些 SNP 对基因表达和疾病表型的贡献仍然知之甚少。在此,我们建立了一个工作流程,以帮助确定候选基因的非编码 SNPs 作为多基因神经系统疾病易感位点的功能相关性。为了说明工作流程的适用性,我们将多因素疾病偏头痛作为一个模型,按照我们的方法逐步进行研究。我们注释了所选 SNP 与调控元件的重叠,并根据公开可用的预测算法和功能基因组学信息评估了它们对基因表达的潜在影响。一些偏头痛风险基因位点被假定位于非编码区,并与神经传递途径有关。在本研究中,我们使用了22个非编码SNPs,这些SNPs来自神经传递和突触机械相关基因,之前根据候选基因关联研究,这些基因被认为与偏头痛易感性有关。在对这些 SNP 进行优先排序后,我们重点研究了那些未报告的、具有高调控潜力的 SNP:(1)VAMP2_rs1150(3' UTR)被预测为 hsa-mir-5010-3p miRNA 的靶点,可能会干扰其自身基因的表达;(2)STX1A_rs6951030(近端增强子)可能会影响锌指转录因子(即 ZNF423)的结合亲和力,干扰 TBL2 基因的表达;(3)SNAP25_rs2327264(远端增强子)预计位于 ONECUT2 转录因子的结合位点。这项研究证明了我们的实用工作流程的适用性,它有助于对潜在相关的非编码 SNP 进行优先排序,并预测它们在多因素神经系统疾病中的功能影响。
{"title":"Integrating functional scoring and regulatory data to predict the effect of non-coding SNPs in a complex neurological disease.","authors":"Daniela Felício, Miguel Alves-Ferreira, Mariana Santos, Marlene Quintas, Alexandra M Lopes, Carolina Lemos, Nádia Pinto, Sandra Martins","doi":"10.1093/bfgp/elad020","DOIUrl":"10.1093/bfgp/elad020","url":null,"abstract":"<p><p>Most SNPs associated with complex diseases seem to lie in non-coding regions of the genome; however, their contribution to gene expression and disease phenotype remains poorly understood. Here, we established a workflow to provide assistance in prioritising the functional relevance of non-coding SNPs of candidate genes as susceptibility loci in polygenic neurological disorders. To illustrate the applicability of our workflow, we considered the multifactorial disorder migraine as a model to follow our step-by-step approach. We annotated the overlap of selected SNPs with regulatory elements and assessed their potential impact on gene expression based on publicly available prediction algorithms and functional genomics information. Some migraine risk loci have been hypothesised to reside in non-coding regions and to be implicated in the neurotransmission pathway. In this study, we used a set of 22 non-coding SNPs from neurotransmission and synaptic machinery-related genes previously suggested to be involved in migraine susceptibility based on our candidate gene association studies. After prioritising these SNPs, we focused on non-reported ones that demonstrated high regulatory potential: (1) VAMP2_rs1150 (3' UTR) was predicted as a target of hsa-mir-5010-3p miRNA, possibly disrupting its own gene expression; (2) STX1A_rs6951030 (proximal enhancer) may affect the binding affinity of zinc-finger transcription factors (namely ZNF423) and disturb TBL2 gene expression; and (3) SNAP25_rs2327264 (distal enhancer) expected to be in a binding site of ONECUT2 transcription factor. This study demonstrated the applicability of our practical workflow to facilitate the prioritisation of potentially relevant non-coding SNPs and predict their functional impact in multifactorial neurological diseases.</p>","PeriodicalId":55323,"journal":{"name":"Briefings in Functional Genomics","volume":" ","pages":"138-149"},"PeriodicalIF":4.0,"publicationDate":"2024-03-20","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"9918600","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Dianshuang Zhou, Shiwei Guo, Yangyang Wang, Jiyun Zhao, Honghao Liu, Feiyang Zhou, Yan Huang, Yue Gu, Gang Jin, Yan Zhang
Abnormalities of DNA modifications are closely related to the pathogenesis and prognosis of pancreatic cancer. The development of third-generation sequencing technology has brought opportunities for the study of new epigenetic modification in cancer. Here, we screened the N6-methyladenine (6mA) and 5-methylcytosine (5mC) modification in pancreatic cancer based on Oxford Nanopore Technologies sequencing. The 6mA levels were lower compared with 5mC and upregulated in pancreatic cancer. We developed a novel method to define differentially methylated deficient region (DMDR), which overlapped 1319 protein-coding genes in pancreatic cancer. Genes screened by DMDRs were more significantly enriched in the cancer genes compared with the traditional differential methylation method (P < 0.001 versus P = 0.21, hypergeometric test). We then identified a survival-related signature based on DMDRs (DMDRSig) that stratified patients into high- and low-risk groups. Functional enrichment analysis indicated that 891 genes were closely related to alternative splicing. Multi-omics data from the cancer genome atlas showed that these genes were frequently altered in cancer samples. Survival analysis indicated that seven genes with high expression (ADAM9, ADAM10, EPS8, FAM83A, FAM111B, LAMA3 and TES) were significantly associated with poor prognosis. In addition, the distinction for pancreatic cancer subtypes was determined using 46 subtype-specific genes and unsupervised clustering. Overall, our study is the first to explore the molecular characteristics of 6mA modifications in pancreatic cancer, indicating that 6mA has the potential to be a target for future clinical treatment.
DNA 修饰异常与胰腺癌的发病机制和预后密切相关。第三代测序技术的发展为研究癌症中新的表观遗传修饰带来了机遇。在此,我们基于牛津纳米孔技术测序筛选了胰腺癌中的N6-甲基腺嘌呤(6mA)和5-甲基胞嘧啶(5mC)修饰。与 5mC 相比,6mA 水平较低,并且在胰腺癌中上调。我们开发了一种界定差异甲基化缺陷区(DMDR)的新方法,该方法与胰腺癌中的 1319 个蛋白编码基因重叠。与传统的差异甲基化方法相比,通过DMDR筛选出的基因在癌症基因中的富集程度更高(P
{"title":"Functional characteristics of DNA N6-methyladenine modification based on long-read sequencing in pancreatic cancer.","authors":"Dianshuang Zhou, Shiwei Guo, Yangyang Wang, Jiyun Zhao, Honghao Liu, Feiyang Zhou, Yan Huang, Yue Gu, Gang Jin, Yan Zhang","doi":"10.1093/bfgp/elad021","DOIUrl":"10.1093/bfgp/elad021","url":null,"abstract":"<p><p>Abnormalities of DNA modifications are closely related to the pathogenesis and prognosis of pancreatic cancer. The development of third-generation sequencing technology has brought opportunities for the study of new epigenetic modification in cancer. Here, we screened the N6-methyladenine (6mA) and 5-methylcytosine (5mC) modification in pancreatic cancer based on Oxford Nanopore Technologies sequencing. The 6mA levels were lower compared with 5mC and upregulated in pancreatic cancer. We developed a novel method to define differentially methylated deficient region (DMDR), which overlapped 1319 protein-coding genes in pancreatic cancer. Genes screened by DMDRs were more significantly enriched in the cancer genes compared with the traditional differential methylation method (P < 0.001 versus P = 0.21, hypergeometric test). We then identified a survival-related signature based on DMDRs (DMDRSig) that stratified patients into high- and low-risk groups. Functional enrichment analysis indicated that 891 genes were closely related to alternative splicing. Multi-omics data from the cancer genome atlas showed that these genes were frequently altered in cancer samples. Survival analysis indicated that seven genes with high expression (ADAM9, ADAM10, EPS8, FAM83A, FAM111B, LAMA3 and TES) were significantly associated with poor prognosis. In addition, the distinction for pancreatic cancer subtypes was determined using 46 subtype-specific genes and unsupervised clustering. Overall, our study is the first to explore the molecular characteristics of 6mA modifications in pancreatic cancer, indicating that 6mA has the potential to be a target for future clinical treatment.</p>","PeriodicalId":55323,"journal":{"name":"Briefings in Functional Genomics","volume":" ","pages":"150-162"},"PeriodicalIF":4.0,"publicationDate":"2024-03-20","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"9588453","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Emma F Jones, Anisha Haldar, Vishal H Oza, Brittany N Lasseigne
Following the central dogma of molecular biology, gene expression heterogeneity can aid in predicting and explaining the wide variety of protein products, functions and, ultimately, heterogeneity in phenotypes. There is currently overlapping terminology used to describe the types of diversity in gene expression profiles, and overlooking these nuances can misrepresent important biological information. Here, we describe transcriptome diversity as a measure of the heterogeneity in (1) the expression of all genes within a sample or a single gene across samples in a population (gene-level diversity) or (2) the isoform-specific expression of a given gene (isoform-level diversity). We first overview modulators and quantification of transcriptome diversity at the gene level. Then, we discuss the role alternative splicing plays in driving transcript isoform-level diversity and how it can be quantified. Additionally, we overview computational resources for calculating gene-level and isoform-level diversity for high-throughput sequencing data. Finally, we discuss future applications of transcriptome diversity. This review provides a comprehensive overview of how gene expression diversity arises, and how measuring it determines a more complete picture of heterogeneity across proteins, cells, tissues, organisms and species.
{"title":"Quantifying transcriptome diversity: a review.","authors":"Emma F Jones, Anisha Haldar, Vishal H Oza, Brittany N Lasseigne","doi":"10.1093/bfgp/elad019","DOIUrl":"10.1093/bfgp/elad019","url":null,"abstract":"<p><p>Following the central dogma of molecular biology, gene expression heterogeneity can aid in predicting and explaining the wide variety of protein products, functions and, ultimately, heterogeneity in phenotypes. There is currently overlapping terminology used to describe the types of diversity in gene expression profiles, and overlooking these nuances can misrepresent important biological information. Here, we describe transcriptome diversity as a measure of the heterogeneity in (1) the expression of all genes within a sample or a single gene across samples in a population (gene-level diversity) or (2) the isoform-specific expression of a given gene (isoform-level diversity). We first overview modulators and quantification of transcriptome diversity at the gene level. Then, we discuss the role alternative splicing plays in driving transcript isoform-level diversity and how it can be quantified. Additionally, we overview computational resources for calculating gene-level and isoform-level diversity for high-throughput sequencing data. Finally, we discuss future applications of transcriptome diversity. This review provides a comprehensive overview of how gene expression diversity arises, and how measuring it determines a more complete picture of heterogeneity across proteins, cells, tissues, organisms and species.</p>","PeriodicalId":55323,"journal":{"name":"Briefings in Functional Genomics","volume":" ","pages":"83-94"},"PeriodicalIF":2.5,"publicationDate":"2024-03-20","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11484519/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"10195229","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Sourajyoti Datta, Muhammad Nabeel Asim, Andreas Dengel, Sheraz Ahmed
Post-translational modifications (PTMs) either enhance a protein's activity in various sub-cellular processes, or degrade their activity which leads toward failure of intracellular processes. Tyrosine nitration (NT) modification degrades protein's activity that initiates and propagates various diseases including neurodegenerative, cardiovascular, autoimmune diseases and carcinogenesis. Identification of NT modification supports development of novel therapies and drug discoveries for associated diseases. Identification of NT modification in biochemical labs is expensive, time consuming and error-prone. To supplement this process, several computational approaches have been proposed. However these approaches fail to precisely identify NT modification, due to the extraction of irrelevant, redundant and less discriminative features from protein sequences. This paper presents the NTpred framework that is competent in extracting comprehensive features from raw protein sequences using four different sequence encoders. To reap the benefits of different encoders, it generates four additional feature spaces by fusing different combinations of individual encodings. Furthermore, it eradicates irrelevant and redundant features from eight different feature spaces through a Recursive Feature Elimination process. Selected features of four individual encodings and four feature fusion vectors are used to train eight different Gradient Boosted Tree classifiers. The probability scores from the trained classifiers are utilized to generate a new probabilistic feature space, which is used to train a Logistic Regression classifier. On the BD1 benchmark dataset, the proposed framework outperforms the existing best-performing predictor in 5-fold cross validation and independent test evaluation with combined improvement of 13.7% in MCC and 20.1% in AUC. Similarly, on the BD2 benchmark dataset, the proposed framework outperforms the existing best-performing predictor with combined improvement of 5.3% in MCC and 1.0% in AUC. NTpred is publicly available for further experimentation and predictive use at: https://sds_genetic_analysis.opendfki.de/PredNTS/.
{"title":"NTpred: a robust and precise machine learning framework for in silico identification of Tyrosine nitration sites in protein sequences.","authors":"Sourajyoti Datta, Muhammad Nabeel Asim, Andreas Dengel, Sheraz Ahmed","doi":"10.1093/bfgp/elad018","DOIUrl":"10.1093/bfgp/elad018","url":null,"abstract":"<p><p>Post-translational modifications (PTMs) either enhance a protein's activity in various sub-cellular processes, or degrade their activity which leads toward failure of intracellular processes. Tyrosine nitration (NT) modification degrades protein's activity that initiates and propagates various diseases including neurodegenerative, cardiovascular, autoimmune diseases and carcinogenesis. Identification of NT modification supports development of novel therapies and drug discoveries for associated diseases. Identification of NT modification in biochemical labs is expensive, time consuming and error-prone. To supplement this process, several computational approaches have been proposed. However these approaches fail to precisely identify NT modification, due to the extraction of irrelevant, redundant and less discriminative features from protein sequences. This paper presents the NTpred framework that is competent in extracting comprehensive features from raw protein sequences using four different sequence encoders. To reap the benefits of different encoders, it generates four additional feature spaces by fusing different combinations of individual encodings. Furthermore, it eradicates irrelevant and redundant features from eight different feature spaces through a Recursive Feature Elimination process. Selected features of four individual encodings and four feature fusion vectors are used to train eight different Gradient Boosted Tree classifiers. The probability scores from the trained classifiers are utilized to generate a new probabilistic feature space, which is used to train a Logistic Regression classifier. On the BD1 benchmark dataset, the proposed framework outperforms the existing best-performing predictor in 5-fold cross validation and independent test evaluation with combined improvement of 13.7% in MCC and 20.1% in AUC. Similarly, on the BD2 benchmark dataset, the proposed framework outperforms the existing best-performing predictor with combined improvement of 5.3% in MCC and 1.0% in AUC. NTpred is publicly available for further experimentation and predictive use at: https://sds_genetic_analysis.opendfki.de/PredNTS/.</p>","PeriodicalId":55323,"journal":{"name":"Briefings in Functional Genomics","volume":" ","pages":"163-179"},"PeriodicalIF":4.0,"publicationDate":"2024-03-20","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"9544857","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
With the global pandemic of COVID-19, the research on influenza virus has entered a new stage, but it is difficult to elucidate the pathogenesis of influenza disease. Genome-wide association studies (GWASs) have greatly shed light on the role of host genetic background in influenza pathogenesis and prognosis, whereas single-cell RNA sequencing (scRNA-seq) has enabled unprecedented resolution of cellular diversity and in vivo following influenza disease. Here, we performed a comprehensive analysis of influenza GWAS and scRNA-seq data to reveal cell types associated with influenza disease and provide clues to understanding pathogenesis. We downloaded two GWAS summary data, two scRNA-seq data on influenza disease. After defining cell types for each scRNA-seq data, we used RolyPoly and LDSC-cts to integrate GWAS and scRNA-seq. Furthermore, we analyzed scRNA-seq data from the peripheral blood mononuclear cells (PBMCs) of a healthy population to validate and compare our results. After processing the scRNA-seq data, we obtained approximately 70 000 cells and identified up to 13 cell types. For the European population analysis, we determined an association between neutrophils and influenza disease. For the East Asian population analysis, we identified an association between monocytes and influenza disease. In addition, we also identified monocytes as a significantly related cell type in a dataset of healthy human PBMCs. In this comprehensive analysis, we identified neutrophils and monocytes as influenza disease-associated cell types. More attention and validation should be given in future studies.
{"title":"Integrating single-cell RNA sequencing data to genome-wide association analysis data identifies significant cell types in influenza A virus infection and COVID-19.","authors":"Yixin Zou, Xifang Sun, Yifan Wang, Yidi Wang, Xiangyu Ye, Junlan Tu, Rongbin Yu, Peng Huang","doi":"10.1093/bfgp/elad025","DOIUrl":"10.1093/bfgp/elad025","url":null,"abstract":"<p><p>With the global pandemic of COVID-19, the research on influenza virus has entered a new stage, but it is difficult to elucidate the pathogenesis of influenza disease. Genome-wide association studies (GWASs) have greatly shed light on the role of host genetic background in influenza pathogenesis and prognosis, whereas single-cell RNA sequencing (scRNA-seq) has enabled unprecedented resolution of cellular diversity and in vivo following influenza disease. Here, we performed a comprehensive analysis of influenza GWAS and scRNA-seq data to reveal cell types associated with influenza disease and provide clues to understanding pathogenesis. We downloaded two GWAS summary data, two scRNA-seq data on influenza disease. After defining cell types for each scRNA-seq data, we used RolyPoly and LDSC-cts to integrate GWAS and scRNA-seq. Furthermore, we analyzed scRNA-seq data from the peripheral blood mononuclear cells (PBMCs) of a healthy population to validate and compare our results. After processing the scRNA-seq data, we obtained approximately 70 000 cells and identified up to 13 cell types. For the European population analysis, we determined an association between neutrophils and influenza disease. For the East Asian population analysis, we identified an association between monocytes and influenza disease. In addition, we also identified monocytes as a significantly related cell type in a dataset of healthy human PBMCs. In this comprehensive analysis, we identified neutrophils and monocytes as influenza disease-associated cell types. More attention and validation should be given in future studies.</p>","PeriodicalId":55323,"journal":{"name":"Briefings in Functional Genomics","volume":" ","pages":"110-117"},"PeriodicalIF":4.0,"publicationDate":"2024-03-20","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"9669193","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
To survive and establish a niche for themselves, bacteria constantly evolve. Toward that, they not only insert point mutations and promote illegitimate recombinations within their genomes but also insert pieces of 'foreign' deoxyribonucleic acid, which are commonly referred to as 'genomic islands' (GEIs). The GEIs come in several forms, structures and types, often providing a fitness advantage to the harboring bacterium. In pathogenic bacteria, some GEIs may enhance virulence, thus altering disease burden, morbidity and mortality. Hence, delineating (i) the GEIs framework, (ii) their encoded functions, (iii) the triggers that help them move, (iv) the mechanisms they exploit to move among bacteria and (v) identification of their natural reservoirs will aid in superior tackling of several bacterial diseases, including sepsis. Given the vast array of comparative genomics data, in this short review, we provide an overview of the GEIs, their types and the compositions therein, especially highlighting GEIs harbored by two important pathogens, viz. Acinetobacter baumannii and Klebsiella pneumoniae, which prominently trigger sepsis in low- and middle-income countries. Our efforts help shed some light on the challenges these pathogens pose when equipped with GEIs. We hope that this review will provoke intense research into understanding GEIs, the cues that drive their mobility across bacteria and the ways and means to prevent their transfer, especially across pathogenic bacteria.
{"title":"Genomic islands and their role in fitness traits of two key sepsis-causing bacterial pathogens.","authors":"Mohd Ilyas, Dyuti Purkait, Krishnamohan Atmakuri","doi":"10.1093/bfgp/elac051","DOIUrl":"10.1093/bfgp/elac051","url":null,"abstract":"<p><p>To survive and establish a niche for themselves, bacteria constantly evolve. Toward that, they not only insert point mutations and promote illegitimate recombinations within their genomes but also insert pieces of 'foreign' deoxyribonucleic acid, which are commonly referred to as 'genomic islands' (GEIs). The GEIs come in several forms, structures and types, often providing a fitness advantage to the harboring bacterium. In pathogenic bacteria, some GEIs may enhance virulence, thus altering disease burden, morbidity and mortality. Hence, delineating (i) the GEIs framework, (ii) their encoded functions, (iii) the triggers that help them move, (iv) the mechanisms they exploit to move among bacteria and (v) identification of their natural reservoirs will aid in superior tackling of several bacterial diseases, including sepsis. Given the vast array of comparative genomics data, in this short review, we provide an overview of the GEIs, their types and the compositions therein, especially highlighting GEIs harbored by two important pathogens, viz. Acinetobacter baumannii and Klebsiella pneumoniae, which prominently trigger sepsis in low- and middle-income countries. Our efforts help shed some light on the challenges these pathogens pose when equipped with GEIs. We hope that this review will provoke intense research into understanding GEIs, the cues that drive their mobility across bacteria and the ways and means to prevent their transfer, especially across pathogenic bacteria.</p>","PeriodicalId":55323,"journal":{"name":"Briefings in Functional Genomics","volume":" ","pages":"55-68"},"PeriodicalIF":4.0,"publicationDate":"2024-01-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"10364958","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
The entire world is facing the stiff challenge of COVID-19 pandemic. To overcome the spread of this highly infectious disease, several short-sighted strategies were adopted such as the use of broad-spectrum antibiotics and antifungals. However, the misuse and/or overuse of antibiotics have accentuated the emergence of the next pandemic: antimicrobial resistance (AMR). It is believed that pathogens while transferring between humans and the environment carry virulence and antibiotic-resistant factors from varied species. It is presumed that all such genetic factors are quantifiable and predictable, a better understanding of which could be a limiting step for the progression of AMR. Herein, we have reviewed how genomics-based understanding of host-pathogen interactions during COVID-19 could reduce the non-judicial use of antibiotics and prevent the eruption of an AMR-based pandemic in future.
全世界都面临着 COVID-19 大流行的严峻挑战。为了遏制这种高度传染性疾病的传播,人们采取了一些短视的策略,如使用广谱抗生素和抗真菌药物。然而,抗生素的滥用和/或过度使用加剧了下一个流行病的出现:抗菌药耐药性(AMR)。据认为,病原体在人类和环境之间传播时,会携带来自不同物种的毒性和抗生素耐药性因子。据推测,所有这些遗传因子都是可以量化和预测的,更好地了解这些遗传因子可能会限制 AMR 的发展。在此,我们回顾了基于基因组学对 COVID-19 期间宿主与病原体相互作用的理解如何减少抗生素的非司法使用并防止未来爆发基于 AMR 的大流行。
{"title":"Significance of understanding the genomics of host-pathogen interaction in limiting antibiotic resistance development: lessons from COVID-19 pandemic.","authors":"Vikas Yadav, Srividhya Ravichandran","doi":"10.1093/bfgp/elad001","DOIUrl":"10.1093/bfgp/elad001","url":null,"abstract":"<p><p>The entire world is facing the stiff challenge of COVID-19 pandemic. To overcome the spread of this highly infectious disease, several short-sighted strategies were adopted such as the use of broad-spectrum antibiotics and antifungals. However, the misuse and/or overuse of antibiotics have accentuated the emergence of the next pandemic: antimicrobial resistance (AMR). It is believed that pathogens while transferring between humans and the environment carry virulence and antibiotic-resistant factors from varied species. It is presumed that all such genetic factors are quantifiable and predictable, a better understanding of which could be a limiting step for the progression of AMR. Herein, we have reviewed how genomics-based understanding of host-pathogen interactions during COVID-19 could reduce the non-judicial use of antibiotics and prevent the eruption of an AMR-based pandemic in future.</p>","PeriodicalId":55323,"journal":{"name":"Briefings in Functional Genomics","volume":" ","pages":"69-74"},"PeriodicalIF":4.0,"publicationDate":"2024-01-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"10593381","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
The dramatic changes in physiology at high altitude (HA) as a result of the characteristic hypobaric hypoxia condition can modify innate and adaptive defense mechanisms of the body. As a consequence, few sojourners visiting HA with mild or asymptomatic infection may have an enhanced susceptibility to high-altitude pulmonary edema (HAPE), an acute but severe altitude sickness. It develops upon rapid ascent to altitudes above 2500 m, in otherwise healthy individuals. Though HAPE has been studied extensively, an elaborate exploration of the HA disease burden and the potential risk factors associated with its manifestation are poorly described. The present review discusses respiratory tract infection (RTI) as an unfamiliar but important risk factor in enhancing HAPE susceptibility in sojourners for two primary reasons. First, the symptoms of RTI s resemble those of HAPE. Secondly, the imbalanced pathways contributing to vascular dysfunction in HAPE also participate in the pathogenesis of the infectious processes. These pathways have a crucial role in shaping host response against viral and bacterial infections and may further worsen the clinical outcomes at HA. Respiratory tract pathogenic agents, if screened in HAPE patients, can help in ascertaining their role in disease risk and also point toward their association with the disease severity. The microbial screenings and identifications of pathogens with diseases are the foundation for describing potential molecular mechanisms underlying host response to the microbial challenge. The prior knowledge of such infections may predict the manifestation of disease etiology and provide better therapeutic options.
{"title":"Respiratory tract infection: an unfamiliar risk factor in high-altitude pulmonary edema.","authors":"Raushni Choudhary, Swati Kumari, Manzoor Ali, Tashi Thinlas, Stanzen Rabyang, Aastha Mishra","doi":"10.1093/bfgp/elac048","DOIUrl":"10.1093/bfgp/elac048","url":null,"abstract":"<p><p>The dramatic changes in physiology at high altitude (HA) as a result of the characteristic hypobaric hypoxia condition can modify innate and adaptive defense mechanisms of the body. As a consequence, few sojourners visiting HA with mild or asymptomatic infection may have an enhanced susceptibility to high-altitude pulmonary edema (HAPE), an acute but severe altitude sickness. It develops upon rapid ascent to altitudes above 2500 m, in otherwise healthy individuals. Though HAPE has been studied extensively, an elaborate exploration of the HA disease burden and the potential risk factors associated with its manifestation are poorly described. The present review discusses respiratory tract infection (RTI) as an unfamiliar but important risk factor in enhancing HAPE susceptibility in sojourners for two primary reasons. First, the symptoms of RTI s resemble those of HAPE. Secondly, the imbalanced pathways contributing to vascular dysfunction in HAPE also participate in the pathogenesis of the infectious processes. These pathways have a crucial role in shaping host response against viral and bacterial infections and may further worsen the clinical outcomes at HA. Respiratory tract pathogenic agents, if screened in HAPE patients, can help in ascertaining their role in disease risk and also point toward their association with the disease severity. The microbial screenings and identifications of pathogens with diseases are the foundation for describing potential molecular mechanisms underlying host response to the microbial challenge. The prior knowledge of such infections may predict the manifestation of disease etiology and provide better therapeutic options.</p>","PeriodicalId":55323,"journal":{"name":"Briefings in Functional Genomics","volume":" ","pages":"38-45"},"PeriodicalIF":4.0,"publicationDate":"2024-01-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"10364149","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Mansi Srivastava, Matthew R Dukeshire, Quoseena Mir, Okiemute Beatrice Omoru, Amirhossein Manzourolajdad, Sarath Chandra Janga
Long-range ribonucleic acid (RNA)-RNA interactions (RRI) are prevalent in positive-strand RNA viruses, including Beta-coronaviruses, and these take part in regulatory roles, including the regulation of sub-genomic RNA production rates. Crosslinking of interacting RNAs and short read-based deep sequencing of resulting RNA-RNA hybrids have shown that these long-range structures exist in severe acute respiratory syndrome coronavirus (SARS-CoV)-2 on both genomic and sub-genomic levels and in dynamic topologies. Furthermore, co-evolution of coronaviruses with their hosts is navigated by genetic variations made possible by its large genome, high recombination frequency and a high mutation rate. SARS-CoV-2's mutations are known to occur spontaneously during replication, and thousands of aggregate mutations have been reported since the emergence of the virus. Although many long-range RRIs have been experimentally identified using high-throughput methods for the wild-type SARS-CoV-2 strain, evolutionary trajectory of these RRIs across variants, impact of mutations on RRIs and interaction of SARS-CoV-2 RNAs with the host have been largely open questions in the field. In this review, we summarize recent computational tools and experimental methods that have been enabling the mapping of RRIs in viral genomes, with a specific focus on SARS-CoV-2. We also present available informatics resources to navigate the RRI maps and shed light on the impact of mutations on the RRI space in viral genomes. Investigating the evolution of long-range RNA interactions and that of virus-host interactions can contribute to the understanding of new and emerging variants as well as aid in developing improved RNA therapeutics critical for combating future outbreaks.
{"title":"Experimental and computational methods for studying the dynamics of RNA-RNA interactions in SARS-COV2 genomes.","authors":"Mansi Srivastava, Matthew R Dukeshire, Quoseena Mir, Okiemute Beatrice Omoru, Amirhossein Manzourolajdad, Sarath Chandra Janga","doi":"10.1093/bfgp/elac050","DOIUrl":"10.1093/bfgp/elac050","url":null,"abstract":"<p><p>Long-range ribonucleic acid (RNA)-RNA interactions (RRI) are prevalent in positive-strand RNA viruses, including Beta-coronaviruses, and these take part in regulatory roles, including the regulation of sub-genomic RNA production rates. Crosslinking of interacting RNAs and short read-based deep sequencing of resulting RNA-RNA hybrids have shown that these long-range structures exist in severe acute respiratory syndrome coronavirus (SARS-CoV)-2 on both genomic and sub-genomic levels and in dynamic topologies. Furthermore, co-evolution of coronaviruses with their hosts is navigated by genetic variations made possible by its large genome, high recombination frequency and a high mutation rate. SARS-CoV-2's mutations are known to occur spontaneously during replication, and thousands of aggregate mutations have been reported since the emergence of the virus. Although many long-range RRIs have been experimentally identified using high-throughput methods for the wild-type SARS-CoV-2 strain, evolutionary trajectory of these RRIs across variants, impact of mutations on RRIs and interaction of SARS-CoV-2 RNAs with the host have been largely open questions in the field. In this review, we summarize recent computational tools and experimental methods that have been enabling the mapping of RRIs in viral genomes, with a specific focus on SARS-CoV-2. We also present available informatics resources to navigate the RRI maps and shed light on the impact of mutations on the RRI space in viral genomes. Investigating the evolution of long-range RNA interactions and that of virus-host interactions can contribute to the understanding of new and emerging variants as well as aid in developing improved RNA therapeutics critical for combating future outbreaks.</p>","PeriodicalId":55323,"journal":{"name":"Briefings in Functional Genomics","volume":" ","pages":"46-54"},"PeriodicalIF":4.0,"publicationDate":"2024-01-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10799312/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"10666297","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}