Evolutionary Bioinformatics最新文献_第2页

Recombination Events Among SARS-CoV-2 Omicron Subvariants: Impact on Spike Interaction With ACE2 Receptor and Neutralizing Antibodies. SARS-CoV-2 Omicron 亚变体间的重组事件：尖峰与 ACE2 受体和中和抗体相互作用的影响

IF 1.7 4区生物学 Q4 EVOLUTIONARY BIOLOGY

Evolutionary Bioinformatics

Pub Date : 2024-08-14 eCollection Date: 2024-01-01 DOI: 10.1177/11769343241272415

Marwa Arbi, Marwa Khedhiri, Kaouther Ayouni, Oussema Souiai, Samar Dhouib, Nidhal Ghanmi, Alia Benkahla, Henda Triki, Sondes Haddad-Boubaker

The recombination plays a key role in promoting evolution of RNA viruses and emergence of potentially epidemic variants. Some studies investigated the recombination occurrence among SARS-CoV-2, without exploring its impact on virus-host interaction. In the aim to investigate the burden of recombination in terms of frequency and distribution, the occurrence of recombination was first explored in 44 230 Omicron sequences among BQ subvariants and the under investigation "ML" (Multiple Lineages) denoted sequences, using 3seq software. Second, the recombination impact on interaction between the Spike protein and ACE2 receptor as well as neutralizing antibodies (nAbs), was analyzed using docking tools. Recombination was detected in 56.91% and 82.20% of BQ and ML strains, respectively. It took place mainly in spike and ORF1a genes. For BQ recombinant strains, the docking analysis showed that the spike interacted strongly with ACE2 and weakly with nAbs. The mutations S373P, S375F and T376A constitute a residue network that enhances the RBD interaction with ACE2. Thirteen mutations in RBD (S373P, S375F, T376A, D405N, R408S, K417N, N440K, S477N, P494S, Q498R, N501Y, and Y505H) and NTD (Y240H) seem to be implicated in immune evasion of recombinants by altering spike interaction with nAbs. In conclusion, this "in silico" study demonstrated that the recombination mechanism is frequent among Omicron BQ and ML variants. It highlights new key mutations, that potentially implicated in enhancement of spike binding to ACE2 (F376A) and escape from nAbs (RBD: F376A, D405N, R408S, N440K, S477N, P494S, and Y505H; NTD: Y240H). Our findings present considerable insights for the elaboration of effective prophylaxis and therapeutic strategies against future SARS-CoV-2 waves.

重组在促进 RNA 病毒的进化和潜在流行变种的出现方面起着关键作用。一些研究调查了 SARS-CoV-2 中重组的发生情况，但没有探讨其对病毒与宿主相互作用的影响。为了从频率和分布方面研究重组的负担，研究人员首先使用 3seq 软件，在 44 230 个 Omicron 序列中的 BQ 亚变体和正在研究的 "ML"（多系）表示序列中探讨了重组的发生情况。其次，利用对接工具分析了重组对 Spike 蛋白和 ACE2 受体以及中和抗体（nAbs）之间相互作用的影响。分别有 56.91% 和 82.20% 的 BQ 和 ML 菌株检测到重组。重组主要发生在穗基因和 ORF1a 基因中。对 BQ 重组菌株进行的对接分析表明，穗状基因与 ACE2 的相互作用强烈，而与 nAbs 的相互作用较弱。突变 S373P、S375F 和 T376A 构成了一个残基网络，增强了 RBD 与 ACE2 的相互作用。RBD 中的 13 个突变（S373P、S375F、T376A、D405N、R408S、K417N、N440K、S477N、P494S、Q498R、N501Y 和 Y505H）和 NTD（Y240H）似乎通过改变与 nAbs 的尖峰相互作用而与重组体的免疫逃避有关。总之，这项 "硅 "研究表明，重组机制在 Omicron BQ 和 ML 变体中很常见。它强调了新的关键突变，这些突变可能与增强尖峰与 ACE2 的结合（F376A）和摆脱 nAbs 有关（RBD：F376A、D405N、R408S、N440K、S477N、P494S 和 Y505H；NTD：Y240H）。我们的研究结果为制定针对未来 SARS-CoV-2 感染的有效预防和治疗策略提供了重要启示。

{"title":"Recombination Events Among SARS-CoV-2 Omicron Subvariants: Impact on Spike Interaction With ACE2 Receptor and Neutralizing Antibodies.","authors":"Marwa Arbi, Marwa Khedhiri, Kaouther Ayouni, Oussema Souiai, Samar Dhouib, Nidhal Ghanmi, Alia Benkahla, Henda Triki, Sondes Haddad-Boubaker","doi":"10.1177/11769343241272415","DOIUrl":"10.1177/11769343241272415","url":null,"abstract":"The recombination plays a key role in promoting evolution of RNA viruses and emergence of potentially epidemic variants. Some studies investigated the recombination occurrence among SARS-CoV-2, without exploring its impact on virus-host interaction. In the aim to investigate the burden of recombination in terms of frequency and distribution, the occurrence of recombination was first explored in 44 230 Omicron sequences among BQ subvariants and the under investigation \"ML\" (Multiple Lineages) denoted sequences, using 3seq software. Second, the recombination impact on interaction between the Spike protein and ACE2 receptor as well as neutralizing antibodies (nAbs), was analyzed using docking tools. Recombination was detected in 56.91% and 82.20% of BQ and ML strains, respectively. It took place mainly in spike and ORF1a genes. For BQ recombinant strains, the docking analysis showed that the spike interacted strongly with ACE2 and weakly with nAbs. The mutations S373P, S375F and T376A constitute a residue network that enhances the RBD interaction with ACE2. Thirteen mutations in RBD (S373P, S375F, T376A, D405N, R408S, K417N, N440K, S477N, P494S, Q498R, N501Y, and Y505H) and NTD (Y240H) seem to be implicated in immune evasion of recombinants by altering spike interaction with nAbs. In conclusion, this \"in silico\" study demonstrated that the recombination mechanism is frequent among Omicron BQ and ML variants. It highlights new key mutations, that potentially implicated in enhancement of spike binding to ACE2 (F376A) and escape from nAbs (RBD: F376A, D405N, R408S, N440K, S477N, P494S, and Y505H; NTD: Y240H). Our findings present considerable insights for the elaboration of effective prophylaxis and therapeutic strategies against future SARS-CoV-2 waves.","PeriodicalId":50472,"journal":{"name":"Evolutionary Bioinformatics","volume":"20 ","pages":"11769343241272415"},"PeriodicalIF":1.7,"publicationDate":"2024-08-14","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11325312/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141989369","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Single-cell RNA Sequencing Identifies Natural Kill Cell-Related Transcription Factors Associated With Age-Related Macular Degeneration. 单细胞 RNA 测序发现与老年性黄斑变性有关的天然杀伤细胞相关转录因子

IF 1.7 4区生物学 Q4 EVOLUTIONARY BIOLOGY

Evolutionary Bioinformatics

Pub Date : 2024-08-14 eCollection Date: 2024-01-01 DOI: 10.1177/11769343241272413

Yili Luo, Jianpeng Liu, Wangqiang Feng, Da Lin, Mengji Chen, Haihua Zheng

Background: Age-related Macular Degeneration (AMD) poses a growing global health concern as the leading cause of central vision loss in elderly people.

Objection: This study focuses on unraveling the intricate involvement of Natural Killer (NK) cells in AMD, shedding light on their immune responses and cytokine regulatory roles.

Methods: Transcriptomic data from the Gene Expression Omnibus database were utilized, employing single-cell RNA-seq analysis. High-dimensional weighted gene co-expression network analysis (hdWGCNA) and single-cell regulatory network inference and clustering (SCENIC) analysis were applied to reveal the regulatory mechanisms of NK cells in early-stage AMD patients. Machine learning models, such as random forests and decision trees, were employed to screen hub genes and key transcription factors (TFs) associated with AMD.

Results: Distinct cell clusters were identified in the present study, especially the T/NK cluster, with a notable increase in NK cell abundance observed in AMD. Cell-cell communication analyses revealed altered interactions, particularly in NK cells, indicating their potential role in AMD pathogenesis. HdWGCNA highlighted the turquoise module, enriched in inflammation-related pathways, as significantly associated with AMD in NK cells. The SCENIC analysis identified key TFs in NK cell regulatory networks. The integration of hub genes and TFs identified CREM, FOXP1, IRF1, NFKB2, and USF2 as potential predictors for AMD through machine learning.

Conclusion: This comprehensive approach enhances our understanding of NK cell dynamics, signaling alterations, and potential predictive models for AMD. The identified TFs provide new avenues for molecular interventions and highlight the intricate relationship between NK cells and AMD pathogenesis. Overall, this study contributes valuable insights for advancing our understanding and management of AMD.

背景：年龄相关性黄斑变性（AMD）是导致老年人中心视力丧失的主要原因，已成为全球日益关注的健康问题：本研究的重点是揭示自然杀伤细胞（NK）在AMD中的复杂参与，阐明其免疫反应和细胞因子的调控作用：方法：利用单细胞RNA-seq分析基因表达总库（Gene Expression Omnibus）的转录组数据。应用高维加权基因共表达网络分析（hdWGCNA）和单细胞调控网络推断与聚类分析（SCENIC）揭示早期AMD患者NK细胞的调控机制。采用随机森林和决策树等机器学习模型筛选与AMD相关的枢纽基因和关键转录因子（TFs）：结果：本研究发现了不同的细胞群，尤其是T/NK细胞群，观察到AMD患者的NK细胞数量明显增加。细胞-细胞通讯分析表明，细胞间的相互作用发生了改变，特别是在NK细胞中，这表明它们在AMD发病机制中的潜在作用。HdWGCNA突出显示了绿松石模块，该模块富含炎症相关通路，与NK细胞中的AMD显著相关。SCENIC 分析确定了 NK 细胞调控网络中的关键 TFs。通过机器学习，整合枢纽基因和TFs确定了CREM、FOXP1、IRF1、NFKB2和USF2是AMD的潜在预测因子：这一综合方法增强了我们对 NK 细胞动态、信号改变和 AMD 潜在预测模型的了解。鉴定出的TFs为分子干预提供了新途径，并凸显了NK细胞与AMD发病机制之间错综复杂的关系。总之，这项研究为促进我们对 AMD 的了解和管理提供了宝贵的见解。

{"title":"Single-cell RNA Sequencing Identifies Natural Kill Cell-Related Transcription Factors Associated With Age-Related Macular Degeneration.","authors":"Yili Luo, Jianpeng Liu, Wangqiang Feng, Da Lin, Mengji Chen, Haihua Zheng","doi":"10.1177/11769343241272413","DOIUrl":"10.1177/11769343241272413","url":null,"abstract":"Background: Age-related Macular Degeneration (AMD) poses a growing global health concern as the leading cause of central vision loss in elderly people.Objection: This study focuses on unraveling the intricate involvement of Natural Killer (NK) cells in AMD, shedding light on their immune responses and cytokine regulatory roles.Methods: Transcriptomic data from the Gene Expression Omnibus database were utilized, employing single-cell RNA-seq analysis. High-dimensional weighted gene co-expression network analysis (hdWGCNA) and single-cell regulatory network inference and clustering (SCENIC) analysis were applied to reveal the regulatory mechanisms of NK cells in early-stage AMD patients. Machine learning models, such as random forests and decision trees, were employed to screen hub genes and key transcription factors (TFs) associated with AMD.Results: Distinct cell clusters were identified in the present study, especially the T/NK cluster, with a notable increase in NK cell abundance observed in AMD. Cell-cell communication analyses revealed altered interactions, particularly in NK cells, indicating their potential role in AMD pathogenesis. HdWGCNA highlighted the turquoise module, enriched in inflammation-related pathways, as significantly associated with AMD in NK cells. The SCENIC analysis identified key TFs in NK cell regulatory networks. The integration of hub genes and TFs identified CREM, FOXP1, IRF1, NFKB2, and USF2 as potential predictors for AMD through machine learning.Conclusion: This comprehensive approach enhances our understanding of NK cell dynamics, signaling alterations, and potential predictive models for AMD. The identified TFs provide new avenues for molecular interventions and highlight the intricate relationship between NK cells and AMD pathogenesis. Overall, this study contributes valuable insights for advancing our understanding and management of AMD.","PeriodicalId":50472,"journal":{"name":"Evolutionary Bioinformatics","volume":"20 ","pages":"11769343241272413"},"PeriodicalIF":1.7,"publicationDate":"2024-08-14","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11325330/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141989370","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

MicroRNA Transcriptomes Reveal Prevalence of Rare and Species-Specific Arm Switching Events During Zebrafish Ontogenesis. MicroRNA 转录组揭示斑马鱼本体发生过程中罕见和物种特异性臂切换事件的普遍性。

IF 1.7 4区生物学 Q4 EVOLUTIONARY BIOLOGY

Evolutionary Bioinformatics

Pub Date : 2024-07-24 eCollection Date: 2024-01-01 DOI: 10.1177/11769343241263230

Arthur Casulli de Oliveira, Luiz Augusto Bovolenta, Lucas Figueiredo, Amanda De Oliveira Ribeiro, Beatriz Jacinto Alves Pereira, Talita Roberto Aleixo de Almeida, Vinicius Farias Campos, James G Patton, Danillo Pinhal

In metazoans, microRNAs (miRNAs) are essential regulators of gene expression, affecting critical cellular processes from differentiation and proliferation, to homeostasis. During miRNA biogenesis, the miRNA strand that loads onto the RNA-induced Silencing Complex (RISC) can vary, leading to changes in gene targeting and modulation of biological pathways. To investigate the impact of these "arm switching" events on gene regulation, we analyzed a diverse range of tissues and developmental stages in zebrafish by comparing 5p and 3p arms accumulation dynamics between embryonic developmental stages, adult tissues, and sexes. We also compared variable arm usage patterns observed in zebrafish to other vertebrates including arm switching data from fish, birds, and mammals. Our comprehensive analysis revealed that variable arm usage events predominantly take place during embryonic development. It is also noteworthy that isomiR occurrence correlates to changes in arm selection evidencing an important role of microRNA distinct isoforms in reinforcing and modifying gene regulation by promoting dynamics switches on miRNA 5p and 3p arms accumulation. Our results shed new light on the emergence and coordination of gene expression regulation and pave the way for future investigations in this field.

在后生动物中，microRNA（miRNA）是基因表达的重要调控因子，影响着从分化、增殖到稳态的关键细胞过程。在 miRNA 的生物发生过程中，加载到 RNA 诱导的沉默复合体（RISC）上的 miRNA 链可能会发生变化，从而导致基因靶向和生物通路调控的改变。为了研究这些 "臂切换 "事件对基因调控的影响，我们分析了斑马鱼的各种组织和发育阶段，比较了胚胎发育阶段、成年组织和性别之间的 5p 和 3p 臂积累动态。我们还将斑马鱼中观察到的可变臂使用模式与其他脊椎动物进行了比较，包括鱼类、鸟类和哺乳动物的臂切换数据。我们的综合分析表明，变臂使用事件主要发生在胚胎发育过程中。同样值得注意的是，isomiR的出现与臂选择的变化相关，这证明了microRNA不同异构体通过促进miRNA 5p和3p臂积累的动态开关，在加强和改变基因调控方面发挥了重要作用。我们的研究结果为基因表达调控的出现和协调提供了新的思路，并为这一领域未来的研究铺平了道路。

{"title":"MicroRNA Transcriptomes Reveal Prevalence of Rare and Species-Specific Arm Switching Events During Zebrafish Ontogenesis.","authors":"Arthur Casulli de Oliveira, Luiz Augusto Bovolenta, Lucas Figueiredo, Amanda De Oliveira Ribeiro, Beatriz Jacinto Alves Pereira, Talita Roberto Aleixo de Almeida, Vinicius Farias Campos, James G Patton, Danillo Pinhal","doi":"10.1177/11769343241263230","DOIUrl":"10.1177/11769343241263230","url":null,"abstract":"In metazoans, microRNAs (miRNAs) are essential regulators of gene expression, affecting critical cellular processes from differentiation and proliferation, to homeostasis. During miRNA biogenesis, the miRNA strand that loads onto the RNA-induced Silencing Complex (RISC) can vary, leading to changes in gene targeting and modulation of biological pathways. To investigate the impact of these \"arm switching\" events on gene regulation, we analyzed a diverse range of tissues and developmental stages in zebrafish by comparing 5p and 3p arms accumulation dynamics between embryonic developmental stages, adult tissues, and sexes. We also compared variable arm usage patterns observed in zebrafish to other vertebrates including arm switching data from fish, birds, and mammals. Our comprehensive analysis revealed that variable arm usage events predominantly take place during embryonic development. It is also noteworthy that isomiR occurrence correlates to changes in arm selection evidencing an important role of microRNA distinct isoforms in reinforcing and modifying gene regulation by promoting dynamics switches on miRNA 5p and 3p arms accumulation. Our results shed new light on the emergence and coordination of gene expression regulation and pave the way for future investigations in this field.","PeriodicalId":50472,"journal":{"name":"Evolutionary Bioinformatics","volume":"20 ","pages":"11769343241263230"},"PeriodicalIF":1.7,"publicationDate":"2024-07-24","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11271096/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141762332","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

The Spatio-Temporal Expression Profiles of Silkworm Pseudogenes Provide Valuable Insights into Their Biological Roles. 蚕假基因的时空表达谱为了解其生物学作用提供了宝贵的视角

IF 2.6 4区生物学 Q4 EVOLUTIONARY BIOLOGY

Evolutionary Bioinformatics

Pub Date : 2024-06-14 eCollection Date: 2024-01-01 DOI: 10.1177/11769343241261814

Linrong Wan, Siyuan Su, Jinyun Liu, Bangxing Zou, Yaming Jiang, Beibei Jiao, Shaokuan Tang, Youhong Zhang, Cao Deng, Wenfu Xiao

Background: Pseudogenes are sequences that have lost the ability to transcribe RNA molecules or encode truncated but possibly functional proteins. While they were once considered to be meaningless remnants of evolution, recent researches have shown that pseudogenes play important roles in various biological processes. However, the studies of pseudogenes in the silkworm, an important model organism, are limited and have focused on single or only a few specific genes.

Objective: To fill these gaps, we present a systematic genome-wide studies of pseudogenes in the silkworm.

Methods: We identified the pseudogenes in the silkworm using the silkworm genome assemblies, transcriptome, protein sequences from silkworm and its related species. Then we used transcriptome datasets from 832 RNA-seq analyses to construct spatio-temporal expression profiles for these pseudogenes. Additionally, we identified tissue-specifically expressed and differentially expressed pseudogenes to further understand their characteristics. Finally, the functional roles of pseudogenes as lncRNAs were systematically analyzed.

Results: We identified a total of 4410 pseudogenes, which were grouped into 4 groups, including duplications (DUPs), unitary pseudogenes (Unitary), processed pseudogenes (retropseudogenes, RETs), and fragments (FRAGs). The most of pseudogenes in the domestic silkworm were generated before the divergence of wild and domestic silkworm, however, the domestication may also involve in the accumulation of pseudogenes. These pseudogenes were clearly divided into 2 cluster, a highly expressed and a lowly expressed, and the posterior silk gland was the tissue with the most tissue-specific pseudogenes (199), implying these pseudogenes may be involved in the development and function of silkgland. We identified 3299 lncRNAs in these pseudogenes, and the target genes of these lncRNAs in silkworm pseudogenes were enriched in the egg formation and olfactory function.

Conclusions: This study replenishes the genome annotations for silkworm, provide valuable insights into the biological roles of pseudogenes. It will also contribute to our understanding of the complex gene regulatory networks in the silkworm and will potentially have implications for other organisms as well.

背景：假基因是失去转录 RNA 分子能力或编码截短但可能具有功能性蛋白质的序列。假基因曾被认为是进化过程中毫无意义的残余物，但最近的研究表明，假基因在各种生物过程中发挥着重要作用。然而，对家蚕这一重要模式生物中假基因的研究十分有限，而且主要集中在单个或少数几个特定基因上：为了填补这些空白，我们对家蚕假基因进行了系统的全基因组研究：方法：我们利用家蚕及其相关物种的基因组组装、转录组和蛋白质序列确定了家蚕的假基因。然后，我们利用来自 832 个 RNA-seq 分析的转录组数据集构建了这些假基因的时空表达谱。此外，我们还鉴定了组织特异表达和差异表达的假基因，以进一步了解它们的特征。最后，系统分析了假基因作为 lncRNA 的功能作用：我们共鉴定出4410个假基因，并将其分为4组，包括重复假基因（DUPs）、单元假基因（Unitary）、加工假基因（retropseudogenes，RETs）和片段假基因（FRAGs）。家蚕中的大部分假基因是在野蚕和家蚕分化之前产生的，但驯化也可能涉及假基因的积累。这些假基因明显分为高表达和低表达两组，后丝腺是组织特异性假基因最多的组织（199），这意味着这些假基因可能参与了丝腺的发育和功能。我们在这些假基因中发现了3299个lncRNAs，这些lncRNAs在家蚕假基因中的靶基因富集在卵的形成和嗅觉功能中：本研究补充了家蚕基因组注释，为假基因的生物学作用提供了有价值的见解。结论：本研究补充了家蚕的基因组注释，为假基因的生物学作用提供了有价值的见解，也有助于我们了解家蚕复杂的基因调控网络，并可能对其他生物产生影响。

{"title":"The Spatio-Temporal Expression Profiles of Silkworm Pseudogenes Provide Valuable Insights into Their Biological Roles.","authors":"Linrong Wan, Siyuan Su, Jinyun Liu, Bangxing Zou, Yaming Jiang, Beibei Jiao, Shaokuan Tang, Youhong Zhang, Cao Deng, Wenfu Xiao","doi":"10.1177/11769343241261814","DOIUrl":"10.1177/11769343241261814","url":null,"abstract":"Background: Pseudogenes are sequences that have lost the ability to transcribe RNA molecules or encode truncated but possibly functional proteins. While they were once considered to be meaningless remnants of evolution, recent researches have shown that pseudogenes play important roles in various biological processes. However, the studies of pseudogenes in the silkworm, an important model organism, are limited and have focused on single or only a few specific genes.Objective: To fill these gaps, we present a systematic genome-wide studies of pseudogenes in the silkworm.Methods: We identified the pseudogenes in the silkworm using the silkworm genome assemblies, transcriptome, protein sequences from silkworm and its related species. Then we used transcriptome datasets from 832 RNA-seq analyses to construct spatio-temporal expression profiles for these pseudogenes. Additionally, we identified tissue-specifically expressed and differentially expressed pseudogenes to further understand their characteristics. Finally, the functional roles of pseudogenes as lncRNAs were systematically analyzed.Results: We identified a total of 4410 pseudogenes, which were grouped into 4 groups, including duplications (DUPs), unitary pseudogenes (Unitary), processed pseudogenes (retropseudogenes, RETs), and fragments (FRAGs). The most of pseudogenes in the domestic silkworm were generated before the divergence of wild and domestic silkworm, however, the domestication may also involve in the accumulation of pseudogenes. These pseudogenes were clearly divided into 2 cluster, a highly expressed and a lowly expressed, and the posterior silk gland was the tissue with the most tissue-specific pseudogenes (199), implying these pseudogenes may be involved in the development and function of silkgland. We identified 3299 lncRNAs in these pseudogenes, and the target genes of these lncRNAs in silkworm pseudogenes were enriched in the egg formation and olfactory function.Conclusions: This study replenishes the genome annotations for silkworm, provide valuable insights into the biological roles of pseudogenes. It will also contribute to our understanding of the complex gene regulatory networks in the silkworm and will potentially have implications for other organisms as well.","PeriodicalId":50472,"journal":{"name":"Evolutionary Bioinformatics","volume":"20 ","pages":"11769343241261814"},"PeriodicalIF":2.6,"publicationDate":"2024-06-14","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11179516/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141332419","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Study on Allele Specific Expression of Long-Term Residents in High Altitude Areas 高海拔地区长期居民的特定基因表达研究

IF 2.6 4区生物学 Q4 EVOLUTIONARY BIOLOGY

Evolutionary Bioinformatics

Pub Date : 2024-05-30 DOI: 10.1177/11769343241257344

Chao He, Bin Zhu, Wenwen Gao, Qianjin Wu, Changshui Zhang

In diploid organisms, half of the chromosomes in each cell come from the father and half from the mother. Through previous studies, it was found that the paternal chromosome and the maternal chromosome can be regulated and expressed independently, leading to the emergence of allele specific expression (ASE). In this study, we analyzed the differential expression of alleles in the high-altitude population and the normal population based on the RNA sequencing data. Through gene cluster analysis and protein interaction network analysis, we found some changes occurred at the gene level, and some negative effects. During the study, we realized that the calmodulin homology domain may have a certain correlation with long-term survival at high altitude. The plateau environment is characterized by hypoxia, low air pressure, strong ultraviolet radiation, and low temperature. Accordingly, the genetic changes in the process of adaptation are mainly reflected in these characteristics. High altitude generation living is also highly related to cancer, immune disease, cardiovascular disease, neurological disease, endocrine disease, and other diseases. Therefore, the medical system in high altitude areas should pay more attention to these diseases.

在二倍体生物中，每个细胞中的染色体一半来自父亲，一半来自母亲。以往的研究发现，父源染色体和母源染色体可以独立调控和表达，从而导致等位基因特异性表达（ASE）的出现。在本研究中，我们根据 RNA 测序数据分析了高海拔人群和正常人群中等位基因的差异表达。通过基因聚类分析和蛋白质相互作用网络分析，我们发现在基因水平上发生了一些变化，同时也产生了一些负面影响。在研究过程中，我们意识到钙调素同源结构域可能与长期高海拔生存有一定的相关性。高原环境的特点是缺氧、低气压、强紫外线辐射和低温。相应地，适应过程中的基因变化主要体现在这些特征上。高海拔一代生活还与癌症、免疫性疾病、心血管疾病、神经系统疾病、内分泌疾病等疾病高度相关。因此，高海拔地区的医疗系统应更加关注这些疾病。

{"title":"Study on Allele Specific Expression of Long-Term Residents in High Altitude Areas","authors":"Chao He, Bin Zhu, Wenwen Gao, Qianjin Wu, Changshui Zhang","doi":"10.1177/11769343241257344","DOIUrl":"https://doi.org/10.1177/11769343241257344","url":null,"abstract":"In diploid organisms, half of the chromosomes in each cell come from the father and half from the mother. Through previous studies, it was found that the paternal chromosome and the maternal chromosome can be regulated and expressed independently, leading to the emergence of allele specific expression (ASE). In this study, we analyzed the differential expression of alleles in the high-altitude population and the normal population based on the RNA sequencing data. Through gene cluster analysis and protein interaction network analysis, we found some changes occurred at the gene level, and some negative effects. During the study, we realized that the calmodulin homology domain may have a certain correlation with long-term survival at high altitude. The plateau environment is characterized by hypoxia, low air pressure, strong ultraviolet radiation, and low temperature. Accordingly, the genetic changes in the process of adaptation are mainly reflected in these characteristics. High altitude generation living is also highly related to cancer, immune disease, cardiovascular disease, neurological disease, endocrine disease, and other diseases. Therefore, the medical system in high altitude areas should pay more attention to these diseases.","PeriodicalId":50472,"journal":{"name":"Evolutionary Bioinformatics","volume":"80 1","pages":""},"PeriodicalIF":2.6,"publicationDate":"2024-05-30","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141190451","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

An Integrated Framework for Analysis and Prediction of Impact of Single Nucleotide Polymorphism Associated with Human Diseases. 分析和预测与人类疾病相关的单核苷酸多态性影响的综合框架。

IF 2.6 4区生物学 Q4 EVOLUTIONARY BIOLOGY

Evolutionary Bioinformatics

Pub Date : 2024-05-10 eCollection Date: 2024-01-01 DOI: 10.1177/11769343241249916

Syed Shah Muhammad, Muhammad Shoaib, Muhammad Tariq Pervez

Single nucleotide polymorphisms are most common type of genetic variation in human genome. Analyzing genetic variants can help us better understand the genetic basis of diseases and develop predictive models which are useful to identify individuals who are at increased risk for certain diseases. Several SNP analysis tools have already been developed. For running these tools, the user needs to collect data from various databases. Secondly, often researchers have to use multiple variant analysis tools for cross validating their results and increase confidence in their findings. Extracting data from multiple databases and running multiple tools at a time, increases complexity and time required for analysis. There are some web-based tools that integrate multiple genetic variant databases and provide variant annotations for a few tools. These approaches have some limitations such as retrieving annotation information, filtering common pathogenic variants. The proposed web-based tool, namely IPSNP: An Integrated Platform for Predicting Impact of SNPs is written in Django which is a python-based framework. It uses RESTful API of MyVariant.info to extract annotation information of variants associated with a given gene, rsID, HGVS format variants specified in a VCF file for 29 tools. The results are in the form of a CSV file of predictions (1) derived from the consensus decision, (2) a file having annotations for the variants associated with the given gene, (3) a file showing variants declared as pathogenic commonly by the selected tools, and (4) a CSV file containing chromosome coordinates based on GRCh37 and GRCh38 genome assemblies, rsIDs and proteomic data, so that users may use tools of their choice and avoiding manual parameter collection for each tool. IPSNP is a valuable resource for researchers and clinicians and it can help to save time and effort in discovering the novel disease-associated variants and the development of personalized treatments.

单核苷酸多态性是人类基因组中最常见的遗传变异类型。分析基因变异可以帮助我们更好地了解疾病的遗传基础，并开发出预测模型，用于识别某些疾病的高危人群。目前已经开发出几种 SNP 分析工具。要运行这些工具，用户需要从各种数据库中收集数据。其次，研究人员往往需要使用多种变异分析工具来交叉验证其结果，并增强对研究结果的信心。从多个数据库提取数据并同时运行多个工具，会增加分析的复杂性和所需时间。有一些基于网络的工具整合了多个基因变异数据库，并为一些工具提供变异注释。这些方法存在一些局限性，如检索注释信息、过滤常见致病变异等。拟议的基于网络的工具，即 "IPSNP：预测 SNPs 影响的集成平台"，是用 Django（一个基于 python 的框架）编写的。它使用 MyVariant.info 的 RESTful API 提取与给定基因、rsID、VCF 文件中指定的 HGVS 格式变异相关的注释信息，可用于 29 种工具。结果以 CSV 文件的形式提供：(1) 根据共识决定得出的预测结果；(2) 与给定基因相关的变异注释文件；(3) 显示所选工具通常宣布为致病的变异的文件；(4) 基于 GRCh37 和 GRCh38 基因组组装、rsID 和蛋白质组数据，包含染色体坐标的 CSV 文件，这样用户就可以使用自己选择的工具，避免为每个工具手动收集参数。IPSNP 是研究人员和临床医生的宝贵资源，有助于节省发现新型疾病相关变异和开发个性化治疗方法的时间和精力。

{"title":"An Integrated Framework for Analysis and Prediction of Impact of Single Nucleotide Polymorphism Associated with Human Diseases.","authors":"Syed Shah Muhammad, Muhammad Shoaib, Muhammad Tariq Pervez","doi":"10.1177/11769343241249916","DOIUrl":"10.1177/11769343241249916","url":null,"abstract":"Single nucleotide polymorphisms are most common type of genetic variation in human genome. Analyzing genetic variants can help us better understand the genetic basis of diseases and develop predictive models which are useful to identify individuals who are at increased risk for certain diseases. Several SNP analysis tools have already been developed. For running these tools, the user needs to collect data from various databases. Secondly, often researchers have to use multiple variant analysis tools for cross validating their results and increase confidence in their findings. Extracting data from multiple databases and running multiple tools at a time, increases complexity and time required for analysis. There are some web-based tools that integrate multiple genetic variant databases and provide variant annotations for a few tools. These approaches have some limitations such as retrieving annotation information, filtering common pathogenic variants. The proposed web-based tool, namely IPSNP: An Integrated Platform for Predicting Impact of SNPs is written in Django which is a python-based framework. It uses RESTful API of MyVariant.info to extract annotation information of variants associated with a given gene, rsID, HGVS format variants specified in a VCF file for 29 tools. The results are in the form of a CSV file of predictions (1) derived from the consensus decision, (2) a file having annotations for the variants associated with the given gene, (3) a file showing variants declared as pathogenic commonly by the selected tools, and (4) a CSV file containing chromosome coordinates based on GRCh37 and GRCh38 genome assemblies, rsIDs and proteomic data, so that users may use tools of their choice and avoiding manual parameter collection for each tool. IPSNP is a valuable resource for researchers and clinicians and it can help to save time and effort in discovering the novel disease-associated variants and the development of personalized treatments.","PeriodicalId":50472,"journal":{"name":"Evolutionary Bioinformatics","volume":"20 ","pages":"11769343241249916"},"PeriodicalIF":2.6,"publicationDate":"2024-05-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11088291/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"140913243","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

HNF4A-Bridging the Gap Between Intestinal Metaplasia and Gastric Cancer HNF4A--弥合肠增生与胃癌之间的鸿沟

IF 2.6 4区生物学 Q4 EVOLUTIONARY BIOLOGY

Evolutionary Bioinformatics

Pub Date : 2024-04-26 DOI: 10.1177/11769343241249017

Yihang Zhao, Hong Tang, Jianhua Xu, Feifei Sun, Yuanyuan Zhao, Yang Li

Background:Intestinal metaplasia (IM) of gastric epithelium has traditionally been regarded as an irreversible stage in the process of the Correa cascade. Exploring the potential molecular mechanism of IM is significant for effective gastric cancer prevention.Methods:The GSE78523 dataset, obtained from the Gene Expression Omnibus (GEO) database, was analyzed using RStudio software to identify the differently expressed genes (DEGs) between IM tissues and normal gastric epithelial tissues. Subsequently, gene ontology (GO) analysis, Kyoto Encyclopedia of Genes and Genomes (KEGG) enrichment analysis, Gene Set Enrichment Analysis (GESA), and protein-protein interaction (PPI) analysis were used to find potential genes. Additionally, the screened genes were analyzed for clinical, immunological, and genetic correlation aspects using single gene clinical correlation analysis (UALCAN), Tumor–Immune System Interactions Database (TISIDB), and validated through western blot experiments.Results:Enrichment analysis showed that the lipid metabolic pathway was significantly associated with IM tissues and the apolipoprotein B ( APOB) gene was identified in the subsequent analysis. Experiment results and correlation analysis showed that the expression of APOB was higher in IM tissues than in normal tissues. This elevated expression of APOB was also found to be associated with the expression levels of hepatocyte nuclear factor 4A ( HNF4A) gene. HNF4A was also found to be associated with immune cell infiltration to gastric cancer and was linked to the prognosis of gastric cancer patients. Moreover, HNF4A was also highly expressed in both IM tissues and gastric cancer cells.Conclusion:Our findings indicate that HNF4A regulates the microenvironment of lipid metabolism in IM tissues by targeting APOB. Higher expression of HNF4A tends to lead to a worse prognosis in gastric cancer patients implying it may serve as a predictive indicator for the progression from IM to gastric cancer.

背景：胃上皮的肠化生（Intestinal metaplasia，IM）传统上被认为是科雷亚级联过程中的一个不可逆阶段。方法：使用 RStudio 软件分析从基因表达总库（GEO）数据库中获得的 GSE78523 数据集，以确定 IM 组织与正常胃上皮组织之间的差异表达基因（DEGs）。随后，利用基因本体（GO）分析、京都基因组百科全书（KEGG）富集分析、基因组富集分析（GESA）和蛋白-蛋白相互作用（PPI）分析来寻找潜在基因。结果：富集分析表明，脂质代谢通路与IM组织显著相关，并在随后的分析中发现了载脂蛋白B（APOB）基因。实验结果和相关分析表明，IM 组织中 APOB 的表达高于正常组织。研究还发现，APOB 的高表达与肝细胞核因子 4A （HNF4A）基因的表达水平有关。研究还发现，HNF4A 与胃癌的免疫细胞浸润有关，并与胃癌患者的预后有关。结论：我们的研究结果表明，HNF4A 通过靶向 APOB 调节 IM 组织中脂质代谢的微环境。结论：我们的研究结果表明，HNF4A通过靶向APOB调节IM组织中的脂质代谢微环境，HNF4A表达越高，胃癌患者的预后越差。

{"title":"HNF4A-Bridging the Gap Between Intestinal Metaplasia and Gastric Cancer","authors":"Yihang Zhao, Hong Tang, Jianhua Xu, Feifei Sun, Yuanyuan Zhao, Yang Li","doi":"10.1177/11769343241249017","DOIUrl":"https://doi.org/10.1177/11769343241249017","url":null,"abstract":"Background:Intestinal metaplasia (IM) of gastric epithelium has traditionally been regarded as an irreversible stage in the process of the Correa cascade. Exploring the potential molecular mechanism of IM is significant for effective gastric cancer prevention.Methods:The GSE78523 dataset, obtained from the Gene Expression Omnibus (GEO) database, was analyzed using RStudio software to identify the differently expressed genes (DEGs) between IM tissues and normal gastric epithelial tissues. Subsequently, gene ontology (GO) analysis, Kyoto Encyclopedia of Genes and Genomes (KEGG) enrichment analysis, Gene Set Enrichment Analysis (GESA), and protein-protein interaction (PPI) analysis were used to find potential genes. Additionally, the screened genes were analyzed for clinical, immunological, and genetic correlation aspects using single gene clinical correlation analysis (UALCAN), Tumor–Immune System Interactions Database (TISIDB), and validated through western blot experiments.Results:Enrichment analysis showed that the lipid metabolic pathway was significantly associated with IM tissues and the apolipoprotein B ( APOB) gene was identified in the subsequent analysis. Experiment results and correlation analysis showed that the expression of APOB was higher in IM tissues than in normal tissues. This elevated expression of APOB was also found to be associated with the expression levels of hepatocyte nuclear factor 4A ( HNF4A) gene. HNF4A was also found to be associated with immune cell infiltration to gastric cancer and was linked to the prognosis of gastric cancer patients. Moreover, HNF4A was also highly expressed in both IM tissues and gastric cancer cells.Conclusion:Our findings indicate that HNF4A regulates the microenvironment of lipid metabolism in IM tissues by targeting APOB. Higher expression of HNF4A tends to lead to a worse prognosis in gastric cancer patients implying it may serve as a predictive indicator for the progression from IM to gastric cancer.","PeriodicalId":50472,"journal":{"name":"Evolutionary Bioinformatics","volume":"52 1","pages":""},"PeriodicalIF":2.6,"publicationDate":"2024-04-26","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"140803734","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Genomic Characterization of IS6110 Insertions in Mycobacterium orygis 鸟疫分枝杆菌 IS6110 插入的基因组特征

IF 2.6 4区生物学 Q4 EVOLUTIONARY BIOLOGY

Evolutionary Bioinformatics

Pub Date : 2024-04-05 DOI: 10.1177/11769343241240558

Ahmed Kabir Refaya, Umashankar Vetrivel, Kannan Palaniyandi

Mycobacterium orygis, a subspecies of the Mycobacterium tuberculosis complex (MTBC), has emerged as a significant concern in the context of One Health, with implications for zoonosis or zooanthroponosis or both. MTBC strains are characterized by the unique insertion element IS 6110, which is widely used as a diagnostic marker. IS 6110 transposition drives genetic modifications in MTBC, imparting genome plasticity and profound biological consequences. While IS 6110 insertions are customarily found in the MTBC genomes, the evolutionary trajectory of strains seems to correlate with the number of IS 6110 copies, indicating enhanced adaptability with increasing copy numbers. Here, we present a comprehensive analysis of IS 6110 insertions in the M. orygis genome, utilizing ISMapper, and elucidate their genetic consequences in promoting successful host adaptation. Our study encompasses a panel of 67 paired-end reads, comprising 11 isolates from our laboratory and 56 sequences downloaded from public databases. Among these sequences, 91% exhibited high-copy, 4.5% low-copy, and 4.5% lacked IS 6110 insertions. We identified 255 insertion loci, including 141 intragenic and 114 intergenic insertions. Most of these loci were either unique or shared among a limited number of isolates, potentially influencing strain behavior. Furthermore, we conducted gene ontology and pathway analysis, using eggNOG-mapper 5.0, on the protein sequences disrupted by IS 6110 insertions, revealing 63 genes involved in diverse functions of Gene Ontology and 45 genes participating in various KEGG pathways. Our findings offer novel insights into IS 6110 insertions, their preferential insertion regions, and their impact on metabolic processes and pathways, providing valuable knowledge on the genetic changes underpinning IS 6110 transposition in M. orygis.

倭黑猩猩分枝杆菌是结核分枝杆菌复合体（MTBC）的一个亚种，已成为 "一体健康 "背景下的一个重大问题，对人畜共患病或动物传染病或两者都有影响。MTBC 菌株以独特的插入元件 IS 6110 为特征，该元件被广泛用作诊断标记。IS 6110 的转座驱动了 MTBC 的基因修饰，赋予了基因组可塑性和深远的生物学影响。虽然 IS 6110 插入元件通常出现在 MTBC 基因组中，但菌株的进化轨迹似乎与 IS 6110 的拷贝数相关，这表明随着拷贝数的增加，适应性也会增强。在这里，我们利用 ISMapper 对 M. orygis 基因组中的 IS 6110 插入物进行了全面分析，并阐明了它们在促进成功适应宿主方面的遗传后果。我们的研究涵盖了 67 个成对末端读数，包括我们实验室的 11 个分离株和从公共数据库下载的 56 个序列。在这些序列中，91%表现为高拷贝，4.5%为低拷贝，4.5%缺乏IS 6110插入。我们确定了 255 个插入位点，包括 141 个基因内插入和 114 个基因间插入。这些位点中的大多数要么是唯一的，要么是少数分离株共享的，可能会影响菌株的行为。此外，我们使用 eggNOG-mapper 5.0 对被 IS 6110 插入破坏的蛋白质序列进行了基因本体和通路分析，发现了 63 个参与基因本体不同功能的基因和 45 个参与各种 KEGG 通路的基因。我们的研究结果为IS 6110插入、其优先插入区域及其对新陈代谢过程和通路的影响提供了新的见解，为IS 6110转座在M. orygis中的遗传变化提供了有价值的知识。

{"title":"Genomic Characterization of IS6110 Insertions in Mycobacterium orygis","authors":"Ahmed Kabir Refaya, Umashankar Vetrivel, Kannan Palaniyandi","doi":"10.1177/11769343241240558","DOIUrl":"https://doi.org/10.1177/11769343241240558","url":null,"abstract":"Mycobacterium orygis, a subspecies of the Mycobacterium tuberculosis complex (MTBC), has emerged as a significant concern in the context of One Health, with implications for zoonosis or zooanthroponosis or both. MTBC strains are characterized by the unique insertion element IS 6110, which is widely used as a diagnostic marker. IS 6110 transposition drives genetic modifications in MTBC, imparting genome plasticity and profound biological consequences. While IS 6110 insertions are customarily found in the MTBC genomes, the evolutionary trajectory of strains seems to correlate with the number of IS 6110 copies, indicating enhanced adaptability with increasing copy numbers. Here, we present a comprehensive analysis of IS 6110 insertions in the M. orygis genome, utilizing ISMapper, and elucidate their genetic consequences in promoting successful host adaptation. Our study encompasses a panel of 67 paired-end reads, comprising 11 isolates from our laboratory and 56 sequences downloaded from public databases. Among these sequences, 91% exhibited high-copy, 4.5% low-copy, and 4.5% lacked IS 6110 insertions. We identified 255 insertion loci, including 141 intragenic and 114 intergenic insertions. Most of these loci were either unique or shared among a limited number of isolates, potentially influencing strain behavior. Furthermore, we conducted gene ontology and pathway analysis, using eggNOG-mapper 5.0, on the protein sequences disrupted by IS 6110 insertions, revealing 63 genes involved in diverse functions of Gene Ontology and 45 genes participating in various KEGG pathways. Our findings offer novel insights into IS 6110 insertions, their preferential insertion regions, and their impact on metabolic processes and pathways, providing valuable knowledge on the genetic changes underpinning IS 6110 transposition in M. orygis.","PeriodicalId":50472,"journal":{"name":"Evolutionary Bioinformatics","volume":"5 1","pages":""},"PeriodicalIF":2.6,"publicationDate":"2024-04-05","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"140579546","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Large-scale Pan Genomic Analysis of Mycobacterium tuberculosis Reveals Key Insights Into Molecular Evolutionary Rate of Specific Processes and Functions. 结核分枝杆菌的大规模泛基因组分析揭示了特定过程和功能的分子进化速度的关键信息。

IF 1.7 4区生物学 Q4 EVOLUTIONARY BIOLOGY

Evolutionary Bioinformatics

Pub Date : 2024-03-25 eCollection Date: 2024-01-01 DOI: 10.1177/11769343241239463

Eshan Bundhoo, Anisah W Ghoorah, Yasmina Jaufeerally-Fakim

Mycobacterium tuberculosis (Mtb) is the causative agent of tuberculosis (TB), an infectious disease that is a major killer worldwide. Due to selection pressure caused by the use of antibacterial drugs, Mtb is characterised by mutational events that have given rise to multi drug resistant (MDR) and extensively drug resistant (XDR) phenotypes. The rate at which mutations occur is an important factor in the study of molecular evolution, and it helps understand gene evolution. Within the same species, different protein-coding genes evolve at different rates. To estimate the rates of molecular evolution of protein-coding genes, a commonly used parameter is the ratio dN/dS, where dN is the rate of non-synonymous substitutions and dS is the rate of synonymous substitutions. Here, we determined the estimated rates of molecular evolution of select biological processes and molecular functions across 264 strains of Mtb. We also investigated the molecular evolutionary rates of core genes of Mtb by computing the dN/dS values, and estimated the pan genome of the 264 strains of Mtb. Our results show that the cellular amino acid metabolic process and the kinase activity function evolve at a significantly higher rate, while the carbohydrate metabolic process evolves at a significantly lower rate for M. tuberculosis. These high rates of evolution correlate well with Mtb physiology and pathogenicity. We further propose that the core genome of M. tuberculosis likely experiences varying rates of molecular evolution which may drive an interplay between core genome and accessory genome during M. tuberculosis evolution.

结核分枝杆菌（Mtb）是结核病（TB）的病原体，而结核病是一种传染性疾病，是全球的主要杀手。由于使用抗菌药物造成的选择压力，Mtb 的特点是发生突变，从而产生多重耐药（MDR）和广泛耐药（XDR）表型。突变发生的速度是分子进化研究中的一个重要因素，它有助于理解基因进化。在同一物种中，不同蛋白质编码基因的进化速度不同。为了估计蛋白编码基因的分子进化速度，一个常用的参数是 dN/dS 比率，其中 dN 是非同义替换的速度，dS 是同义替换的速度。在这里，我们确定了 264 株 Mtb 中某些生物过程和分子功能的分子进化估计速率。我们还通过计算 dN/dS 值研究了 Mtb 核心基因的分子进化速率，并估算了 264 株 Mtb 的泛基因组。我们的结果表明，细胞氨基酸代谢过程和激酶活性功能的进化速度明显较快，而碳水化合物代谢过程的进化速度明显较慢。这些高进化率与结核杆菌的生理学和致病性密切相关。我们进一步提出，结核杆菌的核心基因组可能经历了不同的分子进化速度，这可能会在结核杆菌进化过程中推动核心基因组和附属基因组之间的相互作用。

{"title":"Large-scale Pan Genomic Analysis of Mycobacterium tuberculosis Reveals Key Insights Into Molecular Evolutionary Rate of Specific Processes and Functions.","authors":"Eshan Bundhoo, Anisah W Ghoorah, Yasmina Jaufeerally-Fakim","doi":"10.1177/11769343241239463","DOIUrl":"10.1177/11769343241239463","url":null,"abstract":"Mycobacterium tuberculosis (Mtb) is the causative agent of tuberculosis (TB), an infectious disease that is a major killer worldwide. Due to selection pressure caused by the use of antibacterial drugs, Mtb is characterised by mutational events that have given rise to multi drug resistant (MDR) and extensively drug resistant (XDR) phenotypes. The rate at which mutations occur is an important factor in the study of molecular evolution, and it helps understand gene evolution. Within the same species, different protein-coding genes evolve at different rates. To estimate the rates of molecular evolution of protein-coding genes, a commonly used parameter is the ratio dN/dS, where dN is the rate of non-synonymous substitutions and dS is the rate of synonymous substitutions. Here, we determined the estimated rates of molecular evolution of select biological processes and molecular functions across 264 strains of Mtb. We also investigated the molecular evolutionary rates of core genes of Mtb by computing the dN/dS values, and estimated the pan genome of the 264 strains of Mtb. Our results show that the cellular amino acid metabolic process and the kinase activity function evolve at a significantly higher rate, while the carbohydrate metabolic process evolves at a significantly lower rate for M. tuberculosis. These high rates of evolution correlate well with Mtb physiology and pathogenicity. We further propose that the core genome of M. tuberculosis likely experiences varying rates of molecular evolution which may drive an interplay between core genome and accessory genome during M. tuberculosis evolution.","PeriodicalId":50472,"journal":{"name":"Evolutionary Bioinformatics","volume":"20 ","pages":"11769343241239463"},"PeriodicalIF":1.7,"publicationDate":"2024-03-25","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10964447/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"140295209","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Draft Genome Sequence of Pantoea sp. Strain MHSD4, a Bacterial Endophyte With Bioremediation Potential. 具有生物修复潜力的内生细菌 Pantoea sp.

IF 2.6 4区生物学 Q4 EVOLUTIONARY BIOLOGY

Evolutionary Bioinformatics

Pub Date : 2024-03-13 eCollection Date: 2024-01-01 DOI: 10.1177/11769343231217908

Dimpho Michelle Morobane, Khuthadzo Tshishonga, Mahloro Hope Serepa-Dlamini

Pantoea sp. strain MHSD4 is a bacterial endophyte isolated from the leaves of the medicinal plant Pellaea calomelanos. Here, we report on strain MHSD4 draft whole genome sequence and annotation. The draft genome size of Pantoea sp. strain MHSD4 is 4 647 677 bp with a G+C content of 54.2% and 41 contigs. The National Center for Biotechnology Information Prokaryotic Genome Annotation Pipeline tool predicted a total of 4395 genes inclusive of 4235 protein-coding genes, 87 total RNA genes, 14 non-coding (nc) RNAs and 70 tRNAs, and 73 pseudogenes. Biosynthesis pathways for naphthalene and anthracene degradation were identified. Putative genes involved in bioremediation such as copA, copD, cueO, cueR, glnGm, and trxC were identified. Putative genes involved in copper homeostasis and tolerance were identified which may suggest that Pantoea sp. strain MHSD4 has biotechnological potential for bioremediation of heavy metals.

Pantoea sp. 菌株 MHSD4 是一种从药用植物 Pellaea calomelanos 的叶片中分离出来的细菌内生菌。在此，我们报告了菌株 MHSD4 的全基因组序列草案和注释。菌株 MHSD4 的基因组草案大小为 4 647 677 bp，G+C 含量为 54.2%，有 41 个等位组。美国国家生物技术信息中心原核生物基因组注释管道工具共预测出 4395 个基因，包括 4235 个蛋白质编码基因、87 个总 RNA 基因、14 个非编码 (nc) RNA 和 70 个 tRNA 以及 73 个假基因。确定了萘和蒽降解的生物合成途径。确定了参与生物修复的假定基因，如 copA、copD、cueO、cueR、glnGm 和 trxC。鉴定出了参与铜平衡和耐受性的假定基因，这可能表明盘菌菌株 MHSD4 在重金属生物修复方面具有生物技术潜力。

引用次数: 0