Pub Date : 2024-11-11eCollection Date: 2024-01-01DOI: 10.1177/11769343241298307
Israt Jahan Asha, Shipan Das Gupta, Md Murad Hossain, Md Nur Islam, Nurun Nahar Akter, Mohammed Mafizul Islam, Shuvo Chandra Das, Dhirendra Nath Barman
Objective: Neisseria meningitidis is an encapsulated, diplococcus, kidney bean-shaped bacteria that causes bacterial meningitis. Our study hopes to advance our understanding of disease progression, the spread frequency of the bacteria in people, and the interactions between the bacteria and human body by identifying a functional protein, potentially serving as a target for meningococcal medicine in the future.
Methods: A hypothetical protein HP (PBJ89160.1) from N.meningitidis was employed in this study for extensive structural and functional characterization. In the predictive functional role of HP, several constitutive bioinformatics approaches are applied, such as prediction of physiological properties, domain and motif family function, secondary and tertiary structure prediction, energy minimization, quality validation, docking, and ADMET analysis. To create the protein's three-dimensional (3D) structure, a template protein (PDB_ID: 3GXA) is used with 99% sequence identity by homology modeling technique with the HHpred server. To mitigate the pathogenicity associated with the HP function, it was docked with the natural ligand methionine and five other drug compounds like Verapamil, Loperamide, Thioridazine, Chlorpromazine, and Auranofine.
Results: The protein is predicted to be acidic, soluble and hydrophilic by physicochemical properties analysis. Subcellular localization analysis demonstrated the protein to be periplasmic. The HP has an ATP-binding cassette transporter (also known as ABC transporter) involved in uptake of methionine (MetQ) that creates nutritional virulence in host. Energy minimization, multiple quality assessments, and validation value determination led to the conclusion that the HP model had a workable and acceptable quality. Following ADMET analysis and binding affinity assessments from the docking studies, Loperamide emerged as the most promising therapeutic compound, effectively inhibiting the ATP transporter activity of the HP.
Conclusion: Comparative genomic analysis revealed that this protein is specific to N. meningitidis and has no homologs in human proteins, thereby identifying it as a potential target for therapeutic intervention.
目的:脑膜炎奈瑟菌(Neisseria meningitidis)是一种包囊双球菌,呈芸豆状,可引起细菌性脑膜炎。我们的研究希望通过鉴定一种功能性蛋白质,促进我们对疾病进展、细菌在人体内的传播频率以及细菌与人体之间相互作用的了解,从而有可能成为未来脑膜炎球菌药物的靶点:方法:本研究利用脑膜炎球菌的假定蛋白 HP(PBJ89160.1)进行了广泛的结构和功能表征。在预测 HP 的功能作用时,应用了多种构成性生物信息学方法,如生理特性预测、结构域和主题族功能预测、二级和三级结构预测、能量最小化、质量验证、对接和 ADMET 分析。为了创建蛋白质的三维(3D)结构,利用 HHpred 服务器的同源建模技术,使用了序列同一性为 99% 的模板蛋白质(PDB_ID:3GXA)。为了减轻与 HP 功能相关的致病性,该蛋白与天然配体蛋氨酸和其他五种药物化合物(如维拉帕米、洛哌丁胺、硫利达嗪、氯丙嗪和奥拉诺芬)进行了对接:通过理化性质分析,预测该蛋白质呈酸性、可溶性和亲水性。亚细胞定位分析表明该蛋白质具有围质粒性。HP 有一个 ATP 结合盒转运体(又称 ABC 转运体),参与摄取蛋氨酸(MetQ),从而在宿主体内产生营养毒力。通过能量最小化、多重质量评估和验证值确定,最终得出结论:HP 模型具有可行且可接受的质量。经过 ADMET 分析和对接研究的结合亲和力评估,洛哌丁胺成为最有希望的治疗化合物,它能有效抑制 HP 的 ATP 转运活性:比较基因组分析表明,这种蛋白质是脑膜炎奈瑟氏菌特有的,在人类蛋白质中没有同源物,因此被确定为潜在的治疗靶点。
{"title":"<i>In silico</i> Characterization of a Hypothetical Protein (PBJ89160.1) from <i>Neisseria meningitidis</i> Exhibits a New Insight on Nutritional Virulence and Molecular Docking to Uncover a Therapeutic Target.","authors":"Israt Jahan Asha, Shipan Das Gupta, Md Murad Hossain, Md Nur Islam, Nurun Nahar Akter, Mohammed Mafizul Islam, Shuvo Chandra Das, Dhirendra Nath Barman","doi":"10.1177/11769343241298307","DOIUrl":"https://doi.org/10.1177/11769343241298307","url":null,"abstract":"<p><strong>Objective: </strong><i>Neisseria meningitidis</i> is an encapsulated, diplococcus, kidney bean-shaped bacteria that causes bacterial meningitis. Our study hopes to advance our understanding of disease progression, the spread frequency of the bacteria in people, and the interactions between the bacteria and human body by identifying a functional protein, potentially serving as a target for meningococcal medicine in the future.</p><p><strong>Methods: </strong>A hypothetical protein HP (PBJ89160.1) from <i>N.</i> <i>meningitidis</i> was employed in this study for extensive structural and functional characterization. In the predictive functional role of HP, several constitutive bioinformatics approaches are applied, such as prediction of physiological properties, domain and motif family function, secondary and tertiary structure prediction, energy minimization, quality validation, docking, and ADMET analysis. To create the protein's three-dimensional (3D) structure, a template protein (PDB_ID: 3GXA) is used with 99% sequence identity by homology modeling technique with the HHpred server. To mitigate the pathogenicity associated with the HP function, it was docked with the natural ligand methionine and five other drug compounds like Verapamil, Loperamide, Thioridazine, Chlorpromazine, and Auranofine.</p><p><strong>Results: </strong>The protein is predicted to be acidic, soluble and hydrophilic by physicochemical properties analysis. Subcellular localization analysis demonstrated the protein to be periplasmic. The HP has an ATP-binding cassette transporter (also known as ABC transporter) involved in uptake of methionine (MetQ) that creates nutritional virulence in host. Energy minimization, multiple quality assessments, and validation value determination led to the conclusion that the HP model had a workable and acceptable quality. Following ADMET analysis and binding affinity assessments from the docking studies, Loperamide emerged as the most promising therapeutic compound, effectively inhibiting the ATP transporter activity of the HP.</p><p><strong>Conclusion: </strong>Comparative genomic analysis revealed that this protein is specific to <i>N. meningitidis</i> and has no homologs in human proteins, thereby identifying it as a potential target for therapeutic intervention.</p>","PeriodicalId":50472,"journal":{"name":"Evolutionary Bioinformatics","volume":"20 ","pages":"11769343241298307"},"PeriodicalIF":1.7,"publicationDate":"2024-11-11","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11555745/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142631739","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2024-11-04eCollection Date: 2024-01-01DOI: 10.1177/11769343241294153
Ming Zhang, Zhenzhen Yin
Background: Clostridium botulinum and Clostridium perfringens, 2 major foodborne pathogenic fusobacteria, have a variety of virulent protein types with nervous and enterotoxic pathogenic potential, respectively.
Objective: The relationship between the molecular evolution of the 2 Clostridium genomes and virulence proteins was studied via a bioinformatics prediction method. The genetic stability, main features of gene coding and structural characteristics of virulence proteins were compared and analyzed to reveal the phylogenetic characteristics, diversity, and distribution of virulence factors of foodborne Clostridium strains.
Methods: The phylogenetic analysis was performed via composition vector and average nucleotide identity based methods. Evolutionary distances of virulence genes relative to those of housekeeping genes were calculated via multilocus sequence analysis. Bioinformatics software and tools were used to predict and compare the main functional features of genes encoding virulence proteins, and the structures of virulence proteins were predicted and analyzed through homology modeling and a deep learning algorithm.
Results: According to the diversity of toxins, genome evolution tended to cluster based on the protein-coding virulence genes. The evolutionary transfer distances of virulence genes relative to those of housekeeping genes in C. botulinum strains were greater than those in C. perfringens strains, and BoNTs and alpha toxin proteins were located extracellularly. The BoNTs have highly similar structures, but BoNT/A/B and BoNT/E/F have significantly different conformations. The beta2 toxin monomer structure is similar to but simpler than the alpha toxin monomer structure, which has 2 mobile loops in the N-terminal domain. The C-terminal domain of the CPE trimer forms a "claudin-binding pocket" shape, which suggests biological relevance, such as in pore formation.
Conclusions: According to the genotype of protein-coding virulence genes, the evolution of Clostridium showed a clustering trend. The genetic stability, functional and structural characteristics of foodborne Clostridium virulence proteins reveal the taxonomy and diverse distribution of virulence factors.
{"title":"Comparative Phylogenetic Analysis and Protein Prediction Reveal the Taxonomy and Diverse Distribution of Virulence Factors in Foodborne <i>Clostridium</i> Strains.","authors":"Ming Zhang, Zhenzhen Yin","doi":"10.1177/11769343241294153","DOIUrl":"10.1177/11769343241294153","url":null,"abstract":"<p><strong>Background: </strong><i>Clostridium botulinum</i> and <i>Clostridium perfringens</i>, 2 major foodborne pathogenic fusobacteria, have a variety of virulent protein types with nervous and enterotoxic pathogenic potential, respectively.</p><p><strong>Objective: </strong>The relationship between the molecular evolution of the 2 <i>Clostridium</i> genomes and virulence proteins was studied via a bioinformatics prediction method. The genetic stability, main features of gene coding and structural characteristics of virulence proteins were compared and analyzed to reveal the phylogenetic characteristics, diversity, and distribution of virulence factors of foodborne <i>Clostridium</i> strains.</p><p><strong>Methods: </strong>The phylogenetic analysis was performed via composition vector and average nucleotide identity based methods. Evolutionary distances of virulence genes relative to those of housekeeping genes were calculated via multilocus sequence analysis. Bioinformatics software and tools were used to predict and compare the main functional features of genes encoding virulence proteins, and the structures of virulence proteins were predicted and analyzed through homology modeling and a deep learning algorithm.</p><p><strong>Results: </strong>According to the diversity of toxins, genome evolution tended to cluster based on the protein-coding virulence genes. The evolutionary transfer distances of virulence genes relative to those of housekeeping genes in <i>C. botulinum</i> strains were greater than those in <i>C. perfringens</i> strains, and BoNTs and alpha toxin proteins were located extracellularly. The BoNTs have highly similar structures, but BoNT/A/B and BoNT/E/F have significantly different conformations. The beta2 toxin monomer structure is similar to but simpler than the alpha toxin monomer structure, which has 2 mobile loops in the N-terminal domain. The C-terminal domain of the CPE trimer forms a \"claudin-binding pocket\" shape, which suggests biological relevance, such as in pore formation.</p><p><strong>Conclusions: </strong>According to the genotype of protein-coding virulence genes, the evolution of <i>Clostridium</i> showed a clustering trend. The genetic stability, functional and structural characteristics of foodborne <i>Clostridium</i> virulence proteins reveal the taxonomy and diverse distribution of virulence factors.</p>","PeriodicalId":50472,"journal":{"name":"Evolutionary Bioinformatics","volume":"20 ","pages":"11769343241294153"},"PeriodicalIF":1.7,"publicationDate":"2024-11-04","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11536399/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142583424","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2024-10-21eCollection Date: 2024-01-01DOI: 10.1177/11769343241292224
Dan-Hua Chu, Ji-Yong An, Xiao-Mei Nie
Introduction: Predicting Self-interacting proteins (SIPs) is a crucial area of research in predicting protein functions, as well as in understanding gene-disease and disease-drug associations. These interactions are integral to numerous cellular processes and play pivotal roles within cells. However, traditional methods for identifying SIPs through biological experiments are often expensive, time-consuming, and have long cycles. Therefore, the development of effective computational methods for accurately predicting SIPs is not only necessary but also presents a significant challenge.
Results: In this research, we introduce a novel computational prediction technique, VGGNGLCM, which leverages protein sequence data. This method integrates the VGGNet deep convolutional neural network (VGGN) with the Gray-Level Co-occurrence Matrix (GLCM) to detect Self-interacting proteins associations. Specifically, we initially utilized Position Specific Scoring Matrix (PSSM) to capture protein evolutionary information and integrated key features from PSSM using GLCM. We then employed VGGNet as a predictive classifier, leveraging its capabilities for powerful learning and classification prediction. Subsequently, the extracted features were input into the VGGNet deep convolutional neural network to identify Self-interacting proteins. To evaluate the performance of the VGGNGLCM model, we conducted experiments using yeast and human datasets, achieving average accuracies of 95.68% and 97.72% respectively. Additionally, we compared the prediction performance of the VGGNet classifier with that of the Convolutional Neural Network (CNN) and the state-of-the-art Support Vector Machine (SVM) using the same feature extraction method. We also compared the prediction ability of VGGNGLCM with other existing approaches. The comparison results further demonstrate the superior performance of VGGNGLCM over other prediction models in this domain.
Conclusion: The experimental verification further strengthens the evidence that VGGNGLCM is effective and robust compared to existing methods. It also highlights the high accuracy and robustness of the VGGNGLCM model in predicting Self-interacting proteins (SIPs). Consequently, we believe that the VGGNGLCM method serves as a valuable computational tool and can catalyze extensive bioinformatics research related to SIPs prediction.
{"title":"An Effective Computational Method for Predicting Self-Interacting Proteins Based on VGGNet Convolutional Neural Network and Gray-Level Co-occurrence Matrix.","authors":"Dan-Hua Chu, Ji-Yong An, Xiao-Mei Nie","doi":"10.1177/11769343241292224","DOIUrl":"10.1177/11769343241292224","url":null,"abstract":"<p><strong>Introduction: </strong>Predicting Self-interacting proteins (SIPs) is a crucial area of research in predicting protein functions, as well as in understanding gene-disease and disease-drug associations. These interactions are integral to numerous cellular processes and play pivotal roles within cells. However, traditional methods for identifying SIPs through biological experiments are often expensive, time-consuming, and have long cycles. Therefore, the development of effective computational methods for accurately predicting SIPs is not only necessary but also presents a significant challenge.</p><p><strong>Results: </strong>In this research, we introduce a novel computational prediction technique, VGGNGLCM, which leverages protein sequence data. This method integrates the VGGNet deep convolutional neural network (VGGN) with the Gray-Level Co-occurrence Matrix (GLCM) to detect Self-interacting proteins associations. Specifically, we initially utilized Position Specific Scoring Matrix (PSSM) to capture protein evolutionary information and integrated key features from PSSM using GLCM. We then employed VGGNet as a predictive classifier, leveraging its capabilities for powerful learning and classification prediction. Subsequently, the extracted features were input into the VGGNet deep convolutional neural network to identify Self-interacting proteins. To evaluate the performance of the VGGNGLCM model, we conducted experiments using yeast and human datasets, achieving average accuracies of 95.68% and 97.72% respectively. Additionally, we compared the prediction performance of the VGGNet classifier with that of the Convolutional Neural Network (CNN) and the state-of-the-art Support Vector Machine (SVM) using the same feature extraction method. We also compared the prediction ability of VGGNGLCM with other existing approaches. The comparison results further demonstrate the superior performance of VGGNGLCM over other prediction models in this domain.</p><p><strong>Conclusion: </strong>The experimental verification further strengthens the evidence that VGGNGLCM is effective and robust compared to existing methods. It also highlights the high accuracy and robustness of the VGGNGLCM model in predicting Self-interacting proteins (SIPs). Consequently, we believe that the VGGNGLCM method serves as a valuable computational tool and can catalyze extensive bioinformatics research related to SIPs prediction.</p>","PeriodicalId":50472,"journal":{"name":"Evolutionary Bioinformatics","volume":"20 ","pages":"11769343241292224"},"PeriodicalIF":1.7,"publicationDate":"2024-10-21","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11503870/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142512188","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2024-10-12eCollection Date: 2024-01-01DOI: 10.1177/11769343241290461
Beian Lin, Jian Zhang, Mengting Chen, Xinyue Gao, Jiaxin Wen, Kun Tian, Yajiao Wu, Zekai Chen, Qiaomei Yang, An Zhu, Chunhong Du
Objective: To explore different mRNA transcriptome patterns and RNA N6-methyladenosine (m6A) alteration in yunaconitine (YA)-treated HT22 mouse hippocampal neuron, and uncover the role of abnormal mRNA expression and RNA m6A modification in YA-induced neurotoxicity.
Methods: HT22 cells were treated with 0, 5, 10, and 50 μM of YA for 72 h to evaluate their viability and GSH content. Subsequently, mRNA-seq and MeRIP-seq analyses were performed on HT22 cells treated with 0 and 10 μM YA for 72 h, and molecular docking was used to simulate interactions between YA and differentially expressed m6A regulators. The mitochondrial membrane potential was examined using the JC-10 probe, and RT-qPCR was conducted to verify the expression levels of differentially expressed m6A regulatory factors, as well as to assess alterations in the mRNA expression levels of antioxidant genes.
Results: YA treatment significantly reduced the viability of HT22 cells and decreased GSH content. The mRNA-seq analysis obtained 1018 differentially expressed genes, KEGG and GO enrichment results of differentially expressed genes mainly comprise the nervous system development, cholinergic synapse, response to oxidative stress, and mitochondrial inner membrane. A total of 7 differentially expressed m6A regulators were identified by MeRIP-seq. Notably, molecular docking results suggested a stable interaction between YA and most of the differentially expressed m6A regulators.
Conclusion: This study showed that YA-induced HT22 cell damage was associated with the increased methylation modification level of target gene m6A and abnormal expression of m6A regulators.
{"title":"Comprehensive Profiling of Transcriptome and m6A Epitranscriptome Uncovers the Neurotoxic Effects of Yunaconitine on HT22 Cells.","authors":"Beian Lin, Jian Zhang, Mengting Chen, Xinyue Gao, Jiaxin Wen, Kun Tian, Yajiao Wu, Zekai Chen, Qiaomei Yang, An Zhu, Chunhong Du","doi":"10.1177/11769343241290461","DOIUrl":"10.1177/11769343241290461","url":null,"abstract":"<p><strong>Objective: </strong>To explore different mRNA transcriptome patterns and RNA N6-methyladenosine (m6A) alteration in yunaconitine (YA)-treated HT22 mouse hippocampal neuron, and uncover the role of abnormal mRNA expression and RNA m6A modification in YA-induced neurotoxicity.</p><p><strong>Methods: </strong>HT22 cells were treated with 0, 5, 10, and 50 μM of YA for 72 h to evaluate their viability and GSH content. Subsequently, mRNA-seq and MeRIP-seq analyses were performed on HT22 cells treated with 0 and 10 μM YA for 72 h, and molecular docking was used to simulate interactions between YA and differentially expressed m6A regulators. The mitochondrial membrane potential was examined using the JC-10 probe, and RT-qPCR was conducted to verify the expression levels of differentially expressed m6A regulatory factors, as well as to assess alterations in the mRNA expression levels of antioxidant genes.</p><p><strong>Results: </strong>YA treatment significantly reduced the viability of HT22 cells and decreased GSH content. The mRNA-seq analysis obtained 1018 differentially expressed genes, KEGG and GO enrichment results of differentially expressed genes mainly comprise the nervous system development, cholinergic synapse, response to oxidative stress, and mitochondrial inner membrane. A total of 7 differentially expressed m6A regulators were identified by MeRIP-seq. Notably, molecular docking results suggested a stable interaction between YA and most of the differentially expressed m6A regulators.</p><p><strong>Conclusion: </strong>This study showed that YA-induced HT22 cell damage was associated with the increased methylation modification level of target gene m6A and abnormal expression of m6A regulators.</p>","PeriodicalId":50472,"journal":{"name":"Evolutionary Bioinformatics","volume":"20 ","pages":"11769343241290461"},"PeriodicalIF":1.7,"publicationDate":"2024-10-12","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11526304/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142559292","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2024-09-14DOI: 10.1177/11769343241272414
Nam Anh Dao, Manh Hung Le, Xuan Tho Dang
The identification of potential interactions and relationships between diseases and drugs is significant in public health care and drug discovery. As we all know, experimenting to determine the drug-disease interactions is very expensive in both time and money. However, there are still many drug-disease associations that are still undiscovered and potential. Therefore, the development of computational methods to explore the relationship between drugs and diseases is very important and essential. Many computational methods for predicting drug-disease associations have been developed based on known interactions to learn potential interactions of unknown drug-disease pairs. In this paper, we propose 3 new main groups of meta-paths based on the heterogeneous biological network of drug-protein-disease objects. For each meta-path, we design a machine learning model, then an integrated learning method is formed by these models. We evaluated our approach on 3 standard datasets which are DrugBank, OMIM, and Gottlieb’s dataset. Experimental results demonstrate that the proposed method is better than some recent methods such as EMP-SVD, LRSSL, MBiRW, MPG-DDA, SCMFDD,. . . in some measures such as AUC, AUPR, and F1-score.
确定疾病与药物之间潜在的相互作用和关系对于公共医疗保健和药物研发意义重大。众所周知,通过实验来确定药物与疾病之间的相互作用在时间和金钱上都非常昂贵。然而,仍有许多药物与疾病之间的关联尚未被发现,而且潜力巨大。因此,开发计算方法来探索药物与疾病之间的关系是非常重要和必要的。许多预测药物-疾病关联的计算方法都是基于已知的相互作用来学习未知药物-疾病配对的潜在相互作用。在本文中,我们基于药物-蛋白质-疾病对象的异构生物网络,提出了 3 组新的元路径。我们为每个元路径设计了一个机器学习模型,然后由这些模型组成了一个集成学习方法。我们在 DrugBank、OMIM 和 Gottlieb 数据集这三个标准数据集上评估了我们的方法。实验结果表明,所提出的方法在一些指标(如 AUC 值)上优于最近的一些方法,如 EMP-SVD、LRSSL、MBiRW、MPG-DDA、SCMFDD......。.在 AUC、AUPR 和 F1 分数等一些指标上更胜一筹。
{"title":"Label Transfer for Drug Disease Association in Three Meta-Paths","authors":"Nam Anh Dao, Manh Hung Le, Xuan Tho Dang","doi":"10.1177/11769343241272414","DOIUrl":"https://doi.org/10.1177/11769343241272414","url":null,"abstract":"The identification of potential interactions and relationships between diseases and drugs is significant in public health care and drug discovery. As we all know, experimenting to determine the drug-disease interactions is very expensive in both time and money. However, there are still many drug-disease associations that are still undiscovered and potential. Therefore, the development of computational methods to explore the relationship between drugs and diseases is very important and essential. Many computational methods for predicting drug-disease associations have been developed based on known interactions to learn potential interactions of unknown drug-disease pairs. In this paper, we propose 3 new main groups of meta-paths based on the heterogeneous biological network of drug-protein-disease objects. For each meta-path, we design a machine learning model, then an integrated learning method is formed by these models. We evaluated our approach on 3 standard datasets which are DrugBank, OMIM, and Gottlieb’s dataset. Experimental results demonstrate that the proposed method is better than some recent methods such as EMP-SVD, LRSSL, MBiRW, MPG-DDA, SCMFDD,. . . in some measures such as AUC, AUPR, and F1-score.","PeriodicalId":50472,"journal":{"name":"Evolutionary Bioinformatics","volume":"23 1","pages":""},"PeriodicalIF":2.6,"publicationDate":"2024-09-14","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142254010","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
The recombination plays a key role in promoting evolution of RNA viruses and emergence of potentially epidemic variants. Some studies investigated the recombination occurrence among SARS-CoV-2, without exploring its impact on virus-host interaction. In the aim to investigate the burden of recombination in terms of frequency and distribution, the occurrence of recombination was first explored in 44 230 Omicron sequences among BQ subvariants and the under investigation "ML" (Multiple Lineages) denoted sequences, using 3seq software. Second, the recombination impact on interaction between the Spike protein and ACE2 receptor as well as neutralizing antibodies (nAbs), was analyzed using docking tools. Recombination was detected in 56.91% and 82.20% of BQ and ML strains, respectively. It took place mainly in spike and ORF1a genes. For BQ recombinant strains, the docking analysis showed that the spike interacted strongly with ACE2 and weakly with nAbs. The mutations S373P, S375F and T376A constitute a residue network that enhances the RBD interaction with ACE2. Thirteen mutations in RBD (S373P, S375F, T376A, D405N, R408S, K417N, N440K, S477N, P494S, Q498R, N501Y, and Y505H) and NTD (Y240H) seem to be implicated in immune evasion of recombinants by altering spike interaction with nAbs. In conclusion, this "in silico" study demonstrated that the recombination mechanism is frequent among Omicron BQ and ML variants. It highlights new key mutations, that potentially implicated in enhancement of spike binding to ACE2 (F376A) and escape from nAbs (RBD: F376A, D405N, R408S, N440K, S477N, P494S, and Y505H; NTD: Y240H). Our findings present considerable insights for the elaboration of effective prophylaxis and therapeutic strategies against future SARS-CoV-2 waves.
{"title":"Recombination Events Among SARS-CoV-2 Omicron Subvariants: Impact on Spike Interaction With ACE2 Receptor and Neutralizing Antibodies.","authors":"Marwa Arbi, Marwa Khedhiri, Kaouther Ayouni, Oussema Souiai, Samar Dhouib, Nidhal Ghanmi, Alia Benkahla, Henda Triki, Sondes Haddad-Boubaker","doi":"10.1177/11769343241272415","DOIUrl":"10.1177/11769343241272415","url":null,"abstract":"<p><p>The recombination plays a key role in promoting evolution of RNA viruses and emergence of potentially epidemic variants. Some studies investigated the recombination occurrence among SARS-CoV-2, without exploring its impact on virus-host interaction. In the aim to investigate the burden of recombination in terms of frequency and distribution, the occurrence of recombination was first explored in 44 230 Omicron sequences among BQ subvariants and the under investigation \"ML\" (Multiple Lineages) denoted sequences, using 3seq software. Second, the recombination impact on interaction between the Spike protein and ACE2 receptor as well as neutralizing antibodies (nAbs), was analyzed using docking tools. Recombination was detected in 56.91% and 82.20% of BQ and ML strains, respectively. It took place mainly in spike and ORF1a genes. For BQ recombinant strains, the docking analysis showed that the spike interacted strongly with ACE2 and weakly with nAbs. The mutations S373P, S375F and T376A constitute a residue network that enhances the RBD interaction with ACE2. Thirteen mutations in RBD (S373P, S375F, T376A, D405N, R408S, K417N, N440K, S477N, P494S, Q498R, N501Y, and Y505H) and NTD (Y240H) seem to be implicated in immune evasion of recombinants by altering spike interaction with nAbs. In conclusion, this \"in silico\" study demonstrated that the recombination mechanism is frequent among Omicron BQ and ML variants. It highlights new key mutations, that potentially implicated in enhancement of spike binding to ACE2 (F376A) and escape from nAbs (RBD: F376A, D405N, R408S, N440K, S477N, P494S, and Y505H; NTD: Y240H). Our findings present considerable insights for the elaboration of effective prophylaxis and therapeutic strategies against future SARS-CoV-2 waves.</p>","PeriodicalId":50472,"journal":{"name":"Evolutionary Bioinformatics","volume":"20 ","pages":"11769343241272415"},"PeriodicalIF":1.7,"publicationDate":"2024-08-14","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11325312/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141989369","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2024-08-14eCollection Date: 2024-01-01DOI: 10.1177/11769343241272413
Yili Luo, Jianpeng Liu, Wangqiang Feng, Da Lin, Mengji Chen, Haihua Zheng
Background: Age-related Macular Degeneration (AMD) poses a growing global health concern as the leading cause of central vision loss in elderly people.
Objection: This study focuses on unraveling the intricate involvement of Natural Killer (NK) cells in AMD, shedding light on their immune responses and cytokine regulatory roles.
Methods: Transcriptomic data from the Gene Expression Omnibus database were utilized, employing single-cell RNA-seq analysis. High-dimensional weighted gene co-expression network analysis (hdWGCNA) and single-cell regulatory network inference and clustering (SCENIC) analysis were applied to reveal the regulatory mechanisms of NK cells in early-stage AMD patients. Machine learning models, such as random forests and decision trees, were employed to screen hub genes and key transcription factors (TFs) associated with AMD.
Results: Distinct cell clusters were identified in the present study, especially the T/NK cluster, with a notable increase in NK cell abundance observed in AMD. Cell-cell communication analyses revealed altered interactions, particularly in NK cells, indicating their potential role in AMD pathogenesis. HdWGCNA highlighted the turquoise module, enriched in inflammation-related pathways, as significantly associated with AMD in NK cells. The SCENIC analysis identified key TFs in NK cell regulatory networks. The integration of hub genes and TFs identified CREM, FOXP1, IRF1, NFKB2, and USF2 as potential predictors for AMD through machine learning.
Conclusion: This comprehensive approach enhances our understanding of NK cell dynamics, signaling alterations, and potential predictive models for AMD. The identified TFs provide new avenues for molecular interventions and highlight the intricate relationship between NK cells and AMD pathogenesis. Overall, this study contributes valuable insights for advancing our understanding and management of AMD.
背景:年龄相关性黄斑变性(AMD)是导致老年人中心视力丧失的主要原因,已成为全球日益关注的健康问题:本研究的重点是揭示自然杀伤细胞(NK)在AMD中的复杂参与,阐明其免疫反应和细胞因子的调控作用:方法:利用单细胞RNA-seq分析基因表达总库(Gene Expression Omnibus)的转录组数据。应用高维加权基因共表达网络分析(hdWGCNA)和单细胞调控网络推断与聚类分析(SCENIC)揭示早期AMD患者NK细胞的调控机制。采用随机森林和决策树等机器学习模型筛选与AMD相关的枢纽基因和关键转录因子(TFs):结果:本研究发现了不同的细胞群,尤其是T/NK细胞群,观察到AMD患者的NK细胞数量明显增加。细胞-细胞通讯分析表明,细胞间的相互作用发生了改变,特别是在NK细胞中,这表明它们在AMD发病机制中的潜在作用。HdWGCNA突出显示了绿松石模块,该模块富含炎症相关通路,与NK细胞中的AMD显著相关。SCENIC 分析确定了 NK 细胞调控网络中的关键 TFs。通过机器学习,整合枢纽基因和TFs确定了CREM、FOXP1、IRF1、NFKB2和USF2是AMD的潜在预测因子:这一综合方法增强了我们对 NK 细胞动态、信号改变和 AMD 潜在预测模型的了解。鉴定出的TFs为分子干预提供了新途径,并凸显了NK细胞与AMD发病机制之间错综复杂的关系。总之,这项研究为促进我们对 AMD 的了解和管理提供了宝贵的见解。
{"title":"Single-cell RNA Sequencing Identifies Natural Kill Cell-Related Transcription Factors Associated With Age-Related Macular Degeneration.","authors":"Yili Luo, Jianpeng Liu, Wangqiang Feng, Da Lin, Mengji Chen, Haihua Zheng","doi":"10.1177/11769343241272413","DOIUrl":"10.1177/11769343241272413","url":null,"abstract":"<p><strong>Background: </strong>Age-related Macular Degeneration (AMD) poses a growing global health concern as the leading cause of central vision loss in elderly people.</p><p><strong>Objection: </strong>This study focuses on unraveling the intricate involvement of Natural Killer (NK) cells in AMD, shedding light on their immune responses and cytokine regulatory roles.</p><p><strong>Methods: </strong>Transcriptomic data from the Gene Expression Omnibus database were utilized, employing single-cell RNA-seq analysis. High-dimensional weighted gene co-expression network analysis (hdWGCNA) and single-cell regulatory network inference and clustering (SCENIC) analysis were applied to reveal the regulatory mechanisms of NK cells in early-stage AMD patients. Machine learning models, such as random forests and decision trees, were employed to screen hub genes and key transcription factors (TFs) associated with AMD.</p><p><strong>Results: </strong>Distinct cell clusters were identified in the present study, especially the T/NK cluster, with a notable increase in NK cell abundance observed in AMD. Cell-cell communication analyses revealed altered interactions, particularly in NK cells, indicating their potential role in AMD pathogenesis. HdWGCNA highlighted the turquoise module, enriched in inflammation-related pathways, as significantly associated with AMD in NK cells. The SCENIC analysis identified key TFs in NK cell regulatory networks. The integration of hub genes and TFs identified <i>CREM, FOXP1, IRF1, NFKB2</i>, and <i>USF2</i> as potential predictors for AMD through machine learning.</p><p><strong>Conclusion: </strong>This comprehensive approach enhances our understanding of NK cell dynamics, signaling alterations, and potential predictive models for AMD. The identified TFs provide new avenues for molecular interventions and highlight the intricate relationship between NK cells and AMD pathogenesis. Overall, this study contributes valuable insights for advancing our understanding and management of AMD.</p>","PeriodicalId":50472,"journal":{"name":"Evolutionary Bioinformatics","volume":"20 ","pages":"11769343241272413"},"PeriodicalIF":1.7,"publicationDate":"2024-08-14","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11325330/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141989370","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2024-07-24eCollection Date: 2024-01-01DOI: 10.1177/11769343241263230
Arthur Casulli de Oliveira, Luiz Augusto Bovolenta, Lucas Figueiredo, Amanda De Oliveira Ribeiro, Beatriz Jacinto Alves Pereira, Talita Roberto Aleixo de Almeida, Vinicius Farias Campos, James G Patton, Danillo Pinhal
In metazoans, microRNAs (miRNAs) are essential regulators of gene expression, affecting critical cellular processes from differentiation and proliferation, to homeostasis. During miRNA biogenesis, the miRNA strand that loads onto the RNA-induced Silencing Complex (RISC) can vary, leading to changes in gene targeting and modulation of biological pathways. To investigate the impact of these "arm switching" events on gene regulation, we analyzed a diverse range of tissues and developmental stages in zebrafish by comparing 5p and 3p arms accumulation dynamics between embryonic developmental stages, adult tissues, and sexes. We also compared variable arm usage patterns observed in zebrafish to other vertebrates including arm switching data from fish, birds, and mammals. Our comprehensive analysis revealed that variable arm usage events predominantly take place during embryonic development. It is also noteworthy that isomiR occurrence correlates to changes in arm selection evidencing an important role of microRNA distinct isoforms in reinforcing and modifying gene regulation by promoting dynamics switches on miRNA 5p and 3p arms accumulation. Our results shed new light on the emergence and coordination of gene expression regulation and pave the way for future investigations in this field.
{"title":"MicroRNA Transcriptomes Reveal Prevalence of Rare and Species-Specific Arm Switching Events During Zebrafish Ontogenesis.","authors":"Arthur Casulli de Oliveira, Luiz Augusto Bovolenta, Lucas Figueiredo, Amanda De Oliveira Ribeiro, Beatriz Jacinto Alves Pereira, Talita Roberto Aleixo de Almeida, Vinicius Farias Campos, James G Patton, Danillo Pinhal","doi":"10.1177/11769343241263230","DOIUrl":"10.1177/11769343241263230","url":null,"abstract":"<p><p>In metazoans, microRNAs (miRNAs) are essential regulators of gene expression, affecting critical cellular processes from differentiation and proliferation, to homeostasis. During miRNA biogenesis, the miRNA strand that loads onto the RNA-induced Silencing Complex (RISC) can vary, leading to changes in gene targeting and modulation of biological pathways. To investigate the impact of these \"arm switching\" events on gene regulation, we analyzed a diverse range of tissues and developmental stages in zebrafish by comparing 5p and 3p arms accumulation dynamics between embryonic developmental stages, adult tissues, and sexes. We also compared variable arm usage patterns observed in zebrafish to other vertebrates including arm switching data from fish, birds, and mammals. Our comprehensive analysis revealed that variable arm usage events predominantly take place during embryonic development. It is also noteworthy that isomiR occurrence correlates to changes in arm selection evidencing an important role of microRNA distinct isoforms in reinforcing and modifying gene regulation by promoting dynamics switches on miRNA 5p and 3p arms accumulation. Our results shed new light on the emergence and coordination of gene expression regulation and pave the way for future investigations in this field.</p>","PeriodicalId":50472,"journal":{"name":"Evolutionary Bioinformatics","volume":"20 ","pages":"11769343241263230"},"PeriodicalIF":1.7,"publicationDate":"2024-07-24","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11271096/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141762332","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2024-06-14eCollection Date: 2024-01-01DOI: 10.1177/11769343241261814
Linrong Wan, Siyuan Su, Jinyun Liu, Bangxing Zou, Yaming Jiang, Beibei Jiao, Shaokuan Tang, Youhong Zhang, Cao Deng, Wenfu Xiao
Background: Pseudogenes are sequences that have lost the ability to transcribe RNA molecules or encode truncated but possibly functional proteins. While they were once considered to be meaningless remnants of evolution, recent researches have shown that pseudogenes play important roles in various biological processes. However, the studies of pseudogenes in the silkworm, an important model organism, are limited and have focused on single or only a few specific genes.
Objective: To fill these gaps, we present a systematic genome-wide studies of pseudogenes in the silkworm.
Methods: We identified the pseudogenes in the silkworm using the silkworm genome assemblies, transcriptome, protein sequences from silkworm and its related species. Then we used transcriptome datasets from 832 RNA-seq analyses to construct spatio-temporal expression profiles for these pseudogenes. Additionally, we identified tissue-specifically expressed and differentially expressed pseudogenes to further understand their characteristics. Finally, the functional roles of pseudogenes as lncRNAs were systematically analyzed.
Results: We identified a total of 4410 pseudogenes, which were grouped into 4 groups, including duplications (DUPs), unitary pseudogenes (Unitary), processed pseudogenes (retropseudogenes, RETs), and fragments (FRAGs). The most of pseudogenes in the domestic silkworm were generated before the divergence of wild and domestic silkworm, however, the domestication may also involve in the accumulation of pseudogenes. These pseudogenes were clearly divided into 2 cluster, a highly expressed and a lowly expressed, and the posterior silk gland was the tissue with the most tissue-specific pseudogenes (199), implying these pseudogenes may be involved in the development and function of silkgland. We identified 3299 lncRNAs in these pseudogenes, and the target genes of these lncRNAs in silkworm pseudogenes were enriched in the egg formation and olfactory function.
Conclusions: This study replenishes the genome annotations for silkworm, provide valuable insights into the biological roles of pseudogenes. It will also contribute to our understanding of the complex gene regulatory networks in the silkworm and will potentially have implications for other organisms as well.
{"title":"The Spatio-Temporal Expression Profiles of Silkworm Pseudogenes Provide Valuable Insights into Their Biological Roles.","authors":"Linrong Wan, Siyuan Su, Jinyun Liu, Bangxing Zou, Yaming Jiang, Beibei Jiao, Shaokuan Tang, Youhong Zhang, Cao Deng, Wenfu Xiao","doi":"10.1177/11769343241261814","DOIUrl":"10.1177/11769343241261814","url":null,"abstract":"<p><strong>Background: </strong>Pseudogenes are sequences that have lost the ability to transcribe RNA molecules or encode truncated but possibly functional proteins. While they were once considered to be meaningless remnants of evolution, recent researches have shown that pseudogenes play important roles in various biological processes. However, the studies of pseudogenes in the silkworm, an important model organism, are limited and have focused on single or only a few specific genes.</p><p><strong>Objective: </strong>To fill these gaps, we present a systematic genome-wide studies of pseudogenes in the silkworm.</p><p><strong>Methods: </strong>We identified the pseudogenes in the silkworm using the silkworm genome assemblies, transcriptome, protein sequences from silkworm and its related species. Then we used transcriptome datasets from 832 RNA-seq analyses to construct spatio-temporal expression profiles for these pseudogenes. Additionally, we identified tissue-specifically expressed and differentially expressed pseudogenes to further understand their characteristics. Finally, the functional roles of pseudogenes as lncRNAs were systematically analyzed.</p><p><strong>Results: </strong>We identified a total of 4410 pseudogenes, which were grouped into 4 groups, including duplications (DUPs), unitary pseudogenes (Unitary), processed pseudogenes (retropseudogenes, RETs), and fragments (FRAGs). The most of pseudogenes in the domestic silkworm were generated before the divergence of wild and domestic silkworm, however, the domestication may also involve in the accumulation of pseudogenes. These pseudogenes were clearly divided into 2 cluster, a highly expressed and a lowly expressed, and the posterior silk gland was the tissue with the most tissue-specific pseudogenes (199), implying these pseudogenes may be involved in the development and function of silkgland. We identified 3299 lncRNAs in these pseudogenes, and the target genes of these lncRNAs in silkworm pseudogenes were enriched in the egg formation and olfactory function.</p><p><strong>Conclusions: </strong>This study replenishes the genome annotations for silkworm, provide valuable insights into the biological roles of pseudogenes. It will also contribute to our understanding of the complex gene regulatory networks in the silkworm and will potentially have implications for other organisms as well.</p>","PeriodicalId":50472,"journal":{"name":"Evolutionary Bioinformatics","volume":"20 ","pages":"11769343241261814"},"PeriodicalIF":2.6,"publicationDate":"2024-06-14","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11179516/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141332419","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2024-05-30DOI: 10.1177/11769343241257344
Chao He, Bin Zhu, Wenwen Gao, Qianjin Wu, Changshui Zhang
In diploid organisms, half of the chromosomes in each cell come from the father and half from the mother. Through previous studies, it was found that the paternal chromosome and the maternal chromosome can be regulated and expressed independently, leading to the emergence of allele specific expression (ASE). In this study, we analyzed the differential expression of alleles in the high-altitude population and the normal population based on the RNA sequencing data. Through gene cluster analysis and protein interaction network analysis, we found some changes occurred at the gene level, and some negative effects. During the study, we realized that the calmodulin homology domain may have a certain correlation with long-term survival at high altitude. The plateau environment is characterized by hypoxia, low air pressure, strong ultraviolet radiation, and low temperature. Accordingly, the genetic changes in the process of adaptation are mainly reflected in these characteristics. High altitude generation living is also highly related to cancer, immune disease, cardiovascular disease, neurological disease, endocrine disease, and other diseases. Therefore, the medical system in high altitude areas should pay more attention to these diseases.
{"title":"Study on Allele Specific Expression of Long-Term Residents in High Altitude Areas","authors":"Chao He, Bin Zhu, Wenwen Gao, Qianjin Wu, Changshui Zhang","doi":"10.1177/11769343241257344","DOIUrl":"https://doi.org/10.1177/11769343241257344","url":null,"abstract":"In diploid organisms, half of the chromosomes in each cell come from the father and half from the mother. Through previous studies, it was found that the paternal chromosome and the maternal chromosome can be regulated and expressed independently, leading to the emergence of allele specific expression (ASE). In this study, we analyzed the differential expression of alleles in the high-altitude population and the normal population based on the RNA sequencing data. Through gene cluster analysis and protein interaction network analysis, we found some changes occurred at the gene level, and some negative effects. During the study, we realized that the calmodulin homology domain may have a certain correlation with long-term survival at high altitude. The plateau environment is characterized by hypoxia, low air pressure, strong ultraviolet radiation, and low temperature. Accordingly, the genetic changes in the process of adaptation are mainly reflected in these characteristics. High altitude generation living is also highly related to cancer, immune disease, cardiovascular disease, neurological disease, endocrine disease, and other diseases. Therefore, the medical system in high altitude areas should pay more attention to these diseases.","PeriodicalId":50472,"journal":{"name":"Evolutionary Bioinformatics","volume":"80 1","pages":""},"PeriodicalIF":2.6,"publicationDate":"2024-05-30","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141190451","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}