Ayan Mukherjee , Premanshu Dandapat , Molla Zakirul Haque , Sangeeta Mandal , Partha Sarathi Jana , Sukhen Samanta , Santanu Pal , Arun Kumar Das , Pramod Kumar Nanda , Samiran Bandyopadhyay , Chanchal Guha
{"title":"Computational analysis of hypothetical proteins from Mycobacterium orygis identifies proteins with therapeutic and diagnostic potentials","authors":"Ayan Mukherjee , Premanshu Dandapat , Molla Zakirul Haque , Sangeeta Mandal , Partha Sarathi Jana , Sukhen Samanta , Santanu Pal , Arun Kumar Das , Pramod Kumar Nanda , Samiran Bandyopadhyay , Chanchal Guha","doi":"10.1016/j.angen.2023.200154","DOIUrl":null,"url":null,"abstract":"<div><p>The genome of <span><em>Mycobacterium</em><em> orygis</em></span> encodes 3767 proteins that includes 442 functionally uncharacterized proteins, known as hypothetical proteins (HPs). These functionally unknown proteins may play significant biological role in bacterium's survivability, virulence <em>etc.</em>, hence functional annotation of these HPs with bioinformatics techniques might shed light on these aspects. So, the present study aimed to assign functions to those HPs in <em>M. orygis</em> genome. With the help of an extensive <em>in silico</em><span><span> workflow involving several bioinformatic tools, physicochemical characteristics, protein-protein interaction, sub-cellular localization, functional classification, and antigenicity of the HPs were analyzed. Out of 4039 genes and 3767 encoded proteins, 422 were annotated as “hypothetical protein, unknown function” in NCBI database. Out of 422 HPs, the FASTA sequences of 400 proteins, available in the NCBI database, were extracted. Preliminarily 400 proteins were screened for presence of conserved domain by different bioinformatic tools and 32 shortlisted proteins were further analyzed. The analysis revealed that these proteins may be involved in enzymatic, toxin-antitoxin system, cellular and other processes. Virulence and antigenicity of the complete set of HPs were also predicted. Based on antigenicity scores and sub-cellular localization, four proteins were selected for secondary and </span>tertiary structure<span> modelling. Molecular docking and MD-simulation of four proteins with TLR1 showed stable binding. B-cell and T-cell epitopes were identified for two proteins (QOY47331.1 and QOY49361.1) with highest antigenicity scores. The result of the present study may facilitate better understanding of virulence, pathogenesis and therapeutic target. The information about the structures of the HPs with highest antigenicity score could aid in structure-based inhibitor design and devising novel diagnostic assays.</span></span></p></div>","PeriodicalId":7893,"journal":{"name":"Animal Gene","volume":"29 ","pages":"Article 200154"},"PeriodicalIF":0.0000,"publicationDate":"2023-08-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Animal Gene","FirstCategoryId":"1085","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S2352406523000106","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q4","JCRName":"Biochemistry, Genetics and Molecular Biology","Score":null,"Total":0}
引用次数: 0
Abstract
The genome of Mycobacterium orygis encodes 3767 proteins that includes 442 functionally uncharacterized proteins, known as hypothetical proteins (HPs). These functionally unknown proteins may play significant biological role in bacterium's survivability, virulence etc., hence functional annotation of these HPs with bioinformatics techniques might shed light on these aspects. So, the present study aimed to assign functions to those HPs in M. orygis genome. With the help of an extensive in silico workflow involving several bioinformatic tools, physicochemical characteristics, protein-protein interaction, sub-cellular localization, functional classification, and antigenicity of the HPs were analyzed. Out of 4039 genes and 3767 encoded proteins, 422 were annotated as “hypothetical protein, unknown function” in NCBI database. Out of 422 HPs, the FASTA sequences of 400 proteins, available in the NCBI database, were extracted. Preliminarily 400 proteins were screened for presence of conserved domain by different bioinformatic tools and 32 shortlisted proteins were further analyzed. The analysis revealed that these proteins may be involved in enzymatic, toxin-antitoxin system, cellular and other processes. Virulence and antigenicity of the complete set of HPs were also predicted. Based on antigenicity scores and sub-cellular localization, four proteins were selected for secondary and tertiary structure modelling. Molecular docking and MD-simulation of four proteins with TLR1 showed stable binding. B-cell and T-cell epitopes were identified for two proteins (QOY47331.1 and QOY49361.1) with highest antigenicity scores. The result of the present study may facilitate better understanding of virulence, pathogenesis and therapeutic target. The information about the structures of the HPs with highest antigenicity score could aid in structure-based inhibitor design and devising novel diagnostic assays.
Animal GeneAgricultural and Biological Sciences-Insect Science
自引率
0.00%
发文量
16
期刊介绍:
Gene Reports publishes papers that focus on the regulation, expression, function and evolution of genes in all biological contexts, including all prokaryotic and eukaryotic organisms, as well as viruses. Gene Reports strives to be a very diverse journal and topics in all fields will be considered for publication. Although not limited to the following, some general topics include: DNA Organization, Replication & Evolution -Focus on genomic DNA (chromosomal organization, comparative genomics, DNA replication, DNA repair, mobile DNA, mitochondrial DNA, chloroplast DNA). Expression & Function - Focus on functional RNAs (microRNAs, tRNAs, rRNAs, mRNA splicing, alternative polyadenylation) Regulation - Focus on processes that mediate gene-read out (epigenetics, chromatin, histone code, transcription, translation, protein degradation). Cell Signaling - Focus on mechanisms that control information flow into the nucleus to control gene expression (kinase and phosphatase pathways controlled by extra-cellular ligands, Wnt, Notch, TGFbeta/BMPs, FGFs, IGFs etc.) Profiling of gene expression and genetic variation - Focus on high throughput approaches (e.g., DeepSeq, ChIP-Seq, Affymetrix microarrays, proteomics) that define gene regulatory circuitry, molecular pathways and protein/protein networks. Genetics - Focus on development in model organisms (e.g., mouse, frog, fruit fly, worm), human genetic variation, population genetics, as well as agricultural and veterinary genetics. Molecular Pathology & Regenerative Medicine - Focus on the deregulation of molecular processes in human diseases and mechanisms supporting regeneration of tissues through pluripotent or multipotent stem cells.