Single nucleotide polymorphisms are most common type of genetic variation in human genome. Analyzing genetic variants can help us better understand the genetic basis of diseases and develop predictive models which are useful to identify individuals who are at increased risk for certain diseases. Several SNP analysis tools have already been developed. For running these tools, the user needs to collect data from various databases. Secondly, often researchers have to use multiple variant analysis tools for cross validating their results and increase confidence in their findings. Extracting data from multiple databases and running multiple tools at a time, increases complexity and time required for analysis. There are some web-based tools that integrate multiple genetic variant databases and provide variant annotations for a few tools. These approaches have some limitations such as retrieving annotation information, filtering common pathogenic variants. The proposed web-based tool, namely IPSNP: An Integrated Platform for Predicting Impact of SNPs is written in Django which is a python-based framework. It uses RESTful API of MyVariant.info to extract annotation information of variants associated with a given gene, rsID, HGVS format variants specified in a VCF file for 29 tools. The results are in the form of a CSV file of predictions (1) derived from the consensus decision, (2) a file having annotations for the variants associated with the given gene, (3) a file showing variants declared as pathogenic commonly by the selected tools, and (4) a CSV file containing chromosome coordinates based on GRCh37 and GRCh38 genome assemblies, rsIDs and proteomic data, so that users may use tools of their choice and avoiding manual parameter collection for each tool. IPSNP is a valuable resource for researchers and clinicians and it can help to save time and effort in discovering the novel disease-associated variants and the development of personalized treatments.
Mycobacterium tuberculosis (Mtb) is the causative agent of tuberculosis (TB), an infectious disease that is a major killer worldwide. Due to selection pressure caused by the use of antibacterial drugs, Mtb is characterised by mutational events that have given rise to multi drug resistant (MDR) and extensively drug resistant (XDR) phenotypes. The rate at which mutations occur is an important factor in the study of molecular evolution, and it helps understand gene evolution. Within the same species, different protein-coding genes evolve at different rates. To estimate the rates of molecular evolution of protein-coding genes, a commonly used parameter is the ratio dN/dS, where dN is the rate of non-synonymous substitutions and dS is the rate of synonymous substitutions. Here, we determined the estimated rates of molecular evolution of select biological processes and molecular functions across 264 strains of Mtb. We also investigated the molecular evolutionary rates of core genes of Mtb by computing the dN/dS values, and estimated the pan genome of the 264 strains of Mtb. Our results show that the cellular amino acid metabolic process and the kinase activity function evolve at a significantly higher rate, while the carbohydrate metabolic process evolves at a significantly lower rate for M. tuberculosis. These high rates of evolution correlate well with Mtb physiology and pathogenicity. We further propose that the core genome of M. tuberculosis likely experiences varying rates of molecular evolution which may drive an interplay between core genome and accessory genome during M. tuberculosis evolution.
Pantoea sp. strain MHSD4 is a bacterial endophyte isolated from the leaves of the medicinal plant Pellaea calomelanos. Here, we report on strain MHSD4 draft whole genome sequence and annotation. The draft genome size of Pantoea sp. strain MHSD4 is 4 647 677 bp with a G+C content of 54.2% and 41 contigs. The National Center for Biotechnology Information Prokaryotic Genome Annotation Pipeline tool predicted a total of 4395 genes inclusive of 4235 protein-coding genes, 87 total RNA genes, 14 non-coding (nc) RNAs and 70 tRNAs, and 73 pseudogenes. Biosynthesis pathways for naphthalene and anthracene degradation were identified. Putative genes involved in bioremediation such as copA, copD, cueO, cueR, glnGm, and trxC were identified. Putative genes involved in copper homeostasis and tolerance were identified which may suggest that Pantoea sp. strain MHSD4 has biotechnological potential for bioremediation of heavy metals.
Genetic variations in the human genome represent the differences in DNA sequence within individuals. This highlights the important role of whole human genome sequencing which has become the keystone for precision medicine and disease prediction. Morocco is an important hub for studying human population migration and mixing history. This study presents the analysis of 3 Moroccan genomes; the variant analysis revealed 6 379 606 single nucleotide variants (SNVs) and 1 050 577 small InDels. Of those identified SNVs, 219 152 were novel, with 1233 occurring in coding regions, and 5580 non-synonymous single nucleotide variants (nsSNP) variants were predicted to affect protein functions. The InDels produced 1055 coding variants and 454 non-3n length variants, and their size ranged from -49 and 49 bp. We further analysed the gene pathways of 8 novel coding variants found in the 3 genomes and revealed 5 genes involved in various diseases and biological pathways. We found that the Moroccan genomes share 92.78% of African ancestry, and 92.86% of Non-Finnish European ancestry, according to the gnomAD database. Then, population structure inference, by admixture analysis and network-based approach, revealed that the studied genomes form a mixed population structure, highlighting the increased genetic diversity in Morocco.
Aims: Autophagy plays a significant role in the development of acute myocardial infarction (AMI), and cardiomyocyte autophagy is of major importance in maintaining cardiac function. We aimed to identify key genes associated with autophagy in AMI through bioinformatics analysis and verify them through clinical validation.
Materials and methods: We downloaded an AMI expression profile dataset GSE166780 from Gene Expression Omnibus (GEO). Autophagy-associated genes potentially differentially expressed in AMI were screened using R software. Then, to identify key autophagy-related genes, Gene Ontology (GO) and Kyoto Encyclopedia of Genes and Genomes (KEGG) enrichment analysis, protein-protein interaction (PPI) analysis, Receiver Operating Characteristic (ROC) curve analysis, and correlation analysis were performed on the differentially expressed autophagy-related genes in AMI. Finally, we used quantificational real-time polymerase chain reaction (qRT-PCR) to verify the RNA expression of the screened key genes.
Results: TSC2, HSPA8, and HIF1A were screened out as key autophagy-related genes. qRT-PCR results showed that the expression levels of HSPA8 and TSC2 in AMI blood samples were lower, while the expression level of HIF1A was higher than that in the healthy controls.
Conclusions: TSC2, HSPA8, and HIF1A were identified as key autophagy-related genes in this study. They may influence the development of AMI through autophagy. These findings may help deepen our understanding of AMI and may be useful for the treatment of AMI.
Nociception and pain sensation are important neural processes in humans to avoid injury. Many proteins are involved in nociception and pain sensation in humans; however, the evolution of these proteins in animals is unknown. Here, we chose nociception- and pain-related proteins, including G protein-coupled receptors (GPCRs), ion channels (ICs), and neuropeptides (NPs), which are reportedly associated with nociception and pain in humans, and identified their homologs in various animals by BLAST, phylogenetic analysis and protein architecture comparison to reveal their evolution from protozoans to humans. We found that the homologs of transient receptor potential channel A 1 (TRPA1), TRAPM, acid-sensing IC (ASIC), and voltage-dependent calcium channel (VDCC) first appear in Porifera. Substance-P receptor 1 (TACR1) emerged from Coelenterata. Somatostatin receptor type 2 (SSTR2), TRPV1 and voltage-dependent sodium channels (VDSC) appear in Platyhelminthes. Calcitonin gene-related peptide receptor (CGRPR) was first identified in Nematoda. However, opioid receptors (OPRs) and most NPs were discovered only in vertebrates and exist from agnatha to humans. The results demonstrated that homologs of nociception and pain-related ICs exist from lower animal phyla to high animal phyla, and that most of the GPCRs originate from low to high phyla sequentially, whereas OPRs and NPs are newly evolved in vertebrates, which provides hints of the evolution of nociception and pain-related proteins in animals and humans.
The overexpression of the Epidermal Growth Factor Receptor (EGFR) marks it as a pivotal target in cancer treatment, with the aim of reducing its proliferation and inducing apoptosis. This study aimed at the CADD of a new apoptotic EGFR inhibitor. The natural alkaloid, theobromine, was used as a starting point to obtain a new semisynthetic (di-ortho-chloro acetamide) derivative (T-1-DOCA). Firstly, T-1-DOCA's total electron density, energy gap, reactivity indices, and electrostatic surface potential were determined by DFT calculations, Then, molecular docking studies were carried out to predict the potential of T-1-DOCA against wild and mutant EGFR proteins. T-1-DOCA's correct binding was further confirmed by molecular dynamics (MD) over 100 ns, MM-GPSA, and PLIP experiments. In vitro, T-1-DOCA showed noticeable efficacy compared to erlotinib by suppressing EGFRWT and EGFRT790M with IC50 values of 56.94 and 269.01 nM, respectively. T-1-DOCA inhibited also the proliferation of H1975 and HCT-116 malignant cell lines, exhibiting IC50 values of 14.12 and 23.39 µM, with selectivity indices of 6.8 and 4.1, respectively, indicating its anticancer potential and general safety. The apoptotic effects of T-1-DOCA were indicated by flow cytometric analysis and were further confirmed through its potential to increase the levels of BAX, Casp3, and Casp9, and decrease Bcl-2 levels. In conclusion, T-1-DOCA, a new apoptotic EGFR inhibitor, was designed and evaluated both computationally and experimentally. The results suggest that T-1-DOCA is a promising candidate for further development as an anti-cancer drug.