Pub Date : 2023-09-01Epub Date: 2023-09-27DOI: 10.5808/gi.23011
Rahmat Dani Satria, Lalu Muhammad Irham, Wirawan Adikusuma, Anisa Nova Puspitaningrum, Arief Rahman Afief, Riat El Khair, Abdi Wira Septama
Multiple myeloma (MM) is a hematological malignancy. It is widely believed that genetic factors play a significant role in the development of MM, as investigated in numerous studies. However, the application of genomic information for clinical purposes, including diagnostic and prognostic biomarkers, remains largely confined to research. In this study, we utilized genetic information from the Genomic-Driven Clinical Implementation for Multiple Myeloma database, which is dedicated to clinical trial studies on MM. This genetic information was sourced from the genome-wide association studies catalog database. We prioritized genes with the potential to cause MM based on established annotations, as well as biological risk genes for MM, as potential drug target candidates. The DrugBank database was employed to identify drug candidates targeting these genes. Our research led to the discovery of 14 MM biological risk genes and the identification of 10 drugs that target three of these genes. Notably, only one of these 10 drugs, panobinostat, has been approved for use in MM. The two most promising genes, calcium signal-modulating cyclophilin ligand (CAMLG) and histone deacetylase 2 (HDAC2), were targeted by four drugs (cyclosporine, belinostat, vorinostat, and romidepsin), all of which have clinical evidence supporting their use in the treatment of MM. Interestingly, five of the 10 drugs have been approved for other indications than MM, but they may also be effective in treating MM. Therefore, this study aimed to clarify the genomic variants involved in the pathogenesis of MM and highlight the potential benefits of these genomic variants in drug discovery.
{"title":"Identification of druggable genes for multiple myeloma based on genomic information.","authors":"Rahmat Dani Satria, Lalu Muhammad Irham, Wirawan Adikusuma, Anisa Nova Puspitaningrum, Arief Rahman Afief, Riat El Khair, Abdi Wira Septama","doi":"10.5808/gi.23011","DOIUrl":"10.5808/gi.23011","url":null,"abstract":"<p><p>Multiple myeloma (MM) is a hematological malignancy. It is widely believed that genetic factors play a significant role in the development of MM, as investigated in numerous studies. However, the application of genomic information for clinical purposes, including diagnostic and prognostic biomarkers, remains largely confined to research. In this study, we utilized genetic information from the Genomic-Driven Clinical Implementation for Multiple Myeloma database, which is dedicated to clinical trial studies on MM. This genetic information was sourced from the genome-wide association studies catalog database. We prioritized genes with the potential to cause MM based on established annotations, as well as biological risk genes for MM, as potential drug target candidates. The DrugBank database was employed to identify drug candidates targeting these genes. Our research led to the discovery of 14 MM biological risk genes and the identification of 10 drugs that target three of these genes. Notably, only one of these 10 drugs, panobinostat, has been approved for use in MM. The two most promising genes, calcium signal-modulating cyclophilin ligand (CAMLG) and histone deacetylase 2 (HDAC2), were targeted by four drugs (cyclosporine, belinostat, vorinostat, and romidepsin), all of which have clinical evidence supporting their use in the treatment of MM. Interestingly, five of the 10 drugs have been approved for other indications than MM, but they may also be effective in treating MM. Therefore, this study aimed to clarify the genomic variants involved in the pathogenesis of MM and highlight the potential benefits of these genomic variants in drug discovery.</p>","PeriodicalId":94288,"journal":{"name":"Genomics & informatics","volume":"21 3","pages":"e31"},"PeriodicalIF":0.0,"publicationDate":"2023-09-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10584652/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"41184728","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2023-09-01Epub Date: 2023-07-31DOI: 10.5808/gi.23044
Hyeonwoo Kim, Jiwon Kim, Ji Won Choi, Kwang-Sung Ahn, Dong-Il Park, Sangsoo Kim
Microbial community profiling using 16S rRNA amplicon sequencing allows for taxonomic characterization of diverse microorganisms. While amplicon sequence variant (ASV) methods are increasingly favored for their fine-grained resolution of sequence variants, they often discard substantial portions of sequencing reads during quality control, particularly in datasets with large number samples. We present a streamlined pipeline that integrates FastP for read trimming, HmmUFOtu for operational taxonomic units (OTU) clustering, Vsearch for chimera checking, and Kraken2 for taxonomic assignment. To assess the pipeline's performance, we reprocessed two published stool datasets of normal Korean populations: one with 890 and the other with 1,462 independent samples. In the first dataset, HmmUFOtu retained 93.2% of over 104 million read pairs after quality trimming, discarding chimeric or unclassifiable reads, while DADA2, a commonly used ASV method, retained only 44.6% of the reads. Nonetheless, both methods yielded qualitatively similar β-diversity plots. For the second dataset, HmmUFOtu retained 89.2% of read pairs, while DADA2 retained a mere 18.4% of the reads. HmmUFOtu, being a closed-reference clustering method, facilitates merging separately processed datasets, with shared OTUs between the two datasets exhibiting a correlation coefficient of 0.92 in total abundance (log scale). While the first two dimensions of the β-diversity plot exhibited a cohesive mixture of the two datasets, the third dimension revealed the presence of a batch effect. Our comparative evaluation of ASV and OTU methods within this streamlined pipeline provides valuable insights into their performance when processing large-scale microbial 16S rRNA amplicon sequencing data. The strengths of HmmUFOtu and its potential for dataset merging are highlighted.
{"title":"A streamlined pipeline based on HmmUFOtu for microbial community profiling using 16S rRNA amplicon sequencing.","authors":"Hyeonwoo Kim, Jiwon Kim, Ji Won Choi, Kwang-Sung Ahn, Dong-Il Park, Sangsoo Kim","doi":"10.5808/gi.23044","DOIUrl":"10.5808/gi.23044","url":null,"abstract":"<p><p>Microbial community profiling using 16S rRNA amplicon sequencing allows for taxonomic characterization of diverse microorganisms. While amplicon sequence variant (ASV) methods are increasingly favored for their fine-grained resolution of sequence variants, they often discard substantial portions of sequencing reads during quality control, particularly in datasets with large number samples. We present a streamlined pipeline that integrates FastP for read trimming, HmmUFOtu for operational taxonomic units (OTU) clustering, Vsearch for chimera checking, and Kraken2 for taxonomic assignment. To assess the pipeline's performance, we reprocessed two published stool datasets of normal Korean populations: one with 890 and the other with 1,462 independent samples. In the first dataset, HmmUFOtu retained 93.2% of over 104 million read pairs after quality trimming, discarding chimeric or unclassifiable reads, while DADA2, a commonly used ASV method, retained only 44.6% of the reads. Nonetheless, both methods yielded qualitatively similar β-diversity plots. For the second dataset, HmmUFOtu retained 89.2% of read pairs, while DADA2 retained a mere 18.4% of the reads. HmmUFOtu, being a closed-reference clustering method, facilitates merging separately processed datasets, with shared OTUs between the two datasets exhibiting a correlation coefficient of 0.92 in total abundance (log scale). While the first two dimensions of the β-diversity plot exhibited a cohesive mixture of the two datasets, the third dimension revealed the presence of a batch effect. Our comparative evaluation of ASV and OTU methods within this streamlined pipeline provides valuable insights into their performance when processing large-scale microbial 16S rRNA amplicon sequencing data. The strengths of HmmUFOtu and its potential for dataset merging are highlighted.</p>","PeriodicalId":94288,"journal":{"name":"Genomics & informatics","volume":"21 3","pages":"e40"},"PeriodicalIF":0.0,"publicationDate":"2023-09-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10584646/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"41184725","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2023-09-01Epub Date: 2023-09-27DOI: 10.5808/gi.22064
Yue Shi, Hyung Jun Kim, Seong Yong Kim, Ga Eun Kim, Han Jun Jin
Preterm birth (PTB), a pregnancy-related disease, is defined as a birth before 37 weeks of gestation. It is a major cause of maternal mortality and morbidity worldwide, and its incidence rate is steadily increasing. Various genetic factors can contribute to the etiology of PTB. Vascular endothelial growth factor A (VEGFA) gene is an important angiogenic gene and its polymorphisms have been reported to be associated with PTB development. Therefore, we conducted a case-control study to evaluate the association between VEGFA rs699947, rs2010963, and rs3025039 polymorphisms and PTB in Korean women. A total of 271 subjects (116 patients with PTB and 155 women at ≥38 weeks of gestation) were analyzed in this study. The genotyping of VEGFA gene polymorphisms was performed using polymerase chain reaction- restriction fragment length polymorphism. No significant association between the patients with PTB and the control groups was confirmed. In the combination analysis, we found a significant association between PTB and VEGFA rs699947 CC-rs2010963 GG-rs3025039 CC combination (odds ratio, 3.77; 95% confidence interval, 1.091 to 13.032; p = 0.031). The VEGFA rs699947, rs2010963, and rs3025039 polymorphisms might have no genetic association with the pathogenesis of PTB in Korean women. However, the combination analysis indicates the possibility that VEGFA acts in PTB pathophysiology. Therefore, larger sample sets and replication studies are required to further elucidate our findings.
{"title":"Lack of association between the VEGFA gene polymorphisms and preterm birth in Korean women.","authors":"Yue Shi, Hyung Jun Kim, Seong Yong Kim, Ga Eun Kim, Han Jun Jin","doi":"10.5808/gi.22064","DOIUrl":"10.5808/gi.22064","url":null,"abstract":"<p><p>Preterm birth (PTB), a pregnancy-related disease, is defined as a birth before 37 weeks of gestation. It is a major cause of maternal mortality and morbidity worldwide, and its incidence rate is steadily increasing. Various genetic factors can contribute to the etiology of PTB. Vascular endothelial growth factor A (VEGFA) gene is an important angiogenic gene and its polymorphisms have been reported to be associated with PTB development. Therefore, we conducted a case-control study to evaluate the association between VEGFA rs699947, rs2010963, and rs3025039 polymorphisms and PTB in Korean women. A total of 271 subjects (116 patients with PTB and 155 women at ≥38 weeks of gestation) were analyzed in this study. The genotyping of VEGFA gene polymorphisms was performed using polymerase chain reaction- restriction fragment length polymorphism. No significant association between the patients with PTB and the control groups was confirmed. In the combination analysis, we found a significant association between PTB and VEGFA rs699947 CC-rs2010963 GG-rs3025039 CC combination (odds ratio, 3.77; 95% confidence interval, 1.091 to 13.032; p = 0.031). The VEGFA rs699947, rs2010963, and rs3025039 polymorphisms might have no genetic association with the pathogenesis of PTB in Korean women. However, the combination analysis indicates the possibility that VEGFA acts in PTB pathophysiology. Therefore, larger sample sets and replication studies are required to further elucidate our findings.</p>","PeriodicalId":94288,"journal":{"name":"Genomics & informatics","volume":"21 3","pages":"e29"},"PeriodicalIF":0.0,"publicationDate":"2023-09-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10584649/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"41184732","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
The Bacillus cereus group, also known as B. cereus sensu lato (B. cereus s.l.), is composed of various Bacillus species, some of which can cause diarrheal or emetic food poisoning. Several emerging highly heat-resistant Bacillus species have been identified, these include B. thermoamylovorans, B. sporothermodurans, and B. cytotoxicus NVH 391-98. Herein, we performed whole genome analysis of two thermotolerant Bacillus sp. isolates, Bacillus sp. B48 and Bacillus sp. B140, from an omelet with acacia leaves and fried rice, respectively. Phylogenomic analysis suggested that Bacillus sp. B48 and Bacillus sp. B140 are closely related to B. cereus and B. thuringiensis, respectively. Whole genome alignment of Bacillus sp. B48, Bacillus sp. B140, mesophilic strain B. cereus ATCC14579, and thermophilic strain B. cytotoxicus NVH 391-98 using the Mauve program revealed the presence of numerous homologous regions including genes responsible for heat shock in the dnaK gene cluster. However, the presence of a DUF4253 domain-containing protein was observed only in the genome of B. cereus ATCC14579 while the intracellular protease PfpI family was present only in the chromosome of B. cytotoxicus NVH 391-98. In addition, prophage Clp protease-like proteins were found in the genomes of both Bacillus sp. B48 and Bacillus sp. B140 but not in the genome of B. cereus ATCC14579. The genomic profiles of Bacillus sp. isolates were identified by using whole genome analysis especially those relating to heat-responsive gene clusters. The findings presented in this study lay the foundations for subsequent studies to reveal further insights into the molecular mechanisms of Bacillus species in terms of heat resistance mechanisms.
{"title":"Whole genome sequence analyses of thermotolerant Bacillus sp. isolates from food.","authors":"Phornphan Sornchuer, Kritsakorn Saninjuk, Pholawat Tingpej","doi":"10.5808/gi.23030","DOIUrl":"10.5808/gi.23030","url":null,"abstract":"<p><p>The Bacillus cereus group, also known as B. cereus sensu lato (B. cereus s.l.), is composed of various Bacillus species, some of which can cause diarrheal or emetic food poisoning. Several emerging highly heat-resistant Bacillus species have been identified, these include B. thermoamylovorans, B. sporothermodurans, and B. cytotoxicus NVH 391-98. Herein, we performed whole genome analysis of two thermotolerant Bacillus sp. isolates, Bacillus sp. B48 and Bacillus sp. B140, from an omelet with acacia leaves and fried rice, respectively. Phylogenomic analysis suggested that Bacillus sp. B48 and Bacillus sp. B140 are closely related to B. cereus and B. thuringiensis, respectively. Whole genome alignment of Bacillus sp. B48, Bacillus sp. B140, mesophilic strain B. cereus ATCC14579, and thermophilic strain B. cytotoxicus NVH 391-98 using the Mauve program revealed the presence of numerous homologous regions including genes responsible for heat shock in the dnaK gene cluster. However, the presence of a DUF4253 domain-containing protein was observed only in the genome of B. cereus ATCC14579 while the intracellular protease PfpI family was present only in the chromosome of B. cytotoxicus NVH 391-98. In addition, prophage Clp protease-like proteins were found in the genomes of both Bacillus sp. B48 and Bacillus sp. B140 but not in the genome of B. cereus ATCC14579. The genomic profiles of Bacillus sp. isolates were identified by using whole genome analysis especially those relating to heat-responsive gene clusters. The findings presented in this study lay the foundations for subsequent studies to reveal further insights into the molecular mechanisms of Bacillus species in terms of heat resistance mechanisms.</p>","PeriodicalId":94288,"journal":{"name":"Genomics & informatics","volume":"21 3","pages":"e35"},"PeriodicalIF":0.0,"publicationDate":"2023-09-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10584648/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"41184738","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Melanin is synthesized by tyrosinase to protect the skin from ultraviolet light. However, overproduction and accumulation of melanin can result in hyperpigmentation and skin melanoma. Tyrosinase inhibitors are commonly used in the treatment of hyperpigmentation. Natural tyrosinase inhibitors are often favored over synthetic ones due to the potential side effects of the latter, which can include skin irritation, allergies, and other adverse reactions. Nuciferine, an alkaloid derived from Nelumbo nucifera, exhibits potent antioxidant and anti-proliferative properties. This study focused on the in silico screening of nuciferine for anti-tyrosinase activity, using kojic acid, ascorbic acid, and resorcinol as standards. The tyrosinase protein target was selected through homology modeling. The residues of the substrate binding pocket and active site pockets were identified for the purposes of grid box optimization and docking. Nuciferine demonstrated a binding energy of -7.0 kcal/mol and a Ki of 5 µM, both of which were comparatively higher than the corresponding values of kojic acid, which showed -5.3 kcal/mol and 122 µM respectively. Therefore, nuciferine is a potent natural tyrosinase inhibitor and shows promising potential for application in the treatment of hyperpigmentation and skin melanoma.
{"title":"Molecular docking Study of Nuciferine as a Tyrosinase Inhibitor and Its Therapeutic Potential for Hyperpigmentation.","authors":"Veerabhuvaneshwari Veerichetty, Iswaryalakshmi Saravanabavan","doi":"10.5808/gi.23054","DOIUrl":"10.5808/gi.23054","url":null,"abstract":"<p><p>Melanin is synthesized by tyrosinase to protect the skin from ultraviolet light. However, overproduction and accumulation of melanin can result in hyperpigmentation and skin melanoma. Tyrosinase inhibitors are commonly used in the treatment of hyperpigmentation. Natural tyrosinase inhibitors are often favored over synthetic ones due to the potential side effects of the latter, which can include skin irritation, allergies, and other adverse reactions. Nuciferine, an alkaloid derived from Nelumbo nucifera, exhibits potent antioxidant and anti-proliferative properties. This study focused on the in silico screening of nuciferine for anti-tyrosinase activity, using kojic acid, ascorbic acid, and resorcinol as standards. The tyrosinase protein target was selected through homology modeling. The residues of the substrate binding pocket and active site pockets were identified for the purposes of grid box optimization and docking. Nuciferine demonstrated a binding energy of -7.0 kcal/mol and a Ki of 5 µM, both of which were comparatively higher than the corresponding values of kojic acid, which showed -5.3 kcal/mol and 122 µM respectively. Therefore, nuciferine is a potent natural tyrosinase inhibitor and shows promising potential for application in the treatment of hyperpigmentation and skin melanoma.</p>","PeriodicalId":94288,"journal":{"name":"Genomics & informatics","volume":"21 3","pages":"e43"},"PeriodicalIF":0.0,"publicationDate":"2023-09-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10584639/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"41184734","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2023-09-01Epub Date: 2023-09-27DOI: 10.5808/gi.22069
Shubhashish Chakraborty, Reshita Baruah, Neha Mishra, Ashok K Varma
Ephs belong to the largest family of receptor tyrosine kinase and are highly conserved both sequentially and structurally. The structural organization of Eph is similar to other receptor tyrosine kinases; constituting the extracellular ligand binding domain, a fibronectin domain followed by intracellular juxtamembrane kinase, and SAM domain. Eph binds to respective ephrin ligand, through the ligand binding domain and forms a tetrameric complex to activate the kinase domain. Eph-ephrin regulates many downstream pathways that lead to physiological events such as cell migration, proliferation, and growth. Therefore, considering the importance of Eph-ephrin class of protein in tumorigenesis, 7,620 clinically reported missense mutations belonging to the class of variables of unknown significance were retrieved from cBioPortal and evaluated for pathogenicity. Thirty-two mutations predicted to be pathogenic using SIFT, Polyphen-2, PROVEAN, SNPs&GO, PMut, iSTABLE, and PremPS in-silico tools were found located either in critical functional regions or encompassing interactions at the binding interface of Eph-ephrin. However, seven were reported in nonsmall cell lung cancer (NSCLC). Considering the relevance of receptor tyrosine kinases and Eph in NSCLC, these seven mutations were assessed for change in the folding pattern using molecular dynamic simulation. Structural alterations, stability, flexibility, compactness, and solvent-exposed area was observed in EphA3 Trp790Cys, EphA7 Leu749Phe, EphB1 Gly685Cys, EphB4 Val748Ala, and Ephrin A2 Trp112Cys. Hence, it can be concluded that the evaluated mutations have potential to alter the folding pattern and thus can be further validated by in-vitro, structural and in-vivo studies for clinical management.
{"title":"In-silico and structure-based assessment to evaluate pathogenicity of missense mutations associated with non-small cell lung cancer identified in the Eph-ephrin class of proteins.","authors":"Shubhashish Chakraborty, Reshita Baruah, Neha Mishra, Ashok K Varma","doi":"10.5808/gi.22069","DOIUrl":"10.5808/gi.22069","url":null,"abstract":"<p><p>Ephs belong to the largest family of receptor tyrosine kinase and are highly conserved both sequentially and structurally. The structural organization of Eph is similar to other receptor tyrosine kinases; constituting the extracellular ligand binding domain, a fibronectin domain followed by intracellular juxtamembrane kinase, and SAM domain. Eph binds to respective ephrin ligand, through the ligand binding domain and forms a tetrameric complex to activate the kinase domain. Eph-ephrin regulates many downstream pathways that lead to physiological events such as cell migration, proliferation, and growth. Therefore, considering the importance of Eph-ephrin class of protein in tumorigenesis, 7,620 clinically reported missense mutations belonging to the class of variables of unknown significance were retrieved from cBioPortal and evaluated for pathogenicity. Thirty-two mutations predicted to be pathogenic using SIFT, Polyphen-2, PROVEAN, SNPs&GO, PMut, iSTABLE, and PremPS in-silico tools were found located either in critical functional regions or encompassing interactions at the binding interface of Eph-ephrin. However, seven were reported in nonsmall cell lung cancer (NSCLC). Considering the relevance of receptor tyrosine kinases and Eph in NSCLC, these seven mutations were assessed for change in the folding pattern using molecular dynamic simulation. Structural alterations, stability, flexibility, compactness, and solvent-exposed area was observed in EphA3 Trp790Cys, EphA7 Leu749Phe, EphB1 Gly685Cys, EphB4 Val748Ala, and Ephrin A2 Trp112Cys. Hence, it can be concluded that the evaluated mutations have potential to alter the folding pattern and thus can be further validated by in-vitro, structural and in-vivo studies for clinical management.</p>","PeriodicalId":94288,"journal":{"name":"Genomics & informatics","volume":"21 3","pages":"e30"},"PeriodicalIF":0.0,"publicationDate":"2023-09-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10584653/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"41184731","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2023-09-01Epub Date: 2023-09-27DOI: 10.5808/gi.23035
Ayesha Zeba, Kanagaraj Sekar, Anjali Ganjiwale
The Dengue virus M protein is a 75 amino acid polypeptide with two helical transmembranes (TM). The TM domain oligomerizes to form an ion channel, facilitating viral release from the host cells. The M protein has a critical role in the virus entry and life cycle, making it a potent drug target. The oligomerization of the monomeric protein was studied using ab initio modeling and molecular dynamics (MD) simulation in an implicit membrane environment. The representative structures obtained showed pentamer as the most stable oligomeric state, resembling an ion channel. Glutamic acid, threonine, serine, tryptophan, alanine, isoleucine form the pore-lining residues of the pentameric channel, conferring an overall negative charge to the channel with approximate length of 51.9 Å. Residue interaction analysis (RIN) for M protein shows that Ala94, Leu95, Ser112, Glu124, and Phe155 are the central hub residues representing the physicochemical interactions between domains. The virtual screening with 165 different ion channel inhibitors from the ion channel library shows monovalent ion channel blockers, namely lumacaftor, glipizide, gliquidone, glisoxepide, and azelnidipine to be the inhibitors with high docking scores. Understanding the three-dimensional structure of M protein will help design therapeutics and vaccines for Dengue infection.
{"title":"M Protein from Dengue virus oligomerizes to pentameric channel protein: in silico analysis study.","authors":"Ayesha Zeba, Kanagaraj Sekar, Anjali Ganjiwale","doi":"10.5808/gi.23035","DOIUrl":"10.5808/gi.23035","url":null,"abstract":"<p><p>The Dengue virus M protein is a 75 amino acid polypeptide with two helical transmembranes (TM). The TM domain oligomerizes to form an ion channel, facilitating viral release from the host cells. The M protein has a critical role in the virus entry and life cycle, making it a potent drug target. The oligomerization of the monomeric protein was studied using ab initio modeling and molecular dynamics (MD) simulation in an implicit membrane environment. The representative structures obtained showed pentamer as the most stable oligomeric state, resembling an ion channel. Glutamic acid, threonine, serine, tryptophan, alanine, isoleucine form the pore-lining residues of the pentameric channel, conferring an overall negative charge to the channel with approximate length of 51.9 Å. Residue interaction analysis (RIN) for M protein shows that Ala94, Leu95, Ser112, Glu124, and Phe155 are the central hub residues representing the physicochemical interactions between domains. The virtual screening with 165 different ion channel inhibitors from the ion channel library shows monovalent ion channel blockers, namely lumacaftor, glipizide, gliquidone, glisoxepide, and azelnidipine to be the inhibitors with high docking scores. Understanding the three-dimensional structure of M protein will help design therapeutics and vaccines for Dengue infection.</p>","PeriodicalId":94288,"journal":{"name":"Genomics & informatics","volume":"21 3","pages":"e41"},"PeriodicalIF":0.0,"publicationDate":"2023-09-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10584644/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"41184733","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
The LIM domain-containing proteins are dominantly found in plants and play a significant role in various biological processes such as gene transcription as well as actin cytoskeletal organization. Nevertheless, genome-wide identification as well as functional analysis of the LIM gene family have not yet been reported in the economically important plant sorghum (Sorghum bicolor L.). Therefore, we conducted an in silico identification and characterization of LIM genes in S. bicolor genome using integrated bioinformatics approaches. Based on phylogenetic tree analysis and conserved domain, we identified five LIM genes in S. bicolor (SbLIM) genome corresponding to Arabidopsis LIM (AtLIM) genes. The conserved domain, motif as well as gene structure analyses of the SbLIM gene family showed the similarity within the SbLIM and AtLIM members. The gene ontology (GO) enrichment study revealed that the candidate LIM genes are directly involved in cytoskeletal organization and various other important biological as well as molecular pathways. Some important families of regulating transcription factors such as ERF, MYB, WRKY, NAC, bZIP, C2H2, Dof, and G2-like were detected by analyzing their interaction network with identified SbLIM genes. The cis-acting regulatory elements related to predicted SbLIM genes were identified as responsive to light, hormones, stress, and other functions. The present study will provide valuable useful information about LIM genes in sorghum which would pave the way for the future study of functional pathways of candidate SbLIM genes as well as their regulatory factors in wet-lab experiments.
{"title":"A genome‑wide approach to the systematic and comprehensive analysis of LIM gene family in sorghum (Sorghum bicolor L.).","authors":"Md Abdur Rauf Sarkar, Salim Sarkar, Md Shohelul Islam, Fatema Tuz Zohra, Shaikh Mizanur Rahman","doi":"10.5808/gi.23007","DOIUrl":"10.5808/gi.23007","url":null,"abstract":"<p><p>The LIM domain-containing proteins are dominantly found in plants and play a significant role in various biological processes such as gene transcription as well as actin cytoskeletal organization. Nevertheless, genome-wide identification as well as functional analysis of the LIM gene family have not yet been reported in the economically important plant sorghum (Sorghum bicolor L.). Therefore, we conducted an in silico identification and characterization of LIM genes in S. bicolor genome using integrated bioinformatics approaches. Based on phylogenetic tree analysis and conserved domain, we identified five LIM genes in S. bicolor (SbLIM) genome corresponding to Arabidopsis LIM (AtLIM) genes. The conserved domain, motif as well as gene structure analyses of the SbLIM gene family showed the similarity within the SbLIM and AtLIM members. The gene ontology (GO) enrichment study revealed that the candidate LIM genes are directly involved in cytoskeletal organization and various other important biological as well as molecular pathways. Some important families of regulating transcription factors such as ERF, MYB, WRKY, NAC, bZIP, C2H2, Dof, and G2-like were detected by analyzing their interaction network with identified SbLIM genes. The cis-acting regulatory elements related to predicted SbLIM genes were identified as responsive to light, hormones, stress, and other functions. The present study will provide valuable useful information about LIM genes in sorghum which would pave the way for the future study of functional pathways of candidate SbLIM genes as well as their regulatory factors in wet-lab experiments.</p>","PeriodicalId":94288,"journal":{"name":"Genomics & informatics","volume":"21 3","pages":"e36"},"PeriodicalIF":0.0,"publicationDate":"2023-09-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10584642/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"41184724","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Beom-Soon Choi, Seon Kang Choi, Nam-Soo Kim, I. Choi
BLAST, a basic bioinformatics tool for searching local sequence similarity, has been one of the most widely used bioinformatics programs since its introduction in 1990. Users generally use the web-based NCBI-BLAST program for BLAST analysis. However, users with large sequence data are often faced with a problem of upload size limitation while using the web-based BLAST program. This proves inconvenient as scientists often want to run BLAST on their own data, such as transcriptome or whole genome sequences. To overcome this issue, we developed NBLAST, a graphical user interface-based BLAST program that employs a two-way system, allowing the use of input sequences either as “query” or “target” in the BLAST analysis. NBLAST is also equipped with a dot plot viewer, thus allowing researchers to create custom database for BLAST and run a dot plot similarity analysis within a single program. It is available to access to the NBLAST with http://nbitglobal.com/nblast.
{"title":"NBLAST: a graphical user interface-based two-way BLAST software with a dot plot viewer","authors":"Beom-Soon Choi, Seon Kang Choi, Nam-Soo Kim, I. Choi","doi":"10.5808/gi.22053","DOIUrl":"https://doi.org/10.5808/gi.22053","url":null,"abstract":"BLAST, a basic bioinformatics tool for searching local sequence similarity, has been one of the most widely used bioinformatics programs since its introduction in 1990. Users generally use the web-based NCBI-BLAST program for BLAST analysis. However, users with large sequence data are often faced with a problem of upload size limitation while using the web-based BLAST program. This proves inconvenient as scientists often want to run BLAST on their own data, such as transcriptome or whole genome sequences. To overcome this issue, we developed NBLAST, a graphical user interface-based BLAST program that employs a two-way system, allowing the use of input sequences either as “query” or “target” in the BLAST analysis. NBLAST is also equipped with a dot plot viewer, thus allowing researchers to create custom database for BLAST and run a dot plot similarity analysis within a single program. It is available to access to the NBLAST with http://nbitglobal.com/nblast.","PeriodicalId":94288,"journal":{"name":"Genomics & informatics","volume":" ","pages":""},"PeriodicalIF":0.0,"publicationDate":"2022-09-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"43666790","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Permutation testing is a robust and popular approach for significance testing in genomic research that has the advantage of reducing inflated type 1 error rates; however, its computational cost is notorious in genome-wide association studies (GWAS). Here, we developed a supercomputing-aided approach to accelerate the permutation testing for GWAS, based on the message-passing interface (MPI) on parallel computing architecture. Our application, called MPI-GWAS, conducts MPI-based permutation testing using a parallel computing approach with our supercomputing system, Nurion (8,305 compute nodes, and 563,740 central processing units [CPUs]). For 107 permutations of one locus in MPI-GWAS, it was calculated in 600 s using 2,720 CPU cores. For 107 permutations of ~30,000–50,000 loci in over 7,000 subjects, the total elapsed time was ~4 days in the Nurion supercomputer. Thus, MPI-GWAS enables us to feasibly compute the permutation-based GWAS within a reason-able time by harnessing the power of parallel computing resources.
{"title":"MPI-GWAS: a supercomputing-aided permutation approach for genome-wide association studies","authors":"H. Paik, Yongseong Cho, S. Cho, Oh-Kyoung Kwon","doi":"10.5808/gi.22001","DOIUrl":"https://doi.org/10.5808/gi.22001","url":null,"abstract":"Permutation testing is a robust and popular approach for significance testing in genomic research that has the advantage of reducing inflated type 1 error rates; however, its computational cost is notorious in genome-wide association studies (GWAS). Here, we developed a supercomputing-aided approach to accelerate the permutation testing for GWAS, based on the message-passing interface (MPI) on parallel computing architecture. Our application, called MPI-GWAS, conducts MPI-based permutation testing using a parallel computing approach with our supercomputing system, Nurion (8,305 compute nodes, and 563,740 central processing units [CPUs]). For 107 permutations of one locus in MPI-GWAS, it was calculated in 600 s using 2,720 CPU cores. For 107 permutations of ~30,000–50,000 loci in over 7,000 subjects, the total elapsed time was ~4 days in the Nurion supercomputer. Thus, MPI-GWAS enables us to feasibly compute the permutation-based GWAS within a reason-able time by harnessing the power of parallel computing resources.","PeriodicalId":94288,"journal":{"name":"Genomics & informatics","volume":" ","pages":""},"PeriodicalIF":0.0,"publicationDate":"2022-03-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"49372531","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}