Pub Date : 2024-07-15DOI: 10.1186/s44342-024-00009-0
Joseph Hawadak, Aditi Arya, Shewta Chaudhry, Vineeta Singh
Variable surface antigens (VSAs) encoded by var and vir genes in Plasmodium falciparum and Plasmodium vivax, respectively, are known to be involved in malaria pathogenesis and host immune escape through antigenic variations. Knowledge of the genetic diversity of these antigens is essential for malaria control and effective vaccine development. In this study, we analysed the genetic diversity and evolutionary patterns of two fragments (DBL2X and DBL3X) of VAR2CSA gene and four vir genes (vir 4, vir 12, vir 21 and vir 27) from different endemic regions, including Southeast Asia and sub-Saharan Africa. High levels of segregating sites (S) and haplotype diversity (Hd) were observed in both var and vir genes. Among vir genes, vir 12 (S = 131, Hd = 0.996) and vir 21 (S = 171, Hd = 892) were found to be more diverse as compared to vir 4 (S = 11, Hd = 0.748) and vir 27 (S = 23, Hd = 0.814). DBL2X (S = 99, Hd = 0.996) and DBL3X (S = 307, Hd = 0.999) fragments showed higher genetic diversity. Our analysis indicates that var and vir genes are highly diverse and follow the similar evolutionary pattern globally. Some codons showed signatures of positive or negative selection pressure, but vir and var genes are likely to be under balancing selection. This study highlights the high variability of var and vir genes and underlines the need of functional experimental studies to determine the most relevant allelic forms for effective progress towards vaccine formulation and testing.
众所周知,恶性疟原虫和间日疟原虫的变异表面抗原(VSAs)分别由 var 和 vir 基因编码,它们通过抗原变异参与疟疾发病机制和宿主免疫逃逸。了解这些抗原的遗传多样性对于疟疾控制和有效疫苗开发至关重要。在这项研究中,我们分析了来自东南亚和撒哈拉以南非洲等不同疟疾流行地区的 VAR2CSA 基因的两个片段(DBL2X 和 DBL3X)和四个 vir 基因(vir 4、vir 12、vir 21 和 vir 27)的遗传多样性和进化模式。在var和vir基因中都观察到了高水平的分离位点(S)和单体型多样性(Hd)。在 vir 基因中,vir 12(S = 131,Hd = 0.996)和 vir 21(S = 171,Hd = 892)的多样性高于 vir 4(S = 11,Hd = 0.748)和 vir 27(S = 23,Hd = 0.814)。DBL2X(S = 99,Hd = 0.996)和 DBL3X(S = 307,Hd = 0.999)片段显示出更高的遗传多样性。我们的分析表明,var 和 vir 基因具有高度的多样性,并在全球范围内遵循相似的进化模式。一些密码子显示出正向或负向选择压力的特征,但 vir 和 var 基因很可能处于平衡选择下。这项研究凸显了var和vir基因的高度变异性,并强调有必要进行功能实验研究,以确定最相关的等位基因形式,从而有效推进疫苗的研发和测试。
{"title":"Genetic diversity and natural selection analysis of VAR2CSA and vir genes: implication for vaccine development.","authors":"Joseph Hawadak, Aditi Arya, Shewta Chaudhry, Vineeta Singh","doi":"10.1186/s44342-024-00009-0","DOIUrl":"10.1186/s44342-024-00009-0","url":null,"abstract":"<p><p>Variable surface antigens (VSAs) encoded by var and vir genes in Plasmodium falciparum and Plasmodium vivax, respectively, are known to be involved in malaria pathogenesis and host immune escape through antigenic variations. Knowledge of the genetic diversity of these antigens is essential for malaria control and effective vaccine development. In this study, we analysed the genetic diversity and evolutionary patterns of two fragments (DBL2X and DBL3X) of VAR2CSA gene and four vir genes (vir 4, vir 12, vir 21 and vir 27) from different endemic regions, including Southeast Asia and sub-Saharan Africa. High levels of segregating sites (S) and haplotype diversity (Hd) were observed in both var and vir genes. Among vir genes, vir 12 (S = 131, Hd = 0.996) and vir 21 (S = 171, Hd = 892) were found to be more diverse as compared to vir 4 (S = 11, Hd = 0.748) and vir 27 (S = 23, Hd = 0.814). DBL2X (S = 99, Hd = 0.996) and DBL3X (S = 307, Hd = 0.999) fragments showed higher genetic diversity. Our analysis indicates that var and vir genes are highly diverse and follow the similar evolutionary pattern globally. Some codons showed signatures of positive or negative selection pressure, but vir and var genes are likely to be under balancing selection. This study highlights the high variability of var and vir genes and underlines the need of functional experimental studies to determine the most relevant allelic forms for effective progress towards vaccine formulation and testing.</p>","PeriodicalId":94288,"journal":{"name":"Genomics & informatics","volume":"22 1","pages":"11"},"PeriodicalIF":0.0,"publicationDate":"2024-07-15","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11247734/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141622092","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2024-07-02DOI: 10.1186/s44342-024-00004-5
Aruna Rajalingam, Anjali Ganjiwale
Autoimmune disorders (ADs) are chronic conditions resulting from failure or breakdown of immunological tolerance, resulting in the host immune system attacking its cells or tissues. Recent studies report shared effects, mechanisms, and evolutionary origins among ADs; however, the possible factors connecting them are unknown. This study attempts to identify gene signatures commonly shared between different autoimmune disorders and elucidate their molecular pathways linking the pathogenesis of these ADs using an integrated gene expression approach. We employed differential gene expression analysis across 19 datasets of whole blood/peripheral blood cell samples with five different autoimmune disorders (rheumatoid arthritis, multiple sclerosis, systemic lupus erythematosus, Crohn's disease, and type 1 diabetes) to get nine key genes-EGR1, RUNX3, SMAD7, NAMPT, S100A9, S100A8, CYBB, GATA2, and MCEMP1 that were primarily involved in cell and leukocyte activation, leukocyte mediated immunity, IL-17, AGE-RAGE signaling in diabetic complications, prion disease, and NOD-like receptor signaling confirming its role in immune-related pathways. Combined with biological interpretations such as gene ontology (GO), pathway enrichment, and protein-protein interaction (PPI) network, our current study sheds light on the in-depth research on early detection, diagnosis, and prognosis of different ADs.
{"title":"Identification of common genetic factors and immune-related pathways associating more than two autoimmune disorders: implications on risk, diagnosis, and treatment.","authors":"Aruna Rajalingam, Anjali Ganjiwale","doi":"10.1186/s44342-024-00004-5","DOIUrl":"10.1186/s44342-024-00004-5","url":null,"abstract":"<p><p>Autoimmune disorders (ADs) are chronic conditions resulting from failure or breakdown of immunological tolerance, resulting in the host immune system attacking its cells or tissues. Recent studies report shared effects, mechanisms, and evolutionary origins among ADs; however, the possible factors connecting them are unknown. This study attempts to identify gene signatures commonly shared between different autoimmune disorders and elucidate their molecular pathways linking the pathogenesis of these ADs using an integrated gene expression approach. We employed differential gene expression analysis across 19 datasets of whole blood/peripheral blood cell samples with five different autoimmune disorders (rheumatoid arthritis, multiple sclerosis, systemic lupus erythematosus, Crohn's disease, and type 1 diabetes) to get nine key genes-EGR1, RUNX3, SMAD7, NAMPT, S100A9, S100A8, CYBB, GATA2, and MCEMP1 that were primarily involved in cell and leukocyte activation, leukocyte mediated immunity, IL-17, AGE-RAGE signaling in diabetic complications, prion disease, and NOD-like receptor signaling confirming its role in immune-related pathways. Combined with biological interpretations such as gene ontology (GO), pathway enrichment, and protein-protein interaction (PPI) network, our current study sheds light on the in-depth research on early detection, diagnosis, and prognosis of different ADs.</p>","PeriodicalId":94288,"journal":{"name":"Genomics & informatics","volume":"22 1","pages":"10"},"PeriodicalIF":0.0,"publicationDate":"2024-07-02","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11221123/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141494729","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2024-06-26DOI: 10.1186/s44342-024-00008-1
Maryam Ahmadi, Razieh Abdollahi, Marzieh Otogara, Amir Taherkhani
Objective: Cervical cancer (CxCa) stands as a significant global health challenge, ranking fourth in cancer-related mortality among the female population. While chemotherapy regimens have demonstrated incremental progress in extending overall survival, the outlook for recurrent CxCa patients remains disheartening. An imperative necessity arises to delve into innovative therapeutic avenues, with molecular targeted therapy emerging as a promising candidate. Previous investigations have shed light on the therapeutic effectiveness of five distinct herbal compounds, epicatechin, curcumin, myricetin, jatrorrhizine, and arborinine, within the context of CxCa.
Methods: A systems biology approach was employed to discern differentially expressed genes (DEGs) in CxCa tissues relative to healthy cervical epithelial tissues. A protein-protein interaction network (PPIN) was constructed, anchored in the genes related to CxCa. The central genes were discerned within the PPIN, and Kaplan-Meier survival curves explored their prognostic significance. An assessment of the binding affinity of the selected herbal compounds to the master regulator of prognostic markers in CxCa was conducted.
Results: A significant correlation between the overexpression of MYC, IL6, JUN, RRM2, and VEGFA and an adverse prognosis in CxCa was indicated. The regulation of these markers is notably influenced by the transcription factor CEBPD. Molecular docking analysis indicated that the binding affinity between myricetin and the CEBPD DNA binding site was robust.
Conclusion: The findings presented herein have unveiled pivotal genes and pathways that play a central role in the malignant transformation of CxCa. CEBPD has emerged as a potential target for harnessing the therapeutic potential of myricetin in this context.
{"title":"Exploring molecular targets: herbal isolates in cervical cancer therapy.","authors":"Maryam Ahmadi, Razieh Abdollahi, Marzieh Otogara, Amir Taherkhani","doi":"10.1186/s44342-024-00008-1","DOIUrl":"10.1186/s44342-024-00008-1","url":null,"abstract":"<p><strong>Objective: </strong>Cervical cancer (CxCa) stands as a significant global health challenge, ranking fourth in cancer-related mortality among the female population. While chemotherapy regimens have demonstrated incremental progress in extending overall survival, the outlook for recurrent CxCa patients remains disheartening. An imperative necessity arises to delve into innovative therapeutic avenues, with molecular targeted therapy emerging as a promising candidate. Previous investigations have shed light on the therapeutic effectiveness of five distinct herbal compounds, epicatechin, curcumin, myricetin, jatrorrhizine, and arborinine, within the context of CxCa.</p><p><strong>Methods: </strong>A systems biology approach was employed to discern differentially expressed genes (DEGs) in CxCa tissues relative to healthy cervical epithelial tissues. A protein-protein interaction network (PPIN) was constructed, anchored in the genes related to CxCa. The central genes were discerned within the PPIN, and Kaplan-Meier survival curves explored their prognostic significance. An assessment of the binding affinity of the selected herbal compounds to the master regulator of prognostic markers in CxCa was conducted.</p><p><strong>Results: </strong>A significant correlation between the overexpression of MYC, IL6, JUN, RRM2, and VEGFA and an adverse prognosis in CxCa was indicated. The regulation of these markers is notably influenced by the transcription factor CEBPD. Molecular docking analysis indicated that the binding affinity between myricetin and the CEBPD DNA binding site was robust.</p><p><strong>Conclusion: </strong>The findings presented herein have unveiled pivotal genes and pathways that play a central role in the malignant transformation of CxCa. CEBPD has emerged as a potential target for harnessing the therapeutic potential of myricetin in this context.</p>","PeriodicalId":94288,"journal":{"name":"Genomics & informatics","volume":"22 1","pages":"9"},"PeriodicalIF":0.0,"publicationDate":"2024-06-26","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11201312/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141461510","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2024-06-26DOI: 10.1186/s44342-024-00005-4
Dyonisa Nasirochmi Pakha, Ratih Dewi Yudhani, Lalu Muhammad Irham
Though genes are already known to be responsible for type 1 diabetes mellitus (T1DM), the knowledge of missense mutation of that disease gene has still to be under covered. A genomic database and a bioinformatics-based approach are integrated in the present study in order to address this issue. Initially, nine variants associated with T1DM were retrieved from the GWAS catalogue. Different genomic algorithms such as PolyPhen2.0, SNPs and GTEx analyser programs were used to study the structural and functional effects of these mutations. Subsequently, SNPnexus was also employed to understand the effect of these mutations on the function of the expressed protein. Nine missense variants of T1DM were identified using the GWAS catalogue database. Among these nine SNPs, three were predicted to be related to the progression of T1DM disease by affecting the protein level. TYK2 gene variants with SNP rs34536443 were thought to have a probably damaging effect. Meanwhile, both COL4A3 and IFIH1 genes with SNPs rs55703767 and rs35667974, respectively, might alter protein function through a possibly damaging prediction. Among the variants of the three genes, the TYK2 gene with SNP rs34536443 had the strongest contribution in affecting the development of T1DM, with a score of 0.999. We sincerely hope that the results could be of immense importance in understanding the genetic basis of T1DM.
{"title":"Investigation of missense mutation-related type 1 diabetes mellitus through integrating genomic databases and bioinformatic approach.","authors":"Dyonisa Nasirochmi Pakha, Ratih Dewi Yudhani, Lalu Muhammad Irham","doi":"10.1186/s44342-024-00005-4","DOIUrl":"10.1186/s44342-024-00005-4","url":null,"abstract":"<p><p>Though genes are already known to be responsible for type 1 diabetes mellitus (T1DM), the knowledge of missense mutation of that disease gene has still to be under covered. A genomic database and a bioinformatics-based approach are integrated in the present study in order to address this issue. Initially, nine variants associated with T1DM were retrieved from the GWAS catalogue. Different genomic algorithms such as PolyPhen2.0, SNPs and GTEx analyser programs were used to study the structural and functional effects of these mutations. Subsequently, SNPnexus was also employed to understand the effect of these mutations on the function of the expressed protein. Nine missense variants of T1DM were identified using the GWAS catalogue database. Among these nine SNPs, three were predicted to be related to the progression of T1DM disease by affecting the protein level. TYK2 gene variants with SNP rs34536443 were thought to have a probably damaging effect. Meanwhile, both COL4A3 and IFIH1 genes with SNPs rs55703767 and rs35667974, respectively, might alter protein function through a possibly damaging prediction. Among the variants of the three genes, the TYK2 gene with SNP rs34536443 had the strongest contribution in affecting the development of T1DM, with a score of 0.999. We sincerely hope that the results could be of immense importance in understanding the genetic basis of T1DM.</p>","PeriodicalId":94288,"journal":{"name":"Genomics & informatics","volume":"22 1","pages":"8"},"PeriodicalIF":0.0,"publicationDate":"2024-06-26","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11201337/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141461511","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2024-06-17DOI: 10.1186/s44342-024-00011-6
Yuki Yamagata, Ryota Yamada
This study evaluated large language models (LLMs), particularly the GPT-4 with vision (GPT-4 V) and GPT-4 Turbo, for annotating biomedical figures, focusing on cellular senescence. We assessed the ability of LLMs to categorize and annotate complex biomedical images to enhance their accuracy and efficiency. Our experiments employed prompt engineering with figures from review articles, achieving more than 70% accuracy for label extraction and approximately 80% accuracy for node-type classification. Challenges were noted in the correct annotation of the relationship between directionality and inhibitory processes, which were exacerbated as the number of nodes increased. Using figure legends was a more precise identification of sources and targets than using captions, but sometimes lacked pathway details. This study underscores the potential of LLMs in decoding biological mechanisms from text and outlines avenues for improving inhibitory relationship representations in biomedical informatics.
{"title":"Survey on large language model annotation of cellular senescence from figures in review articles.","authors":"Yuki Yamagata, Ryota Yamada","doi":"10.1186/s44342-024-00011-6","DOIUrl":"https://doi.org/10.1186/s44342-024-00011-6","url":null,"abstract":"<p><p>This study evaluated large language models (LLMs), particularly the GPT-4 with vision (GPT-4 V) and GPT-4 Turbo, for annotating biomedical figures, focusing on cellular senescence. We assessed the ability of LLMs to categorize and annotate complex biomedical images to enhance their accuracy and efficiency. Our experiments employed prompt engineering with figures from review articles, achieving more than 70% accuracy for label extraction and approximately 80% accuracy for node-type classification. Challenges were noted in the correct annotation of the relationship between directionality and inhibitory processes, which were exacerbated as the number of nodes increased. Using figure legends was a more precise identification of sources and targets than using captions, but sometimes lacked pathway details. This study underscores the potential of LLMs in decoding biological mechanisms from text and outlines avenues for improving inhibitory relationship representations in biomedical informatics.</p>","PeriodicalId":94288,"journal":{"name":"Genomics & informatics","volume":"22 1","pages":"7"},"PeriodicalIF":0.0,"publicationDate":"2024-06-17","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141437956","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2024-06-17DOI: 10.1186/s44342-024-00010-7
Jieun Lee, Jung-Ah Hwang, Seung-Hyun Hong, Seon-Young Kim, Donghyeok Seol, Il Ju Choi, Yeon-Su Lee
Here, we investigated that the heat shock protein 47 (HSP47) plays a crucial role in the progression of gastric cancer (GC). We analyzed HSP47 gene expression in GC cell lines and patient tissues. The HSP47 mRNA and protein expression levels were significantly higher in GC cell lines and tumor tissues compared to normal gastric mucosa. Using siRNA to silence the expression of HSP47 in GC cells resulted in a significant reduction in their proliferation, wound healing, migration, and invasion capacities. Additionally, we also showed that the mRNA expression of matrix metallopeptidase-7 (MMP-7), a metastasis-promoting gene, was significantly reduced in HSP47 siRNA-transfected GC cells. We confirmed that the HSP47 promoter region was methylated in the SNU-216 GC cell line expressing low levels of HSP47 and in most non-cancerous gastric tissues. It means that the expression of HSP47 is regulated by epigenetic regulatory mechanisms. These findings suggest that targeting HSP47, potentially through its promoter methylation, could be a useful new therapeutic strategy for treating GC.
{"title":"Overexpression of heat shock protein 47 is associated with increased proliferation and metastasis in gastric cancer.","authors":"Jieun Lee, Jung-Ah Hwang, Seung-Hyun Hong, Seon-Young Kim, Donghyeok Seol, Il Ju Choi, Yeon-Su Lee","doi":"10.1186/s44342-024-00010-7","DOIUrl":"10.1186/s44342-024-00010-7","url":null,"abstract":"<p><p>Here, we investigated that the heat shock protein 47 (HSP47) plays a crucial role in the progression of gastric cancer (GC). We analyzed HSP47 gene expression in GC cell lines and patient tissues. The HSP47 mRNA and protein expression levels were significantly higher in GC cell lines and tumor tissues compared to normal gastric mucosa. Using siRNA to silence the expression of HSP47 in GC cells resulted in a significant reduction in their proliferation, wound healing, migration, and invasion capacities. Additionally, we also showed that the mRNA expression of matrix metallopeptidase-7 (MMP-7), a metastasis-promoting gene, was significantly reduced in HSP47 siRNA-transfected GC cells. We confirmed that the HSP47 promoter region was methylated in the SNU-216 GC cell line expressing low levels of HSP47 and in most non-cancerous gastric tissues. It means that the expression of HSP47 is regulated by epigenetic regulatory mechanisms. These findings suggest that targeting HSP47, potentially through its promoter methylation, could be a useful new therapeutic strategy for treating GC.</p>","PeriodicalId":94288,"journal":{"name":"Genomics & informatics","volume":"22 1","pages":"6"},"PeriodicalIF":0.0,"publicationDate":"2024-06-17","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11184955/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141437955","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2024-05-31DOI: 10.1186/s44342-024-00006-3
Victor Jimenez-Vasquez, Natalia Vargas-Herrera, Luis Bárcena-Flores, Verónica Hurtado, Carlos Padilla-Rojas, Roger V Araujo-Castillo
During the third year of the pandemic in Peru, the persistent transmission of SARS-CoV-2 led to the appearance of more transmissible and immune-evasive Omicron sublineages; in that context, the National Genomic Surveillance of SARS-CoV-2 performed by the Peruvian National Institute of Health detected spike mutations in the circulating Omicron BA.5.1.25 sublineage which was later designated as DJ.1 and increased during the fourth COVID-19 wave, this eventually branched into new sublineages. The introduction, emergence, and timing of the most recent common ancestor (tMRCA) of BA.5.1.25 and its descendants (DJ.1, DJ.1.1, DJ.1.2, and DJ.1.3) were investigated in this paper as well as the time lags between their emergence and identification by the Peruvian National Institute of Health. Our findings show that ongoing genomic surveillance of SARS-CoV-2 is critical for understanding its phylogenetic evolution and the emergence of novel variations.
{"title":"Dispersion of SARS-CoV-2 lineage BA.5.1.25 and its descendants in Peru during two COVID-19 waves in 2022.","authors":"Victor Jimenez-Vasquez, Natalia Vargas-Herrera, Luis Bárcena-Flores, Verónica Hurtado, Carlos Padilla-Rojas, Roger V Araujo-Castillo","doi":"10.1186/s44342-024-00006-3","DOIUrl":"10.1186/s44342-024-00006-3","url":null,"abstract":"<p><p>During the third year of the pandemic in Peru, the persistent transmission of SARS-CoV-2 led to the appearance of more transmissible and immune-evasive Omicron sublineages; in that context, the National Genomic Surveillance of SARS-CoV-2 performed by the Peruvian National Institute of Health detected spike mutations in the circulating Omicron BA.5.1.25 sublineage which was later designated as DJ.1 and increased during the fourth COVID-19 wave, this eventually branched into new sublineages. The introduction, emergence, and timing of the most recent common ancestor (tMRCA) of BA.5.1.25 and its descendants (DJ.1, DJ.1.1, DJ.1.2, and DJ.1.3) were investigated in this paper as well as the time lags between their emergence and identification by the Peruvian National Institute of Health. Our findings show that ongoing genomic surveillance of SARS-CoV-2 is critical for understanding its phylogenetic evolution and the emergence of novel variations.</p>","PeriodicalId":94288,"journal":{"name":"Genomics & informatics","volume":"22 1","pages":"5"},"PeriodicalIF":0.0,"publicationDate":"2024-05-31","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11184951/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141437994","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2024-05-31DOI: 10.1186/s44342-024-00007-2
Arpita Singha Roy, Tasmiah Feroz, Md Kobirul Islam, Md Adnan Munim, Dilara Akhter Supti, Nusrat Jahan Antora, Hasan Al Reza, Supriya Gosh, Newaz Mohammed Bahadur, Mohammad Rahanur Alam, Md Shahadat Hossain
Tumor suppressor cylindromatosis protein (CYLD) regulates NF-κB and JNK signaling pathways by cleaving K63-linked poly-ubiquitin chain from its substrate molecules and thus preventing the progression of tumorigenesis and metastasis of the cancer cells. Mutations in CYLD can cause aberrant structure and abnormal functionality leading to tumor formation. In this study, we utilized several computational tools such as PANTHER, PROVEAN, PredictSNP, PolyPhen-2, PhD-SNP, PON-P2, and SIFT to find out deleterious nsSNPs. We also highlighted the damaging impact of those deleterious nsSNPs on the structure and function of the CYLD utilizing ConSurf, I-Mutant, SDM, Phyre2, HOPE, Swiss-PdbViewer, and Mutation 3D. We shortlisted 18 high-risk nsSNPs from a total of 446 nsSNPs recorded in the NCBI database. Based on the conservation profile, stability status, and structural impact analysis, we finalized 13 nsSNPs. Molecular docking analysis and molecular dynamic simulation concluded the study with the findings of two significant nsSNPs (R830K, H827R) which have a remarkable impact on binding affinity, RMSD, RMSF, radius of gyration, and hydrogen bond formation during CYLD-ubiquitin interaction. The principal component analysis compared native and two mutants R830K and H827R of CYLD that signify structural and energy profile fluctuations during molecular dynamic (MD) simulation. Finally, the protein-protein interaction network showed CYLD interacts with 20 proteins involved in several biological pathways that mutations can impair. Considering all these in silico analyses, our study recommended conducting large-scale association studies of nsSNPs of CYLD with cancer as well as designing precise medications against diseases associated with these polymorphisms.
{"title":"A computational approach for structural and functional analyses of disease-associated mutations in the human CYLD gene.","authors":"Arpita Singha Roy, Tasmiah Feroz, Md Kobirul Islam, Md Adnan Munim, Dilara Akhter Supti, Nusrat Jahan Antora, Hasan Al Reza, Supriya Gosh, Newaz Mohammed Bahadur, Mohammad Rahanur Alam, Md Shahadat Hossain","doi":"10.1186/s44342-024-00007-2","DOIUrl":"10.1186/s44342-024-00007-2","url":null,"abstract":"<p><p>Tumor suppressor cylindromatosis protein (CYLD) regulates NF-κB and JNK signaling pathways by cleaving K63-linked poly-ubiquitin chain from its substrate molecules and thus preventing the progression of tumorigenesis and metastasis of the cancer cells. Mutations in CYLD can cause aberrant structure and abnormal functionality leading to tumor formation. In this study, we utilized several computational tools such as PANTHER, PROVEAN, PredictSNP, PolyPhen-2, PhD-SNP, PON-P2, and SIFT to find out deleterious nsSNPs. We also highlighted the damaging impact of those deleterious nsSNPs on the structure and function of the CYLD utilizing ConSurf, I-Mutant, SDM, Phyre2, HOPE, Swiss-PdbViewer, and Mutation 3D. We shortlisted 18 high-risk nsSNPs from a total of 446 nsSNPs recorded in the NCBI database. Based on the conservation profile, stability status, and structural impact analysis, we finalized 13 nsSNPs. Molecular docking analysis and molecular dynamic simulation concluded the study with the findings of two significant nsSNPs (R830K, H827R) which have a remarkable impact on binding affinity, RMSD, RMSF, radius of gyration, and hydrogen bond formation during CYLD-ubiquitin interaction. The principal component analysis compared native and two mutants R830K and H827R of CYLD that signify structural and energy profile fluctuations during molecular dynamic (MD) simulation. Finally, the protein-protein interaction network showed CYLD interacts with 20 proteins involved in several biological pathways that mutations can impair. Considering all these in silico analyses, our study recommended conducting large-scale association studies of nsSNPs of CYLD with cancer as well as designing precise medications against diseases associated with these polymorphisms.</p>","PeriodicalId":94288,"journal":{"name":"Genomics & informatics","volume":"22 1","pages":"4"},"PeriodicalIF":0.0,"publicationDate":"2024-05-31","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11184958/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141437993","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2024-05-31DOI: 10.1186/s44342-024-00002-7
Şeyda Berk, Serkan Özdemir, Ayşe Nur Pektaş
Caenorhabditis elegans (C. elegans) is a nematode and model organism whose entire genome has been mapped, which allows for easy observation of the organism's development due to its transparent structure, and which is appealing due to its ease of crossover, ease of culture, and low cost. Despite being separated by nearly a billion years of evolution, C. elegans homologs have been identified for the vast majority of human genes and are associated with C. elegans for many biological processes such as apoptosis, cell signaling, cell cycle, cell polarity, metabolism, and aging. A detailed bibliometric study is performed here to examine publication trends in this field. Data were taken from the Web of Science database and analyzed using the bibliometric application Biblioshiny (RStudio). In terms of publication, the results indicated a gradual increase each year between 1980 and 2023. A total of 20,322 records were issued in 96 countries, the majority of which were in the USA, China, and Japan. The most prolific writers, the journals most engaged in the area, the nations, institutions, and keywords used by authors were all determined using the Web of Science database and bibliometric rules. The number of papers in the C. elegans research field is increasing exponentially, and Genetics is the journal with the highest number of articles. This study presents how research patterns have evolved throughout time. As a result, worldwide cooperation and a potential field can be developed.
秀丽隐杆线虫(Caenorhabditis elegans)是一种线虫和模式生物,其整个基因组已经绘制完成,由于其透明的结构,可以方便地观察生物的发育过程,并且由于其易于杂交、易于培养和成本低廉而具有吸引力。尽管相隔了近十亿年的进化过程,但绝大多数人类基因的同源物都已被确定,并且在许多生物过程中,如细胞凋亡、细胞信号传导、细胞周期、细胞极性、新陈代谢和衰老等,都与秀丽隐杆线虫有关。本文进行了详细的文献计量学研究,以考察该领域的出版趋势。数据来自科学网数据库,并使用文献计量应用软件 Biblioshiny (RStudio) 进行了分析。结果表明,从 1980 年到 2023 年,论文发表量逐年递增。96个国家共发布了20322条记录,其中大部分在美国、中国和日本。利用 Web of Science 数据库和文献计量学规则,确定了最多产的作者、该领域最活跃的期刊、国家、机构以及作者使用的关键词。蛇尾藻研究领域的论文数量呈指数级增长,而《遗传学》是论文数量最多的期刊。本研究介绍了研究模式的演变过程。因此,世界范围内的合作和一个潜在的领域可以得到发展。
{"title":"Visualization of scientific production in Caenorhabditis elegans: a bibliometric analysis (1980-2023).","authors":"Şeyda Berk, Serkan Özdemir, Ayşe Nur Pektaş","doi":"10.1186/s44342-024-00002-7","DOIUrl":"10.1186/s44342-024-00002-7","url":null,"abstract":"<p><p>Caenorhabditis elegans (C. elegans) is a nematode and model organism whose entire genome has been mapped, which allows for easy observation of the organism's development due to its transparent structure, and which is appealing due to its ease of crossover, ease of culture, and low cost. Despite being separated by nearly a billion years of evolution, C. elegans homologs have been identified for the vast majority of human genes and are associated with C. elegans for many biological processes such as apoptosis, cell signaling, cell cycle, cell polarity, metabolism, and aging. A detailed bibliometric study is performed here to examine publication trends in this field. Data were taken from the Web of Science database and analyzed using the bibliometric application Biblioshiny (RStudio). In terms of publication, the results indicated a gradual increase each year between 1980 and 2023. A total of 20,322 records were issued in 96 countries, the majority of which were in the USA, China, and Japan. The most prolific writers, the journals most engaged in the area, the nations, institutions, and keywords used by authors were all determined using the Web of Science database and bibliometric rules. The number of papers in the C. elegans research field is increasing exponentially, and Genetics is the journal with the highest number of articles. This study presents how research patterns have evolved throughout time. As a result, worldwide cooperation and a potential field can be developed.</p>","PeriodicalId":94288,"journal":{"name":"Genomics & informatics","volume":"22 1","pages":"3"},"PeriodicalIF":0.0,"publicationDate":"2024-05-31","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11184956/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141437957","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2024-05-28DOI: 10.1186/s44342-024-00003-6
Ngoc Thi Bich Chu, Man Thi Le, Hong Viet La, Quynh Thi Ngoc Le, Thao Duc Le, Huyen Thi Thanh Tran, Lan Thi Mai Tran, Chi Toan Le, Dung Viet Nguyen, Phi Bang Cao, Ha Duc Chu
Small auxin-up RNA (SAUR) proteins were known as a large family that supposedly participated in various biological processes in higher plant species. However, the SAUR family has been still not explored in cacao (Theobroma cacao L.), one of the most important industrial trees. The present work, as an in silico study, revealed comprehensive aspects of the structure, phylogeny, and expression of TcSAUR gene family in cacao. A total of 90 members of the TcSAUR gene family have been identified and annotated in the cacao genome. According to the physic-chemical features analysis, all TcSAUR proteins exhibited slightly similar characteristics. Phylogenetic analysis showed that these TcSAUR proteins could be categorized into seven distinct groups, with 10 sub-groups. Our results suggested that tandemly duplication events, segmental duplication events, and whole genome duplication events might be important in the growth of the TcSAUR gene family in cacao. By re-analyzing the available transcriptome databases, we found that a number of TcSAUR genes were exclusively expressed during the zygotic embryogenesis and somatic embryogenesis. Taken together, our study will be valuable to further functional characterizations of candidate TcSAUR genes for the genetic engineering of cacao.
{"title":"Genome-wide identification, characterization, and expression analysis of the small auxin-up RNA gene family during zygotic and somatic embryo maturation of the cacao tree (Theobroma cacao).","authors":"Ngoc Thi Bich Chu, Man Thi Le, Hong Viet La, Quynh Thi Ngoc Le, Thao Duc Le, Huyen Thi Thanh Tran, Lan Thi Mai Tran, Chi Toan Le, Dung Viet Nguyen, Phi Bang Cao, Ha Duc Chu","doi":"10.1186/s44342-024-00003-6","DOIUrl":"10.1186/s44342-024-00003-6","url":null,"abstract":"<p><p>Small auxin-up RNA (SAUR) proteins were known as a large family that supposedly participated in various biological processes in higher plant species. However, the SAUR family has been still not explored in cacao (Theobroma cacao L.), one of the most important industrial trees. The present work, as an in silico study, revealed comprehensive aspects of the structure, phylogeny, and expression of TcSAUR gene family in cacao. A total of 90 members of the TcSAUR gene family have been identified and annotated in the cacao genome. According to the physic-chemical features analysis, all TcSAUR proteins exhibited slightly similar characteristics. Phylogenetic analysis showed that these TcSAUR proteins could be categorized into seven distinct groups, with 10 sub-groups. Our results suggested that tandemly duplication events, segmental duplication events, and whole genome duplication events might be important in the growth of the TcSAUR gene family in cacao. By re-analyzing the available transcriptome databases, we found that a number of TcSAUR genes were exclusively expressed during the zygotic embryogenesis and somatic embryogenesis. Taken together, our study will be valuable to further functional characterizations of candidate TcSAUR genes for the genetic engineering of cacao.</p>","PeriodicalId":94288,"journal":{"name":"Genomics & informatics","volume":"22 1","pages":"2"},"PeriodicalIF":0.0,"publicationDate":"2024-05-28","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11184954/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141437954","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}