Pub Date : 2024-11-29DOI: 10.1186/s12863-024-01284-0
Klaus Schughart, Annika Möller-Kerutt, Verena Höffken, Pavel Nedvetsky, Ann-Christin Groh, Daniela Anne Braun, Hermann Pavenstädt, Thomas Weide
In epithelia, apicobasal cell polarization is closely linked to cell-cell contact formation, both controlled by the conserved Crumbs (CRB) complex, which includes the transmembrane protein Crumbs (CRB3a) and adapter proteins PALS1, PATJ, and LIN7c. In MDCK II cells, a model for cell polarization, depletion of PALS1 - which binds to all CRB components - leads to defective cell polarization and improper distribution of tight junction proteins, resulting in severe epithelial barrier defects in 3D cyst models. This study investigated whether this phenotype is associated with transcriptional changes by analyzing wildtype (WT) and PALS1 knockout (KO) MDCK II cell lines grown under non-confluent conditions and in 3D cyst cultures. Our results indicate that the transition from non-confluent cells to 3D cysts involves numerous differentially expressed genes (DEGs) in both WT and KO cells. Importantly, the analyses revealed significant overlaps between WT and KO cells in their maturation processes, suggesting that most identified DEGs are linked to differentiation from non-confluent to polarized MDCK cells and likely not a result of PALS1 deficiency. Gene Ontology (GO) enrichment and over-representation analyses using REACTOME and KEGG databases confirmed these similarities. In contrast, the direct comparison of WT and KO cells at the two stages showed fewer DEGs and overlaps in associated biological processes and signaling pathways. DEGs associated with the 3D stage, in which the phenotype manifests, contain DEGs and pathways that were predominantly linked to cell cycle linked processes, centromere assembly, or DNA replication. Furthermore, the transcription of genes encoding key junction proteins, additional polarity proteins, and cell-substrate interaction proteins is less affected by the loss of PALS1, indicating that PALS1 influences the transcriptional profiles in epithelial cells as a modulating factor.
{"title":"PALS1-dependent modulations of mRNA profiles in MDCK II cells grown in non-confluent monolayers and three-dimensional cysts.","authors":"Klaus Schughart, Annika Möller-Kerutt, Verena Höffken, Pavel Nedvetsky, Ann-Christin Groh, Daniela Anne Braun, Hermann Pavenstädt, Thomas Weide","doi":"10.1186/s12863-024-01284-0","DOIUrl":"10.1186/s12863-024-01284-0","url":null,"abstract":"<p><p>In epithelia, apicobasal cell polarization is closely linked to cell-cell contact formation, both controlled by the conserved Crumbs (CRB) complex, which includes the transmembrane protein Crumbs (CRB3a) and adapter proteins PALS1, PATJ, and LIN7c. In MDCK II cells, a model for cell polarization, depletion of PALS1 - which binds to all CRB components - leads to defective cell polarization and improper distribution of tight junction proteins, resulting in severe epithelial barrier defects in 3D cyst models. This study investigated whether this phenotype is associated with transcriptional changes by analyzing wildtype (WT) and PALS1 knockout (KO) MDCK II cell lines grown under non-confluent conditions and in 3D cyst cultures. Our results indicate that the transition from non-confluent cells to 3D cysts involves numerous differentially expressed genes (DEGs) in both WT and KO cells. Importantly, the analyses revealed significant overlaps between WT and KO cells in their maturation processes, suggesting that most identified DEGs are linked to differentiation from non-confluent to polarized MDCK cells and likely not a result of PALS1 deficiency. Gene Ontology (GO) enrichment and over-representation analyses using REACTOME and KEGG databases confirmed these similarities. In contrast, the direct comparison of WT and KO cells at the two stages showed fewer DEGs and overlaps in associated biological processes and signaling pathways. DEGs associated with the 3D stage, in which the phenotype manifests, contain DEGs and pathways that were predominantly linked to cell cycle linked processes, centromere assembly, or DNA replication. Furthermore, the transcription of genes encoding key junction proteins, additional polarity proteins, and cell-substrate interaction proteins is less affected by the loss of PALS1, indicating that PALS1 influences the transcriptional profiles in epithelial cells as a modulating factor.</p>","PeriodicalId":72427,"journal":{"name":"BMC genomic data","volume":"25 1","pages":"102"},"PeriodicalIF":1.9,"publicationDate":"2024-11-29","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11607895/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142755641","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2024-11-27DOI: 10.1186/s12863-024-01283-1
Hailing Huang, Ruiqun Lu, Shenni Peng, Shi Huang, Yinyin Mo, Genliang Li
<p><p>This multi-omics study delves into the expression patterns of PIWIL genes and their correlation with hepatocellular carcinoma (HCC) progression, utilizing whole transcriptome sequencing, bioinformatics, and reverse transcription quantitative polymerase chain reaction (RT-qPCR) in mice. We identified differential expression levels of PIWIL genes between HCC and control tissues and analyzed their roles within the competing endogenous RNA (ceRNA) network related to regulatory non-coding RNA-mediated gene silencing (RNGS). Our findings showed that Piwil1 and Piwil4 were overexpressed while Piwil2 is underexpressed. As ceRNAs, specific lncRNAs, including Pvt1, Gas5, and BGIGI10090_38749, might sponge up miR-351-5p and miR-31-5p, promoting Piwil1 and Piwil4 expression, while miR-133b-3p, lacking ceRNA sponge absorption, continues to inhibit Piwil2. Through their interactions with PPI proteins encoded by RNGS genes, especially Dhx9, Drosha, Mov10, and Tdrd1, PIWI family members might play a multifaceted role in regulating gene expression and metabolic processes, thereby involving the development and progression of HCC. These interactions within the PPI network could influence the stability and activity of PIWIL proteins and contribute to the overall regulation of gene expression and HCC progression. In the RNGS, a diverse array of miRNAs, genes, lncRNAs, circRNAs, and pseudogenes have been observed, which are suggested to intricately interplay, potentially weaving a complex ceRNA regulatory network. Abnormally expressed miRNA-targeted genes in RNGS are associated with key biological processes, such as lipid metabolism and immune responses, crucial for tumor cell survival, and processes supporting tumor growth and invasion, like translation and cytoskeleton organization. This regulation is reflected in distinct KEGG pathways for downregulated and upregulated targets, highlighting the dualistic role of PIWIL genes in modulating HCC progression. The study concludes that PIWI family members have a correlation with HCC progression and play divergent roles in the pathogenesis, with overexpression of the Piwil1 and Piwil4 potentially promoting HCC progression and underexpression of Piwil2 likely suppressing tumor development. The ceRNA mechanism and PPI network are crucial in regulating the expression and function of PIWIL genes, respectively. The intricate ceRNA network potentially regulates the expression of miRNA-targeted genes in RNGS, which might be crucial for tumor survival and promotion, with impacts on immune responses and cell growth based on enriching results of dysregulated miRNA-targeted genes in HCC. By shedding light on the molecular intricacies of HCC, this multi-omics study underscores the pivotal roles of epigenetic regulations, especially the influence of PIWI family genes with other genes and ncRNAs in the RNGS process in HCC pathology. The findings offer valuable insights into the molecular mechanisms underpinning HCC, which may info
{"title":"PIWIL genes in hepatocellular carcinoma: a multi-omics approach uncovering dysregulated expression and ceRNA networks in mice.","authors":"Hailing Huang, Ruiqun Lu, Shenni Peng, Shi Huang, Yinyin Mo, Genliang Li","doi":"10.1186/s12863-024-01283-1","DOIUrl":"10.1186/s12863-024-01283-1","url":null,"abstract":"<p><p>This multi-omics study delves into the expression patterns of PIWIL genes and their correlation with hepatocellular carcinoma (HCC) progression, utilizing whole transcriptome sequencing, bioinformatics, and reverse transcription quantitative polymerase chain reaction (RT-qPCR) in mice. We identified differential expression levels of PIWIL genes between HCC and control tissues and analyzed their roles within the competing endogenous RNA (ceRNA) network related to regulatory non-coding RNA-mediated gene silencing (RNGS). Our findings showed that Piwil1 and Piwil4 were overexpressed while Piwil2 is underexpressed. As ceRNAs, specific lncRNAs, including Pvt1, Gas5, and BGIGI10090_38749, might sponge up miR-351-5p and miR-31-5p, promoting Piwil1 and Piwil4 expression, while miR-133b-3p, lacking ceRNA sponge absorption, continues to inhibit Piwil2. Through their interactions with PPI proteins encoded by RNGS genes, especially Dhx9, Drosha, Mov10, and Tdrd1, PIWI family members might play a multifaceted role in regulating gene expression and metabolic processes, thereby involving the development and progression of HCC. These interactions within the PPI network could influence the stability and activity of PIWIL proteins and contribute to the overall regulation of gene expression and HCC progression. In the RNGS, a diverse array of miRNAs, genes, lncRNAs, circRNAs, and pseudogenes have been observed, which are suggested to intricately interplay, potentially weaving a complex ceRNA regulatory network. Abnormally expressed miRNA-targeted genes in RNGS are associated with key biological processes, such as lipid metabolism and immune responses, crucial for tumor cell survival, and processes supporting tumor growth and invasion, like translation and cytoskeleton organization. This regulation is reflected in distinct KEGG pathways for downregulated and upregulated targets, highlighting the dualistic role of PIWIL genes in modulating HCC progression. The study concludes that PIWI family members have a correlation with HCC progression and play divergent roles in the pathogenesis, with overexpression of the Piwil1 and Piwil4 potentially promoting HCC progression and underexpression of Piwil2 likely suppressing tumor development. The ceRNA mechanism and PPI network are crucial in regulating the expression and function of PIWIL genes, respectively. The intricate ceRNA network potentially regulates the expression of miRNA-targeted genes in RNGS, which might be crucial for tumor survival and promotion, with impacts on immune responses and cell growth based on enriching results of dysregulated miRNA-targeted genes in HCC. By shedding light on the molecular intricacies of HCC, this multi-omics study underscores the pivotal roles of epigenetic regulations, especially the influence of PIWI family genes with other genes and ncRNAs in the RNGS process in HCC pathology. The findings offer valuable insights into the molecular mechanisms underpinning HCC, which may info","PeriodicalId":72427,"journal":{"name":"BMC genomic data","volume":"25 1","pages":"101"},"PeriodicalIF":1.9,"publicationDate":"2024-11-27","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11603867/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142741345","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2024-11-27DOI: 10.1186/s12863-024-01281-3
Cătălin Vasile Munteanu, Cătălin Marian, Adela Chiriță-Emandi, Maria Puiu, Adrian Pavel Trifa
Lynch syndrome (LS) is one of the most common hereditary cancer syndrome in human populations, associated with germline variants in MLH1, MSH2/EPCAM, MSH6 and PMS2 genes. The advent of next generation sequencing has proven a significant impact in germline variant detection in the causative genes; however, a large proportion of patients with clinical criteria still receive uncertain or negative results. PMS2 is the least frequent reported gene, associated with up to 15% of LS cases with late-onset disease and low penetrance phenotype; however, the proportion of PMS2-LS cases is considered to be highly underestimated. In this context, our analysis aimed to improve the current diagnostic yield by focusing on missense and intronic PMS2 variants available in public clinical databases (ClinVar, LOVD). We performed an in silico assessment of the wild-type DNA sequence and the reported genetic variants, employing splicing bioinformatics tools known for their effectiveness in other genes. Splicing variants were predicted in silico and using GTEx short-read RNA expression data. Out of the 2384 missense variants discovered, 90% were classified with uncertain significance (VUS). 4.9% of missense variants were shown to have a potential splicing consequence (DS > 0.2) using SpliceAI. As described in the original publication, SpliceAI-visual was proven effective in annotation of short intronic variants (< 50 bp). Four short intronic variants were identified using SpliceAI-visual as potentially splicing disturbing, in spite of using a lower threshold (DS > 0.1). Exons 2, 3, 4, 5, 6, 7, 8, 11, 12 and 14 were consistently predicted in at least three out of eight software with weak canonical splice sites. Additionally, we noted that both Exonic Splicing Enhancers (ESEs) and Exonic Splicing Silencers (ESSs) contribute significantly to alternative splicing and exonic selection in PMS2 gene. Specifically, ESE motifs were consistently more abundant in highly expressed exons 5, 11 and 14, while ESS motifs played a fundamental role in exons 6, 7 and 10. Computational analysis performed in our study serves as a valuable filtering step for guiding further RNA experiments. Additional functional data is necessary to validate our findings.
{"title":"In silico splicing analysis of the PMS2 gene: exploring alternative molecular mechanisms in PMS2-associated Lynch syndrome.","authors":"Cătălin Vasile Munteanu, Cătălin Marian, Adela Chiriță-Emandi, Maria Puiu, Adrian Pavel Trifa","doi":"10.1186/s12863-024-01281-3","DOIUrl":"10.1186/s12863-024-01281-3","url":null,"abstract":"<p><p>Lynch syndrome (LS) is one of the most common hereditary cancer syndrome in human populations, associated with germline variants in MLH1, MSH2/EPCAM, MSH6 and PMS2 genes. The advent of next generation sequencing has proven a significant impact in germline variant detection in the causative genes; however, a large proportion of patients with clinical criteria still receive uncertain or negative results. PMS2 is the least frequent reported gene, associated with up to 15% of LS cases with late-onset disease and low penetrance phenotype; however, the proportion of PMS2-LS cases is considered to be highly underestimated. In this context, our analysis aimed to improve the current diagnostic yield by focusing on missense and intronic PMS2 variants available in public clinical databases (ClinVar, LOVD). We performed an in silico assessment of the wild-type DNA sequence and the reported genetic variants, employing splicing bioinformatics tools known for their effectiveness in other genes. Splicing variants were predicted in silico and using GTEx short-read RNA expression data. Out of the 2384 missense variants discovered, 90% were classified with uncertain significance (VUS). 4.9% of missense variants were shown to have a potential splicing consequence (DS > 0.2) using SpliceAI. As described in the original publication, SpliceAI-visual was proven effective in annotation of short intronic variants (< 50 bp). Four short intronic variants were identified using SpliceAI-visual as potentially splicing disturbing, in spite of using a lower threshold (DS > 0.1). Exons 2, 3, 4, 5, 6, 7, 8, 11, 12 and 14 were consistently predicted in at least three out of eight software with weak canonical splice sites. Additionally, we noted that both Exonic Splicing Enhancers (ESEs) and Exonic Splicing Silencers (ESSs) contribute significantly to alternative splicing and exonic selection in PMS2 gene. Specifically, ESE motifs were consistently more abundant in highly expressed exons 5, 11 and 14, while ESS motifs played a fundamental role in exons 6, 7 and 10. Computational analysis performed in our study serves as a valuable filtering step for guiding further RNA experiments. Additional functional data is necessary to validate our findings.</p>","PeriodicalId":72427,"journal":{"name":"BMC genomic data","volume":"25 1","pages":"100"},"PeriodicalIF":1.9,"publicationDate":"2024-11-27","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11600730/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142735088","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Objectives: Peach is a deciduous tree widely cultivated in temperate and subtropical regions that requires a process of bud endodormancy to produce normal flowering and fruiting. This release requires a certain accumulation of cold, named chilling requirement (CR). CR is genotype dependent and with varies levels among different species and accessions. Thus, we collected the bud transcriptomic data of two peaches with different CR levels and conduct a series standard basic analysis. The peach bud transcriptomic data we gathered provides a valuable dataset for exploring the relationships between gene expression and peach CR levels.
Data description: We extracted and sequenced the RNA of different CR peach buds at the same status in three endodormancy stages. Each stages have three biological replicates. A total of 18 RNA-seq libraries were obtained and mapped to the reference genome after quality control. The gene expression level was normalized by two methods (TPM and FPKM). Differentially expressed genes (DEGs) analysis revealed that a total of 2,481 unique genes with an absolute value of log2 fold change (FC) greater than 1.0. Homologous functional annotation of these DEGs were conducted which provided further information for CR potential related genes identified and functional genomics studies.
{"title":"Transcriptomic data of peach varieties with different chilling requirement levels.","authors":"Weihan Zhang, Yayun Sun, Haiyan Li, Yuepeng Han, Baoxiong Wan, Liao Liao","doi":"10.1186/s12863-024-01279-x","DOIUrl":"10.1186/s12863-024-01279-x","url":null,"abstract":"<p><strong>Objectives: </strong>Peach is a deciduous tree widely cultivated in temperate and subtropical regions that requires a process of bud endodormancy to produce normal flowering and fruiting. This release requires a certain accumulation of cold, named chilling requirement (CR). CR is genotype dependent and with varies levels among different species and accessions. Thus, we collected the bud transcriptomic data of two peaches with different CR levels and conduct a series standard basic analysis. The peach bud transcriptomic data we gathered provides a valuable dataset for exploring the relationships between gene expression and peach CR levels.</p><p><strong>Data description: </strong>We extracted and sequenced the RNA of different CR peach buds at the same status in three endodormancy stages. Each stages have three biological replicates. A total of 18 RNA-seq libraries were obtained and mapped to the reference genome after quality control. The gene expression level was normalized by two methods (TPM and FPKM). Differentially expressed genes (DEGs) analysis revealed that a total of 2,481 unique genes with an absolute value of log2 fold change (FC) greater than 1.0. Homologous functional annotation of these DEGs were conducted which provided further information for CR potential related genes identified and functional genomics studies.</p>","PeriodicalId":72427,"journal":{"name":"BMC genomic data","volume":"25 1","pages":"99"},"PeriodicalIF":1.9,"publicationDate":"2024-11-26","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11600755/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142735089","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
CGGBP1 is a GC-rich DNA-binding protein which is important for genomic integrity, gene expression and epigenome maintenance through regulation of CTCF occupancy and cytosine methylation. It has remained unclear how CGGBP1 integrates multiple diverse functions with its simple architecture of only a DNA-binding domain tethered to a C-terminal tail with low structural rigidity. We have used truncated forms of CGGBP1 with or without the DNA-binding domain (DBD) to assay cytosine methylation and global gene expression. Proximal promoters of CGGBP1-repressed genes, although significantly GC-poor, contain GC-rich transcription factor binding motifs and exhibit base compositions indicative of low C-T transition rates due to prevention of cytosine methylation. Genome-wide analyses of cytosine methylation and binding of CGGBP1 DBD show that CGGBP1 restricts cytosine methylation in a manner that depends on its DBD and its DNA-binding. The CGGBP1-repressed genes show an increase in promoter cytosine methylation alongside a decrease in transcript abundance when the DBD-deficient CGGBP1 is expressed. Our findings suggest that CGGBP1 protects transcription factor binding sites (TFBS) from cytosine methylation-associated loss and thereby regulates gene expression. By analysing orthologous promoter sequences we show that restriction of cytosine methylation is a function of CGGBP1 progressively acquired during vertebrate evolution. A superimposition of our results and evolution of CGGBP1 suggests that mitigation of cytosine methylation is majorly achieved by its N-terminal DBD. Our results position CGGBP1 DNA-binding as a major evolutionarily acquired mechanism through which it keeps cytosine methylation under check and regulates TFBS retention and gene activity.
CGGBP1 是一种富含 GC 的 DNA 结合蛋白,通过调控 CTCF 占有率和胞嘧啶甲基化,对基因组完整性、基因表达和表观基因组的维护具有重要作用。CGGBP1 的结构简单,只有一个 DNA 结合域与结构刚度较低的 C 端尾部相连,但它是如何整合多种不同功能的,目前仍不清楚。我们利用含有或不含 DNA 结合域(DBD)的 CGGBP1 截短形式来检测胞嘧啶甲基化和全局基因表达。CGGBP1抑制基因的近端启动子虽然明显缺乏GC,但含有富含GC的转录因子结合基团,并且由于胞嘧啶甲基化的阻止,其碱基组成显示出较低的C-T转换率。对胞嘧啶甲基化和 CGGBP1 DBD 结合的全基因组分析表明,CGGBP1 限制胞嘧啶甲基化的方式取决于其 DBD 及其 DNA 结合。当 DBD 缺失的 CGGBP1 表达时,CGGBP1 抑制基因的启动子胞嘧啶甲基化增加,同时转录本丰度下降。我们的研究结果表明,CGGBP1 保护转录因子结合位点(TFBS)免受胞嘧啶甲基化相关损失,从而调节基因表达。通过分析同源启动子序列,我们发现限制胞嘧啶甲基化是 CGGBP1 在脊椎动物进化过程中逐渐获得的一种功能。我们的研究结果与 CGGBP1 进化过程的叠加表明,胞嘧啶甲基化的缓解主要是通过其 N 端 DBD 实现的。我们的研究结果将 CGGBP1 的 DNA 结合定位为一种主要的进化机制,通过这种机制,CGGBP1 可控制胞嘧啶甲基化并调节 TFBS 的保留和基因活性。
{"title":"The amniote-conserved DNA-binding domain of CGGBP1 restricts cytosine methylation of transcription factor binding sites in proximal promoters to regulate gene expression.","authors":"Ishani Morbia, Praveen Kumar, Aditi Lakshmi Satish, Akanksha Mudgal, Subhamoy Datta, Umashankar Singh","doi":"10.1186/s12863-024-01282-2","DOIUrl":"10.1186/s12863-024-01282-2","url":null,"abstract":"<p><p>CGGBP1 is a GC-rich DNA-binding protein which is important for genomic integrity, gene expression and epigenome maintenance through regulation of CTCF occupancy and cytosine methylation. It has remained unclear how CGGBP1 integrates multiple diverse functions with its simple architecture of only a DNA-binding domain tethered to a C-terminal tail with low structural rigidity. We have used truncated forms of CGGBP1 with or without the DNA-binding domain (DBD) to assay cytosine methylation and global gene expression. Proximal promoters of CGGBP1-repressed genes, although significantly GC-poor, contain GC-rich transcription factor binding motifs and exhibit base compositions indicative of low C-T transition rates due to prevention of cytosine methylation. Genome-wide analyses of cytosine methylation and binding of CGGBP1 DBD show that CGGBP1 restricts cytosine methylation in a manner that depends on its DBD and its DNA-binding. The CGGBP1-repressed genes show an increase in promoter cytosine methylation alongside a decrease in transcript abundance when the DBD-deficient CGGBP1 is expressed. Our findings suggest that CGGBP1 protects transcription factor binding sites (TFBS) from cytosine methylation-associated loss and thereby regulates gene expression. By analysing orthologous promoter sequences we show that restriction of cytosine methylation is a function of CGGBP1 progressively acquired during vertebrate evolution. A superimposition of our results and evolution of CGGBP1 suggests that mitigation of cytosine methylation is majorly achieved by its N-terminal DBD. Our results position CGGBP1 DNA-binding as a major evolutionarily acquired mechanism through which it keeps cytosine methylation under check and regulates TFBS retention and gene activity.</p>","PeriodicalId":72427,"journal":{"name":"BMC genomic data","volume":"25 1","pages":"98"},"PeriodicalIF":1.9,"publicationDate":"2024-11-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11575156/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142670087","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Objectives: Wild soybean (Glycine soja), the ancestor of domesticated soybean, retains a higher level of genetic diversity and adaptability to harsh environments, making it highly valuable for breeding. Here, we re-sequenced 69 wild soybean individuals collected by the Shandong Academy of Agricultural Sciences and identified 1,613,162 high-quality SNPs which not only enriches our understanding of the genetic structure of wild soybean, but also provides valuable resources for further genomic research and genetic improvement of soybean.
Data description: In this study, we collected 69 wild soybean accessions from Shandong Province, China, and performed re-sequencing on the DNBSEQ platform, followed by SNPs identification. We then integrated ADMIXTURE, neighbor-joining tree, and principal component analysis to illustrate population characteristics. The results showed that these wild soybean accessions could be divided into three distinct subpopulations, exhibiting significant genetic differences.
{"title":"Comprehensive analysis of the genetic variation dataset among wild soybean (Glycine soja) in Shandong Province, China.","authors":"Lei-Lei Li, Ru-Mei Tian, Yan-Yan Pu, Yun-Zhe Cong, Xue Chen, Kai-Hua Jia, Na-Na Li","doi":"10.1186/s12863-024-01280-4","DOIUrl":"10.1186/s12863-024-01280-4","url":null,"abstract":"<p><strong>Objectives: </strong>Wild soybean (Glycine soja), the ancestor of domesticated soybean, retains a higher level of genetic diversity and adaptability to harsh environments, making it highly valuable for breeding. Here, we re-sequenced 69 wild soybean individuals collected by the Shandong Academy of Agricultural Sciences and identified 1,613,162 high-quality SNPs which not only enriches our understanding of the genetic structure of wild soybean, but also provides valuable resources for further genomic research and genetic improvement of soybean.</p><p><strong>Data description: </strong>In this study, we collected 69 wild soybean accessions from Shandong Province, China, and performed re-sequencing on the DNBSEQ platform, followed by SNPs identification. We then integrated ADMIXTURE, neighbor-joining tree, and principal component analysis to illustrate population characteristics. The results showed that these wild soybean accessions could be divided into three distinct subpopulations, exhibiting significant genetic differences.</p>","PeriodicalId":72427,"journal":{"name":"BMC genomic data","volume":"25 1","pages":"97"},"PeriodicalIF":1.9,"publicationDate":"2024-11-11","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11552345/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142634014","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2024-11-08DOI: 10.1186/s12863-024-01278-y
Alana Firl, Meric C Lieberman, Nestor Kippes, Helen Tsai, Eric Dowd, Luca Comai, Isabelle M Henry
Objectives: Mint oil is used in various commercial applications world-wide. Mint oil is typically harvested from commercial clones of peppermint or spearmints. Spearmints are the product of a cross between two diploid species: Mentha longifolia (horse mint) and Mentha suaveolens (apple mint). Peppermints are the product of an additional hybridization step between spearmint and an octoploid Mentha aquatica (water mint). Here, we present a chromosome-scale assembly of the genome of a clone of M. suaveolens. Together with the previously assembled genome of M. longifolia, these assemblies are instrumental in addressing questions regarding the origins of spearmint and peppermint oil and the genomic composition of commercial spearmints, and to start elaborating strategies for mint cultivar improvement.
Data description: A Falcon assembly of the genome of M. suaveolens was generated from 103X coverage of PacBio long reads. Additional scaffolding was conducted by Dovetail Genomics, using a Chicago library, and a HiC library. The resulting assembly had an N50 of 44.7 Mb, and 98.45% of the 536 Mb of the assembly were contained within 12 large superscaffolds. Finally, a genetic map was applied to correct persistent misjoins. Illumina RNA-Seq libraries from a variety of tissues were used to annotate the genome.
{"title":"Chromosome-scale assembly of apple mint (Mentha suaveolens).","authors":"Alana Firl, Meric C Lieberman, Nestor Kippes, Helen Tsai, Eric Dowd, Luca Comai, Isabelle M Henry","doi":"10.1186/s12863-024-01278-y","DOIUrl":"10.1186/s12863-024-01278-y","url":null,"abstract":"<p><strong>Objectives: </strong>Mint oil is used in various commercial applications world-wide. Mint oil is typically harvested from commercial clones of peppermint or spearmints. Spearmints are the product of a cross between two diploid species: Mentha longifolia (horse mint) and Mentha suaveolens (apple mint). Peppermints are the product of an additional hybridization step between spearmint and an octoploid Mentha aquatica (water mint). Here, we present a chromosome-scale assembly of the genome of a clone of M. suaveolens. Together with the previously assembled genome of M. longifolia, these assemblies are instrumental in addressing questions regarding the origins of spearmint and peppermint oil and the genomic composition of commercial spearmints, and to start elaborating strategies for mint cultivar improvement.</p><p><strong>Data description: </strong>A Falcon assembly of the genome of M. suaveolens was generated from 103X coverage of PacBio long reads. Additional scaffolding was conducted by Dovetail Genomics, using a Chicago library, and a HiC library. The resulting assembly had an N50 of 44.7 Mb, and 98.45% of the 536 Mb of the assembly were contained within 12 large superscaffolds. Finally, a genetic map was applied to correct persistent misjoins. Illumina RNA-Seq libraries from a variety of tissues were used to annotate the genome.</p>","PeriodicalId":72427,"journal":{"name":"BMC genomic data","volume":"25 1","pages":"96"},"PeriodicalIF":1.9,"publicationDate":"2024-11-08","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11549836/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142633934","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2024-11-05DOI: 10.1186/s12863-024-01277-z
Ikwhan Kim, Da-Ryung Jung, Ryeong-Hui Kim, Dokyung Lee, YeonGyun Jung, Ji Hoon Ha, Eun Kyung Lee, Jin Mo Kim, Jin Young Kim, Jun-Hwan Jang, Jun-Tae Bae, Yoon Soo Cho, Jae-Ho Shin
Objectives: Cutibacterium acnes is a Gram-positive bacterium commonly found on human skin, particularly in sebaceous areas. While it is typically considered a commensal, specific strain types based on single locus sequence typing (SLST) have been associated with pathogenic conditions or healthy skin. Recently, SLST D1 strains, part of phylotype IA1, have received attention for their potential benefits related to skin health. However, their genetic characteristics remain underexplored. Therefore, the whole genome of C. acnes CN6, an SLST D1 strain isolated from the facial skin of a healthy individual, was sequenced to expand the understanding of SLST D1 strains and identify genomic features that may support skin health.
Data description: The whole genome sequencing of C. acnes CN6 was conducted using MinION reads based on de novo assembly, revealing a single circular complete chromosome. With the length of 2,550,458 bp and G + C content of 60.04%, the genome contains 2,492 genes, including 2,433 CDSs, 9 rRNAs, 46 tRNAs, 4 ncRNAs, and 134 pseudo genes. Previously predicted virulence proteins of C. ances were detected in the genome. Genome comparation with 200 C. acnes strains isolated from healthy facial skin revealed SLST D1 strain-specific genes and a unique variant of the znuC gene in D1 strains.
{"title":"Complete genome of single locus sequence typing D1 strain Cutibacterium acnes CN6 isolated from healthy facial skin.","authors":"Ikwhan Kim, Da-Ryung Jung, Ryeong-Hui Kim, Dokyung Lee, YeonGyun Jung, Ji Hoon Ha, Eun Kyung Lee, Jin Mo Kim, Jin Young Kim, Jun-Hwan Jang, Jun-Tae Bae, Yoon Soo Cho, Jae-Ho Shin","doi":"10.1186/s12863-024-01277-z","DOIUrl":"10.1186/s12863-024-01277-z","url":null,"abstract":"<p><strong>Objectives: </strong>Cutibacterium acnes is a Gram-positive bacterium commonly found on human skin, particularly in sebaceous areas. While it is typically considered a commensal, specific strain types based on single locus sequence typing (SLST) have been associated with pathogenic conditions or healthy skin. Recently, SLST D1 strains, part of phylotype IA1, have received attention for their potential benefits related to skin health. However, their genetic characteristics remain underexplored. Therefore, the whole genome of C. acnes CN6, an SLST D1 strain isolated from the facial skin of a healthy individual, was sequenced to expand the understanding of SLST D1 strains and identify genomic features that may support skin health.</p><p><strong>Data description: </strong>The whole genome sequencing of C. acnes CN6 was conducted using MinION reads based on de novo assembly, revealing a single circular complete chromosome. With the length of 2,550,458 bp and G + C content of 60.04%, the genome contains 2,492 genes, including 2,433 CDSs, 9 rRNAs, 46 tRNAs, 4 ncRNAs, and 134 pseudo genes. Previously predicted virulence proteins of C. ances were detected in the genome. Genome comparation with 200 C. acnes strains isolated from healthy facial skin revealed SLST D1 strain-specific genes and a unique variant of the znuC gene in D1 strains.</p>","PeriodicalId":72427,"journal":{"name":"BMC genomic data","volume":"25 1","pages":"94"},"PeriodicalIF":1.9,"publicationDate":"2024-11-05","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11539642/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142584515","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2024-11-05DOI: 10.1186/s12863-024-01276-0
Richa Sharma, Rukmini Mishra, Raj Kumar Joshi
Objectives: Purple blotch, caused by the necrotrophic pathogen Alternaria porri, is one of the most economically significant diseases of onion and allied crops. While the virulent nature of many Alternaria spp. has been identified, the pathogenic repertoire of A. porri is still unknown. The objective of this work was to sequence the genome of A. porri using the PacBio SMRT sequencing strategy and analyse the repertoire of CAZymes, secondary metabolites, secretome and effectors in A. porri. Our research group is working to identify onion germplasm with purple blotch resistance and to understand the genetics of the pathogen. The reported de-novo assembly will contribute to the analysis of potential variants and the gene repertoire contributing to the virulence and pathogenicity of the purple blotch pathogen.
Data description: Long-read sequencing on a PacBio Sequel II system resulted in a 32.98 Mb (20 contigs) assembly with an N50 of 2, 657, 264 bp, the longest contig length of 5.05 Mb, and a GC content of 51.06%. The Benchmarking Universal Single-Copy Orthologs (BUSCO) analysis resulted in 99.7% genome completeness at the Dothideomycetes lineage, representing a high-quality genome assembly. AUGUSTUS ab initio analysis resulted in 9875 protein-coding genes. Of the 6776 pathogenicity-related genes, 537 genes with effector functions were identified. Likewise, the glycoside hydrolases (434) were the most dominant group of the total 837 predicted CAZymes. The assembled genome of A. porri showed distinctive similarities to the genomes of A. alternata and A. brassicicola, the causal agents of leaf blight of onion and leaf spot of Brassica crops, respectively.
目的:由坏死性病原菌猪链孢属(Alternaria porri)引起的紫斑病是洋葱及其相关作物中最具经济意义的病害之一。虽然已经确定了许多 Alternaria 属的致病性,但 A. porri 的致病性仍然未知。这项工作的目的是利用 PacBio SMRT 测序策略对 A. porri 的基因组进行测序,并分析 A. porri 的 CAZymes、次生代谢物、分泌物组和效应器。我们的研究小组正致力于鉴定具有紫斑病抗性的洋葱种质,并了解病原体的遗传学。所报告的全新组装将有助于分析潜在的变体以及有助于紫斑病病原体毒力和致病性的基因库:在 PacBio Sequel II 系统上进行长线程测序后,得到了 32.98 Mb(20 个等位基因)的组装结果,N50 为 2,657,264 bp,最长等位基因长度为 5.05 Mb,GC 含量为 51.06%。通用单拷贝同源物基准(BUSCO)分析结果表明,Dothideomycetes 品系的基因组完整度为 99.7%,代表了高质量的基因组组装。AUGUSTUS 自证分析得出了 9875 个蛋白质编码基因。在 6776 个致病性相关基因中,发现了 537 个具有效应功能的基因。同样,在总共 837 个预测的 CAZymes 中,糖苷水解酶(434 个)是最主要的一组。所组装的 A. porri 基因组与 A. alternata 和 A. brassicicola(分别是洋葱叶枯病和芸苔属作物叶斑病的病原菌)的基因组显示出明显的相似性。
{"title":"A highly contiguous genome sequence of Alternaria porri isolate Apn-Nashik causing purple blotch disease in onion.","authors":"Richa Sharma, Rukmini Mishra, Raj Kumar Joshi","doi":"10.1186/s12863-024-01276-0","DOIUrl":"10.1186/s12863-024-01276-0","url":null,"abstract":"<p><strong>Objectives: </strong>Purple blotch, caused by the necrotrophic pathogen Alternaria porri, is one of the most economically significant diseases of onion and allied crops. While the virulent nature of many Alternaria spp. has been identified, the pathogenic repertoire of A. porri is still unknown. The objective of this work was to sequence the genome of A. porri using the PacBio SMRT sequencing strategy and analyse the repertoire of CAZymes, secondary metabolites, secretome and effectors in A. porri. Our research group is working to identify onion germplasm with purple blotch resistance and to understand the genetics of the pathogen. The reported de-novo assembly will contribute to the analysis of potential variants and the gene repertoire contributing to the virulence and pathogenicity of the purple blotch pathogen.</p><p><strong>Data description: </strong>Long-read sequencing on a PacBio Sequel II system resulted in a 32.98 Mb (20 contigs) assembly with an N50 of 2, 657, 264 bp, the longest contig length of 5.05 Mb, and a GC content of 51.06%. The Benchmarking Universal Single-Copy Orthologs (BUSCO) analysis resulted in 99.7% genome completeness at the Dothideomycetes lineage, representing a high-quality genome assembly. AUGUSTUS ab initio analysis resulted in 9875 protein-coding genes. Of the 6776 pathogenicity-related genes, 537 genes with effector functions were identified. Likewise, the glycoside hydrolases (434) were the most dominant group of the total 837 predicted CAZymes. The assembled genome of A. porri showed distinctive similarities to the genomes of A. alternata and A. brassicicola, the causal agents of leaf blight of onion and leaf spot of Brassica crops, respectively.</p>","PeriodicalId":72427,"journal":{"name":"BMC genomic data","volume":"25 1","pages":"95"},"PeriodicalIF":1.9,"publicationDate":"2024-11-05","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11539676/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142584432","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Background: Coronary artery disease (CAD) significantly contributes to global fatalities. Recent studies have demonstrated the crucial roles of sortilin1 (SORT1) and sestrin1 (SESN1) in lipid metabolism, as well as their involvement in the development of CAD. The aberrant expression or activity of SORT1 can consequently lead to metabolic and vascular diseases. Sestrins, including SESN1, play a crucial role in helping cells survive by maintaining metabolic balance while also reducing oxidative stress (OS). OS contributes to the progression of atherosclerosis-related diseases, such as CAD. The study aimed to compare the gene expression of SORT1 and SESN1 in peripheral blood mononuclear cells (PBMCs), alongside serum OS markers, in CAD patients and controls.
Materials: The case-control study included 49 CAD patients and 40 controls. The expression of the SORT1 and SESN1 genes was quantified using qRT-PCR, and the expression of the SORT1 protein was evaluated by western blotting. OS markers, including total oxidation status (TOS), total antioxidant capacity (TAC), and malondialdehyde (MDA), were measured using spectrophotometric and fluorometric methods.
Results: SORT1 gene and protein expressions were similar between groups. CAD patients had a non-significant decrease in SESN1 gene expression. MDA levels were significantly higher in CAD patients, whereas TOS and TAC levels did not differ significantly.
Conclusion: For atherosclerosis-related disorders like CAD, MDA shows potential as a non-invasive, easy-to-use, affordable, and stable biomarker. Further research is needed to elucidate the precise roles of SORT1 and SESN1 in CAD pathogenesis.
{"title":"Evaluating SORT1 and SESN1 genes expression in peripheral blood mononuclear cells and oxidative stress status in patients with coronary artery disease.","authors":"Tayebe Ghiasvand, Jamshid Karimi, Iraj Khodadadi, Amirhossein Yazdi, Salman Khazaei, Zahra Abedi Kichi, Seyed Kianoosh Hosseini","doi":"10.1186/s12863-024-01275-1","DOIUrl":"10.1186/s12863-024-01275-1","url":null,"abstract":"<p><strong>Background: </strong>Coronary artery disease (CAD) significantly contributes to global fatalities. Recent studies have demonstrated the crucial roles of sortilin1 (SORT1) and sestrin1 (SESN1) in lipid metabolism, as well as their involvement in the development of CAD. The aberrant expression or activity of SORT1 can consequently lead to metabolic and vascular diseases. Sestrins, including SESN1, play a crucial role in helping cells survive by maintaining metabolic balance while also reducing oxidative stress (OS). OS contributes to the progression of atherosclerosis-related diseases, such as CAD. The study aimed to compare the gene expression of SORT1 and SESN1 in peripheral blood mononuclear cells (PBMCs), alongside serum OS markers, in CAD patients and controls.</p><p><strong>Materials: </strong>The case-control study included 49 CAD patients and 40 controls. The expression of the SORT1 and SESN1 genes was quantified using qRT-PCR, and the expression of the SORT1 protein was evaluated by western blotting. OS markers, including total oxidation status (TOS), total antioxidant capacity (TAC), and malondialdehyde (MDA), were measured using spectrophotometric and fluorometric methods.</p><p><strong>Results: </strong>SORT1 gene and protein expressions were similar between groups. CAD patients had a non-significant decrease in SESN1 gene expression. MDA levels were significantly higher in CAD patients, whereas TOS and TAC levels did not differ significantly.</p><p><strong>Conclusion: </strong>For atherosclerosis-related disorders like CAD, MDA shows potential as a non-invasive, easy-to-use, affordable, and stable biomarker. Further research is needed to elucidate the precise roles of SORT1 and SESN1 in CAD pathogenesis.</p>","PeriodicalId":72427,"journal":{"name":"BMC genomic data","volume":"25 1","pages":"93"},"PeriodicalIF":1.9,"publicationDate":"2024-11-02","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11531137/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142565440","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}