Pub Date : 2025-03-17DOI: 10.1186/s12863-025-01310-9
Shanwen Ding, Zijun Ma, Yafei Tang, Lin Yu, Zifu He, Xiaoman She
Objectives: The genus Paenibacillus encompasses a diverse group of Gram-positive bacteria with significant biotechnological potential. However, the research data and application cases of Paenibacillus taichungensis were still poorly understood. In this study, we isolated a P. taichungensis strain BB507, which demonstrated antibacterial effect on Ralstonia solanacearum species complex, and provided data and analysis of its complete genome.
Data description: Strain BB507 was isolated from a pine rhizosphere in Meizhou city, Guangdong province of China, and showed antibacterial activity against Ralstonia solanacearum species complex. Complete genome was sequenced using Illumina NovaSeq (second-generation) and Oxford Nanopore (third-generation) platforms. The genome of BB507 comprised of a 6,974,531 bp circular chromosome and a 352,197 bp circular plasmid, encoding a total of 6,649 gene with an average gene length of 950 bp, 103 transfer RNAs, 2 sRNAs, and 36 rRNAs. Three candidate CRISPRs, 6 genomic islands and 14 prophages were predicted. The bacterial orthologous average nucleotide identity (OAT) and the type genome server (TYGS) analysis highlighted the strain BB507 was clustered into a subgroup with P. taichungensis. antiSMASH v7.0 predicted the presence of 10 secondary metabolite gene clusters in the genome. These findings will serve as a useful resource for further research in industrial and agricultural biotechnology.
{"title":"Unveiling the complete genome sequence of Paenibacillus taichungensis: genomic features and biocontrol potential.","authors":"Shanwen Ding, Zijun Ma, Yafei Tang, Lin Yu, Zifu He, Xiaoman She","doi":"10.1186/s12863-025-01310-9","DOIUrl":"https://doi.org/10.1186/s12863-025-01310-9","url":null,"abstract":"<p><strong>Objectives: </strong>The genus Paenibacillus encompasses a diverse group of Gram-positive bacteria with significant biotechnological potential. However, the research data and application cases of Paenibacillus taichungensis were still poorly understood. In this study, we isolated a P. taichungensis strain BB507, which demonstrated antibacterial effect on Ralstonia solanacearum species complex, and provided data and analysis of its complete genome.</p><p><strong>Data description: </strong>Strain BB507 was isolated from a pine rhizosphere in Meizhou city, Guangdong province of China, and showed antibacterial activity against Ralstonia solanacearum species complex. Complete genome was sequenced using Illumina NovaSeq (second-generation) and Oxford Nanopore (third-generation) platforms. The genome of BB507 comprised of a 6,974,531 bp circular chromosome and a 352,197 bp circular plasmid, encoding a total of 6,649 gene with an average gene length of 950 bp, 103 transfer RNAs, 2 sRNAs, and 36 rRNAs. Three candidate CRISPRs, 6 genomic islands and 14 prophages were predicted. The bacterial orthologous average nucleotide identity (OAT) and the type genome server (TYGS) analysis highlighted the strain BB507 was clustered into a subgroup with P. taichungensis. antiSMASH v7.0 predicted the presence of 10 secondary metabolite gene clusters in the genome. These findings will serve as a useful resource for further research in industrial and agricultural biotechnology.</p>","PeriodicalId":72427,"journal":{"name":"BMC genomic data","volume":"26 1","pages":"18"},"PeriodicalIF":1.9,"publicationDate":"2025-03-17","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"143652416","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Objectives: Polyporus umbellatus is a well-known medicinal fungus in Asia. Due to its long growth cycle, wild resources of P. umbellatus are rapidly declining. Research on P. umbellatus growth is scarce, thereby impeding the investigation into the mechanism of sclerotium formation. In this study, the whole genome sequence of P. umbellatus was assembled and annotated.
Data description: The generated genome was 79.74 Mb with an N50 of 969.73 Kbp, a GC content of 50.63%, encoding 10,864 protein-coding genes. Additionally, 82 ribosomal RNAs, 488 transfer RNAs, 25 small nuclear RNAs (snRNAs) and 741 micro RNA (miRNAs) were identified. This genomic resource will unveil molecular mechanisms of sclerotium formation and provides insight into the underlying molecular processes in medicinal fungi.
{"title":"Genome sequencing highlights the fungal sclerotium formation of medicinal fungi Polyporus umbellatus.","authors":"Li Chi, Tianrui Liu, Zhongyi Hua, Pengjie Han, Honghong Jiao, Yuan Yuan","doi":"10.1186/s12863-025-01311-8","DOIUrl":"https://doi.org/10.1186/s12863-025-01311-8","url":null,"abstract":"<p><strong>Objectives: </strong>Polyporus umbellatus is a well-known medicinal fungus in Asia. Due to its long growth cycle, wild resources of P. umbellatus are rapidly declining. Research on P. umbellatus growth is scarce, thereby impeding the investigation into the mechanism of sclerotium formation. In this study, the whole genome sequence of P. umbellatus was assembled and annotated.</p><p><strong>Data description: </strong>The generated genome was 79.74 Mb with an N50 of 969.73 Kbp, a GC content of 50.63%, encoding 10,864 protein-coding genes. Additionally, 82 ribosomal RNAs, 488 transfer RNAs, 25 small nuclear RNAs (snRNAs) and 741 micro RNA (miRNAs) were identified. This genomic resource will unveil molecular mechanisms of sclerotium formation and provides insight into the underlying molecular processes in medicinal fungi.</p>","PeriodicalId":72427,"journal":{"name":"BMC genomic data","volume":"26 1","pages":"17"},"PeriodicalIF":1.9,"publicationDate":"2025-03-17","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"143652398","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2025-03-12DOI: 10.1186/s12863-025-01308-3
Long Chen, Xingyan Tong, Yujie Wu, Can Liu, Chuang Tang, Xu Qi, Fanli Kong, Mingzhou Li, Long Jin, Bo Zeng
The use of single-cell sequencing technology for single-cell transcriptomics studies in pigs is expanding progressively. However, the comprehensive classification of cell types across different anatomical tissues and organs of pig in multiple datasets remains relatively limited. This study employs single-cell and single-nucleus sequencing technologies in Bama pig to identify unique marker genes and their corresponding transcriptomic profiles across diverse cell types in various anatomical tissues and organs, including subcutaneous fat, visceral fat, psoas major muscle, liver, spleen, lung, and kidney. Through detailed data analyses, we observed widespread cellular diversity across various anatomical tissues and organs of Bama pig. This work contributes a comprehensive dataset that supports physiological studies and aids in the identification and prediction of potential marker genes through single-cell transcriptomics of these tissues. The methodologies and data employed in this study are designed to improve the accuracy of cell type identification and ensure consistent cell type allocation.
{"title":"A dataset of single-cell transcriptomic atlas of Bama pig and potential marker genes across seven tissues.","authors":"Long Chen, Xingyan Tong, Yujie Wu, Can Liu, Chuang Tang, Xu Qi, Fanli Kong, Mingzhou Li, Long Jin, Bo Zeng","doi":"10.1186/s12863-025-01308-3","DOIUrl":"10.1186/s12863-025-01308-3","url":null,"abstract":"<p><p>The use of single-cell sequencing technology for single-cell transcriptomics studies in pigs is expanding progressively. However, the comprehensive classification of cell types across different anatomical tissues and organs of pig in multiple datasets remains relatively limited. This study employs single-cell and single-nucleus sequencing technologies in Bama pig to identify unique marker genes and their corresponding transcriptomic profiles across diverse cell types in various anatomical tissues and organs, including subcutaneous fat, visceral fat, psoas major muscle, liver, spleen, lung, and kidney. Through detailed data analyses, we observed widespread cellular diversity across various anatomical tissues and organs of Bama pig. This work contributes a comprehensive dataset that supports physiological studies and aids in the identification and prediction of potential marker genes through single-cell transcriptomics of these tissues. The methodologies and data employed in this study are designed to improve the accuracy of cell type identification and ensure consistent cell type allocation.</p>","PeriodicalId":72427,"journal":{"name":"BMC genomic data","volume":"26 1","pages":"16"},"PeriodicalIF":1.9,"publicationDate":"2025-03-12","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11899051/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"143617683","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2025-02-28DOI: 10.1186/s12863-025-01304-7
Yan Luo, Jae Hee Jang, Maria Balkey, Maria Hoffmann
Objectives: Whole Genome Sequencing (WGS) is widely used in food safety for the detection, investigation, and control of foodborne bacterial pathogens. However, the WGS data in most public databases, such as the National Center for Biotechnology Information (NCBI), primarily consist of Illumina short reads which lack some important information for repetitive regions, structural variations, and mobile genetic elements, and the genomic location of certain important genes like antimicrobial resistance genes (AMR) and virulence genes. To address this limitation, we have contributed 217 closed circular Salmonella enterica genomes that were generated using PacBio sequencing to the NCBI Pathogen Detection (PD) database and GenBank. This dataset provides a higher level of accuracy to genome representations in the database.
Data description: High-quality complete reference genomes generated from PacBio long reads can provide essential details that are not available in draft genomes from short reads. A complete reference genome allows for more accurate data analysis and researchers to establish connections between genome variations and known genes, regulatory elements, and other genomic features. The addition of 217 complete genomes from 78 different Salmonella serovars, each representing either a distinct SNP cluster within the NCBI PD database or a unique strain, significantly enriches the diversity of the reference genome database.
{"title":"217 closed Salmonella reference genomes using PacBio sequencing.","authors":"Yan Luo, Jae Hee Jang, Maria Balkey, Maria Hoffmann","doi":"10.1186/s12863-025-01304-7","DOIUrl":"10.1186/s12863-025-01304-7","url":null,"abstract":"<p><strong>Objectives: </strong>Whole Genome Sequencing (WGS) is widely used in food safety for the detection, investigation, and control of foodborne bacterial pathogens. However, the WGS data in most public databases, such as the National Center for Biotechnology Information (NCBI), primarily consist of Illumina short reads which lack some important information for repetitive regions, structural variations, and mobile genetic elements, and the genomic location of certain important genes like antimicrobial resistance genes (AMR) and virulence genes. To address this limitation, we have contributed 217 closed circular Salmonella enterica genomes that were generated using PacBio sequencing to the NCBI Pathogen Detection (PD) database and GenBank. This dataset provides a higher level of accuracy to genome representations in the database.</p><p><strong>Data description: </strong>High-quality complete reference genomes generated from PacBio long reads can provide essential details that are not available in draft genomes from short reads. A complete reference genome allows for more accurate data analysis and researchers to establish connections between genome variations and known genes, regulatory elements, and other genomic features. The addition of 217 complete genomes from 78 different Salmonella serovars, each representing either a distinct SNP cluster within the NCBI PD database or a unique strain, significantly enriches the diversity of the reference genome database.</p>","PeriodicalId":72427,"journal":{"name":"BMC genomic data","volume":"26 1","pages":"15"},"PeriodicalIF":1.9,"publicationDate":"2025-02-28","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11871702/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"143531473","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Objectives: Penicillium expansum, a necrotrophic plant pathogen with a wide range of fruit hosts, is an important causal agent of blue mold rot. The fruit business suffers significant financial losses because of blue mold rot that occurs during fruit storage, transportation, and sale. The objective of this work was to generate a high-quality draft genome assembly of P. expansum, which will contribute to enhancing the management of blue mold, especially in stone and pome fruits, including grapes, by improving our understanding of the epidemiology of the pathogen and its interactions with the host.
Data description: Here, we describe the genome sequence of Penicillium expansum isolate PPRI25879, one of the most virulent strains isolated from grapes in South Africa. Sequencing reads from P. expansum produced approximately 3.5 Gb. The assembly generated a draft genome of size 32.1 Mb, consisting of 1648 contigs, with an N50 of 508 KB. Examination of the genome completeness with Benchmarking Universal Single-Copy Orthologs (BUSCO) showed the draft genome carries 97.9% of the 746 genes in the Eurotiomycetes_odb10 database. The draft genome sequence will allow improved genomic comparisons among the most important pathogens belonging to the Penicillium genus, with the aim of improving our knowledge of their plant-pathogen interactions, population biology, and control.
{"title":"Draft genome sequence of the fungal pathogen Penicillium expansum (PPRI25879) isolated in South Africa.","authors":"Molemi Rauwane, Sandiswa Figlan, Wai Yin Chan, Khayalethu Ntushelo","doi":"10.1186/s12863-025-01299-1","DOIUrl":"10.1186/s12863-025-01299-1","url":null,"abstract":"<p><strong>Objectives: </strong>Penicillium expansum, a necrotrophic plant pathogen with a wide range of fruit hosts, is an important causal agent of blue mold rot. The fruit business suffers significant financial losses because of blue mold rot that occurs during fruit storage, transportation, and sale. The objective of this work was to generate a high-quality draft genome assembly of P. expansum, which will contribute to enhancing the management of blue mold, especially in stone and pome fruits, including grapes, by improving our understanding of the epidemiology of the pathogen and its interactions with the host.</p><p><strong>Data description: </strong>Here, we describe the genome sequence of Penicillium expansum isolate PPRI25879, one of the most virulent strains isolated from grapes in South Africa. Sequencing reads from P. expansum produced approximately 3.5 Gb. The assembly generated a draft genome of size 32.1 Mb, consisting of 1648 contigs, with an N50 of 508 KB. Examination of the genome completeness with Benchmarking Universal Single-Copy Orthologs (BUSCO) showed the draft genome carries 97.9% of the 746 genes in the Eurotiomycetes_odb10 database. The draft genome sequence will allow improved genomic comparisons among the most important pathogens belonging to the Penicillium genus, with the aim of improving our knowledge of their plant-pathogen interactions, population biology, and control.</p>","PeriodicalId":72427,"journal":{"name":"BMC genomic data","volume":"26 1","pages":"14"},"PeriodicalIF":1.9,"publicationDate":"2025-02-21","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11846280/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"143473366","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Objectives: Peas (Pisum sativum) are vital for human nutrition and have significantly contributed to the understanding of Mendelian inheritance laws. In this study, we performed transcriptome sequencing on 11 distinct pea tissues, offering an extensive gene expression dataset. This study not only provides valuable insights into the gene expression patterns across various pea tissues but also lays the foundation for future research aimed at understanding the molecular mechanisms underlying pea growth, development, and response to environmental stimuli.
Data description: In this study, we conducted transcriptome sequencing on tissue samples from 11 distinct pea plants, each with three biological replicates. This approach yielded a comprehensive RNA-seq dataset, abundant in transcriptomic information. Through principal component analysis (PCA) and gene ontology (GO) enrichment analysis, we identified significant variations in gene expression across different samples. This valuable transcriptomic resource enhances our understanding of gene expression in diverse pea tissues and provides new strategies, along with potential candidate genes, for the genetic improvement of peas.
{"title":"Transcriptome profiling across 11 different tissues in Pisum sativum.","authors":"Ru-Zhi Li, Guan Li, Xiao-Yan Zhang, Lei-Lei Li, Zhi-Wei Wang, Feng-Jing Song, Kai-Hua Jia, Na-Na Li, Peng-Fei Chu","doi":"10.1186/s12863-024-01287-x","DOIUrl":"10.1186/s12863-024-01287-x","url":null,"abstract":"<p><strong>Objectives: </strong>Peas (Pisum sativum) are vital for human nutrition and have significantly contributed to the understanding of Mendelian inheritance laws. In this study, we performed transcriptome sequencing on 11 distinct pea tissues, offering an extensive gene expression dataset. This study not only provides valuable insights into the gene expression patterns across various pea tissues but also lays the foundation for future research aimed at understanding the molecular mechanisms underlying pea growth, development, and response to environmental stimuli.</p><p><strong>Data description: </strong>In this study, we conducted transcriptome sequencing on tissue samples from 11 distinct pea plants, each with three biological replicates. This approach yielded a comprehensive RNA-seq dataset, abundant in transcriptomic information. Through principal component analysis (PCA) and gene ontology (GO) enrichment analysis, we identified significant variations in gene expression across different samples. This valuable transcriptomic resource enhances our understanding of gene expression in diverse pea tissues and provides new strategies, along with potential candidate genes, for the genetic improvement of peas.</p>","PeriodicalId":72427,"journal":{"name":"BMC genomic data","volume":"26 1","pages":"13"},"PeriodicalIF":1.9,"publicationDate":"2025-02-20","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11841238/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"143469249","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2025-02-15DOI: 10.1186/s12863-025-01303-8
Yu Wan, Rachel Pike, Alessandra Harley, Zaynab Mumin, Isabelle Potterill, Danièle Meunier, Mark Ganner, Maria Getino, Juliana Coelho, Elita Jauneikaite, Kartyk Moganeradj, Colin S Brown, Alison H Holmes, Alicia Demirjian, Katie L Hopkins, Bruno Pichon
Objective: Staphylococcus capitis is part of the human microbiome and an opportunistic pathogen known to cause catheter-associated bacteraemia, prosthetic joint infections, skin and wound infections, among others. Detection of S. capitis in normally sterile body sites saw an increase over the last decade in England, where a multidrug-resistant clone, NRCS-A, was widely identified in blood samples from infants in neonatal intensive care units. To address a lack of complete genomes and antibiograms of S. capitis in public databases, we performed long- and short-read whole-genome sequencing, hybrid genome assembly, and antimicrobial susceptibility testing of 22 diverse isolates.
Data description: We present complete genome assemblies of two S. capitis type strains (subspecies capitis: DSM 20326; subspecies urealyticus: DSM 6717) and 20 clinical isolates (NRCS-A: 10) from England. Each genome is accompanied by minimum inhibitory concentrations of 13 antimicrobials including vancomycin, teicoplanin, daptomycin, linezolid, and clindamycin. These 22 genomes were 2.4-2.7 Mbp in length and had a GC content of 33%. Plasmids were identified in 20 isolates. Resistance to teicoplanin, daptomycin, gentamicin, fusidic acid, rifampicin, ciprofloxacin, clindamycin, and erythromycin was seen in 1-10 isolates. Our data are a resource for future studies on genomics, evolution, and antimicrobial resistance of S. capitis.
{"title":"Complete genome assemblies and antibiograms of 22 Staphylococcus capitis isolates.","authors":"Yu Wan, Rachel Pike, Alessandra Harley, Zaynab Mumin, Isabelle Potterill, Danièle Meunier, Mark Ganner, Maria Getino, Juliana Coelho, Elita Jauneikaite, Kartyk Moganeradj, Colin S Brown, Alison H Holmes, Alicia Demirjian, Katie L Hopkins, Bruno Pichon","doi":"10.1186/s12863-025-01303-8","DOIUrl":"10.1186/s12863-025-01303-8","url":null,"abstract":"<p><strong>Objective: </strong>Staphylococcus capitis is part of the human microbiome and an opportunistic pathogen known to cause catheter-associated bacteraemia, prosthetic joint infections, skin and wound infections, among others. Detection of S. capitis in normally sterile body sites saw an increase over the last decade in England, where a multidrug-resistant clone, NRCS-A, was widely identified in blood samples from infants in neonatal intensive care units. To address a lack of complete genomes and antibiograms of S. capitis in public databases, we performed long- and short-read whole-genome sequencing, hybrid genome assembly, and antimicrobial susceptibility testing of 22 diverse isolates.</p><p><strong>Data description: </strong>We present complete genome assemblies of two S. capitis type strains (subspecies capitis: DSM 20326; subspecies urealyticus: DSM 6717) and 20 clinical isolates (NRCS-A: 10) from England. Each genome is accompanied by minimum inhibitory concentrations of 13 antimicrobials including vancomycin, teicoplanin, daptomycin, linezolid, and clindamycin. These 22 genomes were 2.4-2.7 Mbp in length and had a GC content of 33%. Plasmids were identified in 20 isolates. Resistance to teicoplanin, daptomycin, gentamicin, fusidic acid, rifampicin, ciprofloxacin, clindamycin, and erythromycin was seen in 1-10 isolates. Our data are a resource for future studies on genomics, evolution, and antimicrobial resistance of S. capitis.</p>","PeriodicalId":72427,"journal":{"name":"BMC genomic data","volume":"26 1","pages":"12"},"PeriodicalIF":1.9,"publicationDate":"2025-02-15","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11830182/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"143426727","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Objectives: Coffea liberica is one of the species within the Coffea genus known for its distinctive flavor and resistance to leaf rust disease. Through breeding approaches, two superior varieties of C. liberica, designated as Liberoid Meranti 1 (Lim 1) and Liberoid Meranti 2 (Lim 2), were introduced in 2015. These varieties are known for their high adaptability in peatlands. The genetic basis of plant adaptability to peatlands remains largely unknown. It is therefore essential to identify genome-wide DNA polymorphisms in Lim 1 and 2 in order to gain insights into its capacity for adaptation in peatlands.
Data description: Whole genome sequencing was performed on three plants from each variety (Lim 1 and 2), resulting in 430 million sequencing reads. The mean depth of sequencing for each sample was 36.90x. The reads were mapped to the Coffea canephora genome, with an average mapping rate of 96.34%. The sequencing data revealed the presence of 3,766,805 single-nucleotide polymorphisms (SNPs) and 1,123,683 insertion-deletions (indels) in all six plants. Among the SNPs, there was a notable prevalence of transitions, with a ratio of approximately twofold compared to transversions. The generated data offers invaluable genomic resources for marker development, with significant implications for understanding peatlands adaptability.
{"title":"Genome-wide DNA polymorphisms in two peatland adapted Coffea liberica varieties.","authors":"Tisha Melia, Fatayat, Ninik Nihayatul Wahibah, Siti Fatonah, Dewi Indriyani Roslim, Arisman Adnan","doi":"10.1186/s12863-025-01305-6","DOIUrl":"10.1186/s12863-025-01305-6","url":null,"abstract":"<p><strong>Objectives: </strong>Coffea liberica is one of the species within the Coffea genus known for its distinctive flavor and resistance to leaf rust disease. Through breeding approaches, two superior varieties of C. liberica, designated as Liberoid Meranti 1 (Lim 1) and Liberoid Meranti 2 (Lim 2), were introduced in 2015. These varieties are known for their high adaptability in peatlands. The genetic basis of plant adaptability to peatlands remains largely unknown. It is therefore essential to identify genome-wide DNA polymorphisms in Lim 1 and 2 in order to gain insights into its capacity for adaptation in peatlands.</p><p><strong>Data description: </strong>Whole genome sequencing was performed on three plants from each variety (Lim 1 and 2), resulting in 430 million sequencing reads. The mean depth of sequencing for each sample was 36.90x. The reads were mapped to the Coffea canephora genome, with an average mapping rate of 96.34%. The sequencing data revealed the presence of 3,766,805 single-nucleotide polymorphisms (SNPs) and 1,123,683 insertion-deletions (indels) in all six plants. Among the SNPs, there was a notable prevalence of transitions, with a ratio of approximately twofold compared to transversions. The generated data offers invaluable genomic resources for marker development, with significant implications for understanding peatlands adaptability.</p>","PeriodicalId":72427,"journal":{"name":"BMC genomic data","volume":"26 1","pages":"11"},"PeriodicalIF":1.9,"publicationDate":"2025-02-14","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11829567/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"143426733","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2025-01-29DOI: 10.1186/s12863-025-01301-w
Rebecca Lelievre, Mohan Rakesh, Pirro G Hysi, Julian Little, Ellen E Freeman, Marie-Hélène Roy-Gagnon
High intraocular pressure (IOP) is an important risk factor for glaucoma, which is influenced by genetic and environmental factors. However, the etiology of high IOP remains uncertain. Metabolites are compounds involved in metabolism which provide a link between the internal (genetic) and external environments. O-methylascorbate has been reported to be associated with IOP. In addition, researchers have identified several genetic variants which are associated with metabolite concentrations, including O-methylascorbate and another vitamin C related metabolite, ascorbic acid 2-sulfate. We aimed to understand how O-methylascorbate and ascorbic acid 2-sulfate, or genetic variants associated with these metabolites, modify the associations between dietary environmental variables and IOP. We used data from 8060 participants of the Canadian Longitudinal Study on Aging. Using linear models adjusted for relevant covariates, we tested for interactions between six genetic variants previously found to be associated with O-methylascorbate and ascorbic acid 2-sulfate and four environmental variables related to diet (alcohol consumption frequency, smoking status, fruit consumption, and vegetable consumption). We also tested for interactions between serum concentrations of O-methylascorbate and ascorbic acid 2-sulfate and these environmental factors. We used a False Discovery Rate approach to correct for the 32 interaction tests performed. One interaction was suggestively significant after multiple testing correction (adjusted P-value < 0.1): rs8050812 and alcohol consumption frequency. Understanding how genetic variants and metabolites interact with the environment could shed light on biological pathways controlling IOP and lead to improved prevention and treatment of glaucoma.
{"title":"Evaluating vitamin C-related gene-environment and metabolite-environment interaction effects on intraocular pressure in the Canadian Longitudinal Study on Aging.","authors":"Rebecca Lelievre, Mohan Rakesh, Pirro G Hysi, Julian Little, Ellen E Freeman, Marie-Hélène Roy-Gagnon","doi":"10.1186/s12863-025-01301-w","DOIUrl":"https://doi.org/10.1186/s12863-025-01301-w","url":null,"abstract":"<p><p>High intraocular pressure (IOP) is an important risk factor for glaucoma, which is influenced by genetic and environmental factors. However, the etiology of high IOP remains uncertain. Metabolites are compounds involved in metabolism which provide a link between the internal (genetic) and external environments. O-methylascorbate has been reported to be associated with IOP. In addition, researchers have identified several genetic variants which are associated with metabolite concentrations, including O-methylascorbate and another vitamin C related metabolite, ascorbic acid 2-sulfate. We aimed to understand how O-methylascorbate and ascorbic acid 2-sulfate, or genetic variants associated with these metabolites, modify the associations between dietary environmental variables and IOP. We used data from 8060 participants of the Canadian Longitudinal Study on Aging. Using linear models adjusted for relevant covariates, we tested for interactions between six genetic variants previously found to be associated with O-methylascorbate and ascorbic acid 2-sulfate and four environmental variables related to diet (alcohol consumption frequency, smoking status, fruit consumption, and vegetable consumption). We also tested for interactions between serum concentrations of O-methylascorbate and ascorbic acid 2-sulfate and these environmental factors. We used a False Discovery Rate approach to correct for the 32 interaction tests performed. One interaction was suggestively significant after multiple testing correction (adjusted P-value < 0.1): rs8050812 and alcohol consumption frequency. Understanding how genetic variants and metabolites interact with the environment could shed light on biological pathways controlling IOP and lead to improved prevention and treatment of glaucoma.</p>","PeriodicalId":72427,"journal":{"name":"BMC genomic data","volume":"26 1","pages":"10"},"PeriodicalIF":1.9,"publicationDate":"2025-01-29","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11776179/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"143070010","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2025-01-27DOI: 10.1186/s12863-025-01295-5
Yahui Gao, George E Liu, Li Ma, Cong-Jun Li, Ransom L Baldwin
Objective: As one of the most important ruminant breeds, Holstein cattle supply a significant portion of milk and dairy for human consumption, playing a crucial role in agribusiness. The goal of our study was to examine the molecular adaptation of gastrointestinal tissues that facilitate milk synthesis in dairy cattle. DATA DESCRIPTION: We performed RNA-seq analysis on epithelial cells from the rumen, duodenum, and colon at eight different time points: Days 3, 14, 28, 45, 120, 220, and 305 in milk, as well as the dry period. Samples were taken from five multiparous dairy cows as biological replicates per tissue per stage, except for Days 14 and 28, for which the sample size was three. These tissues each serve critical and distinct roles in the digestion and absorption of nutrients and are all vital for providing the necessary substrates required for milk production. Understanding the intricate connections between the tissues involved in providing nutrients necessary to support milk synthesis and their role in digestion can deepen the understanding of lactation physiology. This resource aims to deliver in-depth insights into cattle lactation, highlighting the distinct traits of gastrointestinal tissues and illuminating the intricate transcriptomic dynamics throughout the lactation period.
{"title":"A resource of longitudinal RNA-seq data of Holstein cow rumen, duodenum, and colon epithelial cells during the lactation cycle.","authors":"Yahui Gao, George E Liu, Li Ma, Cong-Jun Li, Ransom L Baldwin","doi":"10.1186/s12863-025-01295-5","DOIUrl":"10.1186/s12863-025-01295-5","url":null,"abstract":"<p><strong>Objective: </strong>As one of the most important ruminant breeds, Holstein cattle supply a significant portion of milk and dairy for human consumption, playing a crucial role in agribusiness. The goal of our study was to examine the molecular adaptation of gastrointestinal tissues that facilitate milk synthesis in dairy cattle. DATA DESCRIPTION: We performed RNA-seq analysis on epithelial cells from the rumen, duodenum, and colon at eight different time points: Days 3, 14, 28, 45, 120, 220, and 305 in milk, as well as the dry period. Samples were taken from five multiparous dairy cows as biological replicates per tissue per stage, except for Days 14 and 28, for which the sample size was three. These tissues each serve critical and distinct roles in the digestion and absorption of nutrients and are all vital for providing the necessary substrates required for milk production. Understanding the intricate connections between the tissues involved in providing nutrients necessary to support milk synthesis and their role in digestion can deepen the understanding of lactation physiology. This resource aims to deliver in-depth insights into cattle lactation, highlighting the distinct traits of gastrointestinal tissues and illuminating the intricate transcriptomic dynamics throughout the lactation period.</p>","PeriodicalId":72427,"journal":{"name":"BMC genomic data","volume":"26 1","pages":"9"},"PeriodicalIF":1.9,"publicationDate":"2025-01-27","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11773790/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"143054409","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}