Pub Date : 2025-07-07DOI: 10.1186/s12863-025-01336-z
Liu Sun, Ke Yao, Hang-Jing Wu
Background: Primary carnitine deficiency (PCD) is an autosomal recessive rare disorder of carnitine cycle and carnitine transport caused by pathogenic variants in the SLC22A5 gene. The prevalence of PCD is unclear. This study aimed to estimate the carrier frequency and genetic prevalence of PCD using Genome Aggregation Database (gnomAD) data.
Methods: The pathogenicity of SLC22A5 variants was interpreted according to the American College of Medical Genetics and Genomics (ACMG) standards and guidelines. The minor allele frequency (MAF) of the variants of the SLC22A5 gene in 807,162 individuals was examined to estimate the global prevalence of PCD in nine ethnicities: African/African American (afr), Admixed American (amr), East Asian (eas), Non-Finnish European (nfe), South Asian (sas), Ashkenazi Jewish (asj), Middle Eastern (mid), Finnish (fin) and Remaining individuals (rmi). The global and population-specific carrier frequency and genetic prevalence of PCD were calculated using the Hardy-Weinberg equation.
Results: Total of 213 pathogenic/likely pathogenic variants (PV/LPV) of the SLC22A5 gene were identified according to the ACMG standards and guidelines. The global carrier frequency and genetic prevalence of PCD were 10.6 per thousand (1/95) and 28.2 per million (1/35427), respectively.
Conclusions: The prevalence of PCD is estimated to be 1/35,000 globally, with a range of between 1/450,000 and 1/20,000 depending on ethnicity.
{"title":"The global prevalence and genetic spectrum of primary carnitine deficiency.","authors":"Liu Sun, Ke Yao, Hang-Jing Wu","doi":"10.1186/s12863-025-01336-z","DOIUrl":"10.1186/s12863-025-01336-z","url":null,"abstract":"<p><strong>Background: </strong>Primary carnitine deficiency (PCD) is an autosomal recessive rare disorder of carnitine cycle and carnitine transport caused by pathogenic variants in the SLC22A5 gene. The prevalence of PCD is unclear. This study aimed to estimate the carrier frequency and genetic prevalence of PCD using Genome Aggregation Database (gnomAD) data.</p><p><strong>Methods: </strong>The pathogenicity of SLC22A5 variants was interpreted according to the American College of Medical Genetics and Genomics (ACMG) standards and guidelines. The minor allele frequency (MAF) of the variants of the SLC22A5 gene in 807,162 individuals was examined to estimate the global prevalence of PCD in nine ethnicities: African/African American (afr), Admixed American (amr), East Asian (eas), Non-Finnish European (nfe), South Asian (sas), Ashkenazi Jewish (asj), Middle Eastern (mid), Finnish (fin) and Remaining individuals (rmi). The global and population-specific carrier frequency and genetic prevalence of PCD were calculated using the Hardy-Weinberg equation.</p><p><strong>Results: </strong>Total of 213 pathogenic/likely pathogenic variants (PV/LPV) of the SLC22A5 gene were identified according to the ACMG standards and guidelines. The global carrier frequency and genetic prevalence of PCD were 10.6 per thousand (1/95) and 28.2 per million (1/35427), respectively.</p><p><strong>Conclusions: </strong>The prevalence of PCD is estimated to be 1/35,000 globally, with a range of between 1/450,000 and 1/20,000 depending on ethnicity.</p>","PeriodicalId":72427,"journal":{"name":"BMC genomic data","volume":"26 1","pages":"44"},"PeriodicalIF":1.9,"publicationDate":"2025-07-07","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12235876/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"144585687","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Objectives: The degradation products of chitin exhibit various biological activities, giving them significant application potential. Chitinase-producing bacteria can be isolated from diverse environments such as soil, natural waters, and rhizospheres. However, their chitinolytic activity is often limited, particularly at low temperatures.
Data description: In this study, complete genome sequencing of a cold-adapted chitinolytic Pseudomonas strain, GWSMS-1, revealed a 4,606,781-bp linear chromosome with a G+C content of 59%. The genome encodes 4,599 protein-coding genes, 73 tRNA genes, and 27 rRNA genes. Functional annotation through GO, KEGG, and CAZy databases identified a substantial number of chitinase-encoding genes, which likely contribute to its high chitin-degrading capacity. The genomic insights into GWSMS-1 highlight its potential for applications in chitin degradation and offer valuable gene candidates for further research.
{"title":"Genomic analysis of Pseudomonas sp. GWSMS-1 isolated from Antarctica reveals its potential in Chitin hydrolysis.","authors":"Haiyu Zeng, Zheng Wang, Jianjun Wang, Yong Yu, Wei Luo, Huirong Li, Haitao Ding","doi":"10.1186/s12863-025-01335-0","DOIUrl":"10.1186/s12863-025-01335-0","url":null,"abstract":"<p><strong>Objectives: </strong>The degradation products of chitin exhibit various biological activities, giving them significant application potential. Chitinase-producing bacteria can be isolated from diverse environments such as soil, natural waters, and rhizospheres. However, their chitinolytic activity is often limited, particularly at low temperatures.</p><p><strong>Data description: </strong>In this study, complete genome sequencing of a cold-adapted chitinolytic Pseudomonas strain, GWSMS-1, revealed a 4,606,781-bp linear chromosome with a G+C content of 59%. The genome encodes 4,599 protein-coding genes, 73 tRNA genes, and 27 rRNA genes. Functional annotation through GO, KEGG, and CAZy databases identified a substantial number of chitinase-encoding genes, which likely contribute to its high chitin-degrading capacity. The genomic insights into GWSMS-1 highlight its potential for applications in chitin degradation and offer valuable gene candidates for further research.</p>","PeriodicalId":72427,"journal":{"name":"BMC genomic data","volume":"26 1","pages":"43"},"PeriodicalIF":1.9,"publicationDate":"2025-07-04","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12228359/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"144565488","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2025-07-01DOI: 10.1186/s12863-025-01334-1
Mesfin Tafesse Gemeda, Abu Feyisa Meka, Asefa Niguse Mamo, Gessesse Kebede Bekele, Jemal Ali, Musin Kelel Abas
Antibiotic resistance is a major global health concern that caused by the overuse and misuse of antibiotics. Mobile genetic elements have a roles in the transmission of antibiotic resistance genes. The distribution and diversity of antibiotic resistance genes and mobile genetic elements in the microbiome of Sof Umer Cave have yet to be explored. To map the distribution and diversity of antibiotic resistance genes and mobile genetic elements in the microbiome of Sof Umer Cave using high-throughput shotgun sequencing. High-molecular-weight DNA was extracted from homogenized sample using the GeneAll DNA Soil Mini Kit. Purified environmental DNA was sequenced using a NovaSeq PE150. Analysis of the pathogen host interaction database revealed the predominance of pathogenic organisms such as Xanthomonas oryzae, Acinetobacter baumannii, Erwinia amylovora, and Mycobacterium tuberculosis. Similarly, analysis of the virulence factor database confirmed the presence of Type IV pili (VF1240), lipopolysaccharides, capsules, heme biosynthesis (VF0758), and alginate. More than 800 antibiotic resistance genes were identified, with 50% related to glycopeptide resistance, followed by antibiotic resistance genes associated with multidrug efflux pumps (30%), aminoglycoside resistance genes (10%), and unknown genes. A variety of mobile genetic elements were also identified, highlighting their importance in the genetic diversity and adaptation of the microbiome of Sof Umer Cave. These findings underscore the importance of the Sof Umer Cave habitat as a reservoir for antibiotic resistance, emphasizing the need for ongoing monitoring to enhance the understanding and control of antibiotic resistance genes.
抗生素耐药性是一个主要的全球健康问题,它是由抗生素的过度使用和误用引起的。移动遗传元件在抗生素耐药基因的传播中起着重要作用。sofumer洞穴微生物群中抗生素耐药基因和可移动遗传元件的分布和多样性有待探索。利用高通量霰弹枪测序技术研究sofumer Cave微生物组中抗生素耐药基因和可移动遗传元件的分布和多样性。使用GeneAll DNA Soil Mini Kit从匀浆样品中提取高分子量DNA。使用NovaSeq PE150对纯化的环境DNA进行测序。病原菌宿主相互作用数据库分析显示,米黄单胞菌、鲍曼不动杆菌、淀粉杆菌和结核分枝杆菌等致病菌占主导地位。同样,对毒力因子数据库的分析证实了IV型菌毛(VF1240)、脂多糖、胶囊、血红素生物合成(VF0758)和海藻酸盐的存在。共鉴定出800多个抗生素耐药基因,其中50%与糖肽耐药有关,其次是与多药外排泵相关的抗生素耐药基因(30%)、氨基糖苷耐药基因(10%)和未知基因。研究还发现了多种可移动的遗传元件,强调了它们在sofumer洞穴微生物群遗传多样性和适应性中的重要性。这些发现强调了sofumer洞穴栖息地作为抗生素耐药性储存库的重要性,强调了持续监测以提高对抗生素耐药基因的理解和控制的必要性。
{"title":"Diversity of antibiotic resistance genes and mobile genetic elements of Sof Umer Cave microbiomes, Ethiopia.","authors":"Mesfin Tafesse Gemeda, Abu Feyisa Meka, Asefa Niguse Mamo, Gessesse Kebede Bekele, Jemal Ali, Musin Kelel Abas","doi":"10.1186/s12863-025-01334-1","DOIUrl":"10.1186/s12863-025-01334-1","url":null,"abstract":"<p><p>Antibiotic resistance is a major global health concern that caused by the overuse and misuse of antibiotics. Mobile genetic elements have a roles in the transmission of antibiotic resistance genes. The distribution and diversity of antibiotic resistance genes and mobile genetic elements in the microbiome of Sof Umer Cave have yet to be explored. To map the distribution and diversity of antibiotic resistance genes and mobile genetic elements in the microbiome of Sof Umer Cave using high-throughput shotgun sequencing. High-molecular-weight DNA was extracted from homogenized sample using the GeneAll DNA Soil Mini Kit. Purified environmental DNA was sequenced using a NovaSeq PE150. Analysis of the pathogen host interaction database revealed the predominance of pathogenic organisms such as Xanthomonas oryzae, Acinetobacter baumannii, Erwinia amylovora, and Mycobacterium tuberculosis. Similarly, analysis of the virulence factor database confirmed the presence of Type IV pili (VF1240), lipopolysaccharides, capsules, heme biosynthesis (VF0758), and alginate. More than 800 antibiotic resistance genes were identified, with 50% related to glycopeptide resistance, followed by antibiotic resistance genes associated with multidrug efflux pumps (30%), aminoglycoside resistance genes (10%), and unknown genes. A variety of mobile genetic elements were also identified, highlighting their importance in the genetic diversity and adaptation of the microbiome of Sof Umer Cave. These findings underscore the importance of the Sof Umer Cave habitat as a reservoir for antibiotic resistance, emphasizing the need for ongoing monitoring to enhance the understanding and control of antibiotic resistance genes.</p>","PeriodicalId":72427,"journal":{"name":"BMC genomic data","volume":"26 1","pages":"41"},"PeriodicalIF":1.9,"publicationDate":"2025-07-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12220799/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"144546298","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Genome assembly of the Bos gaurus in Vietnam using nanopore sequencing.","authors":"Thi Dieu Thuy Nguyen, Thi Binh Nguyen Tran, Loan T Nguyen","doi":"10.1186/s12863-025-01331-4","DOIUrl":"10.1186/s12863-025-01331-4","url":null,"abstract":"","PeriodicalId":72427,"journal":{"name":"BMC genomic data","volume":"26 1","pages":"42"},"PeriodicalIF":1.9,"publicationDate":"2025-07-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12219697/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"144546299","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Amplicon-based metagenomic survey of microbes associated with the organic and inorganic rhizosphere soil of Glycine max L.","authors":"Olubukola Oluranti Babalola, Ijeoma Emelda Osuji, Akinlolu Olalekan Akanmu","doi":"10.1186/s12863-025-01333-2","DOIUrl":"10.1186/s12863-025-01333-2","url":null,"abstract":"","PeriodicalId":72427,"journal":{"name":"BMC genomic data","volume":"26 1","pages":"40"},"PeriodicalIF":1.9,"publicationDate":"2025-06-06","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12142974/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"144251098","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2025-05-28DOI: 10.1186/s12863-025-01330-5
Hayley Goss, Paige Miller, Susan F Zaleski, Robert J Miller, Donna M Schroeder, Henry M Page
{"title":"Draft genome assembly for the purple-hinged rock scallop (Crassadoma gigantea).","authors":"Hayley Goss, Paige Miller, Susan F Zaleski, Robert J Miller, Donna M Schroeder, Henry M Page","doi":"10.1186/s12863-025-01330-5","DOIUrl":"10.1186/s12863-025-01330-5","url":null,"abstract":"","PeriodicalId":72427,"journal":{"name":"BMC genomic data","volume":"26 1","pages":"39"},"PeriodicalIF":1.9,"publicationDate":"2025-05-28","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12121003/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"144176070","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2025-05-27DOI: 10.1186/s12863-025-01323-4
Oluyinka Opoola, Felicien Shumbusho, Innocent Rwamuhizi, Isidore Houaga, David Harvey, David Hambrook, Kellie Watson, Mizeck G G Chagunda, Raphael Mrode, Appolinaire Djikeng
Previous genomic characterisation of Rwanda dairy cattle predominantly focused on the One Cow per Poor Family (locally called "Girinka") programme. However, smallholder farmers in Rwanda have benefited from other livestock initiatives and development programmes. Capturing and documenting the genetic diversity, is critical in part as a key contribution to genomic resource required to support dairy development in Rwanda. A total of 2,229 crossbred animals located in all dairy-producing regions of Rwanda were sampled. For each animal, a hair sample was collected and genotyped by using the Geneseek Genomic Profiler (GGP, Neogen Geneseek®) Bovine 50 K (n = 1,917) and GGP Bovine 100 K arrays (n = 312). The combined dataset was subject to quality control, data curation for use in population genetics and genomic analyses. To assess the genetic structure and diversity of the current population, key analyses for population structure were applied: Principal Component Analysis (PCA), population structure and diversity, admixture analysis, measures of heterozygosity, runs of homozygosity (ROH) and minor allelic frequency (MAF). A dataset of global dairy population of European taurine, African indicus and African taurus (n = 250) was used as reference. Results showed that Rwanda cattle population is highly admixed of diverse pure and crossbred animals with average MAF of 33% (standard error; se = 0.001) with proportion of foreign high yielding (taurine) dairy breeds of Jersey Island (18%); 12% non-Island Jersey and 42% Holstein-Friesian ancestries. Two African Bos taurus and five Bos indicus breeds contributed 28% of their genetics. Genetic distances were highest in Gir and N'dama (0.29); and Nelore and N'dama (0.29). There were 1,331 ROH regions and average heterozygosity were high for Rwanda cattle (0.41 se = 0.001). Asides well-established genes in cattle, we found evidence for a variety of novel and less-known genes under selection to be associated with fertility, milk production, innate immunity and environmental adaptation. This observed diversity offers opportunity to decipher the presence and/or lack of genetic variations to initiate short- and long-term breed improvement programmes for adaptation traits, disease resistance, heat tolerance, productivity and profitability of smallholder dairy systems in Rwanda.
以前对卢旺达奶牛的基因组特征分析主要集中在每个贫困家庭一头奶牛(当地称为“Girinka”)计划上。然而,卢旺达的小农也受益于其他牲畜倡议和发展方案。捕捉和记录遗传多样性至关重要,部分原因是对支持卢旺达乳制品发展所需的基因组资源作出了重要贡献。对位于卢旺达所有奶业产区的2 229只杂交动物进行了抽样。对每只动物收集毛发样本,并使用Geneseek基因组分析器(GGP, Neogen Geneseek®)牛50 K (n = 1,917)和GGP牛100 K阵列(n = 312)进行基因分型。合并的数据集受到质量控制,数据管理用于群体遗传学和基因组分析。采用主成分分析(PCA)、种群结构与多样性分析、外源分析、杂合度测量、纯合度运行数(ROH)和次要等位基因频率(MAF)等关键分析方法对现有群体的遗传结构和多样性进行了评价。以欧洲牛磺酸、非洲籼牛和非洲金牛全球奶牛种群数据集(n = 250)为参考。结果表明,卢旺达牛种群是多种纯种和杂交动物的高度混合种群,平均MAF为33%(标准误差;se = 0.001),泽西岛外国高产(牛磺酸)奶牛品种所占比例为18%;12%非泽西岛血统,42%荷尔斯泰因-弗里斯血统。2个非洲牛牛品种和5个非洲牛品种贡献了28%的遗传基因。遗传距离最高的是吉尔和恩达马(0.29);以及Nelore and N'dama(0.29)。卢旺达牛的平均杂合度较高(0.41 se = 0.001)。除了牛中已建立的基因外,我们还发现了一些新的和不太为人所知的基因在选择过程中与生育力、产奶量、先天免疫和环境适应有关的证据。这种观察到的多样性为破译遗传变异的存在和/或缺乏提供了机会,从而启动短期和长期的品种改进计划,以提高卢旺达小农乳制品系统的适应性状、抗病性、耐热性、生产力和盈利能力。
{"title":"The genetic structure and diversity of smallholder dairy cattle in Rwanda.","authors":"Oluyinka Opoola, Felicien Shumbusho, Innocent Rwamuhizi, Isidore Houaga, David Harvey, David Hambrook, Kellie Watson, Mizeck G G Chagunda, Raphael Mrode, Appolinaire Djikeng","doi":"10.1186/s12863-025-01323-4","DOIUrl":"10.1186/s12863-025-01323-4","url":null,"abstract":"<p><p>Previous genomic characterisation of Rwanda dairy cattle predominantly focused on the One Cow per Poor Family (locally called \"Girinka\") programme. However, smallholder farmers in Rwanda have benefited from other livestock initiatives and development programmes. Capturing and documenting the genetic diversity, is critical in part as a key contribution to genomic resource required to support dairy development in Rwanda. A total of 2,229 crossbred animals located in all dairy-producing regions of Rwanda were sampled. For each animal, a hair sample was collected and genotyped by using the Geneseek Genomic Profiler (GGP, Neogen Geneseek<sup>®</sup>) Bovine 50 K (n = 1,917) and GGP Bovine 100 K arrays (n = 312). The combined dataset was subject to quality control, data curation for use in population genetics and genomic analyses. To assess the genetic structure and diversity of the current population, key analyses for population structure were applied: Principal Component Analysis (PCA), population structure and diversity, admixture analysis, measures of heterozygosity, runs of homozygosity (ROH) and minor allelic frequency (MAF). A dataset of global dairy population of European taurine, African indicus and African taurus (n = 250) was used as reference. Results showed that Rwanda cattle population is highly admixed of diverse pure and crossbred animals with average MAF of 33% (standard error; se = 0.001) with proportion of foreign high yielding (taurine) dairy breeds of Jersey Island (18%); 12% non-Island Jersey and 42% Holstein-Friesian ancestries. Two African Bos taurus and five Bos indicus breeds contributed 28% of their genetics. Genetic distances were highest in Gir and N'dama (0.29); and Nelore and N'dama (0.29). There were 1,331 ROH regions and average heterozygosity were high for Rwanda cattle (0.41 se = 0.001). Asides well-established genes in cattle, we found evidence for a variety of novel and less-known genes under selection to be associated with fertility, milk production, innate immunity and environmental adaptation. This observed diversity offers opportunity to decipher the presence and/or lack of genetic variations to initiate short- and long-term breed improvement programmes for adaptation traits, disease resistance, heat tolerance, productivity and profitability of smallholder dairy systems in Rwanda.</p>","PeriodicalId":72427,"journal":{"name":"BMC genomic data","volume":"26 1","pages":"38"},"PeriodicalIF":1.9,"publicationDate":"2025-05-27","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12107919/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"144163855","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Objective: The pathogenic fungus Diplodia sapinea is of significant importance due to its primary role inducing tip dieback on various Pinus species which are widely distributed throughout the world. The objective of this study is to further provide comprehensive and specific resources for genome assembly and sequence annotation of this important forest pathogen from China, thereby establishing a robust foundation for future studies on its systematics, population genetics, genomics and global movement.
Data description: A high-quality genome of D. sapinea strain ZXD319 was sequenced utilizing the Nanopore PromethION and BGI DNBSEQ-T7 platforms. The assembled genome spans a total length of 36.81 Mb, comprising 14 contigs, with a GC content of 56.80% and an N50 value of 2,972,533 bp. It encompasses 11,200 protein-coding genes and 252 noncoding RNAs. The predicted genes were annotated against multiple public databases, and 1,611 potential virulence genes were identified through the Pathogen Host Interactions (PHI) database. Furthermore, the genome comparative analysis of D. sapinea and related species revealed 11,568 gene clusters and 3,436 single-copy clusters. Phylogenetic analysis indicated a close evolutionary relationship between D. sapinea with D. corticola and D. seriata. The genomic data presented herein serve as a valuable resource for future studies on this globally important pathogen.
{"title":"Whole-genome sequencing of global forest pathogen Diplodia sapinea causing pine shoot blight.","authors":"QuanChao Wang, FeiFei Liu, HuaChao Xu, XuDong Zhou","doi":"10.1186/s12863-025-01328-z","DOIUrl":"10.1186/s12863-025-01328-z","url":null,"abstract":"<p><strong>Objective: </strong>The pathogenic fungus Diplodia sapinea is of significant importance due to its primary role inducing tip dieback on various Pinus species which are widely distributed throughout the world. The objective of this study is to further provide comprehensive and specific resources for genome assembly and sequence annotation of this important forest pathogen from China, thereby establishing a robust foundation for future studies on its systematics, population genetics, genomics and global movement.</p><p><strong>Data description: </strong>A high-quality genome of D. sapinea strain ZXD319 was sequenced utilizing the Nanopore PromethION and BGI DNBSEQ-T7 platforms. The assembled genome spans a total length of 36.81 Mb, comprising 14 contigs, with a GC content of 56.80% and an N50 value of 2,972,533 bp. It encompasses 11,200 protein-coding genes and 252 noncoding RNAs. The predicted genes were annotated against multiple public databases, and 1,611 potential virulence genes were identified through the Pathogen Host Interactions (PHI) database. Furthermore, the genome comparative analysis of D. sapinea and related species revealed 11,568 gene clusters and 3,436 single-copy clusters. Phylogenetic analysis indicated a close evolutionary relationship between D. sapinea with D. corticola and D. seriata. The genomic data presented herein serve as a valuable resource for future studies on this globally important pathogen.</p>","PeriodicalId":72427,"journal":{"name":"BMC genomic data","volume":"26 1","pages":"37"},"PeriodicalIF":1.9,"publicationDate":"2025-05-27","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12107918/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"144153001","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2025-05-22DOI: 10.1186/s12863-025-01329-y
Manuel Zúñiga, Cristina Alcántara, Ángela Peirotén, Luis Andrés Ramón-Nuñez, Vicente Monedero, José María Landete
Objectives: Lactobacillales, commonly known as lactic acid bacteria (LAB), is an order of Gram-positive, facultatively anaerobic or microaerophilic bacteria characterized by their ability to ferment carbohydrates and produce lactic acid as a major metabolic byproduct. Many species within this group have significant roles in food fermentation, human health, and industrial applications. Here, we report the complete genome sequence of Fructilactobacillus vespulae Mu01, the first sequenced genome of this species. The complete genome sequence of F. vespulae Mu01 is expected to provide valuable insights into the genetics and metabolism of this little-characterized species.
Data description: A novel strain of Fructilactobacillus vespulae was isolated from nectar of Musa paradisiaca L. during a survey for LAB associated with wild and cultivated plants in the metropolitan area of Valencia, Spain. A complete genome was obtained by sequencing with Nanopore long read technology. The genome consists of a chromosome of 1506092 bp and a plasmid of 42437 bp, presenting a GC content of 36 % and 31 %, respectively. The genome includes 1541 genes, with 1450 CDSs, 7 pseudogenes, 18 rRNA encoding genes, 63 tRNAs and 3 ncRNAs.
{"title":"The first complete genome of Fructilactobacillus vespulae: strain Mu01, isolated from nectar of Musa paradisiaca L.","authors":"Manuel Zúñiga, Cristina Alcántara, Ángela Peirotén, Luis Andrés Ramón-Nuñez, Vicente Monedero, José María Landete","doi":"10.1186/s12863-025-01329-y","DOIUrl":"10.1186/s12863-025-01329-y","url":null,"abstract":"<p><strong>Objectives: </strong>Lactobacillales, commonly known as lactic acid bacteria (LAB), is an order of Gram-positive, facultatively anaerobic or microaerophilic bacteria characterized by their ability to ferment carbohydrates and produce lactic acid as a major metabolic byproduct. Many species within this group have significant roles in food fermentation, human health, and industrial applications. Here, we report the complete genome sequence of Fructilactobacillus vespulae Mu01, the first sequenced genome of this species. The complete genome sequence of F. vespulae Mu01 is expected to provide valuable insights into the genetics and metabolism of this little-characterized species.</p><p><strong>Data description: </strong>A novel strain of Fructilactobacillus vespulae was isolated from nectar of Musa paradisiaca L. during a survey for LAB associated with wild and cultivated plants in the metropolitan area of Valencia, Spain. A complete genome was obtained by sequencing with Nanopore long read technology. The genome consists of a chromosome of 1506092 bp and a plasmid of 42437 bp, presenting a GC content of 36 % and 31 %, respectively. The genome includes 1541 genes, with 1450 CDSs, 7 pseudogenes, 18 rRNA encoding genes, 63 tRNAs and 3 ncRNAs.</p>","PeriodicalId":72427,"journal":{"name":"BMC genomic data","volume":"26 1","pages":"36"},"PeriodicalIF":1.9,"publicationDate":"2025-05-22","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12101011/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"144129667","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Background: Cystinosis is a rare autosomal recessive lysosomal storage disorder caused by mutations in the CTNS gene, which encodes cystinosin, a lysosomal cystine transporter. These mutations disrupt cystine efflux, leading to its accumulation in lysosomes and subsequent cellular damage. While more than 140 mutations have been identified, the functional and structural impacts of many nonsynonymous single nucleotide polymorphisms (nsSNPs) remain poorly understood. Nonsynonymous SNPs are of particular interest because they can directly alter protein structure and function, potentially leading to disease. Clinically, cystinosis most often presents with renal Fanconi syndrome, photophobia and vision loss due to corneal cystine crystals, and progressive neuromuscular complications such as distal myopathy and swallowing difficulties This study aimed to identify deleterious nsSNPs in the CTNS gene and evaluate their effects on cystinosin stability, structure, and function via computational tools and molecular dynamics simulations.
Results: From a dataset of 12,028 SNPs, 327 nsSNPs were identified, among which 19 were consistently classified as deleterious across multiple predictive tools, including SIFT, PolyPhen, and molecular dynamics simulations. Stability predictions revealed that most of these mutations destabilize cystinosin, with G308R and G308V located in the sixth transmembrane domain essential for transporter function having the most severe effects. Molecular dynamics simulations revealed that these mutations significantly increase local flexibility, alter hydrogen bonding patterns, and enhance solvent accessibility, resulting in structural perturbations. Notably, D305G and F142S disrupted the transmembrane domains essential for the function of cystinosin, whereas compared with the wild-type protein, G309V resulted in increased stability. Conservation analysis revealed that 16 of the 19 mutations affected highly conserved residues, indicating their crucial roles in the function of cystinosin. Additionally, protein interaction analyses suggested that mutations could impact associations with lysosomal and membrane transport proteins.
Conclusions: This study identified 19 deleterious nsSNPs in the CTNS gene that impair cystinosin stability and function. These findings highlight the structural and functional importance of key residues, such as G308, D305, and F142, which play critical roles in maintaining the active conformation and transport capacity of cystinosin. These insights provide a foundation for future experimental validation and the development of targeted therapeutic strategies to mitigate the effects of pathogenic mutations in cystinosis.
{"title":"Computational prediction of deleterious nonsynonymous SNPs in the CTNS gene: implications for cystinosis.","authors":"Leila Adda Neggaz, Amira Chahinez Dahmani, Ibtissem Derriche, Nawel Adda Neggaz, Abdallah Boudjema","doi":"10.1186/s12863-025-01325-2","DOIUrl":"https://doi.org/10.1186/s12863-025-01325-2","url":null,"abstract":"<p><strong>Background: </strong>Cystinosis is a rare autosomal recessive lysosomal storage disorder caused by mutations in the CTNS gene, which encodes cystinosin, a lysosomal cystine transporter. These mutations disrupt cystine efflux, leading to its accumulation in lysosomes and subsequent cellular damage. While more than 140 mutations have been identified, the functional and structural impacts of many nonsynonymous single nucleotide polymorphisms (nsSNPs) remain poorly understood. Nonsynonymous SNPs are of particular interest because they can directly alter protein structure and function, potentially leading to disease. Clinically, cystinosis most often presents with renal Fanconi syndrome, photophobia and vision loss due to corneal cystine crystals, and progressive neuromuscular complications such as distal myopathy and swallowing difficulties This study aimed to identify deleterious nsSNPs in the CTNS gene and evaluate their effects on cystinosin stability, structure, and function via computational tools and molecular dynamics simulations.</p><p><strong>Results: </strong>From a dataset of 12,028 SNPs, 327 nsSNPs were identified, among which 19 were consistently classified as deleterious across multiple predictive tools, including SIFT, PolyPhen, and molecular dynamics simulations. Stability predictions revealed that most of these mutations destabilize cystinosin, with G308R and G308V located in the sixth transmembrane domain essential for transporter function having the most severe effects. Molecular dynamics simulations revealed that these mutations significantly increase local flexibility, alter hydrogen bonding patterns, and enhance solvent accessibility, resulting in structural perturbations. Notably, D305G and F142S disrupted the transmembrane domains essential for the function of cystinosin, whereas compared with the wild-type protein, G309V resulted in increased stability. Conservation analysis revealed that 16 of the 19 mutations affected highly conserved residues, indicating their crucial roles in the function of cystinosin. Additionally, protein interaction analyses suggested that mutations could impact associations with lysosomal and membrane transport proteins.</p><p><strong>Conclusions: </strong>This study identified 19 deleterious nsSNPs in the CTNS gene that impair cystinosin stability and function. These findings highlight the structural and functional importance of key residues, such as G308, D305, and F142, which play critical roles in maintaining the active conformation and transport capacity of cystinosin. These insights provide a foundation for future experimental validation and the development of targeted therapeutic strategies to mitigate the effects of pathogenic mutations in cystinosis.</p>","PeriodicalId":72427,"journal":{"name":"BMC genomic data","volume":"26 1","pages":"35"},"PeriodicalIF":1.9,"publicationDate":"2025-05-15","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12079974/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"144082491","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}