BMC genomic data最新文献_第6页

Complete genome sequence of Streptococcus hominis isolated from subgingival biofilm. 龈下生物膜分离的人链球菌全基因组序列。

IF 2.5 Q3 GENETICS & HEREDITY

BMC genomic data

Pub Date : 2025-09-29 DOI: 10.1186/s12863-025-01367-6

Seok Bin Yang, Doyun Ku, Ji-Hoi Moon, Jae-Hyung Lee, Sang Wook Kang, Hak Kyun Kim, Kyu Hwan Kwack

Objective: Streptococcus hominis is a recently described species within the genus Streptococcus, yet its genomic characteristics remain poorly understood, particularly in the context of the oral microbiome. Previously, only two complete genomes from non-oral sources were available. To address this gap, we sequenced and analyzed S. hominis strain KHUD_010, isolated from the subgingival biofilm of a healthy Korean adult.

Data description: Genomic DNA from KHUD_010 was extracted and confirmed as S. hominis by 16 S rRNA gene sequencing. Whole-genome sequencing using the PacBio Sequel II platform generated 135,974 HiFi reads (N50: 10,345 bp). De novo assembly with SMRT Link v11.0 produced a single circular chromosome of 1,883,665 bp with 39.04% GC content. Annotation via the NCBI Prokaryotic Genome Annotation Pipeline predicted 1,793 protein-coding genes, four rRNA operons (5 S, 16 S, 23 S), and 120 tRNAs. BUSCO analysis showed 99.1% completeness. Comparative genomics with NSJ-17 and UMB6992B revealed 1,416 core, 223 dispensable, and 398 strain-specific gene clusters. KHUD_010 harbored 18 unique gene clusters comprising 20 genes, mostly assigned to COG category L (replication, recombination, repair). This high-quality genome expands the genomic landscape of S. hominis and provides a valuable reference for future studies on oral microbiome diversity and host adaptation.

目的：人链球菌是链球菌属中最近被描述的一种，但其基因组特征仍然知之甚少，特别是在口腔微生物组的背景下。以前，只有两个来自非口服来源的完整基因组可用。为了解决这一空白，我们测序并分析了从健康韩国成年人牙龈下生物膜分离的人链球菌KHUD_010菌株。数据描述：提取KHUD_010的基因组DNA，经16s rRNA基因测序确认为人源链球菌。使用PacBio Sequel II平台进行全基因组测序，产生135,974个HiFi读数（N50: 10,345 bp）。用SMRT Link v11.0重新组装得到一条1,883,665 bp的单圆形染色体，GC含量为39.04%。通过NCBI原核基因组注释管道预测了1793个蛋白质编码基因，4个rRNA操纵子（5 S, 16 S, 23 S）和120个trna。BUSCO分析的完备性为99.1%。与NSJ-17和UMB6992B进行比较基因组学分析，共发现1416个核心基因簇、223个非必需基因簇和398个菌株特异性基因簇。KHUD_010有18个独特的基因簇，包括20个基因，主要归属于COG类L（复制、重组、修复）。这一高质量的基因组扩展了人类链球菌的基因组景观，为未来口腔微生物多样性和宿主适应性的研究提供了有价值的参考。

{"title":"Complete genome sequence of Streptococcus hominis isolated from subgingival biofilm.","authors":"Seok Bin Yang, Doyun Ku, Ji-Hoi Moon, Jae-Hyung Lee, Sang Wook Kang, Hak Kyun Kim, Kyu Hwan Kwack","doi":"10.1186/s12863-025-01367-6","DOIUrl":"10.1186/s12863-025-01367-6","url":null,"abstract":"Objective: Streptococcus hominis is a recently described species within the genus Streptococcus, yet its genomic characteristics remain poorly understood, particularly in the context of the oral microbiome. Previously, only two complete genomes from non-oral sources were available. To address this gap, we sequenced and analyzed S. hominis strain KHUD_010, isolated from the subgingival biofilm of a healthy Korean adult.Data description: Genomic DNA from KHUD_010 was extracted and confirmed as S. hominis by 16 S rRNA gene sequencing. Whole-genome sequencing using the PacBio Sequel II platform generated 135,974 HiFi reads (N50: 10,345 bp). De novo assembly with SMRT Link v11.0 produced a single circular chromosome of 1,883,665 bp with 39.04% GC content. Annotation via the NCBI Prokaryotic Genome Annotation Pipeline predicted 1,793 protein-coding genes, four rRNA operons (5 S, 16 S, 23 S), and 120 tRNAs. BUSCO analysis showed 99.1% completeness. Comparative genomics with NSJ-17 and UMB6992B revealed 1,416 core, 223 dispensable, and 398 strain-specific gene clusters. KHUD_010 harbored 18 unique gene clusters comprising 20 genes, mostly assigned to COG category L (replication, recombination, repair). This high-quality genome expands the genomic landscape of S. hominis and provides a valuable reference for future studies on oral microbiome diversity and host adaptation.","PeriodicalId":72427,"journal":{"name":"BMC genomic data","volume":"26 1","pages":"69"},"PeriodicalIF":2.5,"publicationDate":"2025-09-29","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12482221/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145193983","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Transcriptome characterization and metabolite accumulation: novel insights into metabolite biosynthesis during Angiopteris fokiensis leaf development. 转录组表征和代谢物积累：福山脉管叶片发育过程中代谢物生物合成的新见解。

IF 2.5 Q3 GENETICS & HEREDITY

BMC genomic data

Pub Date : 2025-09-29 DOI: 10.1186/s12863-025-01366-7

Hongyu Chen, Ye Yang, Bo Wang, Ying Yu, Qingwen Sun

Background: Leafdevelopment represents a crucial stage in the plant life cycle, involving complex morphogenetic and physiological processes governed by evolving molecular mechanisms and metabolite profiles. The growth and maturation of Angiopteris fokiensis Hieron, a species used in traditional Chinese medicine, are characterized by fluctuating metabolite accumulation patterns regulated by largely unknown molecular pathways.

Results: Touncover these pathways, we employed next-generation sequencing to construct the A. fokiensis leaf transcriptome at two distinct developmental stages, allowing for a comprehensive analysis of gene expression dynamics while emphasizing the identification of genes that regulate leaf development and metabolite synthesis. The de novo assembly of high-quality sequencing reads generated 117,627 unigenes averaging 1,308 base pairs in length. FPKM analysis uncovered significant transcriptomic alterations during leaf development. Additionally, non-targeted metabolomics identified 1,494 distinct analytes, with lipids representing the most abundant metabolite class in both A. fokiensis samples. In the 'phenylalanine, tyrosine and tryptophan biosynthesis' pathway, two downregulated arogenate dehydrogenase (NADP+) genes (Unigene23378-S4 and Unigene47537-S2) in Stage1 correlated with reduced L-tyrosine levels. In the 'galactose metabolism' pathway, the upregulation of three beta-galactosidase genes (Unigene43641-S6, Unigene43648-S6, Unigene47074-S1) and the downregulation of one (Unigene28294-S2) corresponded to decreased alpha-lactose levels.

Conclusions: This study provides an in-depth examination of the dynamic transcriptomic and metabolomic changes occurring during A. fokiensis leaf development, revealing key regulatory networks and enhancing the annotation of theA. fokiensis genome. These findings lay a crucial groundwork for future research on this medicinal plant.

背景：叶片发育是植物生命周期的一个关键阶段，涉及复杂的形态发生和生理过程，受不断进化的分子机制和代谢物谱的支配。福山脉管蕨（Angiopteris fokiensis Hieron）是一种中药药用植物，其生长和成熟过程中代谢产物的积累模式是由未知的分子途径调控的。结果：为了发现这些途径，我们利用新一代测序技术构建了两个不同发育阶段的福杉叶片转录组，从而全面分析了基因表达动态，同时重点鉴定了调节叶片发育和代谢物合成的基因。高质量测序reads的从头组装产生117,627个unigenes，平均长度为1,308个碱基对。FPKM分析揭示了叶片发育过程中显著的转录组变化。此外，非靶向代谢组学鉴定了1494种不同的分析物，其中脂类代表了两种福山猿猴样本中最丰富的代谢物类别。在“苯丙氨酸、酪氨酸和色氨酸生物合成”途径中，Stage1中两个基因（Unigene23378-S4和Unigene47537-S2）的下调与l -酪氨酸水平降低相关。在“半乳糖代谢”途径中，三个β -半乳糖苷酶基因（Unigene43641-S6、Unigene43648-S6、Unigene47074-S1）的上调和一个基因（Unigene28294-S2）的下调对应于α -乳糖水平的降低。结论：本研究深入研究了福杉叶片发育过程中发生的动态转录组学和代谢组学变化，揭示了关键的调控网络，增强了对theA的注释。fokiensis基因组。这些发现为该药用植物的进一步研究奠定了重要的基础。

{"title":"Transcriptome characterization and metabolite accumulation: novel insights into metabolite biosynthesis during Angiopteris fokiensis leaf development.","authors":"Hongyu Chen, Ye Yang, Bo Wang, Ying Yu, Qingwen Sun","doi":"10.1186/s12863-025-01366-7","DOIUrl":"10.1186/s12863-025-01366-7","url":null,"abstract":"Background: Leafdevelopment represents a crucial stage in the plant life cycle, involving complex morphogenetic and physiological processes governed by evolving molecular mechanisms and metabolite profiles. The growth and maturation of Angiopteris fokiensis Hieron, a species used in traditional Chinese medicine, are characterized by fluctuating metabolite accumulation patterns regulated by largely unknown molecular pathways.Results: Touncover these pathways, we employed next-generation sequencing to construct the A. fokiensis leaf transcriptome at two distinct developmental stages, allowing for a comprehensive analysis of gene expression dynamics while emphasizing the identification of genes that regulate leaf development and metabolite synthesis. The de novo assembly of high-quality sequencing reads generated 117,627 unigenes averaging 1,308 base pairs in length. FPKM analysis uncovered significant transcriptomic alterations during leaf development. Additionally, non-targeted metabolomics identified 1,494 distinct analytes, with lipids representing the most abundant metabolite class in both A. fokiensis samples. In the 'phenylalanine, tyrosine and tryptophan biosynthesis' pathway, two downregulated arogenate dehydrogenase (NADP+) genes (Unigene23378-S4 and Unigene47537-S2) in Stage1 correlated with reduced L-tyrosine levels. In the 'galactose metabolism' pathway, the upregulation of three beta-galactosidase genes (Unigene43641-S6, Unigene43648-S6, Unigene47074-S1) and the downregulation of one (Unigene28294-S2) corresponded to decreased alpha-lactose levels.Conclusions: This study provides an in-depth examination of the dynamic transcriptomic and metabolomic changes occurring during A. fokiensis leaf development, revealing key regulatory networks and enhancing the annotation of theA. fokiensis genome. These findings lay a crucial groundwork for future research on this medicinal plant.","PeriodicalId":72427,"journal":{"name":"BMC genomic data","volume":"26 1","pages":"70"},"PeriodicalIF":2.5,"publicationDate":"2025-09-29","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12482733/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145194041","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Genome-wide identification of QTNs and candidate genes in Ethiopian sorghum (Sorghum bicolor (L.) moench) landraces using SNP-based approaches. 基于snp的埃塞俄比亚高粱（sorghum bicolor (L.) moench）地方品种QTNs和候选基因全基因组鉴定

IF 2.5 Q3 GENETICS & HEREDITY

BMC genomic data

Pub Date : 2025-09-26 DOI: 10.1186/s12863-025-01350-1

Addisu Getahun, Habte Nida, Adugna Abdi Woldesemayat

Background: Sorghum, a diploid C4 cereal (2n = 2x = 20) with a 750 Mbp genome, is widely adaptable to tropical and temperate climates. As its center of origin and diversity, Ethiopia holds valuable genetic variation for improving yield and nutritional traits. This study aimed to identify and functionally characterize quantitative trait nucleotides (QTNs) linked to key agronomic and yield-related traits and their associated candidate genes.Methods: Two hundred sixteen sorghum genotypes were evaluated over two seasons in northwestern Ethiopia using an alpha lattice design. Agronomic traits assessed included days to flowering, days to maturity, plant height, seed number per plant, seed yield, and thousand-seed weight. Genotyping-by-sequencing (GBS) generated 351,692 SNPs, with 50,165 high-quality markers retained. Candidate gene identification and functional characterization were carried out using a combination of bioinformatics tools and publicly available databases. Data normalization and analysis were conducted using META-R and SAS JMP. Linkage disequilibrium was assessed via TASSEL 5.0, and multi-locus genome-wide association study (ML-GWAS) identified significant QTNs (LOD ≥ 4.0) associated with phenotypic traits.Result: This study investigates the genetic basis of key agronomic and yield related traits in sorghum by identifying QTNs associated with phenotypic variation. Descriptive statistics revealed notable variability in traits such as days to flowering (101 days), days to maturity (145.77 days), plant height (357.47 cm), seed number per plant (1808.92 count), seed yield (45.07 g), and thousand-seed weight (23.44 g). Correlation analysis showed strong relationships, particularly between days to flowering and maturity (r = 0.7058). ML-GWAS detected 176 QTNs across all 10 chromosomes, with 34 considered reliable Due to their consistent identification across multiple models. 117 candidate genes were mapped to these QTNs, associated with six major traits: 20 for flowering time, 16 for maturity, 16 for plant height, 17 for seed number per plant, 38 for seed yield, and 10 for seed weight. Key genes included Sobic.001G196700 (flowering time) and Sobic.005G176100 (stress responses). Two important regulatory genes, SbMADS1 and SbFT, were highlighted for their roles in flowering regulation. SbMADS1 influences days to flowering, while SbFT acts as a mobile signal integrating photoperiod cues. These genes are involved in starch and sucrose metabolism pathways, essential for energy storage and mobilization, thereby supporting improved growth and yield in sorghum.Conclusion: This study highlights the complexity of trait inheritance shaped by diverse genetic factors and underscores the significance of major, stable, and unique QTNs for marker-assisted selection. Functional genome annotation revealed that candidate genes are involved in key biological processes and

背景：高粱是一种二倍体C4谷物（2n = 2x = 20），基因组为750mbp，广泛适应热带和温带气候。作为其起源和多样性的中心，埃塞俄比亚拥有提高产量和营养性状的宝贵遗传变异。本研究旨在鉴定和功能表征与关键农艺和产量相关性状及其相关候选基因相关的数量性状核苷酸（QTNs）。方法：采用α晶格设计对埃塞俄比亚西北部两个季节的216种高粱基因型进行了评估。评估的农艺性状包括开花天数、成熟天数、株高、每株种子数、种子产量和千粒重。基因分型测序（GBS）产生351,692个snp，保留50,165个高质量标记。利用生物信息学工具和公开数据库进行候选基因鉴定和功能表征。采用META-R和SAS JMP对数据进行归一化和分析。通过TASSEL 5.0评估连锁不平衡，多位点全基因组关联研究（ML-GWAS）发现与表型性状相关的显著QTNs （LOD≥4.0）。结果：通过鉴定与表型变异相关的qtn，研究了高粱关键农艺性状和产量相关性状的遗传基础。描述性统计结果显示，花期（101天）、成熟期（145.77天）、株高（357.47 cm）、单株种子数（1808.92粒）、产量（45.07 g）和千粒重（23.44 g）等性状存在显著差异。相关分析表明，花期与成熟期之间存在较强的相关性（r = 0.7058）。ML-GWAS在所有10条染色体中检测到176个qtn，其中34个被认为是可靠的，因为它们在多个模型中具有一致的鉴定。117个候选基因被定位到这些QTNs上，与6个主要性状相关：20个与开花时间有关，16个与成熟度有关，16个与株高有关，17个与单株种子数有关，38个与种子产量有关，10个与种子重量有关。关键基因包括Sobic.001G196700（开花时间）和Sobic.005G176100（胁迫反应）。两个重要的调控基因SbMADS1和SbFT在开花调控中发挥了重要作用。SbMADS1影响开花天数，而SbFT则作为整合光周期线索的移动信号。这些基因参与淀粉和蔗糖代谢途径，对能量储存和动员至关重要，从而支持高粱的生长和产量的提高。结论：本研究强调了性状遗传受多种遗传因素影响的复杂性，强调了主要、稳定和独特的qtn对标记辅助选择的重要性。功能基因组注释显示，候选基因参与关键的生物过程和代谢途径，包括淀粉和蔗糖代谢、次级代谢和激素信号传导。

{"title":"Genome-wide identification of QTNs and candidate genes in Ethiopian sorghum (Sorghum bicolor (L.) moench) landraces using SNP-based approaches.","authors":"Addisu Getahun, Habte Nida, Adugna Abdi Woldesemayat","doi":"10.1186/s12863-025-01350-1","DOIUrl":"10.1186/s12863-025-01350-1","url":null,"abstract":"Background: Sorghum, a diploid C4 cereal (2n = 2x = 20) with a 750 Mbp genome, is widely adaptable to tropical and temperate climates. As its center of origin and diversity, Ethiopia holds valuable genetic variation for improving yield and nutritional traits. This study aimed to identify and functionally characterize quantitative trait nucleotides (QTNs) linked to key agronomic and yield-related traits and their associated candidate genes.Methods: Two hundred sixteen sorghum genotypes were evaluated over two seasons in northwestern Ethiopia using an alpha lattice design. Agronomic traits assessed included days to flowering, days to maturity, plant height, seed number per plant, seed yield, and thousand-seed weight. Genotyping-by-sequencing (GBS) generated 351,692 SNPs, with 50,165 high-quality markers retained. Candidate gene identification and functional characterization were carried out using a combination of bioinformatics tools and publicly available databases. Data normalization and analysis were conducted using META-R and SAS JMP. Linkage disequilibrium was assessed via TASSEL 5.0, and multi-locus genome-wide association study (ML-GWAS) identified significant QTNs (LOD ≥ 4.0) associated with phenotypic traits.Result: This study investigates the genetic basis of key agronomic and yield related traits in sorghum by identifying QTNs associated with phenotypic variation. Descriptive statistics revealed notable variability in traits such as days to flowering (101 days), days to maturity (145.77 days), plant height (357.47 cm), seed number per plant (1808.92 count), seed yield (45.07 g), and thousand-seed weight (23.44 g). Correlation analysis showed strong relationships, particularly between days to flowering and maturity (r = 0.7058). ML-GWAS detected 176 QTNs across all 10 chromosomes, with 34 considered reliable Due to their consistent identification across multiple models. 117 candidate genes were mapped to these QTNs, associated with six major traits: 20 for flowering time, 16 for maturity, 16 for plant height, 17 for seed number per plant, 38 for seed yield, and 10 for seed weight. Key genes included Sobic.001G196700 (flowering time) and Sobic.005G176100 (stress responses). Two important regulatory genes, SbMADS1 and SbFT, were highlighted for their roles in flowering regulation. SbMADS1 influences days to flowering, while SbFT acts as a mobile signal integrating photoperiod cues. These genes are involved in starch and sucrose metabolism pathways, essential for energy storage and mobilization, thereby supporting improved growth and yield in sorghum.Conclusion: This study highlights the complexity of trait inheritance shaped by diverse genetic factors and underscores the significance of major, stable, and unique QTNs for marker-assisted selection. Functional genome annotation revealed that candidate genes are involved in key biological processes and ","PeriodicalId":72427,"journal":{"name":"BMC genomic data","volume":"26 1","pages":"67"},"PeriodicalIF":2.5,"publicationDate":"2025-09-26","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12465425/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145180591","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Genomic typing, antimicrobial resistance gene, virulence factor and plasmid replicon database for the important pathogenic bacteria Staphylococcus aureus. 重要致病菌金黄色葡萄球菌基因组分型、耐药基因、毒力因子及质粒复制子数据库。

IF 2.5 Q3 GENETICS & HEREDITY

BMC genomic data

Pub Date : 2025-09-26 DOI: 10.1186/s12863-025-01363-w

Andrey Shelenkov, Anna Slavokhotova, Mariyam Yunusova, Vladimir Kulikov, Yulia Mikhaylova, Vasiliy Akimkin

Background: Bacterial infections pose a global health threat across clinical and community settings. Over the past decade, the alarming expansion of antimicrobial resistance (AMR) has progressively narrowed therapeutic options, particularly for healthcare-associated infections. This critical situation has been formally recognized by the World Health Organization as a major public health concern. Epidemiological studies have demonstrated that the dissemination of AMR is frequently mediated by specific high-risk bacterial lineages, often designated as "global clones" or "clonal complexes." Consequently, surveillance of these epidemic clones and elucidation of their pathogenic mechanisms and AMR acquisition pathways have become essential research priorities. The advent of whole genome sequencing has revolutionized these investigations, enabling comprehensive epidemiological tracking and detailed analysis of mobile genetic elements responsible for resistance gene transfer. However, despite the exponential increase in available bacterial genome sequences, significant challenges persist. Current genomic datasets often suffer from uneven representation of clinically relevant strains and inconsistent availability of accompanying metadata. These limitations create substantial obstacles for large-scale comparative studies and hinder effective surveillance efforts.

Description: This database represents a comprehensive genomic analysis of 98,950 Staphylococcus aureus isolates, a high-priority bacterial pathogen of global clinical significance. We provide detailed isolate characterization through several established typing schemes including multilocus sequence typing (MLST), clonal complex (CC) assignments, spa typing results, and core genome MLST (cgMLST) profiles. The dataset also documents the presence of CRISPR-Cas systems in these isolates. Beyond fundamental typing data, our resource incorporates the distribution of antimicrobial resistance determinants, virulence factors, and plasmid replicons. These systematically curated genomic features offer researchers valuable insights into isolate epidemiology, resistance mechanisms, and horizontal gene transfer patterns in this highly concerning pathogen.

Conclusion: This database is freely available under CC BY-NC-SA at https://doi.org/10.5281/zenodo.14833440 . The data provided enables researchers to identify optimal reference isolates for various genomic studies, supporting critical investigations into S. aureus epidemiology and antimicrobial resistance evolution. This resource will ultimately inform the development of more effective prevention and control measures against this high-priority pathogen.

背景：细菌感染在临床和社区环境中构成全球健康威胁。在过去十年中，抗菌素耐药性（AMR）的惊人扩张逐渐缩小了治疗选择，特别是针对卫生保健相关感染。这一危急情况已被世界卫生组织正式确认为一个重大公共卫生问题。流行病学研究表明，AMR的传播经常是由特定的高风险细菌谱系介导的，通常被称为“全球克隆”或“克隆复合物”。因此，监测这些流行病克隆并阐明其致病机制和抗菌素耐药性获得途径已成为重要的研究重点。全基因组测序的出现彻底改变了这些调查，使全面的流行病学跟踪和详细分析负责抗性基因转移的移动遗传元件成为可能。然而，尽管可用的细菌基因组序列呈指数增长，但重大挑战仍然存在。目前的基因组数据集通常存在临床相关菌株的不均匀代表和附带元数据的不一致可用性的问题。这些限制为大规模比较研究造成了重大障碍，并阻碍了有效的监测工作。描述：该数据库对98,950株金黄色葡萄球菌进行了全面的基因组分析，金黄色葡萄球菌是一种具有全球临床意义的高优先级细菌病原体。我们通过几种已建立的分型方案，包括多位点序列分型（MLST）、克隆复合体（CC）分配、spa分型结果和核心基因组MLST （cgMLST）谱，提供了详细的分离物特征。该数据集还记录了这些分离株中CRISPR-Cas系统的存在。除了基本的分型数据，我们的资源还包括抗菌素耐药性决定因素、毒力因子和质粒复制子的分布。这些系统整理的基因组特征为研究人员对这种高度关注的病原体的分离流行病学、耐药性机制和水平基因转移模式提供了有价值的见解。结论：该数据库在https://doi.org/10.5281/zenodo.14833440的CC BY-NC-SA下免费提供。提供的数据使研究人员能够确定各种基因组研究的最佳参考分离株，支持对金黄色葡萄球菌流行病学和抗菌素耐药性进化的关键调查。这一资源最终将为制定针对这一高度优先病原体的更有效的预防和控制措施提供信息。

{"title":"Genomic typing, antimicrobial resistance gene, virulence factor and plasmid replicon database for the important pathogenic bacteria Staphylococcus aureus.","authors":"Andrey Shelenkov, Anna Slavokhotova, Mariyam Yunusova, Vladimir Kulikov, Yulia Mikhaylova, Vasiliy Akimkin","doi":"10.1186/s12863-025-01363-w","DOIUrl":"10.1186/s12863-025-01363-w","url":null,"abstract":"Background: Bacterial infections pose a global health threat across clinical and community settings. Over the past decade, the alarming expansion of antimicrobial resistance (AMR) has progressively narrowed therapeutic options, particularly for healthcare-associated infections. This critical situation has been formally recognized by the World Health Organization as a major public health concern. Epidemiological studies have demonstrated that the dissemination of AMR is frequently mediated by specific high-risk bacterial lineages, often designated as \"global clones\" or \"clonal complexes.\" Consequently, surveillance of these epidemic clones and elucidation of their pathogenic mechanisms and AMR acquisition pathways have become essential research priorities. The advent of whole genome sequencing has revolutionized these investigations, enabling comprehensive epidemiological tracking and detailed analysis of mobile genetic elements responsible for resistance gene transfer. However, despite the exponential increase in available bacterial genome sequences, significant challenges persist. Current genomic datasets often suffer from uneven representation of clinically relevant strains and inconsistent availability of accompanying metadata. These limitations create substantial obstacles for large-scale comparative studies and hinder effective surveillance efforts.Description: This database represents a comprehensive genomic analysis of 98,950 Staphylococcus aureus isolates, a high-priority bacterial pathogen of global clinical significance. We provide detailed isolate characterization through several established typing schemes including multilocus sequence typing (MLST), clonal complex (CC) assignments, spa typing results, and core genome MLST (cgMLST) profiles. The dataset also documents the presence of CRISPR-Cas systems in these isolates. Beyond fundamental typing data, our resource incorporates the distribution of antimicrobial resistance determinants, virulence factors, and plasmid replicons. These systematically curated genomic features offer researchers valuable insights into isolate epidemiology, resistance mechanisms, and horizontal gene transfer patterns in this highly concerning pathogen.Conclusion: This database is freely available under CC BY-NC-SA at https://doi.org/10.5281/zenodo.14833440 . The data provided enables researchers to identify optimal reference isolates for various genomic studies, supporting critical investigations into S. aureus epidemiology and antimicrobial resistance evolution. This resource will ultimately inform the development of more effective prevention and control measures against this high-priority pathogen.","PeriodicalId":72427,"journal":{"name":"BMC genomic data","volume":"26 1","pages":"65"},"PeriodicalIF":2.5,"publicationDate":"2025-09-26","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12465433/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145180607","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Multi-omics mediation pipeline reveals differential pathways of maternal SNPs affecting newborn adiposity outcomes. 多组学中介管道揭示了母亲snp影响新生儿肥胖结局的不同途径。

IF 2.5 Q3 GENETICS & HEREDITY

BMC genomic data

Pub Date : 2025-09-26 DOI: 10.1186/s12863-025-01355-w

Nathan P Gill, Alan Kuang, Denise M Scholtens

Background: A great deal of previous research describes the impact of the maternal metabolic and genetic milieu on newborn adiposity outcomes. However, much of this research does not focus on all aspects of the problem simultaneously. Studies focusing on metabolic factors may not distinguish between maternal and fetal genetic pathways, while studies that do focus on these different genetic pathways may not incorporate metabolic information into effect estimates or variant classifications. In this paper, we introduce a novel multi-omics pipeline for maternal genetic variant selection and mediation effect testing that can handle all these pathways, and use it to investigate broad patterns in the effects of maternal genetic variants on newborn adiposity outcomes.

Results: A Bayesian network model is used to incorporate both metabolomic and genomic data into an initial filter for maternal variants likely to affect newborn adiposity outcomes through a direct maternal genetic effect, an indirect fetal genetic effect, a maternal metabolic effect, or some combination of these pathways. A mediation model is then fit to these candidate variants and associated outcomes to identify which of these pathways, if any, mediate the total effect. We then group maternal genetic variants according to the relative magnitudes of these three effect pathways. In an application to existing mother-newborn data from the HAPO study, we find that of 78 candidate variants, the majority influence newborn birthweight solely through either a direct maternal or indirect fetal genetic effect (37% and 40%, respectively), a smaller number through both of these (14%), relatively few exclusively through the maternal metabolic pathway (6%), and almost none through a combination of the maternal metabolic pathway with either of the two genetic pathways (3%). We also find that these overall patterns of mediation effects are similar across outcomes.

Conclusions: Our results reveal broad patterns in the effects of maternal genetic variants on newborn adiposity, and identify both new genetic loci and loci known from previous literature to influence newborn adiposity. These results demonstrate the potential for scientific discovery enabled by our multi-omics mediation pipeline, and the approach is broadly applicable for untangling path-specific contributions in the modern integrated multi-omics landscape.

背景：大量先前的研究描述了母体代谢和遗传环境对新生儿肥胖结局的影响。然而，很多研究并没有同时关注这个问题的所有方面。关注代谢因素的研究可能无法区分母体和胎儿的遗传途径，而关注这些不同遗传途径的研究可能不会将代谢信息纳入影响估计或变异分类。在本文中，我们介绍了一种新的多组学管道，用于母体遗传变异选择和中介效应测试，可以处理所有这些途径，并利用它来研究母体遗传变异对新生儿肥胖结局的影响的广泛模式。结果：使用贝叶斯网络模型将代谢组学和基因组学数据合并到母体变异的初始过滤器中，这些变异可能通过直接的母体遗传效应、间接的胎儿遗传效应、母体代谢效应或这些途径的某种组合影响新生儿肥胖结局。然后，将中介模型拟合到这些候选变体和相关结果中，以确定哪些途径（如果有的话）调解了总体效果。然后，我们根据这三种影响途径的相对大小对母体遗传变异进行分组。在对HAPO研究中现有的母婴数据的应用中，我们发现78个候选变异中，大多数仅通过直接母体或间接胎儿遗传效应影响新生儿出生体重（分别为37%和40%），通过这两种遗传效应影响新生儿出生体重的数量较少（14%），完全通过母体代谢途径影响新生儿出生体重的相对较少（6%），几乎没有通过母体代谢途径与两种遗传途径中的任何一种结合影响新生儿出生体重（3%）。我们还发现，这些中介效应的总体模式在不同的结果中是相似的。结论：我们的研究结果揭示了母体遗传变异对新生儿肥胖影响的广泛模式，并确定了新的遗传位点和先前文献中已知的影响新生儿肥胖的基因位点。这些结果表明，我们的多组学中介管道具有科学发现的潜力，并且该方法广泛适用于解开现代集成多组学领域中特定路径的贡献。

{"title":"Multi-omics mediation pipeline reveals differential pathways of maternal SNPs affecting newborn adiposity outcomes.","authors":"Nathan P Gill, Alan Kuang, Denise M Scholtens","doi":"10.1186/s12863-025-01355-w","DOIUrl":"10.1186/s12863-025-01355-w","url":null,"abstract":"Background: A great deal of previous research describes the impact of the maternal metabolic and genetic milieu on newborn adiposity outcomes. However, much of this research does not focus on all aspects of the problem simultaneously. Studies focusing on metabolic factors may not distinguish between maternal and fetal genetic pathways, while studies that do focus on these different genetic pathways may not incorporate metabolic information into effect estimates or variant classifications. In this paper, we introduce a novel multi-omics pipeline for maternal genetic variant selection and mediation effect testing that can handle all these pathways, and use it to investigate broad patterns in the effects of maternal genetic variants on newborn adiposity outcomes.Results: A Bayesian network model is used to incorporate both metabolomic and genomic data into an initial filter for maternal variants likely to affect newborn adiposity outcomes through a direct maternal genetic effect, an indirect fetal genetic effect, a maternal metabolic effect, or some combination of these pathways. A mediation model is then fit to these candidate variants and associated outcomes to identify which of these pathways, if any, mediate the total effect. We then group maternal genetic variants according to the relative magnitudes of these three effect pathways. In an application to existing mother-newborn data from the HAPO study, we find that of 78 candidate variants, the majority influence newborn birthweight solely through either a direct maternal or indirect fetal genetic effect (37% and 40%, respectively), a smaller number through both of these (14%), relatively few exclusively through the maternal metabolic pathway (6%), and almost none through a combination of the maternal metabolic pathway with either of the two genetic pathways (3%). We also find that these overall patterns of mediation effects are similar across outcomes.Conclusions: Our results reveal broad patterns in the effects of maternal genetic variants on newborn adiposity, and identify both new genetic loci and loci known from previous literature to influence newborn adiposity. These results demonstrate the potential for scientific discovery enabled by our multi-omics mediation pipeline, and the approach is broadly applicable for untangling path-specific contributions in the modern integrated multi-omics landscape.","PeriodicalId":72427,"journal":{"name":"BMC genomic data","volume":"26 1","pages":"66"},"PeriodicalIF":2.5,"publicationDate":"2025-09-26","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12466079/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145180589","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Genome-wide association study meta-analysis uncovers novel genetic variants associated with olfactory dysfunction. 全基因组关联研究荟萃分析揭示了与嗅觉功能障碍相关的新型遗传变异。

IF 2.5 Q3 GENETICS & HEREDITY

BMC genomic data

Pub Date : 2025-09-17 DOI: 10.1186/s12863-025-01360-z

Mohammed Aslam Imtiaz, Konstantinos Melas, Adrienne Tin, Valentina Talevi, Honglei Chen, Myriam Fornage, Srishti Shrestha, Martin Gögele, David Emmert, Cristian Pattaro, Peter Pramstaller, Franz Förster, Katrin Horn, Thomas H Mosley, Christian Fuchsberger, Markus Scholz, Monique M B Breteler, N Ahmad Aziz

Background: Olfactory dysfunction is among the earliest signs of many age-related neurodegenerative diseases and has been associated with increased mortality in older adults; however, its genetic basis remains largely unknown. Therefore, here we aimed to elucidate its genetic architecture through a genome-wide association study meta-analysis (GWMA).

Methods: This GWMA included the participants of European ancestry (N = 22,730) enrolled in four different large population-based studies followed by a multi-ancestry GWMA including participants of African ancestry (N = 1,030). Olfactory dysfunction was assessed using a 12-item smell identification test.

Results: GWMA revealed a novel genome-wide significant locus (tagged by single nucleotide polymorphism rs11228623 at the 11q12 locus) associated with olfactory dysfunction. Gene-based analysis revealed a high enrichment for olfactory receptor genes in this region. Phenome-wide association studies demonstrated associations between genetic variants related to olfactory dysfunction and blood cell counts, kidney function, skeletal muscle mass, cholesterol levels and cardiovascular disease. Using individual-level data, we also confirmed and quantified the strength of these associations on a phenotypic level. Moreover, employing two-sample Mendelian Randomization analyses, we found evidence for causal associations between olfactory dysfunction and these phenotypes.

Conclusions: Our findings provide novel insights into the genetic architecture of the sense of smell and highlight its importance for many aspects of human health. Moreover, these findings could facilitate the identification and monitoring of individuals at increased risk of olfactory dysfunction and associated diseases.

背景：嗅觉功能障碍是许多与年龄相关的神经退行性疾病的早期症状之一，并与老年人死亡率增加有关；然而，其遗传基础在很大程度上仍然未知。因此，本研究旨在通过全基因组关联研究荟萃分析（GWMA）阐明其遗传结构。方法：该GWMA纳入了欧洲血统的参与者（N = 22730），他们参加了四项不同的基于人群的大型研究，随后是一项多血统的GWMA，包括非洲血统的参与者（N = 1030）。嗅觉功能障碍评估采用12项嗅觉识别测试。结果：GWMA发现了一个新的与嗅觉功能障碍相关的全基因组显著位点（在11q12位点上以单核苷酸多态性rs11228623标记）。基因分析显示该区域嗅觉受体基因高度富集。全现象关联研究表明，与嗅觉功能障碍相关的遗传变异与血细胞计数、肾功能、骨骼肌质量、胆固醇水平和心血管疾病之间存在关联。使用个体水平的数据，我们也在表型水平上证实并量化了这些关联的强度。此外，采用双样本孟德尔随机化分析，我们发现嗅觉功能障碍与这些表型之间存在因果关系的证据。结论：我们的发现为嗅觉的遗传结构提供了新的见解，并强调了嗅觉对人类健康的许多方面的重要性。此外，这些发现有助于识别和监测嗅觉功能障碍和相关疾病风险增加的个体。

{"title":"Genome-wide association study meta-analysis uncovers novel genetic variants associated with olfactory dysfunction.","authors":"Mohammed Aslam Imtiaz, Konstantinos Melas, Adrienne Tin, Valentina Talevi, Honglei Chen, Myriam Fornage, Srishti Shrestha, Martin Gögele, David Emmert, Cristian Pattaro, Peter Pramstaller, Franz Förster, Katrin Horn, Thomas H Mosley, Christian Fuchsberger, Markus Scholz, Monique M B Breteler, N Ahmad Aziz","doi":"10.1186/s12863-025-01360-z","DOIUrl":"10.1186/s12863-025-01360-z","url":null,"abstract":"Background: Olfactory dysfunction is among the earliest signs of many age-related neurodegenerative diseases and has been associated with increased mortality in older adults; however, its genetic basis remains largely unknown. Therefore, here we aimed to elucidate its genetic architecture through a genome-wide association study meta-analysis (GWMA).Methods: This GWMA included the participants of European ancestry (N = 22,730) enrolled in four different large population-based studies followed by a multi-ancestry GWMA including participants of African ancestry (N = 1,030). Olfactory dysfunction was assessed using a 12-item smell identification test.Results: GWMA revealed a novel genome-wide significant locus (tagged by single nucleotide polymorphism rs11228623 at the 11q12 locus) associated with olfactory dysfunction. Gene-based analysis revealed a high enrichment for olfactory receptor genes in this region. Phenome-wide association studies demonstrated associations between genetic variants related to olfactory dysfunction and blood cell counts, kidney function, skeletal muscle mass, cholesterol levels and cardiovascular disease. Using individual-level data, we also confirmed and quantified the strength of these associations on a phenotypic level. Moreover, employing two-sample Mendelian Randomization analyses, we found evidence for causal associations between olfactory dysfunction and these phenotypes.Conclusions: Our findings provide novel insights into the genetic architecture of the sense of smell and highlight its importance for many aspects of human health. Moreover, these findings could facilitate the identification and monitoring of individuals at increased risk of olfactory dysfunction and associated diseases.","PeriodicalId":72427,"journal":{"name":"BMC genomic data","volume":"26 1","pages":"64"},"PeriodicalIF":2.5,"publicationDate":"2025-09-17","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12445039/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145082371","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Draft genome of the Cuban Painted Landsnail Polymita picta, International Mollusc of the year 2022. 古巴彩绘陆地蜗牛Polymita picta基因组草图，2022年国际软体动物。

IF 2.5 Q3 GENETICS & HEREDITY

BMC genomic data

Pub Date : 2025-09-03 DOI: 10.1186/s12863-025-01356-9

Bernardo Reyes-Tur, Zeyuan Chen, Mario Juan Gordillo-Pérez, Alexander Ben Hamadou, Charlotte Gerheim, Carola Greve, Julia D Sigwart

Objective: The Cuban Painted Landsnail is an iconic endemic tree snail species with distinctive colourful shells used in traditional handicrafts. This species won the International Mollusc of the Year 2022 competition in an open public vote. As the competition prize, we have assembled the draft genome of this species.

Data description: Genomic DNA from Polymita picta (Born, 1778) was sequenced using PacBio HiFi sequencing with a yield of 5.3 million reads (41.4 Gb) and an N50 of 8.1 Kb. The genome size of P. picta was estimated to be 2.9 Gb, and the final assembly was 1.85 Gb, with a total of 22,619 contigs and a contig N50 of 124.2 Kb. BUSCO analysis of the genome assembly indicated a genome completeness of 88.4%, with 7% complete duplicated BUSCOs in metazoa_odb10. The draft genome will be a valuable resource for work on the endangered Cuban Painted Landsnail including monitoring genetic diversity and establishing captive breeding for conservation.

目的：古巴彩绘蜗牛是一种标志性的地方性树蜗牛，其独特的彩色外壳用于传统手工艺品。这个物种在公开投票中赢得了2022年国际软体动物大赛。作为比赛的奖品，我们已经组装了这个物种的基因组草图。数据描述：对Polymita picta（生于1778年）的基因组DNA进行PacBio HiFi测序，产率为530万reads (41.4 Gb)， N50为8.1 Kb。picta的基因组大小估计为2.9 Gb，最终组装量为1.85 Gb，共22,619个contigs， contigs N50为124.2 Kb。基因组组装的BUSCO分析表明，metazoa_odb10的基因组完整性为88.4%，其中有7%的基因组完全重复。基因组草案将成为研究濒临灭绝的古巴彩绘蜗牛的宝贵资源，包括监测遗传多样性和建立圈养繁殖保护。

{"title":"Draft genome of the Cuban Painted Landsnail Polymita picta, International Mollusc of the year 2022.","authors":"Bernardo Reyes-Tur, Zeyuan Chen, Mario Juan Gordillo-Pérez, Alexander Ben Hamadou, Charlotte Gerheim, Carola Greve, Julia D Sigwart","doi":"10.1186/s12863-025-01356-9","DOIUrl":"10.1186/s12863-025-01356-9","url":null,"abstract":"Objective: The Cuban Painted Landsnail is an iconic endemic tree snail species with distinctive colourful shells used in traditional handicrafts. This species won the International Mollusc of the Year 2022 competition in an open public vote. As the competition prize, we have assembled the draft genome of this species.Data description: Genomic DNA from Polymita picta (Born, 1778) was sequenced using PacBio HiFi sequencing with a yield of 5.3 million reads (41.4 Gb) and an N50 of 8.1 Kb. The genome size of P. picta was estimated to be 2.9 Gb, and the final assembly was 1.85 Gb, with a total of 22,619 contigs and a contig N50 of 124.2 Kb. BUSCO analysis of the genome assembly indicated a genome completeness of 88.4%, with 7% complete duplicated BUSCOs in metazoa_odb10. The draft genome will be a valuable resource for work on the endangered Cuban Painted Landsnail including monitoring genetic diversity and establishing captive breeding for conservation.","PeriodicalId":72427,"journal":{"name":"BMC genomic data","volume":"26 1","pages":"63"},"PeriodicalIF":2.5,"publicationDate":"2025-09-03","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12409939/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"144994581","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

High-quality genome assembly and annotation of live animal vaccine bacteria strains in South Korea. 韩国活体动物疫苗菌株的高质量基因组组装和注释。

IF 2.5 Q3 GENETICS & HEREDITY

BMC genomic data

Pub Date : 2025-09-02 DOI: 10.1186/s12863-025-01357-8

Yeonkyeong Lee, Jin-Ju Nah, Hyun-Ok Ku, Il Jang

引用次数: 0

Complete genome sequence of the probiotic candidate strain Lacticaseibacillus rhamnosus B3421 isolated from Panax ginseng C. A. Meyer in South Korea. 韩国人参中益生菌候选菌株鼠李糖乳杆菌B3421的全基因组序列

IF 2.5 Q3 GENETICS & HEREDITY

BMC genomic data

Pub Date : 2025-08-28 DOI: 10.1186/s12863-025-01344-z

Gwi-Deuk Jin, Ho-Youn Kim, Eun Bae Kim, Bokyung Lee

Objectives: Lacticaseibacillus rhamnosus is a widely recognized probiotic bacteria with therapeutic applications in human and animal health. The L. rhamnosus B3421 strain, isolated from Panax ginseng, has been reported to be associated with antioxidant and anti-inflammatory properties, supporting its functional potential. We sequenced and analyzed the genome of L. rhamnosus B3421 to evaluate its probiotic potential for human healthcare and animal applications, focusing on genomic features related to safety and functionality.

Data description: In this study, we isolated L. rhamnosus B3421 from Panax ginseng C. A. Meyer (Ginseng) and performed whole-genome sequencing. The genome of L. rhamnosus B3421 consists of 3,000,051 base pairs (bp) with a guanine + cytosine (G + C) content of 46.70%. It encodes 59 transfer RNAs, 15 ribosomal RNAs, and 2,807 coding sequences (CDSs). Of these CDSs, 99.13% (2,758 proteins) were assigned to functional categories in the Clusters of Orthologous Group (COGs) classification system, while 49 proteins remained uncharacterized. Our genome analysis identified no antibiotic resistance (ABR) or antimicrobial resistance (AMR) genes, indicating that L. rhamnosus B3421 is a safe probiotic bacterium with minimal risk of contributing to the horizontal transfer of antibiotic resistance within the gut microbiome. Additionally, the genome contains genes associated with the ggmotif (PF10439), Enterocin X chain beta, and Carnocin CP52, as identified through BAGEL4 analysis, along with 24 other genes related to reductase or peroxidase activities. These genes may confer competitive advantages against pathogenic bacteria and oxidative stress. Our findings highlight the probiotic potential of L. rhamnosus B3421 and its prospective applications in promoting human and animal health.

目的：鼠李糖乳杆菌是一种广泛认可的益生菌，在人类和动物健康中具有治疗作用。L. rhamnosus B3421菌株是从人参中分离出来的，据报道具有抗氧化和抗炎特性，支持其功能潜力。我们对L. rhamnosus B3421的基因组进行了测序和分析，以评估其在人类保健和动物应用中的益生菌潜力，重点关注与安全性和功能相关的基因组特征。资料描述：本研究从人参中分离得到L. rhamnosus B3421，并进行全基因组测序。鼠李糖B3421基因组全长3,000,051个碱基对，鸟嘌呤+胞嘧啶（G + C）含量为46.70%。它编码59个转移rna， 15个核糖体rna和2807个编码序列（CDSs）。在这些CDSs中，99.13%（2,758个蛋白）在COGs分类系统中被分配到功能类别，而49个蛋白仍未被表征。我们的基因组分析未发现抗生素耐药（ABR）或抗菌素耐药（AMR）基因，这表明鼠李糖乳杆菌B3421是一种安全的益生菌，在肠道微生物群中导致抗生素耐药性水平转移的风险很小。此外，通过BAGEL4分析发现，该基因组包含与ggmotif （PF10439）、Enterocin X链β和Carnocin CP52相关的基因，以及其他24个与还原酶或过氧化物酶活性相关的基因。这些基因可能赋予对抗致病菌和氧化应激的竞争优势。我们的研究结果强调了鼠李糖B3421益生菌的潜力及其在促进人类和动物健康方面的潜在应用。

{"title":"Complete genome sequence of the probiotic candidate strain Lacticaseibacillus rhamnosus B3421 isolated from Panax ginseng C. A. Meyer in South Korea.","authors":"Gwi-Deuk Jin, Ho-Youn Kim, Eun Bae Kim, Bokyung Lee","doi":"10.1186/s12863-025-01344-z","DOIUrl":"https://doi.org/10.1186/s12863-025-01344-z","url":null,"abstract":"Objectives: Lacticaseibacillus rhamnosus is a widely recognized probiotic bacteria with therapeutic applications in human and animal health. The L. rhamnosus B3421 strain, isolated from Panax ginseng, has been reported to be associated with antioxidant and anti-inflammatory properties, supporting its functional potential. We sequenced and analyzed the genome of L. rhamnosus B3421 to evaluate its probiotic potential for human healthcare and animal applications, focusing on genomic features related to safety and functionality.Data description: In this study, we isolated L. rhamnosus B3421 from Panax ginseng C. A. Meyer (Ginseng) and performed whole-genome sequencing. The genome of L. rhamnosus B3421 consists of 3,000,051 base pairs (bp) with a guanine + cytosine (G + C) content of 46.70%. It encodes 59 transfer RNAs, 15 ribosomal RNAs, and 2,807 coding sequences (CDSs). Of these CDSs, 99.13% (2,758 proteins) were assigned to functional categories in the Clusters of Orthologous Group (COGs) classification system, while 49 proteins remained uncharacterized. Our genome analysis identified no antibiotic resistance (ABR) or antimicrobial resistance (AMR) genes, indicating that L. rhamnosus B3421 is a safe probiotic bacterium with minimal risk of contributing to the horizontal transfer of antibiotic resistance within the gut microbiome. Additionally, the genome contains genes associated with the ggmotif (PF10439), Enterocin X chain beta, and Carnocin CP52, as identified through BAGEL4 analysis, along with 24 other genes related to reductase or peroxidase activities. These genes may confer competitive advantages against pathogenic bacteria and oxidative stress. Our findings highlight the probiotic potential of L. rhamnosus B3421 and its prospective applications in promoting human and animal health.","PeriodicalId":72427,"journal":{"name":"BMC genomic data","volume":"26 1","pages":"61"},"PeriodicalIF":2.5,"publicationDate":"2025-08-28","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12395871/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"144980728","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Dataset of 16S rRNA and ITS gene amplicon sequencing of celery and parsley rhizosphere soils. 芹菜和欧芹根际土壤16S rRNA和ITS基因扩增子测序数据集。

IF 2.5 Q3 GENETICS & HEREDITY

BMC genomic data

Pub Date : 2025-08-25 DOI: 10.1186/s12863-025-01351-0

Olubukola Oluranti Babalola, Florence Oluwayemisi Ogundeji, Akinlolu Olalekan Akanmu

Objectives: This amplicon metagenomic study examines the relative abundance, taxonomic profiles and community structure of bacterial and fungal communities associated with the roots of parsley (Petroselinum crispum) and celery (Apium graveolens) under monocropping and intercropping systems. The study aims to provide a baseline understanding of how intercropping influences rhizosphere microbial dynamics.

Data description: The dataset provides insight into the effects of parsley-celery intercropping system on soil microbial richness, diversity and community structure. Amplicon metagenomic sequencing was performed on the DNA samples, targeting the 16S rRNA gene (V3-V4 region) and the ITS region for bacterial and fungal communities, respectively. The quantified libraries were pooled and sequenced using Illumina platforms, and the raw sequences were analyzed using the Quantitative Insights Into Microbial Ecology (QIIME 2 version 2019.1.) pipeline. The resulting Amplicon Sequence Variant (ASV) profiles revealed Actinobacteria and Protobacteria as the most predominant bacteria phyla, followed by Bacteroidota, Gemmatimonadota and Acidobacteriaota. The most predominant taxonomic distribution of fungi at the phylum level includes Ascomycota and Mortierellomycota. The dataset includes raw sequence reads in FASTQ format (.fastq.gz), which have been deposited in the Sequence Read Archive (SRA) of the National Center for Biotechnology Information (NCBI) under the Bioproject Accession numbers; SRP540554 (16S rRNA) and SRP540675 (ITS).

目的：通过扩增子宏基因组研究，研究了单作和间作条件下欧芹（Petroselinum crispum）和芹菜（Apium graveolens）根系相关细菌和真菌群落的相对丰度、分类特征和群落结构。该研究旨在为间作如何影响根际微生物动力学提供一个基本的认识。数据说明：该数据集揭示了欧芹间作制度对土壤微生物丰富度、多样性和群落结构的影响。对DNA样本进行扩增子宏基因组测序，分别针对细菌群落的16S rRNA基因（V3-V4区）和真菌群落的ITS区。使用Illumina平台对定量文库进行汇总和测序，使用Quantitative Insights Into Microbial Ecology （QIIME 2 version 2019.1.）流水线对原始序列进行分析。扩增子序列变异（Amplicon Sequence Variant， ASV）显示放线菌门和原细菌门是最主要的菌门，其次是拟杆菌门、双歧杆菌门和酸杆菌门。在门水平上，真菌最主要的分类分布包括子囊菌门和Mortierellomycota门。该数据集包括FASTQ格式（.fastq.gz）的原始序列读取，已存放在国家生物技术信息中心（NCBI）的序列读取档案（SRA）中，编号为Bioproject Accession number；SRP540554 （16S rRNA）和SRP540675 （ITS）。

{"title":"Dataset of 16S rRNA and ITS gene amplicon sequencing of celery and parsley rhizosphere soils.","authors":"Olubukola Oluranti Babalola, Florence Oluwayemisi Ogundeji, Akinlolu Olalekan Akanmu","doi":"10.1186/s12863-025-01351-0","DOIUrl":"https://doi.org/10.1186/s12863-025-01351-0","url":null,"abstract":"Objectives: This amplicon metagenomic study examines the relative abundance, taxonomic profiles and community structure of bacterial and fungal communities associated with the roots of parsley (Petroselinum crispum) and celery (Apium graveolens) under monocropping and intercropping systems. The study aims to provide a baseline understanding of how intercropping influences rhizosphere microbial dynamics.Data description: The dataset provides insight into the effects of parsley-celery intercropping system on soil microbial richness, diversity and community structure. Amplicon metagenomic sequencing was performed on the DNA samples, targeting the 16S rRNA gene (V3-V4 region) and the ITS region for bacterial and fungal communities, respectively. The quantified libraries were pooled and sequenced using Illumina platforms, and the raw sequences were analyzed using the Quantitative Insights Into Microbial Ecology (QIIME 2 version 2019.1.) pipeline. The resulting Amplicon Sequence Variant (ASV) profiles revealed Actinobacteria and Protobacteria as the most predominant bacteria phyla, followed by Bacteroidota, Gemmatimonadota and Acidobacteriaota. The most predominant taxonomic distribution of fungi at the phylum level includes Ascomycota and Mortierellomycota. The dataset includes raw sequence reads in FASTQ format (.fastq.gz), which have been deposited in the Sequence Read Archive (SRA) of the National Center for Biotechnology Information (NCBI) under the Bioproject Accession numbers; SRP540554 (16S rRNA) and SRP540675 (ITS).","PeriodicalId":72427,"journal":{"name":"BMC genomic data","volume":"26 1","pages":"60"},"PeriodicalIF":2.5,"publicationDate":"2025-08-25","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12376418/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"144980661","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0