Pub Date : 2025-08-12DOI: 10.1186/s12863-025-01346-x
Anna Baborski, Stefanie A Barth, Elke Martina Jung, Frank Bloos, Jürgen Rödel, Bettina Löffler, Michael Bauer, Anne Busch
Enterobacter cloacae complex is a group of common opportunistic pathogens on intensive care units. On intensive care units sepsis is treated with high doses of antibiotics. This treatment does not only eliminate pathogenic bacteria but parts of the microbiome community as well. This leads to an imbalance of the gut microbiome. However, some bacteria can survive such treatment due to certain survival and resistance mechanisms. Not only antibiotic resistance mechanisms but also forming strong communities via biofilm formation promotes cell survival. Here, we investigated the properties of the isolate AT70PIP076 from a sepsis patient treated with piperacillin and tazobactam. After biochemical analysis and MALDI-TOF analysis, the strain was found to be Enterobacter cloacae. In addition to in vitro, antimicrobial susceptibility testing the genome was further investigated in situ regarding antibiotic resistance. Further live/dead staining was performed, and the biofilm formation was investigated using confocal laser microscopy (cLSM). The genome shows the presence of biofilm-associated genes EU554560, bcsABZC_AP010953, ehaB, KF662843, and crl. The understanding of the underlying mechanism of survival of potential pathogens might contribute to elucidate potential treatment options.ObjectivesGenomic analysis of a bacterium that can survive antibiotic treatment within the gut of an antibiotictreated patient to elucidate survival and resistance mechanisms.Data descriptionThe isolate AT70PIP076 was isolated in 2021 from feces collected from a patient treated with Piperacillin and tazobactam. Whole genome DNA was isolated using the Nextera DNA Flex microbial colony extraction protocol and the Nextera Flex DNA preparation kit according to the manufacturer's instructions. Following paired-end sequencing was performed on the MiSeq platform (Illumina, Inc., San Diego, CA, USA) using a 300-cycle MiSeq reagent kit and a read length of 151 bp. Contamination check and identification of 16 S RNA sequences was done by using ContESt16S. The genomic sequence contained 4,988,237 bp and the G + C content is represented at 54.80%. This genome and its associated data set will serve as a useful resource for further analyses.
阴沟肠杆菌复合体是重症监护病房中一组常见的机会致病菌。在重症监护病房,败血症是用大剂量抗生素治疗的。这种治疗不仅可以消除致病菌,还可以消除部分微生物群落。这会导致肠道微生物群的不平衡。然而,由于某些生存和耐药机制,一些细菌可以在这种治疗中存活下来。不仅抗生素耐药机制,而且通过生物膜形成强大的群落促进细胞存活。在这里,我们研究了从接受哌拉西林和他唑巴坦治疗的脓毒症患者中分离出的AT70PIP076的性质。经生化分析和MALDI-TOF分析,菌株为阴沟肠杆菌。在体外试验的基础上,进一步研究了基因组对抗生素耐药性的敏感性。进一步进行活/死染色,并使用共聚焦激光显微镜(cLSM)研究生物膜的形成。基因组显示存在生物膜相关基因EU554560、bcsABZC_AP010953、ehaB、KF662843和crl。了解潜在病原体生存的潜在机制可能有助于阐明潜在的治疗选择。目的对一种在接受抗生素治疗的患者肠道内存活的细菌进行基因组分析,以阐明其存活和耐药机制。分离物AT70PIP076于2021年从接受哌拉西林和他唑巴坦治疗的患者收集的粪便中分离出来。根据制造商的说明,使用Nextera DNA Flex微生物菌落提取方案和Nextera Flex DNA制备试剂盒分离全基因组DNA。随后在MiSeq平台(Illumina, Inc., San Diego, CA, USA)上进行配对端测序,使用300循环MiSeq试剂盒,读取长度为151 bp。采用ContESt16S软件对16s RNA序列进行污染检查和鉴定。基因组序列长度为4,988,237 bp, G + C含量为54.80%。该基因组及其相关数据集将为进一步分析提供有用的资源。
{"title":"Surviving antibiotic treatment as a gut bacterium: genomic characterization of an Enterobacter cloacae.","authors":"Anna Baborski, Stefanie A Barth, Elke Martina Jung, Frank Bloos, Jürgen Rödel, Bettina Löffler, Michael Bauer, Anne Busch","doi":"10.1186/s12863-025-01346-x","DOIUrl":"10.1186/s12863-025-01346-x","url":null,"abstract":"<p><p>Enterobacter cloacae complex is a group of common opportunistic pathogens on intensive care units. On intensive care units sepsis is treated with high doses of antibiotics. This treatment does not only eliminate pathogenic bacteria but parts of the microbiome community as well. This leads to an imbalance of the gut microbiome. However, some bacteria can survive such treatment due to certain survival and resistance mechanisms. Not only antibiotic resistance mechanisms but also forming strong communities via biofilm formation promotes cell survival. Here, we investigated the properties of the isolate AT70PIP076 from a sepsis patient treated with piperacillin and tazobactam. After biochemical analysis and MALDI-TOF analysis, the strain was found to be Enterobacter cloacae. In addition to in vitro, antimicrobial susceptibility testing the genome was further investigated in situ regarding antibiotic resistance. Further live/dead staining was performed, and the biofilm formation was investigated using confocal laser microscopy (cLSM). The genome shows the presence of biofilm-associated genes EU554560, bcsABZC_AP010953, ehaB, KF662843, and crl. The understanding of the underlying mechanism of survival of potential pathogens might contribute to elucidate potential treatment options.ObjectivesGenomic analysis of a bacterium that can survive antibiotic treatment within the gut of an antibiotictreated patient to elucidate survival and resistance mechanisms.Data descriptionThe isolate AT70PIP076 was isolated in 2021 from feces collected from a patient treated with Piperacillin and tazobactam. Whole genome DNA was isolated using the Nextera DNA Flex microbial colony extraction protocol and the Nextera Flex DNA preparation kit according to the manufacturer's instructions. Following paired-end sequencing was performed on the MiSeq platform (Illumina, Inc., San Diego, CA, USA) using a 300-cycle MiSeq reagent kit and a read length of 151 bp. Contamination check and identification of 16 S RNA sequences was done by using ContESt16S. The genomic sequence contained 4,988,237 bp and the G + C content is represented at 54.80%. This genome and its associated data set will serve as a useful resource for further analyses.</p>","PeriodicalId":72427,"journal":{"name":"BMC genomic data","volume":"26 1","pages":"56"},"PeriodicalIF":2.5,"publicationDate":"2025-08-12","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12345040/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"144838713","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Objectives: Plant endophytic fungusHormonema carpetanum is the only organism known to produce enfumafungin to date. Enfumafungin is a fernane-type triterpenoid glycoside compound with broad-spectrum antifungal activity, and this leading compound has been developed to antifungal agent ibrexafungerp approved by Food and Drug Administration (FDA). Here we report the first de novo assembled and annotated whole-genome sequence of H. carpetanum ATCC 74360, the original enfumafungin producing strain from the American Type Culture Collection (ATCC).
Data description: We sequenced the genome of H. carpetanum ATCC 74360 through a hybrid sequencing strategy combining Illumina NovaSeq X Plus short-read and PacBio Sequel IIe long-read technologies. Data from the two platforms were combined and the de novo assembly estimated a 32.93 Mbp genome divided into 16 contigs with a GC content of 49.85%. Genome annotation identified 10,433 genes, 253 tRNAs, and 62 rRNAs. antiSMASH predicted biosynthesis gene cluster (BGC) of secondary metabolite, including enfumafungin BGC. This genome and related data files presented here will provide a foundation for further gene editing of this important antibiotic-producing microorganism.
{"title":"Draft genome sequence and annotation of the enfumafungin producing fungus Hormonema carpetanum ATCC 74360.","authors":"Haodong Chen, Junyi Wang, Songbai Yang, Jiawei Tang, Wen Tang, Shaoxin Chen","doi":"10.1186/s12863-025-01348-9","DOIUrl":"10.1186/s12863-025-01348-9","url":null,"abstract":"<p><strong>Objectives: </strong>Plant endophytic fungusHormonema carpetanum is the only organism known to produce enfumafungin to date. Enfumafungin is a fernane-type triterpenoid glycoside compound with broad-spectrum antifungal activity, and this leading compound has been developed to antifungal agent ibrexafungerp approved by Food and Drug Administration (FDA). Here we report the first de novo assembled and annotated whole-genome sequence of H. carpetanum ATCC 74360, the original enfumafungin producing strain from the American Type Culture Collection (ATCC).</p><p><strong>Data description: </strong>We sequenced the genome of H. carpetanum ATCC 74360 through a hybrid sequencing strategy combining Illumina NovaSeq X Plus short-read and PacBio Sequel IIe long-read technologies. Data from the two platforms were combined and the de novo assembly estimated a 32.93 Mbp genome divided into 16 contigs with a GC content of 49.85%. Genome annotation identified 10,433 genes, 253 tRNAs, and 62 rRNAs. antiSMASH predicted biosynthesis gene cluster (BGC) of secondary metabolite, including enfumafungin BGC. This genome and related data files presented here will provide a foundation for further gene editing of this important antibiotic-producing microorganism.</p>","PeriodicalId":72427,"journal":{"name":"BMC genomic data","volume":"26 1","pages":"55"},"PeriodicalIF":2.5,"publicationDate":"2025-08-08","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12333168/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"144805400","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2025-08-06DOI: 10.1186/s12863-025-01349-8
Olubukola Oluranti Babalola, Oluwaseun Emmanuel Shittu, Ben Jesuorsemwen Enagbonma
Objectives: Leek (Allium ampeloprasum) is a nutritious vegetable popularly cultivated in South Africa and most regions of the world. It is generally recognised as a source of vitamins and vegetables. Nevertheless, little is known about its rhizosphere microbiome and its microbial functional dataset under various fertilization systems. Therefore, this study intended to unveil the rhizosphere microbiome of Allium ampeloprasum and their functional datasets through shotgun metagenomics sequencing analysis.
{"title":"Unveiling the Allium ampeloprasum rhizosphere microbiome and its functional dataset under different fertilization systems.","authors":"Olubukola Oluranti Babalola, Oluwaseun Emmanuel Shittu, Ben Jesuorsemwen Enagbonma","doi":"10.1186/s12863-025-01349-8","DOIUrl":"10.1186/s12863-025-01349-8","url":null,"abstract":"<p><strong>Objectives: </strong>Leek (Allium ampeloprasum) is a nutritious vegetable popularly cultivated in South Africa and most regions of the world. It is generally recognised as a source of vitamins and vegetables. Nevertheless, little is known about its rhizosphere microbiome and its microbial functional dataset under various fertilization systems. Therefore, this study intended to unveil the rhizosphere microbiome of Allium ampeloprasum and their functional datasets through shotgun metagenomics sequencing analysis.</p>","PeriodicalId":72427,"journal":{"name":"BMC genomic data","volume":"26 1","pages":"54"},"PeriodicalIF":2.5,"publicationDate":"2025-08-06","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12330190/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"144796273","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2025-07-31DOI: 10.1186/s12863-025-01345-y
Sarah Babington, Luoyang Ding, Alan J Tilbrook, Shane K Maloney, Dominique Blache
{"title":"A comparison of normalization methods for the expression of genes associated with oxidative stress in the liver of sheep.","authors":"Sarah Babington, Luoyang Ding, Alan J Tilbrook, Shane K Maloney, Dominique Blache","doi":"10.1186/s12863-025-01345-y","DOIUrl":"10.1186/s12863-025-01345-y","url":null,"abstract":"","PeriodicalId":72427,"journal":{"name":"BMC genomic data","volume":"26 1","pages":"53"},"PeriodicalIF":2.5,"publicationDate":"2025-07-31","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12312509/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"144762527","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Background: Respiratory Syncytial Virus (RSV) is a major cause of acute respiratory tract infections (ARTI) worldwide, with seasonal outbreaks often influenced by climatic factors. This study investigates the genetic diversity and phylogenetic relationships of RSV strains circulated in Sri Lanka between 2022 December and 2024 February.
Results: Subtyping of 12 RSV-positive samples identified both RSV A and B, with full-length F gene sequences obtained for four samples. Phylogenetic analysis revealed clustering with globally circulating strains. Mutation profiling highlighted key amino acid substitutions, including a rare mutation of N116D in the F protein, which may have functional implications. The limited sample size necessitates further studies with broader geographic coverage for more robust conclusions.
Conclusion: This study provides insights into the genetic diversity of RSV F gene in Sri Lanka.
{"title":"Genomic characterization and phylogenetic analysis of respiratory syncytial virus F gene in Sri Lanka: a comparative study.","authors":"Thejanee Perera, Ishani De Silva, Pavithri Bandara, Asanka Bowatte, Dinithi Rathnayake, Shanika Perera, Saranga Sumathipala, Rohitha Muthugala","doi":"10.1186/s12863-025-01343-0","DOIUrl":"10.1186/s12863-025-01343-0","url":null,"abstract":"<p><strong>Background: </strong>Respiratory Syncytial Virus (RSV) is a major cause of acute respiratory tract infections (ARTI) worldwide, with seasonal outbreaks often influenced by climatic factors. This study investigates the genetic diversity and phylogenetic relationships of RSV strains circulated in Sri Lanka between 2022 December and 2024 February.</p><p><strong>Results: </strong>Subtyping of 12 RSV-positive samples identified both RSV A and B, with full-length F gene sequences obtained for four samples. Phylogenetic analysis revealed clustering with globally circulating strains. Mutation profiling highlighted key amino acid substitutions, including a rare mutation of N116D in the F protein, which may have functional implications. The limited sample size necessitates further studies with broader geographic coverage for more robust conclusions.</p><p><strong>Conclusion: </strong>This study provides insights into the genetic diversity of RSV F gene in Sri Lanka.</p>","PeriodicalId":72427,"journal":{"name":"BMC genomic data","volume":"26 1","pages":"52"},"PeriodicalIF":2.5,"publicationDate":"2025-07-29","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12308955/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"144746315","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2025-07-22DOI: 10.1186/s12863-025-01339-w
Hayley Goss, Paige Miller, Susan F Zaleski, Robert J Miller, Donna M Schroeder, Henry M Page
{"title":"Publisher Correction: Draft genome assembly for the purple-hinged rock scallop (Crassadoma gigantea).","authors":"Hayley Goss, Paige Miller, Susan F Zaleski, Robert J Miller, Donna M Schroeder, Henry M Page","doi":"10.1186/s12863-025-01339-w","DOIUrl":"10.1186/s12863-025-01339-w","url":null,"abstract":"","PeriodicalId":72427,"journal":{"name":"BMC genomic data","volume":"26 1","pages":"51"},"PeriodicalIF":2.5,"publicationDate":"2025-07-22","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12281668/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"144692575","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2025-07-12DOI: 10.1186/s12863-025-01342-1
Danladi Makeri, Emmanuel Eilu, Martin Odoki, Ismail Abiola Adebayo, Reuben Maghembe, Musoba Abubakar, Reagan Muhwezi, Theophilus Pius, Priscilla Peter Dilli, Saheed Adekunle Akinola, Ezera Agwu
Objective: This study presents the whole-genome sequence of Staphylococcus aureus strain MD02, isolated from a diabetic foot ulcer at a tertiary hospital in southwestern Uganda. The objective was to characterize the genome to understand the isolate's resistance and virulence potential.
Data description: Genomic DNA of S. aureus strain MD02 was extracted and sequenced using the Illumina platform, generating high-quality paired-end reads with an average genome coverage of 32.6×. Quality control was performed with FastQC and reads were trimmed using Trimmomatic. De novo assembly was carried out using SPAdes, resulting in a draft genome of 2,815,980 bp assembled into 68 contigs with a GC content of 32.5%. Quality analysis of the genome revealed 98% completeness, and 2.48% contamination and the closest type strain to our isolate was Staphylococcus aureus (GCA_000330825.2) with an average nucleotide identity of 97.14%% and genome coverage of 89.38% confirming species level identity and close relatedness. Annotation using NCBI Prokaryotic Genome Annotation Pipeline identified 2,701 protein coding genes.
{"title":"Whole genome sequence of a virulent and multidrug resistant Staphylococcus aureus strain MD02 isolated from a diabetic foot ulcer in Uganda.","authors":"Danladi Makeri, Emmanuel Eilu, Martin Odoki, Ismail Abiola Adebayo, Reuben Maghembe, Musoba Abubakar, Reagan Muhwezi, Theophilus Pius, Priscilla Peter Dilli, Saheed Adekunle Akinola, Ezera Agwu","doi":"10.1186/s12863-025-01342-1","DOIUrl":"10.1186/s12863-025-01342-1","url":null,"abstract":"<p><strong>Objective: </strong>This study presents the whole-genome sequence of Staphylococcus aureus strain MD02, isolated from a diabetic foot ulcer at a tertiary hospital in southwestern Uganda. The objective was to characterize the genome to understand the isolate's resistance and virulence potential.</p><p><strong>Data description: </strong>Genomic DNA of S. aureus strain MD02 was extracted and sequenced using the Illumina platform, generating high-quality paired-end reads with an average genome coverage of 32.6×. Quality control was performed with FastQC and reads were trimmed using Trimmomatic. De novo assembly was carried out using SPAdes, resulting in a draft genome of 2,815,980 bp assembled into 68 contigs with a GC content of 32.5%. Quality analysis of the genome revealed 98% completeness, and 2.48% contamination and the closest type strain to our isolate was Staphylococcus aureus (GCA_000330825.2) with an average nucleotide identity of 97.14%% and genome coverage of 89.38% confirming species level identity and close relatedness. Annotation using NCBI Prokaryotic Genome Annotation Pipeline identified 2,701 protein coding genes.</p>","PeriodicalId":72427,"journal":{"name":"BMC genomic data","volume":"26 1","pages":"50"},"PeriodicalIF":1.9,"publicationDate":"2025-07-12","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12256001/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"144621492","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2025-07-12DOI: 10.1186/s12863-025-01324-3
Salem Ahmed Bamusa, Wardah Qureshi, Atia Gohar, Muhammad Irfan, Ishtiaq Ahmad Khan, Muhammad Shakeel
The ASXL1 gene is one of the most frequently mutated genes in acute myeloid leukemia (AML). It is associated with signs of aggressiveness and adverse clinical outcomes. The aim of the current study was to analyze the genetic profile of ASXL1 gene mutations and its impact on the overall survival in AML patients from Pakistan.Thirty-eight well characterized AML patients were enrolled, and DNA sequencing of the ASXL1 was performed using the Illumina NextSeq500 next generation sequencing (NGS) system. Standard pipeline of bioinformatics tools was used to determine the mutational profile. The mutational profile of the enrolled AML patients was compared with that of 1000 Genomes project, and TCGA AML datasets.The analysis revealed 43 genetic variants in ASXL1 across the 38 AML patients (1.13 variant/patient). Eight rare variants were observed in exons 12, 13 of the ASXL1 gene. Notably, a recurrent rare nonsynonymous deleterious variant p.G1336S in exon 13 (NM_015338 transcript) was found in two patients (5.26%). The overall survival of the ASXL1+ (but TP53, FLT3, NPM1, EZH2, and WT1 negative) AML was shorter compared with the ASXL1- (p < 0.05). Further, the overall survival of current study ASXL1 + AML was found comparable with that of the TCGA AML.In conclusion, the non-silent mutations in ASXL1 were associated with lower survival in AML. Further studies with larger cohort are suggested for subsequent clinical implementation.
{"title":"Genetic profile of ASXL1 gene in risk assessment in acute myeloid leukemia.","authors":"Salem Ahmed Bamusa, Wardah Qureshi, Atia Gohar, Muhammad Irfan, Ishtiaq Ahmad Khan, Muhammad Shakeel","doi":"10.1186/s12863-025-01324-3","DOIUrl":"10.1186/s12863-025-01324-3","url":null,"abstract":"<p><p>The ASXL1 gene is one of the most frequently mutated genes in acute myeloid leukemia (AML). It is associated with signs of aggressiveness and adverse clinical outcomes. The aim of the current study was to analyze the genetic profile of ASXL1 gene mutations and its impact on the overall survival in AML patients from Pakistan.Thirty-eight well characterized AML patients were enrolled, and DNA sequencing of the ASXL1 was performed using the Illumina NextSeq500 next generation sequencing (NGS) system. Standard pipeline of bioinformatics tools was used to determine the mutational profile. The mutational profile of the enrolled AML patients was compared with that of 1000 Genomes project, and TCGA AML datasets.The analysis revealed 43 genetic variants in ASXL1 across the 38 AML patients (1.13 variant/patient). Eight rare variants were observed in exons 12, 13 of the ASXL1 gene. Notably, a recurrent rare nonsynonymous deleterious variant p.G1336S in exon 13 (NM_015338 transcript) was found in two patients (5.26%). The overall survival of the ASXL1+ (but TP53, FLT3, NPM1, EZH2, and WT1 negative) AML was shorter compared with the ASXL1- (p < 0.05). Further, the overall survival of current study ASXL1 + AML was found comparable with that of the TCGA AML.In conclusion, the non-silent mutations in ASXL1 were associated with lower survival in AML. Further studies with larger cohort are suggested for subsequent clinical implementation.</p>","PeriodicalId":72427,"journal":{"name":"BMC genomic data","volume":"26 1","pages":"49"},"PeriodicalIF":1.9,"publicationDate":"2025-07-12","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12255144/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"144621490","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2025-07-12DOI: 10.1186/s12863-025-01338-x
Yi Xu, Ying Mao, Xiaoting Hua, Yan Jiang, Yi Zou, Zhichao Wang, Zubi Liu, Hongrui Zhang, Lingling Lu, Yunsong Yu
Background: Mycobacterium tuberculosis (MTB) is a human-specific pathogen that primarily infects humans, causing tuberculosis (TB). Antimicrobial resistance (AMR) in MTB presents a formidable challenge to global health. The employment of machine learning on whole-genome sequencing data (WGS) presents significant potential for uncovering the genomic mechanisms underlying drug resistance in MTB.
Methods: We used 18 binary matrices, each consisting of genotypes and antimicrobial susceptibility testing phenotypes from a specific MTB-antimicrobial dataset. By constructing training and test datasets on all SNPs, intersected SNPs, and randomly generated SNPs, we developed a Machine learning (ML) framework using twelve different algorithms. Then, we compared the performances of the various ML models and used the SHapley Additive exPlanations (SHAP) framework to decipher why and how decisions are made within the optimal algorithm. Lastly, we applied the models to predict the resistance phenotype to rifampicin (RIF) and isoniazid (INH) in the additional independent MTB isolate datasets from India and Israel.
Results: In our study, the Gradient Boosting Classifier (GBC) model was the best in terms of correctly identified percentages (97.28%, 96.06%, 94.19%, and 92.81% for the four first-line drugs, RIF, INH, pyrazinamide, and ethambutol respectively). By estimating the contributions of AMR-related SNPs by SHAP values, we found that position 761,155 (rpoB_p.Ser450), 2,155,168 (katG_p.Ser315) rank top in RIF and INH, their higher values (1 for alternative allele) tend to predict the resistance trait for these two drugs. In addition, the best model GBC generalizes well in predicting the resistance phenotypes for RIF and INH in the external independent MTB isolate datasets from India and Israel.
Conclusions: This study integrates ML methods into antimicrobial resistance research, develops a framework for predicting resistance phenotypes, and explores AMR-related SNPs in MTB. Quantifying the important SNPs' contribution to model decisions makes the ML algorithmic process more transparent, interpretable enabling and enables clinical practice.
{"title":"Machine learning-based prediction of antimicrobial resistance and identification of AMR-related SNPs in Mycobacterium tuberculosis.","authors":"Yi Xu, Ying Mao, Xiaoting Hua, Yan Jiang, Yi Zou, Zhichao Wang, Zubi Liu, Hongrui Zhang, Lingling Lu, Yunsong Yu","doi":"10.1186/s12863-025-01338-x","DOIUrl":"10.1186/s12863-025-01338-x","url":null,"abstract":"<p><strong>Background: </strong>Mycobacterium tuberculosis (MTB) is a human-specific pathogen that primarily infects humans, causing tuberculosis (TB). Antimicrobial resistance (AMR) in MTB presents a formidable challenge to global health. The employment of machine learning on whole-genome sequencing data (WGS) presents significant potential for uncovering the genomic mechanisms underlying drug resistance in MTB.</p><p><strong>Methods: </strong>We used 18 binary matrices, each consisting of genotypes and antimicrobial susceptibility testing phenotypes from a specific MTB-antimicrobial dataset. By constructing training and test datasets on all SNPs, intersected SNPs, and randomly generated SNPs, we developed a Machine learning (ML) framework using twelve different algorithms. Then, we compared the performances of the various ML models and used the SHapley Additive exPlanations (SHAP) framework to decipher why and how decisions are made within the optimal algorithm. Lastly, we applied the models to predict the resistance phenotype to rifampicin (RIF) and isoniazid (INH) in the additional independent MTB isolate datasets from India and Israel.</p><p><strong>Results: </strong>In our study, the Gradient Boosting Classifier (GBC) model was the best in terms of correctly identified percentages (97.28%, 96.06%, 94.19%, and 92.81% for the four first-line drugs, RIF, INH, pyrazinamide, and ethambutol respectively). By estimating the contributions of AMR-related SNPs by SHAP values, we found that position 761,155 (rpoB_p.Ser450), 2,155,168 (katG_p.Ser315) rank top in RIF and INH, their higher values (1 for alternative allele) tend to predict the resistance trait for these two drugs. In addition, the best model GBC generalizes well in predicting the resistance phenotypes for RIF and INH in the external independent MTB isolate datasets from India and Israel.</p><p><strong>Conclusions: </strong>This study integrates ML methods into antimicrobial resistance research, develops a framework for predicting resistance phenotypes, and explores AMR-related SNPs in MTB. Quantifying the important SNPs' contribution to model decisions makes the ML algorithmic process more transparent, interpretable enabling and enables clinical practice.</p>","PeriodicalId":72427,"journal":{"name":"BMC genomic data","volume":"26 1","pages":"48"},"PeriodicalIF":1.9,"publicationDate":"2025-07-12","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12255030/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"144621491","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2025-07-11DOI: 10.1186/s12863-025-01337-y
Viviana Floridia, Arianna Bionda, Katherine Daniela Arias, Annalisa Amato, Matteo Cortellari, Enrico D'Alessandro, Felix Goyache, Vincenzo Lopreiato, Paola Crepaldi, Luigi Liotta, Mario Barbato
Background: Local livestock breeds play a pivotal role in maintaining agricultural sustainability, conserving biodiversity, and preserving cultural heritage. These breeds often possess unique genetic characteristics tailored to their specific environments. The Cinisara is a dual-purpose local cattle breed of Podolian origin, primarily farmed in western Sicily, Italy. However, reports of spurious crossbreeding with cosmopolitan breeds aimed at improving the breed productivity exist. To assess the conservation status and ongoing selective pressures on this unique breed, we genotyped 71 unrelated Cinisara cattle (CIN_A) at 65k SNPs, and extended the dataset with publicly available genotype data of 30 Cinisara individuals sampled 20 years ago (CIN_B). We also included 194 individuals from seven cattle breeds, including the Podolica (POD) breed and the cosmopolitan Holstein (HOL) and Brown Swiss (BRW) breeds. We assessed the genetic diversity, population structure, and determined the extent of introgression from cosmopolitan breeds into Cinisara using local ancestry inference.
Results: Population structure analyses confirmed the Cinisara's Podolian lineage and revealed significant HOL and BRW introgression. While both Cinisara populations, CIN_A and CIN_B, displayed broadly comparable genetic diversity to larger breeds, CIN_B showed reduced heterozygosity and increased inbreeding. CIN_A exhibited higher introgression, suggesting ongoing crossbreeding. Local ancestry was inferred using POD, HOL, and BRW references. CIN_A showed about 258/257 HOL/BRW introgressed SNPs, intercepting 186/131 genes and 1,584/1,772 QTLs. CIN_B had approximately 256/254 HOL/BRW introgressed SNPs, intercepting 218/184 genes and 547/437 QTLs. Predominantly, these regions overlapped with milk production QTLs, but some intercepted genes linked to unique Cinisara traits, like milk quality and climate adaptation, potentially altering breed typicality. Notably, CIN_B shows a potentially higher relative BRW contribution, while CIN_A shows a higher HOL contribution.
Conclusion: Our findings align with the reports of crossbreeding with cosmopolitan breeds to enhance the production performance of Cinisara, and reflect breeding choices such as a reduction in BRW crossing or a preference for HOL. This raises significant concerns regarding the preservation of local breeds, livestock biodiversity, and their cultural and economic value, and highlights the importance of developing informed breeding strategies that balance production improvements with the conservation of genetic heritage.
{"title":"Uncovering the architecture of production-driven introgression in Cinisara cattle breed.","authors":"Viviana Floridia, Arianna Bionda, Katherine Daniela Arias, Annalisa Amato, Matteo Cortellari, Enrico D'Alessandro, Felix Goyache, Vincenzo Lopreiato, Paola Crepaldi, Luigi Liotta, Mario Barbato","doi":"10.1186/s12863-025-01337-y","DOIUrl":"10.1186/s12863-025-01337-y","url":null,"abstract":"<p><strong>Background: </strong>Local livestock breeds play a pivotal role in maintaining agricultural sustainability, conserving biodiversity, and preserving cultural heritage. These breeds often possess unique genetic characteristics tailored to their specific environments. The Cinisara is a dual-purpose local cattle breed of Podolian origin, primarily farmed in western Sicily, Italy. However, reports of spurious crossbreeding with cosmopolitan breeds aimed at improving the breed productivity exist. To assess the conservation status and ongoing selective pressures on this unique breed, we genotyped 71 unrelated Cinisara cattle (CIN_A) at 65k SNPs, and extended the dataset with publicly available genotype data of 30 Cinisara individuals sampled 20 years ago (CIN_B). We also included 194 individuals from seven cattle breeds, including the Podolica (POD) breed and the cosmopolitan Holstein (HOL) and Brown Swiss (BRW) breeds. We assessed the genetic diversity, population structure, and determined the extent of introgression from cosmopolitan breeds into Cinisara using local ancestry inference.</p><p><strong>Results: </strong>Population structure analyses confirmed the Cinisara's Podolian lineage and revealed significant HOL and BRW introgression. While both Cinisara populations, CIN_A and CIN_B, displayed broadly comparable genetic diversity to larger breeds, CIN_B showed reduced heterozygosity and increased inbreeding. CIN_A exhibited higher introgression, suggesting ongoing crossbreeding. Local ancestry was inferred using POD, HOL, and BRW references. CIN_A showed about 258/257 HOL/BRW introgressed SNPs, intercepting 186/131 genes and 1,584/1,772 QTLs. CIN_B had approximately 256/254 HOL/BRW introgressed SNPs, intercepting 218/184 genes and 547/437 QTLs. Predominantly, these regions overlapped with milk production QTLs, but some intercepted genes linked to unique Cinisara traits, like milk quality and climate adaptation, potentially altering breed typicality. Notably, CIN_B shows a potentially higher relative BRW contribution, while CIN_A shows a higher HOL contribution.</p><p><strong>Conclusion: </strong>Our findings align with the reports of crossbreeding with cosmopolitan breeds to enhance the production performance of Cinisara, and reflect breeding choices such as a reduction in BRW crossing or a preference for HOL. This raises significant concerns regarding the preservation of local breeds, livestock biodiversity, and their cultural and economic value, and highlights the importance of developing informed breeding strategies that balance production improvements with the conservation of genetic heritage.</p>","PeriodicalId":72427,"journal":{"name":"BMC genomic data","volume":"26 1","pages":"47"},"PeriodicalIF":1.9,"publicationDate":"2025-07-11","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12247468/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"144610449","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}