Pub Date : 2025-07-12DOI: 10.1186/s12863-025-01338-x
Yi Xu, Ying Mao, Xiaoting Hua, Yan Jiang, Yi Zou, Zhichao Wang, Zubi Liu, Hongrui Zhang, Lingling Lu, Yunsong Yu
Background: Mycobacterium tuberculosis (MTB) is a human-specific pathogen that primarily infects humans, causing tuberculosis (TB). Antimicrobial resistance (AMR) in MTB presents a formidable challenge to global health. The employment of machine learning on whole-genome sequencing data (WGS) presents significant potential for uncovering the genomic mechanisms underlying drug resistance in MTB.
Methods: We used 18 binary matrices, each consisting of genotypes and antimicrobial susceptibility testing phenotypes from a specific MTB-antimicrobial dataset. By constructing training and test datasets on all SNPs, intersected SNPs, and randomly generated SNPs, we developed a Machine learning (ML) framework using twelve different algorithms. Then, we compared the performances of the various ML models and used the SHapley Additive exPlanations (SHAP) framework to decipher why and how decisions are made within the optimal algorithm. Lastly, we applied the models to predict the resistance phenotype to rifampicin (RIF) and isoniazid (INH) in the additional independent MTB isolate datasets from India and Israel.
Results: In our study, the Gradient Boosting Classifier (GBC) model was the best in terms of correctly identified percentages (97.28%, 96.06%, 94.19%, and 92.81% for the four first-line drugs, RIF, INH, pyrazinamide, and ethambutol respectively). By estimating the contributions of AMR-related SNPs by SHAP values, we found that position 761,155 (rpoB_p.Ser450), 2,155,168 (katG_p.Ser315) rank top in RIF and INH, their higher values (1 for alternative allele) tend to predict the resistance trait for these two drugs. In addition, the best model GBC generalizes well in predicting the resistance phenotypes for RIF and INH in the external independent MTB isolate datasets from India and Israel.
Conclusions: This study integrates ML methods into antimicrobial resistance research, develops a framework for predicting resistance phenotypes, and explores AMR-related SNPs in MTB. Quantifying the important SNPs' contribution to model decisions makes the ML algorithmic process more transparent, interpretable enabling and enables clinical practice.
{"title":"Machine learning-based prediction of antimicrobial resistance and identification of AMR-related SNPs in Mycobacterium tuberculosis.","authors":"Yi Xu, Ying Mao, Xiaoting Hua, Yan Jiang, Yi Zou, Zhichao Wang, Zubi Liu, Hongrui Zhang, Lingling Lu, Yunsong Yu","doi":"10.1186/s12863-025-01338-x","DOIUrl":"10.1186/s12863-025-01338-x","url":null,"abstract":"<p><strong>Background: </strong>Mycobacterium tuberculosis (MTB) is a human-specific pathogen that primarily infects humans, causing tuberculosis (TB). Antimicrobial resistance (AMR) in MTB presents a formidable challenge to global health. The employment of machine learning on whole-genome sequencing data (WGS) presents significant potential for uncovering the genomic mechanisms underlying drug resistance in MTB.</p><p><strong>Methods: </strong>We used 18 binary matrices, each consisting of genotypes and antimicrobial susceptibility testing phenotypes from a specific MTB-antimicrobial dataset. By constructing training and test datasets on all SNPs, intersected SNPs, and randomly generated SNPs, we developed a Machine learning (ML) framework using twelve different algorithms. Then, we compared the performances of the various ML models and used the SHapley Additive exPlanations (SHAP) framework to decipher why and how decisions are made within the optimal algorithm. Lastly, we applied the models to predict the resistance phenotype to rifampicin (RIF) and isoniazid (INH) in the additional independent MTB isolate datasets from India and Israel.</p><p><strong>Results: </strong>In our study, the Gradient Boosting Classifier (GBC) model was the best in terms of correctly identified percentages (97.28%, 96.06%, 94.19%, and 92.81% for the four first-line drugs, RIF, INH, pyrazinamide, and ethambutol respectively). By estimating the contributions of AMR-related SNPs by SHAP values, we found that position 761,155 (rpoB_p.Ser450), 2,155,168 (katG_p.Ser315) rank top in RIF and INH, their higher values (1 for alternative allele) tend to predict the resistance trait for these two drugs. In addition, the best model GBC generalizes well in predicting the resistance phenotypes for RIF and INH in the external independent MTB isolate datasets from India and Israel.</p><p><strong>Conclusions: </strong>This study integrates ML methods into antimicrobial resistance research, develops a framework for predicting resistance phenotypes, and explores AMR-related SNPs in MTB. Quantifying the important SNPs' contribution to model decisions makes the ML algorithmic process more transparent, interpretable enabling and enables clinical practice.</p>","PeriodicalId":72427,"journal":{"name":"BMC genomic data","volume":"26 1","pages":"48"},"PeriodicalIF":1.9,"publicationDate":"2025-07-12","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12255030/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"144621491","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2025-07-11DOI: 10.1186/s12863-025-01337-y
Viviana Floridia, Arianna Bionda, Katherine Daniela Arias, Annalisa Amato, Matteo Cortellari, Enrico D'Alessandro, Felix Goyache, Vincenzo Lopreiato, Paola Crepaldi, Luigi Liotta, Mario Barbato
Background: Local livestock breeds play a pivotal role in maintaining agricultural sustainability, conserving biodiversity, and preserving cultural heritage. These breeds often possess unique genetic characteristics tailored to their specific environments. The Cinisara is a dual-purpose local cattle breed of Podolian origin, primarily farmed in western Sicily, Italy. However, reports of spurious crossbreeding with cosmopolitan breeds aimed at improving the breed productivity exist. To assess the conservation status and ongoing selective pressures on this unique breed, we genotyped 71 unrelated Cinisara cattle (CIN_A) at 65k SNPs, and extended the dataset with publicly available genotype data of 30 Cinisara individuals sampled 20 years ago (CIN_B). We also included 194 individuals from seven cattle breeds, including the Podolica (POD) breed and the cosmopolitan Holstein (HOL) and Brown Swiss (BRW) breeds. We assessed the genetic diversity, population structure, and determined the extent of introgression from cosmopolitan breeds into Cinisara using local ancestry inference.
Results: Population structure analyses confirmed the Cinisara's Podolian lineage and revealed significant HOL and BRW introgression. While both Cinisara populations, CIN_A and CIN_B, displayed broadly comparable genetic diversity to larger breeds, CIN_B showed reduced heterozygosity and increased inbreeding. CIN_A exhibited higher introgression, suggesting ongoing crossbreeding. Local ancestry was inferred using POD, HOL, and BRW references. CIN_A showed about 258/257 HOL/BRW introgressed SNPs, intercepting 186/131 genes and 1,584/1,772 QTLs. CIN_B had approximately 256/254 HOL/BRW introgressed SNPs, intercepting 218/184 genes and 547/437 QTLs. Predominantly, these regions overlapped with milk production QTLs, but some intercepted genes linked to unique Cinisara traits, like milk quality and climate adaptation, potentially altering breed typicality. Notably, CIN_B shows a potentially higher relative BRW contribution, while CIN_A shows a higher HOL contribution.
Conclusion: Our findings align with the reports of crossbreeding with cosmopolitan breeds to enhance the production performance of Cinisara, and reflect breeding choices such as a reduction in BRW crossing or a preference for HOL. This raises significant concerns regarding the preservation of local breeds, livestock biodiversity, and their cultural and economic value, and highlights the importance of developing informed breeding strategies that balance production improvements with the conservation of genetic heritage.
{"title":"Uncovering the architecture of production-driven introgression in Cinisara cattle breed.","authors":"Viviana Floridia, Arianna Bionda, Katherine Daniela Arias, Annalisa Amato, Matteo Cortellari, Enrico D'Alessandro, Felix Goyache, Vincenzo Lopreiato, Paola Crepaldi, Luigi Liotta, Mario Barbato","doi":"10.1186/s12863-025-01337-y","DOIUrl":"10.1186/s12863-025-01337-y","url":null,"abstract":"<p><strong>Background: </strong>Local livestock breeds play a pivotal role in maintaining agricultural sustainability, conserving biodiversity, and preserving cultural heritage. These breeds often possess unique genetic characteristics tailored to their specific environments. The Cinisara is a dual-purpose local cattle breed of Podolian origin, primarily farmed in western Sicily, Italy. However, reports of spurious crossbreeding with cosmopolitan breeds aimed at improving the breed productivity exist. To assess the conservation status and ongoing selective pressures on this unique breed, we genotyped 71 unrelated Cinisara cattle (CIN_A) at 65k SNPs, and extended the dataset with publicly available genotype data of 30 Cinisara individuals sampled 20 years ago (CIN_B). We also included 194 individuals from seven cattle breeds, including the Podolica (POD) breed and the cosmopolitan Holstein (HOL) and Brown Swiss (BRW) breeds. We assessed the genetic diversity, population structure, and determined the extent of introgression from cosmopolitan breeds into Cinisara using local ancestry inference.</p><p><strong>Results: </strong>Population structure analyses confirmed the Cinisara's Podolian lineage and revealed significant HOL and BRW introgression. While both Cinisara populations, CIN_A and CIN_B, displayed broadly comparable genetic diversity to larger breeds, CIN_B showed reduced heterozygosity and increased inbreeding. CIN_A exhibited higher introgression, suggesting ongoing crossbreeding. Local ancestry was inferred using POD, HOL, and BRW references. CIN_A showed about 258/257 HOL/BRW introgressed SNPs, intercepting 186/131 genes and 1,584/1,772 QTLs. CIN_B had approximately 256/254 HOL/BRW introgressed SNPs, intercepting 218/184 genes and 547/437 QTLs. Predominantly, these regions overlapped with milk production QTLs, but some intercepted genes linked to unique Cinisara traits, like milk quality and climate adaptation, potentially altering breed typicality. Notably, CIN_B shows a potentially higher relative BRW contribution, while CIN_A shows a higher HOL contribution.</p><p><strong>Conclusion: </strong>Our findings align with the reports of crossbreeding with cosmopolitan breeds to enhance the production performance of Cinisara, and reflect breeding choices such as a reduction in BRW crossing or a preference for HOL. This raises significant concerns regarding the preservation of local breeds, livestock biodiversity, and their cultural and economic value, and highlights the importance of developing informed breeding strategies that balance production improvements with the conservation of genetic heritage.</p>","PeriodicalId":72427,"journal":{"name":"BMC genomic data","volume":"26 1","pages":"47"},"PeriodicalIF":1.9,"publicationDate":"2025-07-11","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12247468/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"144610449","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2025-07-08DOI: 10.1186/s12863-025-01341-2
Carmen Aguilar, Ruth Lydia Olga Lambertz, Mark Heise, Ana Eulalio, Klaus Schughart
Objective: The innate immune response of an infected host is an essential defense mechanism to fight influenza virus infections in the respiratory tract. This response is essential to limit virus replication and spread. However, an exacerbated response may cause severe immune-pathologies. Therefore, it is very important to better understand innate immune responses at the level of its molecular networks in the context of viral infections.
Data: We infected human lung adenocarcinoma (A549) and human monocytic (THP-1) cells with H3N2 influenza virus A virus and performed transcriptome analysis using next generation RNA sequencing at various times post infection. We report raw sequence data and normalized log2 transformed gene expression values. This data will allow researchers in the field to identify differentially expressed genes and pathways between the two cell types and over times post infection. Furthermore, our data enables comparisons to molecular studies performed in humans and animal models in the context of respiratory viral infections.
{"title":"The host response to influenza infections in human lung and macrophages cell lines.","authors":"Carmen Aguilar, Ruth Lydia Olga Lambertz, Mark Heise, Ana Eulalio, Klaus Schughart","doi":"10.1186/s12863-025-01341-2","DOIUrl":"10.1186/s12863-025-01341-2","url":null,"abstract":"<p><strong>Objective: </strong>The innate immune response of an infected host is an essential defense mechanism to fight influenza virus infections in the respiratory tract. This response is essential to limit virus replication and spread. However, an exacerbated response may cause severe immune-pathologies. Therefore, it is very important to better understand innate immune responses at the level of its molecular networks in the context of viral infections.</p><p><strong>Data: </strong>We infected human lung adenocarcinoma (A549) and human monocytic (THP-1) cells with H3N2 influenza virus A virus and performed transcriptome analysis using next generation RNA sequencing at various times post infection. We report raw sequence data and normalized log<sub>2</sub> transformed gene expression values. This data will allow researchers in the field to identify differentially expressed genes and pathways between the two cell types and over times post infection. Furthermore, our data enables comparisons to molecular studies performed in humans and animal models in the context of respiratory viral infections.</p>","PeriodicalId":72427,"journal":{"name":"BMC genomic data","volume":"26 1","pages":"46"},"PeriodicalIF":1.9,"publicationDate":"2025-07-08","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12239443/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"144593050","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2025-07-08DOI: 10.1186/s12863-025-01340-3
Mi Jeong Jo, Hye-Jin Go, Jeong Gyu Kim, Gun-Do Kim
Objectives: This study investigates the nerve cord transcriptome of Asterias amurensis to explore its regenerative abilities. By comparing gene expression between a normal group and a group 72 h post-amputation, key genes involved in regeneration were identified. Functional annotation using GO, KEGG, NR, and UniProt databases provided insights into the biological roles of these genes. This research enhances the understanding of A. amurensis regeneration and highlights the need for further transcriptome analysis across different tissues.
Data description: A. amurensis, a starfish species found in the northwestern Pacific, is known for its strong predatory behavior and impact on marine biodiversity. In this study, individuals were divided into a normal group and a 72-hour post-amputation group. De novo transcriptome assembly of the nerve cord identified 257,769 unigenes, which were functionally annotated using GO, KEGG, NR, and UniProt databases. Since only nerve cord tissue was analyzed, additional transcriptome studies on various tissues are required for a more comprehensive understanding of A. amurensis biology.
{"title":"De novo assembly of transcriptome during regeneration post-arm amputation in the starfish, Asterias amurensis.","authors":"Mi Jeong Jo, Hye-Jin Go, Jeong Gyu Kim, Gun-Do Kim","doi":"10.1186/s12863-025-01340-3","DOIUrl":"10.1186/s12863-025-01340-3","url":null,"abstract":"<p><strong>Objectives: </strong>This study investigates the nerve cord transcriptome of Asterias amurensis to explore its regenerative abilities. By comparing gene expression between a normal group and a group 72 h post-amputation, key genes involved in regeneration were identified. Functional annotation using GO, KEGG, NR, and UniProt databases provided insights into the biological roles of these genes. This research enhances the understanding of A. amurensis regeneration and highlights the need for further transcriptome analysis across different tissues.</p><p><strong>Data description: </strong>A. amurensis, a starfish species found in the northwestern Pacific, is known for its strong predatory behavior and impact on marine biodiversity. In this study, individuals were divided into a normal group and a 72-hour post-amputation group. De novo transcriptome assembly of the nerve cord identified 257,769 unigenes, which were functionally annotated using GO, KEGG, NR, and UniProt databases. Since only nerve cord tissue was analyzed, additional transcriptome studies on various tissues are required for a more comprehensive understanding of A. amurensis biology.</p>","PeriodicalId":72427,"journal":{"name":"BMC genomic data","volume":"26 1","pages":"45"},"PeriodicalIF":2.5,"publicationDate":"2025-07-08","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12236018/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"144585686","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2025-07-07DOI: 10.1186/s12863-025-01336-z
Liu Sun, Ke Yao, Hang-Jing Wu
Background: Primary carnitine deficiency (PCD) is an autosomal recessive rare disorder of carnitine cycle and carnitine transport caused by pathogenic variants in the SLC22A5 gene. The prevalence of PCD is unclear. This study aimed to estimate the carrier frequency and genetic prevalence of PCD using Genome Aggregation Database (gnomAD) data.
Methods: The pathogenicity of SLC22A5 variants was interpreted according to the American College of Medical Genetics and Genomics (ACMG) standards and guidelines. The minor allele frequency (MAF) of the variants of the SLC22A5 gene in 807,162 individuals was examined to estimate the global prevalence of PCD in nine ethnicities: African/African American (afr), Admixed American (amr), East Asian (eas), Non-Finnish European (nfe), South Asian (sas), Ashkenazi Jewish (asj), Middle Eastern (mid), Finnish (fin) and Remaining individuals (rmi). The global and population-specific carrier frequency and genetic prevalence of PCD were calculated using the Hardy-Weinberg equation.
Results: Total of 213 pathogenic/likely pathogenic variants (PV/LPV) of the SLC22A5 gene were identified according to the ACMG standards and guidelines. The global carrier frequency and genetic prevalence of PCD were 10.6 per thousand (1/95) and 28.2 per million (1/35427), respectively.
Conclusions: The prevalence of PCD is estimated to be 1/35,000 globally, with a range of between 1/450,000 and 1/20,000 depending on ethnicity.
{"title":"The global prevalence and genetic spectrum of primary carnitine deficiency.","authors":"Liu Sun, Ke Yao, Hang-Jing Wu","doi":"10.1186/s12863-025-01336-z","DOIUrl":"10.1186/s12863-025-01336-z","url":null,"abstract":"<p><strong>Background: </strong>Primary carnitine deficiency (PCD) is an autosomal recessive rare disorder of carnitine cycle and carnitine transport caused by pathogenic variants in the SLC22A5 gene. The prevalence of PCD is unclear. This study aimed to estimate the carrier frequency and genetic prevalence of PCD using Genome Aggregation Database (gnomAD) data.</p><p><strong>Methods: </strong>The pathogenicity of SLC22A5 variants was interpreted according to the American College of Medical Genetics and Genomics (ACMG) standards and guidelines. The minor allele frequency (MAF) of the variants of the SLC22A5 gene in 807,162 individuals was examined to estimate the global prevalence of PCD in nine ethnicities: African/African American (afr), Admixed American (amr), East Asian (eas), Non-Finnish European (nfe), South Asian (sas), Ashkenazi Jewish (asj), Middle Eastern (mid), Finnish (fin) and Remaining individuals (rmi). The global and population-specific carrier frequency and genetic prevalence of PCD were calculated using the Hardy-Weinberg equation.</p><p><strong>Results: </strong>Total of 213 pathogenic/likely pathogenic variants (PV/LPV) of the SLC22A5 gene were identified according to the ACMG standards and guidelines. The global carrier frequency and genetic prevalence of PCD were 10.6 per thousand (1/95) and 28.2 per million (1/35427), respectively.</p><p><strong>Conclusions: </strong>The prevalence of PCD is estimated to be 1/35,000 globally, with a range of between 1/450,000 and 1/20,000 depending on ethnicity.</p>","PeriodicalId":72427,"journal":{"name":"BMC genomic data","volume":"26 1","pages":"44"},"PeriodicalIF":1.9,"publicationDate":"2025-07-07","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12235876/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"144585687","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Objectives: The degradation products of chitin exhibit various biological activities, giving them significant application potential. Chitinase-producing bacteria can be isolated from diverse environments such as soil, natural waters, and rhizospheres. However, their chitinolytic activity is often limited, particularly at low temperatures.
Data description: In this study, complete genome sequencing of a cold-adapted chitinolytic Pseudomonas strain, GWSMS-1, revealed a 4,606,781-bp linear chromosome with a G+C content of 59%. The genome encodes 4,599 protein-coding genes, 73 tRNA genes, and 27 rRNA genes. Functional annotation through GO, KEGG, and CAZy databases identified a substantial number of chitinase-encoding genes, which likely contribute to its high chitin-degrading capacity. The genomic insights into GWSMS-1 highlight its potential for applications in chitin degradation and offer valuable gene candidates for further research.
{"title":"Genomic analysis of Pseudomonas sp. GWSMS-1 isolated from Antarctica reveals its potential in Chitin hydrolysis.","authors":"Haiyu Zeng, Zheng Wang, Jianjun Wang, Yong Yu, Wei Luo, Huirong Li, Haitao Ding","doi":"10.1186/s12863-025-01335-0","DOIUrl":"10.1186/s12863-025-01335-0","url":null,"abstract":"<p><strong>Objectives: </strong>The degradation products of chitin exhibit various biological activities, giving them significant application potential. Chitinase-producing bacteria can be isolated from diverse environments such as soil, natural waters, and rhizospheres. However, their chitinolytic activity is often limited, particularly at low temperatures.</p><p><strong>Data description: </strong>In this study, complete genome sequencing of a cold-adapted chitinolytic Pseudomonas strain, GWSMS-1, revealed a 4,606,781-bp linear chromosome with a G+C content of 59%. The genome encodes 4,599 protein-coding genes, 73 tRNA genes, and 27 rRNA genes. Functional annotation through GO, KEGG, and CAZy databases identified a substantial number of chitinase-encoding genes, which likely contribute to its high chitin-degrading capacity. The genomic insights into GWSMS-1 highlight its potential for applications in chitin degradation and offer valuable gene candidates for further research.</p>","PeriodicalId":72427,"journal":{"name":"BMC genomic data","volume":"26 1","pages":"43"},"PeriodicalIF":1.9,"publicationDate":"2025-07-04","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12228359/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"144565488","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2025-07-01DOI: 10.1186/s12863-025-01334-1
Mesfin Tafesse Gemeda, Abu Feyisa Meka, Asefa Niguse Mamo, Gessesse Kebede Bekele, Jemal Ali, Musin Kelel Abas
Antibiotic resistance is a major global health concern that caused by the overuse and misuse of antibiotics. Mobile genetic elements have a roles in the transmission of antibiotic resistance genes. The distribution and diversity of antibiotic resistance genes and mobile genetic elements in the microbiome of Sof Umer Cave have yet to be explored. To map the distribution and diversity of antibiotic resistance genes and mobile genetic elements in the microbiome of Sof Umer Cave using high-throughput shotgun sequencing. High-molecular-weight DNA was extracted from homogenized sample using the GeneAll DNA Soil Mini Kit. Purified environmental DNA was sequenced using a NovaSeq PE150. Analysis of the pathogen host interaction database revealed the predominance of pathogenic organisms such as Xanthomonas oryzae, Acinetobacter baumannii, Erwinia amylovora, and Mycobacterium tuberculosis. Similarly, analysis of the virulence factor database confirmed the presence of Type IV pili (VF1240), lipopolysaccharides, capsules, heme biosynthesis (VF0758), and alginate. More than 800 antibiotic resistance genes were identified, with 50% related to glycopeptide resistance, followed by antibiotic resistance genes associated with multidrug efflux pumps (30%), aminoglycoside resistance genes (10%), and unknown genes. A variety of mobile genetic elements were also identified, highlighting their importance in the genetic diversity and adaptation of the microbiome of Sof Umer Cave. These findings underscore the importance of the Sof Umer Cave habitat as a reservoir for antibiotic resistance, emphasizing the need for ongoing monitoring to enhance the understanding and control of antibiotic resistance genes.
抗生素耐药性是一个主要的全球健康问题,它是由抗生素的过度使用和误用引起的。移动遗传元件在抗生素耐药基因的传播中起着重要作用。sofumer洞穴微生物群中抗生素耐药基因和可移动遗传元件的分布和多样性有待探索。利用高通量霰弹枪测序技术研究sofumer Cave微生物组中抗生素耐药基因和可移动遗传元件的分布和多样性。使用GeneAll DNA Soil Mini Kit从匀浆样品中提取高分子量DNA。使用NovaSeq PE150对纯化的环境DNA进行测序。病原菌宿主相互作用数据库分析显示,米黄单胞菌、鲍曼不动杆菌、淀粉杆菌和结核分枝杆菌等致病菌占主导地位。同样,对毒力因子数据库的分析证实了IV型菌毛(VF1240)、脂多糖、胶囊、血红素生物合成(VF0758)和海藻酸盐的存在。共鉴定出800多个抗生素耐药基因,其中50%与糖肽耐药有关,其次是与多药外排泵相关的抗生素耐药基因(30%)、氨基糖苷耐药基因(10%)和未知基因。研究还发现了多种可移动的遗传元件,强调了它们在sofumer洞穴微生物群遗传多样性和适应性中的重要性。这些发现强调了sofumer洞穴栖息地作为抗生素耐药性储存库的重要性,强调了持续监测以提高对抗生素耐药基因的理解和控制的必要性。
{"title":"Diversity of antibiotic resistance genes and mobile genetic elements of Sof Umer Cave microbiomes, Ethiopia.","authors":"Mesfin Tafesse Gemeda, Abu Feyisa Meka, Asefa Niguse Mamo, Gessesse Kebede Bekele, Jemal Ali, Musin Kelel Abas","doi":"10.1186/s12863-025-01334-1","DOIUrl":"10.1186/s12863-025-01334-1","url":null,"abstract":"<p><p>Antibiotic resistance is a major global health concern that caused by the overuse and misuse of antibiotics. Mobile genetic elements have a roles in the transmission of antibiotic resistance genes. The distribution and diversity of antibiotic resistance genes and mobile genetic elements in the microbiome of Sof Umer Cave have yet to be explored. To map the distribution and diversity of antibiotic resistance genes and mobile genetic elements in the microbiome of Sof Umer Cave using high-throughput shotgun sequencing. High-molecular-weight DNA was extracted from homogenized sample using the GeneAll DNA Soil Mini Kit. Purified environmental DNA was sequenced using a NovaSeq PE150. Analysis of the pathogen host interaction database revealed the predominance of pathogenic organisms such as Xanthomonas oryzae, Acinetobacter baumannii, Erwinia amylovora, and Mycobacterium tuberculosis. Similarly, analysis of the virulence factor database confirmed the presence of Type IV pili (VF1240), lipopolysaccharides, capsules, heme biosynthesis (VF0758), and alginate. More than 800 antibiotic resistance genes were identified, with 50% related to glycopeptide resistance, followed by antibiotic resistance genes associated with multidrug efflux pumps (30%), aminoglycoside resistance genes (10%), and unknown genes. A variety of mobile genetic elements were also identified, highlighting their importance in the genetic diversity and adaptation of the microbiome of Sof Umer Cave. These findings underscore the importance of the Sof Umer Cave habitat as a reservoir for antibiotic resistance, emphasizing the need for ongoing monitoring to enhance the understanding and control of antibiotic resistance genes.</p>","PeriodicalId":72427,"journal":{"name":"BMC genomic data","volume":"26 1","pages":"41"},"PeriodicalIF":1.9,"publicationDate":"2025-07-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12220799/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"144546298","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Genome assembly of the Bos gaurus in Vietnam using nanopore sequencing.","authors":"Thi Dieu Thuy Nguyen, Thi Binh Nguyen Tran, Loan T Nguyen","doi":"10.1186/s12863-025-01331-4","DOIUrl":"10.1186/s12863-025-01331-4","url":null,"abstract":"","PeriodicalId":72427,"journal":{"name":"BMC genomic data","volume":"26 1","pages":"42"},"PeriodicalIF":1.9,"publicationDate":"2025-07-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12219697/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"144546299","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Amplicon-based metagenomic survey of microbes associated with the organic and inorganic rhizosphere soil of Glycine max L.","authors":"Olubukola Oluranti Babalola, Ijeoma Emelda Osuji, Akinlolu Olalekan Akanmu","doi":"10.1186/s12863-025-01333-2","DOIUrl":"10.1186/s12863-025-01333-2","url":null,"abstract":"","PeriodicalId":72427,"journal":{"name":"BMC genomic data","volume":"26 1","pages":"40"},"PeriodicalIF":1.9,"publicationDate":"2025-06-06","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12142974/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"144251098","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2025-05-28DOI: 10.1186/s12863-025-01330-5
Hayley Goss, Paige Miller, Susan F Zaleski, Robert J Miller, Donna M Schroeder, Henry M Page
{"title":"Draft genome assembly for the purple-hinged rock scallop (Crassadoma gigantea).","authors":"Hayley Goss, Paige Miller, Susan F Zaleski, Robert J Miller, Donna M Schroeder, Henry M Page","doi":"10.1186/s12863-025-01330-5","DOIUrl":"10.1186/s12863-025-01330-5","url":null,"abstract":"","PeriodicalId":72427,"journal":{"name":"BMC genomic data","volume":"26 1","pages":"39"},"PeriodicalIF":1.9,"publicationDate":"2025-05-28","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12121003/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"144176070","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}