Development and evaluation of statistical and artificial intelligence approaches with microbial shotgun metagenomics data as an untargeted screening tool for use in food production.
Kristen L Beck, Niina Haiminen, Akshay Agarwal, Anna Paola Carrieri, Matthew Madgwick, Jennifer Kelly, Victor Pylro, Ban Kawas, Martin Wiedmann, Erika Ganda
{"title":"Development and evaluation of statistical and artificial intelligence approaches with microbial shotgun metagenomics data as an untargeted screening tool for use in food production.","authors":"Kristen L Beck, Niina Haiminen, Akshay Agarwal, Anna Paola Carrieri, Matthew Madgwick, Jennifer Kelly, Victor Pylro, Ban Kawas, Martin Wiedmann, Erika Ganda","doi":"10.1128/msystems.00840-24","DOIUrl":null,"url":null,"abstract":"<p><p>The increasing knowledge of microbial ecology in food products relating to quality and safety and the established usefulness of machine learning algorithms for anomaly detection in multiple scenarios suggests that the application of microbiome data in food production systems for anomaly detection could be a valuable approach to be used in food systems. These methods could be used to identify ingredients that deviate from their typical microbial composition, which could indicate food fraud or safety issues. The objective of this study was to assess the feasibility of using shotgun sequencing data as input into anomaly detection algorithms using fluid milk as a model system. Contrastive principal component analysis (PCA), cluster-based methods, and explainable artificial intelligence (AI) were evaluated for the detection of two anomalous sample classes using longitudinal metagenomic profiling of fluid milk compared to baseline (BL) samples collected under comparable circumstances. Traditional methods (alpha and beta diversity, clustering-based contrastive PCA, multidimensional scaling, and dendrograms) failed to differentiate anomalous sample classes; however, explainable AI was able to classify anomalous vs baseline samples and indicate microbial drivers in association with antibiotic use. We validated the potential for explainable AI to classify different milk sources using larger publicly available fluid milk 16S rDNA sequencing data sets and demonstrated that explainable AI is able to differentiate between milk storage methods, processing stages, and seasons. Our results indicate that the application of artificial intelligence continues to hold promise in the realm of microbiome data analysis and could present further opportunities for downstream analytic automation to aid in food safety and quality.</p><p><strong>Importance: </strong>We evaluated the feasibility of using untargeted metagenomic sequencing of raw milk for detecting anomalous food ingredient content with artificial intelligence methods in a study specifically designed to test this hypothesis. We also show through analysis of publicly available fluid milk microbial data that our artificial intelligence approach is able to successfully predict milk in different stages of processing. The approach could potentially be applied in the food industry for safety and quality control.</p>","PeriodicalId":18819,"journal":{"name":"mSystems","volume":" ","pages":"e0084024"},"PeriodicalIF":5.0000,"publicationDate":"2024-11-19","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"mSystems","FirstCategoryId":"99","ListUrlMain":"https://doi.org/10.1128/msystems.00840-24","RegionNum":2,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"2024/10/10 0:00:00","PubModel":"Epub","JCR":"Q1","JCRName":"MICROBIOLOGY","Score":null,"Total":0}
引用次数: 0
Abstract
The increasing knowledge of microbial ecology in food products relating to quality and safety and the established usefulness of machine learning algorithms for anomaly detection in multiple scenarios suggests that the application of microbiome data in food production systems for anomaly detection could be a valuable approach to be used in food systems. These methods could be used to identify ingredients that deviate from their typical microbial composition, which could indicate food fraud or safety issues. The objective of this study was to assess the feasibility of using shotgun sequencing data as input into anomaly detection algorithms using fluid milk as a model system. Contrastive principal component analysis (PCA), cluster-based methods, and explainable artificial intelligence (AI) were evaluated for the detection of two anomalous sample classes using longitudinal metagenomic profiling of fluid milk compared to baseline (BL) samples collected under comparable circumstances. Traditional methods (alpha and beta diversity, clustering-based contrastive PCA, multidimensional scaling, and dendrograms) failed to differentiate anomalous sample classes; however, explainable AI was able to classify anomalous vs baseline samples and indicate microbial drivers in association with antibiotic use. We validated the potential for explainable AI to classify different milk sources using larger publicly available fluid milk 16S rDNA sequencing data sets and demonstrated that explainable AI is able to differentiate between milk storage methods, processing stages, and seasons. Our results indicate that the application of artificial intelligence continues to hold promise in the realm of microbiome data analysis and could present further opportunities for downstream analytic automation to aid in food safety and quality.
Importance: We evaluated the feasibility of using untargeted metagenomic sequencing of raw milk for detecting anomalous food ingredient content with artificial intelligence methods in a study specifically designed to test this hypothesis. We also show through analysis of publicly available fluid milk microbial data that our artificial intelligence approach is able to successfully predict milk in different stages of processing. The approach could potentially be applied in the food industry for safety and quality control.
mSystemsBiochemistry, Genetics and Molecular Biology-Biochemistry
CiteScore
10.50
自引率
3.10%
发文量
308
审稿时长
13 weeks
期刊介绍:
mSystems™ will publish preeminent work that stems from applying technologies for high-throughput analyses to achieve insights into the metabolic and regulatory systems at the scale of both the single cell and microbial communities. The scope of mSystems™ encompasses all important biological and biochemical findings drawn from analyses of large data sets, as well as new computational approaches for deriving these insights. mSystems™ will welcome submissions from researchers who focus on the microbiome, genomics, metagenomics, transcriptomics, metabolomics, proteomics, glycomics, bioinformatics, and computational microbiology. mSystems™ will provide streamlined decisions, while carrying on ASM''s tradition of rigorous peer review.