Erica Billig Rose, Molly K. Steele, Beth Tolar, James Pettengill, Michael Batz, Michael Bazaco, Berhanu Tameru, Zhaohui Cui, Rebecca L. Lindsey, Mustafa Simmons, Jess Chen, Drew Posny, Heather Carleton, Beau B. Bruce
{"title":"Attribution of Salmonella enterica to Food Sources by Using Whole-Genome Sequencing Data","authors":"Erica Billig Rose, Molly K. Steele, Beth Tolar, James Pettengill, Michael Batz, Michael Bazaco, Berhanu Tameru, Zhaohui Cui, Rebecca L. Lindsey, Mustafa Simmons, Jess Chen, Drew Posny, Heather Carleton, Beau B. Bruce","doi":"10.3201/eid3104.241172","DOIUrl":null,"url":null,"abstract":"<p><em>Salmonella enterica</em> bacteria are a leading cause of foodborne illness in the United States; however, most <em>Salmonella</em> illnesses are not associated with known outbreaks, and predicting the source of sporadic illnesses remains a challenge. We used a supervised random forest model to determine the most likely sources responsible for human salmonellosis cases in the United States. We trained the model by using whole-genome multilocus sequence typing data from 18,661 <em>Salmonella</em> isolates from collected single food sources and used feature selection to determine the subset of loci most influential for prediction. The overall out-of-bag accuracy of the trained model was 91%; the highest prediction accuracy was for chicken (97%). We applied the trained model to 6,470 isolates from humans with unknown exposure to predict the source of infection. Our model predicted that >33% of the human-derived <em>Salmonella</em> isolates originated from chicken and 27% were from vegetables.</p>","PeriodicalId":11595,"journal":{"name":"Emerging Infectious Diseases","volume":"63 1","pages":""},"PeriodicalIF":7.2000,"publicationDate":"2025-03-20","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Emerging Infectious Diseases","FirstCategoryId":"3","ListUrlMain":"https://doi.org/10.3201/eid3104.241172","RegionNum":2,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"IMMUNOLOGY","Score":null,"Total":0}
引用次数: 0
Abstract
Salmonella enterica bacteria are a leading cause of foodborne illness in the United States; however, most Salmonella illnesses are not associated with known outbreaks, and predicting the source of sporadic illnesses remains a challenge. We used a supervised random forest model to determine the most likely sources responsible for human salmonellosis cases in the United States. We trained the model by using whole-genome multilocus sequence typing data from 18,661 Salmonella isolates from collected single food sources and used feature selection to determine the subset of loci most influential for prediction. The overall out-of-bag accuracy of the trained model was 91%; the highest prediction accuracy was for chicken (97%). We applied the trained model to 6,470 isolates from humans with unknown exposure to predict the source of infection. Our model predicted that >33% of the human-derived Salmonella isolates originated from chicken and 27% were from vegetables.
期刊介绍:
Emerging Infectious Diseases is a monthly open access journal published by the Centers for Disease Control and Prevention. The primary goal of this peer-reviewed journal is to advance the global recognition of both new and reemerging infectious diseases, while also enhancing our understanding of the underlying factors that contribute to disease emergence, prevention, and elimination.
Targeted towards professionals in the field of infectious diseases and related sciences, the journal encourages diverse contributions from experts in academic research, industry, clinical practice, public health, as well as specialists in economics, social sciences, and other relevant disciplines. By fostering a collaborative approach, Emerging Infectious Diseases aims to facilitate interdisciplinary dialogue and address the multifaceted challenges posed by infectious diseases.