Hieu M Le, Tianqi Li, Jimena G Villareal, Jie Gao, Yaxi Hu
{"title":"Rapid Authentication of Plant-Based Milk Alternatives by Coupling Portable Raman Spectroscopy with Machine Learning.","authors":"Hieu M Le, Tianqi Li, Jimena G Villareal, Jie Gao, Yaxi Hu","doi":"10.1093/jaoacint/qsaf022","DOIUrl":null,"url":null,"abstract":"<p><strong>Background: </strong>Plant-based milk alternatives (PBMA) are increasingly popular due to rising lactose intolerance and environmental concerns over traditional dairy products. However, limited efforts have been made to develop rapid authentication methods to verify their biological origin.</p><p><strong>Objective: </strong>In this study, we developed a rapid, on-site analytical method for the authentication and identification of PBMA made by six different plant species utilizing a portable Raman spectrometer coupled with machine learning.</p><p><strong>Methods: </strong>Unprocessed PBMA (i.e., blended raw nut/grain) and processed PBMA that mimic the industrial processing procedures (i.e., filtration and pasteurization) were prepared in lab and subjected to Raman spectral collection without any sample preparation. Three machine learning algorithms [i.e., k-nearest neighbor (KNN), support vector machine (SVM) and random forest (RF)] were tested and compared.</p><p><strong>Results: </strong>RF achieved the best performance in recognizing the plant sources for the unprocessed PBMA, with accuracies of 96.88% and 95.83% in the cross-validation and test set prediction, respectively. Due to small sample size and risk of overfitting, classification models for the biological origin of processed PBMA were constructed by combining Raman spectra of the unprocessed and processed samples. Again, RF models achieved the highest accuracy in identifying the species, i.e., 94.27% in cross-validation and 94.44% in prediction.</p><p><strong>Conclusions: </strong>These results indicated that the portable Raman spectrometer captured the chemical fingerprints that can effectively identify the plant species of different PBMA. Using this non-destructive Raman spectroscopic based method, the overall analysis from sample to answer was completed within 5 min, providing inspection laboratories a rapid and reliable screening tool to ensure the authenticity of the biological origin of PBMA.</p><p><strong>Highlights: </strong>This study presents a novel method for rapid and non-destructive identification of the plant sources of PBMA (both unprocessed and processed) based on the Raman spectroscopic technique and machine learning algorithms.</p>","PeriodicalId":94064,"journal":{"name":"Journal of AOAC International","volume":" ","pages":""},"PeriodicalIF":0.0000,"publicationDate":"2025-03-12","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Journal of AOAC International","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1093/jaoacint/qsaf022","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0
Abstract
Background: Plant-based milk alternatives (PBMA) are increasingly popular due to rising lactose intolerance and environmental concerns over traditional dairy products. However, limited efforts have been made to develop rapid authentication methods to verify their biological origin.
Objective: In this study, we developed a rapid, on-site analytical method for the authentication and identification of PBMA made by six different plant species utilizing a portable Raman spectrometer coupled with machine learning.
Methods: Unprocessed PBMA (i.e., blended raw nut/grain) and processed PBMA that mimic the industrial processing procedures (i.e., filtration and pasteurization) were prepared in lab and subjected to Raman spectral collection without any sample preparation. Three machine learning algorithms [i.e., k-nearest neighbor (KNN), support vector machine (SVM) and random forest (RF)] were tested and compared.
Results: RF achieved the best performance in recognizing the plant sources for the unprocessed PBMA, with accuracies of 96.88% and 95.83% in the cross-validation and test set prediction, respectively. Due to small sample size and risk of overfitting, classification models for the biological origin of processed PBMA were constructed by combining Raman spectra of the unprocessed and processed samples. Again, RF models achieved the highest accuracy in identifying the species, i.e., 94.27% in cross-validation and 94.44% in prediction.
Conclusions: These results indicated that the portable Raman spectrometer captured the chemical fingerprints that can effectively identify the plant species of different PBMA. Using this non-destructive Raman spectroscopic based method, the overall analysis from sample to answer was completed within 5 min, providing inspection laboratories a rapid and reliable screening tool to ensure the authenticity of the biological origin of PBMA.
Highlights: This study presents a novel method for rapid and non-destructive identification of the plant sources of PBMA (both unprocessed and processed) based on the Raman spectroscopic technique and machine learning algorithms.