Comparison of machine learning models for classifying edible oils using Fourier-transform infrared spectroscopy

IF 1.7 4区 化学 Bulletin of the Korean Chemical Society Pub Date : 2024-12-23 DOI:10.1002/bkcs.12932
Hyeona Lim, Seon Yeong Lee, Jin Young Kim, Yeon Ju Shin, Yerin Jang, Hyeonjin Kim, Byung Hee Kim, Sangdoo Ahn
{"title":"Comparison of machine learning models for classifying edible oils using Fourier-transform infrared spectroscopy","authors":"Hyeona Lim,&nbsp;Seon Yeong Lee,&nbsp;Jin Young Kim,&nbsp;Yeon Ju Shin,&nbsp;Yerin Jang,&nbsp;Hyeonjin Kim,&nbsp;Byung Hee Kim,&nbsp;Sangdoo Ahn","doi":"10.1002/bkcs.12932","DOIUrl":null,"url":null,"abstract":"<p>Accurate classification and authentication of edible oils are essential for maintaining product quality, ensuring consumer safety, and preserving market integrity. Therefore, this study aims to propose Fourier-transform infrared (FT-IR) spectroscopy, combined with advanced machine learning models, as a rapid and non-destructive technique for classifying edible oils. The FT-IR spectra of seven edible oil types were analyzed across three spectral regions: the full range, the C-H stretching range, and the fingerprint region. Both absorbance and second derivative spectra were used to evaluate the influence of spectral preprocessing on classification accuracy. Six machine learning models—principal component analysis followed by linear discriminant analysis (PCA-LDA), k-nearest neighbors, decision tree, random forest, eXtreme Gradient Boosting, and support vector machines (SVM)—were employed to classify the oils, achieving training accuracies of 96.4%–100% and testing accuracies of 88.1%–100%. The second derivative spectra enhanced model performance by improving the resolution of overlapping peaks, particularly in the C<span></span>H and C<span></span>O stretching regions. Additionally, the SHapley Additive exPlanations analysis further revealed the most critical spectral features influencing model predictions, offering valuable insights into the decision-making processes. This study demonstrates the effectiveness of combining FT-IR spectroscopy, second derivative preprocessing, and machine learning techniques for classifying edible oils. The findings highlight the benefits of second derivative spectra in enhancing spectral resolution and the superior classification performance of PCA-LDA and SVM models. These results offer a robust framework for advancing edible oil analysis and emphasize the potential of artificial intelligence in food authentication and quality control.</p>","PeriodicalId":54252,"journal":{"name":"Bulletin of the Korean Chemical Society","volume":"46 2","pages":"131-137"},"PeriodicalIF":1.7000,"publicationDate":"2024-12-23","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Bulletin of the Korean Chemical Society","FirstCategoryId":"92","ListUrlMain":"https://onlinelibrary.wiley.com/doi/10.1002/bkcs.12932","RegionNum":4,"RegionCategory":"化学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0

Abstract

Accurate classification and authentication of edible oils are essential for maintaining product quality, ensuring consumer safety, and preserving market integrity. Therefore, this study aims to propose Fourier-transform infrared (FT-IR) spectroscopy, combined with advanced machine learning models, as a rapid and non-destructive technique for classifying edible oils. The FT-IR spectra of seven edible oil types were analyzed across three spectral regions: the full range, the C-H stretching range, and the fingerprint region. Both absorbance and second derivative spectra were used to evaluate the influence of spectral preprocessing on classification accuracy. Six machine learning models—principal component analysis followed by linear discriminant analysis (PCA-LDA), k-nearest neighbors, decision tree, random forest, eXtreme Gradient Boosting, and support vector machines (SVM)—were employed to classify the oils, achieving training accuracies of 96.4%–100% and testing accuracies of 88.1%–100%. The second derivative spectra enhanced model performance by improving the resolution of overlapping peaks, particularly in the CH and CO stretching regions. Additionally, the SHapley Additive exPlanations analysis further revealed the most critical spectral features influencing model predictions, offering valuable insights into the decision-making processes. This study demonstrates the effectiveness of combining FT-IR spectroscopy, second derivative preprocessing, and machine learning techniques for classifying edible oils. The findings highlight the benefits of second derivative spectra in enhancing spectral resolution and the superior classification performance of PCA-LDA and SVM models. These results offer a robust framework for advancing edible oil analysis and emphasize the potential of artificial intelligence in food authentication and quality control.

查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
求助全文
约1分钟内获得全文 去求助
来源期刊
Bulletin of the Korean Chemical Society
Bulletin of the Korean Chemical Society Chemistry-General Chemistry
自引率
23.50%
发文量
182
期刊介绍: The Bulletin of the Korean Chemical Society is an official research journal of the Korean Chemical Society. It was founded in 1980 and reaches out to the chemical community worldwide. It is strictly peer-reviewed and welcomes Accounts, Communications, Articles, and Notes written in English. The scope of the journal covers all major areas of chemistry: analytical chemistry, electrochemistry, industrial chemistry, inorganic chemistry, life-science chemistry, macromolecular chemistry, organic synthesis, non-synthetic organic chemistry, physical chemistry, and materials chemistry.
期刊最新文献
Masthead Cover Picture: Enhancing electrochemical xanthine detection: a two-step incubation strategy to minimize interference from ascorbic acid (BKCS 2/2025) Taeyeon Yoo, Seonhwa Park, Hyoeun Lee, Subin Park, Youngsuk Kim, Haesik Yang Correction to “Highly active cobalt(II) and copper(II) complexes supported by aminomethylquinoline mediating stereoselective ring-opening polymerization of rac-lactide” Highly blue-emissive CBZ-functionalized salen–In complexes: Influence of structural rigidity and donor substituent quantity Quantitative analysis of disaggregation properties of aggregation-induced emission luminogens (AIEgens) and off-the-shelf dyes
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1