Machine Learning on Spectral Data from Miniature Devices for Food Quality Analysis - A Case Study

Fayas Asharindavida, O. Nibouche, J. Uhomoibhi, Jun Liu, Hui Wang
{"title":"Machine Learning on Spectral Data from Miniature Devices for Food Quality Analysis - A Case Study","authors":"Fayas Asharindavida, O. Nibouche, J. Uhomoibhi, Jun Liu, Hui Wang","doi":"10.1145/3583788.3583801","DOIUrl":null,"url":null,"abstract":"Food quality analysis can be carried out by spectral data acquired from spectrometers with its advantage of non-destructive way of testing. Portable and miniature spectroscopy can be a suitable solution when it meets the specifications such as portability, cost, and short processing time requirements, to enable ordinary citizens to use such a device in the fight against food fraud. Compared to more expensive, bulky, and non-portable devices, the data collected using miniature and portable spectrometers is of a lower quality and thus adversely affect the quality of the analysis. Research have been carried out to use machine learning (ML) classifiers on spectral data analysis for food quality assessment. The present work focuses on two aspects: firstly, preliminary exploratory statistical analysis is conducted on the real spectral data on different food products including oils, fruits and spices acquired from such miniature devices, which aims to evaluate and illustrate the distinctive characteristics of such spectral data, data distribution and difference in the spectra across multiple data acquisitions etc. along with a summary of the key challenges to face and explore. Secondly, a case study for the differentiation of extra virgin olive from adulterated with vegetable oil is provided to analyze and evaluate how some commonly used ML classifiers can be used for classification, while the impact of different preprocessing methods to improve the accuracy and efficiency is also provided. The case study demonstrates the good potential of using data analytics for spectral data from miniature device, although the overall performance of those ML classifiers is not exceptional (the classification rates of up to 83.32%) which is partially due to the quality of data, and partially due to limiting to only some classifiers. More elaborate data pre-processing and cleaning methods can be used to address the key challenges of the spectral data from miniature device, and other types of classifiers can be also explored further in future work.","PeriodicalId":292167,"journal":{"name":"Proceedings of the 2023 7th International Conference on Machine Learning and Soft Computing","volume":"11 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2023-01-05","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Proceedings of the 2023 7th International Conference on Machine Learning and Soft Computing","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1145/3583788.3583801","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0

Abstract

Food quality analysis can be carried out by spectral data acquired from spectrometers with its advantage of non-destructive way of testing. Portable and miniature spectroscopy can be a suitable solution when it meets the specifications such as portability, cost, and short processing time requirements, to enable ordinary citizens to use such a device in the fight against food fraud. Compared to more expensive, bulky, and non-portable devices, the data collected using miniature and portable spectrometers is of a lower quality and thus adversely affect the quality of the analysis. Research have been carried out to use machine learning (ML) classifiers on spectral data analysis for food quality assessment. The present work focuses on two aspects: firstly, preliminary exploratory statistical analysis is conducted on the real spectral data on different food products including oils, fruits and spices acquired from such miniature devices, which aims to evaluate and illustrate the distinctive characteristics of such spectral data, data distribution and difference in the spectra across multiple data acquisitions etc. along with a summary of the key challenges to face and explore. Secondly, a case study for the differentiation of extra virgin olive from adulterated with vegetable oil is provided to analyze and evaluate how some commonly used ML classifiers can be used for classification, while the impact of different preprocessing methods to improve the accuracy and efficiency is also provided. The case study demonstrates the good potential of using data analytics for spectral data from miniature device, although the overall performance of those ML classifiers is not exceptional (the classification rates of up to 83.32%) which is partially due to the quality of data, and partially due to limiting to only some classifiers. More elaborate data pre-processing and cleaning methods can be used to address the key challenges of the spectral data from miniature device, and other types of classifiers can be also explored further in future work.
查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
用于食品质量分析的微型设备光谱数据的机器学习-一个案例研究
利用光谱仪采集的光谱数据进行食品质量分析,具有无损检测的优点。便携式和微型光谱仪在满足便携性、成本和处理时间短等规格要求时,可以成为一种合适的解决方案,使普通公民能够使用这种设备来打击食品欺诈。与更昂贵、体积更大、不可携带的设备相比,使用微型和便携式光谱仪收集的数据质量较低,从而对分析质量产生不利影响。已经开展了使用机器学习(ML)分类器对光谱数据分析进行食品质量评估的研究。本工作主要集中在两个方面:首先,对该微型装置获取的油脂、水果、香料等不同食品的真实光谱数据进行初步探索性统计分析,评价和说明该光谱数据的鲜明特征、数据分布、多次数据采集的光谱差异等,总结需要面临和探索的关键挑战;其次,以特级初榨橄榄油与掺假植物油的鉴别为例,分析和评价了几种常用的ML分类器如何进行分类,以及不同预处理方法对提高准确率和效率的影响。该案例研究显示了对来自微型设备的光谱数据使用数据分析的良好潜力,尽管这些ML分类器的总体性能并不出色(分类率高达83.32%),这部分是由于数据质量,部分是由于仅限于某些分类器。更精细的数据预处理和清洗方法可以用来解决微型设备光谱数据的关键挑战,其他类型的分类器也可以在未来的工作中进一步探索。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 去求助
来源期刊
自引率
0.00%
发文量
0
期刊最新文献
Decision Model of Ship Intelligent Collision Avoidance Based on Automatic Information System Data and Generic Adversary Imitation Learning-Deep Deterministic Policy Gradient Joint Action Representation and Prioritized Experience Replay for Reinforcement Learning in Large Discrete Action Spaces Neural Network Optimization Objective Vector Representation based on Genetic Algorithm and Its Multi-objective Optimization Method Deep Learning-Enabled Prediction of Daily Solar Irradiance from Simulated Climate Data CascadeTransformer: Multi-label Classification with Transformer in Chronic Disease Prediction
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1