Robust deep-learning based refrigerator food recognition.

IF 4.7 Q2 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE Frontiers in Artificial Intelligence Pub Date : 2024-12-04 eCollection Date: 2024-01-01 DOI:10.3389/frai.2024.1442948

Xiaoyan Dai

{"title":"Robust deep-learning based refrigerator food recognition.","authors":"Xiaoyan Dai","doi":"10.3389/frai.2024.1442948","DOIUrl":null,"url":null,"abstract":"<p><p>Automatic food identification utilizing artificial intelligence (AI) technology in smart refrigerators presents an innovative solution. However, existing studies exhibit significant limitations. Achieving consistent high performance in recognition across varying camera distances and diverse real-world conditions remain a formidable challenge. Current approaches often struggle to accurately recognize items in scenarios involving occlusions, variable distortions, and complex backgrounds, thereby limiting their practical applicability in household environments. This study addresses these deficiencies by enhancing the Feature Pyramid Network (FPN) of YOLACT with an additional layer designed to capture nuanced information. Furthermore, we propose a two-stage data augmentation method that simulates diverse conditions including distortion and occlusion, to generate images that reflect various backgrounds and handheld scenarios. Comparative analyses with previous research and evaluations on our original dataset demonstrate that our approach significantly improves recognition rates for both typical and challenging real-world images. These enhancements contribute to more effective food waste management in households and indicate broader applications for automatic identification systems.</p>","PeriodicalId":33315,"journal":{"name":"Frontiers in Artificial Intelligence","volume":"7 ","pages":"1442948"},"PeriodicalIF":4.7000,"publicationDate":"2024-12-04","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11652515/pdf/","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Frontiers in Artificial Intelligence","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.3389/frai.2024.1442948","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"2024/1/1 0:00:00","PubModel":"eCollection","JCR":"Q2","JCRName":"COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE","Score":null,"Total":0}

引用次数: 0

Abstract

Automatic food identification utilizing artificial intelligence (AI) technology in smart refrigerators presents an innovative solution. However, existing studies exhibit significant limitations. Achieving consistent high performance in recognition across varying camera distances and diverse real-world conditions remain a formidable challenge. Current approaches often struggle to accurately recognize items in scenarios involving occlusions, variable distortions, and complex backgrounds, thereby limiting their practical applicability in household environments. This study addresses these deficiencies by enhancing the Feature Pyramid Network (FPN) of YOLACT with an additional layer designed to capture nuanced information. Furthermore, we propose a two-stage data augmentation method that simulates diverse conditions including distortion and occlusion, to generate images that reflect various backgrounds and handheld scenarios. Comparative analyses with previous research and evaluations on our original dataset demonstrate that our approach significantly improves recognition rates for both typical and challenging real-world images. These enhancements contribute to more effective food waste management in households and indicate broader applications for automatic identification systems.

查看原文

微信好友朋友圈 QQ好友复制链接

本刊更多论文

基于深度学习的冰箱食品识别。

利用人工智能（AI）技术在智能冰箱中自动识别食品是一种创新的解决方案。然而，现有的研究显示出明显的局限性。在不同的相机距离和不同的现实条件下实现一致的高性能识别仍然是一个艰巨的挑战。目前的方法往往难以准确识别遮挡、可变扭曲和复杂背景下的物品，从而限制了它们在家庭环境中的实际适用性。本研究通过增强YOLACT的特征金字塔网络（FPN），增加一个额外的层来捕获细微的信息，从而解决了这些缺陷。此外，我们提出了一种两阶段的数据增强方法，该方法模拟了包括失真和遮挡在内的多种条件，以生成反映各种背景和手持场景的图像。与之前对原始数据集的研究和评估进行比较分析表明，我们的方法显着提高了典型和具有挑战性的真实世界图像的识别率。这些改进有助于在家庭中更有效地管理食物垃圾，并预示着自动识别系统的更广泛应用。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文去求助

来源期刊