Anna Iliushina , Gleb Mazanov , Sergey Nesteruk , Andrey Pimenov , Anton Stepanov , Nadezhda Mikhaylova , Anna Baldycheva , Andrey Somov
{"title":"以数据为中心的光学垃圾分类实例分割方法。","authors":"Anna Iliushina , Gleb Mazanov , Sergey Nesteruk , Andrey Pimenov , Anton Stepanov , Nadezhda Mikhaylova , Anna Baldycheva , Andrey Somov","doi":"10.1016/j.wasman.2024.11.002","DOIUrl":null,"url":null,"abstract":"<div><div>Computer vision systems have been integrated into facilities dealing with the sorting of household waste. This solution allows for the sorting efficiency improvement and cost reduction. However, challenges associated with the poor annotation quality of existing waste segmentation datasets, unsuitable environment for recognition on a conveyor belt, or limited data for creating an effective and cost-efficient sorting system using visible range cameras significantly limit the application efficiency of computer vision systems. In this article, we report on the data-centric pipeline for enhancing the precision of predictions in multiclass household waste segmentation on a conveyor belt. In particular, we have demonstrated that by employing a pseudo-annotation approach combined with an object-based data augmentation algorithm, it is possible to train a model on a set of ’simple’ images and achieve satisfactory results when estimating the model on a set of ’complex’ images. We collected and prepared the dataset consisting of 5 k manually labeled data and additionally 10 k pseudo-labeled data by object-based augmentation. The proposed pipeline incorporates data balancing, transfer learning, and pseudo-labeling to improve the mean Average Precision (mAP) of the YOLOV8 segmentation model from 67 % to 83 % for ’simple’ use case scenarios and from 42 % to 59 % or ’complex’ industrial solutions.</div></div>","PeriodicalId":23969,"journal":{"name":"Waste management","volume":"191 ","pages":"Pages 70-80"},"PeriodicalIF":7.1000,"publicationDate":"2024-11-07","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Data-centric approach for instance segmentation in optical waste sorting\",\"authors\":\"Anna Iliushina , Gleb Mazanov , Sergey Nesteruk , Andrey Pimenov , Anton Stepanov , Nadezhda Mikhaylova , Anna Baldycheva , Andrey Somov\",\"doi\":\"10.1016/j.wasman.2024.11.002\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<div><div>Computer vision systems have been integrated into facilities dealing with the sorting of household waste. This solution allows for the sorting efficiency improvement and cost reduction. However, challenges associated with the poor annotation quality of existing waste segmentation datasets, unsuitable environment for recognition on a conveyor belt, or limited data for creating an effective and cost-efficient sorting system using visible range cameras significantly limit the application efficiency of computer vision systems. In this article, we report on the data-centric pipeline for enhancing the precision of predictions in multiclass household waste segmentation on a conveyor belt. In particular, we have demonstrated that by employing a pseudo-annotation approach combined with an object-based data augmentation algorithm, it is possible to train a model on a set of ’simple’ images and achieve satisfactory results when estimating the model on a set of ’complex’ images. We collected and prepared the dataset consisting of 5 k manually labeled data and additionally 10 k pseudo-labeled data by object-based augmentation. The proposed pipeline incorporates data balancing, transfer learning, and pseudo-labeling to improve the mean Average Precision (mAP) of the YOLOV8 segmentation model from 67 % to 83 % for ’simple’ use case scenarios and from 42 % to 59 % or ’complex’ industrial solutions.</div></div>\",\"PeriodicalId\":23969,\"journal\":{\"name\":\"Waste management\",\"volume\":\"191 \",\"pages\":\"Pages 70-80\"},\"PeriodicalIF\":7.1000,\"publicationDate\":\"2024-11-07\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Waste management\",\"FirstCategoryId\":\"93\",\"ListUrlMain\":\"https://www.sciencedirect.com/science/article/pii/S0956053X24005592\",\"RegionNum\":2,\"RegionCategory\":\"环境科学与生态学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q1\",\"JCRName\":\"ENGINEERING, ENVIRONMENTAL\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Waste management","FirstCategoryId":"93","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0956053X24005592","RegionNum":2,"RegionCategory":"环境科学与生态学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"ENGINEERING, ENVIRONMENTAL","Score":null,"Total":0}
Data-centric approach for instance segmentation in optical waste sorting
Computer vision systems have been integrated into facilities dealing with the sorting of household waste. This solution allows for the sorting efficiency improvement and cost reduction. However, challenges associated with the poor annotation quality of existing waste segmentation datasets, unsuitable environment for recognition on a conveyor belt, or limited data for creating an effective and cost-efficient sorting system using visible range cameras significantly limit the application efficiency of computer vision systems. In this article, we report on the data-centric pipeline for enhancing the precision of predictions in multiclass household waste segmentation on a conveyor belt. In particular, we have demonstrated that by employing a pseudo-annotation approach combined with an object-based data augmentation algorithm, it is possible to train a model on a set of ’simple’ images and achieve satisfactory results when estimating the model on a set of ’complex’ images. We collected and prepared the dataset consisting of 5 k manually labeled data and additionally 10 k pseudo-labeled data by object-based augmentation. The proposed pipeline incorporates data balancing, transfer learning, and pseudo-labeling to improve the mean Average Precision (mAP) of the YOLOV8 segmentation model from 67 % to 83 % for ’simple’ use case scenarios and from 42 % to 59 % or ’complex’ industrial solutions.
期刊介绍:
Waste Management is devoted to the presentation and discussion of information on solid wastes,it covers the entire lifecycle of solid. wastes.
Scope:
Addresses solid wastes in both industrialized and economically developing countries
Covers various types of solid wastes, including:
Municipal (e.g., residential, institutional, commercial, light industrial)
Agricultural
Special (e.g., C and D, healthcare, household hazardous wastes, sewage sludge)