基于实景图像数据集的货架物品检测训练图像合成

Computer Science Research Notes Pub Date : 2023-07-01 DOI:10.24132/csrn.3301.11

Tomokazu Kaneko, Ryosuke Sakai, Soma Shiraishi

{"title":"基于实景图像数据集的货架物品检测训练图像合成","authors":"Tomokazu Kaneko, Ryosuke Sakai, Soma Shiraishi","doi":"10.24132/csrn.3301.11","DOIUrl":null,"url":null,"abstract":"We propose a novel cut-and-paste approach to synthesize a training dataset for shelf item detection, reflecting the alignments of items in the real image dataset. The conventional cut-and-paste approach synthesizes large numbers of training images by pasting foregrounds on background images and is effective for training object detection. However, the previous method pastes foregrounds on random positions of the background, so the alignment of items on shelves is not reflected, and unrealistic images are generated. Generating realistic images that reflect actual positional relationships between items is necessary for efficient learning of item detection. The proposed method determines the pasting positions for the foreground images by referring to the alignment of the items in the real image dataset, so it can generate more realistic images that reflect the alignment of the real-world items. Since our method can synthesize more realistic images, the trained models can perform better.","PeriodicalId":322214,"journal":{"name":"Computer Science Research Notes","volume":"1 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2023-07-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Training Image Synthesis for Shelf Item Detection reflecting Alignments of Items in Real Image Dataset\",\"authors\":\"Tomokazu Kaneko, Ryosuke Sakai, Soma Shiraishi\",\"doi\":\"10.24132/csrn.3301.11\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"We propose a novel cut-and-paste approach to synthesize a training dataset for shelf item detection, reflecting the alignments of items in the real image dataset. The conventional cut-and-paste approach synthesizes large numbers of training images by pasting foregrounds on background images and is effective for training object detection. However, the previous method pastes foregrounds on random positions of the background, so the alignment of items on shelves is not reflected, and unrealistic images are generated. Generating realistic images that reflect actual positional relationships between items is necessary for efficient learning of item detection. The proposed method determines the pasting positions for the foreground images by referring to the alignment of the items in the real image dataset, so it can generate more realistic images that reflect the alignment of the real-world items. Since our method can synthesize more realistic images, the trained models can perform better.\",\"PeriodicalId\":322214,\"journal\":{\"name\":\"Computer Science Research Notes\",\"volume\":\"1 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2023-07-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Computer Science Research Notes\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.24132/csrn.3301.11\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Computer Science Research Notes","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.24132/csrn.3301.11","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 0

摘要

我们提出了一种新的剪切粘贴方法来合成货架物品检测的训练数据集，反映真实图像数据集中物品的排列。传统的剪切粘贴方法通过在背景图像上粘贴前景来合成大量的训练图像，是一种有效的训练目标检测方法。然而，之前的方法将前景粘贴在背景的随机位置上，因此不会反映货架上物品的对齐，并且会生成不真实的图像。生成反映物品之间实际位置关系的逼真图像是有效学习物品检测的必要条件。该方法通过参考真实图像数据集中物品的对齐方式来确定前景图像的粘贴位置，从而生成更真实的反映真实世界物品对齐方式的图像。由于我们的方法可以合成更真实的图像，训练后的模型可以表现得更好。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文

微信好友朋友圈 QQ好友复制链接

本刊更多论文

Training Image Synthesis for Shelf Item Detection reflecting Alignments of Items in Real Image Dataset

We propose a novel cut-and-paste approach to synthesize a training dataset for shelf item detection, reflecting the alignments of items in the real image dataset. The conventional cut-and-paste approach synthesizes large numbers of training images by pasting foregrounds on background images and is effective for training object detection. However, the previous method pastes foregrounds on random positions of the background, so the alignment of items on shelves is not reflected, and unrealistic images are generated. Generating realistic images that reflect actual positional relationships between items is necessary for efficient learning of item detection. The proposed method determines the pasting positions for the foreground images by referring to the alignment of the items in the real image dataset, so it can generate more realistic images that reflect the alignment of the real-world items. Since our method can synthesize more realistic images, the trained models can perform better.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

Computer Science Research Notes

自引率

0.00%

发文量

期刊最新文献

The Usage of the BP-Layers Stereo Matching Algorithm with the EBCA Camera Set Evolutionary-Edge Bundling with Concatenation Process of Control Points Fast Incremental Image Reconstruction with CNN-enhanced Poisson Interpolation Temporal Segmentation of Actions in Fencing Footwork Training Low-Rank Rational Approximation of Natural Trochoid Parameterizations