Data normalization for training and analysis of the MaskRCNN model using the k-means method for computer vision of smart refrigerator

M. Dorrer, A. Alekhina
{"title":"Data normalization for training and analysis of the MaskRCNN model using the k-means method for computer vision of smart refrigerator","authors":"M. Dorrer, A. Alekhina","doi":"10.34219/2078-8320-2021-12-4-74-80","DOIUrl":null,"url":null,"abstract":"This paper proposes using the k-means method for the controlled adjustment of the training sample for semantic image segmentation in the artificial vision of a smart refrigerator. To solve this problem, a new two-stage architecture for computer vision is proposed. In the proposed architecture, various sets of settings for optimizing the contrast of images are used to classify pixels according to their belonging to fragments of the studied image. Extensive experimental evaluation shows that the proposed method has critical advantages over existing work. Firstly, the obtained pixel classes can be directly clustered into semantic groups using k-means. Secondly, the method can be used for additional training of artificial intelligence in solving the semantic segmentation problem. The developers propose an approach to the correct choice of the number k of centroids to obtain good quality clusters, which is difficult to determine at a high k value. To overcome the problem of initializing the k-means method, an incremental k-means clustering method is proposed, which improves the quality of clusters to reduce the sum of squared errors. Comprehensive experiments have been carried out compared to the traditional k-means algorithm and its new versions to evaluate the performance of the proposed method on synthetically generated datasets and some real-world datasets.","PeriodicalId":299496,"journal":{"name":"Informatization and communication","volume":"1 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2021-04-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Informatization and communication","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.34219/2078-8320-2021-12-4-74-80","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0

Abstract

This paper proposes using the k-means method for the controlled adjustment of the training sample for semantic image segmentation in the artificial vision of a smart refrigerator. To solve this problem, a new two-stage architecture for computer vision is proposed. In the proposed architecture, various sets of settings for optimizing the contrast of images are used to classify pixels according to their belonging to fragments of the studied image. Extensive experimental evaluation shows that the proposed method has critical advantages over existing work. Firstly, the obtained pixel classes can be directly clustered into semantic groups using k-means. Secondly, the method can be used for additional training of artificial intelligence in solving the semantic segmentation problem. The developers propose an approach to the correct choice of the number k of centroids to obtain good quality clusters, which is difficult to determine at a high k value. To overcome the problem of initializing the k-means method, an incremental k-means clustering method is proposed, which improves the quality of clusters to reduce the sum of squared errors. Comprehensive experiments have been carried out compared to the traditional k-means algorithm and its new versions to evaluate the performance of the proposed method on synthetically generated datasets and some real-world datasets.
查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
使用k-means方法对智能冰箱计算机视觉的MaskRCNN模型进行数据归一化训练和分析
提出了一种基于k-means的智能冰箱人工视觉语义图像分割训练样本的控制调整方法。为了解决这一问题,提出了一种新的两阶段计算机视觉体系结构。在所提出的体系结构中,使用各种用于优化图像对比度的设置集来根据像素属于所研究图像的片段对像素进行分类。大量的实验评估表明,所提出的方法比现有的方法具有重要的优势。首先,使用k-means将得到的像素类直接聚类成语义组;其次,该方法可用于人工智能的附加训练,以解决语义分割问题。开发人员提出了一种方法来正确选择k个质心的数量以获得高质量的聚类,这在高k值时很难确定。为了克服初始化k-means方法的问题,提出了一种增量k-means聚类方法,提高聚类质量,减小误差平方和。通过对传统的k-means算法及其新版本进行综合实验,评估了该方法在综合生成的数据集和一些实际数据集上的性能。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 去求助
来源期刊
自引率
0.00%
发文量
0
期刊最新文献
The selection of the indicators and the mathematical model for the evaluation of resistance of electronics and information systems to electromagnetic radiation. Intelligent programming support system: machine learning feat. Fast development of secure programs. Nonlinear differential game “pursuit-evasion”: information aspect. Dynamic hypergraphs of renewal processes in mobile networks. Basic requirements for television communications with laser illumination when creating integrated underwater vehicle search systems.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1