Unsupervised Construction of Task-Specific Datasets for Object Re-identification

Petr Pulc, M. Holeňa
{"title":"Unsupervised Construction of Task-Specific Datasets for Object Re-identification","authors":"Petr Pulc, M. Holeňa","doi":"10.1145/3477911.3477922","DOIUrl":null,"url":null,"abstract":"In the last decade, we have seen a significant uprise of deep neural networks in image processing tasks and many other research areas. However, while various neural architectures have successfully solved numerous tasks, they constantly demand more and more processing time and training data. Moreover, the current trend of using existing pre-trained architectures just as backbones and attaching new processing branches on top not only increases this demand but diminishes the explainability of the whole model. Our research focuses on combinations of explainable building blocks for the image processing tasks, such as object tracking. We propose a combination of Mask R-CNN, state-of-the-art object detection and segmentation neural network, with our previously published method of sparse feature tracking [16]. Such a combination allows us to track objects by connecting detected masks using the proposed sparse feature tracklets. However, this method cannot recover from complete object occlusions and has to be assisted by an object re-identification. To this end, this paper uses our feature tracking method for a slightly different task: an unsupervised extraction of object representations that we can directly use to fine-tune an object re-identification algorithm, see Fig. 1 for visualisation. As we have to use objects masks already in the object tracking, our approach utilises the additional information as an alpha channel of the object representations, which further increases the precision of the re-identification. An additional benefit is that our fine-tuning method can be employed even in a fully online scenario.","PeriodicalId":174824,"journal":{"name":"Proceedings of the 2021 7th International Conference on Computer Technology Applications","volume":"101 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2021-07-13","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Proceedings of the 2021 7th International Conference on Computer Technology Applications","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1145/3477911.3477922","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0

Abstract

In the last decade, we have seen a significant uprise of deep neural networks in image processing tasks and many other research areas. However, while various neural architectures have successfully solved numerous tasks, they constantly demand more and more processing time and training data. Moreover, the current trend of using existing pre-trained architectures just as backbones and attaching new processing branches on top not only increases this demand but diminishes the explainability of the whole model. Our research focuses on combinations of explainable building blocks for the image processing tasks, such as object tracking. We propose a combination of Mask R-CNN, state-of-the-art object detection and segmentation neural network, with our previously published method of sparse feature tracking [16]. Such a combination allows us to track objects by connecting detected masks using the proposed sparse feature tracklets. However, this method cannot recover from complete object occlusions and has to be assisted by an object re-identification. To this end, this paper uses our feature tracking method for a slightly different task: an unsupervised extraction of object representations that we can directly use to fine-tune an object re-identification algorithm, see Fig. 1 for visualisation. As we have to use objects masks already in the object tracking, our approach utilises the additional information as an alpha channel of the object representations, which further increases the precision of the re-identification. An additional benefit is that our fine-tuning method can be employed even in a fully online scenario.
查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
用于对象再识别的任务特定数据集的无监督构建
在过去的十年中,我们已经看到深度神经网络在图像处理任务和许多其他研究领域的重大进步。然而,尽管各种神经结构已经成功地解决了许多任务,但它们不断地需要越来越多的处理时间和训练数据。此外,目前使用现有的预训练架构作为主干,并在其上附加新的处理分支的趋势不仅增加了这种需求,而且降低了整个模型的可解释性。我们的研究重点是图像处理任务的可解释构建块的组合,例如目标跟踪。我们提出将最先进的目标检测和分割神经网络Mask R-CNN与我们之前发表的稀疏特征跟踪方法相结合[16]。这样的组合允许我们通过使用所提出的稀疏特征tracklets连接检测到的掩模来跟踪对象。然而,这种方法不能从完全的物体遮挡中恢复,必须借助于物体的重新识别。为此,本文将我们的特征跟踪方法用于一个略有不同的任务:我们可以直接使用对象表征的无监督提取来微调对象重新识别算法,如图1所示。由于我们必须在对象跟踪中使用已经存在的对象掩码,我们的方法利用附加信息作为对象表示的alpha通道,这进一步提高了重新识别的精度。另一个好处是,我们的微调方法甚至可以在完全在线的场景中使用。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 去求助
来源期刊
自引率
0.00%
发文量
0
期刊最新文献
Applying Ethereum Smart Contracts to Blockchain-Based Crowdfunding System to Increase Trust and Information Symmetry Video Analytics Architecture with Metadata Event-Engine for Urban Safe Cities Integrated Planning of Operating Expenditures (OPEX) - A model to apply best practices when running ERP and DWH systems Converting Manufacturing Companies into Data-Driven Enterprises: an Evaluation of the Transformation Model Design Workflows and Algorithm Diagrams Interpretation Method in Software Development
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1