CompoundEyes: Near-duplicate detection in large scale online video systems in the cloud

IEEE INFOCOM 2016 - The 35th Annual IEEE International Conference on Computer Communications Pub Date : 2016-04-10 DOI:10.1109/INFOCOM.2016.7524429

Yixin Chen, Wenbo He, Yu Hua, Wen Wang

{"title":"CompoundEyes: Near-duplicate detection in large scale online video systems in the cloud","authors":"Yixin Chen, Wenbo He, Yu Hua, Wen Wang","doi":"10.1109/INFOCOM.2016.7524429","DOIUrl":null,"url":null,"abstract":"At the present time, billions of videos are hosted and shared in the cloud of which a sizable portion consists of near-duplicate video copies. An efficient and accurate content-based online near-duplicate video detection method is a fundamental research goal; as it would benefit applications such as duplication-aware storage, pirate video detection, polluted video tag detection, searching result diversification. Despite the recent progress made in near-duplicate video detection, it remains challenging to develop a practical detection system for large-scale applications that has good efficiency and accuracy performance. In this paper, we shift the focus from feature representation design to system design, and develop a novel system, called CompoundEyes, accordingly. The improvement in accuracy is achieved via well-organized classifiers instead of advanced feature design. Meanwhile, by applying simple features with reduced dimensionality and exploiting the parallelism of the detection architecture, we accelerate the detection speed. Through extensive experiments we demonstrate that the proposed detection system is accurate and fast. It takes approximately 1.45 seconds to process a video clip from a large video dataset, CC_WEB_VIDEO, with a 89% detection accuracy.","PeriodicalId":274591,"journal":{"name":"IEEE INFOCOM 2016 - The 35th Annual IEEE International Conference on Computer Communications","volume":"1 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2016-04-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"16","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"IEEE INFOCOM 2016 - The 35th Annual IEEE International Conference on Computer Communications","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/INFOCOM.2016.7524429","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 16

Abstract

At the present time, billions of videos are hosted and shared in the cloud of which a sizable portion consists of near-duplicate video copies. An efficient and accurate content-based online near-duplicate video detection method is a fundamental research goal; as it would benefit applications such as duplication-aware storage, pirate video detection, polluted video tag detection, searching result diversification. Despite the recent progress made in near-duplicate video detection, it remains challenging to develop a practical detection system for large-scale applications that has good efficiency and accuracy performance. In this paper, we shift the focus from feature representation design to system design, and develop a novel system, called CompoundEyes, accordingly. The improvement in accuracy is achieved via well-organized classifiers instead of advanced feature design. Meanwhile, by applying simple features with reduced dimensionality and exploiting the parallelism of the detection architecture, we accelerate the detection speed. Through extensive experiments we demonstrate that the proposed detection system is accurate and fast. It takes approximately 1.45 seconds to process a video clip from a large video dataset, CC_WEB_VIDEO, with a 89% detection accuracy.

查看原文

微信好友朋友圈 QQ好友复制链接

本刊更多论文

复合眼:云中的大规模在线视频系统中的近重复检测

目前，数以十亿计的视频在云上托管和共享，其中相当大一部分是由几乎重复的视频副本组成的。一种高效、准确的基于内容的在线近重复视频检测方法是一个基本的研究目标;因为它有利于重复感知存储、盗版视频检测、污染视频标签检测、搜索结果多样化等应用。尽管在近重复视频检测方面取得了一些进展，但开发一种具有良好效率和精度的大规模应用的实用检测系统仍然具有挑战性。在本文中，我们将重点从特征表示设计转移到系统设计，并相应地开发了一个新的系统，称为复合眼睛。准确度的提高是通过组织良好的分类器而不是高级特征设计来实现的。同时，通过采用简单的降维特征，利用检测体系结构的并行性，加快了检测速度。通过大量的实验，我们证明了所提出的检测系统是准确和快速的。处理来自大型视频数据集CC_WEB_VIDEO的视频剪辑大约需要1.45秒，检测准确率为89%。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文去求助

来源期刊

IEEE INFOCOM 2016 - The 35th Annual IEEE International Conference on Computer Communications

自引率

0.00%

发文量

期刊最新文献

Heavy-traffic analysis of QoE optimality for on-demand video streams over fading channels The quest for resilient (static) forwarding tables CSMA networks in a many-sources regime: A mean-field approach Variability-aware request replication for latency curtailment Apps on the move: A fine-grained analysis of usage behavior of mobile apps