Zehui Tang;Shengke Zeng;Song Han;Yawen Feng;Tao Li;Mingxing He
{"title":"Fuzzy Deduplication: Color-Aware Deduplication for Multi-Media Data","authors":"Zehui Tang;Shengke Zeng;Song Han;Yawen Feng;Tao Li;Mingxing He","doi":"10.1109/TSC.2024.3418351","DOIUrl":null,"url":null,"abstract":"Cloud storage technology is constantly evolving, resulting in a significant amount of duplicate data being stored in the cloud, particularly multimedia data such as images and videos. In terms of data privacy and storage optimization, the encrypted deduplication should be checked to save space overhead for cloud servers. Compared to exact deduplication, fuzzy deduplication is low-cost for encrypted multimedia data. Focusing on reducing the false deletion rate, an efficient and secure fuzzy deduplication system based on dual-feature without additional servers is proposed. We also propose a concept of pre-verification for label consistency to compensate for the loss that cannot be fixed through post-verification. Therefore, it is more practical. Finally, we conduct experiments on real-world datasets for performance evaluation. The experimental results show good performance in terms of both computational cost and deduplication efficiency.","PeriodicalId":13255,"journal":{"name":"IEEE Transactions on Services Computing","volume":"17 5","pages":"2459-2472"},"PeriodicalIF":5.8000,"publicationDate":"2024-07-12","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"IEEE Transactions on Services Computing","FirstCategoryId":"94","ListUrlMain":"https://ieeexplore.ieee.org/document/10596969/","RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"COMPUTER SCIENCE, INFORMATION SYSTEMS","Score":null,"Total":0}
引用次数: 0
Abstract
Cloud storage technology is constantly evolving, resulting in a significant amount of duplicate data being stored in the cloud, particularly multimedia data such as images and videos. In terms of data privacy and storage optimization, the encrypted deduplication should be checked to save space overhead for cloud servers. Compared to exact deduplication, fuzzy deduplication is low-cost for encrypted multimedia data. Focusing on reducing the false deletion rate, an efficient and secure fuzzy deduplication system based on dual-feature without additional servers is proposed. We also propose a concept of pre-verification for label consistency to compensate for the loss that cannot be fixed through post-verification. Therefore, it is more practical. Finally, we conduct experiments on real-world datasets for performance evaluation. The experimental results show good performance in terms of both computational cost and deduplication efficiency.
期刊介绍:
IEEE Transactions on Services Computing encompasses the computing and software aspects of the science and technology of services innovation research and development. It places emphasis on algorithmic, mathematical, statistical, and computational methods central to services computing. Topics covered include Service Oriented Architecture, Web Services, Business Process Integration, Solution Performance Management, and Services Operations and Management. The transactions address mathematical foundations, security, privacy, agreement, contract, discovery, negotiation, collaboration, and quality of service for web services. It also covers areas like composite web service creation, business and scientific applications, standards, utility models, business process modeling, integration, collaboration, and more in the realm of Services Computing.