Yinfu Deng , Hua Dai , Zhangchen Li , Haiping Huang , Qian Zhou , Jian Xu , Geng Yang
{"title":"EDP-CVSM model-based multi-keyword ranked search scheme over encrypted cloud data","authors":"Yinfu Deng , Hua Dai , Zhangchen Li , Haiping Huang , Qian Zhou , Jian Xu , Geng Yang","doi":"10.1016/j.future.2025.107726","DOIUrl":null,"url":null,"abstract":"<div><div>Traditional searchable encryption schemes for clouds are generally based on the term frequency-inverse document frequency (TF-IDF) vector space model, but they ignore the high-dimensional sparse characteristic of encrypted vectors. It could lead to substantial computational cost of the inner product. If the dimensionality and sparsity of encrypted vectors can be reduced or compressed, the search processing will be accelerated. To improve the search efficiency, we propose an encrypted two-layer balance binary tree index-based multi-keyword ranked search scheme (ETMRS) to address this problem in this paper. An equal-length dictionary partition-based compressed vector space model (EDP-CVSM) is presented, which introduces the dictionary partition strategy. It effectively compresses the document and search vectors, which benefits the efficiency of relevance score computation in search processing. In addition, to further improves the search efficiency, a two-layer balance binary tree index (TBBT-index) is proposed, which adopts secure inner product and symmetric encryption to preserve the privacy. The index is able to filter out the sub-dictionaries having no search keywords in the upper layer and identify the result documents in the lower layer, which speeds up the search processing. Experimental results show a good performance of the proposed scheme in file coverage rate, search precision, rank privacy, search efficiency and space consumption.</div></div>","PeriodicalId":55132,"journal":{"name":"Future Generation Computer Systems-The International Journal of Escience","volume":"167 ","pages":"Article 107726"},"PeriodicalIF":6.2000,"publicationDate":"2025-01-28","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Future Generation Computer Systems-The International Journal of Escience","FirstCategoryId":"94","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0167739X25000214","RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"COMPUTER SCIENCE, THEORY & METHODS","Score":null,"Total":0}
引用次数: 0
Abstract
Traditional searchable encryption schemes for clouds are generally based on the term frequency-inverse document frequency (TF-IDF) vector space model, but they ignore the high-dimensional sparse characteristic of encrypted vectors. It could lead to substantial computational cost of the inner product. If the dimensionality and sparsity of encrypted vectors can be reduced or compressed, the search processing will be accelerated. To improve the search efficiency, we propose an encrypted two-layer balance binary tree index-based multi-keyword ranked search scheme (ETMRS) to address this problem in this paper. An equal-length dictionary partition-based compressed vector space model (EDP-CVSM) is presented, which introduces the dictionary partition strategy. It effectively compresses the document and search vectors, which benefits the efficiency of relevance score computation in search processing. In addition, to further improves the search efficiency, a two-layer balance binary tree index (TBBT-index) is proposed, which adopts secure inner product and symmetric encryption to preserve the privacy. The index is able to filter out the sub-dictionaries having no search keywords in the upper layer and identify the result documents in the lower layer, which speeds up the search processing. Experimental results show a good performance of the proposed scheme in file coverage rate, search precision, rank privacy, search efficiency and space consumption.
期刊介绍:
Computing infrastructures and systems are constantly evolving, resulting in increasingly complex and collaborative scientific applications. To cope with these advancements, there is a growing need for collaborative tools that can effectively map, control, and execute these applications.
Furthermore, with the explosion of Big Data, there is a requirement for innovative methods and infrastructures to collect, analyze, and derive meaningful insights from the vast amount of data generated. This necessitates the integration of computational and storage capabilities, databases, sensors, and human collaboration.
Future Generation Computer Systems aims to pioneer advancements in distributed systems, collaborative environments, high-performance computing, and Big Data analytics. It strives to stay at the forefront of developments in grids, clouds, and the Internet of Things (IoT) to effectively address the challenges posed by these wide-area, fully distributed sensing and computing systems.