使用树形浏览器生成加密文档索引结构

Q4 Biochemistry, Genetics and Molecular Biology Journal of Biomolecular Techniques Pub Date : 2023-06-26 DOI:10.51173/jt.v5i2.948

Doaa N. Mhawi, Haider W. Oleiwi, Heba L. Al-Taie

{"title":"使用树形浏览器生成加密文档索引结构","authors":"Doaa N. Mhawi, Haider W. Oleiwi, Heba L. Al-Taie","doi":"10.51173/jt.v5i2.948","DOIUrl":null,"url":null,"abstract":"The document indexing process aims to store documents in a manner that facilitates the process of retrieving specific documents efficiently in terms of accuracy and time complexity. Many information retrieval systems encounter security issues and execution time to retrieve relevant documents. In addition, these systems lead to ample storage. Therefore, it requires combining confidentiality with the indexed document, and a separate process is performed to encrypt the documents. Hence, a new indexing structure named tree browser (TB) was proposed in this paper to be applied to index files of the large document set in an encrypted manner. This method represents the keywords in a variable-length binary format before being stored in the index. This binary format provides additional encryption to the information stored and reduces the index size. The proposed method (TB) is applied to the WebKB dataset. This dataset is related to web page documents (semi-structured documents). The experimental results demonstrated that the storage size is reduced by using TB-tree to 48.5 MB, while the traditional index is 307 MB.","PeriodicalId":39617,"journal":{"name":"Journal of Biomolecular Techniques","volume":"92 4 1","pages":""},"PeriodicalIF":0.0000,"publicationDate":"2023-06-26","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Generating Encrypted Document Index Structure Using Tree Browser\",\"authors\":\"Doaa N. Mhawi, Haider W. Oleiwi, Heba L. Al-Taie\",\"doi\":\"10.51173/jt.v5i2.948\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"The document indexing process aims to store documents in a manner that facilitates the process of retrieving specific documents efficiently in terms of accuracy and time complexity. Many information retrieval systems encounter security issues and execution time to retrieve relevant documents. In addition, these systems lead to ample storage. Therefore, it requires combining confidentiality with the indexed document, and a separate process is performed to encrypt the documents. Hence, a new indexing structure named tree browser (TB) was proposed in this paper to be applied to index files of the large document set in an encrypted manner. This method represents the keywords in a variable-length binary format before being stored in the index. This binary format provides additional encryption to the information stored and reduces the index size. The proposed method (TB) is applied to the WebKB dataset. This dataset is related to web page documents (semi-structured documents). The experimental results demonstrated that the storage size is reduced by using TB-tree to 48.5 MB, while the traditional index is 307 MB.\",\"PeriodicalId\":39617,\"journal\":{\"name\":\"Journal of Biomolecular Techniques\",\"volume\":\"92 4 1\",\"pages\":\"\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2023-06-26\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Journal of Biomolecular Techniques\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.51173/jt.v5i2.948\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q4\",\"JCRName\":\"Biochemistry, Genetics and Molecular Biology\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Journal of Biomolecular Techniques","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.51173/jt.v5i2.948","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q4","JCRName":"Biochemistry, Genetics and Molecular Biology","Score":null,"Total":0}

引用次数: 0

摘要

文档索引过程旨在以一种方式存储文档，以便在准确性和时间复杂度方面有效地检索特定文档。许多信息检索系统在检索相关文档时会遇到安全问题和执行时间问题。此外，这些系统带来了充足的存储空间。因此，它需要将机密性与索引文档结合起来，并执行一个单独的过程来加密文档。因此，本文提出了一种新的索引结构树浏览器(TB)，以加密的方式应用于大型文档集的索引文件。此方法以可变长度二进制格式表示关键字，然后将其存储在索引中。这种二进制格式为存储的信息提供了额外的加密，并减小了索引的大小。将提出的方法(TB)应用于WebKB数据集。这个数据集与网页文档(半结构化文档)相关。实验结果表明，使用TB-tree可以将索引的存储空间减小到48.5 MB，而传统索引的存储空间为307 MB。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文

微信好友朋友圈 QQ好友复制链接

本刊更多论文

Generating Encrypted Document Index Structure Using Tree Browser

The document indexing process aims to store documents in a manner that facilitates the process of retrieving specific documents efficiently in terms of accuracy and time complexity. Many information retrieval systems encounter security issues and execution time to retrieve relevant documents. In addition, these systems lead to ample storage. Therefore, it requires combining confidentiality with the indexed document, and a separate process is performed to encrypt the documents. Hence, a new indexing structure named tree browser (TB) was proposed in this paper to be applied to index files of the large document set in an encrypted manner. This method represents the keywords in a variable-length binary format before being stored in the index. This binary format provides additional encryption to the information stored and reduces the index size. The proposed method (TB) is applied to the WebKB dataset. This dataset is related to web page documents (semi-structured documents). The experimental results demonstrated that the storage size is reduced by using TB-tree to 48.5 MB, while the traditional index is 307 MB.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

Journal of Biomolecular Techniques Biochemistry, Genetics and Molecular Biology-Molecular Biology

CiteScore

2.50

自引率

0.00%

发文量

期刊介绍： The Journal of Biomolecular Techniques is a peer-reviewed publication issued five times a year by the Association of Biomolecular Resource Facilities. The Journal was established to promote the central role biotechnology plays in contemporary research activities, to disseminate information among biomolecular resource facilities, and to communicate the biotechnology research conducted by the Association’s Research Groups and members, as well as other investigators.