Fábio Carlos Moreno, C. R. Barbosa, Edio Roberto Manfio
{"title":"Hash Tables for a Digital Lexicon","authors":"Fábio Carlos Moreno, C. R. Barbosa, Edio Roberto Manfio","doi":"10.22456/2175-2745.107128","DOIUrl":null,"url":null,"abstract":"This paper deals with the construction of digital lexicons within the scope of Natural Language Processing. Data Structures called Hash Tables have demonstrated to generate good results for Natural Language Interface for Databases and have data dispersion, response speed and programming simplicity as main features. The storage of the desired information is done by associating a key through the hashing functions that is responsible for distributing the information in this table. The objective of this paper is to present the tool called Visual TaHs that uses a sparse table to a real lexicon (Lexicon of Herbs), improving performance results of several implemented hash functions. Such structure has achieved satisfactory results in terms of speed and storage when compared to conventional databases and can work in various media, such as desktop, Web and mobile.","PeriodicalId":82472,"journal":{"name":"Research initiative, treatment action : RITA","volume":null,"pages":null},"PeriodicalIF":0.0000,"publicationDate":"2021-08-29","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Research initiative, treatment action : RITA","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.22456/2175-2745.107128","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0
Abstract
This paper deals with the construction of digital lexicons within the scope of Natural Language Processing. Data Structures called Hash Tables have demonstrated to generate good results for Natural Language Interface for Databases and have data dispersion, response speed and programming simplicity as main features. The storage of the desired information is done by associating a key through the hashing functions that is responsible for distributing the information in this table. The objective of this paper is to present the tool called Visual TaHs that uses a sparse table to a real lexicon (Lexicon of Herbs), improving performance results of several implemented hash functions. Such structure has achieved satisfactory results in terms of speed and storage when compared to conventional databases and can work in various media, such as desktop, Web and mobile.
本文研究了自然语言处理范畴内数字词汇的构建问题。被称为哈希表的数据结构已经被证明可以为数据库的自然语言接口产生良好的结果,并且具有数据分散、响应速度和编程简单的主要特点。所需信息的存储是通过哈希函数关联键来完成的,哈希函数负责在该表中分发信息。本文的目的是介绍一种名为Visual TaHs的工具,它使用一个稀疏表来生成一个真正的词典(lexicon of Herbs),从而提高了几个已实现散列函数的性能结果。与传统数据库相比,这种结构在速度和存储方面取得了令人满意的结果,并且可以在桌面、Web和移动等各种媒体上工作。