首页 > 最新文献

International journal of database theory and application最新文献

英文 中文
A Novel Framework for Data Extraction from Multiple Repositories and Generation of Ontologies using Inverted Indexing Technique 一种利用倒排索引技术从多个存储库中提取数据和生成本体的新框架
Pub Date : 2017-07-31 DOI: 10.14257/IJDTA.2017.10.7.07
Sudeepthi Govathoti, M. Babu
Recent years have observed the tremendous growth of information through the large number of domains available in the web. Social media (LinkedIn, Twitter etc.) concentrate on handling massive data obtaining from various sources. It is a fact that information retrieval and data extraction are difficult tasks in handling the large collection of web documents. Semantic web is a new technology used to handle the massive raw data to transform it into knowledgeable representation. Traditional search engines use page ranking algorithms to find data from a large data sources. The proposed work is aimed at designing a user interface for data extraction from multiple repositories using Uniform Resource Identifiers (URIs) and applying inverted indexing techniques for generation of Ontologies. These methods may be used to develop efficient semantic web knowledge based systems for retrieving relevant information from the web .
近年来,通过网络上大量可用的域,我们可以观察到信息的巨大增长。社交媒体(LinkedIn, Twitter等)专注于处理从各种来源获得的大量数据。在处理大量的网络文档时,信息检索和数据提取是一项困难的任务。语义网是一种用于处理海量原始数据并将其转化为知识表示的新技术。传统的搜索引擎使用页面排名算法从大型数据源中查找数据。提出的工作旨在设计一个用户界面,使用统一资源标识符(uri)从多个存储库中提取数据,并应用倒排索引技术生成本体。这些方法可用于开发高效的基于语义网知识的系统,用于从网络中检索相关信息。
{"title":"A Novel Framework for Data Extraction from Multiple Repositories and Generation of Ontologies using Inverted Indexing Technique","authors":"Sudeepthi Govathoti, M. Babu","doi":"10.14257/IJDTA.2017.10.7.07","DOIUrl":"https://doi.org/10.14257/IJDTA.2017.10.7.07","url":null,"abstract":"Recent years have observed the tremendous growth of information through the large number of domains available in the web. Social media (LinkedIn, Twitter etc.) concentrate on handling massive data obtaining from various sources. It is a fact that information retrieval and data extraction are difficult tasks in handling the large collection of web documents. Semantic web is a new technology used to handle the massive raw data to transform it into knowledgeable representation. Traditional search engines use page ranking algorithms to find data from a large data sources. The proposed work is aimed at designing a user interface for data extraction from multiple repositories using Uniform Resource Identifiers (URIs) and applying inverted indexing techniques for generation of Ontologies. These methods may be used to develop efficient semantic web knowledge based systems for retrieving relevant information from the web .","PeriodicalId":13926,"journal":{"name":"International journal of database theory and application","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2017-07-31","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"84189587","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
A Study of Dictionary Based Korean Semantic Role Labeling 基于词典的韩语语义角色标注研究
Pub Date : 2017-07-31 DOI: 10.14257/ijdta.2017.10.7.06
Miran Seok, Hye-Jeong Song, Chan-Young Park, Jong-Dae Kim, Yu-Seop Kim
A semantic role is information used to clarify the role of entities in an event that a sentence describes, including agent, theme, experience, object, and location. Semantic role labeling (SRL) is a process that determines the semantic relation of a predicate and its arguments in a sentence and is an important factor in the semantic analysis of natural language processing, in addition to word sense disambiguation. To date, many manual semantic tagging tasks have been constructed; however, these tasks require a great deal of time and cost. To solve this problem, we propose a method for automatic SRL using frame files included in the Korean version of Proposition Bank (PropBank), which is one of the most widely used corpora. Frame files provide guidelines for PropBank annotators and include a list of framesets, which stand for a set of syntactic frames. First, we select the proper sense of the predicate from among multiple senses of the predicate in the frame files. Senses of the predicate are classified according to the semantic and syntactic properties of the predicate’s arguments. We collect the nouns in a sample sentence of a given sense; we also collect all of the nouns that appear in a given sentence. The semantic similarities between the nouns from the sample sentence and the given sentence are measured and the sense with the highest similarity value is selected. The frame information of the selected sense is used for SRL of the given predicate and its arguments.
语义角色是用来阐明句子所描述的事件中实体的角色的信息,包括代理、主题、经验、对象和位置。语义角色标注(Semantic role labeling, SRL)是确定句子中谓语及其参数的语义关系的过程,是除词义消歧外,自然语言处理语义分析中的一个重要因素。迄今为止,已经构建了许多手动语义标记任务;然而,这些任务需要大量的时间和成本。为了解决这个问题,我们提出了一种使用韩语版本的命题库(PropBank)中包含的框架文件进行自动SRL的方法。命题库是使用最广泛的语料库之一。框架文件为PropBank注释器提供了指导方针,并包含一组框架集,这些框架集代表一组语法框架。首先,我们从框架文件中的多个谓词意义中选择适当的谓词意义。谓词的意义根据谓词的参数的语义和句法特性进行分类。我们在一个给定意义的例句中收集名词;我们还收集在给定句子中出现的所有名词。测量样句中名词与给定句子的语义相似度,选择相似度最高的意义。所选意义的框架信息用于给定谓词及其参数的SRL。
{"title":"A Study of Dictionary Based Korean Semantic Role Labeling","authors":"Miran Seok, Hye-Jeong Song, Chan-Young Park, Jong-Dae Kim, Yu-Seop Kim","doi":"10.14257/ijdta.2017.10.7.06","DOIUrl":"https://doi.org/10.14257/ijdta.2017.10.7.06","url":null,"abstract":"A semantic role is information used to clarify the role of entities in an event that a sentence describes, including agent, theme, experience, object, and location. Semantic role labeling (SRL) is a process that determines the semantic relation of a predicate and its arguments in a sentence and is an important factor in the semantic analysis of natural language processing, in addition to word sense disambiguation. To date, many manual semantic tagging tasks have been constructed; however, these tasks require a great deal of time and cost. To solve this problem, we propose a method for automatic SRL using frame files included in the Korean version of Proposition Bank (PropBank), which is one of the most widely used corpora. Frame files provide guidelines for PropBank annotators and include a list of framesets, which stand for a set of syntactic frames. First, we select the proper sense of the predicate from among multiple senses of the predicate in the frame files. Senses of the predicate are classified according to the semantic and syntactic properties of the predicate’s arguments. We collect the nouns in a sample sentence of a given sense; we also collect all of the nouns that appear in a given sentence. The semantic similarities between the nouns from the sample sentence and the given sentence are measured and the sense with the highest similarity value is selected. The frame information of the selected sense is used for SRL of the given predicate and its arguments.","PeriodicalId":13926,"journal":{"name":"International journal of database theory and application","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2017-07-31","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"81540526","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
A Cross-Domain Analysis using Morphological Sentence Pattern Approach for Extracting Aspect-based Lexicon 基于形态句型的跨领域分析方法提取面向方面的词汇
Pub Date : 2017-07-31 DOI: 10.14257/IJDTA.2017.10.7.02
Youngsub Han, Yanggon Kim, Jin-Hee Song
{"title":"A Cross-Domain Analysis using Morphological Sentence Pattern Approach for Extracting Aspect-based Lexicon","authors":"Youngsub Han, Yanggon Kim, Jin-Hee Song","doi":"10.14257/IJDTA.2017.10.7.02","DOIUrl":"https://doi.org/10.14257/IJDTA.2017.10.7.02","url":null,"abstract":"","PeriodicalId":13926,"journal":{"name":"International journal of database theory and application","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2017-07-31","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"80389047","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 3
Design and Implementation of the Symbol Table for Object-Oriented Programming Language 面向对象程序设计语言符号表的设计与实现
Pub Date : 2017-07-31 DOI: 10.14257/ijdta.2017.10.7.03
Yangsun Lee
The symbol table used in the existing compiler stores one symbol information into a plurality of sub tables, and the abstract syntax tree necessary for generating symbols has a binary tree structure composed of a single data structure node. This structure increases the source code complexity of modules that generate symbols and modules that reference symbol tables, and when designing a compiler for a new language, it is necessary to newly design an abstract syntax tree and a symbol table structure considering the characteristics of the language. In this paper, we apply the object-oriented principle and visitor pattern to improve the abstract syntax tree structure and design and implement the symbol table for the object oriented language. The design of AST (abstract syntax trees) with object-oriented principles and Visitor patterns reduces the time and cost of redesign because it makes it easy to add features of the language without the need to redesign the AST (abstract syntax tree) for the new object-oriented language. In addition, it is easy to create a symbol through the Visitor pattern. Symbol tables using the open-close principle and the dependency inversion principle can improve the code reusability of the source code that creates and refer to the table and improve the readability of the code.
现有编译器中使用的符号表将一个符号信息存储到多个子表中,生成符号所需的抽象语法树具有由单个数据结构节点组成的二叉树结构。这种结构增加了生成符号模块和引用符号表模块的源代码复杂性,在设计新语言的编译器时,需要根据语言的特点重新设计抽象语法树和符号表结构。本文应用面向对象原则和访问者模式,改进了面向对象语言的抽象语法树结构,设计并实现了面向对象语言的符号表。使用面向对象原则和访问者模式设计AST(抽象语法树)减少了重新设计的时间和成本,因为它使添加语言特性变得容易,而无需为新的面向对象语言重新设计AST(抽象语法树)。此外,通过Visitor模式创建符号也很容易。使用开闭原则和依赖反转原则的符号表可以提高创建和引用符号表的源代码的代码可重用性,并提高代码的可读性。
{"title":"Design and Implementation of the Symbol Table for Object-Oriented Programming Language","authors":"Yangsun Lee","doi":"10.14257/ijdta.2017.10.7.03","DOIUrl":"https://doi.org/10.14257/ijdta.2017.10.7.03","url":null,"abstract":"The symbol table used in the existing compiler stores one symbol information into a plurality of sub tables, and the abstract syntax tree necessary for generating symbols has a binary tree structure composed of a single data structure node. This structure increases the source code complexity of modules that generate symbols and modules that reference symbol tables, and when designing a compiler for a new language, it is necessary to newly design an abstract syntax tree and a symbol table structure considering the characteristics of the language. In this paper, we apply the object-oriented principle and visitor pattern to improve the abstract syntax tree structure and design and implement the symbol table for the object oriented language. The design of AST (abstract syntax trees) with object-oriented principles and Visitor patterns reduces the time and cost of redesign because it makes it easy to add features of the language without the need to redesign the AST (abstract syntax tree) for the new object-oriented language. In addition, it is easy to create a symbol through the Visitor pattern. Symbol tables using the open-close principle and the dependency inversion principle can improve the code reusability of the source code that creates and refer to the table and improve the readability of the code.","PeriodicalId":13926,"journal":{"name":"International journal of database theory and application","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2017-07-31","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"78284669","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
Integrated Processes of SDR Data for Real-time Processing 实时处理SDR数据的集成过程
Pub Date : 2017-07-31 DOI: 10.14257/IJDTA.2017.10.7.05
Sang-Young Lee
In this paper, relative data is classified using STDC which is an efficient classification process using the ontology technique. Classified data are saved at the storage according to its SDR type. Integrated processes are used to reuse the saved SDR data. Thus, relative data is constructed in a systematic reuse system applying total architecture. This overcomes the disadvantage of the past processes that required numerous joint computation when handling question and answer. SDTC Technique solves the weakness of old methods which required multiple join calculation that caused functional decline and allows normalized type of classification task.
本文使用STDC对相关数据进行分类,这是一种利用本体技术的高效分类过程。分类数据根据其SDR类型保存在存储器中。集成过程用于重用已保存的SDR数据。因此,在一个应用整体体系结构的系统重用系统中构建了相关数据。这克服了以往处理问答时需要大量联合计算的缺点。SDTC技术解决了旧方法需要进行多次连接计算导致功能下降的缺点,并允许规范化类型的分类任务。
{"title":"Integrated Processes of SDR Data for Real-time Processing","authors":"Sang-Young Lee","doi":"10.14257/IJDTA.2017.10.7.05","DOIUrl":"https://doi.org/10.14257/IJDTA.2017.10.7.05","url":null,"abstract":"In this paper, relative data is classified using STDC which is an efficient classification process using the ontology technique. Classified data are saved at the storage according to its SDR type. Integrated processes are used to reuse the saved SDR data. Thus, relative data is constructed in a systematic reuse system applying total architecture. This overcomes the disadvantage of the past processes that required numerous joint computation when handling question and answer. SDTC Technique solves the weakness of old methods which required multiple join calculation that caused functional decline and allows normalized type of classification task.","PeriodicalId":13926,"journal":{"name":"International journal of database theory and application","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2017-07-31","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"75121544","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Integrated Design Solution for Distributed Databases Using Genetic Algorithms 基于遗传算法的分布式数据库集成设计方案
Pub Date : 2017-06-30 DOI: 10.14257/IJDTA.2017.10.6.02
Sukkyu Song
The design of distributed database systems has prompted many research problems. Among others, the issue of interdependency and interaction associated with data fragmentation, data allocation, and distributed query optimization still remains unanswered. These problems have been proven to be NP-complete or NP-hard, so most previous studies have addressed these problems in isolation by making simplified assumptions. However, these problems are interdependent and hence solving them independently results in inefficient solution overall. In this research, we develop an integrated distributed database design solution for three problems: partitioning data sets, allocating partitioned data sets among the sites of a network, and allocating operations as a problem of distributed query optimization. We use a transaction-based approach, wherein most important transactions are considered in determining the effective design of distributed database, and consider two types of transactions: OLTP (on-line transaction processing) and DSS (decision support system), for reflecting various distributed database design objectives such as total time minimization, response time minimization, and minimization of a combination of both. We employ genetic algorithms as searching methods for the best distributed database design solution. The integrated design solutions are determined by analyzing interactions between the problems in four stages: 1) between vertical fragmentation and operation allocation, 2) between vertical fragmentation and data allocation, 3) between data allocation and operation allocation, and 4) integration of all three problems, with the objectives of cost minimization and load balancing. Our integrated approach resulted in a cost effective distributed database design compared to the designs considering the problems in isolation.
分布式数据库系统的设计引发了许多研究问题。其中,与数据碎片、数据分配和分布式查询优化相关的相互依赖和交互问题仍然没有答案。这些问题已被证明是np完全的或np困难的,因此大多数先前的研究都是通过简化假设来孤立地解决这些问题。然而,这些问题是相互依存的,因此,单独解决它们会导致整体解决效率低下。在本研究中,我们开发了一个集成的分布式数据库设计方案,针对三个问题:分区数据集,在网络站点之间分配分区数据集,以及分配作为分布式查询优化问题的操作。我们使用基于事务的方法,其中在确定分布式数据库的有效设计时考虑了最重要的事务,并考虑了两种类型的事务:OLTP(在线事务处理)和DSS(决策支持系统),以反映各种分布式数据库设计目标,例如总时间最小化、响应时间最小化以及两者的组合最小化。我们采用遗传算法作为搜索方法来寻找最佳的分布式数据库设计方案。以成本最小化和负载均衡为目标,通过分析垂直碎片化与业务分配、垂直碎片化与数据分配、数据分配与业务分配、整合三个阶段问题的交互关系,确定集成设计方案。与单独考虑问题的设计相比,我们的集成方法产生了具有成本效益的分布式数据库设计。
{"title":"Integrated Design Solution for Distributed Databases Using Genetic Algorithms","authors":"Sukkyu Song","doi":"10.14257/IJDTA.2017.10.6.02","DOIUrl":"https://doi.org/10.14257/IJDTA.2017.10.6.02","url":null,"abstract":"The design of distributed database systems has prompted many research problems. Among others, the issue of interdependency and interaction associated with data fragmentation, data allocation, and distributed query optimization still remains unanswered. These problems have been proven to be NP-complete or NP-hard, so most previous studies have addressed these problems in isolation by making simplified assumptions. However, these problems are interdependent and hence solving them independently results in inefficient solution overall. In this research, we develop an integrated distributed database design solution for three problems: partitioning data sets, allocating partitioned data sets among the sites of a network, and allocating operations as a problem of distributed query optimization. We use a transaction-based approach, wherein most important transactions are considered in determining the effective design of distributed database, and consider two types of transactions: OLTP (on-line transaction processing) and DSS (decision support system), for reflecting various distributed database design objectives such as total time minimization, response time minimization, and minimization of a combination of both. We employ genetic algorithms as searching methods for the best distributed database design solution. The integrated design solutions are determined by analyzing interactions between the problems in four stages: 1) between vertical fragmentation and operation allocation, 2) between vertical fragmentation and data allocation, 3) between data allocation and operation allocation, and 4) integration of all three problems, with the objectives of cost minimization and load balancing. Our integrated approach resulted in a cost effective distributed database design compared to the designs considering the problems in isolation.","PeriodicalId":13926,"journal":{"name":"International journal of database theory and application","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2017-06-30","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"77838296","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
The Impact of Some Socio-Economic Factors on Academic Performance: A Fuzzy Mining Decision Support System 社会经济因素对学业成绩的影响:一个模糊挖掘决策支持系统
Pub Date : 2017-06-30 DOI: 10.14257/IJDTA.2017.10.6.06
O. Oladipupo, A. I. Ehigbochie
Due to the reported impacts of some socio-economic factors on academic performance and nations’ education value, there is need for strong awareness to assist students in making the right decision. To this effect, this study proposes and designs student decision support system for determining the extent to which different levels of some socioeconomic factors involvement can jointly affect academic performance. The factors are: Student’s interest, Relationship status, Entrepreneurial activities, Peer influence, Health and family background. The traditional decision support system architecture was extended in this study by introducing two components: Fuzzy engine and Mining Engine. Fuzzy engine was introduced to capture intra uncertainties in students' judgment about the data gathered and Mining engine to extract hidden and previously unknown interesting patterns from the dataset. The predictive model was established using fuzzy association rule mining technique. The dataset was gathered using one-on-one questionnaire interaction with students from 4 Universities in Nigeria. The system evaluates students' linguistic levels of involvement and predicts the possible class of honours for them with explicit interpretation of the fired patterns. This system will assist the students in decision making as to the extent they can be involved in some socioeconomic activities relative to their family and health status in order to have their desired classes of honour.
由于一些社会经济因素对学业成绩和国家教育价值的影响,有必要提高意识,帮助学生做出正确的决定。为此,本研究提出并设计了学生决策支持系统,以确定不同程度的社会经济因素参与对学业成绩的共同影响程度。影响因素有:学生兴趣、人际关系状况、创业活动、同伴影响、健康状况和家庭背景。本文通过引入模糊引擎和挖掘引擎两个组件,对传统的决策支持系统架构进行了扩展。引入模糊引擎来捕捉学生对收集数据判断中的内部不确定性,并引入挖掘引擎从数据集中提取隐藏的和以前未知的有趣模式。利用模糊关联规则挖掘技术建立预测模型。数据集是通过与尼日利亚4所大学的学生进行一对一问卷互动收集的。该系统评估学生的语言参与水平,并通过对已激活模式的明确解释,预测他们可能获得的荣誉等级。这一制度将帮助学生作出决定,使他们能够参与一些与他们的家庭和健康状况有关的社会经济活动,以便获得他们想要的荣誉课程。
{"title":"The Impact of Some Socio-Economic Factors on Academic Performance: A Fuzzy Mining Decision Support System","authors":"O. Oladipupo, A. I. Ehigbochie","doi":"10.14257/IJDTA.2017.10.6.06","DOIUrl":"https://doi.org/10.14257/IJDTA.2017.10.6.06","url":null,"abstract":"Due to the reported impacts of some socio-economic factors on academic performance and nations’ education value, there is need for strong awareness to assist students in making the right decision. To this effect, this study proposes and designs student decision support system for determining the extent to which different levels of some socioeconomic factors involvement can jointly affect academic performance. The factors are: Student’s interest, Relationship status, Entrepreneurial activities, Peer influence, Health and family background. The traditional decision support system architecture was extended in this study by introducing two components: Fuzzy engine and Mining Engine. Fuzzy engine was introduced to capture intra uncertainties in students' judgment about the data gathered and Mining engine to extract hidden and previously unknown interesting patterns from the dataset. The predictive model was established using fuzzy association rule mining technique. The dataset was gathered using one-on-one questionnaire interaction with students from 4 Universities in Nigeria. The system evaluates students' linguistic levels of involvement and predicts the possible class of honours for them with explicit interpretation of the fired patterns. This system will assist the students in decision making as to the extent they can be involved in some socioeconomic activities relative to their family and health status in order to have their desired classes of honour.","PeriodicalId":13926,"journal":{"name":"International journal of database theory and application","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2017-06-30","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"80215841","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 4
An Effective Approach for Non-Numeric Relational Database Verification 非数字关系数据库验证的一种有效方法
Pub Date : 2017-06-30 DOI: 10.14257/IJDTA.2017.10.6.03
L. Camara, Demba Coulibaly, Ali Hamadou, Junyi Li
With the large distribution of digital data, protecting their integrity becomes necessary and digital watermarking has been proposed as solution for protecting the content of relational database. Previous watermarking techniques mainly focus on the numeric database authentication by inserting watermark bits in digital data which may greatly degrade the data quality. In this paper, we present a distortion free approach to verify the integrity of a combined numeric and non-numeric relational database. The technique first partitions the database in different groups of square matrices, then the ASCII code of non-numeric data of group attributes are computed and used to generate the watermark. Security analyzes and experiments demonstrated that the proposed technique is resilient against malicious attacks and moreover the tampering can be detected up to group level.
随着数字数据的大规模分布,保护数据的完整性成为必要,数字水印被提出作为保护关系数据库内容的解决方案。以往的水印技术主要是通过在数字数据中插入水印位进行数字数据库认证,这可能会大大降低数据质量。在本文中,我们提出了一种无失真的方法来验证组合数值和非数值关系数据库的完整性。该技术首先将数据库划分为不同的方阵组,然后计算组属性的非数字数据的ASCII码并用于生成水印。安全性分析和实验表明,该技术具有较强的抗恶意攻击能力,并且可以检测到组级篡改。
{"title":"An Effective Approach for Non-Numeric Relational Database Verification","authors":"L. Camara, Demba Coulibaly, Ali Hamadou, Junyi Li","doi":"10.14257/IJDTA.2017.10.6.03","DOIUrl":"https://doi.org/10.14257/IJDTA.2017.10.6.03","url":null,"abstract":"With the large distribution of digital data, protecting their integrity becomes necessary and digital watermarking has been proposed as solution for protecting the content of relational database. Previous watermarking techniques mainly focus on the numeric database authentication by inserting watermark bits in digital data which may greatly degrade the data quality. In this paper, we present a distortion free approach to verify the integrity of a combined numeric and non-numeric relational database. The technique first partitions the database in different groups of square matrices, then the ASCII code of non-numeric data of group attributes are computed and used to generate the watermark. Security analyzes and experiments demonstrated that the proposed technique is resilient against malicious attacks and moreover the tampering can be detected up to group level.","PeriodicalId":13926,"journal":{"name":"International journal of database theory and application","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2017-06-30","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"77386687","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
Query Optimization for Databases in Cloud Environment: A Survey 云环境下数据库查询优化研究综述
Pub Date : 2017-06-30 DOI: 10.14257/IJDTA.2017.10.6.01
Archana Bachhav, V. Kharat, M. Shelar
Now days in the field of service oriented technologies cloud computing plays an important role. The main aim of cloud computing is to make people compute and store the resources easily and efficiently. Recent focus is deal with data expressing and searching. To improve the performance in the cloud requires the optimization of data processing time. Our study gives a comprehensive survey on numerous models and approaches used for query optimization to minimize execution time and to improve resource utilization. We have reviewed various research work done on query optimization for conventional SQL and MapReduce platforms.
如今在面向服务的技术领域,云计算扮演着重要的角色。云计算的主要目的是使人们能够轻松高效地计算和存储资源。最近的研究重点是处理数据的表达和搜索。要提高云中的性能,需要优化数据处理时间。我们的研究对用于查询优化的许多模型和方法进行了全面的调查,以最大限度地减少执行时间并提高资源利用率。我们回顾了传统SQL和MapReduce平台查询优化方面的各种研究工作。
{"title":"Query Optimization for Databases in Cloud Environment: A Survey","authors":"Archana Bachhav, V. Kharat, M. Shelar","doi":"10.14257/IJDTA.2017.10.6.01","DOIUrl":"https://doi.org/10.14257/IJDTA.2017.10.6.01","url":null,"abstract":"Now days in the field of service oriented technologies cloud computing plays an important role. The main aim of cloud computing is to make people compute and store the resources easily and efficiently. Recent focus is deal with data expressing and searching. To improve the performance in the cloud requires the optimization of data processing time. Our study gives a comprehensive survey on numerous models and approaches used for query optimization to minimize execution time and to improve resource utilization. We have reviewed various research work done on query optimization for conventional SQL and MapReduce platforms.","PeriodicalId":13926,"journal":{"name":"International journal of database theory and application","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2017-06-30","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"76260086","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 8
Analysis of Criminal Profiling Utilizing Structured and Unstructured Data 利用结构化和非结构化数据的犯罪侧写分析
Pub Date : 2017-06-30 DOI: 10.14257/ijdta.2017.10.6.04
Yonghoon Kim, Mokdong Chung
In general, the structured data knows the meaning of the sentence and unstructured data refers to an unknown means. Although the quantity of structured information in the entire data and within organizations is increasing, the majority of information remains available only in unstructured data. While different in form, both unstructured and structured information sources provide information about entities in the world and their properties and relations. Due to the recent rapid changes in society and wide spread of information devices, diverse digital information is utilized in a variety of economic and social analysis. Information related to the crime statistics by type of crime has been used as a major factor in crime. However, statistical analysis using only the structured data has the difficulty in the investigation by providing limited information to investigators and users. In this paper, structured data and unstructured data are analyzed by applying Korean Natural Language Processing (Ko-NLP) and the Latent Semantic Analysis (LSA) technique. It will provide a crime profile optimum system that can be applied to the crime profiling system or statistical analysis [1].
一般来说,结构化数据知道句子的意思,非结构化数据指的是一种未知的手段。尽管整个数据和组织内部的结构化信息的数量正在增加,但大多数信息仍然只能在非结构化数据中获得。虽然形式不同,但非结构化和结构化信息源都提供有关世界上实体及其属性和关系的信息。由于近年来社会的快速变化和信息设备的广泛普及,各种各样的数字信息被用于各种经济和社会分析。按犯罪类型划分的犯罪统计资料已被用作犯罪的主要因素。然而,仅使用结构化数据进行统计分析,给调查人员和用户提供的信息有限,在调查中存在困难。本文采用朝鲜语自然语言处理(Ko-NLP)和潜在语义分析(LSA)技术对结构化数据和非结构化数据进行分析。它将提供一个犯罪侧写优化系统,可应用于犯罪侧写系统或统计分析[1]。
{"title":"Analysis of Criminal Profiling Utilizing Structured and Unstructured Data","authors":"Yonghoon Kim, Mokdong Chung","doi":"10.14257/ijdta.2017.10.6.04","DOIUrl":"https://doi.org/10.14257/ijdta.2017.10.6.04","url":null,"abstract":"In general, the structured data knows the meaning of the sentence and unstructured data refers to an unknown means. Although the quantity of structured information in the entire data and within organizations is increasing, the majority of information remains available only in unstructured data. While different in form, both unstructured and structured information sources provide information about entities in the world and their properties and relations. Due to the recent rapid changes in society and wide spread of information devices, diverse digital information is utilized in a variety of economic and social analysis. Information related to the crime statistics by type of crime has been used as a major factor in crime. However, statistical analysis using only the structured data has the difficulty in the investigation by providing limited information to investigators and users. In this paper, structured data and unstructured data are analyzed by applying Korean Natural Language Processing (Ko-NLP) and the Latent Semantic Analysis (LSA) technique. It will provide a crime profile optimum system that can be applied to the crime profiling system or statistical analysis [1].","PeriodicalId":13926,"journal":{"name":"International journal of database theory and application","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2017-06-30","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"85144430","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
期刊
International journal of database theory and application
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1