首页 > 最新文献

Journal of Computer Science and Cybernetics最新文献

英文 中文
HIGH UTILITY ITEM INTERVAL SEQUENTIAL PATTERN MINING ALGORITHM 高实用项目区间序列模式挖掘算法
Pub Date : 2020-02-27 DOI: 10.15625/1813-9663/36/1/14398
Trần Huy Dương, N. Thang, V. D. Thi
High utility sequential pattern mining is a popular topic in data mining with the main purpose is to extract sequential patterns with high utility in the sequence database. Many recent works have proposed methods to solve this problem. However, most of them does not consider item intervals of sequential patterns which can lead to the extraction of sequential patterns with too long item interval, thus making little sense. In this paper, we propose a High Utility Item Interval Sequential Pattern (HUISP) algorithm to solve this problem. Our algorithm uses pattern growth approach and some techniques to increase algorithm’s performance.
高效用序列模式挖掘是数据挖掘领域的一个热门课题,其主要目的是从序列数据库中提取高效用序列模式。最近的许多研究都提出了解决这个问题的方法。然而,大多数方法没有考虑序列模式的项间隔,导致提取的序列模式项间隔过长,意义不大。在本文中,我们提出了一种高效用项区间序列模式(HUISP)算法来解决这个问题。我们的算法采用模式增长方法和一些技术来提高算法的性能。
{"title":"HIGH UTILITY ITEM INTERVAL SEQUENTIAL PATTERN MINING ALGORITHM","authors":"Trần Huy Dương, N. Thang, V. D. Thi","doi":"10.15625/1813-9663/36/1/14398","DOIUrl":"https://doi.org/10.15625/1813-9663/36/1/14398","url":null,"abstract":"High utility sequential pattern mining is a popular topic in data mining with the main purpose is to extract sequential patterns with high utility in the sequence database. Many recent works have proposed methods to solve this problem. However, most of them does not consider item intervals of sequential patterns which can lead to the extraction of sequential patterns with too long item interval, thus making little sense. In this paper, we propose a High Utility Item Interval Sequential Pattern (HUISP) algorithm to solve this problem. Our algorithm uses pattern growth approach and some techniques to increase algorithm’s performance.","PeriodicalId":15444,"journal":{"name":"Journal of Computer Science and Cybernetics","volume":"1 1","pages":"1-15"},"PeriodicalIF":0.0,"publicationDate":"2020-02-27","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"76250814","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
AGGREGATION OF SYMBOLIC POSSIBILISTIC KNOWLEDGE BASES FROM THE POSTULATE POINT OF VIEW 从假设的角度看符号可能性知识库的聚合
Pub Date : 2020-02-27 DOI: 10.15625/1813-9663/36/1/13188
Thanh Do Van, Thi Thanh Luu Le
Aggregation of knowledge bases in the propositional language was soon investigated and the requirements of aggregation processes of propositional knowledge bases basically are unified within the community of researchers and applicants. Aggregation of standard possibilistic knowledge bases where the weight of propositional formulas being numeric has also been investigated and applied in building the intelligent systems, in multi-criterion decision-making processes as well as in decisionmaking processes implemented by many people. Symbolic possibilistic logic (SPL for short) where the weight of the propositional formulas is symbols was proposed, and recently it was proven that SPL is soundness and completeness. In order to apply SPL in building intelligent systems as well as in decision-making processes, it is necessary to solve the problem of aggregation of symbolic possibilistic knowledge bases (SPK bases for short). This problem has not been researched so far. The purpose of this paper is to investigate aggregation processes of SPK bases from the postulate point of view in propositional language. These processes are implemented via impossibility distributions defined from SPK bases. Characteristics of merging operators, including hierarchical merging operators, of symbolic impossibility distributions (SIDs for short) from the postulate point of view will be shown in the paper.
命题语言中知识库的聚合很快得到了研究,命题知识库的聚合过程的要求在研究者和申请者群体中基本是统一的。命题公式权重为数值的标准可能性知识库的聚合也被研究并应用于智能系统的构建、多准则决策过程以及多人实施的决策过程中。提出了命题公式权重为符号的符号可能性逻辑(Symbolic possibilities logic,简称SPL),最近证明了SPL是完备性和完备性的。为了将符号可能性知识库应用于智能系统的构建和决策过程中,需要解决符号可能性知识库(简称SPK知识库)的聚集问题。这个问题到目前为止还没有研究过。本文的目的是从命题语言的公设角度研究SPK基的聚集过程。这些过程通过从SPK基定义的不可能分布来实现。本文从假设的角度给出了符号不可能分布(SIDs)的归并算子,包括层次归并算子的特征。
{"title":"AGGREGATION OF SYMBOLIC POSSIBILISTIC KNOWLEDGE BASES FROM THE POSTULATE POINT OF VIEW","authors":"Thanh Do Van, Thi Thanh Luu Le","doi":"10.15625/1813-9663/36/1/13188","DOIUrl":"https://doi.org/10.15625/1813-9663/36/1/13188","url":null,"abstract":"Aggregation of knowledge bases in the propositional language was soon investigated and the requirements of aggregation processes of propositional knowledge bases basically are unified within the community of researchers and applicants. Aggregation of standard possibilistic knowledge bases where the weight of propositional formulas being numeric has also been investigated and applied in building the intelligent systems, in multi-criterion decision-making processes as well as in decisionmaking processes implemented by many people. Symbolic possibilistic logic (SPL for short) where the weight of the propositional formulas is symbols was proposed, and recently it was proven that SPL is soundness and completeness. In order to apply SPL in building intelligent systems as well as in decision-making processes, it is necessary to solve the problem of aggregation of symbolic possibilistic knowledge bases (SPK bases for short). This problem has not been researched so far. The purpose of this paper is to investigate aggregation processes of SPK bases from the postulate point of view in propositional language. These processes are implemented via impossibility distributions defined from SPK bases. Characteristics of merging operators, including hierarchical merging operators, of symbolic impossibility distributions (SIDs for short) from the postulate point of view will be shown in the paper.","PeriodicalId":15444,"journal":{"name":"Journal of Computer Science and Cybernetics","volume":"5 1","pages":"17-32"},"PeriodicalIF":0.0,"publicationDate":"2020-02-27","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"90288510","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
CHOOSING SEEDS FOR SEMI-SUPERVISED GRAPH BASED CLUSTERING 基于半监督图聚类的种子选择
Pub Date : 2019-10-31 DOI: 10.15625/1813-9663/35/4/14123
C. Le, V. Vu, L. K. Oanh, Nguyen Thi Hai Yen
Though clustering algorithms have long history, nowadays clustering topic still attracts a lot of attention because of the need of efficient data analysis tools in many applications such as social network, electronic commerce, GIS, etc. Recently, semi-supervised clustering, for example, semi-supervised K-Means, semi-supervised DBSCAN, semi-supervised graph-based clustering (SSGC) etc., which uses side information, has received a great deal of attention. Generally, there are two forms of side information: seed form (labeled data) and constraint form (must-link, cannot-link). By integrating information provided by the user or domain expert, the semi-supervised clustering can produce expected results. In fact, clustering results usually depend on side information provided, so different side information will produce different results of clustering. In some cases, the performance of clustering may decrease if the side information is not carefully chosen. This paper addresses the problem of efficient collection of seeds for semi-supervised clustering, especially for graph based clustering by seeding (SSGC). The properly collected seeds can boost the quality of clustering and minimize the number of queries solicited from the user. For this purpose, we have developed an active learning algorithm (called SKMMM) for the seeds collection task, which identifies candidates to solicit users by using the K-Means and min-max algorithms. Experiments conducted on real data sets from UCI and a real collected document data set show the effectiveness of our approach compared with other methods.
虽然聚类算法有着悠久的历史,但由于在社交网络、电子商务、地理信息系统等许多应用中都需要高效的数据分析工具,聚类话题仍然备受关注。近年来,利用侧信息的半监督聚类,如半监督K-Means、半监督DBSCAN、半监督基于图的聚类(SSGC)等受到了广泛的关注。通常,侧信息有两种形式:种子形式(标记数据)和约束形式(必须链接,不能链接)。通过整合用户或领域专家提供的信息,半监督聚类可以产生预期的结果。实际上,聚类结果通常取决于所提供的侧信息,因此不同的侧信息会产生不同的聚类结果。在某些情况下,如果不仔细选择副信息,聚类的性能可能会下降。本文研究了半监督聚类,特别是基于图的种子聚类(SSGC)的有效种子收集问题。正确收集种子可以提高聚类的质量,并最大限度地减少从用户请求的查询数量。为此,我们开发了一种用于种子收集任务的主动学习算法(称为SKMMM),该算法通过使用K-Means和min-max算法识别候选用户来招揽用户。在UCI的真实数据集和真实收集的文档数据集上进行的实验表明,与其他方法相比,我们的方法是有效的。
{"title":"CHOOSING SEEDS FOR SEMI-SUPERVISED GRAPH BASED CLUSTERING","authors":"C. Le, V. Vu, L. K. Oanh, Nguyen Thi Hai Yen","doi":"10.15625/1813-9663/35/4/14123","DOIUrl":"https://doi.org/10.15625/1813-9663/35/4/14123","url":null,"abstract":"Though clustering algorithms have long history, nowadays clustering topic still attracts a lot of attention because of the need of efficient data analysis tools in many applications such as social network, electronic commerce, GIS, etc. Recently, semi-supervised clustering, for example, semi-supervised K-Means, semi-supervised DBSCAN, semi-supervised graph-based clustering (SSGC) etc., which uses side information, has received a great deal of attention. Generally, there are two forms of side information: seed form (labeled data) and constraint form (must-link, cannot-link). By integrating information provided by the user or domain expert, the semi-supervised clustering can produce expected results. In fact, clustering results usually depend on side information provided, so different side information will produce different results of clustering. In some cases, the performance of clustering may decrease if the side information is not carefully chosen. This paper addresses the problem of efficient collection of seeds for semi-supervised clustering, especially for graph based clustering by seeding (SSGC). The properly collected seeds can boost the quality of clustering and minimize the number of queries solicited from the user. For this purpose, we have developed an active learning algorithm (called SKMMM) for the seeds collection task, which identifies candidates to solicit users by using the K-Means and min-max algorithms. Experiments conducted on real data sets from UCI and a real collected document data set show the effectiveness of our approach compared with other methods.","PeriodicalId":15444,"journal":{"name":"Journal of Computer Science and Cybernetics","volume":"94 1","pages":""},"PeriodicalIF":0.0,"publicationDate":"2019-10-31","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"79450477","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Pythagorean Picture Fuzzy Sets, Part 1- basic notions 毕达哥拉斯图象模糊集,第1部分-基本概念
Pub Date : 2019-10-31 DOI: 10.15625/1813-9663/35/4/13898
B. Cuong
Picture fuzzy set (2013) is a generalization of the Zadeh‟ fuzzy set (1965) and the Antanassov‟intuitionistic fuzzy set. The new concept could be useful for many computational intelligentproblems. Basic operators of the picture fuzzy logic were studied by Cuong, Ngan [10,11 ].Newconcept –Pythagorean picture fuzzy set ( PPFS) is a combination of Picture fuzzy set with theYager‟s Pythagorean fuzzy set [12-14].First, in the Part 1 of this paper, we consider basic notionson PPFS as set operators of PPFS‟s , Pythagorean picture relation, Pythagorean picture fuzzy softset. Next, the Part 2 of the paper is devoted to main operators in fuzzy logic on PPFS: picturenegation operator, picture t-norm, picture t-conorm, picture implication operators on PPFS.As aresult we will have a new branch of the picture fuzzy set theory.
图片模糊集(2013)是对Zadeh“模糊集”(1965)和塔那那索夫“直觉模糊集”的推广。这个新概念可能对许多计算智能问题有用。Cuong, Ngan[10,11]研究了图像模糊逻辑的基本算子。新概念-毕达哥拉斯图像模糊集(PPFS)是图像模糊集与他们ager的毕达哥拉斯模糊集的结合[12-14]。首先,在本文的第一部分中,我们将PPFS的基本概念视为PPFS的集合算子、毕达哥拉斯图关系、毕达哥拉斯图模糊软集。其次,本文的第二部分讨论了PPFS上模糊逻辑的主要算子:PPFS上的图象否定算子、图象t-范数、图象t-符合算子、图象蕴涵算子。因此,我们将有一个新的分支图像模糊集理论。
{"title":"Pythagorean Picture Fuzzy Sets, Part 1- basic notions","authors":"B. Cuong","doi":"10.15625/1813-9663/35/4/13898","DOIUrl":"https://doi.org/10.15625/1813-9663/35/4/13898","url":null,"abstract":"Picture fuzzy set (2013) is a generalization of the Zadeh‟ fuzzy set (1965) and the Antanassov‟intuitionistic fuzzy set. The new concept could be useful for many computational intelligentproblems. Basic operators of the picture fuzzy logic were studied by Cuong, Ngan [10,11 ].Newconcept –Pythagorean picture fuzzy set ( PPFS) is a combination of Picture fuzzy set with theYager‟s Pythagorean fuzzy set [12-14].First, in the Part 1 of this paper, we consider basic notionson PPFS as set operators of PPFS‟s , Pythagorean picture relation, Pythagorean picture fuzzy softset. Next, the Part 2 of the paper is devoted to main operators in fuzzy logic on PPFS: picturenegation operator, picture t-norm, picture t-conorm, picture implication operators on PPFS.As aresult we will have a new branch of the picture fuzzy set theory.","PeriodicalId":15444,"journal":{"name":"Journal of Computer Science and Cybernetics","volume":"46 1","pages":""},"PeriodicalIF":0.0,"publicationDate":"2019-10-31","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"81436340","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 3
DISTORTION-BASED HEURISTIC METHOD FOR SENSITIVE ASSOCIATION RULE HIDING 基于扭曲的敏感关联规则隐藏启发式方法
Pub Date : 2019-10-31 DOI: 10.15625/1813-9663/35/4/14131
Bac Le, L. Kieu, Dat Tran
In the past few years, privacy issues in data mining have received considerable attention in the data mining literature. However, the problem of data security cannot simply be solved by restricting data collection or against unauthorized access, it should be dealt with by providing solutions that  not only protect sensitive information, but also not affect to the accuracy of the results in data mining and not violate the sensitive knowledge related with individual privacy or competitive advantage in businesses. Sensitive association rule hiding is an important issue in privacy preserving data mining. The aim of association rule hiding is to minimize the side effects on the sanitized database, which means to reduce the number of missing non-sensitive rules and the number of generated ghost rules. Current methods for hiding sensitive rules cause side effects and data loss. In this paper, we introduce a new distortion-based method to hide sensitive rules. This method proposes the determination of critical transactions based on the number of non-sensitive maximal frequent itemsets that contain at least one item to the consequent of the sensitive rule, they can be directly affected by the modified transactions. Using this set, the number of non-sensitive itemsets that need to be considered is reduced dramatically. We compute the smallest number of transactions for modification in advance to minimize the damage to the database. Comparative experimental results on real datasets showed that the proposed method can achieve better results than other methods with fewer side effects and data loss.
在过去的几年中,数据挖掘中的隐私问题在数据挖掘文献中受到了相当大的关注。然而,数据安全问题不能简单地通过限制数据收集或防止未经授权的访问来解决,它应该通过提供既保护敏感信息,又不影响数据挖掘结果的准确性,不侵犯涉及个人隐私或商业竞争优势的敏感知识的解决方案来解决。敏感关联规则隐藏是保护隐私数据挖掘中的一个重要问题。关联规则隐藏的目的是最大限度地减少对净化数据库的副作用,这意味着减少缺失的非敏感规则的数量和生成的幽灵规则的数量。当前隐藏敏感规则的方法会导致副作用和数据丢失。本文提出了一种基于失真的敏感规则隐藏方法。该方法提出了基于非敏感最大频繁项集的数量来确定关键事务,这些非敏感最大频繁项集至少包含一个敏感规则的结果,它们可以直接受到修改后的事务的影响。使用此集合,需要考虑的非敏感项集的数量大大减少。我们提前计算出最小的事务修改数量,以尽量减少对数据库的损害。在真实数据集上的对比实验结果表明,该方法具有副作用小、数据丢失少等优点,可以取得较好的效果。
{"title":"DISTORTION-BASED HEURISTIC METHOD FOR SENSITIVE ASSOCIATION RULE HIDING","authors":"Bac Le, L. Kieu, Dat Tran","doi":"10.15625/1813-9663/35/4/14131","DOIUrl":"https://doi.org/10.15625/1813-9663/35/4/14131","url":null,"abstract":"In the past few years, privacy issues in data mining have received considerable attention in the data mining literature. However, the problem of data security cannot simply be solved by restricting data collection or against unauthorized access, it should be dealt with by providing solutions that  not only protect sensitive information, but also not affect to the accuracy of the results in data mining and not violate the sensitive knowledge related with individual privacy or competitive advantage in businesses. Sensitive association rule hiding is an important issue in privacy preserving data mining. The aim of association rule hiding is to minimize the side effects on the sanitized database, which means to reduce the number of missing non-sensitive rules and the number of generated ghost rules. Current methods for hiding sensitive rules cause side effects and data loss. In this paper, we introduce a new distortion-based method to hide sensitive rules. This method proposes the determination of critical transactions based on the number of non-sensitive maximal frequent itemsets that contain at least one item to the consequent of the sensitive rule, they can be directly affected by the modified transactions. Using this set, the number of non-sensitive itemsets that need to be considered is reduced dramatically. We compute the smallest number of transactions for modification in advance to minimize the damage to the database. Comparative experimental results on real datasets showed that the proposed method can achieve better results than other methods with fewer side effects and data loss.","PeriodicalId":15444,"journal":{"name":"Journal of Computer Science and Cybernetics","volume":"9 1","pages":""},"PeriodicalIF":0.0,"publicationDate":"2019-10-31","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"78644209","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
Numerical solution of the problems for plates on some complex partial internal supports 若干复杂局部内支板问题的数值解
Pub Date : 2019-10-31 DOI: 10.15625/1813-9663/35/4/13648
Trương Hà Hải, Vu Vinh Quang, D. Long
In the recent works, Dang and Truong proposed an iterative method for solving some problems of plates on one, two and three line partial internal supports (LPISs), and a cross internal support. In nature they are problems with strongly mixed boundary conditions for biharmonic equation. For this reason the method combines a domain decomposition technique with the reduction of the order of the equation from four to two. In this study, the method is developed for plates on internal supports of more complex configurations. Namely, we examine the cases of symmetric rectangular and H-shape supports, where the computational domain after reducing to the first quadrant of the plate is divided into three subdomains. Also, we consider the case of asymmetric rectangular support where the computational domain needs to be divided into 9 subdomains. The problems under consideration are reduced to sequences of weak mixed boundary value problems for the Poisson equation, which are solved by difference method. The performed numerical experiments show the effectiveness of the iterative method.
在最近的工作中,Dang和Truong提出了一种迭代方法来解决一、二、三线部分内支撑(lpee)和交叉内支撑上的一些问题。本质上是双调和方程的强混合边界条件问题。由于这个原因,该方法结合了域分解技术和将方程的阶数从四降为二。在本研究中,该方法被开发为更复杂结构的板内支撑。也就是说,我们研究了对称矩形和h形支撑的情况,其中计算域减少到板的第一象限后分为三个子域。此外,我们还考虑了非对称矩形支持的情况,其中计算域需要划分为9个子域。将所考虑的问题简化为泊松方程的弱混合边值问题序列,用差分法求解。数值实验表明了该迭代方法的有效性。
{"title":"Numerical solution of the problems for plates on some complex partial internal supports","authors":"Trương Hà Hải, Vu Vinh Quang, D. Long","doi":"10.15625/1813-9663/35/4/13648","DOIUrl":"https://doi.org/10.15625/1813-9663/35/4/13648","url":null,"abstract":"In the recent works, Dang and Truong proposed an iterative method for solving some problems of plates on one, two and three line partial internal supports (LPISs), and a cross internal support. In nature they are problems with strongly mixed boundary conditions for biharmonic equation. For this reason the method combines a domain decomposition technique with the reduction of the order of the equation from four to two. In this study, the method is developed for plates on internal supports of more complex configurations. Namely, we examine the cases of symmetric rectangular and H-shape supports, where the computational domain after reducing to the first quadrant of the plate is divided into three subdomains. Also, we consider the case of asymmetric rectangular support where the computational domain needs to be divided into 9 subdomains. The problems under consideration are reduced to sequences of weak mixed boundary value problems for the Poisson equation, which are solved by difference method. The performed numerical experiments show the effectiveness of the iterative method.","PeriodicalId":15444,"journal":{"name":"Journal of Computer Science and Cybernetics","volume":"65 1","pages":""},"PeriodicalIF":0.0,"publicationDate":"2019-10-31","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"88962457","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
EXTENDING RELATIONAL DATABASE MODEL FOR UNCERTAIN INFORMATION 扩展不确定信息的关系数据库模型
Pub Date : 2019-10-31 DOI: 10.15625/1813-9663/35/4/13907
Hoa Nguyen
In this paper, we propose a new probabilistic relational database model, denote by PRDB, as an extension of the classical relational database model where the uncertainty of relational attribute values and tuples are respectively represented by finite sets and probability intervals. A probabilistic interpretation of binary relations on finite sets is proposed for the computation of their probability measures. The combination strategies on probability intervals are employed to combine attribute values and compute uncertain membership degrees of tuples in a relation. The fundamental concepts of the classical relational database model are extended and generalized for PRDB. Then, the probabilistic relational algebraic operations are formally defined accordingly in PRDB. In addition, a set of the properties of the algebraic operations in this new model also are formulated and proven.
本文提出了一种新的概率关系数据库模型(PRDB),作为经典关系数据库模型的扩展,其中关系属性值和元组的不确定性分别用有限集合和概率区间表示。提出了有限集合上二元关系的概率解释,用于计算它们的概率测度。利用概率区间上的组合策略组合属性值,计算关系中元组的不确定隶属度。经典关系数据库模型的基本概念在PRDB中得到了扩展和推广。然后,在PRDB中对相应的概率关系代数运算进行形式化定义。此外,还给出了该模型中代数运算的一系列性质,并给出了证明。
{"title":"EXTENDING RELATIONAL DATABASE MODEL FOR UNCERTAIN INFORMATION","authors":"Hoa Nguyen","doi":"10.15625/1813-9663/35/4/13907","DOIUrl":"https://doi.org/10.15625/1813-9663/35/4/13907","url":null,"abstract":"In this paper, we propose a new probabilistic relational database model, denote by PRDB, as an extension of the classical relational database model where the uncertainty of relational attribute values and tuples are respectively represented by finite sets and probability intervals. A probabilistic interpretation of binary relations on finite sets is proposed for the computation of their probability measures. The combination strategies on probability intervals are employed to combine attribute values and compute uncertain membership degrees of tuples in a relation. The fundamental concepts of the classical relational database model are extended and generalized for PRDB. Then, the probabilistic relational algebraic operations are formally defined accordingly in PRDB. In addition, a set of the properties of the algebraic operations in this new model also are formulated and proven.","PeriodicalId":15444,"journal":{"name":"Journal of Computer Science and Cybernetics","volume":"2 1","pages":""},"PeriodicalIF":0.0,"publicationDate":"2019-10-31","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"89250097","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 2
A HEDGE ALGEBRAS BASED CLASSIFICATION REASONING METHOD WITH MULTI-GRANULARITY FUZZY PARTITIONING 基于树篱代数的多粒度模糊划分分类推理方法
Pub Date : 2019-10-31 DOI: 10.15625/1813-9663/35/4/14348
Phạm Đình Phong, Nguyen Duc Du, N. Thuy, Hoàng Văn Thông
During last years, lots of the fuzzy rule based classifier (FRBC) design methods have been proposed to improve the classification accuracy and the interpretability of the proposed classification models. Most of them are based on the fuzzy set theory approach in such a way that the fuzzy classification rules are generated from the grid partitions combined with the pre-designed fuzzy partitions using fuzzy sets. Some mechanisms are studied to automatically generate fuzzy partitions from data such as discretization, granular computing, etc. Even those, linguistic terms are intuitively assigned to fuzzy sets because there is no formalisms to link inherent semantics of linguistic terms to fuzzy sets. In view of that trend, genetic design methods of linguistic terms along with their (triangular and trapezoidal) fuzzy sets based semantics for FRBCs, using hedge algebras as the mathematical formalism, have been proposed. Those hedge algebras-based design methods utilize semantically quantifying mapping values of linguistic terms to generate their fuzzy sets based semantics so as to make use of fuzzy sets based-classification reasoning methods proposed in design methods based on fuzzy set theoretic approach for data classification. If there exists a classification reasoning method which bases merely on semantic parameters of hedge algebras, fuzzy sets-based semantics of the linguistic terms in fuzzy classification rule bases can be replaced by semantics - based hedge algebras. This paper presents a FRBC design method based on hedge algebras approach by introducing a hedge algebra- based classification reasoning method with multi-granularity fuzzy partitioning for data classification so that the semantic of linguistic terms in rule bases can be hedge algebras-based semantics. Experimental results over 17 real world datasets are compared to existing methods based on hedge algebras and the state-of-the-art fuzzy sets theoretic-based approaches, showing that the proposed FRBC in this paper is an effective classifier and produces good results.
近年来,人们提出了许多基于模糊规则的分类器(FRBC)设计方法,以提高分类模型的分类精度和可解释性。它们大多是基于模糊集理论的方法,利用模糊集将网格分区与预先设计好的模糊分区相结合来生成模糊分类规则。研究了从数据中自动生成模糊分区的一些机制,如离散化、颗粒计算等。即使这样,语言术语也会被直观地分配给模糊集,因为没有形式化的方法将语言术语的固有语义与模糊集联系起来。鉴于这一趋势,人们提出了基于模糊集语义的frbc语言术语遗传设计方法,并以树篱代数作为数学形式。这些基于对冲代数的设计方法利用语义量化语言术语的映射值来生成基于模糊集的语义,从而利用基于模糊集理论方法的设计方法中提出的基于模糊集的分类推理方法进行数据分类。如果存在一种仅基于套期代数语义参数的分类推理方法,则模糊分类规则库中语言项的模糊集语义可以被基于语义的套期代数所取代。本文提出了一种基于对冲代数方法的FRBC设计方法,通过引入基于对冲代数的多粒度模糊划分的分类推理方法对数据进行分类,使规则库中语言项的语义成为基于对冲代数的语义。在17个真实数据集上的实验结果与现有的基于对冲代数的方法和最先进的基于模糊集理论的方法进行了比较,表明本文提出的FRBC是一种有效的分类器,并产生了良好的结果。
{"title":"A HEDGE ALGEBRAS BASED CLASSIFICATION REASONING METHOD WITH MULTI-GRANULARITY FUZZY PARTITIONING","authors":"Phạm Đình Phong, Nguyen Duc Du, N. Thuy, Hoàng Văn Thông","doi":"10.15625/1813-9663/35/4/14348","DOIUrl":"https://doi.org/10.15625/1813-9663/35/4/14348","url":null,"abstract":"During last years, lots of the fuzzy rule based classifier (FRBC) design methods have been proposed to improve the classification accuracy and the interpretability of the proposed classification models. Most of them are based on the fuzzy set theory approach in such a way that the fuzzy classification rules are generated from the grid partitions combined with the pre-designed fuzzy partitions using fuzzy sets. Some mechanisms are studied to automatically generate fuzzy partitions from data such as discretization, granular computing, etc. Even those, linguistic terms are intuitively assigned to fuzzy sets because there is no formalisms to link inherent semantics of linguistic terms to fuzzy sets. In view of that trend, genetic design methods of linguistic terms along with their (triangular and trapezoidal) fuzzy sets based semantics for FRBCs, using hedge algebras as the mathematical formalism, have been proposed. Those hedge algebras-based design methods utilize semantically quantifying mapping values of linguistic terms to generate their fuzzy sets based semantics so as to make use of fuzzy sets based-classification reasoning methods proposed in design methods based on fuzzy set theoretic approach for data classification. If there exists a classification reasoning method which bases merely on semantic parameters of hedge algebras, fuzzy sets-based semantics of the linguistic terms in fuzzy classification rule bases can be replaced by semantics - based hedge algebras. This paper presents a FRBC design method based on hedge algebras approach by introducing a hedge algebra- based classification reasoning method with multi-granularity fuzzy partitioning for data classification so that the semantic of linguistic terms in rule bases can be hedge algebras-based semantics. Experimental results over 17 real world datasets are compared to existing methods based on hedge algebras and the state-of-the-art fuzzy sets theoretic-based approaches, showing that the proposed FRBC in this paper is an effective classifier and produces good results.","PeriodicalId":15444,"journal":{"name":"Journal of Computer Science and Cybernetics","volume":"11 1","pages":""},"PeriodicalIF":0.0,"publicationDate":"2019-10-31","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"85726411","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 3
FUZZY COMMON SEQUENTIAL RULES MINING IN QUANTITATIVE SEQUENCE DATABASES 定量序列数据库中的模糊公共序列规则挖掘
Pub Date : 2019-08-15 DOI: 10.15625/1813-9663/35/3/13277
Thanh Do Van, Phuong Truong Duc
{"title":"FUZZY COMMON SEQUENTIAL RULES MINING IN QUANTITATIVE SEQUENCE DATABASES","authors":"Thanh Do Van, Phuong Truong Duc","doi":"10.15625/1813-9663/35/3/13277","DOIUrl":"https://doi.org/10.15625/1813-9663/35/3/13277","url":null,"abstract":"","PeriodicalId":15444,"journal":{"name":"Journal of Computer Science and Cybernetics","volume":"131 1","pages":"217-232"},"PeriodicalIF":0.0,"publicationDate":"2019-08-15","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"88351737","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
FUZZY COMMON SEQUENTIAL RULES MINING IN QUANTITATIVE SEQUENCE DATABASES 定量序列数据库中的模糊公共序列规则挖掘
Pub Date : 2019-08-15 DOI: 10.15625/1813-9663/0/0/13277
Thanh Do Van, Phuong Truong Duc
Common Sequential Rules present a relationship between unordered itemsets in which the items in antecedents have to appear before ones in consequents. The algorithms proposed to find the such rules so far are only applied for transactional sequence databases, not applied for quantitative sequence databases.The goal of this paper is to propose a new algorithm for finding the fuzzy common sequential (FCS for short) rules in quantitative sequence databases. The proposed algorithm is improved by basing on the ERMiner algorithm. It is considered to be the most effective today compared to other algorithms for finding common sequential rules in transactional sequence database. FCS rules are more general than classical fuzzy sequential rules and are useful in marketing, market analysis, medical diagnosis and treatment
公共顺序规则表示无序项集之间的关系,其中先行项中的项必须出现在结果项之前。目前提出的寻找这些规则的算法只适用于事务性序列数据库,而不适用于定量序列数据库。本文的目标是提出一种在定量序列数据库中寻找模糊公共序列规则的新算法。该算法在ERMiner算法的基础上进行了改进。它被认为是目前在事务序列数据库中查找公共顺序规则的最有效的算法。FCS规则比经典的模糊顺序规则更通用,在市场营销、市场分析、医疗诊断和治疗中都很有用
{"title":"FUZZY COMMON SEQUENTIAL RULES MINING IN QUANTITATIVE SEQUENCE DATABASES","authors":"Thanh Do Van, Phuong Truong Duc","doi":"10.15625/1813-9663/0/0/13277","DOIUrl":"https://doi.org/10.15625/1813-9663/0/0/13277","url":null,"abstract":"Common Sequential Rules present a relationship between unordered itemsets in which the items in antecedents have to appear before ones in consequents. The algorithms proposed to find the such rules so far are only applied for transactional sequence databases, not applied for quantitative sequence databases.The goal of this paper is to propose a new algorithm for finding the fuzzy common sequential (FCS for short) rules in quantitative sequence databases. The proposed algorithm is improved by basing on the ERMiner algorithm. It is considered to be the most effective today compared to other algorithms for finding common sequential rules in transactional sequence database. FCS rules are more general than classical fuzzy sequential rules and are useful in marketing, market analysis, medical diagnosis and treatment","PeriodicalId":15444,"journal":{"name":"Journal of Computer Science and Cybernetics","volume":"66 1","pages":""},"PeriodicalIF":0.0,"publicationDate":"2019-08-15","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"89345564","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
期刊
Journal of Computer Science and Cybernetics
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1