Nguyễn Tấn Thuận, Tran Thi, Thuy Trinh, Doan Van Ban, Truong Ngoc, Chau, Nguyen Thi Anh, Phuong, Nguyen Truong Thang
{"title":"Some new fuzzy query processing methods based on similarity measurement and fuzzy data clustering","authors":"Nguyễn Tấn Thuận, Tran Thi, Thuy Trinh, Doan Van Ban, Truong Ngoc, Chau, Nguyen Thi Anh, Phuong, Nguyen Truong Thang","doi":"10.15625/2525-2518/18222","DOIUrl":null,"url":null,"abstract":"In relational and object-oriented database systems there is always data that is naturally fuzzy or uncertain. However, to deal with complex data types with fuzzy nature, these systems have many limitations. Therefore, in order to represent and manage fuzzy data, it is necessary to have a fuzzy interrogation system to facilitate non-expert users. To solve this challenge, the paper proposes two different approaches to increase the flexibility of the fuzzy interrogation system. Firstly, based on similarity measures and fuzzy logic, we develop three fuzzy query processing algorithms for single-condition and multi-condition cases such as FQSIMSC (Fuzzy Query Sim Single Condition), FQSIMMC (Fuzzy Query Sim Multi-Condition) and FQSEM (Fuzzy Query SEM). Secondly, we combine the fuzzy clustering algorithm EMC (Expectation maximization Coefficient) and the query processing algorithm that is based on fuzzy partitions FQINTERVAL (Fuzzy Query Interval). With this approach, we not only improve query processing cost but also support applications and devices equipped with intelligent interactive function that easily interacts with the fuzzy query system. The results of our theoretical and experimental analysis, it can be seen that both the proposed methods significantly reduce the processing time and memory space for a data set (extracted from UCI) that has a fuzzy and incomplete natural element with the resulting data size being optimal","PeriodicalId":23553,"journal":{"name":"Vietnam Journal of Science and Technology","volume":"34 4","pages":""},"PeriodicalIF":0.0000,"publicationDate":"2024-02-23","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Vietnam Journal of Science and Technology","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.15625/2525-2518/18222","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0
Abstract
In relational and object-oriented database systems there is always data that is naturally fuzzy or uncertain. However, to deal with complex data types with fuzzy nature, these systems have many limitations. Therefore, in order to represent and manage fuzzy data, it is necessary to have a fuzzy interrogation system to facilitate non-expert users. To solve this challenge, the paper proposes two different approaches to increase the flexibility of the fuzzy interrogation system. Firstly, based on similarity measures and fuzzy logic, we develop three fuzzy query processing algorithms for single-condition and multi-condition cases such as FQSIMSC (Fuzzy Query Sim Single Condition), FQSIMMC (Fuzzy Query Sim Multi-Condition) and FQSEM (Fuzzy Query SEM). Secondly, we combine the fuzzy clustering algorithm EMC (Expectation maximization Coefficient) and the query processing algorithm that is based on fuzzy partitions FQINTERVAL (Fuzzy Query Interval). With this approach, we not only improve query processing cost but also support applications and devices equipped with intelligent interactive function that easily interacts with the fuzzy query system. The results of our theoretical and experimental analysis, it can be seen that both the proposed methods significantly reduce the processing time and memory space for a data set (extracted from UCI) that has a fuzzy and incomplete natural element with the resulting data size being optimal