Using Genetic Algorithm to Optimize Weights in Data Mining Task

Sci. J. Riga Tech. Univ. Ser. Comput. Sci. Pub Date : 1900-01-01 DOI:10.2478/v10143-010-0017-7

I. Provorova, S. Parshutin, S. Provorovs

{"title":"Using Genetic Algorithm to Optimize Weights in Data Mining Task","authors":"I. Provorova, S. Parshutin, S. Provorovs","doi":"10.2478/v10143-010-0017-7","DOIUrl":null,"url":null,"abstract":"Using Genetic Algorithm to Optimize Weights in Data Mining Task This paper considers an application of genetic algorithm (GA) to optimize weights in data mining task. Data mining tasks usually have datasets containing a large number of records and features that will be processed using, for example, created classification rules. As a result, by using classical method to classify a large number of records and features, a high classification error value will be obtained. To solve this problem, the genetic algorithm was applied to find for each feature the weight that would reduce classification error value. As a classical method, the k-nearest neighbour (KNN) classifier was chosen and the modified genetic algorithm was applied to optimize the weight. Based on the joint application of genetic and k-nearest neighbour algorithms, the GA/KNN hybrid algorithm was developed. As a result, the developed hybrid algorithm provides a stable classification error reducing regardless of the number of records and features, and also of the chosen number of neighbours. In the GA block the modified crossover and mutation works in each generation with identical intensity and cannot provide debasing of the individual.","PeriodicalId":211660,"journal":{"name":"Sci. J. Riga Tech. Univ. Ser. Comput. Sci.","volume":"27 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"1900-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"3","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Sci. J. Riga Tech. Univ. Ser. Comput. Sci.","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.2478/v10143-010-0017-7","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 3

Abstract

Using Genetic Algorithm to Optimize Weights in Data Mining Task This paper considers an application of genetic algorithm (GA) to optimize weights in data mining task. Data mining tasks usually have datasets containing a large number of records and features that will be processed using, for example, created classification rules. As a result, by using classical method to classify a large number of records and features, a high classification error value will be obtained. To solve this problem, the genetic algorithm was applied to find for each feature the weight that would reduce classification error value. As a classical method, the k-nearest neighbour (KNN) classifier was chosen and the modified genetic algorithm was applied to optimize the weight. Based on the joint application of genetic and k-nearest neighbour algorithms, the GA/KNN hybrid algorithm was developed. As a result, the developed hybrid algorithm provides a stable classification error reducing regardless of the number of records and features, and also of the chosen number of neighbours. In the GA block the modified crossover and mutation works in each generation with identical intensity and cannot provide debasing of the individual.

查看原文

微信好友朋友圈 QQ好友复制链接

本刊更多论文

基于遗传算法的数据挖掘任务权值优化

利用遗传算法优化数据挖掘任务中的权值，研究了遗传算法在数据挖掘任务中权值优化中的应用。数据挖掘任务通常具有包含大量记录和特征的数据集，这些记录和特征将使用例如创建的分类规则进行处理。因此，用经典方法对大量的记录和特征进行分类，会得到较高的分类误差值。为了解决这一问题，应用遗传算法为每个特征寻找减少分类误差值的权重。作为经典方法，选择k近邻(KNN)分类器，并采用改进的遗传算法对权重进行优化。基于遗传算法和k近邻算法的联合应用，提出了GA/KNN混合算法。因此，所开发的混合算法提供了一个稳定的分类误差，无论记录和特征的数量，以及所选择的邻居数量如何。在遗传块中，改进的交叉和突变在每一代中以相同的强度起作用，不会造成个体的贬低。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文去求助

来源期刊

Sci. J. Riga Tech. Univ. Ser. Comput. Sci.

自引率

0.00%

发文量