Using Genetic Algorithm to Optimize Weights in Data Mining Task

I. Provorova, S. Parshutin, S. Provorovs
{"title":"Using Genetic Algorithm to Optimize Weights in Data Mining Task","authors":"I. Provorova, S. Parshutin, S. Provorovs","doi":"10.2478/v10143-010-0017-7","DOIUrl":null,"url":null,"abstract":"Using Genetic Algorithm to Optimize Weights in Data Mining Task This paper considers an application of genetic algorithm (GA) to optimize weights in data mining task. Data mining tasks usually have datasets containing a large number of records and features that will be processed using, for example, created classification rules. As a result, by using classical method to classify a large number of records and features, a high classification error value will be obtained. To solve this problem, the genetic algorithm was applied to find for each feature the weight that would reduce classification error value. As a classical method, the k-nearest neighbour (KNN) classifier was chosen and the modified genetic algorithm was applied to optimize the weight. Based on the joint application of genetic and k-nearest neighbour algorithms, the GA/KNN hybrid algorithm was developed. As a result, the developed hybrid algorithm provides a stable classification error reducing regardless of the number of records and features, and also of the chosen number of neighbours. In the GA block the modified crossover and mutation works in each generation with identical intensity and cannot provide debasing of the individual.","PeriodicalId":211660,"journal":{"name":"Sci. J. Riga Tech. Univ. Ser. Comput. Sci.","volume":"27 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"1900-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"3","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Sci. J. Riga Tech. Univ. Ser. Comput. Sci.","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.2478/v10143-010-0017-7","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 3

Abstract

Using Genetic Algorithm to Optimize Weights in Data Mining Task This paper considers an application of genetic algorithm (GA) to optimize weights in data mining task. Data mining tasks usually have datasets containing a large number of records and features that will be processed using, for example, created classification rules. As a result, by using classical method to classify a large number of records and features, a high classification error value will be obtained. To solve this problem, the genetic algorithm was applied to find for each feature the weight that would reduce classification error value. As a classical method, the k-nearest neighbour (KNN) classifier was chosen and the modified genetic algorithm was applied to optimize the weight. Based on the joint application of genetic and k-nearest neighbour algorithms, the GA/KNN hybrid algorithm was developed. As a result, the developed hybrid algorithm provides a stable classification error reducing regardless of the number of records and features, and also of the chosen number of neighbours. In the GA block the modified crossover and mutation works in each generation with identical intensity and cannot provide debasing of the individual.
查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
基于遗传算法的数据挖掘任务权值优化
利用遗传算法优化数据挖掘任务中的权值,研究了遗传算法在数据挖掘任务中权值优化中的应用。数据挖掘任务通常具有包含大量记录和特征的数据集,这些记录和特征将使用例如创建的分类规则进行处理。因此,用经典方法对大量的记录和特征进行分类,会得到较高的分类误差值。为了解决这一问题,应用遗传算法为每个特征寻找减少分类误差值的权重。作为经典方法,选择k近邻(KNN)分类器,并采用改进的遗传算法对权重进行优化。基于遗传算法和k近邻算法的联合应用,提出了GA/KNN混合算法。因此,所开发的混合算法提供了一个稳定的分类误差,无论记录和特征的数量,以及所选择的邻居数量如何。在遗传块中,改进的交叉和突变在每一代中以相同的强度起作用,不会造成个体的贬低。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 去求助
来源期刊
自引率
0.00%
发文量
0
期刊最新文献
Inventory Management in Multi Echelon Supply Chain using Sample Average Approximation Hybridisation of Evolutionary Algorithms for Solving Multi-Objective Simulation Optimisation Problems Simulation-Based Analysis of Fitness Landscape in Optimisation Multi-Agent Geosimulation of Urban Dynamics within the V-Devs Framework The Baseline Configuration of Project Management Information System
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1