Proposing new method to improve gravitational fixed nearest neighbor algorithm for imbalanced data classification

Bahareh Nikpour, Mahin Shabani, H. Nezamabadi-pour
{"title":"Proposing new method to improve gravitational fixed nearest neighbor algorithm for imbalanced data classification","authors":"Bahareh Nikpour, Mahin Shabani, H. Nezamabadi-pour","doi":"10.1109/CSIEC.2017.7940167","DOIUrl":null,"url":null,"abstract":"Classification of imbalanced data sets is one of the basic challenges in the field of machine learning and data mining. There have been many proposed methods for classification of such data sets up to now. In algorithmic level methods, new algorithms are created which are adapted to the nature of imbalanced data sets. Gravitational fixed radius nearest neighbor algorithm (GFRNN) is an algorithmic level method developed with the aim of enhancing k nearest neighbor classifier to acquire the ability of dealing with imbalanced data sets. This algorithm, utilizes the sum of gravitational forces on a query sample from its nearest neighbors in a fixed radius to determine its label. Simplicity and no need for parameter setting during the run of algorithm are the main advantages of this method. In this paper, a method is proposed for improving the performance of GFRNN algorithm in which mass assigning of each training sample is done based on the sum of gravitational forces from other training samples on it. The results obtained from applying the proposed method on 10 data sets prove the superiority of it compared with 5 other algorithms.","PeriodicalId":166046,"journal":{"name":"2017 2nd Conference on Swarm Intelligence and Evolutionary Computation (CSIEC)","volume":"61 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2017-03-07","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"10","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2017 2nd Conference on Swarm Intelligence and Evolutionary Computation (CSIEC)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/CSIEC.2017.7940167","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 10

Abstract

Classification of imbalanced data sets is one of the basic challenges in the field of machine learning and data mining. There have been many proposed methods for classification of such data sets up to now. In algorithmic level methods, new algorithms are created which are adapted to the nature of imbalanced data sets. Gravitational fixed radius nearest neighbor algorithm (GFRNN) is an algorithmic level method developed with the aim of enhancing k nearest neighbor classifier to acquire the ability of dealing with imbalanced data sets. This algorithm, utilizes the sum of gravitational forces on a query sample from its nearest neighbors in a fixed radius to determine its label. Simplicity and no need for parameter setting during the run of algorithm are the main advantages of this method. In this paper, a method is proposed for improving the performance of GFRNN algorithm in which mass assigning of each training sample is done based on the sum of gravitational forces from other training samples on it. The results obtained from applying the proposed method on 10 data sets prove the superiority of it compared with 5 other algorithms.
查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
提出了一种改进引力固定近邻算法的不平衡数据分类新方法
不平衡数据集的分类是机器学习和数据挖掘领域的基本挑战之一。到目前为止,已经提出了许多对此类数据集进行分类的方法。在算法级方法中,创建了适应不平衡数据集性质的新算法。重力固定半径最近邻算法(GFRNN)是一种算法级方法,旨在增强k最近邻分类器以获得处理不平衡数据集的能力。该算法利用查询样本在固定半径内的最近邻的引力之和来确定其标签。该方法的主要优点是操作简单,在算法运行过程中不需要设置参数。本文提出了一种改进GFRNN算法性能的方法,该方法根据其他训练样本对其施加的引力之和对每个训练样本进行质量分配。在10个数据集上的应用结果证明了该方法与其他5种算法相比的优越性。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 去求助
来源期刊
自引率
0.00%
发文量
0
期刊最新文献
EEG-based multi-class motor imagery classification using variable sized filter bank and enhanced One Versus One classifier MOCSA: A Multi-Objective Crow Search Algorithm for Multi-Objective optimization A genetic approach in procedural content generation for platformer games level creation Using Recurrence quantification analysis and Generalized Hurst Exponents of ECG for human authentication Improved particle swarm optimization through orthogonal experimental design
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1