{"title":"Balancing of the Imbalance data classification using Data Intrinsic characteristics","authors":"Harpreet Singh Bror","doi":"10.1109/IEMCON.2018.8614861","DOIUrl":null,"url":null,"abstract":"Data mining is the current buzz of the data processing world. There are large number of applications which are producing the large amount of data. This data need to be processed for the extraction of the analyzed data. So that this analysis can be used in the decision making purpose in relevant organizations. But the data collected from the row sources are of imbalance in nature. This data needs to be processed to generate the balancing dataset. Because processing imbalance data will be highly in efficient process. There are various types of discrepancies lies into the dataset. In current research paper these different types of discrepancies are to be neutralized. So that processing and analysis of the dataset will become easy and more efficient. These discrepancies lies into the dataset are like small disjuncts, lack of density in data, overlapping between the classes, impact of noisy data, the borderline etc. using different techniques these problems into the dataset are being neutralized. So that balanced data can be inputted for the data mining purpose.","PeriodicalId":368939,"journal":{"name":"2018 IEEE 9th Annual Information Technology, Electronics and Mobile Communication Conference (IEMCON)","volume":"54 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2018-11-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2018 IEEE 9th Annual Information Technology, Electronics and Mobile Communication Conference (IEMCON)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/IEMCON.2018.8614861","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 1
Abstract
Data mining is the current buzz of the data processing world. There are large number of applications which are producing the large amount of data. This data need to be processed for the extraction of the analyzed data. So that this analysis can be used in the decision making purpose in relevant organizations. But the data collected from the row sources are of imbalance in nature. This data needs to be processed to generate the balancing dataset. Because processing imbalance data will be highly in efficient process. There are various types of discrepancies lies into the dataset. In current research paper these different types of discrepancies are to be neutralized. So that processing and analysis of the dataset will become easy and more efficient. These discrepancies lies into the dataset are like small disjuncts, lack of density in data, overlapping between the classes, impact of noisy data, the borderline etc. using different techniques these problems into the dataset are being neutralized. So that balanced data can be inputted for the data mining purpose.