{"title":"Heuristic model to improve Feature Selection based on Machine Learning in Data Mining","authors":"Jahin Majumdar, Anwesha Mal, Shruti Gupta","doi":"10.1109/CONFLUENCE.2016.7508050","DOIUrl":null,"url":null,"abstract":"Data Mining and Machine Learning is one of the most popular research areas in computer science that is relevant in today's world of unfathomable data. To keep up with the rising size of data, there arises a need to quickly extract knowledge from data sources to aid data analysis research and improve industry and market needs. Primary Data Mining algorithms like k-means, Apriori, PageRank etc. are used today, but Machine Learning techniques can enhance the same by learning from the complex patterns. This paper focuses on the various existing approaches where Machine Learning algorithms have been used to improve data classification and pattern recognition in Data Mining especially for Feature Selection. It compares and contrasts the existing techniques and finds out the best one among them. Further, the paper proposes a heuristic approach to theoretically overcome most of the limitations in existing algorithms.","PeriodicalId":299044,"journal":{"name":"2016 6th International Conference - Cloud System and Big Data Engineering (Confluence)","volume":"22 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2016-07-11","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"8","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2016 6th International Conference - Cloud System and Big Data Engineering (Confluence)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/CONFLUENCE.2016.7508050","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 8
Abstract
Data Mining and Machine Learning is one of the most popular research areas in computer science that is relevant in today's world of unfathomable data. To keep up with the rising size of data, there arises a need to quickly extract knowledge from data sources to aid data analysis research and improve industry and market needs. Primary Data Mining algorithms like k-means, Apriori, PageRank etc. are used today, but Machine Learning techniques can enhance the same by learning from the complex patterns. This paper focuses on the various existing approaches where Machine Learning algorithms have been used to improve data classification and pattern recognition in Data Mining especially for Feature Selection. It compares and contrasts the existing techniques and finds out the best one among them. Further, the paper proposes a heuristic approach to theoretically overcome most of the limitations in existing algorithms.