{"title":"A Novel HOSFS Algorithm for Online Streaming Feature Selection","authors":"S. Sandhiya, U. Palani","doi":"10.1109/ICSCAN49426.2020.9262401","DOIUrl":null,"url":null,"abstract":"In recent days, Data stream mining is important for many of the real time and IOT based applications. Online feature selection is the one big topic of data stream mining which attracted researchers with intensive interest. This technique reduces the dimensionality of the streaming features by excluding inappropriate and redundant features. The researchers have proposed many online feature selection algorithm for streaming features like Grafting, Alpha-investing, OSFS, OGFS and SAOLA. Based on above studies the exiting algorithm has limitation over prediction accuracy and the large number of selected features. To overcome the limitations of above mentioned approaches, we propose an online feature selection algorithm for streaming features called Heuristic Online Streaming Feature Selection (HOSFS) which has advantages on choosing features from streaming features and omits the irrelevant and redundant features in real-time by using self-adaption sliding window protocol, and Heuristic function. The HOSFS algorithm assigns heuristic value to the features using the trained heuristic function and selects features with higher heuristic value where other features are considered as irrelevant features. This proposed technique results reduced number of strongly related features and obtains greater prediction accuracy with optimal features. HOSFS algorithm efficiency was tested with three different Health care datasets using MOA tools. Through the experimental outcomes, HOSFS has greater prediction accuracy and reduced number of selected features than alpha - investing, OSFS, and SAOLA.","PeriodicalId":6744,"journal":{"name":"2020 International Conference on System, Computation, Automation and Networking (ICSCAN)","volume":"6 1","pages":"1-6"},"PeriodicalIF":0.0000,"publicationDate":"2020-07-03","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"2","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2020 International Conference on System, Computation, Automation and Networking (ICSCAN)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICSCAN49426.2020.9262401","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 2
Abstract
In recent days, Data stream mining is important for many of the real time and IOT based applications. Online feature selection is the one big topic of data stream mining which attracted researchers with intensive interest. This technique reduces the dimensionality of the streaming features by excluding inappropriate and redundant features. The researchers have proposed many online feature selection algorithm for streaming features like Grafting, Alpha-investing, OSFS, OGFS and SAOLA. Based on above studies the exiting algorithm has limitation over prediction accuracy and the large number of selected features. To overcome the limitations of above mentioned approaches, we propose an online feature selection algorithm for streaming features called Heuristic Online Streaming Feature Selection (HOSFS) which has advantages on choosing features from streaming features and omits the irrelevant and redundant features in real-time by using self-adaption sliding window protocol, and Heuristic function. The HOSFS algorithm assigns heuristic value to the features using the trained heuristic function and selects features with higher heuristic value where other features are considered as irrelevant features. This proposed technique results reduced number of strongly related features and obtains greater prediction accuracy with optimal features. HOSFS algorithm efficiency was tested with three different Health care datasets using MOA tools. Through the experimental outcomes, HOSFS has greater prediction accuracy and reduced number of selected features than alpha - investing, OSFS, and SAOLA.