{"title":"Online extreme learning machine for handling concept drift and class imbalance problem","authors":"Dr. B. Vinayagasundaram, R. J. Aarthi, N. Abirami","doi":"10.1109/ICSCN.2017.8085690","DOIUrl":null,"url":null,"abstract":"The key interest of machine learning is conventionally training the machine from data that have underlying distribution such as data should have predetermined distribution. Such a constraint on the problem area leads to the technique for development of learning algorithms with notionally verifiable performance accuracy. However, real-world problems are not able to fit smartly into such restricted model. Class imbalance problem can occur due to tilted distribution of class data. Data streaming from non-stationary distribution with more uncertainty in real-time applications, resulting in the concept drift problem. In this methodology, it is proposed to extend the Extreme Learning Machine (ELM) algorithm for effectively handling the class imbalance problem and concept drift in datasets. This proposal has higher level of prediction accuracy and performance compared to Support Vector Machine (SVM) and Support Vector Data Description.","PeriodicalId":383458,"journal":{"name":"2017 Fourth International Conference on Signal Processing, Communication and Networking (ICSCN)","volume":"2021 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2017-03-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2017 Fourth International Conference on Signal Processing, Communication and Networking (ICSCN)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICSCN.2017.8085690","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 1
Abstract
The key interest of machine learning is conventionally training the machine from data that have underlying distribution such as data should have predetermined distribution. Such a constraint on the problem area leads to the technique for development of learning algorithms with notionally verifiable performance accuracy. However, real-world problems are not able to fit smartly into such restricted model. Class imbalance problem can occur due to tilted distribution of class data. Data streaming from non-stationary distribution with more uncertainty in real-time applications, resulting in the concept drift problem. In this methodology, it is proposed to extend the Extreme Learning Machine (ELM) algorithm for effectively handling the class imbalance problem and concept drift in datasets. This proposal has higher level of prediction accuracy and performance compared to Support Vector Machine (SVM) and Support Vector Data Description.
机器学习的主要兴趣是传统地从具有底层分布的数据中训练机器,例如应该具有预定分布的数据。这种对问题区域的约束导致了具有概念上可验证性能准确性的学习算法的开发技术。然而,现实世界的问题并不能很好地适应这种受限制的模型。班级数据的倾斜分布会导致班级不平衡问题。数据流是非平稳分布,在实时应用中具有更多的不确定性,导致概念漂移问题。在该方法中,提出了扩展极限学习机(ELM)算法以有效处理数据集中的类不平衡问题和概念漂移问题。与支持向量机(SVM)和支持向量数据描述(Support Vector Data Description)相比,该方法具有更高的预测精度和性能。