Attribute Pattern Weights (APW): A Scale to Detect Concept Drift in Data Stream Mining Models

B. Ramakrishna, S. K. M. Rao
{"title":"Attribute Pattern Weights (APW): A Scale to Detect Concept Drift in Data Stream Mining Models","authors":"B. Ramakrishna, S. K. M. Rao","doi":"10.1109/ICCCI.2018.8441513","DOIUrl":null,"url":null,"abstract":"Extracting data structures from dynamic real-time data records is gaining prominence across industries. The need for massive mining of data sequences is increasingly observed in a wide range of user applications including social network platforms, banking sector, genomics, telecom sector, e-commerce and other sectors. To analyse multiple streams of data that is, for understanding rapid sequences of data flowing at continuous intervals, researchers are focusing on continuous improvements in data stream mining. Application of data mining models (like classifiers) in data streaming scenario mandates accurate detection of data distribution. Further, the model should adapt quickly to any variations in the distribution patterns to ensure the sustained performance of model predictability. Referred to as drift detection, the process can be gradual or abrupt. Extensive research has been made, designing several algorithms to accurately detect the type of drift and to adapt to shifts drift approaches. However, even the most reputed concept drift models have limited ability to adapt to both types of drift. The relationship between the adaptability and predictor variables is based on data distribution features and its sensitivity to in-built parameters. In this context, concept drift detection using attribute pattern weight (APW) is proposed here in this manuscript. Unlike the many of existing models, the proposed model is not dependent of any of the process targeted to apply on streaming data. The other significance of the proposed model is to detect the both types of concept drift that is gradual or abrupt. The experimental study that carried is evincing the scalability and robustness, and significance of the proposed model.","PeriodicalId":141663,"journal":{"name":"2018 International Conference on Computer Communication and Informatics (ICCCI)","volume":"47 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"1900-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2018 International Conference on Computer Communication and Informatics (ICCCI)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICCCI.2018.8441513","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 1

Abstract

Extracting data structures from dynamic real-time data records is gaining prominence across industries. The need for massive mining of data sequences is increasingly observed in a wide range of user applications including social network platforms, banking sector, genomics, telecom sector, e-commerce and other sectors. To analyse multiple streams of data that is, for understanding rapid sequences of data flowing at continuous intervals, researchers are focusing on continuous improvements in data stream mining. Application of data mining models (like classifiers) in data streaming scenario mandates accurate detection of data distribution. Further, the model should adapt quickly to any variations in the distribution patterns to ensure the sustained performance of model predictability. Referred to as drift detection, the process can be gradual or abrupt. Extensive research has been made, designing several algorithms to accurately detect the type of drift and to adapt to shifts drift approaches. However, even the most reputed concept drift models have limited ability to adapt to both types of drift. The relationship between the adaptability and predictor variables is based on data distribution features and its sensitivity to in-built parameters. In this context, concept drift detection using attribute pattern weight (APW) is proposed here in this manuscript. Unlike the many of existing models, the proposed model is not dependent of any of the process targeted to apply on streaming data. The other significance of the proposed model is to detect the both types of concept drift that is gradual or abrupt. The experimental study that carried is evincing the scalability and robustness, and significance of the proposed model.
查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
属性模式权重(APW):一种检测数据流挖掘模型中概念漂移的尺度
从动态实时数据记录中提取数据结构在各行各业越来越受重视。在社交网络平台、银行业、基因组学、电信、电子商务等广泛的用户应用中,对数据序列的大规模挖掘需求越来越大。为了分析多个数据流,即为了理解以连续间隔流动的快速数据序列,研究人员正在关注数据流挖掘的持续改进。数据流场景中数据挖掘模型(如分类器)的应用要求对数据分布进行准确的检测。此外,模型应该快速适应分布模式中的任何变化,以确保模型可预测性的持续性能。这个过程被称为漂移检测,可以是渐进的,也可以是突然的。已经进行了广泛的研究,设计了几种算法来准确地检测漂移类型并适应移位漂移方法。然而,即使是最著名的概念漂移模型,适应这两种类型漂移的能力也有限。自适应与预测变量之间的关系基于数据分布特征及其对内置参数的敏感性。在此背景下,本文提出了使用属性模式权重(APW)进行概念漂移检测。与许多现有模型不同,所提出的模型不依赖于任何针对流数据的流程。提出的模型的另一个意义是检测两种类型的概念漂移是渐进的或突然的。实验结果表明,该模型具有良好的可扩展性和鲁棒性。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 去求助
来源期刊
自引率
0.00%
发文量
0
期刊最新文献
Critical review of machine learning approaches to apply big data analytics in DDoS forensics Detection of the effect of exercise on APG signals Categorisation of security threats for smart home appliances Rotation-based LTE downlink resource scheduling using queue status monitoring Design and Analysis of Booth Multiplier with Optimised Power Delay Product
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1