Rare association rule mining for data stream

2021 RIVF International Conference on Computing and Communication Technologies (RIVF) Pub Date : 2014-12-01 DOI:10.1109/ICCCT2.2014.7066696

Sunitha Vanamala, L. P. Sree, S. Bhavani

{"title":"Rare association rule mining for data stream","authors":"Sunitha Vanamala, L. P. Sree, S. Bhavani","doi":"10.1109/ICCCT2.2014.7066696","DOIUrl":null,"url":null,"abstract":"The immense volumes of data is populated into repositories from various applications. More over data arrives into the repositories continuously i.e. stream of data that cannot be stored into repository due to its varying characteristics. Frequent itemset mining is thoroughly studied by many researchers but important rare items are not discovered by these algorithms. In many cases, the contradictions or exceptions also offers useful associations. In the recent past the researchers started to focus on the discovery of such kind of associations called rare associations. Rare itemsets can be obtained by setting low support but generates huge number of rules. The rare association rule mining is a challenging area of research on data streams. In this paper we proposed an idea to analyze the data stream to identify interesting rare association rules. Rare association rule mining is the process of identifying associations that are having low support but occurs with high confidence. The rare association rules are useful for many applications such as fraudulent credit card usage, anomaly detection in networks, detection of network failures, educational data, medical diagnosis etc. The proposed rare association rule mining algorithm for data stream is implemented using sliding window technique over a stream of data, data is represented in vertical bit sequence format. The advantage of proposed algorithm is that it requires single scan to discover all rare associations. The proposed algorithm outperforms both in terms of memory and time.","PeriodicalId":6860,"journal":{"name":"2021 RIVF International Conference on Computing and Communication Technologies (RIVF)","volume":"28 1","pages":"1-6"},"PeriodicalIF":0.0000,"publicationDate":"2014-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"10","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2021 RIVF International Conference on Computing and Communication Technologies (RIVF)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICCCT2.2014.7066696","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 10

Abstract

The immense volumes of data is populated into repositories from various applications. More over data arrives into the repositories continuously i.e. stream of data that cannot be stored into repository due to its varying characteristics. Frequent itemset mining is thoroughly studied by many researchers but important rare items are not discovered by these algorithms. In many cases, the contradictions or exceptions also offers useful associations. In the recent past the researchers started to focus on the discovery of such kind of associations called rare associations. Rare itemsets can be obtained by setting low support but generates huge number of rules. The rare association rule mining is a challenging area of research on data streams. In this paper we proposed an idea to analyze the data stream to identify interesting rare association rules. Rare association rule mining is the process of identifying associations that are having low support but occurs with high confidence. The rare association rules are useful for many applications such as fraudulent credit card usage, anomaly detection in networks, detection of network failures, educational data, medical diagnosis etc. The proposed rare association rule mining algorithm for data stream is implemented using sliding window technique over a stream of data, data is represented in vertical bit sequence format. The advantage of proposed algorithm is that it requires single scan to discover all rare associations. The proposed algorithm outperforms both in terms of memory and time.

查看原文

微信好友朋友圈 QQ好友复制链接

本刊更多论文

数据流稀有关联规则挖掘

海量的数据被从不同的应用程序填充到存储库中。越来越多的数据连续地进入存储库，即由于其不同的特征而无法存储到存储库中的数据流。频繁项集挖掘已经被许多研究者进行了深入的研究，但这些算法并没有发现重要的稀有项。在许多情况下，矛盾或例外也提供了有用的关联。在最近的过去，研究人员开始专注于发现这种被称为罕见关联的关联。稀有道具集可以通过设置低支持而获得，但会生成大量规则。罕见的关联规则挖掘是数据流研究的一个具有挑战性的领域。本文提出了一种分析数据流以识别有趣的稀有关联规则的方法。稀有关联规则挖掘是识别支持度低但置信度高的关联的过程。罕见的关联规则在信用卡欺诈使用、网络异常检测、网络故障检测、教育数据、医疗诊断等应用中都很有用。提出的数据流稀有关联规则挖掘算法在数据流上使用滑动窗口技术实现，数据以垂直位序列格式表示。该算法的优点是只需一次扫描即可发现所有罕见关联。该算法在内存和时间方面都优于传统算法。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文去求助

来源期刊

2021 RIVF International Conference on Computing and Communication Technologies (RIVF)

自引率

0.00%

发文量