{"title":"Efficiency and Consistency Study on Carma","authors":"Yuan Huang, Xing Wang, B. Shia","doi":"10.1109/NCM.2009.241","DOIUrl":null,"url":null,"abstract":"Carma is a type of online association algorithm, designed to facilitate association rule with online data flow and successively changing support thresholds. In this paper we study the factors that contribute to the efficiency of Carma and how data flow distribution give effects on the performance of Carma. We design several experiments with two kinds of data. In fixed support threshold situations, we compare Carma with that of Apriori. We find the sets generated by Carma are subsets of those generated by Apriori. We find that if the support threshold is reasonably defined, these two algorithms reach the same results. On the other hand, as the support threshold increases, Phase Ι generates less items and the number of deleted sets from Phase II first increases and then declines. Carma behaves consistently towards changing support. We notice the earlier the items enter into a lattice, the more accurate the estimations are. If base stone elements show up early in the transaction, the performance of Phase II is mainly influenced by the late-entered item sets. Based on the discussion with Carma, we propose a new procedure to improve Carma. Simulations reveal that the modified algorithm works well.","PeriodicalId":119669,"journal":{"name":"2009 Fifth International Joint Conference on INC, IMS and IDC","volume":"148 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2009-08-25","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2009 Fifth International Joint Conference on INC, IMS and IDC","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/NCM.2009.241","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 1
Abstract
Carma is a type of online association algorithm, designed to facilitate association rule with online data flow and successively changing support thresholds. In this paper we study the factors that contribute to the efficiency of Carma and how data flow distribution give effects on the performance of Carma. We design several experiments with two kinds of data. In fixed support threshold situations, we compare Carma with that of Apriori. We find the sets generated by Carma are subsets of those generated by Apriori. We find that if the support threshold is reasonably defined, these two algorithms reach the same results. On the other hand, as the support threshold increases, Phase Ι generates less items and the number of deleted sets from Phase II first increases and then declines. Carma behaves consistently towards changing support. We notice the earlier the items enter into a lattice, the more accurate the estimations are. If base stone elements show up early in the transaction, the performance of Phase II is mainly influenced by the late-entered item sets. Based on the discussion with Carma, we propose a new procedure to improve Carma. Simulations reveal that the modified algorithm works well.