CLoPAR: Classification based on Predictive Association Rules

M. N. Dehkordi, M. H. Shenassa
{"title":"CLoPAR: Classification based on Predictive Association Rules","authors":"M. N. Dehkordi, M. H. Shenassa","doi":"10.1109/IS.2006.348467","DOIUrl":null,"url":null,"abstract":"Recent studies in data mining have proposed a new classification approach, called associative classification, which, according to several reports, such as Liu, B. et al (1998), achieves higher classification accuracy than traditional classification approaches such as C4.S However, the approach also suffers from two major deficiencies: (1) it generates a very large number of association rules, which leads to high processing overhead; and (2) its confidence-based rule evaluation measure may lead to overfitting. In comparison with associative classification, traditional rule-based classifiers, such as C4.5, FOIL and RIPPER, are substantially faster but their accuracy, in most cases, may not be as high. In this paper, we propose a new classification approach, CLoPAR (Classification based on Predictive Association Rules), which combines the advantages of both associative classification and traditional rule-based classification. Instead of generating a large number of candidate rules as in associative classification, CLoPAR adopts a greedy algorithm to generate rules directly from training data. Moreover, CLoPAR generates and tests more rules than traditional rule-based classifiers to avoid missing important rules. To avoid overfitting, CLoPAR uses expected accuracy to evaluate each rule and uses the best k rules in prediction","PeriodicalId":116809,"journal":{"name":"2006 3rd International IEEE Conference Intelligent Systems","volume":"1 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2006-09-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"7","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2006 3rd International IEEE Conference Intelligent Systems","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/IS.2006.348467","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 7

Abstract

Recent studies in data mining have proposed a new classification approach, called associative classification, which, according to several reports, such as Liu, B. et al (1998), achieves higher classification accuracy than traditional classification approaches such as C4.S However, the approach also suffers from two major deficiencies: (1) it generates a very large number of association rules, which leads to high processing overhead; and (2) its confidence-based rule evaluation measure may lead to overfitting. In comparison with associative classification, traditional rule-based classifiers, such as C4.5, FOIL and RIPPER, are substantially faster but their accuracy, in most cases, may not be as high. In this paper, we propose a new classification approach, CLoPAR (Classification based on Predictive Association Rules), which combines the advantages of both associative classification and traditional rule-based classification. Instead of generating a large number of candidate rules as in associative classification, CLoPAR adopts a greedy algorithm to generate rules directly from training data. Moreover, CLoPAR generates and tests more rules than traditional rule-based classifiers to avoid missing important rules. To avoid overfitting, CLoPAR uses expected accuracy to evaluate each rule and uses the best k rules in prediction
查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
CLoPAR:基于预测关联规则的分类
最近的数据挖掘研究提出了一种新的分类方法,称为关联分类,根据一些报道,如Liu, B. et al(1998),它比传统的分类方法(如C4)实现了更高的分类精度。然而,该方法也存在两个主要缺陷:(1)生成大量关联规则,导致处理开销高;(2)基于置信度的规则评价方法可能导致过拟合。与关联分类相比,传统的基于规则的分类器,如C4.5、FOIL和RIPPER,速度要快得多,但在大多数情况下,它们的准确率可能没有那么高。本文提出了一种新的基于预测关联规则的分类方法CLoPAR (classification based on Predictive Association Rules),它结合了关联分类和传统基于规则的分类的优点。CLoPAR不像关联分类那样生成大量的候选规则,而是采用贪心算法直接从训练数据中生成规则。此外,与传统的基于规则的分类器相比,CLoPAR生成和测试的规则更多,从而避免遗漏重要的规则。为了避免过拟合,CLoPAR使用预期精度来评估每个规则,并在预测中使用最佳的k条规则
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 去求助
来源期刊
自引率
0.00%
发文量
0
期刊最新文献
A Neurofuzzy Adaptive Kalman Filter Artificial Intelligence Technique for Gene Expression Profiling of Urinary Bladder Cancer Evolutionary Support Vector Machines for Diabetes Mellitus Diagnosis IGUANA: Individuation of Global Unsafe ANomalies and Alarm activation Smart Data Analysis Services
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1