Mining Web Log Sequential Patterns with Layer Coded Breadth-First Linked WAP-Tree

Lizhi Liu, Jun Liu
{"title":"Mining Web Log Sequential Patterns with Layer Coded Breadth-First Linked WAP-Tree","authors":"Lizhi Liu, Jun Liu","doi":"10.1109/ISME.2010.271","DOIUrl":null,"url":null,"abstract":"Sequential mining is the process of applying data mining techniques to a sequential database for the purposes of discovering the correlation relationships that exist among an ordered list of events. An important application of sequential mining techniques is web usage mining, for mining web log accesses, which the sequences of web page accesses made by different web users over a period of time, through a server, are recorded. Web access pattern tree (WAP-tree) mining is a sequential pattern mining technique for web log access sequences. This paper proposes a more efficient approach for using the BFWAP-tree to mine frequent sequences, which reflects ancestor-descendant relationship of nodes in BFWAP tree directly and efficiently. The proposed algorithm builds the frequent header node links of the original WAP-tree in a Breadth-First fashion and uses the layer code of each node to identify the ancestor-descendant relationships between nodes of the tree. It then, finds each frequent sequential pattern, through progressive Breadth-First sequence search, starting with its first Breadth-First subsequence event. Experiments show huge performance gain over the WAP-tree technique.","PeriodicalId":348878,"journal":{"name":"2010 International Conference of Information Science and Management Engineering","volume":"44 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2010-08-07","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"8","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2010 International Conference of Information Science and Management Engineering","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ISME.2010.271","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 8

Abstract

Sequential mining is the process of applying data mining techniques to a sequential database for the purposes of discovering the correlation relationships that exist among an ordered list of events. An important application of sequential mining techniques is web usage mining, for mining web log accesses, which the sequences of web page accesses made by different web users over a period of time, through a server, are recorded. Web access pattern tree (WAP-tree) mining is a sequential pattern mining technique for web log access sequences. This paper proposes a more efficient approach for using the BFWAP-tree to mine frequent sequences, which reflects ancestor-descendant relationship of nodes in BFWAP tree directly and efficiently. The proposed algorithm builds the frequent header node links of the original WAP-tree in a Breadth-First fashion and uses the layer code of each node to identify the ancestor-descendant relationships between nodes of the tree. It then, finds each frequent sequential pattern, through progressive Breadth-First sequence search, starting with its first Breadth-First subsequence event. Experiments show huge performance gain over the WAP-tree technique.
查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
利用层编码宽度优先链接wap树挖掘Web日志序列模式
顺序挖掘是将数据挖掘技术应用于顺序数据库的过程,目的是发现有序事件列表之间存在的相关关系。顺序挖掘技术的一个重要应用是web使用情况挖掘,即挖掘web日志访问,记录不同web用户在一段时间内通过服务器访问web页面的顺序。Web访问模式树(WAP-tree)挖掘是一种针对Web日志访问序列的顺序模式挖掘技术。本文提出了一种更有效的利用BFWAP树来挖掘频繁序列的方法,该方法直接有效地反映了BFWAP树中节点的祖先-后代关系。该算法以宽度优先的方式构建原始wap树的频繁头节点链接,并使用每个节点的层码来识别树中节点之间的祖先-后代关系。然后,它通过逐步的宽度优先序列搜索,从它的第一个宽度优先子序列事件开始,找到每个频繁的序列模式。实验表明,与wap树技术相比,性能有了巨大的提高。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 去求助
来源期刊
自引率
0.00%
发文量
0
期刊最新文献
Research on Construction Strategy of Enterprise Information Sharing in Supply Chain Mond-Weir Type Duality in Nondifferentiable Fractional Programming with Generalized Convexity A Bin-packing Model Based on File Preservation Problem Comprehensive Evaluation Based on Gray Relation Analysis for Information Security Management Measurement A Model on Customer Satisfaction Degree Evaluation of Third Party Logistics
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1