{"title":"A Mediator Exploiting Approach for Mining Indirect Associations from Web Data Streams","authors":"Wen-Yang Lin, Yi-Ching Chen","doi":"10.1109/IBICA.2011.50","DOIUrl":null,"url":null,"abstract":"Recently, the concept of indirect associations, a new type of infrequent patterns that indirectly connect two rarely co-occurred items via a frequent item set called ¡§mediator¡¨, has been shown its power in capturing interesting information over web usage data. Most contemporary indirect association mining algorithms are developed for static dataset. Our previous work has proposed an algorithm, MIA-LM, tailored to streaming data. In this paper, we propose a new efficient algorithm, namely EMIA-LM, for mining indirect associations over web data streams. EMIA-LM employs a mediator-exploiting search strategy, which reduce the search space as well as computation cost for generating indirect associations. Besides, EMIA-LM adopts a compact data structure, alleviating unnecessary data transforming processes and consuming far less memory storage. Preliminary experiments conducted on real Web streaming datasets show that EMIA-LM is superior to the leading HI-mine* algorithm for static data and MIA-LM both in computation speed and memory consumption.","PeriodicalId":158080,"journal":{"name":"2011 Second International Conference on Innovations in Bio-inspired Computing and Applications","volume":"47 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2011-12-16","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"4","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2011 Second International Conference on Innovations in Bio-inspired Computing and Applications","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/IBICA.2011.50","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 4
Abstract
Recently, the concept of indirect associations, a new type of infrequent patterns that indirectly connect two rarely co-occurred items via a frequent item set called ¡§mediator¡¨, has been shown its power in capturing interesting information over web usage data. Most contemporary indirect association mining algorithms are developed for static dataset. Our previous work has proposed an algorithm, MIA-LM, tailored to streaming data. In this paper, we propose a new efficient algorithm, namely EMIA-LM, for mining indirect associations over web data streams. EMIA-LM employs a mediator-exploiting search strategy, which reduce the search space as well as computation cost for generating indirect associations. Besides, EMIA-LM adopts a compact data structure, alleviating unnecessary data transforming processes and consuming far less memory storage. Preliminary experiments conducted on real Web streaming datasets show that EMIA-LM is superior to the leading HI-mine* algorithm for static data and MIA-LM both in computation speed and memory consumption.