{"title":"基于web日志数据的频繁模式挖掘算法的性能评价","authors":"Yonas Gashaw, Fang Liu","doi":"10.1109/CISP-BMEI.2017.8302317","DOIUrl":null,"url":null,"abstract":"In today's information era, the Internet is a powerful platform as the data repository that plays a great role in storing, sharing, and retrieve information for knowledge discovery. However, as there are countless, dynamic, and significant growth of data, web users face big problems in terms of the relevant information required. Consequently, poor information precision and retrieval are part of the hottest recent research areas in today's world. Despite the voluminous of information resided on the web, valuable informative knowledge could possibly be discovered with the application of advanced data mining techniques. Association rule mining, as a technique in data mining, is one way to discover frequent patterns from various data sources. In this paper, three of the foremost association rule mining algorithms used for frequent pattern discovering namely, Eclat, Apriori, and FP-Growth examined on three sets of transactional databases devised from server access log file. The comparison is made both in execution time and memory usage aspects. Unlike most previous research works, findings, in this paper, reveal that each of the algorithms has their own appropriateness and specificities that can best fit depending on the data size and support parameter thresholds.","PeriodicalId":6474,"journal":{"name":"2017 10th International Congress on Image and Signal Processing, BioMedical Engineering and Informatics (CISP-BMEI)","volume":"43 1","pages":"1-5"},"PeriodicalIF":0.0000,"publicationDate":"2017-10-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"7","resultStr":"{\"title\":\"Performance evaluation of frequent pattern mining algorithms using web log data for web usage mining\",\"authors\":\"Yonas Gashaw, Fang Liu\",\"doi\":\"10.1109/CISP-BMEI.2017.8302317\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"In today's information era, the Internet is a powerful platform as the data repository that plays a great role in storing, sharing, and retrieve information for knowledge discovery. However, as there are countless, dynamic, and significant growth of data, web users face big problems in terms of the relevant information required. Consequently, poor information precision and retrieval are part of the hottest recent research areas in today's world. Despite the voluminous of information resided on the web, valuable informative knowledge could possibly be discovered with the application of advanced data mining techniques. Association rule mining, as a technique in data mining, is one way to discover frequent patterns from various data sources. In this paper, three of the foremost association rule mining algorithms used for frequent pattern discovering namely, Eclat, Apriori, and FP-Growth examined on three sets of transactional databases devised from server access log file. The comparison is made both in execution time and memory usage aspects. Unlike most previous research works, findings, in this paper, reveal that each of the algorithms has their own appropriateness and specificities that can best fit depending on the data size and support parameter thresholds.\",\"PeriodicalId\":6474,\"journal\":{\"name\":\"2017 10th International Congress on Image and Signal Processing, BioMedical Engineering and Informatics (CISP-BMEI)\",\"volume\":\"43 1\",\"pages\":\"1-5\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2017-10-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"7\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2017 10th International Congress on Image and Signal Processing, BioMedical Engineering and Informatics (CISP-BMEI)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/CISP-BMEI.2017.8302317\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2017 10th International Congress on Image and Signal Processing, BioMedical Engineering and Informatics (CISP-BMEI)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/CISP-BMEI.2017.8302317","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
Performance evaluation of frequent pattern mining algorithms using web log data for web usage mining
In today's information era, the Internet is a powerful platform as the data repository that plays a great role in storing, sharing, and retrieve information for knowledge discovery. However, as there are countless, dynamic, and significant growth of data, web users face big problems in terms of the relevant information required. Consequently, poor information precision and retrieval are part of the hottest recent research areas in today's world. Despite the voluminous of information resided on the web, valuable informative knowledge could possibly be discovered with the application of advanced data mining techniques. Association rule mining, as a technique in data mining, is one way to discover frequent patterns from various data sources. In this paper, three of the foremost association rule mining algorithms used for frequent pattern discovering namely, Eclat, Apriori, and FP-Growth examined on three sets of transactional databases devised from server access log file. The comparison is made both in execution time and memory usage aspects. Unlike most previous research works, findings, in this paper, reveal that each of the algorithms has their own appropriateness and specificities that can best fit depending on the data size and support parameter thresholds.