{"title":"Accelerating XML mining using graphic processors","authors":"S. Rathi, C. A. Dhote, Vivek Bangera","doi":"10.1109/ICCICCT.2014.6992945","DOIUrl":null,"url":null,"abstract":"Mining of association rules is an important research direction of data mining. Extensive use of XML on web makes it an interesting source for data extraction from large data sets. There is a growing demand for modern tools and technologies which can efficiently handle such large data. This paper proposes a collaborative approach to extract association rules from structured XML data with the help of cost effective and energy efficient Graphic Processors. The serial approach comprises of deserialization using XPath followed by parallel sorting. In the parallel model there is parallel deserialization of XML data with the help of graphic processor followed by sorting the converted XML data with the help of in-built multithreaded structure of GPU. An empirical performance study on synthetic data is given, demonstrating a remarkable speed increase on a GPU as compared with fully optimized CPU implementation.","PeriodicalId":6615,"journal":{"name":"2014 International Conference on Control, Instrumentation, Communication and Computational Technologies (ICCICCT)","volume":"51 1","pages":"144-148"},"PeriodicalIF":0.0000,"publicationDate":"2014-07-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2014 International Conference on Control, Instrumentation, Communication and Computational Technologies (ICCICCT)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICCICCT.2014.6992945","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0
Abstract
Mining of association rules is an important research direction of data mining. Extensive use of XML on web makes it an interesting source for data extraction from large data sets. There is a growing demand for modern tools and technologies which can efficiently handle such large data. This paper proposes a collaborative approach to extract association rules from structured XML data with the help of cost effective and energy efficient Graphic Processors. The serial approach comprises of deserialization using XPath followed by parallel sorting. In the parallel model there is parallel deserialization of XML data with the help of graphic processor followed by sorting the converted XML data with the help of in-built multithreaded structure of GPU. An empirical performance study on synthetic data is given, demonstrating a remarkable speed increase on a GPU as compared with fully optimized CPU implementation.