{"title":"DNS usage mining and its two applications","authors":"Jun Wu, Xiaodong Li, Xin Wang, Baoping Yan","doi":"10.1109/ICDIM.2011.6093364","DOIUrl":null,"url":null,"abstract":"The principal goal of DNS usage mining is the discovery and analysis of patterns in the query behavior of DNS users. In this paper, we develop a unified framework for DNS usage mining based on Clustering analysis of co-occurrence data derived from DNS server query data. Through transforming the raw query data into co-occurrence matrix, user transaction clustering can be applied to discover the user groups according to their similar query behaviors. Using the aggregate usage profile that represents a user cluster and suitable similarity measure, a specific approach for a domain name recommendation engine is shown. For identifying the latent purpose of a domain name, Probabilistic Latent Semantic Analysis (PLSA) is used, which can automatically discover hidden semantic relationships between users and domain names. We demonstrate the effectiveness of our approaches through experiments performed on real-world data sets.","PeriodicalId":355775,"journal":{"name":"2011 Sixth International Conference on Digital Information Management","volume":"7 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2011-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"6","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2011 Sixth International Conference on Digital Information Management","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICDIM.2011.6093364","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 6
Abstract
The principal goal of DNS usage mining is the discovery and analysis of patterns in the query behavior of DNS users. In this paper, we develop a unified framework for DNS usage mining based on Clustering analysis of co-occurrence data derived from DNS server query data. Through transforming the raw query data into co-occurrence matrix, user transaction clustering can be applied to discover the user groups according to their similar query behaviors. Using the aggregate usage profile that represents a user cluster and suitable similarity measure, a specific approach for a domain name recommendation engine is shown. For identifying the latent purpose of a domain name, Probabilistic Latent Semantic Analysis (PLSA) is used, which can automatically discover hidden semantic relationships between users and domain names. We demonstrate the effectiveness of our approaches through experiments performed on real-world data sets.