{"title":"A Parameter-Adjustable Estimating Method for Change Frequency of Web Pages","authors":"Shuchen Tan, Xuan Zhang","doi":"10.1109/ICEE.2010.352","DOIUrl":null,"url":null,"abstract":"Refreshing on-line information in time is a main task of incremental crawling, so it is very important to predict the change frequency of web pages. We model the change of page as a Poisson process in this paper. And based on it, we propose a parameter-adjustable algorithm after considering the unbiasedness, efficiency and consistency comprehensively. This algorithm can adjust the parameters in order to estimate the change frequency more effective.","PeriodicalId":420284,"journal":{"name":"2010 International Conference on E-Business and E-Government","volume":"40 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2010-05-07","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2010 International Conference on E-Business and E-Government","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICEE.2010.352","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0
Abstract
Refreshing on-line information in time is a main task of incremental crawling, so it is very important to predict the change frequency of web pages. We model the change of page as a Poisson process in this paper. And based on it, we propose a parameter-adjustable algorithm after considering the unbiasedness, efficiency and consistency comprehensively. This algorithm can adjust the parameters in order to estimate the change frequency more effective.