{"title":"基于综合特征分析的网站资源评价算法","authors":"Baosheng Yin, Longlong Zhang, D. Pei, Yusheng Yan","doi":"10.1109/IUCC/DSCI/SmartCNS.2019.00082","DOIUrl":null,"url":null,"abstract":"Traditional web page sorting algorithms can only find the single web page that is the most relevant to keywords, but can not find the relevant website information source. For tackling the problem, we Propose a website information source evaluation algorithm based on comprehensive feature analysis. This algorithm first obtains multiple web pages corresponding to keywords through Baidu and other search engines, then obtains the contents of corresponding website information sources through crawler program and extracts the features, and finally obtains the sorting results of information sources of relevant websites by calculating relevancy combining BM25 algorithm and cosine distance. At the same time, combined with the implicit feedback behavior of users' browsing time, the sorting results could be dynamically adjusted to make the search results personalized. Experiment results show that this approach could make full use of web features, and improve the quality of web source evaluation algorithm by combining the semantic information of web content.","PeriodicalId":410905,"journal":{"name":"2019 IEEE International Conferences on Ubiquitous Computing & Communications (IUCC) and Data Science and Computational Intelligence (DSCI) and Smart Computing, Networking and Services (SmartCNS)","volume":"21 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2019-10-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":"{\"title\":\"A Website Source Evaluation Algorithm Based on Comprehensive Feature Analysis\",\"authors\":\"Baosheng Yin, Longlong Zhang, D. Pei, Yusheng Yan\",\"doi\":\"10.1109/IUCC/DSCI/SmartCNS.2019.00082\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Traditional web page sorting algorithms can only find the single web page that is the most relevant to keywords, but can not find the relevant website information source. For tackling the problem, we Propose a website information source evaluation algorithm based on comprehensive feature analysis. This algorithm first obtains multiple web pages corresponding to keywords through Baidu and other search engines, then obtains the contents of corresponding website information sources through crawler program and extracts the features, and finally obtains the sorting results of information sources of relevant websites by calculating relevancy combining BM25 algorithm and cosine distance. At the same time, combined with the implicit feedback behavior of users' browsing time, the sorting results could be dynamically adjusted to make the search results personalized. Experiment results show that this approach could make full use of web features, and improve the quality of web source evaluation algorithm by combining the semantic information of web content.\",\"PeriodicalId\":410905,\"journal\":{\"name\":\"2019 IEEE International Conferences on Ubiquitous Computing & Communications (IUCC) and Data Science and Computational Intelligence (DSCI) and Smart Computing, Networking and Services (SmartCNS)\",\"volume\":\"21 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2019-10-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"1\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2019 IEEE International Conferences on Ubiquitous Computing & Communications (IUCC) and Data Science and Computational Intelligence (DSCI) and Smart Computing, Networking and Services (SmartCNS)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/IUCC/DSCI/SmartCNS.2019.00082\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2019 IEEE International Conferences on Ubiquitous Computing & Communications (IUCC) and Data Science and Computational Intelligence (DSCI) and Smart Computing, Networking and Services (SmartCNS)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/IUCC/DSCI/SmartCNS.2019.00082","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
A Website Source Evaluation Algorithm Based on Comprehensive Feature Analysis
Traditional web page sorting algorithms can only find the single web page that is the most relevant to keywords, but can not find the relevant website information source. For tackling the problem, we Propose a website information source evaluation algorithm based on comprehensive feature analysis. This algorithm first obtains multiple web pages corresponding to keywords through Baidu and other search engines, then obtains the contents of corresponding website information sources through crawler program and extracts the features, and finally obtains the sorting results of information sources of relevant websites by calculating relevancy combining BM25 algorithm and cosine distance. At the same time, combined with the implicit feedback behavior of users' browsing time, the sorting results could be dynamically adjusted to make the search results personalized. Experiment results show that this approach could make full use of web features, and improve the quality of web source evaluation algorithm by combining the semantic information of web content.