{"title":"基于广义隐马尔可夫模型的Web内容函数检测","authors":"Jinlin Chen, Ping Zhong, Terry Cook","doi":"10.1109/ICMLA.2006.21","DOIUrl":null,"url":null,"abstract":"Web content function indicates authors' intension towards the purpose of the content and therefore plays an important role for Web information processing. In this paper we propose a generalized hidden Markov model which extends traditional hidden Markov model for Web content function detection. By incorporating multiple emission features and detecting state transition sequence based on layout structure, generalized hidden Markov model can effectively make use of Web-specific information and achieve better performance comparing to traditional hidden Markov model. Comparing to previous approaches on function detection, our approach has the advantages of domain-independency and extensibility for other applications. Experiments show promising results with our approach","PeriodicalId":297071,"journal":{"name":"2006 5th International Conference on Machine Learning and Applications (ICMLA'06)","volume":"54 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2006-12-14","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"15","resultStr":"{\"title\":\"Detecting Web Content Function Using Generalized Hidden Markov Model\",\"authors\":\"Jinlin Chen, Ping Zhong, Terry Cook\",\"doi\":\"10.1109/ICMLA.2006.21\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Web content function indicates authors' intension towards the purpose of the content and therefore plays an important role for Web information processing. In this paper we propose a generalized hidden Markov model which extends traditional hidden Markov model for Web content function detection. By incorporating multiple emission features and detecting state transition sequence based on layout structure, generalized hidden Markov model can effectively make use of Web-specific information and achieve better performance comparing to traditional hidden Markov model. Comparing to previous approaches on function detection, our approach has the advantages of domain-independency and extensibility for other applications. Experiments show promising results with our approach\",\"PeriodicalId\":297071,\"journal\":{\"name\":\"2006 5th International Conference on Machine Learning and Applications (ICMLA'06)\",\"volume\":\"54 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2006-12-14\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"15\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2006 5th International Conference on Machine Learning and Applications (ICMLA'06)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/ICMLA.2006.21\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2006 5th International Conference on Machine Learning and Applications (ICMLA'06)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICMLA.2006.21","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
Detecting Web Content Function Using Generalized Hidden Markov Model
Web content function indicates authors' intension towards the purpose of the content and therefore plays an important role for Web information processing. In this paper we propose a generalized hidden Markov model which extends traditional hidden Markov model for Web content function detection. By incorporating multiple emission features and detecting state transition sequence based on layout structure, generalized hidden Markov model can effectively make use of Web-specific information and achieve better performance comparing to traditional hidden Markov model. Comparing to previous approaches on function detection, our approach has the advantages of domain-independency and extensibility for other applications. Experiments show promising results with our approach