{"title":"A Method of Field Recognizing Based on Association Strength","authors":"Yang Liu, Lingyu Xu, Jie Yu, Yunlan Xue, Han Dong","doi":"10.1109/DASC.2013.90","DOIUrl":null,"url":null,"abstract":"The knowledge on the internet is uncertain, quickly updated and from multi-sources. The information we extract from a field in webpages is usually one-sided and contains incorrect data. This paper proposed an algorithm to extract data from internet web pages or text documents by using the association strength. The algorithm combines knowledge extraction method of the text mining with the technology of the intelligence analysis to the data. It uses ontology theory to describe the knowledge, and automatically extract the knowledge from the web pages or text documents which is returned by the search engine.","PeriodicalId":179557,"journal":{"name":"2013 IEEE 11th International Conference on Dependable, Autonomic and Secure Computing","volume":"17 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2013-12-21","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2013 IEEE 11th International Conference on Dependable, Autonomic and Secure Computing","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/DASC.2013.90","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0
Abstract
The knowledge on the internet is uncertain, quickly updated and from multi-sources. The information we extract from a field in webpages is usually one-sided and contains incorrect data. This paper proposed an algorithm to extract data from internet web pages or text documents by using the association strength. The algorithm combines knowledge extraction method of the text mining with the technology of the intelligence analysis to the data. It uses ontology theory to describe the knowledge, and automatically extract the knowledge from the web pages or text documents which is returned by the search engine.