{"title":"Enhancing search engines through utilization of visually emphasized terms","authors":"H.L. Larsen","doi":"10.1109/NAFIPS.2002.1018117","DOIUrl":null,"url":null,"abstract":"We present an approach to weighted indexing of documents in information retrieval systems and search engines utilizing visual emphasizing applied in the document texts. The significance of a term in characterizing the topic of a document depends both on the number of occurrences of the term in the page, and on the amount of visual emphasizing applied in the occurrences. We argue that the document discrimination degree of a term, as measured by the inverse document frequency, should be applied as the default importance of the term in a query. The approach was evaluated using a real world case set showing good performance and sensitivity to parameters as expected.","PeriodicalId":348314,"journal":{"name":"2002 Annual Meeting of the North American Fuzzy Information Processing Society Proceedings. NAFIPS-FLINT 2002 (Cat. No. 02TH8622)","volume":"19 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2002-08-07","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2002 Annual Meeting of the North American Fuzzy Information Processing Society Proceedings. NAFIPS-FLINT 2002 (Cat. No. 02TH8622)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/NAFIPS.2002.1018117","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0
Abstract
We present an approach to weighted indexing of documents in information retrieval systems and search engines utilizing visual emphasizing applied in the document texts. The significance of a term in characterizing the topic of a document depends both on the number of occurrences of the term in the page, and on the amount of visual emphasizing applied in the occurrences. We argue that the document discrimination degree of a term, as measured by the inverse document frequency, should be applied as the default importance of the term in a query. The approach was evaluated using a real world case set showing good performance and sensitivity to parameters as expected.