{"title":"Decision Support for Indexing and Retrieval of Information in Hypertext Systems","authors":"Wenlie Zhu, M. Lehto","doi":"10.1207/S15327590IJHC1104_5","DOIUrl":null,"url":null,"abstract":"This study introduces and evaluates the performance of two statistical models intended to support the automatic creation of a subject-based index containing links to hypertext documents. The fuzzy Bayes model makes strong dependence assumptions, and only considers the strongest evidence presented by single words occurring in a document, whereas the classic Bayes model makes strong independence assumptions and attempts to aggregate all the evidence. The links or index terms suggested by both indexing models overlapped greatly with those assigned by a human indexer. However, the probabilities calculated using the classic Bayes model were unstable because of data sparseness and severe violations of the independence assumptions. Subsequent analysis therefore focused on the fuzzy Bayes model. The latter analysis revealed that human experts rated index terms suggested by the model significantly higher than randomly selected index terms. When the index terms assigned by the fuzzy Bayes model were implemented as ...","PeriodicalId":208962,"journal":{"name":"Int. J. Hum. Comput. Interact.","volume":"88 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"1999-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"9","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Int. J. Hum. Comput. Interact.","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1207/S15327590IJHC1104_5","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 9
Abstract
This study introduces and evaluates the performance of two statistical models intended to support the automatic creation of a subject-based index containing links to hypertext documents. The fuzzy Bayes model makes strong dependence assumptions, and only considers the strongest evidence presented by single words occurring in a document, whereas the classic Bayes model makes strong independence assumptions and attempts to aggregate all the evidence. The links or index terms suggested by both indexing models overlapped greatly with those assigned by a human indexer. However, the probabilities calculated using the classic Bayes model were unstable because of data sparseness and severe violations of the independence assumptions. Subsequent analysis therefore focused on the fuzzy Bayes model. The latter analysis revealed that human experts rated index terms suggested by the model significantly higher than randomly selected index terms. When the index terms assigned by the fuzzy Bayes model were implemented as ...