{"title":"Further experiments with hierarchic clustering in document retrieval","authors":"C.J. van Rijsbergen","doi":"10.1016/0020-0271(74)90038-2","DOIUrl":null,"url":null,"abstract":"<div><p>The purpose of this paper is to report the results of experiments in document clustering using three well known test collections. Automatic classification is briefly introduced. The hypothesis underlying the use of clustering is discussed. A framework for the evaluation of cluster-based retrieval strategies is constructed. These strategies are shown to be dependent on the method of cluster representation (cluster profile) adopted. Finally, a particular cluster-based strategy together with a cluster representation method associated with it is examined and evaluated in detail.</p></div>","PeriodicalId":100670,"journal":{"name":"Information Storage and Retrieval","volume":"10 1","pages":"Pages 1-14"},"PeriodicalIF":0.0000,"publicationDate":"1974-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://sci-hub-pdf.com/10.1016/0020-0271(74)90038-2","citationCount":"44","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Information Storage and Retrieval","FirstCategoryId":"1085","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/0020027174900382","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 44
Abstract
The purpose of this paper is to report the results of experiments in document clustering using three well known test collections. Automatic classification is briefly introduced. The hypothesis underlying the use of clustering is discussed. A framework for the evaluation of cluster-based retrieval strategies is constructed. These strategies are shown to be dependent on the method of cluster representation (cluster profile) adopted. Finally, a particular cluster-based strategy together with a cluster representation method associated with it is examined and evaluated in detail.