{"title":"Scholarly article retrieval from Web of Science, Scopus and Dimensions: A comparative analysis of retrieval quality","authors":"Prashasti Singh, V. K. Singh, Rajesh Piryani","doi":"10.1177/01655515231191351","DOIUrl":null,"url":null,"abstract":"Scholarly databases are now being increasingly used for search and retrieval of research articles in different subject areas. Several previous studies have shown that different databases vary in their coverage of publication sources, and therefore, one may expect that for a given query, they may retrieve different results. However, how do these databases compare in terms of relevance of the retrieved results is relatively unexplored. This study, therefore, attempts to bridge this research gap by carrying out a systematic study of retrieval relevance of the three scholarly databases – Web of Science, Scopus and Dimensions. Five selected queries are used for this purpose. The retrieved results from the three databases for the given queries are first analysed in terms of volume of retrieved records, language of retrieved records, etc. Thereafter, a user-based annotation scheme is used to assess and compare the relevance of retrieved results. The standard measure of normalised discounted cumulative gain (NDCG) and Spearman rank correlation coefficient (SRCC) is computed for the purpose. Results indicate that although the number of retrieved results for the same query differs significantly in the three databases, the databases differ only marginally in retrieval relevance, with Web of Science having a slight edge over other two.","PeriodicalId":54796,"journal":{"name":"Journal of Information Science","volume":" ","pages":""},"PeriodicalIF":1.8000,"publicationDate":"2023-08-21","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Journal of Information Science","FirstCategoryId":"94","ListUrlMain":"https://doi.org/10.1177/01655515231191351","RegionNum":4,"RegionCategory":"管理学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q3","JCRName":"COMPUTER SCIENCE, INFORMATION SYSTEMS","Score":null,"Total":0}
引用次数: 0
Abstract
Scholarly databases are now being increasingly used for search and retrieval of research articles in different subject areas. Several previous studies have shown that different databases vary in their coverage of publication sources, and therefore, one may expect that for a given query, they may retrieve different results. However, how do these databases compare in terms of relevance of the retrieved results is relatively unexplored. This study, therefore, attempts to bridge this research gap by carrying out a systematic study of retrieval relevance of the three scholarly databases – Web of Science, Scopus and Dimensions. Five selected queries are used for this purpose. The retrieved results from the three databases for the given queries are first analysed in terms of volume of retrieved records, language of retrieved records, etc. Thereafter, a user-based annotation scheme is used to assess and compare the relevance of retrieved results. The standard measure of normalised discounted cumulative gain (NDCG) and Spearman rank correlation coefficient (SRCC) is computed for the purpose. Results indicate that although the number of retrieved results for the same query differs significantly in the three databases, the databases differ only marginally in retrieval relevance, with Web of Science having a slight edge over other two.
学术数据库现在越来越多地用于搜索和检索不同学科领域的研究论文。先前的一些研究表明,不同的数据库对出版物来源的覆盖范围各不相同,因此,对于给定的查询,它们可能会检索到不同的结果。然而,这些数据库在检索结果的相关性方面是如何比较的,这是相对未知的。因此,本研究试图通过对Web of Science、Scopus和Dimensions三个学术数据库的检索相关性进行系统研究来弥补这一研究缺口。为此选择了5个查询。首先根据检索记录的数量、检索记录的语言等对三个数据库中给定查询的检索结果进行分析。然后,使用基于用户的注释方案来评估和比较检索结果的相关性。为此,计算了标准化贴现累积增益(NDCG)和斯皮尔曼等级相关系数(SRCC)的标准度量。结果表明,虽然同一查询的检索结果数量在三个数据库中差异很大,但检索相关性差异很小,其中Web of Science比其他两个数据库稍微有优势。
期刊介绍:
The Journal of Information Science is a peer-reviewed international journal of high repute covering topics of interest to all those researching and working in the sciences of information and knowledge management. The Editors welcome material on any aspect of information science theory, policy, application or practice that will advance thinking in the field.