首页 > 最新文献

Journal of data and information science (Warsaw, Poland)最新文献

英文 中文
A Tailor-made Data Quality Approach for Higher Educational Data 为高等教育数据量身定制的数据质量方法
Pub Date : 2020-07-09 DOI: 10.2478/jdis-2020-0029
C. Daraio, R. Bruni, G. Catalano, Alessandro Daraio, G. Matteucci, M. Scannapieco, Daniel Wagner-Schuster, B. Lepori
Abstract Purpose This paper relates the definition of data quality procedures for knowledge organizations such as Higher Education Institutions. The main purpose is to present the flexible approach developed for monitoring the data quality of the European Tertiary Education Register (ETER) database, illustrating its functioning and highlighting the main challenges that still have to be faced in this domain. Design/methodology/approach The proposed data quality methodology is based on two kinds of checks, one to assess the consistency of cross-sectional data and the other to evaluate the stability of multiannual data. This methodology has an operational and empirical orientation. This means that the proposed checks do not assume any theoretical distribution for the determination of the threshold parameters that identify potential outliers, inconsistencies, and errors in the data. Findings We show that the proposed cross-sectional checks and multiannual checks are helpful to identify outliers, extreme observations and to detect ontological inconsistencies not described in the available meta-data. For this reason, they may be a useful complement to integrate the processing of the available information. Research limitations The coverage of the study is limited to European Higher Education Institutions. The cross-sectional and multiannual checks are not yet completely integrated. Practical implications The consideration of the quality of the available data and information is important to enhance data quality-aware empirical investigations, highlighting problems, and areas where to invest for improving the coverage and interoperability of data in future data collection initiatives. Originality/value The data-driven quality checks proposed in this paper may be useful as a reference for building and monitoring the data quality of new databases or of existing databases available for other countries or systems characterized by high heterogeneity and complexity of the units of analysis without relying on pre-specified theoretical distributions.
摘要目的探讨知识型组织(如高等教育机构)数据质量程序的定义。主要目的是介绍为监测欧洲高等教育登记册(ETER)数据库的数据质量而制定的灵活方法,说明其功能并强调在这一领域仍然必须面对的主要挑战。建议的数据质量方法基于两种检查,一种用于评估横截面数据的一致性,另一种用于评估多年数据的稳定性。这种方法具有操作性和经验性。这意味着建议的检查不假设任何理论分布来确定识别数据中潜在异常值、不一致和错误的阈值参数。研究结果表明,建议的横断面检查和多年检查有助于识别异常值、极端观察值和检测可用元数据中未描述的本体不一致性。出于这个原因,它们可能是集成可用信息处理的有用补充。研究局限研究范围仅限于欧洲高等教育机构。横断面检查和多年度检查尚未完全结合起来。考虑可用数据和信息的质量对于加强数据质量意识的实证调查、突出问题以及在未来数据收集计划中为改善数据的覆盖范围和互操作性而投资的领域非常重要。原创性/价值本文提出的数据驱动的质量检查可以作为建立和监测新数据库或现有数据库的数据质量的参考,这些数据库可用于其他国家或系统,其特征是分析单元的高度异质性和复杂性,而不依赖于预先指定的理论分布。
{"title":"A Tailor-made Data Quality Approach for Higher Educational Data","authors":"C. Daraio, R. Bruni, G. Catalano, Alessandro Daraio, G. Matteucci, M. Scannapieco, Daniel Wagner-Schuster, B. Lepori","doi":"10.2478/jdis-2020-0029","DOIUrl":"https://doi.org/10.2478/jdis-2020-0029","url":null,"abstract":"Abstract Purpose This paper relates the definition of data quality procedures for knowledge organizations such as Higher Education Institutions. The main purpose is to present the flexible approach developed for monitoring the data quality of the European Tertiary Education Register (ETER) database, illustrating its functioning and highlighting the main challenges that still have to be faced in this domain. Design/methodology/approach The proposed data quality methodology is based on two kinds of checks, one to assess the consistency of cross-sectional data and the other to evaluate the stability of multiannual data. This methodology has an operational and empirical orientation. This means that the proposed checks do not assume any theoretical distribution for the determination of the threshold parameters that identify potential outliers, inconsistencies, and errors in the data. Findings We show that the proposed cross-sectional checks and multiannual checks are helpful to identify outliers, extreme observations and to detect ontological inconsistencies not described in the available meta-data. For this reason, they may be a useful complement to integrate the processing of the available information. Research limitations The coverage of the study is limited to European Higher Education Institutions. The cross-sectional and multiannual checks are not yet completely integrated. Practical implications The consideration of the quality of the available data and information is important to enhance data quality-aware empirical investigations, highlighting problems, and areas where to invest for improving the coverage and interoperability of data in future data collection initiatives. Originality/value The data-driven quality checks proposed in this paper may be useful as a reference for building and monitoring the data quality of new databases or of existing databases available for other countries or systems characterized by high heterogeneity and complexity of the units of analysis without relying on pre-specified theoretical distributions.","PeriodicalId":92237,"journal":{"name":"Journal of data and information science (Warsaw, Poland)","volume":"5 1","pages":"129 - 160"},"PeriodicalIF":0.0,"publicationDate":"2020-07-09","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"43552469","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 9
Scientometric Implosion that Leads to Explosion: Case Study of Armenian Journals 导致爆炸的科学计量学内爆:亚美尼亚期刊的案例研究
Pub Date : 2020-07-03 DOI: 10.2478/jdis-2020-0028
Shushanik A. Sargsyan, E. Gzoyan, A. Mirzoyan, V. Blaginin
Abstract Purpose The purpose of this study is to introduce a new concept and term into the scientometric discourse and research—scientometric implosion—and test the idea on the example of the Armenian journals. The article argues that the existence of a compressed scientific area in the country makes pressure on the journals and after some time this pressure makes one or several journals explode—break the limited national scientific area and move to the international arena. As soon as one of the local journals breaks through this compressed space and appears at an international level, further explosion happens, which makes the other journals follow the same path. Design/methodology/approach Our research is based on three international scientific databases—WoS, Scopus, and RISC CC, from where we have retrieved information about the Armenian journals indexed there and citations received by those journals and one national database—the Armenian Science Citation Index. Armenian Journal Impact Factor (ArmJIF) was calculated for the local Armenian journals based on the general impact factor formula. Journals were classified according to Glänzel and Schubert (2003). Findings Our results show that the science policy developed by the scientific authorities of Armenia and the introduction of ArmJIF have made the Armenian journals comply with international standards and resulted in some local journals to break the national scientific territory and be indexed in international scientific databases of RISC, Scopus, and WoS. Apart from complying with technical requirements, the journals start publishing articles also in foreign languages. Although nearly half of the local journals are in the fields of social sciences and humanities, only one journal from that field is indexed in international scientific databases. Research limitation One of the limitations of the study is that it was performed on the example of only one state and the second one is that more time passage is needed to firmly evaluate the results. However, the introduction of the concept can inspire other similar case study. Practical implications The new term and relevant model offered in the article can practically be used for the development of national journals. Originality/value The article proposes a new term and a concept in scientometrics.
摘要目的本研究的目的是在科学计量话语和研究中引入一个新的概念和术语——科学计量内爆——并以亚美尼亚期刊为例验证这一观点。文章认为,该国压缩的科学领域的存在给期刊带来了压力,经过一段时间后,这种压力使一种或几种期刊爆炸——打破了有限的国家科学领域,走向了国际舞台。一旦其中一家地方期刊突破了这个压缩的空间,出现在国际水平上,就会发生进一步的爆炸,这使得其他期刊也走上了同样的道路。设计/方法论/方法我们的研究基于三个国际科学数据库——WoS、Scopus和RISC CC,我们从中检索了亚美尼亚期刊的索引信息和这些期刊收到的引文,以及一个国家数据库——亚美尼亚科学引文索引。亚美尼亚期刊影响因子(ArmJIF)是根据通用影响因子公式为当地亚美尼亚期刊计算的。根据Glänzel和Schubert(2003)对期刊进行了分类。研究结果我们的研究结果表明,亚美尼亚科学当局制定的科学政策和ArmJIF的引入使亚美尼亚期刊符合国际标准,并导致一些地方期刊突破了国家科学领域,被编入RISC、Scopus和WoS的国际科学数据库。除了符合技术要求外,这些期刊还开始用外语发表文章。尽管近一半的地方期刊都在社会科学和人文学科领域,但只有一本该领域的期刊在国际科学数据库中被编入索引。研究局限性研究的局限性之一是只对一种状态进行了研究,第二个局限性是需要更多的时间来坚定地评估结果。然而,该概念的引入可以启发其他类似的案例研究。本文提出的新术语和相关模式可用于国家期刊的发展。这篇文章在科学计量学中提出了一个新的术语和概念。
{"title":"Scientometric Implosion that Leads to Explosion: Case Study of Armenian Journals","authors":"Shushanik A. Sargsyan, E. Gzoyan, A. Mirzoyan, V. Blaginin","doi":"10.2478/jdis-2020-0028","DOIUrl":"https://doi.org/10.2478/jdis-2020-0028","url":null,"abstract":"Abstract Purpose The purpose of this study is to introduce a new concept and term into the scientometric discourse and research—scientometric implosion—and test the idea on the example of the Armenian journals. The article argues that the existence of a compressed scientific area in the country makes pressure on the journals and after some time this pressure makes one or several journals explode—break the limited national scientific area and move to the international arena. As soon as one of the local journals breaks through this compressed space and appears at an international level, further explosion happens, which makes the other journals follow the same path. Design/methodology/approach Our research is based on three international scientific databases—WoS, Scopus, and RISC CC, from where we have retrieved information about the Armenian journals indexed there and citations received by those journals and one national database—the Armenian Science Citation Index. Armenian Journal Impact Factor (ArmJIF) was calculated for the local Armenian journals based on the general impact factor formula. Journals were classified according to Glänzel and Schubert (2003). Findings Our results show that the science policy developed by the scientific authorities of Armenia and the introduction of ArmJIF have made the Armenian journals comply with international standards and resulted in some local journals to break the national scientific territory and be indexed in international scientific databases of RISC, Scopus, and WoS. Apart from complying with technical requirements, the journals start publishing articles also in foreign languages. Although nearly half of the local journals are in the fields of social sciences and humanities, only one journal from that field is indexed in international scientific databases. Research limitation One of the limitations of the study is that it was performed on the example of only one state and the second one is that more time passage is needed to firmly evaluate the results. However, the introduction of the concept can inspire other similar case study. Practical implications The new term and relevant model offered in the article can practically be used for the development of national journals. Originality/value The article proposes a new term and a concept in scientometrics.","PeriodicalId":92237,"journal":{"name":"Journal of data and information science (Warsaw, Poland)","volume":"5 1","pages":"187 - 196"},"PeriodicalIF":0.0,"publicationDate":"2020-07-03","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"47388523","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 2
The Gender Patenting Gap: A Study on the Iberoamerican Countries 性别专利差距:对伊比利亚-美洲国家的研究
Pub Date : 2020-07-03 DOI: 10.2478/jdis-2020-0025
Danilo S. Carvalho, Lydia Bares, Kelyane Silva
Abstract Purpose This work presents a study on the female involvement in patent applications in all 23 Ibero-American countries that are WIPO members, in order to measure gender inequalities in institutional collaborations and technological fields, across time. Design/methodology/approach The data used in this paper come from EPO Worldwide Patent Statistical Database (PATSTAT). PATSTAT contains bibliographical data relating to more than 100 million patent documents from leading industrialized and developing countries, as well as legal event data from more than 40 patent authorities contained in the EPO worldwide legal event data (INPADOC). The extracted subset is composed of 150,863 patent applications with priority years between 2007 and 2016. Findings Our observations indicate that even in more dynamic economies such as Portugal and Spain, the participation of women per patent applications does not exceed 30%. Additionally, the distribution of female participation among institutional sectors and technological fields is consistent with previous studies in other regions and indicate a socio-cultural divide. Research limitations Unisex names were not considered and were counted as gender unknown, and patent applications for which no inventor information was available were discarded, but further effort of data analysis may provide more information about gender inequalities. Practical implications While patents are imperfect variables of inventive step and therefore should be considered as a variable proxy of innovation, our findings may help to guide the implementation of policies for balancing gender participation in innovative activities, as well as instigating research into the issues causing divisive participation along gender lines. Originality/value While there is a widespread effort into evaluating and improving the participation of groups recognized as minorities within state-of-the-art activities, research about women participation in the innovation sector is fragmented due to differing regional characteristics: industrial and academic segmentation, socio-economic disparities, and cultural factors. Thus, localized studies present an opportunity of filling the gaps of knowledge on societal participation in innovation activities.
摘要目的这项工作对知识产权组织所有23个伊比利亚-美洲成员国的女性参与专利申请的情况进行了研究,以衡量不同时期机构合作和技术领域中的性别不平等。设计/方法/方法本文使用的数据来自EPO全球专利统计数据库(PATSTAT)。PATSTAT包含来自领先工业化国家和发展中国家的1亿多份专利文件的目录数据,以及EPO全球法律事件数据(INPADOC)中包含的40多个专利机构的法律事件数据。提取的子集由150863份专利申请组成,优先权年在2007年至2016年之间。调查结果我们的观察结果表明,即使在葡萄牙和西班牙等更有活力的经济体,每次专利申请中女性的参与率也不超过30%。此外,女性参与机构部门和技术领域的分布与其他地区以往的研究一致,表明存在社会文化差异。研究限制男女通用的名字没有被考虑,被视为性别未知,没有发明人信息的专利申请被丢弃,但进一步的数据分析可能会提供更多关于性别不平等的信息。实际意义虽然专利是创造性的不完美变量,因此应被视为创新的可变代表,但我们的研究结果可能有助于指导平衡性别参与创新活动的政策的实施,并推动对导致性别参与分裂的问题的研究。独创性/价值尽管人们普遍致力于评估和改善被公认为少数群体的群体参与最先进的活动,但由于不同的地区特征:工业和学术细分、社会经济差异和文化因素,关于妇女参与创新部门的研究是分散的。因此,本地化研究提供了一个填补社会参与创新活动知识空白的机会。
{"title":"The Gender Patenting Gap: A Study on the Iberoamerican Countries","authors":"Danilo S. Carvalho, Lydia Bares, Kelyane Silva","doi":"10.2478/jdis-2020-0025","DOIUrl":"https://doi.org/10.2478/jdis-2020-0025","url":null,"abstract":"Abstract Purpose This work presents a study on the female involvement in patent applications in all 23 Ibero-American countries that are WIPO members, in order to measure gender inequalities in institutional collaborations and technological fields, across time. Design/methodology/approach The data used in this paper come from EPO Worldwide Patent Statistical Database (PATSTAT). PATSTAT contains bibliographical data relating to more than 100 million patent documents from leading industrialized and developing countries, as well as legal event data from more than 40 patent authorities contained in the EPO worldwide legal event data (INPADOC). The extracted subset is composed of 150,863 patent applications with priority years between 2007 and 2016. Findings Our observations indicate that even in more dynamic economies such as Portugal and Spain, the participation of women per patent applications does not exceed 30%. Additionally, the distribution of female participation among institutional sectors and technological fields is consistent with previous studies in other regions and indicate a socio-cultural divide. Research limitations Unisex names were not considered and were counted as gender unknown, and patent applications for which no inventor information was available were discarded, but further effort of data analysis may provide more information about gender inequalities. Practical implications While patents are imperfect variables of inventive step and therefore should be considered as a variable proxy of innovation, our findings may help to guide the implementation of policies for balancing gender participation in innovative activities, as well as instigating research into the issues causing divisive participation along gender lines. Originality/value While there is a widespread effort into evaluating and improving the participation of groups recognized as minorities within state-of-the-art activities, research about women participation in the innovation sector is fragmented due to differing regional characteristics: industrial and academic segmentation, socio-economic disparities, and cultural factors. Thus, localized studies present an opportunity of filling the gaps of knowledge on societal participation in innovation activities.","PeriodicalId":92237,"journal":{"name":"Journal of data and information science (Warsaw, Poland)","volume":"5 1","pages":"116 - 128"},"PeriodicalIF":0.0,"publicationDate":"2020-07-03","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"49099148","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 2
Acknowledgment of Libraries in the Journal Literature: An Exploratory Study 期刊文献中的图书馆认知:一项探索性研究
Pub Date : 2020-07-03 DOI: 10.2478/jdis-2020-0023
David E. Hubbard, Sierra Laddusaw
Abstract Purpose This study examines acknowledgments to libraries in the journal literature, as well as the efficacy of using Web of Science (WoS) to locate general acknowledgment text. Design/methodology/approach This mixed-methods approach quantifies and characterizes acknowledgments to libraries in the journal literature. Using WoS's Funding Text field, the acknowledgments for six peer universities were identified and then characterized. The efficacy of using WoS to locate library acknowledgments was assessed by comparing the WoS Funding Text search results to the actual acknowledgment text found in the articles. Findings Acknowledgments to libraries were found in articles at all six peer universities, though the absolute and relative numbers were quite low (< 0.5%). Most of the library acknowledgments were for resources (collections, funding, etc.), and many were concentrated in natural history (e.g. zoology). Examination of Texas A&M University zoology articles found that 91.7% of the funding information came from “acknowledgments” and not specifically a funding acknowledgment section. The WoS Funding Text search found 56% of the library acknowledgments compared to a search of the actual acknowledgment text in the articles. Research limitations Limiting publications to journals, using a single truncated search term, and including only six research universities in the United States. Practical implications This study examined library acknowledgments, but the same approach could be applied to searches of other keywords, institutions/organizations, individuals, etc. While not specifically designed to search general acknowledgments, WoS's Funding Text field can be used as an exploratory tool to search acknowledgments beyond funding. Originality/value There are a few studies that have examined library acknowledgments in the scholarly literature, though to date none of those studies have examined the efficacy of using the WoS Funding Text field to locate those library acknowledgments within the journal literature.
摘要目的本研究考察了期刊文献中对图书馆的确认,以及使用科学网络(WoS)定位一般确认文本的有效性。设计/方法论/方法这种混合方法量化和表征了期刊文献中对图书馆的承认。使用WoS的资助文本字段,识别并表征了六所同行大学的致谢。通过将WoS资助文本搜索结果与文章中的实际确认文本进行比较,评估了使用WoS定位图书馆确认的有效性。调查结果在所有六所同行大学的文章中都发现了对图书馆的认可,尽管绝对和相对数字都很低(<0.5%)。大多数图书馆认可都是针对资源(收藏、资助等),许多都集中在自然史(如动物学)。对得克萨斯农工大学动物学文章的审查发现,91.7%的资助信息来自“确认”,而不是专门的资助确认部分。WoS资助文本搜索发现了56%的图书馆确认,而搜索文章中的实际确认文本。研究限制将出版物限制在期刊上,使用一个截断的搜索词,并且只包括美国的六所研究型大学。实际意义这项研究检查了图书馆的确认书,但同样的方法也可以应用于其他关键词、机构/组织、个人等的搜索。虽然不是专门为搜索一般确认书而设计的,但WoS的资助文本字段可以用作搜索资助以外的确认书的探索工具。原创性/价值有一些研究检查了学术文献中的图书馆承认,尽管到目前为止,这些研究都没有检查使用WoS资助文本字段在期刊文献中定位这些图书馆承认的有效性。
{"title":"Acknowledgment of Libraries in the Journal Literature: An Exploratory Study","authors":"David E. Hubbard, Sierra Laddusaw","doi":"10.2478/jdis-2020-0023","DOIUrl":"https://doi.org/10.2478/jdis-2020-0023","url":null,"abstract":"Abstract Purpose This study examines acknowledgments to libraries in the journal literature, as well as the efficacy of using Web of Science (WoS) to locate general acknowledgment text. Design/methodology/approach This mixed-methods approach quantifies and characterizes acknowledgments to libraries in the journal literature. Using WoS's Funding Text field, the acknowledgments for six peer universities were identified and then characterized. The efficacy of using WoS to locate library acknowledgments was assessed by comparing the WoS Funding Text search results to the actual acknowledgment text found in the articles. Findings Acknowledgments to libraries were found in articles at all six peer universities, though the absolute and relative numbers were quite low (< 0.5%). Most of the library acknowledgments were for resources (collections, funding, etc.), and many were concentrated in natural history (e.g. zoology). Examination of Texas A&M University zoology articles found that 91.7% of the funding information came from “acknowledgments” and not specifically a funding acknowledgment section. The WoS Funding Text search found 56% of the library acknowledgments compared to a search of the actual acknowledgment text in the articles. Research limitations Limiting publications to journals, using a single truncated search term, and including only six research universities in the United States. Practical implications This study examined library acknowledgments, but the same approach could be applied to searches of other keywords, institutions/organizations, individuals, etc. While not specifically designed to search general acknowledgments, WoS's Funding Text field can be used as an exploratory tool to search acknowledgments beyond funding. Originality/value There are a few studies that have examined library acknowledgments in the scholarly literature, though to date none of those studies have examined the efficacy of using the WoS Funding Text field to locate those library acknowledgments within the journal literature.","PeriodicalId":92237,"journal":{"name":"Journal of data and information science (Warsaw, Poland)","volume":"5 1","pages":"178 - 186"},"PeriodicalIF":0.0,"publicationDate":"2020-07-03","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"42638146","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 5
Library and Information Science Papers Discussed on Twitter: A new Network-based Approach for Measuring Public Attention 在Twitter上讨论的图书馆情报学论文:一种基于网络的测量公众注意力的新方法
Pub Date : 2020-07-03 DOI: 10.2478/jdis-2020-0017
R. Haunschild, L. Leydesdorff, L. Bornmann
Abstract Purpose In recent years, one can witness a trend in research evaluation to measure the impact on society or attention to research by society (beyond science). We address the following question: can Twitter be meaningfully used for the mapping of public and scientific discourses? Design/methodology/approach Recently, Haunschild et al. (2019) introduced a new network-oriented approach for using Twitter data in research evaluation. Such a procedure can be used to measure the public discussion around a specific field or topic. In this study, we used all papers published in the Web of Science (WoS, Clarivate Analytics) subject category Information Science & Library Science to explore the publicly discussed topics from the area of library and information science (LIS) in comparison to the topics used by scholars in their publications in this area. Findings The results show that LIS papers are represented rather well on Twitter. Similar topics appear in the networks of author keywords of all LIS papers, not tweeted LIS papers, and tweeted LIS papers. The networks of the author keywords of all LIS papers and not tweeted LIS papers are most similar to each other. Research limitations Only papers published since 2011 with DOI were analyzed. Practical implications Although Twitter data do not seem to be useful for quantitative research evaluation, it seems that Twitter data can be used in a more qualitative way for mapping of public and scientific discourses. Originality/value This study explores a rather new methodology for comparing public and scientific discourses.
摘要目的近年来,研究评价出现了一种趋势,即衡量研究对社会的影响或社会(科学以外)对研究的关注。我们要解决以下问题:Twitter能被有意义地用于绘制公共和科学话语的地图吗?设计/方法/方法最近,Haunschild等人(2019)引入了一种新的面向网络的方法,用于在研究评估中使用Twitter数据。这样的程序可以用来衡量围绕特定领域或主题的公众讨论。在这项研究中,我们使用了所有发表在Web of Science (WoS, Clarivate Analytics)主题类别“信息科学与图书馆科学”上的论文,以探索图书馆与信息科学(LIS)领域的公开讨论主题,并将其与学者在该领域出版物中使用的主题进行比较。结果表明,美国论文在Twitter上的代表性相当好。相似的主题出现在所有LIS论文的作者关键词网络中,未推的LIS论文中,推的LIS论文中。所有LIS论文和未推文的LIS论文的作者关键词网络最相似。仅分析2011年以来发表的DOI为DOI的论文。虽然Twitter数据似乎对定量研究评估没有用处,但Twitter数据似乎可以以更定性的方式用于公共和科学话语的映射。本研究探索了一种比较公共话语和科学话语的新方法。
{"title":"Library and Information Science Papers Discussed on Twitter: A new Network-based Approach for Measuring Public Attention","authors":"R. Haunschild, L. Leydesdorff, L. Bornmann","doi":"10.2478/jdis-2020-0017","DOIUrl":"https://doi.org/10.2478/jdis-2020-0017","url":null,"abstract":"Abstract Purpose In recent years, one can witness a trend in research evaluation to measure the impact on society or attention to research by society (beyond science). We address the following question: can Twitter be meaningfully used for the mapping of public and scientific discourses? Design/methodology/approach Recently, Haunschild et al. (2019) introduced a new network-oriented approach for using Twitter data in research evaluation. Such a procedure can be used to measure the public discussion around a specific field or topic. In this study, we used all papers published in the Web of Science (WoS, Clarivate Analytics) subject category Information Science & Library Science to explore the publicly discussed topics from the area of library and information science (LIS) in comparison to the topics used by scholars in their publications in this area. Findings The results show that LIS papers are represented rather well on Twitter. Similar topics appear in the networks of author keywords of all LIS papers, not tweeted LIS papers, and tweeted LIS papers. The networks of the author keywords of all LIS papers and not tweeted LIS papers are most similar to each other. Research limitations Only papers published since 2011 with DOI were analyzed. Practical implications Although Twitter data do not seem to be useful for quantitative research evaluation, it seems that Twitter data can be used in a more qualitative way for mapping of public and scientific discourses. Originality/value This study explores a rather new methodology for comparing public and scientific discourses.","PeriodicalId":92237,"journal":{"name":"Journal of data and information science (Warsaw, Poland)","volume":"5 1","pages":"17 - 5"},"PeriodicalIF":0.0,"publicationDate":"2020-07-03","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"42321207","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 11
Co-occurrence of Cell Lines, Basal Media and Supplementation in the Biomedical Research Literature 生物医学研究文献中细胞系、基础培养基和补充物的共存
Pub Date : 2020-07-03 DOI: 10.2478/jdis-2020-0016
Jessica Cox, Darin McBeath, Corey A. Harper, Ron Daniel
Abstract Purpose The use of in vitro cell culture and experimentation is a cornerstone of biomedical research, however, more attention has recently been given to the potential consequences of using such artificial basal medias and undefined supplements. As a first step towards better understanding and measuring the impact these systems have on experimental results, we use text mining to capture typical research practices and trends around cell culture. Design/methodology/approach To measure the scale of in vitro cell culture use, we have analyzed a corpus of 94,695 research articles that appear in biomedical research journals published in ScienceDirect from 2000–2018. Central to our investigation is the observation that studies using cell culture describe conditions using the typical sentence structure of cell line, basal media, and supplemented compounds. Here we tag our corpus with a curated list of basal medias and the Cellosaurus ontology using the Aho-Corasick algorithm. We also processed the corpus with Stanford CoreNLP to find nouns that follow the basal media, in an attempt to identify supplements used. Findings Interestingly, we find that researchers frequently use DMEM even if a cell line's vendor recommends less concentrated media. We see long-tailed distributions for the usage of media and cell lines, with DMEM and RPMI dominating the media, and HEK293, HEK293T, and HeLa dominating cell lines used. Research limitations Our analysis was restricted to documents in ScienceDirect, and our text mining method achieved high recall but low precision and mandated manual inspection of many tokens. Practical implications Our findings document current cell culture practices in the biomedical research community, which can be used as a resource for future experimental design. Originality/value No other work has taken a text mining approach to surveying cell culture practices in biomedical research.
体外细胞培养和实验的使用是生物医学研究的基石,然而,最近越来越多的人关注使用这种人工基础培养基和未定义补充剂的潜在后果。作为更好地理解和衡量这些系统对实验结果的影响的第一步,我们使用文本挖掘来捕获围绕细胞培养的典型研究实践和趋势。为了衡量体外细胞培养的使用规模,我们分析了2000年至2018年在ScienceDirect上发表的生物医学研究期刊上发表的94,695篇研究文章。我们研究的核心是观察到使用细胞培养的研究使用细胞系,基础培养基和补充化合物的典型句子结构来描述条件。在这里,我们使用Aho-Corasick算法将我们的语料库标记为基础媒体和Cellosaurus本体的策划列表。我们还使用斯坦福CoreNLP对语料库进行处理,以找到遵循基础介质的名词,试图识别使用的补充物。有趣的是,我们发现即使细胞系的供应商推荐浓度较低的培养基,研究人员也经常使用DMEM。我们看到培养基和细胞系的使用呈长尾分布,DMEM和RPMI占主导地位,HEK293、HEK293T和HeLa占主导地位。我们的分析仅限于ScienceDirect中的文档,我们的文本挖掘方法实现了高召回率但低准确率,并且强制手动检查许多令牌。我们的研究结果记录了当前生物医学研究界的细胞培养实践,可作为未来实验设计的资源。没有其他工作采用文本挖掘方法来调查生物医学研究中的细胞培养实践。
{"title":"Co-occurrence of Cell Lines, Basal Media and Supplementation in the Biomedical Research Literature","authors":"Jessica Cox, Darin McBeath, Corey A. Harper, Ron Daniel","doi":"10.2478/jdis-2020-0016","DOIUrl":"https://doi.org/10.2478/jdis-2020-0016","url":null,"abstract":"Abstract Purpose The use of in vitro cell culture and experimentation is a cornerstone of biomedical research, however, more attention has recently been given to the potential consequences of using such artificial basal medias and undefined supplements. As a first step towards better understanding and measuring the impact these systems have on experimental results, we use text mining to capture typical research practices and trends around cell culture. Design/methodology/approach To measure the scale of in vitro cell culture use, we have analyzed a corpus of 94,695 research articles that appear in biomedical research journals published in ScienceDirect from 2000–2018. Central to our investigation is the observation that studies using cell culture describe conditions using the typical sentence structure of cell line, basal media, and supplemented compounds. Here we tag our corpus with a curated list of basal medias and the Cellosaurus ontology using the Aho-Corasick algorithm. We also processed the corpus with Stanford CoreNLP to find nouns that follow the basal media, in an attempt to identify supplements used. Findings Interestingly, we find that researchers frequently use DMEM even if a cell line's vendor recommends less concentrated media. We see long-tailed distributions for the usage of media and cell lines, with DMEM and RPMI dominating the media, and HEK293, HEK293T, and HeLa dominating cell lines used. Research limitations Our analysis was restricted to documents in ScienceDirect, and our text mining method achieved high recall but low precision and mandated manual inspection of many tokens. Practical implications Our findings document current cell culture practices in the biomedical research community, which can be used as a resource for future experimental design. Originality/value No other work has taken a text mining approach to surveying cell culture practices in biomedical research.","PeriodicalId":92237,"journal":{"name":"Journal of data and information science (Warsaw, Poland)","volume":"5 1","pages":"161 - 177"},"PeriodicalIF":0.0,"publicationDate":"2020-07-03","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"44045294","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 3
The Compound F2-index and the Compound H-index as Extension of the f2 and h-indexes from a Dynamic Perspective① 从动力学角度看复合F2指数和复合H指数作为F2指数和H指数的推广①
Pub Date : 2020-07-03 DOI: 10.2478/jdis-2020-0019
Y. Fassin
Abstract Purpose Elaboration of an indicator to include the dynamic aspect of citations in bibliometric indexes. Design/methodology/approach A new bibliometric methodology—the f2-index—is applied at the career level and at the level of the recent 5 years to analyze the dynamic aspect of bibliometrics. The method is applied, as an illustration, to the field of corporate governance. Findings The compound F2-index as an extension of the f2-index recognizes past achievements but also values new research work with potential. The method is extended to the h-index and the h2-index. An activity index is defined as the ratio between the recent h’-index to the career h-index. Research limitations The compound F2 and H-indexes are PAC, probably approximately correct, and depend on the selection and database. Practical implications The F2- and H compound indexes allow identifying the rising stars of a field from a dynamic perspective. The activity ratio highlights the contribution of younger researchers. Originality/value The new methodology demonstrates the underestimated dynamic capacity of bibliometric research.
摘要目的阐述一个指标,包括文献计量索引中引用的动态方面。设计/方法论/方法一种新的文献计量学方法——f2指数——被应用于职业层面和最近5年的层面,以分析文献计量学的动态方面。举例来说,该方法被应用于公司治理领域。研究结果复合F2指数作为F2指数的延伸,认可了过去的成就,但也重视有潜力的新研究工作。将该方法推广到h指数和h2指数。活动指数被定义为最近的h’-指数与职业h指数之间的比率。研究局限性化合物F2和H指数是PAC,可能大致正确,并取决于选择和数据库。实际意义F2和H复合指数允许从动态角度识别一个领域的新星。活动比率突出了年轻研究人员的贡献。新颖性/价值新方法论表明文献计量学研究的动态能力被低估了。
{"title":"The Compound F2-index and the Compound H-index as Extension of the f2 and h-indexes from a Dynamic Perspective①","authors":"Y. Fassin","doi":"10.2478/jdis-2020-0019","DOIUrl":"https://doi.org/10.2478/jdis-2020-0019","url":null,"abstract":"Abstract Purpose Elaboration of an indicator to include the dynamic aspect of citations in bibliometric indexes. Design/methodology/approach A new bibliometric methodology—the f2-index—is applied at the career level and at the level of the recent 5 years to analyze the dynamic aspect of bibliometrics. The method is applied, as an illustration, to the field of corporate governance. Findings The compound F2-index as an extension of the f2-index recognizes past achievements but also values new research work with potential. The method is extended to the h-index and the h2-index. An activity index is defined as the ratio between the recent h’-index to the career h-index. Research limitations The compound F2 and H-indexes are PAC, probably approximately correct, and depend on the selection and database. Practical implications The F2- and H compound indexes allow identifying the rising stars of a field from a dynamic perspective. The activity ratio highlights the contribution of younger researchers. Originality/value The new methodology demonstrates the underestimated dynamic capacity of bibliometric research.","PeriodicalId":92237,"journal":{"name":"Journal of data and information science (Warsaw, Poland)","volume":"5 1","pages":"71 - 83"},"PeriodicalIF":0.0,"publicationDate":"2020-07-03","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"49661066","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 4
Evidence-based Nomenclature and Taxonomy of Research Impact Indicators 基于证据的研究影响指标命名法和分类
Pub Date : 2020-07-03 DOI: 10.2478/jdis-2020-0018
M. Arsalan, Omar Mubin, A. Mahmud
Abstract Purpose This study aims to classify research impact indicators based on their characteristics and scope. A concept of evidence-based nomenclature of research impact (RI) indicator has been introduced for generalization and transformation of scope. Design/methodology/approch Literature was collected related to the research impact assessment. It was categorized in conceptual and applied case studies. One hundred and nineteen indicators were selected to prepare classification and nomenclature. The nomenclature was developed based on the principle—“every indicator is a contextual-function to explain the impact”. Every indicator was disintegrated into three parts, i.e. Function, Domain, and Target Areas. Findings The main functions of research impact indicators express improvement (63%), recognition (23%), and creation/development (14%). The focus of research impact indicators in literature is more towards the academic domain (59%) whereas the environment/sustainability domain is least considered (4%). As a result, research impact related to the research aspects is felt the most (29%). Other target areas include system and services, methods and procedures, networking, planning, policy development, economic aspects and commercialisation, etc. Research limitations This research applied to 119 research impact indicators. However, the inclusion of additional indicators may change the result. Practical implications The plausible effect of nomenclature is a better organization of indicators with appropriate tags of functions, domains, and target areas. This approach also provides a framework of indicator generalization and transformation. Therefore, similar indicators can be applied in other fields and target areas with modifications. Originality/value The development of nomenclature for research impact indicators is a novel approach in scientometrics. It is developed on the same line as presented in other scientific disciplines, where fundamental objects need to classify on common standards such as biology and chemistry.
摘要目的本研究旨在根据研究影响指标的特征和范围对其进行分类。为了推广和转换研究影响指标的范围,提出了基于证据的研究影响指标命名法的概念。设计/方法/方法收集与研究影响评估相关的文献。它分为概念和应用案例研究。选择了119个指标来编制分类和命名法。该术语是根据“每个指标都是解释影响的上下文功能”这一原则制定的。每个指标被分解为三个部分,即功能、领域和目标区域。研究影响指标的主要功能表现为改进(63%)、认可(23%)和创造/发展(14%)。文献中研究影响指标的重点更多地放在学术领域(59%),而环境/可持续性领域被考虑得最少(4%)。因此,与研究方面相关的研究影响感受最大(29%)。其他目标领域包括系统和服务、方法和程序、网络、规划、政策制定、经济方面和商业化等。本研究涉及119个研究影响指标。然而,纳入其他指标可能会改变结果。实际意义命名法的合理效果是用适当的功能、领域和目标区域标签更好地组织指标。该方法还提供了一个指标泛化和转换的框架。因此,类似的指标可以通过修改应用于其他领域和目标领域。研究影响指标命名法的发展是科学计量学的一种新方法。它的发展与其他科学学科的发展是一样的,在这些学科中,基本对象需要按照生物和化学等共同标准进行分类。
{"title":"Evidence-based Nomenclature and Taxonomy of Research Impact Indicators","authors":"M. Arsalan, Omar Mubin, A. Mahmud","doi":"10.2478/jdis-2020-0018","DOIUrl":"https://doi.org/10.2478/jdis-2020-0018","url":null,"abstract":"Abstract Purpose This study aims to classify research impact indicators based on their characteristics and scope. A concept of evidence-based nomenclature of research impact (RI) indicator has been introduced for generalization and transformation of scope. Design/methodology/approch Literature was collected related to the research impact assessment. It was categorized in conceptual and applied case studies. One hundred and nineteen indicators were selected to prepare classification and nomenclature. The nomenclature was developed based on the principle—“every indicator is a contextual-function to explain the impact”. Every indicator was disintegrated into three parts, i.e. Function, Domain, and Target Areas. Findings The main functions of research impact indicators express improvement (63%), recognition (23%), and creation/development (14%). The focus of research impact indicators in literature is more towards the academic domain (59%) whereas the environment/sustainability domain is least considered (4%). As a result, research impact related to the research aspects is felt the most (29%). Other target areas include system and services, methods and procedures, networking, planning, policy development, economic aspects and commercialisation, etc. Research limitations This research applied to 119 research impact indicators. However, the inclusion of additional indicators may change the result. Practical implications The plausible effect of nomenclature is a better organization of indicators with appropriate tags of functions, domains, and target areas. This approach also provides a framework of indicator generalization and transformation. Therefore, similar indicators can be applied in other fields and target areas with modifications. Originality/value The development of nomenclature for research impact indicators is a novel approach in scientometrics. It is developed on the same line as presented in other scientific disciplines, where fundamental objects need to classify on common standards such as biology and chemistry.","PeriodicalId":92237,"journal":{"name":"Journal of data and information science (Warsaw, Poland)","volume":"5 1","pages":"33 - 56"},"PeriodicalIF":0.0,"publicationDate":"2020-07-03","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"48269493","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 3
Discipline Impact Factor: Some of Its History, Some of the Author's Experience of Its Application, the Continuing Reasons for Its Use and… Next Beyond 学科影响因素:它的一些历史,作者的一些应用经验,它使用的持续原因和…下一步
Pub Date : 2020-07-03 DOI: 10.2478/jdis-2020-0015
V. Lazarev
Abstract Purpose This work aims to consider the role and some of the 42-year history of the discipline impact factor (DIF) in evaluation of serial publications. Also, the original “symmetric” indicator called the “discipline susceptibility factor” is to be presented. Design/methodology/approach In accordance with the purpose of the work, the methods are analytical interpretation of the scientific literature related to this problem as well as speculative explanations. The information base of the research is bibliometric publications dealing with impact, impact factor, discipline impact factor, and discipline susceptibility factor. Findings Examples of the DIF application and modification of the indicator are given. It is shown why research and university libraries need to use the DIF to evaluate serials in conditions of scarce funding for subscription to serial publications, even if open access is available. The role of the DIF for evaluating journals by authors of scientific papers when choosing a good and right journal for submitting a paper is also briefly discussed. An original indicator “symmetrical” to the DIF (the “discipline susceptibility factor”) and its differences from the DIF in terms of content and purpose of evaluation are also briefly presented. Research limitations The selection of publications for the information base of the research did not include those in which the DIF was only mentioned, used partially or not for its original purpose. Restrictions on the length of the article to be submitted in this special issue of the JDIS also caused exclusion even a number of completely relevant publications. Consideration of the DIF is not placed in the context of describing other derivatives from the Garfield impact factor. Practical implications An underrated bibliometric indicator, viz. the discipline impact factor is being promoted for the practical application. An original indicator “symmetrical” to DIF has been proposed in order of searching serial publications representing the external research fields that might fit for potential applications of the results of scientific activities obtained within the framework of the specific research field represented by the cited specialized journals. Both can be useful in research and university libraries in their endeavors to improve scientific information services. Also, both can be used for evaluating journals by authors of scientific papers when choosing a journal to submit a paper. Originality/value The article substantiates the need to evaluate scientific serial publications in library activities—even in conditions of access to huge and convenient databases (subscription packages) and open access to a large number of serial publications. It gives a mini-survey of the history of one of the methods of such evaluation, and offers an original method for evaluating scientific serial publications.
摘要目的本工作旨在考虑学科影响因素(DIF)在系列出版物评估中的作用和42年的一些历史。此外,还将提出称为“学科易感性因子”的原始“对称”指标。设计/方法论/方法根据工作目的,方法是对与该问题相关的科学文献的分析解释以及推测性解释。研究的信息基础是涉及影响、影响因素、学科影响因素和学科易感性因素的文献计量出版物。研究结果给出了DIF应用和指标修改的实例。这表明了为什么研究和大学图书馆需要使用DIF来评估连载,即使可以开放获取,在订阅连载出版物的资金稀缺的情况下也是如此。还简要讨论了DIF在科学论文作者选择好的、正确的期刊提交论文时评估期刊的作用。还简要介绍了一个与DIF(“学科易感性因子”)“对称”的原始指标及其与DIF在评估内容和目的方面的差异。研究局限性研究信息库的出版物选择不包括仅提及、部分使用或未用于其原始目的的出版物。对本期JDIS特刊中提交的文章长度的限制也导致一些完全相关的出版物被排除在外。在描述加菲尔德影响因子的其他导数时,没有考虑DIF。实际意义一个被低估的文献计量指标,即学科影响因素,正在被推广用于实际应用。已经提出了一个与DIF“对称”的原始指标,以便搜索代表外部研究领域的系列出版物,这些出版物可能适合在被引用的专业期刊所代表的特定研究领域的框架内获得的科学活动结果的潜在应用。两者都可以在科研和大学图书馆努力改善科学信息服务方面发挥作用。此外,两者都可以用于科学论文作者在选择提交论文的期刊时评估期刊。原创性/价值这篇文章证实了在图书馆活动中评估科学系列出版物的必要性——即使是在访问庞大方便的数据库(订阅包)和开放访问大量系列出版物的条件下。它对其中一种评估方法的历史进行了简要的调查,并为评估科学系列出版物提供了一种独创的方法。
{"title":"Discipline Impact Factor: Some of Its History, Some of the Author's Experience of Its Application, the Continuing Reasons for Its Use and… Next Beyond","authors":"V. Lazarev","doi":"10.2478/jdis-2020-0015","DOIUrl":"https://doi.org/10.2478/jdis-2020-0015","url":null,"abstract":"Abstract Purpose This work aims to consider the role and some of the 42-year history of the discipline impact factor (DIF) in evaluation of serial publications. Also, the original “symmetric” indicator called the “discipline susceptibility factor” is to be presented. Design/methodology/approach In accordance with the purpose of the work, the methods are analytical interpretation of the scientific literature related to this problem as well as speculative explanations. The information base of the research is bibliometric publications dealing with impact, impact factor, discipline impact factor, and discipline susceptibility factor. Findings Examples of the DIF application and modification of the indicator are given. It is shown why research and university libraries need to use the DIF to evaluate serials in conditions of scarce funding for subscription to serial publications, even if open access is available. The role of the DIF for evaluating journals by authors of scientific papers when choosing a good and right journal for submitting a paper is also briefly discussed. An original indicator “symmetrical” to the DIF (the “discipline susceptibility factor”) and its differences from the DIF in terms of content and purpose of evaluation are also briefly presented. Research limitations The selection of publications for the information base of the research did not include those in which the DIF was only mentioned, used partially or not for its original purpose. Restrictions on the length of the article to be submitted in this special issue of the JDIS also caused exclusion even a number of completely relevant publications. Consideration of the DIF is not placed in the context of describing other derivatives from the Garfield impact factor. Practical implications An underrated bibliometric indicator, viz. the discipline impact factor is being promoted for the practical application. An original indicator “symmetrical” to DIF has been proposed in order of searching serial publications representing the external research fields that might fit for potential applications of the results of scientific activities obtained within the framework of the specific research field represented by the cited specialized journals. Both can be useful in research and university libraries in their endeavors to improve scientific information services. Also, both can be used for evaluating journals by authors of scientific papers when choosing a journal to submit a paper. Originality/value The article substantiates the need to evaluate scientific serial publications in library activities—even in conditions of access to huge and convenient databases (subscription packages) and open access to a large number of serial publications. It gives a mini-survey of the history of one of the methods of such evaluation, and offers an original method for evaluating scientific serial publications.","PeriodicalId":92237,"journal":{"name":"Journal of data and information science (Warsaw, Poland)","volume":"5 1","pages":"197 - 209"},"PeriodicalIF":0.0,"publicationDate":"2020-07-03","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"46163700","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 5
Historical Bibliometrics Using Google Scholar: The Case of Roman Law, 1727–2016 使用谷歌学者的历史文献计量学:罗马法案例,1727-2016
Pub Date : 2020-07-03 DOI: 10.2478/jdis-2020-0024
Janne Pölönen, Björn Hammarfelt
Abstract Purpose The purpose of this study is to investigate the historical and linguistic coverage of Google Scholar, using publications in the field of Roman law as an example. Design/methodology/approach To create a dataset of Roman law publications, we retrieved a total of 21,300 records of publications, published between years 1500 and 2016, with title including words denoting “Roman law” in English, French, German, Italian, and Spanish. Findings We were able to find publications dating back to 1727. The largest number of publications and authors date to the late 19th century, and this peak might be explained by the role of Roman law in French legal education at the time. Furthermore, we found exceptionally skewed concentration of publications to authors, as well as of citations to publications. We speculate that this could be explained by the long time-frame of the study, and the importance of classic works. Research limitation Major limitations, and potential future work, relate to data quality, and cleaning, disambiguation of publications and authors, as well as comparing coverage with other data sources. Practical implications We find Google Scholar to be a promising data source for historical bibliometrics. This approach may help bridge the gap between bibliometrics and the “digital humanities”. Originality/value Earlier studies have focused mainly on Google Scholar's coverage of publications and citations in general, or in specific fields. The historical coverage has, however, received less attention.
摘要目的本研究的目的是调查b谷歌学者的历史和语言覆盖范围,以罗马法领域的出版物为例。为了创建罗马法出版物的数据集,我们检索了1500年至2016年间出版的21,300份出版物记录,标题包括英语、法语、德语、意大利语和西班牙语中表示“罗马法”的单词。我们能够找到追溯到1727年的出版物。最多的出版物和作者可以追溯到19世纪后期,这一高峰可能是由于罗马法在当时法国法律教育中的作用。此外,我们还发现,作者的出版物以及出版物被引用的集中程度异常倾斜。我们推测,这可能是由于研究的时间跨度长,以及经典作品的重要性。主要的限制和潜在的未来工作涉及数据质量、出版物和作者的清理、消除歧义以及与其他数据源的覆盖范围进行比较。我们发现谷歌Scholar是一个有前途的历史文献计量学数据源。这种方法可能有助于弥合文献计量学和“数字人文学科”之间的鸿沟。早期的研究主要集中在b谷歌学者对一般或特定领域的出版物和引用的覆盖范围。然而,对历史的报道受到的关注较少。
{"title":"Historical Bibliometrics Using Google Scholar: The Case of Roman Law, 1727–2016","authors":"Janne Pölönen, Björn Hammarfelt","doi":"10.2478/jdis-2020-0024","DOIUrl":"https://doi.org/10.2478/jdis-2020-0024","url":null,"abstract":"Abstract Purpose The purpose of this study is to investigate the historical and linguistic coverage of Google Scholar, using publications in the field of Roman law as an example. Design/methodology/approach To create a dataset of Roman law publications, we retrieved a total of 21,300 records of publications, published between years 1500 and 2016, with title including words denoting “Roman law” in English, French, German, Italian, and Spanish. Findings We were able to find publications dating back to 1727. The largest number of publications and authors date to the late 19th century, and this peak might be explained by the role of Roman law in French legal education at the time. Furthermore, we found exceptionally skewed concentration of publications to authors, as well as of citations to publications. We speculate that this could be explained by the long time-frame of the study, and the importance of classic works. Research limitation Major limitations, and potential future work, relate to data quality, and cleaning, disambiguation of publications and authors, as well as comparing coverage with other data sources. Practical implications We find Google Scholar to be a promising data source for historical bibliometrics. This approach may help bridge the gap between bibliometrics and the “digital humanities”. Originality/value Earlier studies have focused mainly on Google Scholar's coverage of publications and citations in general, or in specific fields. The historical coverage has, however, received less attention.","PeriodicalId":92237,"journal":{"name":"Journal of data and information science (Warsaw, Poland)","volume":"5 1","pages":"18 - 32"},"PeriodicalIF":0.0,"publicationDate":"2020-07-03","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"44190729","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 5
期刊
Journal of data and information science (Warsaw, Poland)
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1