Scientometrics最新文献_第8页

Is Management and Organizational Studies divided into (micro-)tribes? 管理与组织研究是否分为（微）部落？

IF 3.9 3区管理学 Q2 COMPUTER SCIENCE, INTERDISCIPLINARY APPLICATIONS

Scientometrics

Pub Date : 2024-06-25 DOI: 10.1007/s11192-024-05013-3

Oliver Wieczorek, Olof Hallonsten, Fredrik Åström

Many claims have been made in the past that Management and Organization Studies (MOS) is becoming increasingly fragmented, and that this fragmentation is causing it to drift into self-reference and irrelevance. Despite the weight of this claim, it has not yet been subjected to a systematic empirical test. This paper addresses this research gap using the tribalization approach and diachronic co-citation analyses. Based on 22,430 papers published in 14 MOS journals between 1980 and 2019, we calculate local and global centrality measures and the flow of cited articles between co-citation communities over time. In addition, we use a node-removal strategy to test whether only ritualized citations ensure MOS cohesion. Rather than tribalization, our results suggest a center–periphery structure. Furthermore, more peripheral papers are integrated into the central co-citation communities, but the lion's share of the flow of cited papers occurs over time to only a small number of large clusters. An increase of fragmentation and crowding-out of smaller clusters in MOS in seen in the polycentrically organized core 2014–2019.

过去曾有许多人声称，管理与组织研究（MOS）正变得越来越支离破碎，而这种支离破碎的状况正导致它逐渐陷入自说自话和无关紧要的境地。尽管这种说法很有分量，但它尚未经过系统的实证检验。本文利用部落化方法和非同步共引分析填补了这一研究空白。基于 1980 年至 2019 年间在 14 种 MOS 期刊上发表的 22430 篇论文，我们计算了局部和全局中心度量以及随着时间推移在共引社区之间被引用文章的流动情况。此外，我们还使用节点移除策略来检验是否只有仪式化的引用才能确保 MOS 的凝聚力。我们的结果表明，与其说是部落化，不如说是中心-边缘结构。此外，更多的外围论文被整合到了中心的共同引用群体中，但随着时间的推移，大部分被引用论文只流向了少数大型集群。在 2014-2019 年的多中心组织核心中，MOS 中较小集群的分散和排挤现象有所增加。

引用次数: 0

Predicting citation impact of academic papers across research areas using multiple models and early citations 利用多种模型和早期引文预测各研究领域学术论文的引文影响力

IF 3.9 3区管理学 Q2 COMPUTER SCIENCE, INTERDISCIPLINARY APPLICATIONS

Scientometrics

Pub Date : 2024-06-25 DOI: 10.1007/s11192-024-05086-0

Fang Zhang, Shengli Wu

As the volume of scientific literature expands rapidly, accurately gauging and predicting the citation impact of academic papers has become increasingly imperative. Citation counts serve as a widely adopted metric for this purpose. While numerous researchers have explored techniques for projecting papers’ citation counts, a prevalent constraint lies in the utilization of a singular model across all papers within a dataset. This universal approach, suitable for small, homogeneous collections, proves less effective for large, heterogeneous collections spanning various research domains, thereby curtailing the practical utility of these methodologies. In this study, we propose a pioneering methodology that deploys multiple models tailored to distinct research domains and integrates early citation data. Our approach encompasses instance-based learning techniques to categorize papers into different research domains and distinct prediction models trained on early citation counts for papers within each domain. We assessed our methodology using two extensive datasets sourced from DBLP and arXiv. Our experimental findings affirm that the proposed classification methodology is both precise and efficient in classifying papers into research domains. Furthermore, the proposed prediction methodology, harnessing multiple domain-specific models and early citations, surpasses four state-of-the-art baseline methods in most instances, substantially enhancing the accuracy of citation impact predictions for diverse collections of academic papers.

随着科学文献数量的迅速增长，准确衡量和预测学术论文的引文影响力变得日益重要。在这方面，引用次数是一个被广泛采用的指标。虽然许多研究人员都探索过预测论文引用次数的技术，但一个普遍的制约因素是在数据集中的所有论文中使用单一模型。这种通用方法适用于小型同质数据集，但对于横跨不同研究领域的大型异质数据集而言，其效果却大打折扣，从而削弱了这些方法的实用性。在本研究中，我们提出了一种开创性的方法，该方法部署了针对不同研究领域的多种模型，并整合了早期引文数据。我们的方法包括基于实例的学习技术，将论文归类到不同的研究领域，以及根据每个领域内论文的早期引用次数训练出的不同预测模型。我们使用来自 DBLP 和 arXiv 的两个广泛数据集对我们的方法进行了评估。我们的实验结果证实，所提出的分类方法在将论文分类到研究领域方面既精确又高效。此外，所提出的预测方法利用了多个特定领域模型和早期引文，在大多数情况下都超越了四种最先进的基线方法，大大提高了对不同学术论文集进行引文影响预测的准确性。

{"title":"Predicting citation impact of academic papers across research areas using multiple models and early citations","authors":"Fang Zhang, Shengli Wu","doi":"10.1007/s11192-024-05086-0","DOIUrl":"https://doi.org/10.1007/s11192-024-05086-0","url":null,"abstract":"As the volume of scientific literature expands rapidly, accurately gauging and predicting the citation impact of academic papers has become increasingly imperative. Citation counts serve as a widely adopted metric for this purpose. While numerous researchers have explored techniques for projecting papers’ citation counts, a prevalent constraint lies in the utilization of a singular model across all papers within a dataset. This universal approach, suitable for small, homogeneous collections, proves less effective for large, heterogeneous collections spanning various research domains, thereby curtailing the practical utility of these methodologies. In this study, we propose a pioneering methodology that deploys multiple models tailored to distinct research domains and integrates early citation data. Our approach encompasses instance-based learning techniques to categorize papers into different research domains and distinct prediction models trained on early citation counts for papers within each domain. We assessed our methodology using two extensive datasets sourced from DBLP and arXiv. Our experimental findings affirm that the proposed classification methodology is both precise and efficient in classifying papers into research domains. Furthermore, the proposed prediction methodology, harnessing multiple domain-specific models and early citations, surpasses four state-of-the-art baseline methods in most instances, substantially enhancing the accuracy of citation impact predictions for diverse collections of academic papers.","PeriodicalId":21755,"journal":{"name":"Scientometrics","volume":"149 1","pages":""},"PeriodicalIF":3.9,"publicationDate":"2024-06-25","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141504465","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"管理学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Investigating the application of work–energy metaphor in interdisciplinary citation analysis 跨学科引文分析中工作能量隐喻的应用研究

IF 3.9 3区管理学 Q2 COMPUTER SCIENCE, INTERDISCIPLINARY APPLICATIONS

Scientometrics

Pub Date : 2024-06-24 DOI: 10.1007/s11192-024-05019-x

Guoyang Rong, Changling Li, Zhijian Zhang, Shuaipu Chen, Yuxing Qian

Metaphors play a crucial role in facilitating the comprehension and analysis of knowledge. “Knowledge as energy” is a well-established metaphorical framework that provides unique benefits for comprehending the dissemination of knowledge and enabling its quantification. Nevertheless, empirical studies employing this framework are limited, especially in the area of the work–energy metaphor, which primarily remains theoretical. This paper proposes an application scheme for the work– energy metaphor in interdisciplinary citation analysis. In this scheme, disciplines are considered entities; various factors that drive the progress of a discipline are considered forces; energy is considered the knowledge produced or transferred in the citations. Building upon the work–energy theorem in physics, this study developed indicators reflecting citation quality and velocity to assess interdisciplinary research progression. An empirical investigation was carried out, utilizing these indicators to evaluate the influence of interdisciplinary citations on disciplines. In the experiments, we used Library and Information Science (LIS) from 2012 to 2021 as an example to analyze the impact of interdisciplinary citations from LIS on other disciplines over two time periods. The experiments demonstrated the feasibility of the work–energy metaphorical framework proposed in this paper. It was also found that Computer Science, Management, and Business experienced the highest impact from LIS interdisciplinary citations and exhibited steady growth over a 10-year period. Environmental Science has substantial potential for the future.

隐喻在促进对知识的理解和分析方面发挥着至关重要的作用。"知识即能量 "是一个成熟的隐喻框架，为理解知识的传播和量化知识提供了独特的好处。然而，运用这一框架进行的实证研究非常有限，尤其是在工作--能量隐喻领域，主要还是停留在理论层面。本文提出了跨学科引文分析中工作-能量隐喻的应用方案。在这一方案中，学科被视为实体；推动学科进步的各种因素被视为力；能量被视为引文中产生或转移的知识。本研究以物理学中的工能定理为基础，制定了反映引文质量和速度的指标，以评估跨学科研究的进展情况。我们利用这些指标开展了一项实证调查，以评估跨学科引文对学科的影响。在实验中，我们以2012年至2021年的图书馆与信息科学（LIS）为例，分析了两个时间段内图书馆与信息科学的跨学科引文对其他学科的影响。实验证明了本文提出的工作能量隐喻框架的可行性。实验还发现，计算机科学、管理学和商学受到 LIS 跨学科引文的影响最大，并在 10 年内呈现出稳步增长的态势。环境科学在未来具有巨大的潜力。

{"title":"Investigating the application of work–energy metaphor in interdisciplinary citation analysis","authors":"Guoyang Rong, Changling Li, Zhijian Zhang, Shuaipu Chen, Yuxing Qian","doi":"10.1007/s11192-024-05019-x","DOIUrl":"https://doi.org/10.1007/s11192-024-05019-x","url":null,"abstract":"Metaphors play a crucial role in facilitating the comprehension and analysis of knowledge. “Knowledge as energy” is a well-established metaphorical framework that provides unique benefits for comprehending the dissemination of knowledge and enabling its quantification. Nevertheless, empirical studies employing this framework are limited, especially in the area of the work–energy metaphor, which primarily remains theoretical. This paper proposes an application scheme for the work– energy metaphor in interdisciplinary citation analysis. In this scheme, disciplines are considered entities; various factors that drive the progress of a discipline are considered forces; energy is considered the knowledge produced or transferred in the citations. Building upon the work–energy theorem in physics, this study developed indicators reflecting citation quality and velocity to assess interdisciplinary research progression. An empirical investigation was carried out, utilizing these indicators to evaluate the influence of interdisciplinary citations on disciplines. In the experiments, we used Library and Information Science (LIS) from 2012 to 2021 as an example to analyze the impact of interdisciplinary citations from LIS on other disciplines over two time periods. The experiments demonstrated the feasibility of the work–energy metaphorical framework proposed in this paper. It was also found that Computer Science, Management, and Business experienced the highest impact from LIS interdisciplinary citations and exhibited steady growth over a 10-year period. Environmental Science has substantial potential for the future.","PeriodicalId":21755,"journal":{"name":"Scientometrics","volume":"139 1","pages":""},"PeriodicalIF":3.9,"publicationDate":"2024-06-24","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141524267","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"管理学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

A citation analysis examining geographical specificity in article titles 对文章标题中地域特异性的引文分析

IF 3.9 3区管理学 Q2 COMPUTER SCIENCE, INTERDISCIPLINARY APPLICATIONS

Scientometrics

Pub Date : 2024-06-21 DOI: 10.1007/s11192-024-05075-3

C. Sean Burns, Md. Anwarul Islam

This investigation explores the impact of geographical names within article titles on citation frequency across a corpus of literature within the field of library and information science, spanning from 2018 to 2020, and encompassing 56 journal titles. We hypothesized that the presence of geographical names of nations in article titles would negatively correlate with citation counts. Our primary analysis of 1330 articles with geographical names in titles versus 8702 without, revealed a statistically significant, albeit small, difference in median citations, favoring articles without geographical names (mdn = 7) over those with geographical names (mdn = 6). Contrary to our secondary hypothesis, a proximity analysis demonstrated a weak, positive correlation between the position of geographical names near the title end and citation counts. Our examination found little evidence supporting differential citation frequency based on the Human Development Index (HDI) of the nations mentioned in titles. However, although a journal’s impact score strongly predicted citation counts for articles, we found that these counts were depressed when articles in those journals contained a geographic name. We found a negative correlation between the frequency of geographical names in article titles and the journals’ impact scores, yet this was weakly, statistically significant. Our data also suggested a vague positional preference for nations within titles, unrelated to HDI. Furthermore, the likelihood of journals publishing articles mentioning nations of varying HDI was found to be statistically insignificant. This study sheds light on the nuanced influence of title specificity, through geographical names, on scholarly communication and citation impact, indicating a slight preference for broader title phrasing in garnering citations.

这项调查探讨了文章标题中的地名对图书馆与信息科学领域文献库中引用频率的影响，时间跨度为 2018 年至 2020 年，涵盖 56 种期刊标题。我们假设，文章标题中出现国家地名将与引用次数负相关。我们对 1330 篇标题中包含地名的文章和 8702 篇标题中不包含地名的文章进行了初步分析，结果显示，尽管差异较小，但在引用中位数方面存在显著的统计学差异，不包含地名的文章（mdn = 7）优于包含地名的文章（mdn = 6）。与我们的次要假设相反，近似性分析表明，地名靠近标题末尾的位置与引用次数之间存在微弱的正相关。我们的研究发现，几乎没有证据支持根据标题中提到的国家的人类发展指数（HDI）来区分引用频率。不过，尽管期刊的影响分值对文章的引用次数有很大的预测作用，但我们发现，当这些期刊的文章中包含地名时，引用次数就会下降。我们发现，文章标题中出现地理名称的频率与期刊的影响分值之间存在负相关，但在统计学上意义微弱。我们的数据还表明，标题中对国家的位置偏好是模糊的，与人类发展指数无关。此外，我们还发现，期刊发表文章提及不同人类发展指数国家的可能性在统计学上并不显著。这项研究揭示了通过地名实现的标题特异性对学术交流和引文影响的微妙影响，表明在获得引文方面，人们略微偏好更宽泛的标题措辞。

{"title":"A citation analysis examining geographical specificity in article titles","authors":"C. Sean Burns, Md. Anwarul Islam","doi":"10.1007/s11192-024-05075-3","DOIUrl":"https://doi.org/10.1007/s11192-024-05075-3","url":null,"abstract":"This investigation explores the impact of geographical names within article titles on citation frequency across a corpus of literature within the field of library and information science, spanning from 2018 to 2020, and encompassing 56 journal titles. We hypothesized that the presence of geographical names of nations in article titles would negatively correlate with citation counts. Our primary analysis of 1330 articles with geographical names in titles versus 8702 without, revealed a statistically significant, albeit small, difference in median citations, favoring articles without geographical names (mdn = 7) over those with geographical names (mdn = 6). Contrary to our secondary hypothesis, a proximity analysis demonstrated a weak, positive correlation between the position of geographical names near the title end and citation counts. Our examination found little evidence supporting differential citation frequency based on the Human Development Index (HDI) of the nations mentioned in titles. However, although a journal’s impact score strongly predicted citation counts for articles, we found that these counts were depressed when articles in those journals contained a geographic name. We found a negative correlation between the frequency of geographical names in article titles and the journals’ impact scores, yet this was weakly, statistically significant. Our data also suggested a vague positional preference for nations within titles, unrelated to HDI. Furthermore, the likelihood of journals publishing articles mentioning nations of varying HDI was found to be statistically insignificant. This study sheds light on the nuanced influence of title specificity, through geographical names, on scholarly communication and citation impact, indicating a slight preference for broader title phrasing in garnering citations.","PeriodicalId":21755,"journal":{"name":"Scientometrics","volume":"75 1","pages":""},"PeriodicalIF":3.9,"publicationDate":"2024-06-21","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141524271","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"管理学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Heterogeneous hypergraph learning for literature retrieval based on citation intents 基于引用意图的异构超图学习用于文献检索

IF 3.9 3区管理学 Q2 COMPUTER SCIENCE, INTERDISCIPLINARY APPLICATIONS

Scientometrics

Pub Date : 2024-06-21 DOI: 10.1007/s11192-024-05066-4

Kaiwen Shi, Kan Liu, Xinyan He

Literature retrieval helps scientists find previous work that is relative to their own research or even get new research ideas. However, the discrepancy between retrieval results and the ultimate intention of citation is neglected by most literature retrieval models. Citation intent refers to the researcher’s motivation for citing a paper. A citation intent graph with homogeneous nodes and heterogeneous hyperedges can represent different types of citation intents. By leveraging the citation intent information included in a hypergraph, a retrieval model can guide researchers on where to cite its retrieval result by understanding the citation behaviour in the graph. We present a ranking model called CitenGL (Citation Intent Graph Learning) that aims to extract citation intent information and textual matching signals. The proposed model consists of a heterogeneous hypergraph encoder and a lightweight deep fusion unit for efficiency trade-offs. Compared to traditional literature retrieval, our model fills the gap between retrieval results and citation intention and yields an understandable graph-structured output. We evaluated our model on publicly available full-text paper datasets. Experimental results show that CitenGL outperforms most existing neural ranking models that only consider textual information, which illustrates the effectiveness of integrating citation intent information with textual information. Further ablation analyses show how citation intent information complements text-matching signals and citation networks.

文献检索可以帮助科学家找到与自己研究相关的前人工作，甚至获得新的研究思路。然而，大多数文献检索模型都忽略了检索结果与最终引用意图之间的差异。引用意图是指研究人员引用论文的动机。具有同质节点和异质超边的引用意图图可以代表不同类型的引用意图。通过利用超图中的引用意图信息，检索模型可以通过了解图中的引用行为，指导研究人员将检索结果引用到何处。我们提出了一种名为 CitenGL（引文意图图学习）的排序模型，旨在提取引文意图信息和文本匹配信号。该模型由一个异构超图编码器和一个轻量级深度融合单元组成，以实现效率权衡。与传统的文献检索相比，我们的模型填补了检索结果与引文意图之间的空白，并产生了可理解的图结构输出。我们在公开的全文论文数据集上评估了我们的模型。实验结果表明，CitenGL 优于大多数只考虑文本信息的现有神经排名模型，这说明了将引文意图信息与文本信息相结合的有效性。进一步的消融分析表明了引文意图信息是如何对文本匹配信号和引文网络进行补充的。

{"title":"Heterogeneous hypergraph learning for literature retrieval based on citation intents","authors":"Kaiwen Shi, Kan Liu, Xinyan He","doi":"10.1007/s11192-024-05066-4","DOIUrl":"https://doi.org/10.1007/s11192-024-05066-4","url":null,"abstract":"Literature retrieval helps scientists find previous work that is relative to their own research or even get new research ideas. However, the discrepancy between retrieval results and the ultimate intention of citation is neglected by most literature retrieval models. Citation intent refers to the researcher’s motivation for citing a paper. A citation intent graph with homogeneous nodes and heterogeneous hyperedges can represent different types of citation intents. By leveraging the citation intent information included in a hypergraph, a retrieval model can guide researchers on where to cite its retrieval result by understanding the citation behaviour in the graph. We present a ranking model called CitenGL (Citation Intent Graph Learning) that aims to extract citation intent information and textual matching signals. The proposed model consists of a heterogeneous hypergraph encoder and a lightweight deep fusion unit for efficiency trade-offs. Compared to traditional literature retrieval, our model fills the gap between retrieval results and citation intention and yields an understandable graph-structured output. We evaluated our model on publicly available full-text paper datasets. Experimental results show that CitenGL outperforms most existing neural ranking models that only consider textual information, which illustrates the effectiveness of integrating citation intent information with textual information. Further ablation analyses show how citation intent information complements text-matching signals and citation networks.","PeriodicalId":21755,"journal":{"name":"Scientometrics","volume":"44 1","pages":""},"PeriodicalIF":3.9,"publicationDate":"2024-06-21","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141524270","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"管理学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Gender assignment in doctoral theses: revisiting Teseo with a method based on cultural consensus theory 博士论文中的性别分配：用基于文化共识理论的方法重新审视 Teseo

IF 3.9 3区管理学 Q2 COMPUTER SCIENCE, INTERDISCIPLINARY APPLICATIONS

Scientometrics

Pub Date : 2024-06-20 DOI: 10.1007/s11192-024-05079-z

Nataly Matias-Rayme, Iuliana Botezan, Mari Carmen Suárez-Figueroa, Rodrigo Sánchez-Jiménez

This study critically evaluates gender assignment methods within academic contexts, employing a comparative analysis of diverse techniques, including a SVM classifier, gender-guesser, genderize.io, and a Cultural Consensus Theory based classifier. Emphasizing the significance of transparency, data sources, and methodological considerations, the research introduces nomquamgender, a cultural consensus-based method, and applies it to Teseo, a Spanish dissertation database. The results reveal a substantial reduction in the number of individuals with unknown gender compared to traditional methods relying on INE data. The nuanced differences in gender distribution underscore the importance of methodological choices in gender studies, urging for transparent, comprehensive, and freely accessible methods to enhance the accuracy and reliability of gender assignment in academic research. After reevaluating the problem of gender imbalances in the doctoral system we can conclude that it’s still evident although the trend is clearly set for its reduction. Finaly, specific problems related to some disciplines, including STEM fields and seniority roles are found to be worth of attention in the near future.

本研究通过对 SVM 分类器、gender-guesser、genderize.io 和基于文化共识理论的分类器等不同技术的比较分析，对学术背景下的性别分配方法进行了批判性评估。研究强调了透明度、数据来源和方法考虑的重要性，引入了基于文化共识的方法 nomquamgender，并将其应用于西班牙论文数据库 Teseo。结果显示，与依赖国家统计学会数据的传统方法相比，性别未知的人数大幅减少。性别分布的细微差别强调了性别研究中方法选择的重要性，呼吁采用透明、全面和可免费获取的方法，以提高学术研究中性别分配的准确性和可靠性。在重新评估了博士生制度中的性别失衡问题后，我们可以得出结论：尽管减少性别失衡的趋势已经形成，但这一问题依然明显。最后，与某些学科有关的具体问题，包括 STEM 领域和资历角色，在不久的将来值得关注。

{"title":"Gender assignment in doctoral theses: revisiting Teseo with a method based on cultural consensus theory","authors":"Nataly Matias-Rayme, Iuliana Botezan, Mari Carmen Suárez-Figueroa, Rodrigo Sánchez-Jiménez","doi":"10.1007/s11192-024-05079-z","DOIUrl":"https://doi.org/10.1007/s11192-024-05079-z","url":null,"abstract":"This study critically evaluates gender assignment methods within academic contexts, employing a comparative analysis of diverse techniques, including a SVM classifier, gender-guesser, genderize.io, and a Cultural Consensus Theory based classifier. Emphasizing the significance of transparency, data sources, and methodological considerations, the research introduces nomquamgender, a cultural consensus-based method, and applies it to Teseo, a Spanish dissertation database. The results reveal a substantial reduction in the number of individuals with unknown gender compared to traditional methods relying on INE data. The nuanced differences in gender distribution underscore the importance of methodological choices in gender studies, urging for transparent, comprehensive, and freely accessible methods to enhance the accuracy and reliability of gender assignment in academic research. After reevaluating the problem of gender imbalances in the doctoral system we can conclude that it’s still evident although the trend is clearly set for its reduction. Finaly, specific problems related to some disciplines, including STEM fields and seniority roles are found to be worth of attention in the near future.","PeriodicalId":21755,"journal":{"name":"Scientometrics","volume":"58 1","pages":""},"PeriodicalIF":3.9,"publicationDate":"2024-06-20","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141504466","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"管理学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Are the confidence scores of reviewers consistent with the review content? Evidence from top conference proceedings in AI 审稿人的信心分数与审稿内容一致吗？来自人工智能顶级会议论文集的证据

IF 3.9 3区管理学 Q2 COMPUTER SCIENCE, INTERDISCIPLINARY APPLICATIONS

Scientometrics

Pub Date : 2024-06-20 DOI: 10.1007/s11192-024-05070-8

Wenqing Wu, Haixu Xi, Chengzhi Zhang

Peer review is a critical process used in academia to assess the quality and validity of research articles. Top-tier conferences in the field of artificial intelligence (e.g. ICLR and ACL et al.) require reviewers to provide confidence scores to ensure the reliability of their review reports. However, existing studies on confidence scores have neglected to measure the consistency between the comment text and the confidence score in a more refined way, which may overlook more detailed details (such as aspects) in the text, leading to incomplete understanding of the results and insufficient objective analysis of the results. In this work, we propose assessing the consistency between the textual content of the review reports and the assigned scores at a fine-grained level, including word, sentence and aspect levels. The data used in this paper is derived from the peer review comments of conferences in the fields of deep learning and natural language processing. We employed deep learning models to detect hedge sentences and their corresponding aspects. Furthermore, we conducted statistical analyses of the length of review reports, frequency of hedge word usage, number of hedge sentences, frequency of aspect mentions, and their associated sentiment to assess the consistency between the textual content and confidence scores. Finally, we performed correlation analysis, significance tests and regression analysis on the data to examine the impact of confidence scores on the outcomes of the papers. The results indicate that textual content of the review reports and their confidence scores have high level of consistency at the word, sentence, and aspect levels. The regression results reveal a negative correlation between confidence scores and paper outcomes, indicating that higher confidence scores given by reviewers were associated with paper rejection. This indicates that current overall assessment of the paper’s content and quality by the experts is reliable, making the transparency and fairness of the peer review process convincing. We release our data and associated codes at https://github.com/njust-winchy/confidence_score.

同行评议是学术界用来评估研究文章质量和有效性的重要程序。人工智能领域的顶级会议（如 ICLR 和 ACL 等）都要求审稿人提供置信度分数，以确保审稿报告的可靠性。然而，现有关于置信度评分的研究忽略了以更精细的方式衡量评论文本与置信度评分之间的一致性，这可能会忽略文本中更详细的细节（如方面），导致对结果的理解不全面，对结果的分析不够客观。在这项工作中，我们建议在细粒度层面（包括单词、句子和方面层面）评估综述报告的文本内容与指定分数之间的一致性。本文使用的数据来自深度学习和自然语言处理领域会议的同行评审意见。我们采用深度学习模型来检测对冲句子及其相应方面。此外，我们还对评论报告的长度、对冲词的使用频率、对冲句子的数量、方面的提及频率及其相关情感进行了统计分析，以评估文本内容与置信度得分之间的一致性。最后，我们对数据进行了相关性分析、显著性检验和回归分析，以研究置信度得分对论文结果的影响。结果表明，综述报告的文本内容与置信度得分在词、句和方面层面上具有高度一致性。回归结果显示，可信度得分与论文结果之间呈负相关，表明审稿人给出的可信度得分越高，论文被拒的可能性越大。这表明目前专家对论文内容和质量的总体评价是可靠的，从而使同行评审过程的透明度和公平性令人信服。我们在 https://github.com/njust-winchy/confidence_score 上发布了我们的数据和相关代码。

{"title":"Are the confidence scores of reviewers consistent with the review content? Evidence from top conference proceedings in AI","authors":"Wenqing Wu, Haixu Xi, Chengzhi Zhang","doi":"10.1007/s11192-024-05070-8","DOIUrl":"https://doi.org/10.1007/s11192-024-05070-8","url":null,"abstract":"Peer review is a critical process used in academia to assess the quality and validity of research articles. Top-tier conferences in the field of artificial intelligence (e.g. ICLR and ACL et al.) require reviewers to provide confidence scores to ensure the reliability of their review reports. However, existing studies on confidence scores have neglected to measure the consistency between the comment text and the confidence score in a more refined way, which may overlook more detailed details (such as aspects) in the text, leading to incomplete understanding of the results and insufficient objective analysis of the results. In this work, we propose assessing the consistency between the textual content of the review reports and the assigned scores at a fine-grained level, including word, sentence and aspect levels. The data used in this paper is derived from the peer review comments of conferences in the fields of deep learning and natural language processing. We employed deep learning models to detect hedge sentences and their corresponding aspects. Furthermore, we conducted statistical analyses of the length of review reports, frequency of hedge word usage, number of hedge sentences, frequency of aspect mentions, and their associated sentiment to assess the consistency between the textual content and confidence scores. Finally, we performed correlation analysis, significance tests and regression analysis on the data to examine the impact of confidence scores on the outcomes of the papers. The results indicate that textual content of the review reports and their confidence scores have high level of consistency at the word, sentence, and aspect levels. The regression results reveal a negative correlation between confidence scores and paper outcomes, indicating that higher confidence scores given by reviewers were associated with paper rejection. This indicates that current overall assessment of the paper’s content and quality by the experts is reliable, making the transparency and fairness of the peer review process convincing. We release our data and associated codes at https://github.com/njust-winchy/confidence_score.","PeriodicalId":21755,"journal":{"name":"Scientometrics","volume":"62 1","pages":""},"PeriodicalIF":3.9,"publicationDate":"2024-06-20","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141524315","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"管理学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Closer in time and higher correlation: disclosing the relationship between citation similarity and citation interval 时间更近，相关性更高：揭示引文相似性与引文间隔之间的关系

IF 3.9 3区管理学 Q2 COMPUTER SCIENCE, INTERDISCIPLINARY APPLICATIONS

Scientometrics

Pub Date : 2024-06-20 DOI: 10.1007/s11192-024-05080-6

Wei Cheng, Dejun Zheng, Shaoxiong Fu, Jingfeng Cui

Investigating the intricate relationship between citation similarity and the citation interval offers vital insights for refining citation recommendation systems and enhancing citation evaluation models. This is also a new perspective for understanding citation patterns. In this study, we used the Library and Information Science (LIS) field as an example to determine and discuss the correlation between citation similarity and the citation interval. Using the methods of data collection, paper title preprocessing, text vectorization based on simCSE, calculation of citation similarity and the citation interval, and calculation of the index per citing paper, this study found the following LIS domain-based results: (i) there is a significant negative correlation between citation similarity and the citation interval, but the correlation coefficient is low. (ii) The citation intervals of the least relevant series of cited papers exhibit a more pronounced susceptibility to citation similarity than the most relevant series of cited papers. (iii) The citation intervals of the most relevant cited papers are more concentrated within 12 years and more likely to be published within the average citation interval, typically from the newer half of the cited paper list and published later within 5 years of the citation half-life. This study concludes that researchers usually pay more attention to the latest and most cutting-edge and strongly relevant existing research than to weakly relevant existing research. Continuous attention and timely incorporation of knowledge into the research direction will promote a more rapid and specialized diffusion of knowledge. These findings are influenced by the accelerated dissemination of information via Internet, heightened academic competition, and the concentration of research endeavors in specialized disciplines. This study not only contributes to the scholarly discussion of citation analysis but also lays the foundation for future exploration and understanding of citation patterns.

研究引文相似性与引文时间间隔之间错综复杂的关系为完善引文推荐系统和改进引文评价模型提供了重要启示。这也是理解引文模式的一个新视角。在本研究中，我们以图书馆与信息科学（LIS）领域为例，确定并讨论了引文相似度与引文间隔之间的相关性。通过数据收集、论文标题预处理、基于 simCSE 的文本矢量化、计算引文相似度和引文区间、计算每篇引用论文的索引等方法，本研究发现了以下基于 LIS 领域的结果：(i) 引文相似度和引文区间之间存在显著的负相关，但相关系数较低。(ii) 与相关性最高的论文系列相比，相关性最低的论文系列的引文区间更容易受到引文相似性的影响。(iii) 最相关被引论文的引文区间更集中在 12 年内，更有可能在平均引文区间内发表，通常来自被引论文列表中较新的一半，并在引文半衰期的 5 年内较晚发表。这项研究的结论是，研究人员通常更关注最新、最前沿、相关性强的现有研究，而不是相关性弱的现有研究。持续关注并及时将知识纳入研究方向，将促进知识更快速、更专业化的传播。这些发现受到互联网信息传播速度加快、学术竞争加剧以及研究工作向专业学科集中等因素的影响。本研究不仅为引文分析的学术讨论做出了贡献，还为今后探索和理解引文模式奠定了基础。

{"title":"Closer in time and higher correlation: disclosing the relationship between citation similarity and citation interval","authors":"Wei Cheng, Dejun Zheng, Shaoxiong Fu, Jingfeng Cui","doi":"10.1007/s11192-024-05080-6","DOIUrl":"https://doi.org/10.1007/s11192-024-05080-6","url":null,"abstract":"Investigating the intricate relationship between citation similarity and the citation interval offers vital insights for refining citation recommendation systems and enhancing citation evaluation models. This is also a new perspective for understanding citation patterns. In this study, we used the Library and Information Science (LIS) field as an example to determine and discuss the correlation between citation similarity and the citation interval. Using the methods of data collection, paper title preprocessing, text vectorization based on simCSE, calculation of citation similarity and the citation interval, and calculation of the index per citing paper, this study found the following LIS domain-based results: (i) there is a significant negative correlation between citation similarity and the citation interval, but the correlation coefficient is low. (ii) The citation intervals of the least relevant series of cited papers exhibit a more pronounced susceptibility to citation similarity than the most relevant series of cited papers. (iii) The citation intervals of the most relevant cited papers are more concentrated within 12 years and more likely to be published within the average citation interval, typically from the newer half of the cited paper list and published later within 5 years of the citation half-life. This study concludes that researchers usually pay more attention to the latest and most cutting-edge and strongly relevant existing research than to weakly relevant existing research. Continuous attention and timely incorporation of knowledge into the research direction will promote a more rapid and specialized diffusion of knowledge. These findings are influenced by the accelerated dissemination of information via Internet, heightened academic competition, and the concentration of research endeavors in specialized disciplines. This study not only contributes to the scholarly discussion of citation analysis but also lays the foundation for future exploration and understanding of citation patterns.","PeriodicalId":21755,"journal":{"name":"Scientometrics","volume":"111 1","pages":""},"PeriodicalIF":3.9,"publicationDate":"2024-06-20","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141524269","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"管理学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

The many publics of science: using altmetrics to identify common communication channels by scientific field 科学的众多公众：利用 Altmetrics 按科学领域确定共同的传播渠道

IF 3.9 3区管理学 Q2 COMPUTER SCIENCE, INTERDISCIPLINARY APPLICATIONS

Scientometrics

Pub Date : 2024-06-20 DOI: 10.1007/s11192-024-05077-1

Daniel Torres-Salinas, Domingo Docampo, Wenceslao Arroyo-Machado, Nicolas Robinson-Garcia

Altmetrics have led to new quantitative studies of science through social media interactions. However, there are no models of science communication that respond to the multiplicity of non-academic channels. Using the 3653 authors with the highest volume of altmetrics mentions from the main channels (Twitter, News, Facebook, Wikipedia, Blog, Policy documents, and Peer reviews) to their publications (2016-2020), it has been analyzed where the audiences of each discipline are located. The results evidence the generalities and specificities of these new communication models and the differences between areas. These findings are useful for the development of science communication policies and strategies.

Altmetrics 通过社交媒体互动对科学进行了新的定量研究。然而，目前还没有针对非学术渠道多样性的科学传播模型。利用主要渠道（推特、新闻、脸书、维基百科、博客、政策文件和同行评议）中 Altmetrics 提及量最高的 3653 位作者的出版物（2016-2020 年），分析了各学科受众的位置。结果证明了这些新传播模式的普遍性和特殊性，以及不同领域之间的差异。这些研究结果有助于制定科学传播政策和战略。

引用次数: 0

Collaboration at the phylum level: coauthorship and acknowledgment patterns in the world of the water bears (phylum Tardigrada) 动物门一级的合作：水熊（动物门）世界中的共同著作和鸣谢模式

IF 3.9 3区管理学 Q2 COMPUTER SCIENCE, INTERDISCIPLINARY APPLICATIONS

Scientometrics

Pub Date : 2024-06-20 DOI: 10.1007/s11192-024-05036-w

Katherine W. McCain

Coauthor and acknowledgment data were captured for 1384 research articles published between 1980 and June, 2023 that focused on tardigrades. Articles indexed in Web of Science or an archives of tardigrade literature were downloaded and thoroughly examined for personal acknowledgment data. Annual publication counts and coauthor maps for four successive time periods (1980–1999, 2000–2008, 2009–2017, 2018-June 2023) showed growth in the literature and increased research activity (more researchers, more complex networks, more international collaboration), beginning in 2000. A two-level Personal Acknowledgments Classification (PAC), was used to code types of acknowledgments. The majority of articles focused on field studies and/or descriptions of new species of tardigrades. This was reflected in rankings of acknowledgment categories and additions to the PAC. Ranked lists of frequently-thanked acknowledgees (all tardigrade researchers) were produced for each period. Acknowledgment profiles of four frequently-thanked researchers identified three different roles that researchers might play in tardigrade studies—”informal academic editorial consultant,” “taxonomic gatekeeper,” and “all-rounder.” Acknowledgments honoring people by naming a new species after them were only found in the species description, not in the formal acknowledgment section.

我们收集了 1980 年至 2023 年 6 月间发表的 1384 篇有关沙蜥的研究文章的共同作者和致谢数据。我们下载了在 Web of Science 或沙蜥文献档案中索引的文章，并对个人致谢数据进行了全面检查。从 2000 年开始，连续四个时间段（1980-1999 年、2000-2008 年、2009-2017 年、2018 年至 2023 年 6 月）的年度发表论文数量和合著者分布图显示了文献的增长和研究活动的增加（更多的研究人员、更复杂的网络、更多的国际合作）。在对致谢类型进行编码时，采用了两级个人致谢分类法（PAC）。大多数文章都集中于实地研究和/或对新种沙蜥的描述。这反映在致谢类别的排名和 PAC 的添加上。每个时期都会产生一份经常被致谢者（所有沙蜥研究人员）的排名表。四位经常致谢的研究人员的致谢简介确定了研究人员在游仆虫研究中可能扮演的三种不同角色--"非正式学术编辑顾问"、"分类学把关人 "和 "全能选手"。通过以其名字命名新物种的致谢只出现在物种描述中，而不是正式致谢部分。

{"title":"Collaboration at the phylum level: coauthorship and acknowledgment patterns in the world of the water bears (phylum Tardigrada)","authors":"Katherine W. McCain","doi":"10.1007/s11192-024-05036-w","DOIUrl":"https://doi.org/10.1007/s11192-024-05036-w","url":null,"abstract":"Coauthor and acknowledgment data were captured for 1384 research articles published between 1980 and June, 2023 that focused on tardigrades. Articles indexed in Web of Science or an archives of tardigrade literature were downloaded and thoroughly examined for personal acknowledgment data. Annual publication counts and coauthor maps for four successive time periods (1980–1999, 2000–2008, 2009–2017, 2018-June 2023) showed growth in the literature and increased research activity (more researchers, more complex networks, more international collaboration), beginning in 2000. A two-level Personal Acknowledgments Classification (PAC), was used to code types of acknowledgments. The majority of articles focused on field studies and/or descriptions of new species of tardigrades. This was reflected in rankings of acknowledgment categories and additions to the PAC. Ranked lists of frequently-thanked acknowledgees (all tardigrade researchers) were produced for each period. Acknowledgment profiles of four frequently-thanked researchers identified three different roles that researchers might play in tardigrade studies—”informal academic editorial consultant,” “taxonomic gatekeeper,” and “all-rounder.” Acknowledgments honoring people by naming a new species after them were only found in the species description, not in the formal acknowledgment section.","PeriodicalId":21755,"journal":{"name":"Scientometrics","volume":"63 1","pages":""},"PeriodicalIF":3.9,"publicationDate":"2024-06-20","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141524268","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"管理学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0