Pub Date : 2024-06-08DOI: 10.1016/j.joi.2024.101547
Zhuanlan Sun , Dongjin He , Yiwei Li
The practice of uploading preprints of scientific manuscripts prior to journal submission has become increasingly popular. As such, it is essential to understand the impact of the preprint version of a manuscript on the peer review process to facilitate the development of open peer review practices. In the current research, we analyze a dataset comprising 1,078 biomedical papers published in Nature Communications and eLife in 2019, along with their manuscript information posted on preprint servers and their peer review histories. Our investigation focuses on the relationship between the readability of manuscript before journal submission, as represented by preprints, and the sentimental features expressed by reviewers. Based on empirical analysis utilizing a linear regression model, it has been found that reviewers are inclined to express positive sentiments towards preprints characterized by technical language, as indicated by low value on the readability indices. Additional subgroup analysis suggests that this positive effect is more pronounced in papers with lower social and scientific impact, as indicated by online attention scores and scholarly views after publication, respectively. Overall, results of our analysis reveals that the utilization of technical language characterized by lower readability level in academic papers does not seem to hinder the peer review process in biomedical science, which has significant implications for the open peer review practice.
{"title":"How the readability of manuscript before journal submission advantages peer review process: Evidence from biomedical scientific publications","authors":"Zhuanlan Sun , Dongjin He , Yiwei Li","doi":"10.1016/j.joi.2024.101547","DOIUrl":"https://doi.org/10.1016/j.joi.2024.101547","url":null,"abstract":"<div><p>The practice of uploading preprints of scientific manuscripts prior to journal submission has become increasingly popular. As such, it is essential to understand the impact of the preprint version of a manuscript on the peer review process to facilitate the development of open peer review practices. In the current research, we analyze a dataset comprising 1,078 biomedical papers published in <em>Nature Communications</em> and <em>eLife</em> in 2019, along with their manuscript information posted on preprint servers and their peer review histories. Our investigation focuses on the relationship between the readability of manuscript before journal submission, as represented by preprints, and the sentimental features expressed by reviewers. Based on empirical analysis utilizing a linear regression model, it has been found that reviewers are inclined to express positive sentiments towards preprints characterized by technical language, as indicated by low value on the readability indices. Additional subgroup analysis suggests that this positive effect is more pronounced in papers with lower social and scientific impact, as indicated by online attention scores and scholarly views after publication, respectively. Overall, results of our analysis reveals that the utilization of technical language characterized by lower readability level in academic papers does not seem to hinder the peer review process in biomedical science, which has significant implications for the open peer review practice.</p></div>","PeriodicalId":48662,"journal":{"name":"Journal of Informetrics","volume":null,"pages":null},"PeriodicalIF":3.7,"publicationDate":"2024-06-08","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141294435","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"管理学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2024-06-08DOI: 10.1016/j.joi.2024.101550
Yuzhuo Wang , Yi Xiang , Chengzhi Zhang
With the formation of the fourth paradigm of scientific research, algorithms have become increasingly important in scientific research. In academic papers, algorithms may be mentioned by scholars with various motivations, using, comparing, or improving algorithms to solve complex research tasks. Identifying these motivations can help scholars discover the relationships between algorithms and further assess their roles and values. Therefore, taking the field of natural language processing (NLP) as an example, this article proposes a complete method to conduct the identification, distribution, and evolution of motivations for mentioning algorithms at the sentence level. Specifically, using manual annotation and machine learning methods, we identify algorithm entities and sentences in the full text of papers, classify motivations for mentioning algorithms by pre-training models and data augmentation techniques, and finally analyze the distribution and evolution of motivations. The results show that the deep learning models trained with the augmented data outperform the traditional machine learning models in the classification task. In academic papers, more than half of the sentences show the direct use of algorithms, while the lowest percentage of motivations are improving algorithms, and the diversity of motivations has been increasing with time. For specific algorithms, grammatical algorithms are mentioned more by the motivation of “description,” while more motivations of “use” are found in the machine learning algorithms category. As time passed, the “use” motivations gradually replaced the “description” motivations for different algorithms, and the number of motivation types decreased significantly. Our research explores the identification, distribution, and evolution of authors’ motivations for mentioning algorithm entities, which could provide a basis for future algorithm relationship identification and influence evaluation using motivations.
{"title":"Exploring motivations for algorithm mention in the domain of natural language processing: A deep learning approach","authors":"Yuzhuo Wang , Yi Xiang , Chengzhi Zhang","doi":"10.1016/j.joi.2024.101550","DOIUrl":"https://doi.org/10.1016/j.joi.2024.101550","url":null,"abstract":"<div><p>With the formation of the fourth paradigm of scientific research, algorithms have become increasingly important in scientific research. In academic papers, algorithms may be mentioned by scholars with various motivations, using, comparing, or improving algorithms to solve complex research tasks. Identifying these motivations can help scholars discover the relationships between algorithms and further assess their roles and values. Therefore, taking the field of natural language processing (NLP) as an example, this article proposes a complete method to conduct the identification, distribution, and evolution of motivations for mentioning algorithms at the sentence level. Specifically, using manual annotation and machine learning methods, we identify algorithm entities and sentences in the full text of papers, classify motivations for mentioning algorithms by pre-training models and data augmentation techniques, and finally analyze the distribution and evolution of motivations. The results show that the deep learning models trained with the augmented data outperform the traditional machine learning models in the classification task. In academic papers, more than half of the sentences show the direct use of algorithms, while the lowest percentage of motivations are improving algorithms, and the diversity of motivations has been increasing with time. For specific algorithms, grammatical algorithms are mentioned more by the motivation of “description,” while more motivations of “use” are found in the machine learning algorithms category. As time passed, the “use” motivations gradually replaced the “description” motivations for different algorithms, and the number of motivation types decreased significantly. Our research explores the identification, distribution, and evolution of authors’ motivations for mentioning algorithm entities, which could provide a basis for future algorithm relationship identification and influence evaluation using motivations.</p></div>","PeriodicalId":48662,"journal":{"name":"Journal of Informetrics","volume":null,"pages":null},"PeriodicalIF":3.7,"publicationDate":"2024-06-08","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141291971","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"管理学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2024-06-08DOI: 10.1016/j.joi.2024.101551
Dejian Yu, Bo Xiang
Existing studies on the detection of emerging scientific topics (ESTs) overemphasize the newness and neglect content innovation of knowledge. Moreover, they also ignore the lag existing in knowledge diffusion. In this paper, we propose a four-stage detection framework for ESTs that maps emerging attributes from paper entities to scientific topics. Empirical studies based on two significantly different disciplinary datasets, IS-LS, and AI, which contain 73,601 and 255,620 publications, respectively, are employed to validate our approach. First, we generate 29 and 47 candidate scientific topics based on topic modeling, respectively. Second, we represent the novelty of paper entities based on pre-trained language models, which is mapped to scientific topic entities along with knowledge distributions to obtain topic emerging attributes: topic novelty, relative share and growth. Third, we propose to predict future trends of these attributes with Neural Prophet, which outperforms four baseline models in , and . Finally, combining future values of candidate scientific topics, they are grouped into 8 clusters containing two ESTs types through strategic market theory and clustering model. From the correlation and feature distribution analysis of emerging attributes, we discover the existence of resilience and scale advantage in the diffusion of scientific knowledge. There also exists significant uncertainty in previous citation-based scientific topic evaluation patterns caused by the complexity of citation behavior. Overall, this research enriches theoretical knowledge and detection frameworks of ESTs, and provides detailed insights into comprehensive assessment and dissemination of scientific topics.
现有关于新兴科学课题(EST)检测的研究过于强调知识的新颖性,而忽视了知识内容的创新性。此外,它们还忽视了知识传播中存在的滞后性。在本文中,我们提出了一个四阶段 EST 检测框架,该框架将论文实体的新兴属性映射到科学主题。为了验证我们的方法,我们采用了基于 IS-LS 和 AI 这两个明显不同的学科数据集的实证研究,这两个数据集分别包含 73,601 篇和 255,620 篇论文。首先,我们基于主题建模分别生成了 29 个和 47 个候选科学主题。其次,我们基于预先训练好的语言模型来表示论文实体的新颖性,并将其与知识分布一起映射到科学主题实体上,从而得到主题的新兴属性:主题新颖性、相对份额和增长。第三,我们建议使用神经先知预测这些属性的未来趋势,该模型在 R2、MAE 和 RMSE 方面优于四个基线模型。最后,结合候选科学主题的未来价值,通过战略市场理论和聚类模型,将其分为包含两种 EST 类型的 8 个聚类。从新兴属性的相关性和特征分布分析中,我们发现科学知识的传播存在弹性和规模优势。同时,由于引文行为的复杂性,以往基于引文的科学主题评价模式也存在很大的不确定性。总之,本研究丰富了EST的理论知识和检测框架,为科学主题的综合评估和传播提供了详尽的见解。
{"title":"An ESTs detection research based on paper entity mapping: Combining scientific text modeling and neural prophet","authors":"Dejian Yu, Bo Xiang","doi":"10.1016/j.joi.2024.101551","DOIUrl":"https://doi.org/10.1016/j.joi.2024.101551","url":null,"abstract":"<div><p>Existing studies on the detection of emerging scientific topics (ESTs) overemphasize the newness and neglect content innovation of knowledge. Moreover, they also ignore the lag existing in knowledge diffusion. In this paper, we propose a four-stage detection framework for ESTs that maps emerging attributes from paper entities to scientific topics. Empirical studies based on two significantly different disciplinary datasets, IS-LS, and AI, which contain 73,601 and 255,620 publications, respectively, are employed to validate our approach. First, we generate 29 and 47 candidate scientific topics based on topic modeling, respectively. Second, we represent the novelty of paper entities based on pre-trained language models, which is mapped to scientific topic entities along with knowledge distributions to obtain topic emerging attributes: topic novelty, relative share and growth. Third, we propose to predict future trends of these attributes with Neural Prophet, which outperforms four baseline models in <span><math><msup><mrow><mi>R</mi></mrow><mn>2</mn></msup></math></span>, <span><math><mrow><mi>M</mi><mi>A</mi><mi>E</mi></mrow></math></span> and <span><math><mrow><mi>R</mi><mi>M</mi><mi>S</mi><mi>E</mi></mrow></math></span>. Finally, combining future values of candidate scientific topics, they are grouped into 8 clusters containing two ESTs types through strategic market theory and clustering model. From the correlation and feature distribution analysis of emerging attributes, we discover the existence of resilience and scale advantage in the diffusion of scientific knowledge. There also exists significant uncertainty in previous citation-based scientific topic evaluation patterns caused by the complexity of citation behavior. Overall, this research enriches theoretical knowledge and detection frameworks of ESTs, and provides detailed insights into comprehensive assessment and dissemination of scientific topics.</p></div>","PeriodicalId":48662,"journal":{"name":"Journal of Informetrics","volume":null,"pages":null},"PeriodicalIF":3.7,"publicationDate":"2024-06-08","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141291912","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"管理学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2024-06-05DOI: 10.1016/j.joi.2024.101552
Bryan Mathis , Ryosuke L. Ohniwa
While advantages of global homogenization and standardization have been discussed on scientific and technological research activities, specific discussion on the disadvantages to generate scientific innovation has been limited. In this study, we aim to clarify the impact of globalization to generate emerging topics in life science and medicine by applying the emerging keywords (EKs) and highly successful emerging keywords (HS-EKs) methodology, which represent scientometric elements of emerging topics and high-impact emerging topics, respectively, as indicators. We analyzed all paper output from 53 countries using PubMed and found a global increase in paper output and EK generation in line with economic growth in the past 50 years. However, the efficiency to generate scientific innovation, reflected in HS-EKs, was significantly reduced and this effect was independent of country-level economic output. We also reported homogenization in research topics over the study period as a possible factor for the observed decrease in HS-EK generation efficiency. Finally, we discuss the foundational issues that gave rise to homogenized science, the impact on scientific innovation, and what policies might be necessary to repair the scientific innovation-generating engine.
虽然全球同质化和标准化对科技研究活动的优势已经得到了讨论,但对其产生科学创新的劣势的具体讨论却很有限。在本研究中,我们采用新兴关键词(EKs)和高成功新兴关键词(HS-EKs)方法,分别代表新兴课题和高影响力新兴课题的科学计量要素,并以此为指标,旨在阐明全球化对生命科学和医学新兴课题产生的影响。我们利用 PubMed 对 53 个国家的所有论文产出进行了分析,发现在过去 50 年中,全球论文产出和 EK 生成量与经济增长同步增长。然而,HS-EK 所反映的科学创新效率却显著下降,而且这种影响与国家层面的经济产出无关。我们还报告了研究期间研究课题的同质化,这可能是导致所观察到的HS-EK生成效率下降的一个因素。最后,我们讨论了导致科学同质化的基本问题、对科学创新的影响以及修复科学创新引擎可能需要的政策。
{"title":"Trends in emerging topics generation across countries in life science and medicine","authors":"Bryan Mathis , Ryosuke L. Ohniwa","doi":"10.1016/j.joi.2024.101552","DOIUrl":"https://doi.org/10.1016/j.joi.2024.101552","url":null,"abstract":"<div><p>While advantages of global homogenization and standardization have been discussed on scientific and technological research activities, specific discussion on the disadvantages to generate scientific innovation has been limited. In this study, we aim to clarify the impact of globalization to generate emerging topics in life science and medicine by applying the emerging keywords (EKs) and highly successful emerging keywords (HS-EKs) methodology, which represent scientometric elements of emerging topics and high-impact emerging topics, respectively, as indicators. We analyzed all paper output from 53 countries using PubMed and found a global increase in paper output and EK generation in line with economic growth in the past 50 years. However, the efficiency to generate scientific innovation, reflected in HS-EKs, was significantly reduced and this effect was independent of country-level economic output. We also reported homogenization in research topics over the study period as a possible factor for the observed decrease in HS-EK generation efficiency. Finally, we discuss the foundational issues that gave rise to homogenized science, the impact on scientific innovation, and what policies might be necessary to repair the scientific innovation-generating engine.</p></div>","PeriodicalId":48662,"journal":{"name":"Journal of Informetrics","volume":null,"pages":null},"PeriodicalIF":3.7,"publicationDate":"2024-06-05","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141263985","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"管理学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2024-05-30DOI: 10.1016/j.joi.2024.101546
Daniel Torres-Salinas , Enrique Orduña-Malea , Ángel Delgado-Vázquez , Juan Gorraiz , Wenceslao Arroyo-Machado
The document 'Foundations of Narrative Bibliometrics' provides an analysis of the evolution of scientific assessment, highlighting the influence of manifestos such as DORA and CoARA in shaping ethical and responsible practices in academia, as well as their assimilation by Spanish scientific policies. It connects this context with the contributions of evaluative bibliometrics, emphasising the transition towards a more integrative approach that advocates for a balance between quantitative and qualitative methods in research evaluation. Furthermore, it underscores how the Narrative Curriculum has emerged as one of the fundamental tools in new evaluation processes, as it allows for the description of the complexity and context of academic achievements. Narrative Bibliometrics is proposed, defined as the use of bibliometric indicators to generate stories that support the defence and exposition of a scientific curriculum and/or its individual contributions within the framework of a scientific evaluation process, which demands narratives. To introduce the reader, it presents, in a non-exhaustive manner, sources, indicators, and practical cases for effectively applying Narrative Bibliometrics in various scientific evaluation contexts. Hence, this document contributes to the responsible use of bibliometric indicators, serving as a tool for evaluators and researchers.
叙事文献计量学的基础 "文件分析了科学评估的演变过程,强调了 DORA 和 CoARA 等宣言在塑造学术界道德和责任实践方面的影响,以及西班牙科学政策对这些宣言的吸收。报告将这一背景与文献计量学评估的贡献联系起来,强调了向更加综合的方法过渡,主张在研究评估中兼顾定量和定性方法。此外,它还强调了叙事课程是如何成为新评价过程中的基本工具之一的,因为它可以描述学术成就的复杂性和背景。书目统计叙事学被定义为利用书目统计指标生成故事,以支持在需要叙事的科学评价过程框架内对科学课程和/或其个人贡献进行辩护和阐述。为了向读者介绍,本文件以非详尽的方式介绍了在各种科学评价背景下有效应用文献计量学叙事的来源、指标和实际案例。因此,本文件有助于负责任地使用文献计量学指标,为评估人员和研究人员提供工具。
{"title":"Foundations of Narrative Bibliometrics","authors":"Daniel Torres-Salinas , Enrique Orduña-Malea , Ángel Delgado-Vázquez , Juan Gorraiz , Wenceslao Arroyo-Machado","doi":"10.1016/j.joi.2024.101546","DOIUrl":"10.1016/j.joi.2024.101546","url":null,"abstract":"<div><p>The document 'Foundations of Narrative Bibliometrics' provides an analysis of the evolution of scientific assessment, highlighting the influence of manifestos such as DORA and CoARA in shaping ethical and responsible practices in academia, as well as their assimilation by Spanish scientific policies. It connects this context with the contributions of evaluative bibliometrics, emphasising the transition towards a more integrative approach that advocates for a balance between quantitative and qualitative methods in research evaluation. Furthermore, it underscores how the Narrative Curriculum has emerged as one of the fundamental tools in new evaluation processes, as it allows for the description of the complexity and context of academic achievements. Narrative Bibliometrics is proposed, defined as the use of bibliometric indicators to generate stories that support the defence and exposition of a scientific curriculum and/or its individual contributions within the framework of a scientific evaluation process, which demands narratives. To introduce the reader, it presents, in a non-exhaustive manner, sources, indicators, and practical cases for effectively applying Narrative Bibliometrics in various scientific evaluation contexts. Hence, this document contributes to the responsible use of bibliometric indicators, serving as a tool for evaluators and researchers.</p></div>","PeriodicalId":48662,"journal":{"name":"Journal of Informetrics","volume":null,"pages":null},"PeriodicalIF":3.7,"publicationDate":"2024-05-30","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.sciencedirect.com/science/article/pii/S1751157724000592/pdfft?md5=7fbc6cfc08d9d3423ead6951c148e6b5&pid=1-s2.0-S1751157724000592-main.pdf","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141190650","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"管理学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2024-05-28DOI: 10.1016/j.joi.2024.101541
Li Tang , Defang Yang , Mingxing Wang , Ying Guo
The highly skewed nature of research influence has been widely acknowledged. Among extant studies examining contributing factors, most focus on the hard sciences in developed economies with very few examining the social sciences in emerging powers. The impact of citation scope is likewise left largely underexplored. In this paper, we develop two novel measures of citation scope using geography and research field as metrics and explore their role in boosting academic impact. Our results support geography as a citation scope serving an important pathway through which international collaboration affects academic impact. Such effect increases in prominence in later years. We do not find evidence indicating the mediating effect of research field citation scope on scholarly recognition.
{"title":"The mediating impact of citation scope: Evidence from China's ESI publications","authors":"Li Tang , Defang Yang , Mingxing Wang , Ying Guo","doi":"10.1016/j.joi.2024.101541","DOIUrl":"10.1016/j.joi.2024.101541","url":null,"abstract":"<div><p>The highly skewed nature of research influence has been widely acknowledged. Among extant studies examining contributing factors, most focus on the hard sciences in developed economies with very few examining the social sciences in emerging powers. The impact of citation scope is likewise left largely underexplored. In this paper, we develop two novel measures of citation scope using geography and research field as metrics and explore their role in boosting academic impact. Our results support geography as a citation scope serving an important pathway through which international collaboration affects academic impact. Such effect increases in prominence in later years. We do not find evidence indicating the mediating effect of research field citation scope on scholarly recognition.</p></div>","PeriodicalId":48662,"journal":{"name":"Journal of Informetrics","volume":null,"pages":null},"PeriodicalIF":3.7,"publicationDate":"2024-05-28","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141190583","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"管理学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2024-05-27DOI: 10.1016/j.joi.2024.101545
Wei Zhang , Juyang Cao , Manuel Sebastian Mariani , Zhen-Zhen Wang , Mingyang Zhou , Wei Chen , Hao Liao
Methods to rank documents in large-scale citation data are increasingly assessed in terms of their ability to identify small sets of expert-selected papers. Here, we propose an algorithm for the accurate identification of milestone papers from citation networks. The algorithm combines an influence propagation process with game theory concepts. It outperforms state-of-the-art metrics in the identification of milestone papers in aggregate citation network data, while potentially mitigating the ranking's temporal bias compared with metrics that have similar milestone identification performance. The proposed method sheds light on the interplay between ranking accuracy and temporal bias.
{"title":"Uncovering milestone papers: A network diffusion and game theory approach","authors":"Wei Zhang , Juyang Cao , Manuel Sebastian Mariani , Zhen-Zhen Wang , Mingyang Zhou , Wei Chen , Hao Liao","doi":"10.1016/j.joi.2024.101545","DOIUrl":"10.1016/j.joi.2024.101545","url":null,"abstract":"<div><p>Methods to rank documents in large-scale citation data are increasingly assessed in terms of their ability to identify small sets of expert-selected papers. Here, we propose an algorithm for the accurate identification of milestone papers from citation networks. The algorithm combines an influence propagation process with game theory concepts. It outperforms state-of-the-art metrics in the identification of milestone papers in aggregate citation network data, while potentially mitigating the ranking's temporal bias compared with metrics that have similar milestone identification performance. The proposed method sheds light on the interplay between ranking accuracy and temporal bias.</p></div>","PeriodicalId":48662,"journal":{"name":"Journal of Informetrics","volume":null,"pages":null},"PeriodicalIF":3.7,"publicationDate":"2024-05-27","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141191079","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"管理学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2024-05-18DOI: 10.1016/j.joi.2024.101543
Xianzhe Peng, Huixin Xu, Jin Shi
The growth of excellent scholars provides paradigmatic career paths leading to research success, as their research capabilities ultimately manifest as fluctuations in bibliometric indexes. Examining the commonalities in the trajectories of these bibliometric indexes displays the universal characteristics of their growth process, and furtherly shows exemplary routes to scientific success. In this study, we examine 287 excellent scholars elected as ACM Fellows in the field of computer science from 2016s to 2020s. Based on their changes in productivity, impact, and comprehensive abilities, we categorize them into three categories, four categories, and six categories, respectively. Most of these scholars experience continuous growth in productivity during the early development stages, maintaining a prolonged period of high productivity in the mid-later maturity stages. Their impact rises smoothly and consistently, while the growth of their comprehensive abilities is relatively gradual, remaining at above-average levels in the mid-later maturity stages. Furthermore, the level of recognition within the scientific research community varies for different categories of scholars, and there are also differences in the growth patterns between scholars from Asia and those from Western regions.
{"title":"Are the bibliometric growth patterns of excellent scholars similar? From the analysis of ACM Fellows","authors":"Xianzhe Peng, Huixin Xu, Jin Shi","doi":"10.1016/j.joi.2024.101543","DOIUrl":"10.1016/j.joi.2024.101543","url":null,"abstract":"<div><p>The growth of excellent scholars provides paradigmatic career paths leading to research success, as their research capabilities ultimately manifest as fluctuations in bibliometric indexes. Examining the commonalities in the trajectories of these bibliometric indexes displays the universal characteristics of their growth process, and furtherly shows exemplary routes to scientific success. In this study, we examine 287 excellent scholars elected as ACM Fellows in the field of computer science from 2016s to 2020s. Based on their changes in productivity, impact, and comprehensive abilities, we categorize them into three categories, four categories, and six categories, respectively. Most of these scholars experience continuous growth in productivity during the early development stages, maintaining a prolonged period of high productivity in the mid-later maturity stages. Their impact rises smoothly and consistently, while the growth of their comprehensive abilities is relatively gradual, remaining at above-average levels in the mid-later maturity stages. Furthermore, the level of recognition within the scientific research community varies for different categories of scholars, and there are also differences in the growth patterns between scholars from Asia and those from Western regions.</p></div>","PeriodicalId":48662,"journal":{"name":"Journal of Informetrics","volume":null,"pages":null},"PeriodicalIF":3.7,"publicationDate":"2024-05-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141063565","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"管理学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2024-05-13DOI: 10.1016/j.joi.2024.101542
Ling Kong , Wei Zhang , Haotian Hu , Zhu Liang , Yonggang Han , Dongbo Wang , Min Song
The diffusion of citation knowledge is an important measure of transdisciplinary scientific impact and the diversity of transdisciplinary citation content (sentences). Moreover, combining citation sentiment (CS) and citation aspect (CA) can help researchers identify the attitudes, ideas, or positions reflected in the evolution of scientific elements (e.g., theories, techniques, and methods). This is because of their use by scholars from different disciplines, paving the way toward transdisciplinary penetration and the development of domain knowledge through the proliferation of cited knowledge. However, most studies mainly address citation aspect classification (CAC) and citation sentiment classification (CSC) separately, ignoring their shared features of interactions. In this study, we construct a dataset for transdisciplinary citation content analysis using citations and academic full texts from the Chinese Social Sciences Citation Index (CSSCI), which includes 14,832 manually-annotated citations. Thereafter, we utilized the developed dataset to conduct a transdisciplinary fine-grained citation content analysis by combining CAC and CSC. The objective of the CAC task was to classify transdisciplinary citations into theoretical concepts (TC), methodological techniques (MT), and data information (DI), whereas the CSC task classified citations into positive, negative, and neutral classes. Furthermore, we leveraged a multi-task learning (MTL) model to perform CAC and CSC jointly and then compared its performance to those of several widely-used deep learning models. Our model achieved 83.10 % accuracy for CAC and 80.46 % accuracy for CSC, demonstrating its superiority to single-task systems. This indicates the strong correlation between the CAC and CSC of transdisciplinary citation tasks, benefiting from each other when learned concurrently. This new method can be used as an auxiliary decision support system to extend the analysis of transdisciplinary citation content.
{"title":"Transdisciplinary fine-grained citation content analysis: A multi-task learning perspective for citation aspect and sentiment classification","authors":"Ling Kong , Wei Zhang , Haotian Hu , Zhu Liang , Yonggang Han , Dongbo Wang , Min Song","doi":"10.1016/j.joi.2024.101542","DOIUrl":"https://doi.org/10.1016/j.joi.2024.101542","url":null,"abstract":"<div><p>The diffusion of citation knowledge is an important measure of transdisciplinary scientific impact and the diversity of transdisciplinary citation content (sentences). Moreover, combining citation sentiment (CS) and citation aspect (CA) can help researchers identify the attitudes, ideas, or positions reflected in the evolution of scientific elements (e.g., theories, techniques, and methods). This is because of their use by scholars from different disciplines, paving the way toward transdisciplinary penetration and the development of domain knowledge through the proliferation of cited knowledge. However, most studies mainly address citation aspect classification (CAC) and citation sentiment classification (CSC) separately, ignoring their shared features of interactions. In this study, we construct a dataset for transdisciplinary citation content analysis using citations and academic full texts from the Chinese Social Sciences Citation Index (CSSCI), which includes 14,832 manually-annotated citations. Thereafter, we utilized the developed dataset to conduct a transdisciplinary fine-grained citation content analysis by combining CAC and CSC. The objective of the CAC task was to classify transdisciplinary citations into theoretical concepts (TC), methodological techniques (MT), and data information (DI), whereas the CSC task classified citations into positive, negative, and neutral classes. Furthermore, we leveraged a multi-task learning (MTL) model to perform CAC and CSC jointly and then compared its performance to those of several widely-used deep learning models. Our model achieved 83.10 % accuracy for CAC and 80.46 % accuracy for CSC, demonstrating its superiority to single-task systems. This indicates the strong correlation between the CAC and CSC of transdisciplinary citation tasks, benefiting from each other when learned concurrently. This new method can be used as an auxiliary decision support system to extend the analysis of transdisciplinary citation content.</p></div>","PeriodicalId":48662,"journal":{"name":"Journal of Informetrics","volume":null,"pages":null},"PeriodicalIF":3.7,"publicationDate":"2024-05-13","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"140918104","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"管理学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2024-05-07DOI: 10.1016/j.joi.2024.101531
Xing Wang
Nonlinear field normalization citation counts at the paper level obtained using nonlinear field normalization methods should not be added or averaged. Unfortunately, there are many cases adding or averaging the nonlinear normalized citation counts of individual papers that can be found in the academic literature, indicating that nonlinear field normalization methods have long been misused in academia. In this paper, we performed the following three research works. First, we analyzed why the nonlinear normalized citation counts of individual papers should not be added or averaged from the perspective of theoretical analysis in mathematics: we provided mathematical proofs for the crucial steps of the analysis. Second, we systematically classified the existing main field normalization methods into linear and nonlinear field normalization methods. Third, we used real citation data to explore the error effects caused by adding or averaging the nonlinear normalized citation counts on practical research evaluation results. The above three research works provide a theoretical basis for the proper use of field normalization methods in the future. Furthermore, because our mathematical proof is applicable to all nonlinear data in the entire real number domain, our research works are also meaningful for the whole field of data and information science.
{"title":"The misuse of the nonlinear field normalization method: Nonlinear field normalization citation counts at the paper level should not be added or averaged","authors":"Xing Wang","doi":"10.1016/j.joi.2024.101531","DOIUrl":"https://doi.org/10.1016/j.joi.2024.101531","url":null,"abstract":"<div><p>Nonlinear field normalization citation counts at the paper level obtained using nonlinear field normalization methods should not be added or averaged. Unfortunately, there are many cases adding or averaging the nonlinear normalized citation counts of individual papers that can be found in the academic literature, indicating that nonlinear field normalization methods have long been misused in academia. In this paper, we performed the following three research works. First, we analyzed why the nonlinear normalized citation counts of individual papers should not be added or averaged from the perspective of theoretical analysis in mathematics: we provided mathematical proofs for the crucial steps of the analysis. Second, we systematically classified the existing main field normalization methods into linear and nonlinear field normalization methods. Third, we used real citation data to explore the error effects caused by adding or averaging the nonlinear normalized citation counts on practical research evaluation results. The above three research works provide a theoretical basis for the proper use of field normalization methods in the future. Furthermore, because our mathematical proof is applicable to all nonlinear data in the entire real number domain, our research works are also meaningful for the whole field of data and information science.</p></div>","PeriodicalId":48662,"journal":{"name":"Journal of Informetrics","volume":null,"pages":null},"PeriodicalIF":3.7,"publicationDate":"2024-05-07","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.sciencedirect.com/science/article/pii/S1751157724000440/pdfft?md5=2cee2433e7c0ada2b3ebaae74c0e5282&pid=1-s2.0-S1751157724000440-main.pdf","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"140878581","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"管理学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}