首页 > 最新文献

Journal of Information Science最新文献

英文 中文
Relevance meets calibration: Triple calibration distance design for neighbour-based recommender systems 相关性与校准:基于邻居的推荐系统的三重校准距离设计
IF 2.4 4区 管理学 Q3 COMPUTER SCIENCE, INFORMATION SYSTEMS Pub Date : 2023-07-07 DOI: 10.1177/01655515231182069
Zhuang Chen, Haitao Zou, Hualong Yu, Shang Zheng, Shang Gao
Calibrated recommendations are devoted to revealing the various preferences of users with the appropriate proportions in the recommendation list. Most of the existing calibrated-oriented recommendations take an extra postprocessing step to rerank the initial outputs. However, applying this postprocessing strategy may decrease the recommendation relevance, since the origin accurate outputs have been scattered, and they usually ignore the calibration between pairwise users/items. Instead of reranking the recommendation outputs, this article is dedicated to modifying the criterion of neighbour users’ selection, where we look forward to strengthening the recommendation relevance by calibrating the neighbourhood. We propose the first-order, second-order and the third-order calibration distance based on the motivation that if a user has a similar genre distribution or genre rating schema towards the target user, then his or her suggestions will be more useful for rating prediction. We also provide an equivalent transformation for the original method to speed up the algorithm with solid theoretical proof. Experimental analysis on two publicly available data sets empirically shows that our approaches are better than some of the state-of-the-art methods in terms of recommendation relevance, calibration and efficiency.
校准推荐致力于在推荐列表中以适当的比例揭示用户的各种偏好。大多数现有的面向校准的建议都需要额外的后处理步骤来重新排列初始输出。然而,应用这种后处理策略可能会降低推荐相关性,因为原始准确输出是分散的,并且它们通常忽略了成对用户/项目之间的校准。本文致力于修改邻居用户选择的标准,而不是对推荐输出进行重新排序,我们期待通过校准邻居来增强推荐相关性。基于用户与目标用户具有相似的体裁分布或体裁评分模式的动机,我们提出了一阶、二阶和三阶校准距离,那么他或她的建议对评分预测更有用。我们还对原方法进行了等价变换,以提高算法的速度,并提供了坚实的理论证明。对两个公开数据集的实验分析表明,我们的方法在推荐相关性、校准和效率方面优于一些最先进的方法。
{"title":"Relevance meets calibration: Triple calibration distance design for neighbour-based recommender systems","authors":"Zhuang Chen, Haitao Zou, Hualong Yu, Shang Zheng, Shang Gao","doi":"10.1177/01655515231182069","DOIUrl":"https://doi.org/10.1177/01655515231182069","url":null,"abstract":"Calibrated recommendations are devoted to revealing the various preferences of users with the appropriate proportions in the recommendation list. Most of the existing calibrated-oriented recommendations take an extra postprocessing step to rerank the initial outputs. However, applying this postprocessing strategy may decrease the recommendation relevance, since the origin accurate outputs have been scattered, and they usually ignore the calibration between pairwise users/items. Instead of reranking the recommendation outputs, this article is dedicated to modifying the criterion of neighbour users’ selection, where we look forward to strengthening the recommendation relevance by calibrating the neighbourhood. We propose the first-order, second-order and the third-order calibration distance based on the motivation that if a user has a similar genre distribution or genre rating schema towards the target user, then his or her suggestions will be more useful for rating prediction. We also provide an equivalent transformation for the original method to speed up the algorithm with solid theoretical proof. Experimental analysis on two publicly available data sets empirically shows that our approaches are better than some of the state-of-the-art methods in terms of recommendation relevance, calibration and efficiency.","PeriodicalId":54796,"journal":{"name":"Journal of Information Science","volume":" ","pages":""},"PeriodicalIF":2.4,"publicationDate":"2023-07-07","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"45064146","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"管理学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Exploring interdisciplinarity of science projects based on the text mining 基于文本挖掘的科学项目跨学科性探索
IF 2.4 4区 管理学 Q3 COMPUTER SCIENCE, INFORMATION SYSTEMS Pub Date : 2023-07-03 DOI: 10.1177/01655515231182075
Zhang Xue, Zhiqiang Zhang, Zhengyin Hu
Interdisciplinary research has gradually become one of the main driving forces to promote original innovation of scientific research, and how to measure the interdisciplinarity of science project is becoming an important topic in the science foundation managements. Existing researches mainly using methods, such as academic degree or institutional discipline or discipline category mapping of journals, to measure the interdisciplinarity. This study proposes an approach to mine and capture the different or complementary characteristics of interdisciplinarity of projects by combining text mining and machine learning methods. First, we construct the classification system and extract a raw paper and its discipline matrix according to the discipline category of journals where the references were published in. Second, we cut the matrix to summarise the distribution of key disciplines in each paper and extract the text features in the abstract and title to form a training set. Finally, we compare and analyse the classification effects of Naive Bayesian Model, Support Vector Machine and Bidirectional Encoder Representations from Transformers (BERT) model. Then, the model evaluation indicators show that the best classification effect was achieved by the BERT model. Therefore, the deep pre-trained linguistic model BERT is chosen to predict the discipline distribution of each project. In addition, the different aspects of interdisciplinarity are measured using network coherence and discipline diversity indicators. Besides, experts are invited to evaluate and interpret the results. This proposed approach could be applied to deeply understand the discipline integration from a new perspective.
跨学科研究已逐渐成为推动科学研究原始创新的主要动力之一,如何衡量科学项目的跨学科性正成为科学基金管理中的一个重要课题。现有研究主要采用学位或机构学科或期刊学科类别映射等方法来衡量跨学科性。本研究提出了一种通过结合文本挖掘和机器学习方法来挖掘和捕获项目跨学科性的不同或互补特征的方法。首先,构建分类体系,根据文献所在期刊的学科类别提取原论文及其学科矩阵;其次,对矩阵进行裁剪,总结每篇论文中重点学科的分布,提取摘要和标题中的文本特征,形成训练集;最后,比较分析了朴素贝叶斯模型、支持向量机模型和双向编码器表示的分类效果。然后,模型评价指标表明BERT模型的分类效果最好。因此,选择深度预训练语言模型BERT来预测每个项目的学科分布。此外,使用网络一致性和学科多样性指标来衡量跨学科性的不同方面。此外,还邀请专家对结果进行评价和解释。该方法可以从一个新的视角来深入理解学科整合。
{"title":"Exploring interdisciplinarity of science projects based on the text mining","authors":"Zhang Xue, Zhiqiang Zhang, Zhengyin Hu","doi":"10.1177/01655515231182075","DOIUrl":"https://doi.org/10.1177/01655515231182075","url":null,"abstract":"Interdisciplinary research has gradually become one of the main driving forces to promote original innovation of scientific research, and how to measure the interdisciplinarity of science project is becoming an important topic in the science foundation managements. Existing researches mainly using methods, such as academic degree or institutional discipline or discipline category mapping of journals, to measure the interdisciplinarity. This study proposes an approach to mine and capture the different or complementary characteristics of interdisciplinarity of projects by combining text mining and machine learning methods. First, we construct the classification system and extract a raw paper and its discipline matrix according to the discipline category of journals where the references were published in. Second, we cut the matrix to summarise the distribution of key disciplines in each paper and extract the text features in the abstract and title to form a training set. Finally, we compare and analyse the classification effects of Naive Bayesian Model, Support Vector Machine and Bidirectional Encoder Representations from Transformers (BERT) model. Then, the model evaluation indicators show that the best classification effect was achieved by the BERT model. Therefore, the deep pre-trained linguistic model BERT is chosen to predict the discipline distribution of each project. In addition, the different aspects of interdisciplinarity are measured using network coherence and discipline diversity indicators. Besides, experts are invited to evaluate and interpret the results. This proposed approach could be applied to deeply understand the discipline integration from a new perspective.","PeriodicalId":54796,"journal":{"name":"Journal of Information Science","volume":" ","pages":""},"PeriodicalIF":2.4,"publicationDate":"2023-07-03","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"42368051","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"管理学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
A mathematical analysis of the h-index and study of its correlation with some improved metrics: A conceptual approach h指数的数学分析及其与一些改进度量的相关性研究:一种概念方法
IF 2.4 4区 管理学 Q3 COMPUTER SCIENCE, INFORMATION SYSTEMS Pub Date : 2023-07-03 DOI: 10.1177/01655515231184832
Dhrubajyoti Borgohain, B. Lund, M. Verma
This article aims to establish the h-index in different mathematical aspects and measure the correlation of the h-index with other metrics using the bibliographic data of selected journals in the Library and Information Science (LIS) domain. Using the concept of relation and function, the h-index is expressed. Data collected from authors in three major LIS journals, including h-index, g-index, m-index, total citations and total publications, are analysed using correlation and regression analyses. The findings indicate a high level of relationship between h-index and g-index scores and m-index at starting year of publication. Conversely, a lower level of relationship between the h-index, g-index, starting year of publication, the total number of publications and the number of citations. This is an original study and will be of interest to the researchers of LIS who wants to know this performance indicator in different aspects, the dependency/independency of the h-index with other metrics, and the impact/performance of the three journals over time in terms of h-index, m-index, total citations and the number of publications. The study is limited to analysis of performance-and-citation-related measurements h-index, g-index, m-index, total citations, and the number of publications.
本文旨在利用图书馆情报学领域精选期刊的书目数据,从不同的数学方面建立h指数,并测量h指数与其他指标的相关性。利用关系和函数的概念来表示h指数。采用相关分析和回归分析的方法,对美国3种主要期刊的作者数据进行分析,包括h指数、g指数、m指数、总被引数和总发表数。研究结果表明,在出版开始年份,h指数、g指数得分和m指数之间存在高度相关。相反,h指数、g指数、发表起始年份、总发表数和被引次数之间的关系水平较低。这是一项原创研究,对于想要从不同方面了解这一绩效指标、h-index与其他指标的依赖性/独立性,以及三种期刊在h-index、m-index、总被引次数和发表数量方面随时间变化的影响/绩效的LIS研究人员来说将会很感兴趣。本研究仅限于分析与绩效和引用相关的测量指标h指数、g指数、m指数、总引用和出版物数量。
{"title":"A mathematical analysis of the h-index and study of its correlation with some improved metrics: A conceptual approach","authors":"Dhrubajyoti Borgohain, B. Lund, M. Verma","doi":"10.1177/01655515231184832","DOIUrl":"https://doi.org/10.1177/01655515231184832","url":null,"abstract":"This article aims to establish the h-index in different mathematical aspects and measure the correlation of the h-index with other metrics using the bibliographic data of selected journals in the Library and Information Science (LIS) domain. Using the concept of relation and function, the h-index is expressed. Data collected from authors in three major LIS journals, including h-index, g-index, m-index, total citations and total publications, are analysed using correlation and regression analyses. The findings indicate a high level of relationship between h-index and g-index scores and m-index at starting year of publication. Conversely, a lower level of relationship between the h-index, g-index, starting year of publication, the total number of publications and the number of citations. This is an original study and will be of interest to the researchers of LIS who wants to know this performance indicator in different aspects, the dependency/independency of the h-index with other metrics, and the impact/performance of the three journals over time in terms of h-index, m-index, total citations and the number of publications. The study is limited to analysis of performance-and-citation-related measurements h-index, g-index, m-index, total citations, and the number of publications.","PeriodicalId":54796,"journal":{"name":"Journal of Information Science","volume":" ","pages":""},"PeriodicalIF":2.4,"publicationDate":"2023-07-03","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"41619474","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"管理学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Scientometric review of Web 3.0 科学
IF 2.4 4区 管理学 Q3 COMPUTER SCIENCE, INFORMATION SYSTEMS Pub Date : 2023-06-30 DOI: 10.1177/01655515231182073
Dhaval Kukreja, Shikha Gupta, Dheeraj Patel, J. Rai
Web 3.0 is a next-generation web architecture that envisions a more decentralised, secure and intelligent Internet. Its implications are vast and could potentially impact various research areas, such as e-commerce, social networking, finance, healthcare and education. It can be seen as a confluence of various technological advancements, including blockchain, artificial intelligence, semantic web and decentralised web technologies, which continue to attract substantial research interest in several dimensions and categories throughout the last few decades. A detailed scientometric analysis was undertaken to obtain concise understanding on development and publication trends of this multi-dimensional field. Corpus of 1154 articles, extracted from Web of Science from 2002 to 2022, were used to identify networks of co-authorship, keywords, subject categories, institutions and countries engaged in publishing on Web 3.0 along with co-citation and cluster analysis. Networks and interactive visualisations created using CiteSpace revealed new research areas where Web 3.0 may be beneficial and potential directions of development for Web 3.0 discipline. We identify Journalism 3.0, Personal Data Stores (PDS), Decentralised File Storages (DFS) and Metaverse as emerging domains Web 3.0 research, seeking overwhelming research attention globally.
Web 3.0是下一代网络架构,它设想了一个更加分散、安全和智能的互联网。它的影响是巨大的,可能会影响到各个研究领域,如电子商务、社交网络、金融、医疗保健和教育。它可以被看作是各种技术进步的融合,包括区块链、人工智能、语义网和分散的网络技术,在过去的几十年里,这些技术在几个维度和类别上继续吸引着大量的研究兴趣。本文进行了详细的科学计量分析,以获得对这一多维领域的发展和出版趋势的简明理解。利用2002 ~ 2022年Web of Science数据库中1154篇文章的语料库,对参与Web 3.0发表的合作作者网络、关键词、主题类别、机构和国家进行了识别,并进行了共被引和聚类分析。使用CiteSpace创建的网络和交互式可视化显示了Web 3.0可能有益的新研究领域和Web 3.0学科的潜在发展方向。我们确定新闻3.0,个人数据存储(PDS),分散文件存储(DFS)和元宇宙作为新兴领域的Web 3.0研究,寻求全球压倒性的研究关注。
{"title":"Scientometric review of Web 3.0","authors":"Dhaval Kukreja, Shikha Gupta, Dheeraj Patel, J. Rai","doi":"10.1177/01655515231182073","DOIUrl":"https://doi.org/10.1177/01655515231182073","url":null,"abstract":"Web 3.0 is a next-generation web architecture that envisions a more decentralised, secure and intelligent Internet. Its implications are vast and could potentially impact various research areas, such as e-commerce, social networking, finance, healthcare and education. It can be seen as a confluence of various technological advancements, including blockchain, artificial intelligence, semantic web and decentralised web technologies, which continue to attract substantial research interest in several dimensions and categories throughout the last few decades. A detailed scientometric analysis was undertaken to obtain concise understanding on development and publication trends of this multi-dimensional field. Corpus of 1154 articles, extracted from Web of Science from 2002 to 2022, were used to identify networks of co-authorship, keywords, subject categories, institutions and countries engaged in publishing on Web 3.0 along with co-citation and cluster analysis. Networks and interactive visualisations created using CiteSpace revealed new research areas where Web 3.0 may be beneficial and potential directions of development for Web 3.0 discipline. We identify Journalism 3.0, Personal Data Stores (PDS), Decentralised File Storages (DFS) and Metaverse as emerging domains Web 3.0 research, seeking overwhelming research attention globally.","PeriodicalId":54796,"journal":{"name":"Journal of Information Science","volume":" ","pages":""},"PeriodicalIF":2.4,"publicationDate":"2023-06-30","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"48033695","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"管理学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Modelling and analysis of misinformation diffusion based on the double intervention mechanism 基于双重干预机制的虚假信息传播建模与分析
IF 2.4 4区 管理学 Q3 COMPUTER SCIENCE, INFORMATION SYSTEMS Pub Date : 2023-06-26 DOI: 10.1177/01655515231182076
Cheng Jiang, Yong-tian Yu, Xinyu Zhang
Although official departments attempt to intervene against misinformation, the personal field often conflicts with the goals of these departments. Thus, when rumours spread widely on social media, decision-makers often use a combination of rigid and soft control measures, such as blocking keywords, deleting misinformation, suspending accounts or refuting misinformation, to decrease the diffusion of misinformation. However, existing methods rarely consider the interplay of blocking and rebuttal measures, resulting in an unclear effect of the double intervention mechanism. To address these issues, we propose a novel misinformation diffusion model called SEIRI (susceptible, exposed, infective, removed, and infective) that considers the double intervention mechanism and secondary diffusion characteristics. We analyse the stability of the proposed model, obtain rumour-free and rumour-spread equilibriums, and calculate the basic reproduction number. Furthermore, we conduct numerical simulations to analyse the influence of key parameters through comparative experiments. Finally, we validate the effectiveness of the proposed approach by crawling a real-world data set of COVID-19-related misinformation tweets from Sina Weibo. Our comparison experiments with other similar works show that the SEIRI model provides superior performance in characterising the actual spread of misinformation. Our findings lead to several practical implications for public health policymaking.
尽管官方部门试图对错误信息进行干预,但个人领域往往与这些部门的目标相冲突。因此,当谣言在社交媒体上广泛传播时,决策者往往采用硬软相结合的控制措施,如屏蔽关键词、删除错误信息、封号或驳斥错误信息等,以减少错误信息的传播。然而,现有的方法很少考虑阻断和反驳措施的相互作用,导致双重干预机制的效果不明确。为了解决这些问题,我们提出了一种新的错误信息扩散模型,称为SEIRI(易感、暴露、感染、移除和感染),该模型考虑了双重干预机制和二次扩散特征。我们分析了所提模型的稳定性,得到了无谣言均衡和谣言扩散均衡,并计算了基本复制数。并通过对比实验进行数值模拟,分析关键参数的影响。最后,我们通过从新浪微博上抓取与covid -19相关的错误信息推文的真实数据集来验证所提出方法的有效性。我们与其他类似工作的比较实验表明,SEIRI模型在描述错误信息的实际传播方面提供了优越的性能。我们的研究结果对公共卫生政策制定有一些实际意义。
{"title":"Modelling and analysis of misinformation diffusion based on the double intervention mechanism","authors":"Cheng Jiang, Yong-tian Yu, Xinyu Zhang","doi":"10.1177/01655515231182076","DOIUrl":"https://doi.org/10.1177/01655515231182076","url":null,"abstract":"Although official departments attempt to intervene against misinformation, the personal field often conflicts with the goals of these departments. Thus, when rumours spread widely on social media, decision-makers often use a combination of rigid and soft control measures, such as blocking keywords, deleting misinformation, suspending accounts or refuting misinformation, to decrease the diffusion of misinformation. However, existing methods rarely consider the interplay of blocking and rebuttal measures, resulting in an unclear effect of the double intervention mechanism. To address these issues, we propose a novel misinformation diffusion model called SEIRI (susceptible, exposed, infective, removed, and infective) that considers the double intervention mechanism and secondary diffusion characteristics. We analyse the stability of the proposed model, obtain rumour-free and rumour-spread equilibriums, and calculate the basic reproduction number. Furthermore, we conduct numerical simulations to analyse the influence of key parameters through comparative experiments. Finally, we validate the effectiveness of the proposed approach by crawling a real-world data set of COVID-19-related misinformation tweets from Sina Weibo. Our comparison experiments with other similar works show that the SEIRI model provides superior performance in characterising the actual spread of misinformation. Our findings lead to several practical implications for public health policymaking.","PeriodicalId":54796,"journal":{"name":"Journal of Information Science","volume":" ","pages":""},"PeriodicalIF":2.4,"publicationDate":"2023-06-26","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"42374534","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"管理学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Investigating the reviewer assignment problem: A systematic literature review 调查审稿人分配问题:一个系统的文献综述
IF 2.4 4区 管理学 Q3 COMPUTER SCIENCE, INFORMATION SYSTEMS Pub Date : 2023-06-26 DOI: 10.1177/01655515231176668
Ana Carolina Ribeiro, Amanda Sizo, Luís Paulo Reis
The assignment of appropriate reviewers to academic articles, known as the reviewer assignment problem (RAP), has become a crucial issue in academia. While there has been much research on RAP, there has not yet been a systematic literature review (SLR) examining the various approaches, techniques, algorithms and discoveries related to this topic. To conduct the SLR, we identified and evaluated relevant articles from four databases using defined inclusion and exclusion criteria. We analysed the selected articles and extracted information, and assessed their quality. Our review identified 67 articles on RAP published in conferences and journals up to mid-2022. As one of the main challenges in RAP is acquiring open data, we have studied the data sources used by researchers and found that most studies use real data from conferences, bibliographic databases and online academic search engines. RAP is divided into two main phases: (1) finding/recommending expert reviewers and (2) assigning reviewers to submitted manuscripts. In Phase 1, we have identified that decision support systems, recommendation systems, and machine learning-oriented approaches are more commonly used due to better results. In Phase 2, heuristics and metaheuristics are the approaches that present better results and are consequently more commonly used by researchers. Based on the analysed studies, we have identified potential areas for future research that could lead to improved results. Specifically, we suggest exploring the application of deep neural networks for calculating the degree of correspondence and using the Boolean satisfiability problem to optimise the attribution process.
为学术文章分配合适的审稿人,即审稿人分配问题(RAP),已成为学术界的一个关键问题。虽然对RAP进行了大量研究,但尚未对与该主题相关的各种方法、技术、算法和发现进行系统的文献综述(SLR)。为了进行SLR,我们使用定义的纳入和排除标准从四个数据库中识别和评估了相关文章。我们分析了选定的文章和提取的信息,并评估了它们的质量。我们的综述确定了截至2022年年中在会议和期刊上发表的67篇关于RAP的文章。由于RAP的主要挑战之一是获取开放数据,我们研究了研究人员使用的数据源,发现大多数研究都使用来自会议、书目数据库和在线学术搜索引擎的真实数据。RAP分为两个主要阶段:(1)寻找/推荐专家评审员和(2)为提交的稿件指派评审员。在第1阶段,我们发现决策支持系统、推荐系统和面向机器学习的方法由于效果更好而更常用。在第二阶段,启发式和元启发式是呈现更好结果的方法,因此更常被研究人员使用。根据分析的研究,我们确定了未来研究的潜在领域,这些领域可能会带来更好的结果。具体而言,我们建议探索深度神经网络在计算对应度方面的应用,并使用布尔可满足性问题来优化归因过程。
{"title":"Investigating the reviewer assignment problem: A systematic literature review","authors":"Ana Carolina Ribeiro, Amanda Sizo, Luís Paulo Reis","doi":"10.1177/01655515231176668","DOIUrl":"https://doi.org/10.1177/01655515231176668","url":null,"abstract":"The assignment of appropriate reviewers to academic articles, known as the reviewer assignment problem (RAP), has become a crucial issue in academia. While there has been much research on RAP, there has not yet been a systematic literature review (SLR) examining the various approaches, techniques, algorithms and discoveries related to this topic. To conduct the SLR, we identified and evaluated relevant articles from four databases using defined inclusion and exclusion criteria. We analysed the selected articles and extracted information, and assessed their quality. Our review identified 67 articles on RAP published in conferences and journals up to mid-2022. As one of the main challenges in RAP is acquiring open data, we have studied the data sources used by researchers and found that most studies use real data from conferences, bibliographic databases and online academic search engines. RAP is divided into two main phases: (1) finding/recommending expert reviewers and (2) assigning reviewers to submitted manuscripts. In Phase 1, we have identified that decision support systems, recommendation systems, and machine learning-oriented approaches are more commonly used due to better results. In Phase 2, heuristics and metaheuristics are the approaches that present better results and are consequently more commonly used by researchers. Based on the analysed studies, we have identified potential areas for future research that could lead to improved results. Specifically, we suggest exploring the application of deep neural networks for calculating the degree of correspondence and using the Boolean satisfiability problem to optimise the attribution process.","PeriodicalId":54796,"journal":{"name":"Journal of Information Science","volume":" ","pages":""},"PeriodicalIF":2.4,"publicationDate":"2023-06-26","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"45537641","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"管理学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Does scientific collaboration variety influence the impact of articles? 科学合作的多样性是否影响文章的影响力?
IF 2.4 4区 管理学 Q3 COMPUTER SCIENCE, INFORMATION SYSTEMS Pub Date : 2023-06-22 DOI: 10.1177/01655515231182067
Hou Jianhua, Yuyao He, Conglin Ye, Zhuomalamu, Suolanglamu, Song Haoyang
This study attempts to investigate the relationship between scientific collaboration variety and scientific output in a specific field. The indicators were set from co-author variety (co-author’s academic background variety and co-author’s network structure variety as independent variables) and article impact (academic impact and social impact as dependent variables). Considering other factors affecting the research results, we also set up control variables (the number of co-authors, proportion of high-level authors and ratio of highly productive authors). We used the Scopus database as the data source and collected all articles published in the dental field in 2018 as data. We used multiple linear regression analyses to examine the impact of co-authors’ variety on the article impact. The results demonstrate that the relationship between scientific collaboration variety and article impact is complicated, which depends on the type of variety of the cooperative scientist. Conversely, the same variety indicator presents the same results as the correlation analysis of academic and social impact articles. The findings indicate that authors can improve their scientific output by collaborating with similar authors of academic backgrounds or stable groups of authors, which provides guidance for scientific cooperation.
本研究试图探讨特定领域的科学协作多样性与科学产出之间的关系。指标由合著者的多样性(以合著者的学术背景多样性和网络结构多样性为自变量)和文章影响(以学术影响和社会影响为因变量)设定。考虑到影响研究结果的其他因素,我们还设置了控制变量(合著者数量、高水平作者比例和高产作者比例)。我们使用Scopus数据库作为数据源,并收集了2018年在牙科领域发表的所有文章作为数据。我们使用多元线性回归分析来检验合著者的多样性对文章影响的影响。结果表明,科学合作品种与文章影响之间的关系是复杂的,这取决于合作科学家的品种类型。相反,相同的多样性指标呈现出与学术和社会影响文章的相关性分析相同的结果。研究结果表明,作者可以通过与学术背景相似的作者或稳定的作者群体合作来提高他们的科学产出,这为科学合作提供了指导。
{"title":"Does scientific collaboration variety influence the impact of articles?","authors":"Hou Jianhua, Yuyao He, Conglin Ye, Zhuomalamu, Suolanglamu, Song Haoyang","doi":"10.1177/01655515231182067","DOIUrl":"https://doi.org/10.1177/01655515231182067","url":null,"abstract":"This study attempts to investigate the relationship between scientific collaboration variety and scientific output in a specific field. The indicators were set from co-author variety (co-author’s academic background variety and co-author’s network structure variety as independent variables) and article impact (academic impact and social impact as dependent variables). Considering other factors affecting the research results, we also set up control variables (the number of co-authors, proportion of high-level authors and ratio of highly productive authors). We used the Scopus database as the data source and collected all articles published in the dental field in 2018 as data. We used multiple linear regression analyses to examine the impact of co-authors’ variety on the article impact. The results demonstrate that the relationship between scientific collaboration variety and article impact is complicated, which depends on the type of variety of the cooperative scientist. Conversely, the same variety indicator presents the same results as the correlation analysis of academic and social impact articles. The findings indicate that authors can improve their scientific output by collaborating with similar authors of academic backgrounds or stable groups of authors, which provides guidance for scientific cooperation.","PeriodicalId":54796,"journal":{"name":"Journal of Information Science","volume":" ","pages":""},"PeriodicalIF":2.4,"publicationDate":"2023-06-22","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"43142108","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"管理学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Analysis and prediction of the formation of new technical phrases for inventive ideation 创新思维新技术短语形成的分析与预测
IF 2.4 4区 管理学 Q3 COMPUTER SCIENCE, INFORMATION SYSTEMS Pub Date : 2023-06-08 DOI: 10.1177/01655515231171090
Haiying Ren, Lu Zhang, Chao Wang
Despite the fast pace of technological development, the process of inventive ideation remains fuzzy. Meanwhile, improving innovation efficiency has become critical for research and development (R&D) teams because of the fierce competition. This study claimed that new technical phrases (NTPs) were important carriers of novel inventive ideas, and their formation was key to understanding and improving ideation processes. Therefore, this article proposed a methodology to analyse and predict the formation of NTPs. First, based on the recombinant search theory and link prediction, four variables in the prior co-word network of a phrase that may influence its formation were collected. Thereafter, logistic regression and a classification tree were employed on patent data to explore the effects of these variables on NTPs. Moreover, various machine learning methods were used for developing NTP prediction models, and procedures for applying the prediction models in real-world R&D settings were designed. Finally, a case study was conducted using the proposed methodology for its demonstration and validation in neural network technology. The case study revealed that all the four variables posed significant impact on the formation of NTPs, and the prediction models yielded the highest prediction accuracy of 78.6% on the test set. The proposed methodology would shed light on the ideation process in innovation theory and provide R&D teams with practical tools for generating new technical ideas.
尽管技术发展速度很快,但创造性思维的过程仍然很模糊。同时,由于竞争激烈,提高创新效率已成为研发团队的关键。本研究认为,新技术短语是创新思想的重要载体,其形成是理解和改进思维过程的关键。因此,本文提出了一种分析和预测NTPs形成的方法。首先,基于重组搜索理论和链接预测,收集了短语先前共词网络中可能影响其形成的四个变量。然后,对专利数据进行逻辑回归和分类树,以探讨这些变量对NTPs的影响。此外,还使用了各种机器学习方法来开发NTP预测模型,并设计了在真实世界的研发环境中应用预测模型的程序。最后,利用所提出的方法在神经网络技术中进行了实例验证。案例研究表明,所有四个变量都对NTP的形成产生了显著影响,预测模型在测试集上的预测准确率最高,为78.6%。所提出的方法将阐明创新理论中的构思过程,并为研发团队提供产生新技术想法的实用工具。
{"title":"Analysis and prediction of the formation of new technical phrases for inventive ideation","authors":"Haiying Ren, Lu Zhang, Chao Wang","doi":"10.1177/01655515231171090","DOIUrl":"https://doi.org/10.1177/01655515231171090","url":null,"abstract":"Despite the fast pace of technological development, the process of inventive ideation remains fuzzy. Meanwhile, improving innovation efficiency has become critical for research and development (R&D) teams because of the fierce competition. This study claimed that new technical phrases (NTPs) were important carriers of novel inventive ideas, and their formation was key to understanding and improving ideation processes. Therefore, this article proposed a methodology to analyse and predict the formation of NTPs. First, based on the recombinant search theory and link prediction, four variables in the prior co-word network of a phrase that may influence its formation were collected. Thereafter, logistic regression and a classification tree were employed on patent data to explore the effects of these variables on NTPs. Moreover, various machine learning methods were used for developing NTP prediction models, and procedures for applying the prediction models in real-world R&D settings were designed. Finally, a case study was conducted using the proposed methodology for its demonstration and validation in neural network technology. The case study revealed that all the four variables posed significant impact on the formation of NTPs, and the prediction models yielded the highest prediction accuracy of 78.6% on the test set. The proposed methodology would shed light on the ideation process in innovation theory and provide R&D teams with practical tools for generating new technical ideas.","PeriodicalId":54796,"journal":{"name":"Journal of Information Science","volume":" ","pages":""},"PeriodicalIF":2.4,"publicationDate":"2023-06-08","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"46199024","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"管理学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Hybrid approach for text categorization: A case study with Bangla news article 文本分类的混合方法——以孟加拉语新闻文章为例
IF 2.4 4区 管理学 Q3 COMPUTER SCIENCE, INFORMATION SYSTEMS Pub Date : 2023-06-01 DOI: 10.1177/01655515211027770
Ankita Dhar, Himadri Mukherjee, K. Roy, K. Santosh, N. Dash
The incredible expansion of online texts due to the Internet has intensified and revived the interest of sorting, managing and categorising the documents into their respective domains. This shows the pressing need for automatic text categorization system to assign a document into its appropriate domain. In this article, the focus is on showcasing the effectiveness of a hybrid approach that works elegantly by combining text-based and graph-based features. The hybrid approach was applied on 14,373 Bangla articles with 57,22,569 tokens collected from various online news corpora covering nine categories. This article also presents the individual application of both the features to explicate how they generally work. For classification purposes, the feature sets were passed through the Bayesian classification methods which yield satisfactory results with 98.73% accuracy for Naïve Bayes Multinomial (NBM). Also, to test the robustness and language independency of the system, the experiments were performed on two popular English datasets as well.
由于因特网,在线文本的不可思议的扩展已经加强和恢复了对分类、管理和将文档分类到各自领域的兴趣。这表明了对文本自动分类系统将文档分配到相应领域的迫切需要。在本文中,重点是展示一种混合方法的有效性,这种方法通过结合基于文本和基于图形的特性而优雅地工作。混合方法应用于14,373篇孟加拉文文章,从各种在线新闻语料库中收集了57,22,569个代币,涵盖9个类别。本文还介绍了这两个特性的单独应用程序,以说明它们的一般工作原理。在分类方面,通过贝叶斯分类方法对特征集进行分类,对Naïve贝叶斯多项式(NBM)的分类准确率达到98.73%,结果令人满意。此外,为了测试系统的鲁棒性和语言独立性,还在两个流行的英语数据集上进行了实验。
{"title":"Hybrid approach for text categorization: A case study with Bangla news article","authors":"Ankita Dhar, Himadri Mukherjee, K. Roy, K. Santosh, N. Dash","doi":"10.1177/01655515211027770","DOIUrl":"https://doi.org/10.1177/01655515211027770","url":null,"abstract":"The incredible expansion of online texts due to the Internet has intensified and revived the interest of sorting, managing and categorising the documents into their respective domains. This shows the pressing need for automatic text categorization system to assign a document into its appropriate domain. In this article, the focus is on showcasing the effectiveness of a hybrid approach that works elegantly by combining text-based and graph-based features. The hybrid approach was applied on 14,373 Bangla articles with 57,22,569 tokens collected from various online news corpora covering nine categories. This article also presents the individual application of both the features to explicate how they generally work. For classification purposes, the feature sets were passed through the Bayesian classification methods which yield satisfactory results with 98.73% accuracy for Naïve Bayes Multinomial (NBM). Also, to test the robustness and language independency of the system, the experiments were performed on two popular English datasets as well.","PeriodicalId":54796,"journal":{"name":"Journal of Information Science","volume":"49 1","pages":"762 - 777"},"PeriodicalIF":2.4,"publicationDate":"2023-06-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"46025373","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"管理学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
Relevance feedback for building pooled test collections 用于构建池测试集合的相关反馈
IF 2.4 4区 管理学 Q3 COMPUTER SCIENCE, INFORMATION SYSTEMS Pub Date : 2023-05-26 DOI: 10.1177/01655515231171085
David Otero, Javier Parapar, Álvaro Barreiro
Offline evaluation of information retrieval systems depends on test collections. These datasets provide the researchers with a corpus of documents, topics and relevance judgements indicating which documents are relevant for each topic. Gathering the latter is costly, requiring human assessors to judge the documents. Therefore, experts usually judge only a portion of the corpus. The most common approach for selecting that subset is pooling. By intelligently choosing which documents to assess, it is possible to optimise the number of positive labels for a given budget. For this reason, much work has focused on developing techniques to better select which documents from the corpus merit human assessments. In this article, we propose using relevance feedback to prioritise the documents when building new pooled test collections. We explore several state-of-the-art statistical feedback methods for prioritising the documents the algorithm presents to the assessors. A thorough comparison on eight Text Retrieval Conference (TREC) datasets against strong baselines shows that, among other results, our proposals improve in retrieving relevant documents with lower assessment effort than other state-of-the-art adjudicating methods without harming the reliability, fairness and reusability.
信息检索系统的离线评估依赖于测试集合。这些数据集为研究人员提供了一个文档、主题和相关性判断的语料库,指示哪些文档与每个主题相关。收集后者成本高昂,需要人工评估人员对文件进行判断。因此,专家通常只判断语料库的一部分。选择该子集最常见的方法是池化。通过智能地选择要评估的文档,可以优化给定预算的正面标签数量。因此,许多工作都集中在开发技术上,以更好地从语料库中选择哪些文档值得人类评估。在本文中,我们建议在构建新的池测试集合时使用相关性反馈来对文档进行优先级排序。我们探索了几种最先进的统计反馈方法,用于对算法呈现给评估人员的文档进行优先级排序。将八个文本检索会议(TREC)数据集与强基线进行彻底比较表明,除其他结果外,与其他最先进的裁决方法相比,我们的提案在检索相关文件方面有所改进,评估工作量更低,而不会损害可靠性、公平性和可重用性。
{"title":"Relevance feedback for building pooled test collections","authors":"David Otero, Javier Parapar, Álvaro Barreiro","doi":"10.1177/01655515231171085","DOIUrl":"https://doi.org/10.1177/01655515231171085","url":null,"abstract":"Offline evaluation of information retrieval systems depends on test collections. These datasets provide the researchers with a corpus of documents, topics and relevance judgements indicating which documents are relevant for each topic. Gathering the latter is costly, requiring human assessors to judge the documents. Therefore, experts usually judge only a portion of the corpus. The most common approach for selecting that subset is pooling. By intelligently choosing which documents to assess, it is possible to optimise the number of positive labels for a given budget. For this reason, much work has focused on developing techniques to better select which documents from the corpus merit human assessments. In this article, we propose using relevance feedback to prioritise the documents when building new pooled test collections. We explore several state-of-the-art statistical feedback methods for prioritising the documents the algorithm presents to the assessors. A thorough comparison on eight Text Retrieval Conference (TREC) datasets against strong baselines shows that, among other results, our proposals improve in retrieving relevant documents with lower assessment effort than other state-of-the-art adjudicating methods without harming the reliability, fairness and reusability.","PeriodicalId":54796,"journal":{"name":"Journal of Information Science","volume":" ","pages":""},"PeriodicalIF":2.4,"publicationDate":"2023-05-26","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"41335507","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"管理学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
期刊
Journal of Information Science
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1