首页 > 最新文献

Journal of the Association for Information Science and Technology最新文献

英文 中文
A study of search result aggregation approaches for the digital humanities 数字人文学科搜索结果聚合方法研究
IF 4.3 2区 管理学 Q2 COMPUTER SCIENCE, INFORMATION SYSTEMS Pub Date : 2025-07-21 DOI: 10.1002/asi.70006
Milad Momeni, Orland Hoeber

Searching across diverse information platforms, such as digital humanities archives, academic digital libraries, and encyclopedias, poses challenges in managing the queries issued to each platform and synthesizing the resources discovered. While search result aggregation interfaces address this problem, how best to present the search results from different platforms in the search engine results page remains an open question. In this research, we implemented three common approaches and developed a new technique for aggregating search results across three platforms: Europeana, our University's academic library, and Wikipedia. The three common approaches (1) use tabs to switch between the platforms, (2) interleave results from each platform producing a single list, and (3) use a bento box approach to group results from each platform. The new technique organizes the search results into thematic clusters irrespective of their source platform. We designed a controlled laboratory study using a within-subjects design and exploratory search tasks conducted in the context of digital humanities searching. We collected data from 32 student participants, focusing on utility, perceived value, and diversity of saved resources. This study provides evidence that thematic clustering can be a beneficial aggregation approach, opening opportunities for studying different ways of representing and visualizing aggregated search results.

跨各种信息平台(如数字人文档案、学术数字图书馆和百科全书)进行搜索,在管理向每个平台发出的查询和综合发现的资源方面提出了挑战。虽然搜索结果聚合接口解决了这个问题,但如何在搜索引擎结果页面中最好地呈现来自不同平台的搜索结果仍然是一个悬而未决的问题。在这项研究中,我们实现了三种常见的方法,并开发了一种新技术,用于跨三个平台聚合搜索结果:Europeana、我们大学的学术图书馆和维基百科。三种常见的方法(1)使用选项卡在平台之间切换,(2)将来自每个平台的结果交织在一起生成单个列表,以及(3)使用便当盒方法对来自每个平台的结果进行分组。新技术将搜索结果组织成主题集群,而不考虑其来源平台。我们设计了一个受控的实验室研究,使用主题内设计和探索性搜索任务,在数字人文搜索的背景下进行。我们收集了32名学生参与者的数据,重点关注效用、感知价值和节省资源的多样性。本研究提供的证据表明,主题聚类可以是一种有益的聚合方法,为研究不同的方式表示和可视化聚合搜索结果提供了机会。
{"title":"A study of search result aggregation approaches for the digital humanities","authors":"Milad Momeni,&nbsp;Orland Hoeber","doi":"10.1002/asi.70006","DOIUrl":"https://doi.org/10.1002/asi.70006","url":null,"abstract":"<p>Searching across diverse information platforms, such as digital humanities archives, academic digital libraries, and encyclopedias, poses challenges in managing the queries issued to each platform and synthesizing the resources discovered. While search result aggregation interfaces address this problem, how best to present the search results from different platforms in the search engine results page remains an open question. In this research, we implemented three common approaches and developed a new technique for aggregating search results across three platforms: Europeana, our University's academic library, and Wikipedia. The three common approaches (1) use tabs to switch between the platforms, (2) interleave results from each platform producing a single list, and (3) use a bento box approach to group results from each platform. The new technique organizes the search results into thematic clusters irrespective of their source platform. We designed a controlled laboratory study using a within-subjects design and exploratory search tasks conducted in the context of digital humanities searching. We collected data from 32 student participants, focusing on utility, perceived value, and diversity of saved resources. This study provides evidence that thematic clustering can be a beneficial aggregation approach, opening opportunities for studying different ways of representing and visualizing aggregated search results.</p>","PeriodicalId":48810,"journal":{"name":"Journal of the Association for Information Science and Technology","volume":"76 11","pages":"1488-1507"},"PeriodicalIF":4.3,"publicationDate":"2025-07-21","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://asistdl.onlinelibrary.wiley.com/doi/epdf/10.1002/asi.70006","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145371889","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"管理学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Scaling research aim identification: Language models for classifying scientific and societal-oriented studies 尺度研究目标识别:科学与社会研究分类的语言模型
IF 4.3 2区 管理学 Q2 COMPUTER SCIENCE, INFORMATION SYSTEMS Pub Date : 2025-07-17 DOI: 10.1002/asi.70004
Mengjia Wu, Gunnar Sivertsen, Lin Zhang, Fan Qi, Yi Zhang

The classification of research according to its aims has been a longstanding focus in the fields of quantitative science studies and R&D statistics. Since 1963, the Organization for Economic Co-operation and Development (OECD) has employed a classical distinction among basic, applied, and experimental research. Building on this framework, our previous work highlighted the utility of differentiating between scientific and societal progress as two primary research objectives. This distinction enabled the quantitative analysis of scientific publication abstracts and the development of an automated method for large-scale classification. In the current study, we systematically evaluate text classification techniques, including traditional text mining models, classification tools, BERT-based language models, and decoder-only large language models (LLMs) such as ChatGPT. Our findings show that the fine-tuned GPT-4o-mini model performs the best among single-model approaches. However, traditional and BERT-based models outperform in certain fine-grained classification tasks. Leveraging majority voting strategies to incorporate their strengths yields performance comparable to closed-source GPT models. A case study on 10 biomedical journals further validates the method, demonstrating strong alignment between journal scopes, model predictions, and outputs generated by the fine-tuned GPT-4o-mini model. These results highlight the robustness and practical effectiveness of the proposed methodology for nuanced research aim classification.

根据研究目的对研究进行分类一直是定量科学研究和研发统计领域长期关注的焦点。自1963年以来,经济合作与发展组织(OECD)在基础研究、应用研究和实验研究之间采用了经典的区分方法。在这个框架的基础上,我们之前的工作强调了区分科学进步和社会进步作为两个主要研究目标的效用。这种区别使科学出版物摘要的定量分析和大规模分类的自动化方法的发展成为可能。在当前的研究中,我们系统地评估了文本分类技术,包括传统的文本挖掘模型、分类工具、基于bert的语言模型和仅解码的大型语言模型(llm),如ChatGPT。我们的研究结果表明,在单模型方法中,微调后的gpt - 40 -mini模型表现最好。然而,传统的和基于bert的模型在某些细粒度的分类任务中表现更好。利用多数投票策略来结合它们的优势,可以产生与闭源GPT模型相当的性能。对10种生物医学期刊的案例研究进一步验证了该方法,表明期刊范围、模型预测和经过微调的gpt - 40 -mini模型产生的输出之间具有很强的一致性。这些结果突出了所提出的方法对细微研究目标分类的鲁棒性和实际有效性。
{"title":"Scaling research aim identification: Language models for classifying scientific and societal-oriented studies","authors":"Mengjia Wu,&nbsp;Gunnar Sivertsen,&nbsp;Lin Zhang,&nbsp;Fan Qi,&nbsp;Yi Zhang","doi":"10.1002/asi.70004","DOIUrl":"https://doi.org/10.1002/asi.70004","url":null,"abstract":"<p>The classification of research according to its aims has been a longstanding focus in the fields of quantitative science studies and R&amp;D statistics. Since 1963, the Organization for Economic Co-operation and Development (OECD) has employed a classical distinction among basic, applied, and experimental research. Building on this framework, our previous work highlighted the utility of differentiating between scientific and societal progress as two primary research objectives. This distinction enabled the quantitative analysis of scientific publication abstracts and the development of an automated method for large-scale classification. In the current study, we systematically evaluate text classification techniques, including traditional text mining models, classification tools, BERT-based language models, and decoder-only large language models (LLMs) such as ChatGPT. Our findings show that the fine-tuned GPT-4o-mini model performs the best among single-model approaches. However, traditional and BERT-based models outperform in certain fine-grained classification tasks. Leveraging majority voting strategies to incorporate their strengths yields performance comparable to closed-source GPT models. A case study on 10 biomedical journals further validates the method, demonstrating strong alignment between journal scopes, model predictions, and outputs generated by the fine-tuned GPT-4o-mini model. These results highlight the robustness and practical effectiveness of the proposed methodology for nuanced research aim classification.</p>","PeriodicalId":48810,"journal":{"name":"Journal of the Association for Information Science and Technology","volume":"76 11","pages":"1470-1487"},"PeriodicalIF":4.3,"publicationDate":"2025-07-17","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://asistdl.onlinelibrary.wiley.com/doi/epdf/10.1002/asi.70004","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145371850","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"管理学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Automated novelty evaluation of academic paper: A collaborative approach integrating human and large language model knowledge 学术论文的自动新颖性评估:一种整合人类和大型语言模型知识的协作方法
IF 4.3 2区 管理学 Q2 COMPUTER SCIENCE, INFORMATION SYSTEMS Pub Date : 2025-07-15 DOI: 10.1002/asi.70005
Wenqing Wu, Chengzhi Zhang, Yi Zhao

Novelty is a crucial criterion in the peer-review process for evaluating academic papers. Traditionally, it is judged by experts or measured by unique reference combinations. Both methods have limitations: experts have limited knowledge, and the effectiveness of the combination method is uncertain. Moreover, it is unclear if unique citations truly measure novelty. The large language model (LLM) possesses a wealth of knowledge, while human experts possess judgment abilities that the LLM does not possess. Therefore, our research integrates the knowledge and abilities of LLM and human experts to address the limitations of novelty assessment. One of the most common types of novelty in academic papers is the introduction of new methods. In this paper, we propose leveraging human knowledge and LLM to assist pre-trained language models (PLMs, e.g., BERT, etc.) in predicting the method novelty of papers. Specifically, we extract sentences related to the novelty of the academic paper from peer-review reports and use LLM to summarize the methodology section of the academic paper, which are then used to fine-tune PLMs. In addition, we have designed a text-guided fusion module with novel Sparse-Attention to better integrate human and LLM knowledge. We compared the method we proposed with a large number of baselines. Extensive experiments demonstrate that our method achieves superior performance.

在同行评议过程中,新颖性是评估学术论文的一个重要标准。传统上,它是由专家判断或通过独特的参考组合来衡量的。两种方法都有局限性:专家知识有限,组合方法的有效性不确定。此外,目前尚不清楚唯一引用是否真的能衡量新颖性。大型语言模型(LLM)拥有丰富的知识,而人类专家拥有LLM所不具备的判断能力。因此,我们的研究整合了法学硕士和人类专家的知识和能力,以解决新颖性评估的局限性。学术论文中最常见的新颖类型之一是引入新方法。在本文中,我们建议利用人类知识和法学硕士来协助预训练的语言模型(plm,例如BERT等)预测论文的方法新颖性。具体来说,我们从同行评审报告中提取与学术论文新颖性相关的句子,并使用LLM来总结学术论文的方法论部分,然后用于微调plm。此外,我们还设计了一个具有新颖的稀疏注意力的文本引导融合模块,以更好地整合人类和法学硕士知识。我们将提出的方法与大量的基线进行了比较。大量的实验表明,我们的方法取得了优异的性能。
{"title":"Automated novelty evaluation of academic paper: A collaborative approach integrating human and large language model knowledge","authors":"Wenqing Wu,&nbsp;Chengzhi Zhang,&nbsp;Yi Zhao","doi":"10.1002/asi.70005","DOIUrl":"https://doi.org/10.1002/asi.70005","url":null,"abstract":"<p>Novelty is a crucial criterion in the peer-review process for evaluating academic papers. Traditionally, it is judged by experts or measured by unique reference combinations. Both methods have limitations: experts have limited knowledge, and the effectiveness of the combination method is uncertain. Moreover, it is unclear if unique citations truly measure novelty. The large language model (LLM) possesses a wealth of knowledge, while human experts possess judgment abilities that the LLM does not possess. Therefore, our research integrates the knowledge and abilities of LLM and human experts to address the limitations of novelty assessment. One of the most common types of novelty in academic papers is the introduction of new methods. In this paper, we propose leveraging human knowledge and LLM to assist pre-trained language models (PLMs, e.g., BERT, etc.) in predicting the method novelty of papers. Specifically, we extract sentences related to the novelty of the academic paper from peer-review reports and use LLM to summarize the methodology section of the academic paper, which are then used to fine-tune PLMs. In addition, we have designed a text-guided fusion module with novel Sparse-Attention to better integrate human and LLM knowledge. We compared the method we proposed with a large number of baselines. Extensive experiments demonstrate that our method achieves superior performance.</p>","PeriodicalId":48810,"journal":{"name":"Journal of the Association for Information Science and Technology","volume":"76 11","pages":"1452-1469"},"PeriodicalIF":4.3,"publicationDate":"2025-07-15","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145371887","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"管理学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
How do authors perceive the way their work is cited? Findings from a large-scale survey on quotation accuracy 作者如何看待他们的作品被引用的方式?一项关于报价准确性的大规模调查结果
IF 4.3 2区 管理学 Q2 COMPUTER SCIENCE, INFORMATION SYSTEMS Pub Date : 2025-07-08 DOI: 10.1002/asi.70000
Simon Wakeling, Monica Lestari Paramita, Stephen Pinfield

It has long been recognized that there are issues with the appropriateness of citations in the academic literature. Citations of sources that do not support the statement they are cited against are known as quotation errors, and there have been many previous studies of their prevalence. The vast majority of these studies rely on researchers evaluating the accuracy of citations in a small sample of the literature, and show large variation in quotation error rates. In this article we report a novel approach to assessing quotation accuracy via an online survey in which 2648 corresponding authors of articles evaluated a real-world citation of their work. Respondents were also asked to categorize the perceived purpose of the citation, and what action, if any, they take when encountering inaccurate citations of their work. We found a quotation error rate of 16.6%, with no significant difference across academic disciplines, suggesting that variation in previous studies may be a result of methodological differences. Only 11.3% of respondents indicated they had taken action after encountering an inaccurate citation of their work. This work reveals reasons contributing to inaccurate quotations and issues with citation practices, and offers suggestions of areas for future research.

人们早就认识到,学术文献中引文的适当性存在问题。引用不支持他们所引用的声明的来源被称为引用错误,之前有许多关于其普遍性的研究。这些研究绝大多数依赖于研究人员对一小部分文献样本中引文的准确性进行评估,并显示出引文错误率的巨大差异。在本文中,我们报告了一种通过在线调查评估引文准确性的新方法,其中2648篇文章的通讯作者评估了他们工作的真实引用。受访者还被要求对引用的感知目的进行分类,以及当遇到对其工作的不准确引用时,他们会采取什么行动。我们发现引文错误率为16.6%,不同学科之间没有显著差异,这表明以往研究的差异可能是方法差异的结果。只有11.3%的受访者表示,他们在遇到不准确的引用后采取了行动。本文揭示了导致引文不准确的原因和引文实践中存在的问题,并对今后的研究提出了建议。
{"title":"How do authors perceive the way their work is cited? Findings from a large-scale survey on quotation accuracy","authors":"Simon Wakeling,&nbsp;Monica Lestari Paramita,&nbsp;Stephen Pinfield","doi":"10.1002/asi.70000","DOIUrl":"10.1002/asi.70000","url":null,"abstract":"<p>It has long been recognized that there are issues with the appropriateness of citations in the academic literature. Citations of sources that do not support the statement they are cited against are known as quotation errors, and there have been many previous studies of their prevalence. The vast majority of these studies rely on researchers evaluating the accuracy of citations in a small sample of the literature, and show large variation in quotation error rates. In this article we report a novel approach to assessing quotation accuracy via an online survey in which 2648 corresponding authors of articles evaluated a real-world citation of their work. Respondents were also asked to categorize the perceived purpose of the citation, and what action, if any, they take when encountering inaccurate citations of their work. We found a quotation error rate of 16.6%, with no significant difference across academic disciplines, suggesting that variation in previous studies may be a result of methodological differences. Only 11.3% of respondents indicated they had taken action after encountering an inaccurate citation of their work. This work reveals reasons contributing to inaccurate quotations and issues with citation practices, and offers suggestions of areas for future research.</p>","PeriodicalId":48810,"journal":{"name":"Journal of the Association for Information Science and Technology","volume":"76 10","pages":"1396-1410"},"PeriodicalIF":4.3,"publicationDate":"2025-07-08","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://asistdl.onlinelibrary.wiley.com/doi/epdf/10.1002/asi.70000","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145062682","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"管理学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
The Tower of Babel in science communication on social media: An analysis of linguistic diversity in Twitter mentions of scientific publications 社交媒体上科学传播的巴别塔:Twitter上提及科学出版物的语言多样性分析
IF 4.3 2区 管理学 Q2 COMPUTER SCIENCE, INFORMATION SYSTEMS Pub Date : 2025-06-20 DOI: 10.1002/asi.70002
Yanqing Zhang, Zhichao Fang

To unravel the linguistic dynamics of science communication on social media, this study presents a large-scale, cross-disciplinary analysis of language use in over 21 million Twitter mentions of 6.7 million scientific publications. While English dominates—accounting for 90.8% of all mentions and serving as a bridging language for the international dissemination of research—90 non-English languages contribute to a rich and diverse multilingual ecosystem. A strong alignment is observed between the language of non-English publications and their corresponding Twitter mentions, particularly for languages such as Japanese and Spanish, reflecting linguistic proximity and regional engagement. Importantly, non-English tweets achieve user engagement levels comparable to those written in English, whereas tweets lacking meaningful textual content consistently receive lower interaction. These findings highlight the inherently multilingual nature of science communication on Twitter and underscore the importance of incorporating non-English activities into altmetric analyses to ensure a more inclusive and equitable understanding of global scientific discourse.

为了揭示社交媒体上科学传播的语言动态,本研究对Twitter上超过2100万次提及的670万份科学出版物的语言使用进行了大规模的跨学科分析。虽然英语占主导地位,占所有提及的90.8%,并作为国际研究传播的桥梁语言,但90种非英语语言为丰富多样的多语言生态系统做出了贡献。非英语出版物的语言与其相应的Twitter提及之间存在强烈的一致性,特别是对于日语和西班牙语等语言,反映了语言的接近性和区域参与度。重要的是,非英语推文的用户参与度与英语推文相当,而缺乏有意义的文本内容的推文的交互性一直较低。这些发现突出了Twitter上科学传播固有的多语言性质,并强调了将非英语活动纳入替代分析的重要性,以确保对全球科学话语的更包容和公平的理解。
{"title":"The Tower of Babel in science communication on social media: An analysis of linguistic diversity in Twitter mentions of scientific publications","authors":"Yanqing Zhang,&nbsp;Zhichao Fang","doi":"10.1002/asi.70002","DOIUrl":"https://doi.org/10.1002/asi.70002","url":null,"abstract":"<p>To unravel the linguistic dynamics of science communication on social media, this study presents a large-scale, cross-disciplinary analysis of language use in over 21 million Twitter mentions of 6.7 million scientific publications. While English dominates—accounting for 90.8% of all mentions and serving as a bridging language for the international dissemination of research—90 non-English languages contribute to a rich and diverse multilingual ecosystem. A strong alignment is observed between the language of non-English publications and their corresponding Twitter mentions, particularly for languages such as Japanese and Spanish, reflecting linguistic proximity and regional engagement. Importantly, non-English tweets achieve user engagement levels comparable to those written in English, whereas tweets lacking meaningful textual content consistently receive lower interaction. These findings highlight the inherently multilingual nature of science communication on Twitter and underscore the importance of incorporating non-English activities into altmetric analyses to ensure a more inclusive and equitable understanding of global scientific discourse.</p>","PeriodicalId":48810,"journal":{"name":"Journal of the Association for Information Science and Technology","volume":"76 11","pages":"1431-1451"},"PeriodicalIF":4.3,"publicationDate":"2025-06-20","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://asistdl.onlinelibrary.wiley.com/doi/epdf/10.1002/asi.70002","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145371811","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"管理学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
The visual, the textual, and the one-dimensional: An exploration of the visual elements of bibliographic classification schemes 视觉、文本和一维:书目分类方案视觉元素的探索
IF 4.3 2区 管理学 Q2 COMPUTER SCIENCE, INFORMATION SYSTEMS Pub Date : 2025-06-20 DOI: 10.1002/asi.70001
Deborah Lee

Classification schemes are a key way of organizing bibliographic knowledge, yet the way that classification schemes communicate their information to classifiers receives little attention. This article takes a novel approach by exploring the visual aspects contained within classification schemes. The research uses a classification scheme analysis methodology. Three different classification scheme phenomena are discussed in terms of their visualization: hierarchy, notation, and notes. Indentation is found to be a significant—and implicit—method of communicating hierarchy to classifiers and offers intriguing solutions to the issues of transmuting from two dimensions into one. The visual elements of notation reveal a strong separation between notation and class, while the visual elements of notes illuminate a varying narrative around the position of notes in the classification scheme. A categorization system for visual elements in classification schemes is presented. Model 1 proffers visual elements as a fourth plane of classification, which extends and remodels Ranganathan's Three Planes of Work. Model 2 shows how visual elements could fit into classification scheme versioning. Ultimately, looking at visual aspects of classification schemes is a novel way of thinking about knowledge organization and can help us to better understand—and ultimately, to better use—classification schemes.

分类方案是组织书目知识的一种关键方式,但分类方案向分类器传递信息的方式却很少受到关注。本文通过探索分类方案中包含的视觉方面采取了一种新颖的方法。本研究采用分类方案分析方法。从可视化的角度讨论了三种不同的分类方案现象:层次、符号和注释。缩进被认为是一种将层次结构传递给分类器的重要且隐含的方法,并为从二维转换到一维的问题提供了有趣的解决方案。符号的视觉元素揭示了符号和类别之间的强烈分离,而音符的视觉元素则阐明了围绕音符在分类方案中的位置的不同叙述。提出了一种分类方案中视觉元素的分类系统。模型1提供了视觉元素作为分类的第四个层面,它扩展和重塑了Ranganathan的三个工作层面。模型2显示了可视化元素如何适应分类方案版本控制。最终,研究分类方案的可视化方面是一种思考知识组织的新颖方式,可以帮助我们更好地理解分类方案,并最终更好地使用分类方案。
{"title":"The visual, the textual, and the one-dimensional: An exploration of the visual elements of bibliographic classification schemes","authors":"Deborah Lee","doi":"10.1002/asi.70001","DOIUrl":"10.1002/asi.70001","url":null,"abstract":"<p>Classification schemes are a key way of organizing bibliographic knowledge, yet the way that classification schemes communicate their information to classifiers receives little attention. This article takes a novel approach by exploring the visual aspects contained within classification schemes. The research uses a classification scheme analysis methodology. Three different classification scheme phenomena are discussed in terms of their visualization: hierarchy, notation, and notes. Indentation is found to be a significant—and implicit—method of communicating hierarchy to classifiers and offers intriguing solutions to the issues of transmuting from two dimensions into one. The visual elements of notation reveal a strong separation between notation and class, while the visual elements of notes illuminate a varying narrative around the position of notes in <i>the classification scheme</i>. A categorization system for visual elements in classification schemes is presented. Model 1 proffers visual elements as a fourth plane of classification, which extends and remodels Ranganathan's <i>Three Planes of Work</i>. Model 2 shows how visual elements could fit into classification scheme versioning. Ultimately, looking at visual aspects of classification schemes is a novel way of thinking about knowledge organization and can help us to better understand—and ultimately, to better use—classification schemes.</p>","PeriodicalId":48810,"journal":{"name":"Journal of the Association for Information Science and Technology","volume":"76 10","pages":"1411-1427"},"PeriodicalIF":4.3,"publicationDate":"2025-06-20","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://asistdl.onlinelibrary.wiley.com/doi/epdf/10.1002/asi.70001","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145062742","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"管理学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Knowledge integration and diffusion structures of interdisciplinary research: A large-scale analysis based on propensity score matching 跨学科研究的知识整合与扩散结构:基于倾向得分匹配的大规模分析
IF 4.3 2区 管理学 Q2 COMPUTER SCIENCE, INFORMATION SYSTEMS Pub Date : 2025-06-04 DOI: 10.1002/asi.25014
Jiawei Xu, Zhihan Zheng, Chao Min, Win-bin Huang, Yi Bu

While facilitating science, interdisciplinary research (IDR) has a heavier cognitive burden for researchers compared to unidisciplinary research (UDR). Yet, little has been known about patterns of knowledge integration and diffusion structures of IDR. Here we adopt a causal inference strategy, namely propensity score matching, with all journal publications in 2005 in Microsoft Academic Graph to better understand the IDR effect in various research fields. We use the diversity of reference fields of one paper as the proxy of the paper's interdisciplinarity and estimate the effect of a research article being IDR on its knowledge integration and diffusion measured by its high-order citation/reference cascade. We find that, in disciplines where IDR articles are less popular, such as mathematics, physics, and chemistry, IDR needs a more extensive knowledge base than UDR to gain a similar number of citations. In disciplines where IDR articles are more popular, for example, psychology, geology, biology, and economics, a small knowledge base is enough for a high-impact IDR article. As to knowledge diffusion, no matter whether IDR or UDR, a more extensive knowledge base leads to stronger knowledge diffusion ability. Findings imply potential drawbacks of pure interdisciplinarity-oriented research policy; rather, the establishment of policies may vary across disciplines.

与单一学科研究相比,跨学科研究在促进科学发展的同时,给研究人员带来了更重的认知负担。然而,对知识集成和知识扩散结构的研究却很少。为了更好地理解各个研究领域的IDR效应,我们在Microsoft Academic Graph中对2005年的所有期刊出版物采用因果推理策略,即倾向得分匹配。我们使用一篇论文的参考领域多样性作为该论文的跨学科性的代表,并通过其高阶引用/参考级联来估计研究论文被IDR对其知识整合和扩散的影响。我们发现,在IDR文章不太受欢迎的学科中,如数学、物理和化学,IDR需要比UDR更广泛的知识库来获得相似的引用数量。在IDR文章更受欢迎的学科中,例如心理学、地质学、生物学和经济学,一个小的知识库就足以发表高影响力的IDR文章。在知识扩散方面,无论是IDR还是UDR,知识基础越广泛,知识扩散能力越强。研究结果暗示了纯跨学科导向研究政策的潜在弊端;相反,政策的制定可能因学科而异。
{"title":"Knowledge integration and diffusion structures of interdisciplinary research: A large-scale analysis based on propensity score matching","authors":"Jiawei Xu,&nbsp;Zhihan Zheng,&nbsp;Chao Min,&nbsp;Win-bin Huang,&nbsp;Yi Bu","doi":"10.1002/asi.25014","DOIUrl":"10.1002/asi.25014","url":null,"abstract":"<p>While facilitating science, interdisciplinary research (IDR) has a heavier cognitive burden for researchers compared to unidisciplinary research (UDR). Yet, little has been known about patterns of knowledge integration and diffusion structures of IDR. Here we adopt a causal inference strategy, namely propensity score matching, with all journal publications in 2005 in Microsoft Academic Graph to better understand the IDR effect in various research fields. We use the diversity of reference fields of one paper as the proxy of the paper's interdisciplinarity and estimate the effect of a research article being IDR on its knowledge integration and diffusion measured by its high-order citation/reference cascade. We find that, in disciplines where IDR articles are less popular, such as mathematics, physics, and chemistry, IDR needs a more extensive knowledge base than UDR to gain a similar number of citations. In disciplines where IDR articles are more popular, for example, psychology, geology, biology, and economics, a small knowledge base is enough for a high-impact IDR article. As to knowledge diffusion, no matter whether IDR or UDR, a more extensive knowledge base leads to stronger knowledge diffusion ability. Findings imply potential drawbacks of pure interdisciplinarity-oriented research policy; rather, the establishment of policies may vary across disciplines.</p>","PeriodicalId":48810,"journal":{"name":"Journal of the Association for Information Science and Technology","volume":"76 9","pages":"1210-1226"},"PeriodicalIF":4.3,"publicationDate":"2025-06-04","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"144923610","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"管理学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Semantic organization for historical maps: Classification, representation, association 历史地图的语义组织:分类、表示、关联
IF 4.3 2区 管理学 Q2 COMPUTER SCIENCE, INFORMATION SYSTEMS Pub Date : 2025-05-26 DOI: 10.1002/asi.25023
Qi Xiaoying, Alton Y. K. Chua, Yang Haiping

Given that historical maps (HM) are represented by a complex network of symbols, their semantics cannot be easily and directly understood. To extract the embedded knowledge, scholars have developed semantic organization for different types of HM. However, the construction of semantic organization for HM is challenging due to problems of semantic clutter, semantic loss, and semantic ambiguity. To resolve these problems, this paper proposes a semantic organization system which includes classification, representation, and association mechanisms for HM. The intent is to achieve semantic ordering, semantic enhancement, and semantic association. As a means to verify the proposed semantic organization system, this paper develops an HM knowledge question and answer (Q&A) system. Experimental results show that the Q&A system outperformed Baidu (Wenxinyiyan) and GPT-4o in terms of precision and recall.

由于历史地图是由一个复杂的符号网络表示的,因此它们的语义不容易直接理解。为了提取嵌入的知识,学者们对不同类型的hmm进行了语义组织。然而,由于语义混乱、语义丢失和语义模糊等问题,语义组织的构建面临挑战。为了解决这些问题,本文提出了一种包含HM分类、表示和关联机制的语义组织体系。其目的是实现语义排序、语义增强和语义关联。为了验证所提出的语义组织系统,本文开发了一个HM知识问答系统。实验结果表明,Q&;A系统在查准率和查全率方面都优于百度(Wenxinyiyan)和gpt - 40。
{"title":"Semantic organization for historical maps: Classification, representation, association","authors":"Qi Xiaoying,&nbsp;Alton Y. K. Chua,&nbsp;Yang Haiping","doi":"10.1002/asi.25023","DOIUrl":"10.1002/asi.25023","url":null,"abstract":"<p>Given that historical maps (HM) are represented by a complex network of symbols, their semantics cannot be easily and directly understood. To extract the embedded knowledge, scholars have developed semantic organization for different types of HM. However, the construction of semantic organization for HM is challenging due to problems of semantic clutter, semantic loss, and semantic ambiguity. To resolve these problems, this paper proposes a semantic organization system which includes classification, representation, and association mechanisms for HM. The intent is to achieve semantic ordering, semantic enhancement, and semantic association. As a means to verify the proposed semantic organization system, this paper develops an HM knowledge question and answer (Q&amp;A) system. Experimental results show that the Q&amp;A system outperformed Baidu (Wenxinyiyan) and GPT-4o in terms of precision and recall.</p>","PeriodicalId":48810,"journal":{"name":"Journal of the Association for Information Science and Technology","volume":"76 10","pages":"1374-1395"},"PeriodicalIF":4.3,"publicationDate":"2025-05-26","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145062563","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"管理学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
The landscape of data reuse in interactive information retrieval: Motivations, sources, and evaluation of reusability 交互式信息检索中的数据重用前景:可重用性的动机、来源和评估
IF 4.3 2区 管理学 Q2 COMPUTER SCIENCE, INFORMATION SYSTEMS Pub Date : 2025-05-23 DOI: 10.1002/asi.25020
Tianji Jiang, Wenqi Li, Jiqun Liu

Reusing research data can effectively reduce efforts in data collection and enhance the replicability of evaluation experiments, especially for small laboratories and research teams studying human-centered systems. Building a sustainable data reuse process and culture relies on frameworks that encompass policies, standards, roles, and responsibilities, all of which must address the diverse needs of data providers, curators, and reusers. This study investigated data reuse practices of experienced researchers in Interactive Information Retrieval (IIR), a field where data reuse has been strongly advocated but still remains a challenge. We conducted 21 semi-structured in-depth interviews with IIR researchers from varying demographic backgrounds, institutions, and career stages about their motivations, experiences, and concerns regarding data reuse. We uncovered the rationales, criteria, and strategies they used in reusability assessments, as well as the challenges they faced when attempting to reuse research data in their studies. These empirical findings enrich ongoing discussions about the reusability of user-generated data and research resources and help promote community-level data reuse culture and standards in both traditional and emerging IIR research fields.

研究数据的重用可以有效地减少数据收集的工作量,提高评价实验的可重复性,特别是对于研究以人为中心的系统的小型实验室和研究团队。构建可持续的数据重用流程和文化依赖于包含策略、标准、角色和职责的框架,所有这些都必须满足数据提供者、管理者和重用者的不同需求。本研究调查了交互式信息检索(IIR)中经验丰富的研究人员的数据重用实践,交互式信息检索是一个数据重用被强烈提倡但仍然存在挑战的领域。我们对来自不同人口背景、机构和职业阶段的IIR研究人员进行了21次半结构化深度访谈,了解他们对数据重用的动机、经验和担忧。我们揭示了他们在可重用性评估中使用的基本原理、标准和策略,以及他们在试图在研究中重用研究数据时面临的挑战。这些实证研究结果丰富了正在进行的关于用户生成数据和研究资源可重用性的讨论,并有助于在传统和新兴的IIR研究领域促进社区层面的数据重用文化和标准。
{"title":"The landscape of data reuse in interactive information retrieval: Motivations, sources, and evaluation of reusability","authors":"Tianji Jiang,&nbsp;Wenqi Li,&nbsp;Jiqun Liu","doi":"10.1002/asi.25020","DOIUrl":"10.1002/asi.25020","url":null,"abstract":"<p>Reusing research data can effectively reduce efforts in data collection and enhance the replicability of evaluation experiments, especially for small laboratories and research teams studying human-centered systems. Building a sustainable data reuse process and culture relies on frameworks that encompass policies, standards, roles, and responsibilities, all of which must address the diverse needs of data providers, curators, and reusers. This study investigated data reuse practices of experienced researchers in Interactive Information Retrieval (IIR), a field where data reuse has been strongly advocated but still remains a challenge. We conducted 21 semi-structured in-depth interviews with IIR researchers from varying demographic backgrounds, institutions, and career stages about their motivations, experiences, and concerns regarding data reuse. We uncovered the rationales, criteria, and strategies they used in reusability assessments, as well as the challenges they faced when attempting to reuse research data in their studies. These empirical findings enrich ongoing discussions about the reusability of user-generated data and research resources and help promote community-level data reuse culture and standards in both traditional and emerging IIR research fields.</p>","PeriodicalId":48810,"journal":{"name":"Journal of the Association for Information Science and Technology","volume":"76 9","pages":"1258-1276"},"PeriodicalIF":4.3,"publicationDate":"2025-05-23","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"144923781","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"管理学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
The spatiotemporal relationship between usage data and topic popularity in scientific literature 科学文献中使用数据与主题流行度的时空关系
IF 4.3 2区 管理学 Q2 COMPUTER SCIENCE, INFORMATION SYSTEMS Pub Date : 2025-05-23 DOI: 10.1002/asi.25019
Xianwen Wang, Wencan Tian, Ruonan Cai, Zhichao Fang

This study explored the spatiotemporal relationship between usage data (measured by PDF downloads and HTML views) and topic popularity (measured by the number of publications) in scientific literature. Using a panel dataset of over 2.3 million papers and 130 million usage records from IEEE Xplore, we develop a theoretical framework grounded in attention economy theory and the competitive exclusion principle. By using fixed effects model, the instrumental variable method, and the spatial Durbin model, we discover that how often a topic is used greatly increases its future popularity, while usage data from related topics have a negative impact. This study provides solid preliminary evidence for using usage data in detecting research hotspots. Additionally, this study innovatively proposes two methods for constructing spatial weight matrices based on topic semantic vectors, offering a concrete pathway for integrating spatial econometrics with spatial scientometrics.

本研究探讨了科学文献的使用数据(以PDF下载和HTML视图衡量)与主题流行度(以出版物数量衡量)之间的时空关系。利用来自IEEE explore的超过230万篇论文和1.3亿条使用记录的面板数据集,我们建立了一个基于注意力经济理论和竞争排斥原则的理论框架。通过固定效应模型、工具变量法和空间Durbin模型,我们发现一个话题的使用频率会大大增加其未来的受欢迎程度,而相关话题的使用数据会产生负面影响。本研究为利用使用数据发现研究热点提供了坚实的初步依据。此外,本研究创新性地提出了两种基于主题语义向量的空间权重矩阵构建方法,为空间计量经济学与空间科学计量学的融合提供了具体途径。
{"title":"The spatiotemporal relationship between usage data and topic popularity in scientific literature","authors":"Xianwen Wang,&nbsp;Wencan Tian,&nbsp;Ruonan Cai,&nbsp;Zhichao Fang","doi":"10.1002/asi.25019","DOIUrl":"10.1002/asi.25019","url":null,"abstract":"<p>This study explored the spatiotemporal relationship between usage data (measured by PDF downloads and HTML views) and topic popularity (measured by the number of publications) in scientific literature. Using a panel dataset of over 2.3 million papers and 130 million usage records from IEEE Xplore, we develop a theoretical framework grounded in attention economy theory and the competitive exclusion principle. By using fixed effects model, the instrumental variable method, and the spatial Durbin model, we discover that how often a topic is used greatly increases its future popularity, while usage data from related topics have a negative impact. This study provides solid preliminary evidence for using usage data in detecting research hotspots. Additionally, this study innovatively proposes two methods for constructing spatial weight matrices based on topic semantic vectors, offering a concrete pathway for integrating spatial econometrics with spatial scientometrics.</p>","PeriodicalId":48810,"journal":{"name":"Journal of the Association for Information Science and Technology","volume":"76 9","pages":"1241-1257"},"PeriodicalIF":4.3,"publicationDate":"2025-05-23","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"144923780","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"管理学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
期刊
Journal of the Association for Information Science and Technology
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1