SCINOBO: a novel system classifying scholarly communication in a dynamically constructed hierarchical Field-of-Science taxonomy.

Sotiris Kotitsas, Dimitris Pappas, Natalia Manola, Haris Papageorgiou
{"title":"SCINOBO: a novel system classifying scholarly communication in a dynamically constructed hierarchical Field-of-Science taxonomy.","authors":"Sotiris Kotitsas,&nbsp;Dimitris Pappas,&nbsp;Natalia Manola,&nbsp;Haris Papageorgiou","doi":"10.3389/frma.2023.1149834","DOIUrl":null,"url":null,"abstract":"<p><p>Classifying scientific publications according to Field-of-Science taxonomies is of crucial importance, powering a wealth of relevant applications including Search Engines, Tools for Scientific Literature, Recommendation Systems, and Science Monitoring. Furthermore, it allows funders, publishers, scholars, companies, and other stakeholders to organize scientific literature more effectively, calculate impact indicators along Science Impact pathways and identify emerging topics that can also facilitate Science, Technology, and Innovation policy-making. As a result, existing classification schemes for scientific publications underpin a large area of research evaluation with several classification schemes currently in use. However, many existing schemes are domain-specific, comprised of few levels of granularity, and require continuous manual work, making it hard to follow the rapidly evolving landscape of science as new research topics emerge. Based on our previous work of scinobo, which incorporates metadata and graph-based publication bibliometric information to assign Field-of-Science fields to scientific publications, we propose a novel hybrid approach by further employing Neural Topic Modeling and Community Detection techniques to dynamically construct a Field-of-Science taxonomy used as the backbone in automatic publication-level Field-of-Science classifiers. Our proposed Field-of-Science taxonomy is based on the OECD fields of research and development (FORD) classification, developed in the framework of the Frascati Manual containing knowledge domains in broad (first level(L1), one-digit) and narrower (second level(L2), two-digit) levels. We create a 3-level hierarchical taxonomy by manually linking Field-of-Science fields of the sciencemetrix Journal classification to the OECD/FORD level-2 fields. To facilitate a more fine-grained analysis, we extend the aforementioned Field-of-Science taxonomy to level-4 and level-5 fields by employing a pipeline of AI techniques. We evaluate the coherence and the coverage of the Field-of-Science fields for the two additional levels based on synthesis scientific publications in two case studies, in the knowledge domains of Energy and Artificial Intelligence. Our results showcase that the proposed automatically generated Field-of-Science taxonomy captures the dynamics of the two research areas encompassing the underlying structure and the emerging scientific developments.</p>","PeriodicalId":73104,"journal":{"name":"Frontiers in research metrics and analytics","volume":null,"pages":null},"PeriodicalIF":0.0000,"publicationDate":"2023-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10192702/pdf/","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Frontiers in research metrics and analytics","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.3389/frma.2023.1149834","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0

Abstract

Classifying scientific publications according to Field-of-Science taxonomies is of crucial importance, powering a wealth of relevant applications including Search Engines, Tools for Scientific Literature, Recommendation Systems, and Science Monitoring. Furthermore, it allows funders, publishers, scholars, companies, and other stakeholders to organize scientific literature more effectively, calculate impact indicators along Science Impact pathways and identify emerging topics that can also facilitate Science, Technology, and Innovation policy-making. As a result, existing classification schemes for scientific publications underpin a large area of research evaluation with several classification schemes currently in use. However, many existing schemes are domain-specific, comprised of few levels of granularity, and require continuous manual work, making it hard to follow the rapidly evolving landscape of science as new research topics emerge. Based on our previous work of scinobo, which incorporates metadata and graph-based publication bibliometric information to assign Field-of-Science fields to scientific publications, we propose a novel hybrid approach by further employing Neural Topic Modeling and Community Detection techniques to dynamically construct a Field-of-Science taxonomy used as the backbone in automatic publication-level Field-of-Science classifiers. Our proposed Field-of-Science taxonomy is based on the OECD fields of research and development (FORD) classification, developed in the framework of the Frascati Manual containing knowledge domains in broad (first level(L1), one-digit) and narrower (second level(L2), two-digit) levels. We create a 3-level hierarchical taxonomy by manually linking Field-of-Science fields of the sciencemetrix Journal classification to the OECD/FORD level-2 fields. To facilitate a more fine-grained analysis, we extend the aforementioned Field-of-Science taxonomy to level-4 and level-5 fields by employing a pipeline of AI techniques. We evaluate the coherence and the coverage of the Field-of-Science fields for the two additional levels based on synthesis scientific publications in two case studies, in the knowledge domains of Energy and Artificial Intelligence. Our results showcase that the proposed automatically generated Field-of-Science taxonomy captures the dynamics of the two research areas encompassing the underlying structure and the emerging scientific developments.

Abstract Image

Abstract Image

Abstract Image

查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
sciobo:一个用动态构建的分层科学领域分类法对学术交流进行分类的新系统。
根据科学领域分类法对科学出版物进行分类是至关重要的,它为大量相关应用提供动力,包括搜索引擎、科学文献工具、推荐系统和科学监测。此外,它允许资助者、出版商、学者、公司和其他利益相关者更有效地组织科学文献,沿着科学影响路径计算影响指标,并确定可以促进科学、技术和创新决策的新兴主题。因此,现有的科学出版物分类方案为目前使用的几种分类方案的研究评价提供了很大的基础。然而,许多现有的方案是特定于领域的,由很少的粒度级别组成,并且需要持续的手工工作,这使得随着新的研究课题的出现,很难跟上快速发展的科学景观。基于我们之前在sciobo中整合元数据和基于图的出版物文献计量信息来为科学出版物分配科学领域的工作,我们提出了一种新的混合方法,进一步利用神经主题建模和社区检测技术来动态构建科学领域分类,作为自动出版级科学领域分类器的主干。我们提出的科学领域分类法是基于经合组织的研究和发展领域(FORD)分类,在Frascati手册的框架内开发的,包含广泛的(一级(L1),一位数)和较窄的(二级(L2),两位数)水平的知识领域。我们通过将sciencemetrix期刊分类的Field-of-Science字段手动链接到OECD/FORD 2级字段,创建了一个3级分层分类法。为了便于进行更细粒度的分析,我们使用人工智能技术管道将前面提到的Field-of-Science分类法扩展到4级和5级领域。我们基于能源和人工智能知识领域的两个案例研究中的综合科学出版物,评估了两个额外级别的科学领域的一致性和覆盖范围。我们的研究结果表明,提出的自动生成的科学领域分类法捕获了两个研究领域的动态,包括底层结构和新兴科学发展。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 去求助
来源期刊
CiteScore
3.50
自引率
0.00%
发文量
0
审稿时长
14 weeks
期刊最新文献
Navigating algorithm bias in AI: ensuring fairness and trust in Africa. The ethics of knowledge sharing: a feminist examination of intellectual property rights and open-source materials in gender transformative methodologies. Complexity and phase transitions in citation networks: insights from artificial intelligence research. Designing measures of complex collaborations with participatory, evidence-centered design. Patent data-driven analysis of literature associations with changing innovation trends.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1