AIDA: A knowledge graph about research dynamics in academia and industry

IF 4.1 Q1 INFORMATION SCIENCE & LIBRARY SCIENCE Quantitative Science Studies Pub Date : 2021-11-05 DOI:10.1162/qss_a_00162
Simone Angioni, Angelo Salatino, Francesco Osborne, Diego Reforgiato, Recupero, E. Motta
{"title":"AIDA: A knowledge graph about research dynamics in academia and industry","authors":"Simone Angioni, Angelo Salatino, Francesco Osborne, Diego Reforgiato, Recupero, E. Motta","doi":"10.1162/qss_a_00162","DOIUrl":null,"url":null,"abstract":"Abstract Academia and industry share a complex, multifaceted, and symbiotic relationship. Analyzing the knowledge flow between them, understanding which directions have the biggest potential, and discovering the best strategies to harmonize their efforts is a critical task for several stakeholders. Research publications and patents are an ideal medium to analyze this space, but current data sets of scholarly data cannot be used for such a purpose because they lack a high-quality characterization of the relevant research topics and industrial sectors. In this paper, we introduce the Academia/Industry DynAmics (AIDA) Knowledge Graph, which describes 21 million publications and 8 million patents according to the research topics drawn from the Computer Science Ontology. 5.1 million publications and 5.6 million patents are further characterized according to the type of the author’s affiliations and 66 industrial sectors from the proposed Industrial Sectors Ontology (INDUSO). AIDA was generated by an automatic pipeline that integrates data from Microsoft Academic Graph, Dimensions, DBpedia, the Computer Science Ontology, and the Global Research Identifier Database. It is publicly available under CC BY 4.0 and can be downloaded as a dump or queried via a triplestore. We evaluated the different parts of the generation pipeline on a manually crafted gold standard yielding competitive results.","PeriodicalId":34021,"journal":{"name":"Quantitative Science Studies","volume":"2 1","pages":"1356-1398"},"PeriodicalIF":4.1000,"publicationDate":"2021-11-05","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"22","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Quantitative Science Studies","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1162/qss_a_00162","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"INFORMATION SCIENCE & LIBRARY SCIENCE","Score":null,"Total":0}
引用次数: 22

Abstract

Abstract Academia and industry share a complex, multifaceted, and symbiotic relationship. Analyzing the knowledge flow between them, understanding which directions have the biggest potential, and discovering the best strategies to harmonize their efforts is a critical task for several stakeholders. Research publications and patents are an ideal medium to analyze this space, but current data sets of scholarly data cannot be used for such a purpose because they lack a high-quality characterization of the relevant research topics and industrial sectors. In this paper, we introduce the Academia/Industry DynAmics (AIDA) Knowledge Graph, which describes 21 million publications and 8 million patents according to the research topics drawn from the Computer Science Ontology. 5.1 million publications and 5.6 million patents are further characterized according to the type of the author’s affiliations and 66 industrial sectors from the proposed Industrial Sectors Ontology (INDUSO). AIDA was generated by an automatic pipeline that integrates data from Microsoft Academic Graph, Dimensions, DBpedia, the Computer Science Ontology, and the Global Research Identifier Database. It is publicly available under CC BY 4.0 and can be downloaded as a dump or queried via a triplestore. We evaluated the different parts of the generation pipeline on a manually crafted gold standard yielding competitive results.
查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
AIDA:关于学术界和工业界研究动态的知识图谱
学术界和产业界有着复杂的、多方面的、共生的关系。分析他们之间的知识流动,了解哪个方向具有最大的潜力,并发现协调他们努力的最佳策略是几个利益相关者的关键任务。研究出版物和专利是分析这一领域的理想媒介,但目前的学术数据集不能用于这一目的,因为它们缺乏对相关研究主题和工业部门的高质量描述。本文引入了学术界/行业动态(AIDA)知识图谱,该图谱根据从计算机科学本体中提取的研究主题描述了2100万份出版物和800万项专利,并根据作者所属单位的类型和提出的工业部门本体(INDUSO)中的66个工业部门进一步描述了510万份出版物和560万项专利。AIDA是由一个自动管道生成的,该管道集成了来自微软学术图、维度、DBpedia、计算机科学本体和全球研究标识数据库的数据。它在CC BY 4.0下公开提供,可以作为转储文件下载或通过triplestore查询。我们在手工制作的黄金标准上评估了生成管道的不同部分,产生了具有竞争力的结果。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 去求助
来源期刊
Quantitative Science Studies
Quantitative Science Studies INFORMATION SCIENCE & LIBRARY SCIENCE-
CiteScore
12.10
自引率
12.50%
发文量
46
审稿时长
22 weeks
期刊介绍:
期刊最新文献
Technological Impact of Funded Research: A Case Study of Non-Patent References Socio-cultural factors and academic openness of world countries Scope and limitations of library metrics for the assessment of ebook usage: COUNTER R5 and link resolver The rise of responsible metrics as a professional reform movement: A collective action frames account New methodologies for the digital age? How methods (re-)organize research using social media data
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1