Extracting academic genealogy trees from the networked digital library of theses and dissertations

W. Dores, Fabrício Benevenuto, Alberto H. F. Laender
{"title":"Extracting academic genealogy trees from the networked digital library of theses and dissertations","authors":"W. Dores, Fabrício Benevenuto, Alberto H. F. Laender","doi":"10.1145/2910896.2910916","DOIUrl":null,"url":null,"abstract":"Along the history, many researchers provided remarkable contributions to science, not only advancing knowledge but also in terms of mentoring new scientists. Currently, identifying and studying the formation of researchers over the years is a challenging task as current repositories of theses and dissertations are cataloged in a decentralized way through many local digital libraries. In this paper, we give a first step towards building a large repository that records the academic genealogy of researchers across fields and countries. We crawled data from the Networked Digital Library of Theses and Dissertations (NDLTD) and develop a framework to extract academic genealogy trees from this data and provide a series of analyses that describe the main properties of the academic genealogy trees. Our effort identified interesting findings related to the structure of academic formation, which highlight the importance of cataloging academic genealogy trees. We hope our initial framework will be the basis of a much larger crowdsourcing system.","PeriodicalId":109613,"journal":{"name":"2016 IEEE/ACM Joint Conference on Digital Libraries (JCDL)","volume":"249 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2016-06-19","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"13","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2016 IEEE/ACM Joint Conference on Digital Libraries (JCDL)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1145/2910896.2910916","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 13

Abstract

Along the history, many researchers provided remarkable contributions to science, not only advancing knowledge but also in terms of mentoring new scientists. Currently, identifying and studying the formation of researchers over the years is a challenging task as current repositories of theses and dissertations are cataloged in a decentralized way through many local digital libraries. In this paper, we give a first step towards building a large repository that records the academic genealogy of researchers across fields and countries. We crawled data from the Networked Digital Library of Theses and Dissertations (NDLTD) and develop a framework to extract academic genealogy trees from this data and provide a series of analyses that describe the main properties of the academic genealogy trees. Our effort identified interesting findings related to the structure of academic formation, which highlight the importance of cataloging academic genealogy trees. We hope our initial framework will be the basis of a much larger crowdsourcing system.
查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
从论文和学位论文的网络化数字图书馆中提取学术谱系树
在历史上,许多研究人员为科学做出了卓越的贡献,不仅推动了知识的发展,而且还指导了新的科学家。目前,识别和研究多年来研究人员的形成是一项具有挑战性的任务,因为目前的论文和学位论文库是通过许多地方数字图书馆以分散的方式编目的。在本文中,我们向建立一个大型存储库迈出了第一步,该存储库记录了跨领域和国家的研究人员的学术谱系。我们从网络数字论文图书馆(NDLTD)中抓取数据,并开发了一个框架,从这些数据中提取学术谱系树,并提供了一系列描述学术谱系树主要属性的分析。我们的努力确定了与学术形成结构相关的有趣发现,这突出了编目学术谱系树的重要性。我们希望我们最初的框架将成为一个更大的众包系统的基础。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 去求助
来源期刊
自引率
0.00%
发文量
0
期刊最新文献
Joint workshop on bibliometric-enhanced information retrieval and natural language processing for digital libraries (BIRNDL 2016) Panel: Preserving born-digital news ArchiveSpark: Efficient Web archive access, extraction and derivation Desiderata for exploratory search interfaces to Web archives in support of scholarly activities How to identify specialized research communities related to a researcher's changing interests
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1