Discovering research data management trends from job advertisements using a text-mining approach

IF 1.8 4区 管理学 Q3 COMPUTER SCIENCE, INFORMATION SYSTEMS Journal of Information Science Pub Date : 2023-09-15 DOI:10.1177/01655515231193845
Naseema Sheriff, R Sevukan
{"title":"Discovering research data management trends from job advertisements using a text-mining approach","authors":"Naseema Sheriff, R Sevukan","doi":"10.1177/01655515231193845","DOIUrl":null,"url":null,"abstract":"In today’s data-driven culture, research data management (RDM) is essential for the research community. The demand for reusing research datasets is a challenging and diverse process for the scientific community. Despite this, it is essential in RDM to discover trends and themes using text mining, which is scarce. The purpose of this study is to employ text mining to discover insights from job advertisements associated with RDM profiles, which collected 810 advertisements. We found RDM-related patterns using latent Dirichlet allocation (LDA) and identified three key contexts. The first is ‘research services in libraries’, with the topics of research services, research information, research universities, collection processes and library services. The second context is ‘research data’, which includes RDM, business data, university data, research data, health research, science research, social science research, data centres, data services, statistical software, digital scholarship and digital preservation. The third context is ‘workplace environment’, and the topics are leadership, work development and scientific position. Job title normalisation reveals names such as ‘data librarian’, ‘librarian’, ‘director’, ‘data curator’, ‘data manager’, ‘research data librarian’, ‘data specialist’ and ‘data officer’ are frequently employed. Focusing on titles with a single or double occurrence is new and interesting for developing nations. Reputable institutions such as Harvard, Stanford and the Massachusetts Institute of Technology, as well as countries such as the United States, the United Kingdom, Canada and Germany, are the major participants in RDM practises and services. This discovery will assist higher education institutions, RDM stakeholders, which aid in the formulation of curriculum, and job seekers to familiarise themselves with the themes.","PeriodicalId":54796,"journal":{"name":"Journal of Information Science","volume":"16 1","pages":"0"},"PeriodicalIF":1.8000,"publicationDate":"2023-09-15","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Journal of Information Science","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1177/01655515231193845","RegionNum":4,"RegionCategory":"管理学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q3","JCRName":"COMPUTER SCIENCE, INFORMATION SYSTEMS","Score":null,"Total":0}
引用次数: 0

Abstract

In today’s data-driven culture, research data management (RDM) is essential for the research community. The demand for reusing research datasets is a challenging and diverse process for the scientific community. Despite this, it is essential in RDM to discover trends and themes using text mining, which is scarce. The purpose of this study is to employ text mining to discover insights from job advertisements associated with RDM profiles, which collected 810 advertisements. We found RDM-related patterns using latent Dirichlet allocation (LDA) and identified three key contexts. The first is ‘research services in libraries’, with the topics of research services, research information, research universities, collection processes and library services. The second context is ‘research data’, which includes RDM, business data, university data, research data, health research, science research, social science research, data centres, data services, statistical software, digital scholarship and digital preservation. The third context is ‘workplace environment’, and the topics are leadership, work development and scientific position. Job title normalisation reveals names such as ‘data librarian’, ‘librarian’, ‘director’, ‘data curator’, ‘data manager’, ‘research data librarian’, ‘data specialist’ and ‘data officer’ are frequently employed. Focusing on titles with a single or double occurrence is new and interesting for developing nations. Reputable institutions such as Harvard, Stanford and the Massachusetts Institute of Technology, as well as countries such as the United States, the United Kingdom, Canada and Germany, are the major participants in RDM practises and services. This discovery will assist higher education institutions, RDM stakeholders, which aid in the formulation of curriculum, and job seekers to familiarise themselves with the themes.
查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
使用文本挖掘方法从招聘广告中发现研究数据管理趋势
在当今数据驱动的文化中,研究数据管理(RDM)对研究社区至关重要。对于科学界来说,重复使用研究数据集的需求是一个具有挑战性和多样化的过程。尽管如此,在RDM中,使用文本挖掘来发现趋势和主题是必要的,而这是稀缺的。本研究的目的是利用文本挖掘来发现与RDM档案相关的招聘广告的见解,该研究收集了810个广告。我们使用潜在狄利克雷分配(LDA)发现了rdm相关的模式,并确定了三个关键上下文。第一个是“图书馆的研究服务”,主题是研究服务、研究信息、研究型大学、馆藏流程和图书馆服务。第二个上下文是“研究数据”,包括RDM、商业数据、大学数据、研究数据、卫生研究、科学研究、社会科学研究、数据中心、数据服务、统计软件、数字奖学金和数字保存。第三个语境是“职场环境”,主题是领导力、工作发展和科学定位。职衔规范化显示,“数据图书管理员”、“图书管理员”、“主管”、“数据馆长”、“数据经理”、“研究数据图书管理员”、“数据专家”和“数据主任”等名称经常被聘用。对于发展中国家来说,专注于出现一次或两次的游戏是一种新颖而有趣的做法。诸如哈佛大学、斯坦福大学和麻省理工学院等著名机构,以及诸如美国、联合王国、加拿大和德国等国家,都是RDM实践和服务的主要参与者。这一发现将有助于高等教育机构、有助于制定课程的RDM利益相关者和求职者熟悉这些主题。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 去求助
来源期刊
Journal of Information Science
Journal of Information Science 工程技术-计算机:信息系统
CiteScore
6.80
自引率
8.30%
发文量
121
审稿时长
4 months
期刊介绍: The Journal of Information Science is a peer-reviewed international journal of high repute covering topics of interest to all those researching and working in the sciences of information and knowledge management. The Editors welcome material on any aspect of information science theory, policy, application or practice that will advance thinking in the field.
期刊最新文献
Government chatbot: Empowering smart conversations with enhanced contextual understanding and reasoning Knowing within multispecies families: An information experience study How are global university rankings adjusted for erroneous science, fraud and misconduct? Posterior reduction or adjustment in rankings in response to retractions and invalidation of scientific findings Predicting the technological impact of papers: Exploring optimal models and most important features Cross-domain corpus selection for cold-start context
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1