Web Mining of Online Resources for German Labor Market Research and Education: Finding the Ground Truth?

Knowledge Pub Date : 2024-02-19 DOI:10.3390/knowledge4010003
Andreas Fischer, Jens Dörpinghaus
{"title":"Web Mining of Online Resources for German Labor Market Research and Education: Finding the Ground Truth?","authors":"Andreas Fischer, Jens Dörpinghaus","doi":"10.3390/knowledge4010003","DOIUrl":null,"url":null,"abstract":"The labor market is highly dependent on vocational and academic education, training, retraining, and further education in order to master challenges such as advancing digitalization and sustainability. Further training is a key factor in ensuring a qualified workforce, the employability of all employees, and, thus, national competitiveness and innovation. In the contribution at hand, we explore an innovative way to derive knowledge about learning pathways by connecting the dots from different data sources of the German labor market. In particular, we focus on the web mining of online resources for German labor market research and education, such as online advertisements, information portals, and official government websites. A key question for working with different data sources is how to find the ground truth and common data structures that can be used to make the data interoperable. We discuss how to classify and summarize web data from different platforms and which methods can be used for extracting data, entities and relationships from online resources on the German labor market to build a network of educational pathways. Our proposed solution is based on the classification of occupations (KldB) and related document codes (DKZ), and combines natural language processing and knowledge graph technologies. Our research provides the foundation for further investigation into educational pathways and linked data for labor market research. While our work focuses on German data, it is also useful for other German-speaking countries and could easily be extended to other languages such as English.","PeriodicalId":510293,"journal":{"name":"Knowledge","volume":"227 ","pages":""},"PeriodicalIF":0.0000,"publicationDate":"2024-02-19","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Knowledge","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.3390/knowledge4010003","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0

Abstract

The labor market is highly dependent on vocational and academic education, training, retraining, and further education in order to master challenges such as advancing digitalization and sustainability. Further training is a key factor in ensuring a qualified workforce, the employability of all employees, and, thus, national competitiveness and innovation. In the contribution at hand, we explore an innovative way to derive knowledge about learning pathways by connecting the dots from different data sources of the German labor market. In particular, we focus on the web mining of online resources for German labor market research and education, such as online advertisements, information portals, and official government websites. A key question for working with different data sources is how to find the ground truth and common data structures that can be used to make the data interoperable. We discuss how to classify and summarize web data from different platforms and which methods can be used for extracting data, entities and relationships from online resources on the German labor market to build a network of educational pathways. Our proposed solution is based on the classification of occupations (KldB) and related document codes (DKZ), and combines natural language processing and knowledge graph technologies. Our research provides the foundation for further investigation into educational pathways and linked data for labor market research. While our work focuses on German data, it is also useful for other German-speaking countries and could easily be extended to other languages such as English.
查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
德国劳动力市场研究与教育在线资源的网络挖掘:寻找真相?
劳动力市场高度依赖职业和学术教育、培训、再培训和继续教育,以应对诸如推进数字化和可持续性等挑战。继续培训是确保合格劳动力、所有员工就业能力的关键因素,因此也是国家竞争力和创新能力的关键因素。在本文中,我们探索了一种创新方法,通过连接德国劳动力市场的不同数据源来获取有关学习途径的知识。我们尤其关注对德国劳动力市场研究和教育在线资源的网络挖掘,如在线广告、信息门户和政府官方网站。处理不同数据源的一个关键问题是如何找到基本事实和通用数据结构,从而使数据具有互操作性。我们将讨论如何对来自不同平台的网络数据进行分类和汇总,以及哪些方法可用于从德国劳动力市场的在线资源中提取数据、实体和关系,以构建教育路径网络。我们提出的解决方案基于职业分类(KldB)和相关文档代码(DKZ),并结合了自然语言处理和知识图谱技术。我们的研究为进一步研究教育路径和劳动力市场研究的链接数据奠定了基础。虽然我们的工作侧重于德语数据,但它对其他德语国家也很有用,而且很容易扩展到英语等其他语言。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 去求助
来源期刊
自引率
0.00%
发文量
0
期刊最新文献
sBERT: Parameter-Efficient Transformer-Based Deep Learning Model for Scientific Literature Classification SmartLabAirgap: Helping Electrical Machines Air Gap Field Learning Gesture Recognition of Filipino Sign Language Using Convolutional and Long Short-Term Memory Deep Neural Networks Shannon Holes, Black Holes, and Knowledge: The Essential Tension for Autonomous Human–Machine Teams Facing Uncertainty Academic Performance of Excellence: The Impact of Self-Regulated Learning and Academic Time Management Planning
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1