{"title":"IPOD:大规模工业和专业职业数据集","authors":"Junhua Liu, Yung Chuen Ng, Kwan Hui Lim","doi":"10.1145/3406865.3418329","DOIUrl":null,"url":null,"abstract":"In today's job market, occupational data mining and analysis is growing in importance as it enables companies to predict employee turnover, model career trajectories, screen through resumes and perform other human resource tasks. As such, there has been growing interest in utilizing occupational data mining and analysis, and a key requirement to facilitate these tasks is the need for an occupation-related dataset. However, most research use proprietary datasets or do not make their dataset publicly available, thus impeding development in this area. To solve this issue, we present the Industrial and Professional Occupation Dataset (IPOD), which comprises 475,073 job titles belonging to 192,295 Linkedin users. In addition to making IPOD publicly available, we also: (i) manually annotate each job title with its associated level of seniority, domain of work and location; and (ii) provide embedding for job titles and discuss various use cases. This dataset is publicly available at https://github.com/junhua/ipod.","PeriodicalId":93424,"journal":{"name":"CSCW '20 Companion : conference companion publication of the 2020 Conference on Computer Supported Cooperative Work and Social Computing : October 17-21, 2020, Virtual Event, USA. Conference on Computer-Supported Cooperative Work and So...","volume":"54 1","pages":""},"PeriodicalIF":0.0000,"publicationDate":"2020-10-17","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"7","resultStr":"{\"title\":\"IPOD: A Large-scale Industrial and Professional Occupation Dataset\",\"authors\":\"Junhua Liu, Yung Chuen Ng, Kwan Hui Lim\",\"doi\":\"10.1145/3406865.3418329\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"In today's job market, occupational data mining and analysis is growing in importance as it enables companies to predict employee turnover, model career trajectories, screen through resumes and perform other human resource tasks. As such, there has been growing interest in utilizing occupational data mining and analysis, and a key requirement to facilitate these tasks is the need for an occupation-related dataset. However, most research use proprietary datasets or do not make their dataset publicly available, thus impeding development in this area. To solve this issue, we present the Industrial and Professional Occupation Dataset (IPOD), which comprises 475,073 job titles belonging to 192,295 Linkedin users. In addition to making IPOD publicly available, we also: (i) manually annotate each job title with its associated level of seniority, domain of work and location; and (ii) provide embedding for job titles and discuss various use cases. This dataset is publicly available at https://github.com/junhua/ipod.\",\"PeriodicalId\":93424,\"journal\":{\"name\":\"CSCW '20 Companion : conference companion publication of the 2020 Conference on Computer Supported Cooperative Work and Social Computing : October 17-21, 2020, Virtual Event, USA. Conference on Computer-Supported Cooperative Work and So...\",\"volume\":\"54 1\",\"pages\":\"\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2020-10-17\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"7\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"CSCW '20 Companion : conference companion publication of the 2020 Conference on Computer Supported Cooperative Work and Social Computing : October 17-21, 2020, Virtual Event, USA. Conference on Computer-Supported Cooperative Work and So...\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1145/3406865.3418329\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"CSCW '20 Companion : conference companion publication of the 2020 Conference on Computer Supported Cooperative Work and Social Computing : October 17-21, 2020, Virtual Event, USA. Conference on Computer-Supported Cooperative Work and So...","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1145/3406865.3418329","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 7

摘要

在当今的就业市场,职业数据挖掘和分析变得越来越重要,因为它使公司能够预测员工流动率,建立职业轨迹模型,筛选简历和执行其他人力资源任务。因此,人们对利用职业数据挖掘和分析越来越感兴趣,而促进这些任务的一个关键要求是需要一个与职业相关的数据集。然而,大多数研究使用专有数据集或不公开其数据集,从而阻碍了该领域的发展。为了解决这个问题,我们提供了工业和专业职业数据集(IPOD),其中包括属于192,295名Linkedin用户的475,073个职位。除了公开IPOD之外,我们还:(i)手动标注每个职位的相关资历、工作领域和工作地点;(ii)为职位名称提供嵌入并讨论各种用例。该数据集可在https://github.com/junhua/ipod上公开获取。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
IPOD: A Large-scale Industrial and Professional Occupation Dataset
In today's job market, occupational data mining and analysis is growing in importance as it enables companies to predict employee turnover, model career trajectories, screen through resumes and perform other human resource tasks. As such, there has been growing interest in utilizing occupational data mining and analysis, and a key requirement to facilitate these tasks is the need for an occupation-related dataset. However, most research use proprietary datasets or do not make their dataset publicly available, thus impeding development in this area. To solve this issue, we present the Industrial and Professional Occupation Dataset (IPOD), which comprises 475,073 job titles belonging to 192,295 Linkedin users. In addition to making IPOD publicly available, we also: (i) manually annotate each job title with its associated level of seniority, domain of work and location; and (ii) provide embedding for job titles and discuss various use cases. This dataset is publicly available at https://github.com/junhua/ipod.
求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
自引率
0.00%
发文量
0
期刊最新文献
Affective Affordance of Message Balloon Animations: An Early Exploration of AniBalloons Rethinking Trust Repair in Human-Robot Interaction Computer Supported Cooperative Work and Social Computing: 17th CCF Conference, ChineseCSCW 2022, Taiyuan, China, November 25–27, 2022, Revised Selected Papers, Part II Computer Supported Cooperative Work and Social Computing: 17th CCF Conference, ChineseCSCW 2022, Taiyuan, China, November 25–27, 2022, Revised Selected Papers, Part I Computer Supported Cooperative Work and Social Computing: 16th CCF Conference, ChineseCSCW 2021, Xiangtan, China, November 26–28, 2021, Revised Selected Papers, Part II
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1