P. Pedraza, Stefano Visintin, K. Tijdens, G. Kismihók
{"title":"Survey vs Scraped Data: Comparing Time Series Properties of Web and Survey Vacancy Data","authors":"P. Pedraza, Stefano Visintin, K. Tijdens, G. Kismihók","doi":"10.2478/izajole-2019-0004","DOIUrl":null,"url":null,"abstract":"Abstract This paper studies the relationship between a vacancy population obtained from web crawling and vacancies in the economy inferred by a National Statistics Office (NSO) using a traditional method. We compare the time series properties of samples obtained between 2007 and 2014 by Statistics Netherlands and by a web scraping company. We find that the web and NSO vacancy data present similar time series properties, suggesting that both time series are generated by the same underlying phenomenon: the real number of new vacancies in the economy. We conclude that, in our case study, web-sourced data are able to capture aggregate economic activity in the labor market.","PeriodicalId":37841,"journal":{"name":"IZA Journal of Labor Economics","volume":" ","pages":""},"PeriodicalIF":0.0000,"publicationDate":"2019-06-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"12","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"IZA Journal of Labor Economics","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.2478/izajole-2019-0004","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q3","JCRName":"Business, Management and Accounting","Score":null,"Total":0}
引用次数: 12
Abstract
Abstract This paper studies the relationship between a vacancy population obtained from web crawling and vacancies in the economy inferred by a National Statistics Office (NSO) using a traditional method. We compare the time series properties of samples obtained between 2007 and 2014 by Statistics Netherlands and by a web scraping company. We find that the web and NSO vacancy data present similar time series properties, suggesting that both time series are generated by the same underlying phenomenon: the real number of new vacancies in the economy. We conclude that, in our case study, web-sourced data are able to capture aggregate economic activity in the labor market.
期刊介绍:
As of March 31, 2019, the IZA Open Access Journal Series will transfer to Sciendo. Please use the Springer Editorial Manager system for all submissions until February 28. During the transfer period in March 2019 you may direct your submissions to journals@iza.org. The IZA Journal of Labor Economics publishes scientific articles in all areas of labor economics. This refers to original high-quality theoretical and applied contributions on both microeconomic and macroeconomic labor-related topics. In particular, the IZA Journal of Labor Economics encourages submissions in subject areas that are closely linked to the various IZA Program Areas, ranging from education, family and environment to mobility, behavioral and personnel economics, and labor market institutions, among others. The IZA Journal of Labor Economics is part of IZA’s mission of contributing to social and economic discourse, enabling political decision-making to be based on the best available scientific knowledge. We want to stimulate research to close knowledge gaps. Hence, the IZA Journal of Labor Economics particularly welcomes contributions that provide scientifically sound answers to open and relevant questions of modern labor economics.