{"title":"BLP-LASSO for Aggregate Discrete Choice Models with Rich Covariates","authors":"B. Gillen, Sergio Montero, H. Moon, M. Shum","doi":"10.2139/ssrn.2700775","DOIUrl":null,"url":null,"abstract":"We introduce the BLP-LASSO model, which augments the classic BLP (Berry, Levinsohn, and Pakes, 1995) random-coefficients logit model to allow for data-driven selection among a high- dimensional set of control variables. Economists often study consumers’ aggregate behavior across markets choosing from a menu of differentiated products. In this analysis, local demo- graphic characteristics can serve as controls for market-specific preference heterogeneity. Given rich demographic data, implementing these models requires specifying which variables to include in the analysis, an ad hoc process typically guided primarily by a researcher’s intuition. We pro- pose a data-driven approach to estimate these models applying penalized estimation algorithms imported from the machine learning literature that are known to be valid for uniform inferences with respect to variable selection. Our application explores the effect of campaign spending on vote shares in data from Mexican elections.","PeriodicalId":165362,"journal":{"name":"ERN: Discrete Regression & Qualitative Choice Models (Single) (Topic)","volume":"11 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2018-10-09","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"2","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"ERN: Discrete Regression & Qualitative Choice Models (Single) (Topic)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.2139/ssrn.2700775","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 2
Abstract
We introduce the BLP-LASSO model, which augments the classic BLP (Berry, Levinsohn, and Pakes, 1995) random-coefficients logit model to allow for data-driven selection among a high- dimensional set of control variables. Economists often study consumers’ aggregate behavior across markets choosing from a menu of differentiated products. In this analysis, local demo- graphic characteristics can serve as controls for market-specific preference heterogeneity. Given rich demographic data, implementing these models requires specifying which variables to include in the analysis, an ad hoc process typically guided primarily by a researcher’s intuition. We pro- pose a data-driven approach to estimate these models applying penalized estimation algorithms imported from the machine learning literature that are known to be valid for uniform inferences with respect to variable selection. Our application explores the effect of campaign spending on vote shares in data from Mexican elections.