{"title":"Linear regression with many controls of limited explanatory power","authors":"Chenchuan Li, Ulrich K. Müller","doi":"10.3982/QE1577","DOIUrl":null,"url":null,"abstract":"We consider inference about a scalar coefficient in a linear regression model. One previously considered approach to dealing with many controls imposes sparsity, that is, it is assumed known that nearly all control coefficients are (very nearly) zero. We instead impose a bound on the quadratic mean of the controls' effect on the dependent variable, which also has an interpretation as an R 2‐type bound on the explanatory power of the controls. We develop a simple inference procedure that exploits this additional information in general heteroskedastic models. We study its asymptotic efficiency properties and compare it to a sparsity‐based approach in a Monte Carlo study. The method is illustrated in three empirical applications.","PeriodicalId":46811,"journal":{"name":"Quantitative Economics","volume":"1 1","pages":""},"PeriodicalIF":1.9000,"publicationDate":"2021-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"11","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Quantitative Economics","FirstCategoryId":"96","ListUrlMain":"https://doi.org/10.3982/QE1577","RegionNum":3,"RegionCategory":"经济学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"ECONOMICS","Score":null,"Total":0}
引用次数: 11
Abstract
We consider inference about a scalar coefficient in a linear regression model. One previously considered approach to dealing with many controls imposes sparsity, that is, it is assumed known that nearly all control coefficients are (very nearly) zero. We instead impose a bound on the quadratic mean of the controls' effect on the dependent variable, which also has an interpretation as an R 2‐type bound on the explanatory power of the controls. We develop a simple inference procedure that exploits this additional information in general heteroskedastic models. We study its asymptotic efficiency properties and compare it to a sparsity‐based approach in a Monte Carlo study. The method is illustrated in three empirical applications.