首页 > 最新文献

Journal of Econometrics最新文献

英文 中文
Nonparametric treatment effect identification in school choice 择校中的非参数治疗效果识别
IF 4 3区 经济学 Q1 ECONOMICS Pub Date : 2026-01-01 Epub Date: 2026-01-05 DOI: 10.1016/j.jeconom.2025.106172
Jiafeng Chen
This paper studies nonparametric identification and estimation of causal effects in centralized school assignment. In many centralized assignment algorithms, students face both lottery-driven variation and regression discontinuity- (RD) driven variation. We characterize the full set of identified atomic treatment effects (aTEs), defined as the conditional average treatment effect between a pair of schools given student characteristics. Atomic treatment effects are the building blocks of more aggregated treatment contrasts, and common approaches to estimating aTE aggregations can mask important heterogeneity. In particular, many aggregations of aTEs put zero weight on aTEs driven by RD variation, and estimators of such aggregations put asymptotically vanishing weight on the RD-driven aTEs. We provide a diagnostic and recommend new aggregation schemes. Lastly, we provide estimators and asymptotic results for inference on these aggregations.
本文研究了集中式学校分配中因果效应的非参数辨识与估计。在许多集中式作业算法中,学生既面临彩票驱动的变异,也面临回归不连续(RD)驱动的变异。我们描述了整套确定的原子处理效果(aTEs),定义为一对给定学生特征的学校之间的条件平均处理效果。原子处理效果是更多聚合处理对比的构建块,估计aTE聚合的常用方法可以掩盖重要的异质性。特别是,许多由RD变化驱动的aTEs的聚集将权重为零,并且这种聚集的估计将权重渐近消失放在RD驱动的aTEs上。我们提供了一个诊断和推荐新的聚合方案。最后,我们给出了这些集合的估计量和渐近推断结果。
{"title":"Nonparametric treatment effect identification in school choice","authors":"Jiafeng Chen","doi":"10.1016/j.jeconom.2025.106172","DOIUrl":"10.1016/j.jeconom.2025.106172","url":null,"abstract":"<div><div>This paper studies nonparametric identification and estimation of causal effects in centralized school assignment. In many centralized assignment algorithms, students face both lottery-driven variation and regression discontinuity- (RD) driven variation. We characterize the full set of identified <em>atomic treatment effects</em> (aTEs), defined as the conditional average treatment effect between a pair of schools given student characteristics. Atomic treatment effects are the building blocks of more aggregated treatment contrasts, and common approaches to estimating aTE aggregations can mask important heterogeneity. In particular, many aggregations of aTEs put zero weight on aTEs driven by RD variation, and estimators of such aggregations put asymptotically vanishing weight on the RD-driven aTEs. We provide a diagnostic and recommend new aggregation schemes. Lastly, we provide estimators and asymptotic results for inference on these aggregations.</div></div>","PeriodicalId":15629,"journal":{"name":"Journal of Econometrics","volume":"253 ","pages":"Article 106172"},"PeriodicalIF":4.0,"publicationDate":"2026-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145938447","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"经济学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
GLS estimation of local projections: Trading robustness for efficiency 局部预测的GLS估计:以鲁棒性换取效率
IF 4 3区 经济学 Q1 ECONOMICS Pub Date : 2026-01-01 Epub Date: 2026-01-19 DOI: 10.1016/j.jeconom.2026.106182
Ignace De Vos , Gerdie Everaert
Local projections (LPs) are widely used for estimating impulse responses (IRs) as they are considered more robust to model misspecification than forward-iterated IRs from dynamic models such as VARs. However, this robustness comes at the cost of higher variance, particularly at longer horizons. To mitigate this trade-off, several GLS transformations of LPs have been proposed. This paper analyzes two broad strands of GLS-type LP estimators: those that condition on residuals from an auxiliary VAR, and those that condition on residuals from previous-horizon LPs. We show that the former impose a VAR structure, which leads them to align with VAR IRs, while the latter preserve the unrestricted nature of LPs but end up replicating LP OLS estimates. Consequently, the intended efficiency gains are either not achieved or come at the expense of the very robustness that motivates the use of LPs.
局部投影(lp)被广泛用于估计脉冲响应(IRs),因为它们被认为比动态模型(如var)的前向迭代IRs更具鲁棒性。然而,这种稳健性是以更高的方差为代价的,尤其是在更长的时间跨度内。为了减轻这种权衡,已经提出了几种lp的GLS转换。本文分析了两大类gls型LP估计量:以辅助VAR残差为条件的估计量和以前水平LP残差为条件的估计量。我们表明,前者施加了一个VAR结构,这导致它们与VAR IRs保持一致,而后者保留了LP的不受限制的性质,但最终复制了LP OLS估计。因此,预期的效率收益要么没有实现,要么是以牺牲鲁棒性为代价的,而这正是使用lp的原因。
{"title":"GLS estimation of local projections: Trading robustness for efficiency","authors":"Ignace De Vos ,&nbsp;Gerdie Everaert","doi":"10.1016/j.jeconom.2026.106182","DOIUrl":"10.1016/j.jeconom.2026.106182","url":null,"abstract":"<div><div>Local projections (LPs) are widely used for estimating impulse responses (IRs) as they are considered more robust to model misspecification than forward-iterated IRs from dynamic models such as VARs. However, this robustness comes at the cost of higher variance, particularly at longer horizons. To mitigate this trade-off, several GLS transformations of LPs have been proposed. This paper analyzes two broad strands of GLS-type LP estimators: those that condition on residuals from an auxiliary VAR, and those that condition on residuals from previous-horizon LPs. We show that the former impose a VAR structure, which leads them to align with VAR IRs, while the latter preserve the unrestricted nature of LPs but end up replicating LP OLS estimates. Consequently, the intended efficiency gains are either not achieved or come at the expense of the very robustness that motivates the use of LPs.</div></div>","PeriodicalId":15629,"journal":{"name":"Journal of Econometrics","volume":"253 ","pages":"Article 106182"},"PeriodicalIF":4.0,"publicationDate":"2026-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"146034688","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"经济学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Empirical welfare maximization with constraints 有约束的经验福利最大化
IF 4 3区 经济学 Q1 ECONOMICS Pub Date : 2026-01-01 Epub Date: 2025-12-24 DOI: 10.1016/j.jeconom.2025.106169
Liyang Sun
Empirical Welfare Maximization (EWM) is a framework that can be used to select welfare program eligibility policies based on data. This paper extends EWM by allowing for uncertainty in estimating the budget needed to implement the selected policy, in addition to its welfare. Due to the additional estimation error, I show there exist no rules that achieve the highest welfare possible while satisfying a budget constraint uniformly over a wide range of DGPs. This differs from the setting without a budget constraint where uniformity is achievable. I propose an alternative trade-off rule and illustrate it with Medicaid expansion, a setting with imperfect take-up and varying program costs.
实证福利最大化(Empirical Welfare Maximization, EWM)是一个基于数据选择福利项目资格政策的框架。本文通过考虑实施所选政策所需预算的不确定性以及其福利来扩展EWM。由于额外的估计误差,我表明不存在能够在满足预算约束的同时,在广泛的dpp范围内实现最高福利的规则。这与没有预算限制的情况不同,在这种情况下,一致性是可以实现的。我提出了另一种权衡规则,并以医疗补助计划的扩张为例进行了说明,这是一个不完美的占用和不同项目成本的设置。
{"title":"Empirical welfare maximization with constraints","authors":"Liyang Sun","doi":"10.1016/j.jeconom.2025.106169","DOIUrl":"10.1016/j.jeconom.2025.106169","url":null,"abstract":"<div><div>Empirical Welfare Maximization (EWM) is a framework that can be used to select welfare program eligibility policies based on data. This paper extends EWM by allowing for uncertainty in estimating the budget needed to implement the selected policy, in addition to its welfare. Due to the additional estimation error, I show there exist no rules that achieve the highest welfare possible while satisfying a budget constraint uniformly over a wide range of DGPs. This differs from the setting without a budget constraint where uniformity is achievable. I propose an alternative trade-off rule and illustrate it with Medicaid expansion, a setting with imperfect take-up and varying program costs.</div></div>","PeriodicalId":15629,"journal":{"name":"Journal of Econometrics","volume":"253 ","pages":"Article 106169"},"PeriodicalIF":4.0,"publicationDate":"2026-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145836614","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"经济学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Estimation and inference for causal functions with multi-way clustered data 基于多路聚类数据的因果函数估计与推理
IF 4 3区 经济学 Q1 ECONOMICS Pub Date : 2026-01-01 Epub Date: 2026-01-07 DOI: 10.1016/j.jeconom.2025.106178
Nan Liu , Yanbo Liu , Yuya Sasaki
We propose methods for estimation and uniform inference for a broad class of causal functions, such as conditional average treatment effects and continuous treatment effects, under multi-way clustering. The causal function is identified as the conditional expectation of a Neyman-orthogonal signal that depends on high-dimensional nuisance parameters. We introduce a two-step procedure: the first step uses machine learning to estimate the nuisance parameters, and the second step projects the estimated Neyman-orthogonal signal onto a dictionary of basis functions whose dimension grows with the sample size. We consider both full-sample and multi-way cross-fitting approaches to this procedure and derive a functional limit theory for the resulting estimators. For uniform inference, we develop a novel resampling method, the multi-way cluster-robust sieve score bootstrap, which extends the sieve score bootstrap of Chen and Christensen (2018) to settings with multi-way clustering. Extensive simulations demonstrate that the proposed methods exhibit favorable finite-sample performance. We apply our approach to study the causal relationship between mistrust levels in Africa and historical exposure to the slave trade. Accounting for the two-way clustering by ethnicity and region, our inference method rejects the null hypothesis of uniformly zero effects and uncover heterogeneous treatment effects, with particularly strong impacts in regions with high historical trade intensity.
我们提出了在多向聚类条件平均处理效应和连续处理效应等大类因果函数的估计和一致推理方法。因果函数被确定为依赖于高维干扰参数的内曼正交信号的条件期望。我们引入了一个两步的过程:第一步使用机器学习来估计干扰参数,第二步将估计的内曼正交信号投影到一个基函数字典上,该基函数的维数随着样本量的增长而增长。我们考虑了这一过程的全样本和多路交叉拟合方法,并推导了结果估计量的泛函极限理论。为了均匀推理,我们开发了一种新的重采样方法,即多路聚类-鲁棒筛分bootstrap,它将Chen和Christensen(2018)的筛分bootstrap扩展到多路聚类的设置。大量的仿真表明,所提出的方法具有良好的有限样本性能。我们运用我们的方法来研究非洲的不信任程度与历史上对奴隶贸易的暴露之间的因果关系。考虑到种族和地区的双向聚类,我们的推理方法拒绝了均匀零效应的零假设,并揭示了异质性的治疗效应,在历史贸易强度高的地区影响特别强。
{"title":"Estimation and inference for causal functions with multi-way clustered data","authors":"Nan Liu ,&nbsp;Yanbo Liu ,&nbsp;Yuya Sasaki","doi":"10.1016/j.jeconom.2025.106178","DOIUrl":"10.1016/j.jeconom.2025.106178","url":null,"abstract":"<div><div>We propose methods for estimation and uniform inference for a broad class of causal functions, such as conditional average treatment effects and continuous treatment effects, under multi-way clustering. The causal function is identified as the conditional expectation of a Neyman-orthogonal signal that depends on high-dimensional nuisance parameters. We introduce a two-step procedure: the first step uses machine learning to estimate the nuisance parameters, and the second step projects the estimated Neyman-orthogonal signal onto a dictionary of basis functions whose dimension grows with the sample size. We consider both full-sample and multi-way cross-fitting approaches to this procedure and derive a functional limit theory for the resulting estimators. For uniform inference, we develop a novel resampling method, <em>the multi-way cluster-robust sieve score bootstrap</em>, which extends the sieve score bootstrap of Chen and Christensen (2018) to settings with multi-way clustering. Extensive simulations demonstrate that the proposed methods exhibit favorable finite-sample performance. We apply our approach to study the causal relationship between mistrust levels in Africa and historical exposure to the slave trade. Accounting for the two-way clustering by ethnicity and region, our inference method rejects the null hypothesis of uniformly zero effects and uncover heterogeneous treatment effects, with particularly strong impacts in regions with high historical trade intensity.</div></div>","PeriodicalId":15629,"journal":{"name":"Journal of Econometrics","volume":"253 ","pages":"Article 106178"},"PeriodicalIF":4.0,"publicationDate":"2026-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145938446","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"经济学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Quasi-Bayesian estimation and inference with control functions 带控制函数的拟贝叶斯估计与推理
IF 4 3区 经济学 Q1 ECONOMICS Pub Date : 2026-01-01 Epub Date: 2025-11-13 DOI: 10.1016/j.jeconom.2025.106126
Ruixuan Liu , Zhengfei Yu
This paper introduces a quasi-Bayesian method that integrates frequentist nonparametric estimation with Bayesian inference in a two-stage process. Applied to an endogenous discrete choice model, the approach first uses kernel or sieve estimators to estimate the control function nonparametrically, followed by Bayesian methods to estimate the structural parameters. This combination leverages the advantages of both frequentist tractability for nonparametric estimation and Bayesian computational efficiency for complicated structural models. We analyze the asymptotic properties of the resulting quasi-posterior distribution, finding that its mean provides a consistent estimator for the parameters of interest, although its quantiles do not yield valid confidence intervals. However, bootstrapping the quasi-posterior mean accounts for the estimation uncertainty from the first stage, thereby producing asymptotically valid confidence intervals
本文介绍了一种拟贝叶斯方法,该方法将频率非参数估计与贝叶斯推理集成为两阶段过程。该方法应用于内源性离散选择模型,首先使用核或筛估计器非参数估计控制函数,然后使用贝叶斯方法估计结构参数。这种组合利用了非参数估计的频率可追溯性和复杂结构模型的贝叶斯计算效率的优点。我们分析了所得到的准后验分布的渐近性质,发现它的平均值为感兴趣的参数提供了一致的估计量,尽管它的分位数没有产生有效的置信区间。然而,自举准后验均值解释了第一阶段的估计不确定性,从而产生渐近有效的置信区间
{"title":"Quasi-Bayesian estimation and inference with control functions","authors":"Ruixuan Liu ,&nbsp;Zhengfei Yu","doi":"10.1016/j.jeconom.2025.106126","DOIUrl":"10.1016/j.jeconom.2025.106126","url":null,"abstract":"<div><div>This paper introduces a quasi-Bayesian method that integrates frequentist nonparametric estimation with Bayesian inference in a two-stage process. Applied to an endogenous discrete choice model, the approach first uses kernel or sieve estimators to estimate the control function nonparametrically, followed by Bayesian methods to estimate the structural parameters. This combination leverages the advantages of both frequentist tractability for nonparametric estimation and Bayesian computational efficiency for complicated structural models. We analyze the asymptotic properties of the resulting quasi-posterior distribution, finding that its mean provides a consistent estimator for the parameters of interest, although its quantiles do not yield valid confidence intervals. However, bootstrapping the quasi-posterior mean accounts for the estimation uncertainty from the first stage, thereby producing asymptotically valid confidence intervals</div></div>","PeriodicalId":15629,"journal":{"name":"Journal of Econometrics","volume":"253 ","pages":"Article 106126"},"PeriodicalIF":4.0,"publicationDate":"2026-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145527584","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"经济学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Multivariate kernel regression in vector and product metric spaces 向量和积度量空间中的多元核回归
IF 4 3区 经济学 Q1 ECONOMICS Pub Date : 2026-01-01 Epub Date: 2025-12-22 DOI: 10.1016/j.jeconom.2025.106168
Marcia Schafgans , Victoria Zinde-Walsh
This paper derives limit properties of nonparametric kernel regression estimators without requiring existence of density for regressors in Rq. In functional regression limit properties are established for multivariate functional regression. The rate and asymptotic normality for the Nadaraya–Watson (NW) estimator is established for distributions of regressors in Rq that allow for mass points, factor structure, multicollinearity and nonlinear dependence, as well as fractal distribution; when bounded density exists we provide statistical guarantees for the standard rate and the asymptotic normality without requiring smoothness. We demonstrate faster convergence associated with dimension reducing types of singularity, such as a fractal distribution or a factor structure in the regressors. The paper extends asymptotic normality of kernel functional regression to multivariate regression over a product of any number of metric spaces. Finite sample evidence confirms rate improvement due to singularity in regression over Rq. For functional regression the simulations underline the importance of accounting for multiple functional regressors. We demonstrate the applicability and advantages of the NW estimator in our empirical study, which reexamines the job training program evaluation based on the LaLonde data.
本文导出了非参数核回归估计量的极限性质,而不要求回归量在Rq中存在密度。在泛函回归中,建立了多元泛函回归的极限性质。对于Rq中考虑质量点、因子结构、多重共线性和非线性依赖以及分形分布的回归量分布,建立了Nadaraya-Watson (NW)估计量的速率和渐近正态性;当有界密度存在时,我们提供了标准率和渐近正态性的统计保证,而不要求平滑性。我们展示了与降维奇点类型相关的更快收敛,例如回归量中的分形分布或因子结构。本文将核泛函回归的渐近正态性推广到任意数量度量空间积上的多元回归。有限样本证据证实了由于Rq上回归的奇异性而导致的速率提高。对于函数回归,模拟强调了考虑多个函数回归量的重要性。通过对LaLonde数据的实证研究,验证了NW估计器的适用性和优越性。
{"title":"Multivariate kernel regression in vector and product metric spaces","authors":"Marcia Schafgans ,&nbsp;Victoria Zinde-Walsh","doi":"10.1016/j.jeconom.2025.106168","DOIUrl":"10.1016/j.jeconom.2025.106168","url":null,"abstract":"<div><div>This paper derives limit properties of nonparametric kernel regression estimators without requiring existence of density for regressors in <span><math><msup><mi>R</mi><mi>q</mi></msup></math></span>. In functional regression limit properties are established for multivariate functional regression. The rate and asymptotic normality for the Nadaraya–Watson (NW) estimator is established for distributions of regressors in <span><math><msup><mi>R</mi><mi>q</mi></msup></math></span> that allow for mass points, factor structure, multicollinearity and nonlinear dependence, as well as fractal distribution; when bounded density exists we provide statistical guarantees for the standard rate and the asymptotic normality without requiring smoothness. We demonstrate faster convergence associated with dimension reducing types of singularity, such as a fractal distribution or a factor structure in the regressors. The paper extends asymptotic normality of kernel functional regression to multivariate regression over a product of any number of metric spaces. Finite sample evidence confirms rate improvement due to singularity in regression over <span><math><msup><mi>R</mi><mi>q</mi></msup></math></span>. For functional regression the simulations underline the importance of accounting for multiple functional regressors. We demonstrate the applicability and advantages of the NW estimator in our empirical study, which reexamines the job training program evaluation based on the LaLonde data.</div></div>","PeriodicalId":15629,"journal":{"name":"Journal of Econometrics","volume":"253 ","pages":"Article 106168"},"PeriodicalIF":4.0,"publicationDate":"2026-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145836615","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"经济学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Doubly-robust inference for conditional average treatment effects with high-dimensional controls 高维控制条件平均处理效果的双鲁棒推理
IF 4 3区 经济学 Q1 ECONOMICS Pub Date : 2026-01-01 Epub Date: 2026-01-14 DOI: 10.1016/j.jeconom.2026.106180
Adam Baybutt , Manu Navjeevan
Plausible identification of conditional average treatment effects (CATEs) can rely on controlling for a large number of variables to account for confounding factors. In these high-dimensional settings, estimation of the CATE requires estimating first-stage models whose consistency relies on correctly specifying their parametric forms. While doubly-robust estimators of the CATE exist, inference procedures based on the second-stage CATE estimator are not doubly-robust. Using the popular augmented inverse propensity weighting signal, we propose an estimator for the CATE whose resulting Wald-type confidence intervals are doubly-robust. We assume a logistic model for the propensity score and a linear model for the outcome regression, and estimate the parameters of these models using an ℓ1 (Lasso) penalty to address the high-dimensional covariates. Inference based on this estimator remains valid even if one of the logistic propensity score or linear outcome regression models are misspecified.
条件平均治疗效果(CATEs)的合理识别可以依赖于控制大量变量来解释混杂因素。在这些高维设置中,CATE的估计需要估计第一阶段模型,其一致性依赖于正确指定其参数形式。虽然存在CATE的双鲁棒估计,但基于第二阶段CATE估计的推理过程不是双鲁棒的。利用广受欢迎的增广逆倾向加权信号,我们提出了一个估计量,其结果wald型置信区间是双鲁棒的。我们假设倾向得分为逻辑模型,结果回归为线性模型,并使用l_1 (Lasso)惩罚来估计这些模型的参数,以解决高维协变量。即使逻辑倾向评分或线性结果回归模型中的一个被错误指定,基于该估计量的推断仍然有效。
{"title":"Doubly-robust inference for conditional average treatment effects with high-dimensional controls","authors":"Adam Baybutt ,&nbsp;Manu Navjeevan","doi":"10.1016/j.jeconom.2026.106180","DOIUrl":"10.1016/j.jeconom.2026.106180","url":null,"abstract":"<div><div>Plausible identification of conditional average treatment effects (CATEs) can rely on controlling for a large number of variables to account for confounding factors. In these high-dimensional settings, estimation of the CATE requires estimating first-stage models whose consistency relies on correctly specifying their parametric forms. While doubly-robust estimators of the CATE exist, inference procedures based on the second-stage CATE estimator are not doubly-robust. Using the popular augmented inverse propensity weighting signal, we propose an estimator for the CATE whose resulting Wald-type confidence intervals are doubly-robust. We assume a logistic model for the propensity score and a linear model for the outcome regression, and estimate the parameters of these models using an ℓ<sub>1</sub> (Lasso) penalty to address the high-dimensional covariates. Inference based on this estimator remains valid even if one of the logistic propensity score or linear outcome regression models are misspecified.</div></div>","PeriodicalId":15629,"journal":{"name":"Journal of Econometrics","volume":"253 ","pages":"Article 106180"},"PeriodicalIF":4.0,"publicationDate":"2026-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145976963","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"经济学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Inference for two-stage experiments under covariate-adaptive randomization 协变量自适应随机化下两阶段实验的推理
IF 4 3区 经济学 Q1 ECONOMICS Pub Date : 2026-01-01 Epub Date: 2026-01-18 DOI: 10.1016/j.jeconom.2026.106189
Jizhou Liu
This paper studies inference in two-stage randomized experiments under covariate-adaptive randomization. In the initial stage of this experimental design, clusters (e.g., households, schools, or graph partitions) are stratified and randomly assigned to control or treatment groups based on cluster-level covariates. Subsequently, an independent second-stage design is carried out, wherein units within each treated cluster are further stratified and randomly assigned to either control or treatment groups, based on individual-level covariates. Under the homogeneous partial interference assumption, I establish conditions under which the proposed difference-in-“average of averages” estimators are consistent and asymptotically normal for the corresponding average primary and spillover effects and develop consistent estimators of their asymptotic variances. Combining these results establishes the asymptotic validity of tests based on these estimators. My findings suggest that ignoring covariate information in the design stage can result in efficiency loss, and commonly used inference methods that ignore or improperly use covariate information can lead to either conservative or invalid inference. Then, I apply these results to studying optimal use of covariate information under covariate-adaptive randomization in large samples, and demonstrate that a specific generalized matched-pair design achieves minimum asymptotic variance for each proposed estimator. Finally, I discuss covariate adjustment, which incorporates additional baseline covariates not used for treatment assignment. The practical relevance of the theoretical results is illustrated through a simulation study and an empirical application.
本文研究了协变量自适应随机化下两阶段随机实验的推理问题。在这个实验设计的初始阶段,聚类(例如,家庭、学校或图形分区)被分层,并根据聚类水平的协变量随机分配到对照组或治疗组。随后,进行独立的第二阶段设计,其中每个治疗组中的单位进一步分层,并根据个人水平的协变量随机分配到对照组或治疗组。在齐次部分干涉假设下,我建立了对相应的平均初级效应和溢出效应提出的“平均值的平均值”差估计是一致的和渐近正态的条件,并开发了它们的渐近方差的一致估计。结合这些结果,建立了基于这些估计量的检验的渐近有效性。我的研究结果表明,在设计阶段忽略协变量信息会导致效率损失,而通常使用的忽略或不正确使用协变量信息的推理方法可能导致保守或无效的推理。然后,我将这些结果应用于在大样本中研究协变量自适应随机化下协变量信息的最佳使用,并证明了特定的广义匹配对设计对于每个提出的估计量实现了最小的渐近方差。最后,我讨论协变量调整,它包含了额外的基线协变量,而不是用于治疗分配。通过仿真研究和实证应用说明了理论结果的实际意义。
{"title":"Inference for two-stage experiments under covariate-adaptive randomization","authors":"Jizhou Liu","doi":"10.1016/j.jeconom.2026.106189","DOIUrl":"10.1016/j.jeconom.2026.106189","url":null,"abstract":"<div><div>This paper studies inference in two-stage randomized experiments under covariate-adaptive randomization. In the initial stage of this experimental design, clusters (e.g., households, schools, or graph partitions) are stratified and randomly assigned to control or treatment groups based on cluster-level covariates. Subsequently, an independent second-stage design is carried out, wherein units within each treated cluster are further stratified and randomly assigned to either control or treatment groups, based on individual-level covariates. Under the homogeneous partial interference assumption, I establish conditions under which the proposed difference-in-“average of averages” estimators are consistent and asymptotically normal for the corresponding average primary and spillover effects and develop consistent estimators of their asymptotic variances. Combining these results establishes the asymptotic validity of tests based on these estimators. My findings suggest that ignoring covariate information in the design stage can result in efficiency loss, and commonly used inference methods that ignore or improperly use covariate information can lead to either conservative or invalid inference. Then, I apply these results to studying optimal use of covariate information under covariate-adaptive randomization in large samples, and demonstrate that a specific generalized matched-pair design achieves minimum asymptotic variance for each proposed estimator. Finally, I discuss covariate adjustment, which incorporates additional baseline covariates not used for treatment assignment. The practical relevance of the theoretical results is illustrated through a simulation study and an empirical application.</div></div>","PeriodicalId":15629,"journal":{"name":"Journal of Econometrics","volume":"253 ","pages":"Article 106189"},"PeriodicalIF":4.0,"publicationDate":"2026-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"146034687","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"经济学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Data-driven policy learning for continuous treatments 数据驱动的连续治疗策略学习
IF 4 3区 经济学 Q1 ECONOMICS Pub Date : 2026-01-01 Epub Date: 2025-12-25 DOI: 10.1016/j.jeconom.2025.106170
Chunrong Ai , Yue Fang , Haitian Xie
This paper studies policy learning for continuous treatments from observational data. Continuous treatments present more significant challenges than discrete ones because population welfare may need nonparametric estimation, and policy space may be infinite-dimensional and may satisfy shape restrictions. We propose to approximate the policy space with a sequence of finite-dimensional spaces and, for any given policy, obtain the empirical welfare by applying the kernel method. We consider two cases: known and unknown propensity scores. In the latter case, we allow for machine learning of the propensity score and modify the empirical welfare to account for the effect of machine learning. The learned policy maximizes the empirical welfare or the modified empirical welfare over the approximating space. In both cases, we modify the penalty algorithm proposed in Mbakop and Tabord-Meehan (2021) to data-automate the tuning parameters (i.e., bandwidth and dimension of the approximating space) and establish an oracle inequality for the welfare regret.
本文研究了基于观测数据的连续治疗策略学习。由于人口福利可能需要非参数估计,并且政策空间可能是无限维的,并且可能满足形状限制,因此连续处理比离散处理面临更大的挑战。我们提出用有限维空间序列来近似策略空间,并对任意给定的策略,应用核方法获得经验福利。我们考虑两种情况:已知和未知的倾向得分。在后一种情况下,我们允许倾向得分的机器学习,并修改经验福利来解释机器学习的影响。学习策略使经验福利或修正经验福利在近似空间上最大化。在这两种情况下,我们修改了Mbakop和Tabord-Meehan(2021)提出的惩罚算法,使调优参数(即近似空间的带宽和维度)数据自动化,并建立了福利后悔的oracle不等式。
{"title":"Data-driven policy learning for continuous treatments","authors":"Chunrong Ai ,&nbsp;Yue Fang ,&nbsp;Haitian Xie","doi":"10.1016/j.jeconom.2025.106170","DOIUrl":"10.1016/j.jeconom.2025.106170","url":null,"abstract":"<div><div>This paper studies policy learning for continuous treatments from observational data. Continuous treatments present more significant challenges than discrete ones because population welfare may need nonparametric estimation, and policy space may be infinite-dimensional and may satisfy shape restrictions. We propose to approximate the policy space with a sequence of finite-dimensional spaces and, for any given policy, obtain the empirical welfare by applying the kernel method. We consider two cases: known and unknown propensity scores. In the latter case, we allow for machine learning of the propensity score and modify the empirical welfare to account for the effect of machine learning. The learned policy maximizes the empirical welfare or the modified empirical welfare over the approximating space. In both cases, we modify the penalty algorithm proposed in Mbakop and Tabord-Meehan (2021) to data-automate the tuning parameters (i.e., bandwidth and dimension of the approximating space) and establish an oracle inequality for the welfare regret.</div></div>","PeriodicalId":15629,"journal":{"name":"Journal of Econometrics","volume":"253 ","pages":"Article 106170"},"PeriodicalIF":4.0,"publicationDate":"2026-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145836613","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"经济学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Decomposition and interpretation of treatment effects in settings with delayed outcomes 结果延迟的情况下治疗效果的分解和解释
IF 4 3区 经济学 Q1 ECONOMICS Pub Date : 2026-01-01 Epub Date: 2025-12-05 DOI: 10.1016/j.jeconom.2025.106160
Federico A. Bugni , Ivan A. Canay , Steve McBride
This paper studies settings where the analyst is interested in identifying and estimating the average direct causal effect of a binary treatment on an outcome. We consider a setup in which the outcome realization does not get immediately realized after the treatment assignment, a feature that is ubiquitous in empirical settings. The period between the treatment and the realization of the outcome allows other observed actions to occur and affect the outcome. In this context, we study several regression-based estimands routinely used in empirical work to capture the average treatment effect and shed light on interpreting them in terms of ceteris paribus effects, indirect causal effects, and selection terms. We obtain three main and related takeaways under a common set of assumptions. First, the three most popular estimands do not generally satisfy what we call strong sign preservation, in the sense that these estimands may be negative even when the treatment positively affects the outcome conditional on any possible combination of other actions. Second, the most popular regression that includes the other actions as controls satisfies strong sign preservation if and only if these actions are mutually exclusive binary variables. Finally, we show that a linear regression that fully stratifies the other actions leads to estimands that satisfy strong sign preservation.
本文研究分析员对识别和估计二元治疗对结果的平均直接因果效应感兴趣的设置。我们考虑一种设置,其中结果实现在处理分配后不会立即实现,这是在经验设置中普遍存在的特征。治疗和实现结果之间的这段时间允许其他观察到的行为发生并影响结果。在这种背景下,我们研究了几个基于回归的估计,这些估计通常用于实证工作,以捕获平均治疗效果,并阐明了根据其他条件效应、间接因果效应和选择条款来解释它们。在一组共同的假设下,我们得到了三个主要的和相关的结论。首先,三种最流行的估计通常不满足我们所说的强符号保存,也就是说,即使在治疗对结果产生积极影响的情况下,这些估计也可能是负的,条件是任何可能的其他行动的组合。其次,当且仅当这些操作是互斥的二进制变量时,最流行的包括其他操作作为控制的回归满足强符号保存。最后,我们证明了将其他行为完全分层的线性回归导致满足强符号保持的估计。
{"title":"Decomposition and interpretation of treatment effects in settings with delayed outcomes","authors":"Federico A. Bugni ,&nbsp;Ivan A. Canay ,&nbsp;Steve McBride","doi":"10.1016/j.jeconom.2025.106160","DOIUrl":"10.1016/j.jeconom.2025.106160","url":null,"abstract":"<div><div>This paper studies settings where the analyst is interested in identifying and estimating the average <em>direct</em> causal effect of a binary treatment on an outcome. We consider a setup in which the outcome realization does not get immediately realized after the treatment assignment, a feature that is ubiquitous in empirical settings. The period between the treatment and the realization of the outcome allows other observed actions to occur and affect the outcome. In this context, we study several regression-based estimands routinely used in empirical work to capture the average treatment effect and shed light on interpreting them in terms of ceteris paribus effects, indirect causal effects, and selection terms. We obtain three main and related takeaways under a common set of assumptions. First, the three most popular estimands do not generally satisfy what we call <em>strong sign preservation</em>, in the sense that these estimands may be negative even when the treatment positively affects the outcome conditional on any possible combination of other actions. Second, the most popular regression that includes the other actions as controls satisfies strong sign preservation <em>if and only if</em> these actions are mutually exclusive binary variables. Finally, we show that a linear regression that fully stratifies the other actions leads to estimands that satisfy strong sign preservation.</div></div>","PeriodicalId":15629,"journal":{"name":"Journal of Econometrics","volume":"253 ","pages":"Article 106160"},"PeriodicalIF":4.0,"publicationDate":"2026-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145690430","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"经济学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
期刊
Journal of Econometrics
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1