首页 > 最新文献

Journal of Business & Economic Statistics最新文献

英文 中文
Specification Testing of Regression Models with Mixed Discrete and Continuous Predictors 离散和连续混合预测回归模型的规格检验
IF 3 2区 数学 Q1 ECONOMICS Pub Date : 2022-08-08 DOI: 10.1080/07350015.2022.2110879
Xuehu Zhu, Qiming Zhang, Lixing Zhu, Jun Zhang, Luoyao Yu
Abstract This article proposes a nonparametric projection-based adaptive-to-model specification test for regressions with discrete and continuous predictors. The test statistic is asymptotically normal under the null hypothesis and omnibus against alternative hypotheses. The test behaves like a locally smoothing test as if the number of continuous predictors was one and can detect the local alternative hypotheses distinct from the null hypothesis at the rate that can be achieved by existing locally smoothing tests for regressions with only one continuous predictor. Because of the model adaptation property, the test can fully use the model structure under the null hypothesis so that the dimensionality problem can be significantly alleviated. A discretization-expectation ordinary least squares estimation approach for partial central subspace in sufficient dimension reduction is developed as a by-product in the test construction. We suggest a residual-based wild bootstrap method to give an approximation by fully using the null model and thus closer to the limiting null distribution than existing bootstrap approximations. We conduct simulation studies to compare it with existing tests and two real data examples for illustration.
摘要本文提出了一种基于非参数投影的离散和连续预测回归自适应模型规范检验方法。检验统计量在零假设下是渐近正态的,对备择假设是综合的。该检验的行为类似于局部平滑检验,就好像连续预测因子的数量是一个一样,并且可以检测到与零假设不同的局部替代假设,其速度与只有一个连续预测因子的回归的现有局部平滑检验所能达到的速度相同。由于模型的自适应特性,该检验可以充分利用零假设下的模型结构,从而显著缓解维数问题。作为试验构造的副产品,提出了一种充分降维的部分中心子空间的离散化期望普通最小二乘估计方法。我们提出了一种基于残差的野生自举方法,通过充分利用零模型来给出近似,从而比现有的自举近似更接近极限零分布。我们进行了模拟研究,将其与现有测试和两个真实数据示例进行比较。
{"title":"Specification Testing of Regression Models with Mixed Discrete and Continuous Predictors","authors":"Xuehu Zhu, Qiming Zhang, Lixing Zhu, Jun Zhang, Luoyao Yu","doi":"10.1080/07350015.2022.2110879","DOIUrl":"https://doi.org/10.1080/07350015.2022.2110879","url":null,"abstract":"Abstract This article proposes a nonparametric projection-based adaptive-to-model specification test for regressions with discrete and continuous predictors. The test statistic is asymptotically normal under the null hypothesis and omnibus against alternative hypotheses. The test behaves like a locally smoothing test as if the number of continuous predictors was one and can detect the local alternative hypotheses distinct from the null hypothesis at the rate that can be achieved by existing locally smoothing tests for regressions with only one continuous predictor. Because of the model adaptation property, the test can fully use the model structure under the null hypothesis so that the dimensionality problem can be significantly alleviated. A discretization-expectation ordinary least squares estimation approach for partial central subspace in sufficient dimension reduction is developed as a by-product in the test construction. We suggest a residual-based wild bootstrap method to give an approximation by fully using the null model and thus closer to the limiting null distribution than existing bootstrap approximations. We conduct simulation studies to compare it with existing tests and two real data examples for illustration.","PeriodicalId":50247,"journal":{"name":"Journal of Business & Economic Statistics","volume":"41 1","pages":"1101 - 1115"},"PeriodicalIF":3.0,"publicationDate":"2022-08-08","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"48057158","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
LASSO for Stochastic Frontier Models with Many Efficient Firms 具有多个有效企业的随机前沿模型的LASSO
IF 3 2区 数学 Q1 ECONOMICS Pub Date : 2022-08-08 DOI: 10.1080/07350015.2022.2110881
W. Horrace, Hyunseok Jung, Yoonseok Lee
Abstract We apply the adaptive LASSO to select a set of maximally efficient firms in the panel fixed-effect stochastic frontier model. The adaptively weighted L 1 penalty with sign restrictions allows simultaneous selection of a group of maximally efficient firms and estimation of firm-level inefficiency parameters with a faster rate of convergence than least squares dummy variable estimators. Our estimator possesses the oracle property. We propose a tuning parameter selection criterion and an efficient optimization algorithm based on coordinate descent. We apply the method to estimate a group of efficient police officers who are best at detecting contraband in motor vehicle stops (i.e., search efficiency) in Syracuse, NY.
摘要我们应用自适应LASSO在面板固定效应随机前沿模型中选择一组最大有效企业。具有符号限制的自适应加权L1惩罚允许同时选择一组最大有效的企业,并以比最小二乘伪变量估计器更快的收敛速度估计企业级低效率参数。我们的估计量具有预言性质。提出了一种基于坐标下降的调谐参数选择准则和一种有效的优化算法。我们应用该方法来估计纽约州锡拉丘兹市一组最擅长在机动车停车场发现违禁品的高效警察(即搜索效率)。
{"title":"LASSO for Stochastic Frontier Models with Many Efficient Firms","authors":"W. Horrace, Hyunseok Jung, Yoonseok Lee","doi":"10.1080/07350015.2022.2110881","DOIUrl":"https://doi.org/10.1080/07350015.2022.2110881","url":null,"abstract":"Abstract We apply the adaptive LASSO to select a set of maximally efficient firms in the panel fixed-effect stochastic frontier model. The adaptively weighted L 1 penalty with sign restrictions allows simultaneous selection of a group of maximally efficient firms and estimation of firm-level inefficiency parameters with a faster rate of convergence than least squares dummy variable estimators. Our estimator possesses the oracle property. We propose a tuning parameter selection criterion and an efficient optimization algorithm based on coordinate descent. We apply the method to estimate a group of efficient police officers who are best at detecting contraband in motor vehicle stops (i.e., search efficiency) in Syracuse, NY.","PeriodicalId":50247,"journal":{"name":"Journal of Business & Economic Statistics","volume":"41 1","pages":"1132 - 1142"},"PeriodicalIF":3.0,"publicationDate":"2022-08-08","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"48059493","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 3
Nonparametric, Stochastic Frontier Models with Multiple Inputs and Outputs 多输入和多输出的非参数随机前沿模型
IF 3 2区 数学 Q1 ECONOMICS Pub Date : 2022-08-08 DOI: 10.1080/07350015.2022.2110882
L. Simar, P. W. Wilson
Abstract Stochastic frontier models along the lines of Aigner et al. are widely used to benchmark firms’ performances in terms of efficiency. The models are typically fully parametric, with functional form specifications for the frontier as well as both the noise and the inefficiency processes. Studies such as Kumbhakar et al. have attempted to relax some of the restrictions in parametric models, but so far all such approaches are limited to a univariate response variable. Some (e.g., Simar and Zelenyuk; Kuosmanen and Johnson) have proposed nonparametric estimation of directional distance functions to handle multiple inputs and outputs, raising issues of endogeneity that are either ignored or addressed by imposing restrictive and implausible assumptions. This article extends nonparametric methods developed by Simar et al. and Hafner et al. to allow multiple inputs and outputs in an almost fully nonparametric framework while avoiding endogeneity problems. We discuss properties of the resulting estimators, and examine their finite-sample performance through Monte Carlo experiments. Practical implementation of the method is illustrated using data on U.S. commercial banks.
摘要Aigner等人的随机前沿模型被广泛用于衡量企业的效率表现。模型通常是全参数的,具有边界以及噪声和低效过程的函数形式规范。Kumbhakar等人的研究试图放松参数模型中的一些限制,但到目前为止,所有这些方法都局限于单变量响应变量。一些人(例如,Simar和Zelenyuk;Kuosmann和Johnson)提出了方向距离函数的非参数估计,以处理多个输入和输出,提出了内生性问题,这些问题要么被忽视,要么通过强加限制性和不可信的假设来解决。本文扩展了Simar等人和Hafner等人开发的非参数方法。在几乎完全非参数的框架中允许多个输入和输出,同时避免内生性问题。我们讨论了所得估计量的性质,并通过蒙特卡罗实验检验了它们的有限样本性能。利用美国商业银行的数据说明了该方法的实际实施。
{"title":"Nonparametric, Stochastic Frontier Models with Multiple Inputs and Outputs","authors":"L. Simar, P. W. Wilson","doi":"10.1080/07350015.2022.2110882","DOIUrl":"https://doi.org/10.1080/07350015.2022.2110882","url":null,"abstract":"Abstract Stochastic frontier models along the lines of Aigner et al. are widely used to benchmark firms’ performances in terms of efficiency. The models are typically fully parametric, with functional form specifications for the frontier as well as both the noise and the inefficiency processes. Studies such as Kumbhakar et al. have attempted to relax some of the restrictions in parametric models, but so far all such approaches are limited to a univariate response variable. Some (e.g., Simar and Zelenyuk; Kuosmanen and Johnson) have proposed nonparametric estimation of directional distance functions to handle multiple inputs and outputs, raising issues of endogeneity that are either ignored or addressed by imposing restrictive and implausible assumptions. This article extends nonparametric methods developed by Simar et al. and Hafner et al. to allow multiple inputs and outputs in an almost fully nonparametric framework while avoiding endogeneity problems. We discuss properties of the resulting estimators, and examine their finite-sample performance through Monte Carlo experiments. Practical implementation of the method is illustrated using data on U.S. commercial banks.","PeriodicalId":50247,"journal":{"name":"Journal of Business & Economic Statistics","volume":"41 1","pages":"1391 - 1403"},"PeriodicalIF":3.0,"publicationDate":"2022-08-08","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"48001020","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 6
Bagged Pretested Portfolio Selection 袋装预测试投资组合选择
IF 3 2区 数学 Q1 ECONOMICS Pub Date : 2022-08-08 DOI: 10.1080/07350015.2022.2110880
Ekaterina Kazak, W. Pohlmeier
Abstract This article exploits the idea of combining pretesting and bagging to choose between competing portfolio strategies. We propose an estimator for the portfolio weight vector, which optimally trades off Type I against Type II errors when choosing the best investment strategy. Furthermore, we accommodate the idea of bagging in the portfolio testing problem, which helps to avoid sharp thresholding and reduces turnover costs substantially. Our Bagged Pretested Portfolio Selection (BPPS) approach borrows from both the shrinkage and the forecast combination literature. The portfolio weights of our strategy are weighted averages of the portfolio weights from a set of stand-alone strategies. More specifically, the weights are generated from pseudo-out-of-sample portfolio pretesting, such that they reflect the probability that a given strategy will be overall best performing. The resulting strategy allows for a flexible and smooth switch between the underlying strategies and outperforms the corresponding stand-alone strategies. Besides yielding high point estimates of the portfolio performance measures, the BPPS approach performs exceptionally well in terms of precision and is robust against outliers resulting from the choice of the asset space.
摘要本文利用前测和套袋相结合的思想,在竞争组合策略之间进行选择。我们提出了一个投资组合权重向量的估计器,它在选择最佳投资策略时最优地权衡了类型I和类型II的错误。此外,我们在组合测试问题中容纳了打包的想法,这有助于避免尖锐的阈值,并大大减少周转成本。我们的套袋预测试投资组合选择(BPPS)方法借鉴了收缩和预测组合的文献。我们策略的投资组合权重是一组独立策略的投资组合权重的加权平均值。更具体地说,权重是由伪样本外投资组合预测试生成的,这样它们就反映了给定策略整体表现最佳的概率。由此产生的策略允许在基础策略之间灵活而平稳的切换,并且优于相应的独立策略。除了产生投资组合绩效指标的高点估计外,BPPS方法在精度方面表现得非常好,并且对资产空间选择产生的异常值具有鲁棒性。
{"title":"Bagged Pretested Portfolio Selection","authors":"Ekaterina Kazak, W. Pohlmeier","doi":"10.1080/07350015.2022.2110880","DOIUrl":"https://doi.org/10.1080/07350015.2022.2110880","url":null,"abstract":"Abstract This article exploits the idea of combining pretesting and bagging to choose between competing portfolio strategies. We propose an estimator for the portfolio weight vector, which optimally trades off Type I against Type II errors when choosing the best investment strategy. Furthermore, we accommodate the idea of bagging in the portfolio testing problem, which helps to avoid sharp thresholding and reduces turnover costs substantially. Our Bagged Pretested Portfolio Selection (BPPS) approach borrows from both the shrinkage and the forecast combination literature. The portfolio weights of our strategy are weighted averages of the portfolio weights from a set of stand-alone strategies. More specifically, the weights are generated from pseudo-out-of-sample portfolio pretesting, such that they reflect the probability that a given strategy will be overall best performing. The resulting strategy allows for a flexible and smooth switch between the underlying strategies and outperforms the corresponding stand-alone strategies. Besides yielding high point estimates of the portfolio performance measures, the BPPS approach performs exceptionally well in terms of precision and is robust against outliers resulting from the choice of the asset space.","PeriodicalId":50247,"journal":{"name":"Journal of Business & Economic Statistics","volume":"41 1","pages":"1116 - 1131"},"PeriodicalIF":3.0,"publicationDate":"2022-08-08","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"41418356","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Identification of SVAR Models by Combining Sign Restrictions With External Instruments 符号约束与外部仪器相结合的SVAR模型识别
IF 3 2区 数学 Q1 ECONOMICS Pub Date : 2022-07-29 DOI: 10.1080/07350015.2022.2104857
R. Braun, R. Brüggemann
ABSTRACT We discuss combining sign restrictions with information in external instruments (proxy variables) to identify structural vector autoregressive (SVAR) models. In one setting, we assume the availability of valid external instruments. Sign restrictions may then be used to identify further orthogonal shocks, or as an additional piece of information to pin down the shocks identified by the external instruments more precisely. In a second setting, we assume that proxy variables are only “plausibly exogenous” and suggest various types of inequality restrictions to bound the relation between structural shocks and the external variable. This can be combined with conventional sign restrictions to further narrow down the set of admissible models. Within a proxy-augmented SVAR, we conduct Bayesian inference and discuss computation of Bayes factors. They can be useful to test either the sign- or IV restrictions as overidentifying. We illustrate the usefulness of our methodology in estimating the effects of oil supply and monetary policy shocks.
摘要我们讨论了将符号限制与外部工具(代理变量)中的信息相结合来识别结构向量自回归(SVAR)模型。在一种情况下,我们假设有效的外部仪器可用。然后,可以使用符号限制来识别进一步的正交冲击,或者作为一条附加信息来更准确地确定外部仪器识别的冲击。在第二种情况下,我们假设代理变量只是“看似外生的”,并提出了各种类型的不平等限制,以约束结构性冲击和外部变量之间的关系。这可以与传统的符号限制相结合,以进一步缩小可接受模型的范围。在代理增强SVAR中,我们进行贝叶斯推理,并讨论贝叶斯因子的计算。它们可以用来测试体征或IV限制是否为过度识别。我们说明了我们的方法在估计石油供应和货币政策冲击的影响方面的有用性。
{"title":"Identification of SVAR Models by Combining Sign Restrictions With External Instruments","authors":"R. Braun, R. Brüggemann","doi":"10.1080/07350015.2022.2104857","DOIUrl":"https://doi.org/10.1080/07350015.2022.2104857","url":null,"abstract":"ABSTRACT We discuss combining sign restrictions with information in external instruments (proxy variables) to identify structural vector autoregressive (SVAR) models. In one setting, we assume the availability of valid external instruments. Sign restrictions may then be used to identify further orthogonal shocks, or as an additional piece of information to pin down the shocks identified by the external instruments more precisely. In a second setting, we assume that proxy variables are only “plausibly exogenous” and suggest various types of inequality restrictions to bound the relation between structural shocks and the external variable. This can be combined with conventional sign restrictions to further narrow down the set of admissible models. Within a proxy-augmented SVAR, we conduct Bayesian inference and discuss computation of Bayes factors. They can be useful to test either the sign- or IV restrictions as overidentifying. We illustrate the usefulness of our methodology in estimating the effects of oil supply and monetary policy shocks.","PeriodicalId":50247,"journal":{"name":"Journal of Business & Economic Statistics","volume":"41 1","pages":"1077 - 1089"},"PeriodicalIF":3.0,"publicationDate":"2022-07-29","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"44798055","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 16
Trends in Earnings Volatility Using Linked Administrative and Survey Data 使用关联管理和调查数据的收益波动趋势
IF 3 2区 数学 Q1 ECONOMICS Pub Date : 2022-07-15 DOI: 10.1080/07350015.2022.2102023
James P. Ziliak, Charles M. Hokayem, C. Bollinger
Abstract We document trends in earnings volatility separately by gender using unique linked survey data from the CPS ASEC and Social Security earnings records for the tax years spanning 1995–2015. The exact data link permits us to focus on differences in measured volatility from earnings nonresponse, survey attrition, and measurement between survey and administrative earnings data reports, while holding constant the sampling frame. Our results for both men and women suggest that the level and trend in volatility is similar in the survey and administrative data, showing substantial business-cycle sensitivity among men but no overall trend among continuous workers, while women demonstrate no change in earnings volatility over the business cycle but a declining trend. A substantive difference emerges with the inclusion of imputed earnings among survey nonrespondents, suggesting that users of the ASEC drop earnings nonrespondents.
我们使用来自CPS ASEC和社会保障收入记录的1995-2015纳税年度的独特关联调查数据,按性别分别记录了收入波动的趋势。准确的数据链接使我们能够专注于从盈余无反应、调查减员以及调查和行政盈余数据报告之间的测量中测量的波动性差异,同时保持抽样框架不变。我们对男性和女性的研究结果表明,在调查和管理数据中,波动性的水平和趋势是相似的,在男性中显示出实质性的商业周期敏感性,但在连续工作者中没有总体趋势,而女性在商业周期中表现出收入波动性没有变化,而是呈下降趋势。在非受访者中纳入估算收入后,出现了实质性差异,这表明ASEC的用户减少了非受访者的收入。
{"title":"Trends in Earnings Volatility Using Linked Administrative and Survey Data","authors":"James P. Ziliak, Charles M. Hokayem, C. Bollinger","doi":"10.1080/07350015.2022.2102023","DOIUrl":"https://doi.org/10.1080/07350015.2022.2102023","url":null,"abstract":"Abstract We document trends in earnings volatility separately by gender using unique linked survey data from the CPS ASEC and Social Security earnings records for the tax years spanning 1995–2015. The exact data link permits us to focus on differences in measured volatility from earnings nonresponse, survey attrition, and measurement between survey and administrative earnings data reports, while holding constant the sampling frame. Our results for both men and women suggest that the level and trend in volatility is similar in the survey and administrative data, showing substantial business-cycle sensitivity among men but no overall trend among continuous workers, while women demonstrate no change in earnings volatility over the business cycle but a declining trend. A substantive difference emerges with the inclusion of imputed earnings among survey nonrespondents, suggesting that users of the ASEC drop earnings nonrespondents.","PeriodicalId":50247,"journal":{"name":"Journal of Business & Economic Statistics","volume":"41 1","pages":"12 - 19"},"PeriodicalIF":3.0,"publicationDate":"2022-07-15","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"42002407","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 9
Large-Scale Generalized Linear Models for Longitudinal Data with Grouped Patterns of Unobserved Heterogeneity 具有未观测异质性分组模式的纵向数据的大规模广义线性模型
IF 3 2区 数学 Q1 ECONOMICS Pub Date : 2022-07-06 DOI: 10.1080/07350015.2022.2097913
T. Ando, Jushan Bai
ABSTRACT This article provides methods for flexibly capturing unobservable heterogeneity from longitudinal data in the context of an exponential family of distributions. The group memberships of individual units are left unspecified, and their heterogeneity is influenced by group-specific unobservable factor structures. The model includes, as special cases, probit, logit, and Poisson regressions with interactive fixed effects along with unknown group membership. We discuss a computationally efficient estimation method and derive the corresponding asymptotic theory. Uniform consistency of the estimated group membership is established. To test heterogeneous regression coefficients within groups, we propose a Swamy-type test that allows for unobserved heterogeneity. We apply the proposed method to the study of market structure of the taxi industry in New York City. Our method unveils interesting and important insights from large-scale longitudinal data that consist of over 450 million data points.
摘要本文提供了在指数分布族的背景下,从纵向数据中灵活捕捉不可观测异质性的方法。个体单元的群体成员身份未明确,其异质性受到群体特定的不可观察因素结构的影响。作为特殊情况,该模型包括具有交互固定效应的probit、logit和Poisson回归以及未知的群成员关系。我们讨论了一种计算有效的估计方法,并导出了相应的渐近理论。建立了估计组成员的一致性。为了测试组内的异质回归系数,我们提出了一种允许未观察到的异质性的Swamy型检验。我们将所提出的方法应用于纽约市出租车行业的市场结构研究。我们的方法从由超过4.5亿个数据点组成的大规模纵向数据中揭示了有趣而重要的见解。
{"title":"Large-Scale Generalized Linear Models for Longitudinal Data with Grouped Patterns of Unobserved Heterogeneity","authors":"T. Ando, Jushan Bai","doi":"10.1080/07350015.2022.2097913","DOIUrl":"https://doi.org/10.1080/07350015.2022.2097913","url":null,"abstract":"ABSTRACT This article provides methods for flexibly capturing unobservable heterogeneity from longitudinal data in the context of an exponential family of distributions. The group memberships of individual units are left unspecified, and their heterogeneity is influenced by group-specific unobservable factor structures. The model includes, as special cases, probit, logit, and Poisson regressions with interactive fixed effects along with unknown group membership. We discuss a computationally efficient estimation method and derive the corresponding asymptotic theory. Uniform consistency of the estimated group membership is established. To test heterogeneous regression coefficients within groups, we propose a Swamy-type test that allows for unobserved heterogeneity. We apply the proposed method to the study of market structure of the taxi industry in New York City. Our method unveils interesting and important insights from large-scale longitudinal data that consist of over 450 million data points.","PeriodicalId":50247,"journal":{"name":"Journal of Business & Economic Statistics","volume":"41 1","pages":"983 - 994"},"PeriodicalIF":3.0,"publicationDate":"2022-07-06","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"48151446","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Robust Signal Recovery for High-Dimensional Linear Log-Contrast Models with Compositional Covariates 具有组合协变量的高维线性对数对比度模型的鲁棒信号恢复
IF 3 2区 数学 Q1 ECONOMICS Pub Date : 2022-07-06 DOI: 10.1080/07350015.2022.2097911
Dongxiao Han, Jian Huang, Yuanyuan Lin, Lei Liu, Lianqiang Qu, Liuquan Sun
Abstract In this article, we propose a robust signal recovery method for high-dimensional linear log-contrast models, when the error distribution could be heavy-tailed and asymmetric. The proposed method is built on the Huber loss with penalization. We establish the and consistency for the resulting estimator. Under conditions analogous to the irrepresentability condition and the minimum signal strength condition, we prove that the signed support of the slope parameter vector can be recovered with high probability. The finite-sample behavior of the proposed method is evaluated through simulation studies, and applications to a GDP satisfaction dataset an HIV microbiome dataset are provided.
摘要在本文中,当误差分布可能是重尾和不对称时,我们为高维线性对数对比度模型提出了一种稳健的信号恢复方法。该方法建立在Huber损失的基础上,并进行了惩罚。我们建立了结果估计量的和一致性。在类似于不可表示性条件和最小信号强度条件的条件下,我们证明了斜率参数向量的符号支持可以高概率地恢复。通过模拟研究评估了所提出方法的有限样本行为,并将其应用于GDP满意度数据集和HIV微生物组数据集。
{"title":"Robust Signal Recovery for High-Dimensional Linear Log-Contrast Models with Compositional Covariates","authors":"Dongxiao Han, Jian Huang, Yuanyuan Lin, Lei Liu, Lianqiang Qu, Liuquan Sun","doi":"10.1080/07350015.2022.2097911","DOIUrl":"https://doi.org/10.1080/07350015.2022.2097911","url":null,"abstract":"Abstract In this article, we propose a robust signal recovery method for high-dimensional linear log-contrast models, when the error distribution could be heavy-tailed and asymmetric. The proposed method is built on the Huber loss with penalization. We establish the and consistency for the resulting estimator. Under conditions analogous to the irrepresentability condition and the minimum signal strength condition, we prove that the signed support of the slope parameter vector can be recovered with high probability. The finite-sample behavior of the proposed method is evaluated through simulation studies, and applications to a GDP satisfaction dataset an HIV microbiome dataset are provided.","PeriodicalId":50247,"journal":{"name":"Journal of Business & Economic Statistics","volume":"41 1","pages":"957 - 967"},"PeriodicalIF":3.0,"publicationDate":"2022-07-06","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"49373682","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
Covariate-Assisted Community Detection in Multi-Layer Networks 多层网络中的协变量辅助社区检测
IF 3 2区 数学 Q1 ECONOMICS Pub Date : 2022-06-02 DOI: 10.1080/07350015.2022.2085726
Shi Xu, Yao Zhen, Junhui Wang
ABSTRACT Communities in multi-layer networks consist of nodes with similar connectivity patterns across all layers. This article proposes a tensor-based community detection method in multi-layer networks, which leverages available node-wise covariates to improve community detection accuracy. This is motivated by the network homophily principle, which suggests that nodes with similar covariates tend to reside in the same community. To take advantage of the node-wise covariates, the proposed method augments the multi-layer network with an additional layer constructed from the node similarity matrix with proper scaling, and conducts a Tucker decomposition of the augmented multi-layer network, yielding the spectral embedding vector of each node for community detection. Asymptotic consistencies of the proposed method in terms of community detection are established, which are also supported by numerical experiments on various synthetic networks and two real-life multi-layer networks.
摘要多层网络中的社区由跨所有层具有相似连接模式的节点组成。本文提出了一种多层网络中基于张量的社区检测方法,该方法利用可用的节点协变量来提高社区检测的准确性。这是由网络同质性原理引起的,该原理表明具有相似协变量的节点往往位于同一社区中。为了利用逐节点协变量,所提出的方法用由节点相似性矩阵构建的具有适当比例的附加层来扩充多层网络,并对扩充的多层网络进行Tucker分解,产生用于社区检测的每个节点的谱嵌入向量。建立了所提出的方法在社区检测方面的渐近一致性,并在各种合成网络和两个真实的多层网络上进行了数值实验。
{"title":"Covariate-Assisted Community Detection in Multi-Layer Networks","authors":"Shi Xu, Yao Zhen, Junhui Wang","doi":"10.1080/07350015.2022.2085726","DOIUrl":"https://doi.org/10.1080/07350015.2022.2085726","url":null,"abstract":"ABSTRACT Communities in multi-layer networks consist of nodes with similar connectivity patterns across all layers. This article proposes a tensor-based community detection method in multi-layer networks, which leverages available node-wise covariates to improve community detection accuracy. This is motivated by the network homophily principle, which suggests that nodes with similar covariates tend to reside in the same community. To take advantage of the node-wise covariates, the proposed method augments the multi-layer network with an additional layer constructed from the node similarity matrix with proper scaling, and conducts a Tucker decomposition of the augmented multi-layer network, yielding the spectral embedding vector of each node for community detection. Asymptotic consistencies of the proposed method in terms of community detection are established, which are also supported by numerical experiments on various synthetic networks and two real-life multi-layer networks.","PeriodicalId":50247,"journal":{"name":"Journal of Business & Economic Statistics","volume":"41 1","pages":"915 - 926"},"PeriodicalIF":3.0,"publicationDate":"2022-06-02","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"47121634","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 7
Detection of Multiple Structural Breaks in Large Covariance Matrices 大协方差矩阵中多个结构断裂的检测
IF 3 2区 数学 Q1 ECONOMICS Pub Date : 2022-05-18 DOI: 10.1080/07350015.2022.2076686
Yu-Ning Li, Degui Li, P. Fryzlewicz
ABSTRACT This article studies multiple structural breaks in large contemporaneous covariance matrices of high-dimensional time series satisfying an approximate factor model. The breaks in the second-order moment structure of the common components are due to sudden changes in either factor loadings or covariance of latent factors, requiring appropriate transformation of the factor models to facilitate estimation of the (transformed) common factors and factor loadings via the classical principal component analysis. With the estimated factors and idiosyncratic errors, an easy-to-implement CUSUM-based detection technique is introduced to consistently estimate the location and number of breaks and correctly identify whether they originate in the common or idiosyncratic error components. The algorithms of Wild Binary Segmentation for Covariance (WBS-Cov) and Wild Sparsified Binary Segmentation for Covariance (WSBS-Cov) are used to estimate breaks in the common and idiosyncratic error components, respectively. Under some technical conditions, the asymptotic properties of the proposed methodology are derived with near-optimal rates (up to a logarithmic factor) achieved for the estimated breaks. Monte Carlo simulation studies are conducted to examine the finite-sample performance of the developed method and its comparison with other existing approaches. We finally apply our method to study the contemporaneous covariance structure of daily returns of S&P 500 constituents and identify a few breaks including those occurring during the 2007–2008 financial crisis and the recent coronavirus (COVID-19) outbreak. An package “ ” is provided to implement the proposed algorithms.
摘要本文研究了满足近似因子模型的高维时间序列的大同期协方差矩阵中的多个结构断裂。公共分量的二阶矩结构的中断是由于因子载荷或潜在因子协方差的突然变化,需要对因子模型进行适当的变换,以便于通过经典主分量分析估计(变换的)公共因子和因子载荷。利用估计的因素和特殊误差,引入了一种易于实现的基于CUSUM的检测技术,以一致地估计中断的位置和数量,并正确识别它们是起源于常见还是特殊误差分量。分别使用协方差的野生二进制分割(WBS Cov)和协方差的野生稀疏二进制分割(WSBS Cov)算法来估计常见和特殊误差分量的中断。在某些技术条件下,推导出了所提出方法的渐近性质,估计断裂达到了接近最优的速率(高达对数因子)。进行了蒙特卡罗模拟研究,以检验所开发的方法的有限样本性能,并将其与其他现有方法进行比较。最后,我们应用我们的方法来研究标准普尔500指数成分股每日收益的同期协方差结构,并确定一些中断,包括2007-2008年金融危机和最近冠状病毒(新冠肺炎)爆发期间发生的中断。提供了一个包“”来实现所提出的算法。
{"title":"Detection of Multiple Structural Breaks in Large Covariance Matrices","authors":"Yu-Ning Li, Degui Li, P. Fryzlewicz","doi":"10.1080/07350015.2022.2076686","DOIUrl":"https://doi.org/10.1080/07350015.2022.2076686","url":null,"abstract":"ABSTRACT This article studies multiple structural breaks in large contemporaneous covariance matrices of high-dimensional time series satisfying an approximate factor model. The breaks in the second-order moment structure of the common components are due to sudden changes in either factor loadings or covariance of latent factors, requiring appropriate transformation of the factor models to facilitate estimation of the (transformed) common factors and factor loadings via the classical principal component analysis. With the estimated factors and idiosyncratic errors, an easy-to-implement CUSUM-based detection technique is introduced to consistently estimate the location and number of breaks and correctly identify whether they originate in the common or idiosyncratic error components. The algorithms of Wild Binary Segmentation for Covariance (WBS-Cov) and Wild Sparsified Binary Segmentation for Covariance (WSBS-Cov) are used to estimate breaks in the common and idiosyncratic error components, respectively. Under some technical conditions, the asymptotic properties of the proposed methodology are derived with near-optimal rates (up to a logarithmic factor) achieved for the estimated breaks. Monte Carlo simulation studies are conducted to examine the finite-sample performance of the developed method and its comparison with other existing approaches. We finally apply our method to study the contemporaneous covariance structure of daily returns of S&P 500 constituents and identify a few breaks including those occurring during the 2007–2008 financial crisis and the recent coronavirus (COVID-19) outbreak. An package “ ” is provided to implement the proposed algorithms.","PeriodicalId":50247,"journal":{"name":"Journal of Business & Economic Statistics","volume":"41 1","pages":"846 - 861"},"PeriodicalIF":3.0,"publicationDate":"2022-05-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"45166385","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 8
期刊
Journal of Business & Economic Statistics
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1