首页 > 最新文献

Asta-Advances in Statistical Analysis最新文献

英文 中文
Hedonic pricing modelling with unstructured predictors: an application to Italian Fashion Industry 具有非结构化预测因子的Hedonic定价模型在意大利时装业中的应用
IF 1.4 4区 数学 Q2 STATISTICS & PROBABILITY Pub Date : 2022-12-13 DOI: 10.1007/s10182-022-00465-5
Federico Crescenzi

This study proposes a comparison of hedonic pricing models that use attributes obtained by featurizing text. We collected prices of items sold on the websites of five famous fashion producers in order to estimate hedonic pricing models that leverage the information contained in product descriptions. We mapped product descriptions to a high-dimensional feature space and compared predictive accuracy and variable selection properties of some statistical estimators that leverage sparse modelling, topic modelling and aggregated predictors, to test whether better predictive accuracy comes with an empirically consistent selection of attributes. We call this approach Hedonic Text-Regression modelling. Its novelty is that by using attributes obtained by text-mining of product descriptions, we obtain an estimate of the implicit price of the words contained therein. Empirically, all the proposed models outperformed the traditional hedonic pricing model in terms of predictive accuracy, while also providing consistent variable selection.

本研究建议对使用通过特征化文本获得的属性的享乐定价模型进行比较。我们收集了五家著名时装生产商网站上销售商品的价格,以估算利用产品描述中包含的信息的享乐定价模型。我们将产品描述映射到一个高维特征空间,并比较了一些利用稀疏建模、主题建模和聚合预测因子的统计估计器的预测准确性和变量选择特性,以检验更好的预测准确性是否来自于经验上一致的属性选择。我们称这种方法为 "河东文本回归建模"。它的新颖之处在于,通过使用对产品描述进行文本挖掘所获得的属性,我们可以估算出其中包含的词语的隐含价格。根据经验,所有建议的模型在预测准确性方面都优于传统的对冲定价模型,同时还提供了一致的变量选择。
{"title":"Hedonic pricing modelling with unstructured predictors: an application to Italian Fashion Industry","authors":"Federico Crescenzi","doi":"10.1007/s10182-022-00465-5","DOIUrl":"10.1007/s10182-022-00465-5","url":null,"abstract":"<div><p>This study proposes a comparison of hedonic pricing models that use attributes obtained by featurizing text. We collected prices of items sold on the websites of five famous fashion producers in order to estimate hedonic pricing models that leverage the information contained in product descriptions. We mapped product descriptions to a high-dimensional feature space and compared predictive accuracy and variable selection properties of some statistical estimators that leverage sparse modelling, topic modelling and aggregated predictors, to test whether better predictive accuracy comes with an empirically consistent selection of attributes. We call this approach Hedonic Text-Regression modelling. Its novelty is that by using attributes obtained by text-mining of product descriptions, we obtain an estimate of the implicit price of the words contained therein. Empirically, all the proposed models outperformed the traditional hedonic pricing model in terms of predictive accuracy, while also providing consistent variable selection.</p></div>","PeriodicalId":55446,"journal":{"name":"Asta-Advances in Statistical Analysis","volume":"107 4","pages":"733 - 753"},"PeriodicalIF":1.4,"publicationDate":"2022-12-13","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"44348728","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Estimating the Impact of Medical Care Usage on Work Absenteeism by a Trivariate Probit Model with Two Binary Endogenous Explanatory Variables 用二元内生解释变量的三元Probit模型估计医疗服务使用对工作缺勤的影响
IF 1.4 4区 数学 Q2 STATISTICS & PROBABILITY Pub Date : 2022-10-18 DOI: 10.1007/s10182-022-00464-6
Panagiota Filippou, Giampiero Marra, Rosalba Radice, David Zimmer

The aim of this paper is to estimate the effects of seeking medical care on missing work. Specifically, our case study explores the question: Does visiting a medical provider cause an employee to miss work? To address this, we employ a model that can consistently estimate the impacts of two endogenous binary regressors. The model is based on three equations connected via a multivariate Gaussian distribution, which makes it possible to model the correlations among the equations, hence accounting for unobserved heterogeneity. Parameter estimation is reliably carried out via a trust region algorithm with analytical derivative information. We find that, observationally, having a curative visit associates with a nearly 80% increase in the probability of missing work, while having a preventive visit correlates with a smaller 13% increase in the likelihood of missing work. However, after addressing potential endogeneity, neither type of visit appears to significantly relate to missing work. That finding also applies to visits that occur during the previous year. Therefore, we conclude that the observed links between medical usage and absenteeism derive from unobserved heterogeneity, rather than direct causal channels. The modeling framework is available through the R package GJRM.

本文旨在估算就医对缺勤的影响。具体来说,我们的案例研究探讨了以下问题:就医是否会导致员工缺勤?为了解决这个问题,我们采用了一个模型,该模型可以持续估计两个内生二元回归因子的影响。该模型基于通过多元高斯分布连接起来的三个方程,这使得方程之间的相关性建模成为可能,从而考虑了未观察到的异质性。参数估计通过具有分析导数信息的信任区域算法可靠地进行。我们发现,从观察结果来看,治疗性就诊会使缺勤概率增加近 80%,而预防性就诊则会使缺勤概率增加 13%。然而,在解决了潜在的内生性问题后,这两种就诊类型似乎都与缺勤没有显著关系。这一结论也适用于上一年的就诊。因此,我们得出结论,观察到的医疗使用和缺勤之间的联系来自于未观察到的异质性,而不是直接的因果渠道。建模框架可通过 R 软件包 GJRM 获取。
{"title":"Estimating the Impact of Medical Care Usage on Work Absenteeism by a Trivariate Probit Model with Two Binary Endogenous Explanatory Variables","authors":"Panagiota Filippou,&nbsp;Giampiero Marra,&nbsp;Rosalba Radice,&nbsp;David Zimmer","doi":"10.1007/s10182-022-00464-6","DOIUrl":"10.1007/s10182-022-00464-6","url":null,"abstract":"<div><p>The aim of this paper is to estimate the effects of seeking medical care on missing work. Specifically, our case study explores the question: Does visiting a medical provider cause an employee to miss work? To address this, we employ a model that can consistently estimate the impacts of two endogenous binary regressors. The model is based on three equations connected via a multivariate Gaussian distribution, which makes it possible to model the correlations among the equations, hence accounting for unobserved heterogeneity. Parameter estimation is reliably carried out via a trust region algorithm with analytical derivative information. We find that, observationally, having a curative visit associates with a nearly 80% increase in the probability of missing work, while having a preventive visit correlates with a smaller 13% increase in the likelihood of missing work. However, after addressing potential endogeneity, neither type of visit appears to significantly relate to missing work. That finding also applies to visits that occur during the previous year. Therefore, we conclude that the observed links between medical usage and absenteeism derive from unobserved heterogeneity, rather than direct causal channels. The modeling framework is available through the <span>R</span> package <span>GJRM</span>.</p></div>","PeriodicalId":55446,"journal":{"name":"Asta-Advances in Statistical Analysis","volume":"107 4","pages":"713 - 731"},"PeriodicalIF":1.4,"publicationDate":"2022-10-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"42881312","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Control charts for measurement error models 测量误差模型控制图
IF 1.4 4区 数学 Q2 STATISTICS & PROBABILITY Pub Date : 2022-10-05 DOI: 10.1007/s10182-022-00462-8
Vasyl Golosnoy, Benno Hildebrandt, Steffen Köhler, Wolfgang Schmid, Miriam Isabel Seifert

We consider a linear measurement error model (MEM) with AR(1) process in the state equation which is widely used in applied research. This MEM could be equivalently re-written as ARMA(1,1) process, where the MA(1) parameter is related to the variance of measurement errors. As the MA(1) parameter is of essential importance for these linear MEMs, it is of much relevance to provide instruments for online monitoring in order to detect its possible changes. In this paper we develop control charts for online detection of such changes, i.e., from AR(1) to ARMA(1,1) and vice versa, as soon as they occur. For this purpose, we elaborate on both cumulative sum (CUSUM) and exponentially weighted moving average (EWMA) control charts and investigate their performance in a Monte Carlo simulation study. The empirical illustration of our approach is conducted based on time series of daily realized volatilities.

我们考虑的是状态方程中含有 AR(1) 过程的线性测量误差模型 (MEM),该模型在应用研究中被广泛使用。这种 MEM 可以等价地改写为 ARMA(1,1) 过程,其中 MA(1) 参数与测量误差的方差有关。由于 MA(1) 参数对这些线性 MEM 至关重要,因此提供在线监测仪器以检测其可能的变化具有重要意义。在本文中,我们开发了在线检测这种变化的控制图,即从 AR(1) 到 ARMA(1,1) 以及反之亦然。为此,我们详细阐述了累积和(CUSUM)和指数加权移动平均(EWMA)控制图,并在蒙特卡罗模拟研究中调查了它们的性能。我们根据每日已实现波动率的时间序列对我们的方法进行了实证说明。
{"title":"Control charts for measurement error models","authors":"Vasyl Golosnoy,&nbsp;Benno Hildebrandt,&nbsp;Steffen Köhler,&nbsp;Wolfgang Schmid,&nbsp;Miriam Isabel Seifert","doi":"10.1007/s10182-022-00462-8","DOIUrl":"10.1007/s10182-022-00462-8","url":null,"abstract":"<div><p>We consider a linear measurement error model (MEM) with AR(1) process in the state equation which is widely used in applied research. This MEM could be equivalently re-written as ARMA(1,1) process, where the MA(1) parameter is related to the variance of measurement errors. As the MA(1) parameter is of essential importance for these linear MEMs, it is of much relevance to provide instruments for online monitoring in order to detect its possible changes. In this paper we develop control charts for online detection of such changes, i.e., from AR(1) to ARMA(1,1) and vice versa, as soon as they occur. For this purpose, we elaborate on both cumulative sum (CUSUM) and exponentially weighted moving average (EWMA) control charts and investigate their performance in a Monte Carlo simulation study. The empirical illustration of our approach is conducted based on time series of daily realized volatilities.\u0000</p></div>","PeriodicalId":55446,"journal":{"name":"Asta-Advances in Statistical Analysis","volume":"107 4","pages":"693 - 712"},"PeriodicalIF":1.4,"publicationDate":"2022-10-05","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9533293/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"33498201","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Sieve bootstrapping the memory parameter in long-range dependent stationary functional time series 筛网自举的记忆参数在长期依赖平稳函数时间序列
IF 1.4 4区 数学 Q2 STATISTICS & PROBABILITY Pub Date : 2022-10-01 DOI: 10.1007/s10182-022-00463-7
Han Lin Shang

We consider a sieve bootstrap procedure to quantify the estimation uncertainty of long-memory parameters in stationary functional time series. We use a semiparametric local Whittle estimator to estimate the long-memory parameter. In the local Whittle estimator, discrete Fourier transform and periodogram are constructed from the first set of principal component scores via a functional principal component analysis. The sieve bootstrap procedure uses a general vector autoregressive representation of the estimated principal component scores. It generates bootstrap replicates that adequately mimic the dependence structure of the underlying stationary process. We first compute the estimated first set of principal component scores for each bootstrap replicate and then apply the semiparametric local Whittle estimator to estimate the memory parameter. By taking quantiles of the estimated memory parameters from these bootstrap replicates, we can nonparametrically construct confidence intervals of the long-memory parameter. As measured by coverage probability differences between the empirical and nominal coverage probabilities at three levels of significance, we demonstrate the advantage of using the sieve bootstrap compared to the asymptotic confidence intervals based on normality.

我们考虑了一种筛子自举法来量化平稳函数时间序列中长记忆参数估计的不确定性。我们使用半参数局部Whittle估计来估计长记忆参数。在局部Whittle估计中,通过泛函主成分分析,从第一组主成分分数构造离散傅里叶变换和周期图。筛选引导过程使用估计主成分分数的一般向量自回归表示。它产生的自举复制,充分模仿基础平稳过程的依赖结构。我们首先计算估计的每个bootstrap复制的第一组主成分分数,然后应用半参数局部Whittle估计器估计内存参数。通过从这些自举重复中取估计的记忆参数的分位数,我们可以非参数地构造长记忆参数的置信区间。通过在三个显著性水平上测量经验和名义覆盖概率之间的覆盖概率差异,我们证明了与基于正态性的渐近置信区间相比,使用筛选自举法的优势。
{"title":"Sieve bootstrapping the memory parameter in long-range dependent stationary functional time series","authors":"Han Lin Shang","doi":"10.1007/s10182-022-00463-7","DOIUrl":"10.1007/s10182-022-00463-7","url":null,"abstract":"<div><p>We consider a sieve bootstrap procedure to quantify the estimation uncertainty of long-memory parameters in stationary functional time series. We use a semiparametric local Whittle estimator to estimate the long-memory parameter. In the local Whittle estimator, discrete Fourier transform and periodogram are constructed from the first set of principal component scores via a functional principal component analysis. The sieve bootstrap procedure uses a general vector autoregressive representation of the estimated principal component scores. It generates bootstrap replicates that adequately mimic the dependence structure of the underlying stationary process. We first compute the estimated first set of principal component scores for each bootstrap replicate and then apply the semiparametric local Whittle estimator to estimate the memory parameter. By taking quantiles of the estimated memory parameters from these bootstrap replicates, we can nonparametrically construct confidence intervals of the long-memory parameter. As measured by coverage probability differences between the empirical and nominal coverage probabilities at three levels of significance, we demonstrate the advantage of using the sieve bootstrap compared to the asymptotic confidence intervals based on normality.</p></div>","PeriodicalId":55446,"journal":{"name":"Asta-Advances in Statistical Analysis","volume":"107 3","pages":"421 - 441"},"PeriodicalIF":1.4,"publicationDate":"2022-10-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://link.springer.com/content/pdf/10.1007/s10182-022-00463-7.pdf","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"46807934","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
Distributional properties of continuous time processes: from CIR to bates 连续时间过程的分布性质:从CIR到bates
IF 1.4 4区 数学 Q2 STATISTICS & PROBABILITY Pub Date : 2022-08-25 DOI: 10.1007/s10182-022-00459-3
Ostap Okhrin, Michael Rockinger, Manuel Schmid

In this paper, we compute closed-form expressions of moments and comoments for the CIR process which allows us to provide a new construction of the transition probability density based on a moment argument that differs from the historic approach. For Bates’ model with stochastic volatility and jumps, we show that finite difference approximations of higher moments such as the skewness and the kurtosis are unstable and, as a remedy, provide exact analytic formulas for log-returns. Our approach does not assume a constant mean for log-price differentials but correctly incorporates volatility resulting from Ito’s lemma. We also provide R, MATLAB, and Mathematica modules with exact implementations of the theoretical conditional and unconditional moments. These modules should prove useful for empirical research.

在本文中,我们计算了CIR过程的矩和共轭矩的闭合形式表达式,这使我们能够基于不同于历史方法的矩自变量来提供过渡概率密度的新构造。对于具有随机波动性和跳跃性的Bates模型,我们证明了偏度和峰度等高阶矩的有限差分近似是不稳定的,并且作为补救,我们提供了对数收益的精确分析公式。我们的方法没有假设对数价差的平均值不变,而是正确地结合了伊藤引理产生的波动性。我们还为R、MATLAB和Mathematica模块提供了理论条件矩和无条件矩的精确实现。这些模块应被证明对实证研究有用。
{"title":"Distributional properties of continuous time processes: from CIR to bates","authors":"Ostap Okhrin,&nbsp;Michael Rockinger,&nbsp;Manuel Schmid","doi":"10.1007/s10182-022-00459-3","DOIUrl":"10.1007/s10182-022-00459-3","url":null,"abstract":"<div><p>In this paper, we compute closed-form expressions of moments and comoments for the CIR process which allows us to provide a new construction of the transition probability density based on a moment argument that differs from the historic approach. For Bates’ model with stochastic volatility and jumps, we show that finite difference approximations of higher moments such as the skewness and the kurtosis are unstable and, as a remedy, provide exact analytic formulas for log-returns. Our approach does not assume a constant mean for log-price differentials but correctly incorporates volatility resulting from Ito’s lemma. We also provide R, MATLAB, and Mathematica modules with exact implementations of the theoretical conditional and unconditional moments. These modules should prove useful for empirical research.</p></div>","PeriodicalId":55446,"journal":{"name":"Asta-Advances in Statistical Analysis","volume":"107 3","pages":"397 - 419"},"PeriodicalIF":1.4,"publicationDate":"2022-08-25","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://link.springer.com/content/pdf/10.1007/s10182-022-00459-3.pdf","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"50046215","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
Hierarchical disjoint principal component analysis 层次不相交主成分分析
IF 1.4 4区 数学 Q2 STATISTICS & PROBABILITY Pub Date : 2022-08-24 DOI: 10.1007/s10182-022-00458-4
Carlo Cavicchia, Maurizio Vichi, Giorgia Zaccaria

Dimension reduction, by means of Principal Component Analysis (PCA), is often employed to obtain a reduced set of components preserving the largest possible part of the total variance of the observed variables. Several methodologies have been proposed either to improve the interpretation of PCA results (e.g., by means of orthogonal, oblique rotations, shrinkage methods), or to model oblique components or factors with a hierarchical structure, such as in Bi-factor and High-Order Factor analyses. In this paper, we propose a new methodology, called Hierarchical Disjoint Principal Component Analysis (HierDPCA), that aims at building a hierarchy of disjoint principal components of maximum variance associated with disjoint groups of observed variables, from Q up to a unique, general one. HierDPCA also allows choosing the type of the relationship among disjoint principal components of two sequential levels, from the lowest upwards, by testing the component correlation per level and changing from a reflective to a formative approach when this correlation turns out to be not statistically significant. The methodology is formulated in a semi-parametric least-squares framework and a coordinate descent algorithm is proposed to estimate the model parameters. A simulation study and two real applications are illustrated to highlight the empirical properties of the proposed methodology.

通常采用主成分分析(PCA)的降维方法来获得保留观测变量总方差的最大可能部分的降维分量集。已经提出了几种方法来改进PCA结果的解释(例如,通过正交、倾斜旋转、收缩方法),或者用层次结构来模拟倾斜成分或因素,例如在双因素和高阶因素分析中。在本文中,我们提出了一种新的方法,称为层次不相交主成分分析(HierDPCA),旨在建立与观察变量的不相交组相关的最大方差的不相交主成分的层次,从Q到唯一的,一般的。HierDPCA还允许在两个连续水平的不相交主成分之间选择关系的类型,从最低向上,通过测试每个水平的成分相关性,当这种相关性在统计上不显著时,从反射方法转变为形成方法。该方法采用半参数最小二乘框架,并提出了一种坐标下降算法来估计模型参数。模拟研究和两个实际应用说明,以突出所提出的方法的经验性质。
{"title":"Hierarchical disjoint principal component analysis","authors":"Carlo Cavicchia,&nbsp;Maurizio Vichi,&nbsp;Giorgia Zaccaria","doi":"10.1007/s10182-022-00458-4","DOIUrl":"10.1007/s10182-022-00458-4","url":null,"abstract":"<div><p>Dimension reduction, by means of Principal Component Analysis (PCA), is often employed to obtain a reduced set of components preserving the largest possible part of the total variance of the observed variables. Several methodologies have been proposed either to improve the interpretation of PCA results (e.g., by means of orthogonal, oblique rotations, shrinkage methods), or to model oblique components or factors with a hierarchical structure, such as in Bi-factor and High-Order Factor analyses. In this paper, we propose a new methodology, called Hierarchical Disjoint Principal Component Analysis (HierDPCA), that aims at building a hierarchy of disjoint principal components of maximum variance associated with disjoint groups of observed variables, from <i>Q</i> up to a unique, general one. HierDPCA also allows choosing the type of the relationship among disjoint principal components of two sequential levels, from the lowest upwards, by testing the component correlation per level and changing from a reflective to a formative approach when this correlation turns out to be not statistically significant. The methodology is formulated in a semi-parametric least-squares framework and a coordinate descent algorithm is proposed to estimate the model parameters. A simulation study and two real applications are illustrated to highlight the empirical properties of the proposed methodology.</p></div>","PeriodicalId":55446,"journal":{"name":"Asta-Advances in Statistical Analysis","volume":"107 3","pages":"537 - 574"},"PeriodicalIF":1.4,"publicationDate":"2022-08-24","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://link.springer.com/content/pdf/10.1007/s10182-022-00458-4.pdf","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"42994839","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 2
Multiple imputation of ordinal missing not at random data 非随机数据序号缺失的多重插补
IF 1.4 4区 数学 Q2 STATISTICS & PROBABILITY Pub Date : 2022-08-22 DOI: 10.1007/s10182-022-00461-9
Angelina Hammon

We introduce a selection model-based imputation approach to be used within the Fully Conditional Specification (FCS) framework for the Multiple Imputation (MI) of incomplete ordinal variables that are supposed to be Missing Not at Random (MNAR). Thereby, we generalise previous work on this topic which involved binary single-level and multilevel data to ordinal variables. We apply an ordered probit model with sample selection as base of our imputation algorithm. The applied model involves two equations that are modelled jointly where the first one describes the missing-data mechanism and the second one specifies the variable to be imputed. In addition, we develop a version for hierarchical data by incorporating random intercept terms in both equations. To fit this multilevel imputation model we use quadrature techniques. Two simulation studies validate the overall good performance of our single-level and multilevel imputation methods. In addition, we show its applicability to empirical data by applying it to a common research topic in educational science using data of the National Educational Panel Study (NEPS) and conducting a short sensitivity analysis. Our approach is designed to be used within the R software package mice which makes it easy to access and apply.

我们介绍了一种基于选择模型的估算方法,该方法可用于全条件规范(FCS)框架内的多重估算(MI),用于估算非随机缺失(MNAR)的不完整序数变量。因此,我们将以往涉及二进制单层次和多层次数据的工作推广到了序数变量。我们在估算算法的基础上,采用了带有样本选择功能的有序概率模型。应用的模型包括两个共同建模的等式,第一个等式描述数据缺失机制,第二个等式指定需要估算的变量。此外,通过在两个方程中加入随机截距项,我们还开发了一个适用于分层数据的版本。为了拟合这个多层次估算模型,我们使用了正交技术。两项模拟研究验证了我们的单层次和多层次估算方法的整体良好性能。此外,我们还利用国家教育面板研究(NEPS)的数据,将其应用于教育科学中的一个常见研究课题,并进行了简短的敏感性分析,从而展示了该方法对经验数据的适用性。我们的方法可在 R 软件包 mice 中使用,因此易于访问和应用。
{"title":"Multiple imputation of ordinal missing not at random data","authors":"Angelina Hammon","doi":"10.1007/s10182-022-00461-9","DOIUrl":"10.1007/s10182-022-00461-9","url":null,"abstract":"<div><p>We introduce a selection model-based imputation approach to be used within the Fully Conditional Specification (FCS) framework for the Multiple Imputation (MI) of incomplete ordinal variables that are supposed to be Missing Not at Random (MNAR). Thereby, we generalise previous work on this topic which involved binary single-level and multilevel data to ordinal variables. We apply an ordered probit model with sample selection as base of our imputation algorithm. The applied model involves two equations that are modelled jointly where the first one describes the missing-data mechanism and the second one specifies the variable to be imputed. In addition, we develop a version for hierarchical data by incorporating random intercept terms in both equations. To fit this multilevel imputation model we use quadrature techniques. Two simulation studies validate the overall good performance of our single-level and multilevel imputation methods. In addition, we show its applicability to empirical data by applying it to a common research topic in educational science using data of the National Educational Panel Study (NEPS) and conducting a short sensitivity analysis. Our approach is designed to be used within the <span>R</span> software package <span>mice</span> which makes it easy to access and apply.</p></div>","PeriodicalId":55446,"journal":{"name":"Asta-Advances in Statistical Analysis","volume":"107 4","pages":"671 - 692"},"PeriodicalIF":1.4,"publicationDate":"2022-08-22","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://link.springer.com/content/pdf/10.1007/s10182-022-00461-9.pdf","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"46920381","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Testing for the presence of treatment effect under selection on observables 在可观察器上选择下治疗效果的存在性测试
IF 1.4 4区 数学 Q2 STATISTICS & PROBABILITY Pub Date : 2022-08-09 DOI: 10.1007/s10182-022-00454-8
Pier Luigi Conti, Livia De Giovanni

The evaluation of the possible effects of a treatment on an outcome plays a central role in both theoretical and applied statistical and econometrical literature. This paper focuses on nonparametric tests for possible difference in the distribution of potential outcomes due to receiving or not receiving a treatment. The approach is based on weighting observed data on the basis on the estimated propensity score. Kolmogorov–Smirnov type and Wilcoxon–Mann–Whitney type tests are constructed, and their limiting distributions are studied. Rejection regions are obtained by inverting confidence intervals. This involves the study of appropriate estimators of the limiting variance of test statistics. Approximations of quantiles via subsampling are also considered. The merits of the different tests are studied by Monte Carlo simulation. An application to the construction of tests for stochastic dominance is provided.

在理论和应用统计与经济学文献中,评估治疗对结果可能产生的影响起着核心作用。本文的重点是对接受或不接受治疗可能导致的潜在结果分布差异进行非参数检验。该方法以估计的倾向得分为基础对观测数据进行加权。构建了 Kolmogorov-Smirnov 类型和 Wilcoxon-Mann-Whitney 类型检验,并对其极限分布进行了研究。通过倒置置信区间获得拒绝区域。这涉及对检验统计量极限方差的适当估计值的研究。此外,还考虑了通过子抽样对量值进行逼近。通过蒙特卡罗模拟研究了不同检验的优点。还提供了构建随机优势检验的应用。
{"title":"Testing for the presence of treatment effect under selection on observables","authors":"Pier Luigi Conti,&nbsp;Livia De Giovanni","doi":"10.1007/s10182-022-00454-8","DOIUrl":"10.1007/s10182-022-00454-8","url":null,"abstract":"<div><p>The evaluation of the possible effects of a treatment on an outcome plays a central role in both theoretical and applied statistical and econometrical literature. This paper focuses on nonparametric tests for possible difference in the distribution of potential outcomes due to receiving or not receiving a treatment. The approach is based on weighting observed data on the basis on the estimated propensity score. Kolmogorov–Smirnov type and Wilcoxon–Mann–Whitney type tests are constructed, and their limiting distributions are studied. Rejection regions are obtained by inverting confidence intervals. This involves the study of appropriate estimators of the limiting variance of test statistics. Approximations of quantiles <i>via</i> subsampling are also considered. The merits of the different tests are studied by Monte Carlo simulation. An application to the construction of tests for stochastic dominance is provided.</p></div>","PeriodicalId":55446,"journal":{"name":"Asta-Advances in Statistical Analysis","volume":"107 4","pages":"641 - 669"},"PeriodicalIF":1.4,"publicationDate":"2022-08-09","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://link.springer.com/content/pdf/10.1007/s10182-022-00454-8.pdf","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"44295457","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Authors’ response: on the role of data, statistics and decisions in a pandemic 作者的回应:关于数据、统计和决策在大流行中的作用
IF 1.4 4区 数学 Q2 STATISTICS & PROBABILITY Pub Date : 2022-07-30 DOI: 10.1007/s10182-022-00460-w
Beate Jahn, Sarah Friedrich, Joachim Behnke, Joachim Engel, Ursula Garczarek, Ralf Münnich, Markus Pauly, Adalbert Wilhelm, Olaf Wolkenhauer, Markus Zwick, Uwe Siebert, Tim Friede
{"title":"Authors’ response: on the role of data, statistics and decisions in a pandemic","authors":"Beate Jahn,&nbsp;Sarah Friedrich,&nbsp;Joachim Behnke,&nbsp;Joachim Engel,&nbsp;Ursula Garczarek,&nbsp;Ralf Münnich,&nbsp;Markus Pauly,&nbsp;Adalbert Wilhelm,&nbsp;Olaf Wolkenhauer,&nbsp;Markus Zwick,&nbsp;Uwe Siebert,&nbsp;Tim Friede","doi":"10.1007/s10182-022-00460-w","DOIUrl":"10.1007/s10182-022-00460-w","url":null,"abstract":"","PeriodicalId":55446,"journal":{"name":"Asta-Advances in Statistical Analysis","volume":"106 3","pages":"403 - 405"},"PeriodicalIF":1.4,"publicationDate":"2022-07-30","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://link.springer.com/content/pdf/10.1007/s10182-022-00460-w.pdf","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"40612467","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
A new price index for multi-period and multilateral comparisons 用于多时期和多边比较的新价格指数
IF 1.4 4区 数学 Q2 STATISTICS & PROBABILITY Pub Date : 2022-07-12 DOI: 10.1007/s10182-022-00457-5
Mario Faliva, Consuelo Rubina Nava, Maria Grazia Zoia

Within the stochastic approach, this paper establishes a closed-form solution to the price index problem for an arbitrary number of periods or countries. The index’s reference basket merges the intersections of all couples of baskets in all periods/countries and provides an effective commodity coverage. Under spherical regression errors, the index satisfies the Geary–Khamis equation system and, as such, offers a general and compact representation of the latter as well as the inferential framework as a dowry. Furthermore, by relaxing sphericalness in favor of a more realistic assumption of commodity-dependent variances, a broader result is achieved. The solution to the price index problem thus obtained encompasses the Geary–Khamis formulation and sows the seeds to further advances.

在随机方法中,本文为任意数量的时期或国家建立了价格指数问题的闭式解决方案。该指数的参考篮子合并了所有时期/国家的所有篮子的交叉点,并提供了有效的商品覆盖范围。在球形回归误差条件下,该指数满足 Geary-Khamis 方程系统,因此为后者以及作为嫁妆的推论框架提供了通用而紧凑的表示方法。此外,通过放宽球形性,转而采用更现实的商品依赖性方差假设,还能获得更广泛的结果。由此获得的价格指数问题解决方案包含了 Geary-Khamis 公式,并为进一步的发展播下了种子。
{"title":"A new price index for multi-period and multilateral comparisons","authors":"Mario Faliva,&nbsp;Consuelo Rubina Nava,&nbsp;Maria Grazia Zoia","doi":"10.1007/s10182-022-00457-5","DOIUrl":"10.1007/s10182-022-00457-5","url":null,"abstract":"<div><p>Within the stochastic approach, this paper establishes a closed-form solution to the price index problem for an arbitrary number of periods or countries. The index’s reference basket merges the intersections of all couples of baskets in all periods/countries and provides an effective commodity coverage. Under spherical regression errors, the index satisfies the Geary–Khamis equation system and, as such, offers a general and compact representation of the latter as well as the inferential framework as a dowry. Furthermore, by relaxing sphericalness in favor of a more realistic assumption of commodity-dependent variances, a broader result is achieved. The solution to the price index problem thus obtained encompasses the Geary–Khamis formulation and sows the seeds to further advances.</p></div>","PeriodicalId":55446,"journal":{"name":"Asta-Advances in Statistical Analysis","volume":"107 4","pages":"621 - 640"},"PeriodicalIF":1.4,"publicationDate":"2022-07-12","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://link.springer.com/content/pdf/10.1007/s10182-022-00457-5.pdf","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"49106042","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
期刊
Asta-Advances in Statistical Analysis
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1