首页 > 最新文献

Asta-Advances in Statistical Analysis最新文献

英文 中文
A dynamic causal modeling of the second outbreak of COVID-19 in Italy 意大利 COVID-19 第二次爆发的动态因果模型。
IF 1.4 4区 数学 Q2 STATISTICS & PROBABILITY Pub Date : 2023-02-07 DOI: 10.1007/s10182-023-00469-9
Massimo Bilancia, Domenico Vitale, Fabio Manca, Paola Perchinunno, Luigi Santacroce

While the vaccination campaign against COVID-19 is having its positive impact, we retrospectively analyze the causal impact of some decisions made by the Italian government on the second outbreak of the SARS-CoV-2 pandemic in Italy, when no vaccine was available. First, we analyze the causal impact of reopenings after the first lockdown in 2020. In addition, we also analyze the impact of reopening schools in September 2020. Our results provide an unprecedented opportunity to evaluate the causal relationship between the relaxation of restrictions and the transmission in the community of a highly contagious respiratory virus that causes severe illness in the absence of prophylactic vaccination programs. We present a purely data-analytic approach based on a Bayesian methodology and discuss possible interpretations of the results obtained and implications for policy makers.

在 COVID-19 疫苗接种活动产生积极影响的同时,我们回顾性地分析了意大利政府在第二次 SARS-CoV-2 大流行爆发(当时还没有疫苗)时所做的一些决策的因果影响。首先,我们分析了 2020 年第一次封锁后重新开放的因果影响。此外,我们还分析了 2020 年 9 月重新开放学校的影响。我们的研究结果提供了一个前所未有的机会,可以评估放宽限制与在没有预防性疫苗接种计划的情况下会导致严重疾病的高传染性呼吸道病毒在社区传播之间的因果关系。我们提出了一种基于贝叶斯方法的纯数据分析方法,并讨论了对所获结果的可能解释以及对政策制定者的影响。
{"title":"A dynamic causal modeling of the second outbreak of COVID-19 in Italy","authors":"Massimo Bilancia,&nbsp;Domenico Vitale,&nbsp;Fabio Manca,&nbsp;Paola Perchinunno,&nbsp;Luigi Santacroce","doi":"10.1007/s10182-023-00469-9","DOIUrl":"10.1007/s10182-023-00469-9","url":null,"abstract":"<div><p>While the vaccination campaign against COVID-19 is having its positive impact, we retrospectively analyze the causal impact of some decisions made by the Italian government on the second outbreak of the SARS-CoV-2 pandemic in Italy, when no vaccine was available. First, we analyze the causal impact of reopenings after the first lockdown in 2020. In addition, we also analyze the impact of reopening schools in September 2020. Our results provide an unprecedented opportunity to evaluate the causal relationship between the relaxation of restrictions and the transmission in the community of a highly contagious respiratory virus that causes severe illness in the absence of prophylactic vaccination programs. We present a purely data-analytic approach based on a Bayesian methodology and discuss possible interpretations of the results obtained and implications for policy makers.</p></div>","PeriodicalId":55446,"journal":{"name":"Asta-Advances in Statistical Analysis","volume":"108 1","pages":"1 - 30"},"PeriodicalIF":1.4,"publicationDate":"2023-02-07","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9904269/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"10712587","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Left-truncated health insurance claims data: theoretical review and empirical application 左截断医疗保险理赔数据:理论回顾与实证应用
IF 1.4 4区 数学 Q2 STATISTICS & PROBABILITY Pub Date : 2023-02-02 DOI: 10.1007/s10182-023-00471-1
Rafael Weißbach, Achim Dörre, Dominik Wied, Gabriele Doblhammer, Anne Fink

From the inventory of the health insurer AOK in 2004, we draw a sample of a quarter million people and follow each person’s health claims continuously until 2013. Our aim is to estimate the effect of a stroke on the dementia onset probability for Germans born in the first half of the 20th century. People deceased before 2004 are randomly left-truncated, and especially their number is unknown. Filtrations, modelling the missing data, enable circumventing the unknown number of truncated persons by using a conditional likelihood. Dementia onset after 2013 is a fixed right-censoring event. For each observed health history, Jacod’s formula yields its conditional likelihood contribution. Asymptotic normality of the estimated intensities is derived, related to a sample size definition including the number of truncated people. The standard error results from the asymptotic normality and is easily computable, despite the unknown sample size. The claims data reveal that after a stroke, with time measured in years, the intensity of dementia onset increases from 0.02 to 0.07. Using the independence of the two estimated intensities, a 95% confidence interval for their difference is [0.053, 0.057]. The effect halves when we extend the analysis to an age-inhomogeneous model, but does not change further when we additionally adjust for multi-morbidity.

我们从 2004 年医疗保险公司 AOK 的库存中抽取了 25 万人作为样本,并对每个人的医疗索赔进行了持续跟踪,直至 2013 年。我们的目的是估计中风对 20 世纪上半叶出生的德国人痴呆症发病概率的影响。2004 年之前去世的人被随机左截断,尤其是他们的人数未知。利用条件似然法对缺失数据进行过滤建模,可以规避截断人数未知的问题。2013 年后发病的痴呆症患者是一个固定的右截断事件。对于每个观察到的健康史,Jacod 公式都能得出其条件似然贡献。估计强度的渐近正态性与包括截断人数在内的样本量定义有关。标准误差由渐近正态性得出,尽管样本量未知,但很容易计算。理赔数据显示,中风后,随着时间的推移(以年为单位),痴呆症发病强度从 0.02 增加到 0.07。利用两个估计强度的独立性,它们之间差异的 95% 置信区间为 [0.053, 0.057]。当我们将分析扩展到年龄同质性模型时,该效应减半,但当我们对多病症进行额外调整时,该效应没有进一步变化。
{"title":"Left-truncated health insurance claims data: theoretical review and empirical application","authors":"Rafael Weißbach,&nbsp;Achim Dörre,&nbsp;Dominik Wied,&nbsp;Gabriele Doblhammer,&nbsp;Anne Fink","doi":"10.1007/s10182-023-00471-1","DOIUrl":"10.1007/s10182-023-00471-1","url":null,"abstract":"<div><p>From the inventory of the health insurer AOK in 2004, we draw a sample of a quarter million people and follow each person’s health claims continuously until 2013. Our aim is to estimate the effect of a stroke on the dementia onset probability for Germans born in the first half of the 20th century. People deceased before 2004 are randomly left-truncated, and especially their number is unknown. Filtrations, modelling the missing data, enable circumventing the unknown number of truncated persons by using a conditional likelihood. Dementia onset after 2013 is a fixed right-censoring event. For each observed health history, Jacod’s formula yields its conditional likelihood contribution. Asymptotic normality of the estimated intensities is derived, related to a sample size definition including the number of truncated people. The standard error results from the asymptotic normality and is easily computable, despite the unknown sample size. The claims data reveal that after a stroke, with time measured in years, the intensity of dementia onset increases from 0.02 to 0.07. Using the independence of the two estimated intensities, a 95% confidence interval for their difference is [0.053, 0.057]. The effect halves when we extend the analysis to an age-inhomogeneous model, but does not change further when we additionally adjust for multi-morbidity.</p></div>","PeriodicalId":55446,"journal":{"name":"Asta-Advances in Statistical Analysis","volume":"108 1","pages":"31 - 68"},"PeriodicalIF":1.4,"publicationDate":"2023-02-02","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://link.springer.com/content/pdf/10.1007/s10182-023-00471-1.pdf","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"42282189","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Statistical guarantees for sparse deep learning 稀疏深度学习的统计保障
IF 1.4 4区 数学 Q2 STATISTICS & PROBABILITY Pub Date : 2023-01-24 DOI: 10.1007/s10182-022-00467-3
Johannes Lederer

Neural networks are becoming increasingly popular in applications, but our mathematical understanding of their potential and limitations is still limited. In this paper, we further this understanding by developing statistical guarantees for sparse deep learning. In contrast to previous work, we consider different types of sparsity, such as few active connections, few active nodes, and other norm-based types of sparsity. Moreover, our theories cover important aspects that previous theories have neglected, such as multiple outputs, regularization, and (ell_{2})-loss. The guarantees have a mild dependence on network widths and depths, which means that they support the application of sparse but wide and deep networks from a statistical perspective. Some of the concepts and tools that we use in our derivations are uncommon in deep learning and, hence, might be of additional interest.

神经网络在应用中越来越受欢迎,但我们对其潜力和局限性的数学理解仍然有限。在本文中,我们通过开发稀疏深度学习的统计保证,进一步加深了对这一问题的理解。与之前的工作不同,我们考虑了不同类型的稀疏性,如很少的活动连接、很少的活动节点以及其他基于规范的稀疏性类型。此外,我们的理论还涵盖了以往理论所忽略的重要方面,如多重输出、正则化和(ell_{2})损失。这些保证对网络宽度和深度有温和的依赖性,这意味着它们支持从统计学角度应用稀疏但宽而深的网络。我们在推导中使用的一些概念和工具在深度学习中并不常见,因此可能会引起额外的兴趣。
{"title":"Statistical guarantees for sparse deep learning","authors":"Johannes Lederer","doi":"10.1007/s10182-022-00467-3","DOIUrl":"10.1007/s10182-022-00467-3","url":null,"abstract":"<div><p>Neural networks are becoming increasingly popular in applications, but our mathematical understanding of their potential and limitations is still limited. In this paper, we further this understanding by developing statistical guarantees for sparse deep learning. In contrast to previous work, we consider different types of sparsity, such as few active connections, few active nodes, and other norm-based types of sparsity. Moreover, our theories cover important aspects that previous theories have neglected, such as multiple outputs, regularization, and <span>(ell_{2})</span>-loss. The guarantees have a mild dependence on network widths and depths, which means that they support the application of sparse but wide and deep networks from a statistical perspective. Some of the concepts and tools that we use in our derivations are uncommon in deep learning and, hence, might be of additional interest.</p></div>","PeriodicalId":55446,"journal":{"name":"Asta-Advances in Statistical Analysis","volume":"108 2","pages":"231 - 258"},"PeriodicalIF":1.4,"publicationDate":"2023-01-24","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://link.springer.com/content/pdf/10.1007/s10182-022-00467-3.pdf","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"136118419","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Addressing non-normality in multivariate analysis using the t-distribution 利用t分布解决多元分析中的非正态性
IF 1.4 4区 数学 Q2 STATISTICS & PROBABILITY Pub Date : 2023-01-21 DOI: 10.1007/s10182-022-00468-2
Felipe Osorio, Manuel Galea, Claudio Henríquez, Reinaldo Arellano-Valle

The main aim of this paper is to propose a set of tools for assessing non-normality taking into consideration the class of multivariate t-distributions. Assuming second moment existence, we consider a reparameterized version of the usual t distribution, so that the scale matrix coincides with covariance matrix of the distribution. We use the local influence procedure and the Kullback–Leibler divergence measure to propose quantitative methods to evaluate deviations from the normality assumption. In addition, the possible non-normality due to the presence of both skewness and heavy tails is also explored. Our findings based on two real datasets are complemented by a simulation study to evaluate the performance of the proposed methodology on finite samples.

本文的主要目的是提出一套评估非正态性的工具,同时考虑到多元 t 分布的类别。假定第二矩存在,我们考虑了通常 t 分布的重参数化版本,从而使尺度矩阵与分布的协方差矩阵重合。我们使用局部影响程序和库尔贝克-莱布勒发散度量,提出了评估正态性假设偏差的定量方法。此外,我们还探讨了由于偏斜和重尾的存在而可能导致的非正态性。我们基于两个真实数据集的研究结果通过模拟研究得到了补充,以评估所提出的方法在有限样本上的性能。
{"title":"Addressing non-normality in multivariate analysis using the t-distribution","authors":"Felipe Osorio,&nbsp;Manuel Galea,&nbsp;Claudio Henríquez,&nbsp;Reinaldo Arellano-Valle","doi":"10.1007/s10182-022-00468-2","DOIUrl":"10.1007/s10182-022-00468-2","url":null,"abstract":"<div><p>The main aim of this paper is to propose a set of tools for assessing non-normality taking into consideration the class of multivariate <i>t</i>-distributions. Assuming second moment existence, we consider a reparameterized version of the usual <i>t</i> distribution, so that the scale matrix coincides with covariance matrix of the distribution. We use the local influence procedure and the Kullback–Leibler divergence measure to propose quantitative methods to evaluate deviations from the normality assumption. In addition, the possible non-normality due to the presence of both skewness and heavy tails is also explored. Our findings based on two real datasets are complemented by a simulation study to evaluate the performance of the proposed methodology on finite samples.</p></div>","PeriodicalId":55446,"journal":{"name":"Asta-Advances in Statistical Analysis","volume":"107 4","pages":"785 - 813"},"PeriodicalIF":1.4,"publicationDate":"2023-01-21","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"46365758","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Bayesian ridge regression for survival data based on a vine copula-based prior 基于vine copula先验的生存数据贝叶斯脊回归
IF 1.4 4区 数学 Q2 STATISTICS & PROBABILITY Pub Date : 2022-12-30 DOI: 10.1007/s10182-022-00466-4
Hirofumi Michimae, Takeshi Emura

Ridge regression estimators can be interpreted as a Bayesian posterior mean (or mode) when the regression coefficients follow multivariate normal prior. However, the multivariate normal prior may not give efficient posterior estimates for regression coefficients, especially in the presence of interaction terms. In this paper, the vine copula-based priors are proposed for Bayesian ridge estimators under the Cox proportional hazards model. The semiparametric Cox models are built on the posterior density under two likelihoods: Cox’s partial likelihood and the full likelihood under the gamma process prior. The simulations show that the full likelihood is generally more efficient and stable for estimating regression coefficients than the partial likelihood. We also show via simulations and a data example that the Archimedean copula priors (the Clayton and Gumbel copula) are superior to the multivariate normal prior and the Gaussian copula prior.

当回归系数遵循多元正态先验时,岭回归估计值可解释为贝叶斯后验均值(或模式)。然而,多元正态先验可能无法给出有效的回归系数后验估计值,尤其是在存在交互项的情况下。本文针对 Cox 比例危险模型下的贝叶斯脊估计器提出了基于藤状协方差的先验。半参数 Cox 模型建立在两种似然下的后验密度上:Cox 部分似然和伽玛过程先验下的完全似然。模拟结果表明,在估计回归系数时,完全似然通常比部分似然更有效、更稳定。我们还通过模拟和一个数据示例表明,阿基米德协程先验(克莱顿协程和 Gumbel 协程)优于多元正态先验和高斯协程先验。
{"title":"Bayesian ridge regression for survival data based on a vine copula-based prior","authors":"Hirofumi Michimae,&nbsp;Takeshi Emura","doi":"10.1007/s10182-022-00466-4","DOIUrl":"10.1007/s10182-022-00466-4","url":null,"abstract":"<div><p>Ridge regression estimators can be interpreted as a Bayesian posterior mean (or mode) when the regression coefficients follow multivariate normal prior. However, the multivariate normal prior may not give efficient posterior estimates for regression coefficients, especially in the presence of interaction terms. In this paper, the vine copula-based priors are proposed for Bayesian ridge estimators under the Cox proportional hazards model. The semiparametric Cox models are built on the posterior density under two likelihoods: Cox’s partial likelihood and the full likelihood under the gamma process prior. The simulations show that the full likelihood is generally more efficient and stable for estimating regression coefficients than the partial likelihood. We also show via simulations and a data example that the Archimedean copula priors (the Clayton and Gumbel copula) are superior to the multivariate normal prior and the Gaussian copula prior.</p></div>","PeriodicalId":55446,"journal":{"name":"Asta-Advances in Statistical Analysis","volume":"107 4","pages":"755 - 784"},"PeriodicalIF":1.4,"publicationDate":"2022-12-30","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"47123911","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Hedonic pricing modelling with unstructured predictors: an application to Italian Fashion Industry 具有非结构化预测因子的Hedonic定价模型在意大利时装业中的应用
IF 1.4 4区 数学 Q2 STATISTICS & PROBABILITY Pub Date : 2022-12-13 DOI: 10.1007/s10182-022-00465-5
Federico Crescenzi

This study proposes a comparison of hedonic pricing models that use attributes obtained by featurizing text. We collected prices of items sold on the websites of five famous fashion producers in order to estimate hedonic pricing models that leverage the information contained in product descriptions. We mapped product descriptions to a high-dimensional feature space and compared predictive accuracy and variable selection properties of some statistical estimators that leverage sparse modelling, topic modelling and aggregated predictors, to test whether better predictive accuracy comes with an empirically consistent selection of attributes. We call this approach Hedonic Text-Regression modelling. Its novelty is that by using attributes obtained by text-mining of product descriptions, we obtain an estimate of the implicit price of the words contained therein. Empirically, all the proposed models outperformed the traditional hedonic pricing model in terms of predictive accuracy, while also providing consistent variable selection.

本研究建议对使用通过特征化文本获得的属性的享乐定价模型进行比较。我们收集了五家著名时装生产商网站上销售商品的价格,以估算利用产品描述中包含的信息的享乐定价模型。我们将产品描述映射到一个高维特征空间,并比较了一些利用稀疏建模、主题建模和聚合预测因子的统计估计器的预测准确性和变量选择特性,以检验更好的预测准确性是否来自于经验上一致的属性选择。我们称这种方法为 "河东文本回归建模"。它的新颖之处在于,通过使用对产品描述进行文本挖掘所获得的属性,我们可以估算出其中包含的词语的隐含价格。根据经验,所有建议的模型在预测准确性方面都优于传统的对冲定价模型,同时还提供了一致的变量选择。
{"title":"Hedonic pricing modelling with unstructured predictors: an application to Italian Fashion Industry","authors":"Federico Crescenzi","doi":"10.1007/s10182-022-00465-5","DOIUrl":"10.1007/s10182-022-00465-5","url":null,"abstract":"<div><p>This study proposes a comparison of hedonic pricing models that use attributes obtained by featurizing text. We collected prices of items sold on the websites of five famous fashion producers in order to estimate hedonic pricing models that leverage the information contained in product descriptions. We mapped product descriptions to a high-dimensional feature space and compared predictive accuracy and variable selection properties of some statistical estimators that leverage sparse modelling, topic modelling and aggregated predictors, to test whether better predictive accuracy comes with an empirically consistent selection of attributes. We call this approach Hedonic Text-Regression modelling. Its novelty is that by using attributes obtained by text-mining of product descriptions, we obtain an estimate of the implicit price of the words contained therein. Empirically, all the proposed models outperformed the traditional hedonic pricing model in terms of predictive accuracy, while also providing consistent variable selection.</p></div>","PeriodicalId":55446,"journal":{"name":"Asta-Advances in Statistical Analysis","volume":"107 4","pages":"733 - 753"},"PeriodicalIF":1.4,"publicationDate":"2022-12-13","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"44348728","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Estimating the Impact of Medical Care Usage on Work Absenteeism by a Trivariate Probit Model with Two Binary Endogenous Explanatory Variables 用二元内生解释变量的三元Probit模型估计医疗服务使用对工作缺勤的影响
IF 1.4 4区 数学 Q2 STATISTICS & PROBABILITY Pub Date : 2022-10-18 DOI: 10.1007/s10182-022-00464-6
Panagiota Filippou, Giampiero Marra, Rosalba Radice, David Zimmer

The aim of this paper is to estimate the effects of seeking medical care on missing work. Specifically, our case study explores the question: Does visiting a medical provider cause an employee to miss work? To address this, we employ a model that can consistently estimate the impacts of two endogenous binary regressors. The model is based on three equations connected via a multivariate Gaussian distribution, which makes it possible to model the correlations among the equations, hence accounting for unobserved heterogeneity. Parameter estimation is reliably carried out via a trust region algorithm with analytical derivative information. We find that, observationally, having a curative visit associates with a nearly 80% increase in the probability of missing work, while having a preventive visit correlates with a smaller 13% increase in the likelihood of missing work. However, after addressing potential endogeneity, neither type of visit appears to significantly relate to missing work. That finding also applies to visits that occur during the previous year. Therefore, we conclude that the observed links between medical usage and absenteeism derive from unobserved heterogeneity, rather than direct causal channels. The modeling framework is available through the R package GJRM.

本文旨在估算就医对缺勤的影响。具体来说,我们的案例研究探讨了以下问题:就医是否会导致员工缺勤?为了解决这个问题,我们采用了一个模型,该模型可以持续估计两个内生二元回归因子的影响。该模型基于通过多元高斯分布连接起来的三个方程,这使得方程之间的相关性建模成为可能,从而考虑了未观察到的异质性。参数估计通过具有分析导数信息的信任区域算法可靠地进行。我们发现,从观察结果来看,治疗性就诊会使缺勤概率增加近 80%,而预防性就诊则会使缺勤概率增加 13%。然而,在解决了潜在的内生性问题后,这两种就诊类型似乎都与缺勤没有显著关系。这一结论也适用于上一年的就诊。因此,我们得出结论,观察到的医疗使用和缺勤之间的联系来自于未观察到的异质性,而不是直接的因果渠道。建模框架可通过 R 软件包 GJRM 获取。
{"title":"Estimating the Impact of Medical Care Usage on Work Absenteeism by a Trivariate Probit Model with Two Binary Endogenous Explanatory Variables","authors":"Panagiota Filippou,&nbsp;Giampiero Marra,&nbsp;Rosalba Radice,&nbsp;David Zimmer","doi":"10.1007/s10182-022-00464-6","DOIUrl":"10.1007/s10182-022-00464-6","url":null,"abstract":"<div><p>The aim of this paper is to estimate the effects of seeking medical care on missing work. Specifically, our case study explores the question: Does visiting a medical provider cause an employee to miss work? To address this, we employ a model that can consistently estimate the impacts of two endogenous binary regressors. The model is based on three equations connected via a multivariate Gaussian distribution, which makes it possible to model the correlations among the equations, hence accounting for unobserved heterogeneity. Parameter estimation is reliably carried out via a trust region algorithm with analytical derivative information. We find that, observationally, having a curative visit associates with a nearly 80% increase in the probability of missing work, while having a preventive visit correlates with a smaller 13% increase in the likelihood of missing work. However, after addressing potential endogeneity, neither type of visit appears to significantly relate to missing work. That finding also applies to visits that occur during the previous year. Therefore, we conclude that the observed links between medical usage and absenteeism derive from unobserved heterogeneity, rather than direct causal channels. The modeling framework is available through the <span>R</span> package <span>GJRM</span>.</p></div>","PeriodicalId":55446,"journal":{"name":"Asta-Advances in Statistical Analysis","volume":"107 4","pages":"713 - 731"},"PeriodicalIF":1.4,"publicationDate":"2022-10-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"42881312","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Control charts for measurement error models 测量误差模型控制图
IF 1.4 4区 数学 Q2 STATISTICS & PROBABILITY Pub Date : 2022-10-05 DOI: 10.1007/s10182-022-00462-8
Vasyl Golosnoy, Benno Hildebrandt, Steffen Köhler, Wolfgang Schmid, Miriam Isabel Seifert

We consider a linear measurement error model (MEM) with AR(1) process in the state equation which is widely used in applied research. This MEM could be equivalently re-written as ARMA(1,1) process, where the MA(1) parameter is related to the variance of measurement errors. As the MA(1) parameter is of essential importance for these linear MEMs, it is of much relevance to provide instruments for online monitoring in order to detect its possible changes. In this paper we develop control charts for online detection of such changes, i.e., from AR(1) to ARMA(1,1) and vice versa, as soon as they occur. For this purpose, we elaborate on both cumulative sum (CUSUM) and exponentially weighted moving average (EWMA) control charts and investigate their performance in a Monte Carlo simulation study. The empirical illustration of our approach is conducted based on time series of daily realized volatilities.

我们考虑的是状态方程中含有 AR(1) 过程的线性测量误差模型 (MEM),该模型在应用研究中被广泛使用。这种 MEM 可以等价地改写为 ARMA(1,1) 过程,其中 MA(1) 参数与测量误差的方差有关。由于 MA(1) 参数对这些线性 MEM 至关重要,因此提供在线监测仪器以检测其可能的变化具有重要意义。在本文中,我们开发了在线检测这种变化的控制图,即从 AR(1) 到 ARMA(1,1) 以及反之亦然。为此,我们详细阐述了累积和(CUSUM)和指数加权移动平均(EWMA)控制图,并在蒙特卡罗模拟研究中调查了它们的性能。我们根据每日已实现波动率的时间序列对我们的方法进行了实证说明。
{"title":"Control charts for measurement error models","authors":"Vasyl Golosnoy,&nbsp;Benno Hildebrandt,&nbsp;Steffen Köhler,&nbsp;Wolfgang Schmid,&nbsp;Miriam Isabel Seifert","doi":"10.1007/s10182-022-00462-8","DOIUrl":"10.1007/s10182-022-00462-8","url":null,"abstract":"<div><p>We consider a linear measurement error model (MEM) with AR(1) process in the state equation which is widely used in applied research. This MEM could be equivalently re-written as ARMA(1,1) process, where the MA(1) parameter is related to the variance of measurement errors. As the MA(1) parameter is of essential importance for these linear MEMs, it is of much relevance to provide instruments for online monitoring in order to detect its possible changes. In this paper we develop control charts for online detection of such changes, i.e., from AR(1) to ARMA(1,1) and vice versa, as soon as they occur. For this purpose, we elaborate on both cumulative sum (CUSUM) and exponentially weighted moving average (EWMA) control charts and investigate their performance in a Monte Carlo simulation study. The empirical illustration of our approach is conducted based on time series of daily realized volatilities.\u0000</p></div>","PeriodicalId":55446,"journal":{"name":"Asta-Advances in Statistical Analysis","volume":"107 4","pages":"693 - 712"},"PeriodicalIF":1.4,"publicationDate":"2022-10-05","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9533293/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"33498201","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Sieve bootstrapping the memory parameter in long-range dependent stationary functional time series 筛网自举的记忆参数在长期依赖平稳函数时间序列
IF 1.4 4区 数学 Q2 STATISTICS & PROBABILITY Pub Date : 2022-10-01 DOI: 10.1007/s10182-022-00463-7
Han Lin Shang

We consider a sieve bootstrap procedure to quantify the estimation uncertainty of long-memory parameters in stationary functional time series. We use a semiparametric local Whittle estimator to estimate the long-memory parameter. In the local Whittle estimator, discrete Fourier transform and periodogram are constructed from the first set of principal component scores via a functional principal component analysis. The sieve bootstrap procedure uses a general vector autoregressive representation of the estimated principal component scores. It generates bootstrap replicates that adequately mimic the dependence structure of the underlying stationary process. We first compute the estimated first set of principal component scores for each bootstrap replicate and then apply the semiparametric local Whittle estimator to estimate the memory parameter. By taking quantiles of the estimated memory parameters from these bootstrap replicates, we can nonparametrically construct confidence intervals of the long-memory parameter. As measured by coverage probability differences between the empirical and nominal coverage probabilities at three levels of significance, we demonstrate the advantage of using the sieve bootstrap compared to the asymptotic confidence intervals based on normality.

我们考虑了一种筛子自举法来量化平稳函数时间序列中长记忆参数估计的不确定性。我们使用半参数局部Whittle估计来估计长记忆参数。在局部Whittle估计中,通过泛函主成分分析,从第一组主成分分数构造离散傅里叶变换和周期图。筛选引导过程使用估计主成分分数的一般向量自回归表示。它产生的自举复制,充分模仿基础平稳过程的依赖结构。我们首先计算估计的每个bootstrap复制的第一组主成分分数,然后应用半参数局部Whittle估计器估计内存参数。通过从这些自举重复中取估计的记忆参数的分位数,我们可以非参数地构造长记忆参数的置信区间。通过在三个显著性水平上测量经验和名义覆盖概率之间的覆盖概率差异,我们证明了与基于正态性的渐近置信区间相比,使用筛选自举法的优势。
{"title":"Sieve bootstrapping the memory parameter in long-range dependent stationary functional time series","authors":"Han Lin Shang","doi":"10.1007/s10182-022-00463-7","DOIUrl":"10.1007/s10182-022-00463-7","url":null,"abstract":"<div><p>We consider a sieve bootstrap procedure to quantify the estimation uncertainty of long-memory parameters in stationary functional time series. We use a semiparametric local Whittle estimator to estimate the long-memory parameter. In the local Whittle estimator, discrete Fourier transform and periodogram are constructed from the first set of principal component scores via a functional principal component analysis. The sieve bootstrap procedure uses a general vector autoregressive representation of the estimated principal component scores. It generates bootstrap replicates that adequately mimic the dependence structure of the underlying stationary process. We first compute the estimated first set of principal component scores for each bootstrap replicate and then apply the semiparametric local Whittle estimator to estimate the memory parameter. By taking quantiles of the estimated memory parameters from these bootstrap replicates, we can nonparametrically construct confidence intervals of the long-memory parameter. As measured by coverage probability differences between the empirical and nominal coverage probabilities at three levels of significance, we demonstrate the advantage of using the sieve bootstrap compared to the asymptotic confidence intervals based on normality.</p></div>","PeriodicalId":55446,"journal":{"name":"Asta-Advances in Statistical Analysis","volume":"107 3","pages":"421 - 441"},"PeriodicalIF":1.4,"publicationDate":"2022-10-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://link.springer.com/content/pdf/10.1007/s10182-022-00463-7.pdf","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"46807934","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
Distributional properties of continuous time processes: from CIR to bates 连续时间过程的分布性质:从CIR到bates
IF 1.4 4区 数学 Q2 STATISTICS & PROBABILITY Pub Date : 2022-08-25 DOI: 10.1007/s10182-022-00459-3
Ostap Okhrin, Michael Rockinger, Manuel Schmid

In this paper, we compute closed-form expressions of moments and comoments for the CIR process which allows us to provide a new construction of the transition probability density based on a moment argument that differs from the historic approach. For Bates’ model with stochastic volatility and jumps, we show that finite difference approximations of higher moments such as the skewness and the kurtosis are unstable and, as a remedy, provide exact analytic formulas for log-returns. Our approach does not assume a constant mean for log-price differentials but correctly incorporates volatility resulting from Ito’s lemma. We also provide R, MATLAB, and Mathematica modules with exact implementations of the theoretical conditional and unconditional moments. These modules should prove useful for empirical research.

在本文中,我们计算了CIR过程的矩和共轭矩的闭合形式表达式,这使我们能够基于不同于历史方法的矩自变量来提供过渡概率密度的新构造。对于具有随机波动性和跳跃性的Bates模型,我们证明了偏度和峰度等高阶矩的有限差分近似是不稳定的,并且作为补救,我们提供了对数收益的精确分析公式。我们的方法没有假设对数价差的平均值不变,而是正确地结合了伊藤引理产生的波动性。我们还为R、MATLAB和Mathematica模块提供了理论条件矩和无条件矩的精确实现。这些模块应被证明对实证研究有用。
{"title":"Distributional properties of continuous time processes: from CIR to bates","authors":"Ostap Okhrin,&nbsp;Michael Rockinger,&nbsp;Manuel Schmid","doi":"10.1007/s10182-022-00459-3","DOIUrl":"10.1007/s10182-022-00459-3","url":null,"abstract":"<div><p>In this paper, we compute closed-form expressions of moments and comoments for the CIR process which allows us to provide a new construction of the transition probability density based on a moment argument that differs from the historic approach. For Bates’ model with stochastic volatility and jumps, we show that finite difference approximations of higher moments such as the skewness and the kurtosis are unstable and, as a remedy, provide exact analytic formulas for log-returns. Our approach does not assume a constant mean for log-price differentials but correctly incorporates volatility resulting from Ito’s lemma. We also provide R, MATLAB, and Mathematica modules with exact implementations of the theoretical conditional and unconditional moments. These modules should prove useful for empirical research.</p></div>","PeriodicalId":55446,"journal":{"name":"Asta-Advances in Statistical Analysis","volume":"107 3","pages":"397 - 419"},"PeriodicalIF":1.4,"publicationDate":"2022-08-25","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://link.springer.com/content/pdf/10.1007/s10182-022-00459-3.pdf","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"50046215","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
期刊
Asta-Advances in Statistical Analysis
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1