首页 > 最新文献

Asta-Advances in Statistical Analysis最新文献

英文 中文
Sequential estimation of multivariate factor stochastic volatility models 多因素随机波动模型的序贯估计
IF 1.4 4区 数学 Q2 STATISTICS & PROBABILITY Pub Date : 2025-08-01 DOI: 10.1007/s10182-025-00536-3
Christian Mücher, Giorgio Calzolari, Roxana Halbleib

We provide the first “frequentist” method to estimate the parameters of multivariate stochastic volatility models with latent factor structures to capture the time-varying variance–covariance of financial returns. These models alleviate the standard curse of dimensionality, allowing the number of parameters to increase only linearly with the number of series. Although theoretically very appealing, they have only found limited practical application due to huge computational burdens. Our estimation method is simple in implementation as it consists of two steps: first, we estimate the loadings and the unconditional variances by maximum likelihood, and then, we use the efficient method of moments to estimate the parameters of the stochastic volatility structure with the generalised autoregressive conditional heteroskedasticity (GARCH) auxiliary models. In a comprehensive Monte Carlo study, we show the good performance of our method to estimate the parameters of interest accurately. The simulation study and an application to the daily returns on 148 stocks in the cross-sectional dimension provide sound evidence on the computational feasibility of the method proposed and its application.

我们提供了第一个“频率”方法来估计具有潜在因素结构的多变量随机波动模型的参数,以捕获金融收益的时变方差-协方差。这些模型减轻了标准的维数诅咒,允许参数的数量只随序列的数量线性增加。虽然理论上很有吸引力,但由于巨大的计算负担,它们只找到了有限的实际应用。我们的估计方法实现简单,它由两个步骤组成:首先,我们通过极大似然估计负荷和无条件方差,然后,我们使用有效的矩量方法与广义自回归条件异方差(GARCH)辅助模型估计随机波动结构的参数。在一个全面的蒙特卡罗研究中,我们证明了我们的方法在准确估计感兴趣参数方面的良好性能。对148只股票在横截面上的日收益进行了模拟研究和应用,为该方法的计算可行性及其应用提供了有力的证据。
{"title":"Sequential estimation of multivariate factor stochastic volatility models","authors":"Christian Mücher,&nbsp;Giorgio Calzolari,&nbsp;Roxana Halbleib","doi":"10.1007/s10182-025-00536-3","DOIUrl":"10.1007/s10182-025-00536-3","url":null,"abstract":"<div><p>We provide the first “frequentist” method to estimate the parameters of multivariate stochastic volatility models with latent factor structures to capture the time-varying variance–covariance of financial returns. These models alleviate the standard curse of dimensionality, allowing the number of parameters to increase only linearly with the number of series. Although theoretically very appealing, they have only found limited practical application due to huge computational burdens. Our estimation method is simple in implementation as it consists of two steps: first, we estimate the loadings and the unconditional variances by maximum likelihood, and then, we use the efficient method of moments to estimate the parameters of the stochastic volatility structure with the generalised autoregressive conditional heteroskedasticity (GARCH) auxiliary models. In a comprehensive Monte Carlo study, we show the good performance of our method to estimate the parameters of interest accurately. The simulation study and an application to the daily returns on 148 stocks in the cross-sectional dimension provide sound evidence on the computational feasibility of the method proposed and its application.</p></div>","PeriodicalId":55446,"journal":{"name":"Asta-Advances in Statistical Analysis","volume":"110 1","pages":"41 - 63"},"PeriodicalIF":1.4,"publicationDate":"2025-08-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://link.springer.com/content/pdf/10.1007/s10182-025-00536-3.pdf","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"147335964","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Clustering for ranking multivariate data by Linear Ordered Partitions 基于线性有序分区的多变量数据排序聚类
IF 1.4 4区 数学 Q2 STATISTICS & PROBABILITY Pub Date : 2025-07-21 DOI: 10.1007/s10182-025-00534-5
Mariaelena Bottazzi Schenone, Maurizio Vichi

This paper explores the use of clustering to rank multivariate observations by linking ranking to clustering through the Linear Ordered Partition (LOP) concept. A LOP allows optimal clustering into ordered “equivalence classes”. In fact, unlike simple units’ ordering, cluster ranking identifies classes where units are “incomparable”. The aim is to partition units into clusters with statistically distinct centroids, leading to an optimally ranked total order of clusters, where units within each one are considered “ties”. The proposed model finds the best least-squares (LS) LOP, alongside with a univariate transformation of the observed variables. This is because it identifies the LS LOP by orthogonally projecting multivariate units onto a line, thus creating a composite indicator that summarizes the observed variables. Model’s theoretical properties are discussed, and a large simulation study demonstrates its performance across different scenarios. Three real data applications highlight the method’s potential across different fields.

本文通过线性有序划分(LOP)概念将排序与聚类联系起来,探讨了聚类对多变量观测进行排序的方法。LOP允许最优聚类成有序的“等价类”。事实上,与简单的单位排序不同,聚类排序确定了单位“不可比较”的类。其目的是将单位划分为具有统计上不同质心的簇,从而导致簇的最佳排列总顺序,其中每个簇中的单位被认为是“关系”。提出的模型找到最佳最小二乘(LS) LOP,以及观测变量的单变量变换。这是因为它通过将多变量单元正交投影到一条线上来识别LS LOP,从而创建了一个综合指标,总结了观察到的变量。讨论了模型的理论性质,并进行了大型仿真研究,验证了模型在不同场景下的性能。三个真实的数据应用突出了该方法在不同领域的潜力。
{"title":"Clustering for ranking multivariate data by Linear Ordered Partitions","authors":"Mariaelena Bottazzi Schenone,&nbsp;Maurizio Vichi","doi":"10.1007/s10182-025-00534-5","DOIUrl":"10.1007/s10182-025-00534-5","url":null,"abstract":"<div><p>This paper explores the use of clustering to rank multivariate observations by linking ranking to clustering through the Linear Ordered Partition (LOP) concept. A LOP allows optimal clustering into ordered “<i>equivalence classes</i>”. In fact, unlike simple units’ ordering, cluster ranking identifies classes where units are “<i>incomparable</i>”. The aim is to partition units into clusters with statistically distinct centroids, leading to an optimally ranked total order of clusters, where units within each one are considered “<i>ties</i>”. The proposed model finds the best least-squares (LS) LOP, alongside with a univariate transformation of the observed variables. This is because it identifies the LS LOP by orthogonally projecting multivariate units onto a line, thus creating a composite indicator that summarizes the observed variables. Model’s theoretical properties are discussed, and a large simulation study demonstrates its performance across different scenarios. Three real data applications highlight the method’s potential across different fields.\u0000</p></div>","PeriodicalId":55446,"journal":{"name":"Asta-Advances in Statistical Analysis","volume":"110 1","pages":"117 - 148"},"PeriodicalIF":1.4,"publicationDate":"2025-07-21","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://link.springer.com/content/pdf/10.1007/s10182-025-00534-5.pdf","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"147341237","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Correction: Fuzzy group fixed-effects estimation with spatial clustering 修正:空间聚类模糊群固定效应估计
IF 1.4 4区 数学 Q2 STATISTICS & PROBABILITY Pub Date : 2025-07-07 DOI: 10.1007/s10182-025-00532-7
Roy Cerqueti, Pierpaolo D’Urso, Raffaele Mattera
{"title":"Correction: Fuzzy group fixed-effects estimation with spatial clustering","authors":"Roy Cerqueti,&nbsp;Pierpaolo D’Urso,&nbsp;Raffaele Mattera","doi":"10.1007/s10182-025-00532-7","DOIUrl":"10.1007/s10182-025-00532-7","url":null,"abstract":"","PeriodicalId":55446,"journal":{"name":"Asta-Advances in Statistical Analysis","volume":"109 4","pages":"829 - 829"},"PeriodicalIF":1.4,"publicationDate":"2025-07-07","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145915723","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Functional data analysis for wearable sensor data: a systematic review 可穿戴传感器数据的功能数据分析:系统综述
IF 1.4 4区 数学 Q2 STATISTICS & PROBABILITY Pub Date : 2025-06-18 DOI: 10.1007/s10182-025-00531-8
Nihan Acar-Denizli, Pedro Delicado

Wearable devices and sensors have recently become a popular way to collect data, especially in the health sciences. The use of sensors allows patients to be monitored over a period of time with a high observation frequency. Due to the continuous-on-time structure of the data, novel statistical methods are recommended for the analysis of sensor data. One of the popular approaches in the analysis of wearable sensor data is functional data analysis. The main objective of this paper is to review functional data analysis methods applied to wearable device data according to the type of sensor. In addition, we introduce several freely available software packages and open databases of wearable device data to facilitate access to sensor data in different fields.

可穿戴设备和传感器最近已经成为一种流行的数据收集方式,特别是在健康科学领域。传感器的使用使患者能够在一段时间内以高观察频率进行监测。由于数据的连续准时结构,建议采用新的统计方法对传感器数据进行分析。分析可穿戴传感器数据的常用方法之一是功能数据分析。本文的主要目的是根据传感器的类型,综述应用于可穿戴设备数据的功能数据分析方法。此外,我们引入了几个免费的软件包和开放的可穿戴设备数据数据库,以方便访问不同领域的传感器数据。
{"title":"Functional data analysis for wearable sensor data: a systematic review","authors":"Nihan Acar-Denizli,&nbsp;Pedro Delicado","doi":"10.1007/s10182-025-00531-8","DOIUrl":"10.1007/s10182-025-00531-8","url":null,"abstract":"<div><p>Wearable devices and sensors have recently become a popular way to collect data, especially in the health sciences. The use of sensors allows patients to be monitored over a period of time with a high observation frequency. Due to the continuous-on-time structure of the data, novel statistical methods are recommended for the analysis of sensor data. One of the popular approaches in the analysis of wearable sensor data is functional data analysis. The main objective of this paper is to review functional data analysis methods applied to wearable device data according to the type of sensor. In addition, we introduce several freely available software packages and open databases of wearable device data to facilitate access to sensor data in different fields.</p></div>","PeriodicalId":55446,"journal":{"name":"Asta-Advances in Statistical Analysis","volume":"109 3","pages":"591 - 631"},"PeriodicalIF":1.4,"publicationDate":"2025-06-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://link.springer.com/content/pdf/10.1007/s10182-025-00531-8.pdf","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145384842","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Anisotropic local covariance matrices for spatial blind source separation 空间盲源分离的各向异性局部协方差矩阵
IF 1.4 4区 数学 Q2 STATISTICS & PROBABILITY Pub Date : 2025-06-10 DOI: 10.1007/s10182-025-00529-2
Christoph Muehlmann, Claudia Cappello, Sandra De Iaco, Klaus Nordhausen

This paper aims to introduce a novel approach to spatial blind source separation (SBSS) that addresses the limitations of existing methods. Current SBSS techniques rely on the joint diagonalization of multiple local covariance functions, all of which assume isotropy. To overcome this constraint, anisotropic local covariance matrices that relax the isotropy assumption are proposed. A simulation study and an application on real-world data demonstrate the performance improvement obtained by incorporating these anisotropic covariance matrices into the SBSS framework and highlight the potential of this new approach for more accurate and flexible source separation in spatial data analysis.

本文旨在介绍一种新的空间盲源分离方法,以解决现有方法的局限性。当前的SBSS技术依赖于多个局部协方差函数的联合对角化,这些协方差函数都假设各向同性。为了克服这一限制,提出了放宽各向异性假设的各向异性局部协方差矩阵。仿真研究和在实际数据中的应用表明,将这些各向异性协方差矩阵纳入SBSS框架可以提高性能,并突出了这种新方法在空间数据分析中更加准确和灵活的源分离的潜力。
{"title":"Anisotropic local covariance matrices for spatial blind source separation","authors":"Christoph Muehlmann,&nbsp;Claudia Cappello,&nbsp;Sandra De Iaco,&nbsp;Klaus Nordhausen","doi":"10.1007/s10182-025-00529-2","DOIUrl":"10.1007/s10182-025-00529-2","url":null,"abstract":"<div><p>This paper aims to introduce a novel approach to spatial blind source separation (SBSS) that addresses the limitations of existing methods. Current SBSS techniques rely on the joint diagonalization of multiple local covariance functions, all of which assume isotropy. To overcome this constraint, anisotropic local covariance matrices that relax the isotropy assumption are proposed. A simulation study and an application on real-world data demonstrate the performance improvement obtained by incorporating these anisotropic covariance matrices into the SBSS framework and highlight the potential of this new approach for more accurate and flexible source separation in spatial data analysis.</p></div>","PeriodicalId":55446,"journal":{"name":"Asta-Advances in Statistical Analysis","volume":"109 4","pages":"753 - 770"},"PeriodicalIF":1.4,"publicationDate":"2025-06-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://link.springer.com/content/pdf/10.1007/s10182-025-00529-2.pdf","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145915721","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
High-dimensional confounding adjustment in causal inference 因果推理中的高维混淆调整
IF 1.4 4区 数学 Q2 STATISTICS & PROBABILITY Pub Date : 2025-06-05 DOI: 10.1007/s10182-025-00528-3
Sanghun Cha, Joon Jin Song, Kyeong Eun Lee

When estimating treatment effects in observational studies, propensity score analysis (PSA) is commonly used to reduce the arising bias that results from confounders interfering with causal inference. However, propensity score (PS) estimation is unstable if some confounders are densely measured and formed into high-dimensional data, which could eventually result in a biased estimate of the treatment effect. We propose two-stage analytic procedures to mitigate the high-dimensional problem: ridge PSA and functional PSA. In addition, conventional variance estimation of treatment effect estimates in the PSA methods tends to be biased, so we leverage the empirical bootstrap approach to develop a valid variance estimator. In the simulation study, we compare the bias and MSE of treatment effects estimated by ridge PSA and function PSA under the various confounding structures, including more densely measured confounders, and evaluate the performance of bootstrap variance estimators. The proposed methods are applied in the case study of police shootings.

当估计观察性研究中的治疗效果时,倾向评分分析(PSA)通常用于减少因混杂因素干扰因果推理而产生的偏倚。然而,如果一些混杂因素被密集测量并形成高维数据,则倾向得分(PS)估计是不稳定的,最终可能导致对治疗效果的估计有偏。我们提出两阶段的分析程序,以减轻高维问题:脊PSA和功能PSA。此外,PSA方法中治疗效果估计的传统方差估计往往存在偏差,因此我们利用经验自举方法来开发有效的方差估计器。在模拟研究中,我们比较了山脊PSA和函数PSA在各种混杂结构下(包括更密集测量的混杂因素)估计的治疗效果的偏差和MSE,并评估了自举方差估计器的性能。将所提出的方法应用于警察枪击事件的案例研究。
{"title":"High-dimensional confounding adjustment in causal inference","authors":"Sanghun Cha,&nbsp;Joon Jin Song,&nbsp;Kyeong Eun Lee","doi":"10.1007/s10182-025-00528-3","DOIUrl":"10.1007/s10182-025-00528-3","url":null,"abstract":"<div><p>When estimating treatment effects in observational studies, propensity score analysis (PSA) is commonly used to reduce the arising bias that results from confounders interfering with causal inference. However, propensity score (PS) estimation is unstable if some confounders are densely measured and formed into high-dimensional data, which could eventually result in a biased estimate of the treatment effect. We propose two-stage analytic procedures to mitigate the high-dimensional problem: ridge PSA and functional PSA. In addition, conventional variance estimation of treatment effect estimates in the PSA methods tends to be biased, so we leverage the empirical bootstrap approach to develop a valid variance estimator. In the simulation study, we compare the bias and MSE of treatment effects estimated by ridge PSA and function PSA under the various confounding structures, including more densely measured confounders, and evaluate the performance of bootstrap variance estimators. The proposed methods are applied in the case study of police shootings.</p></div>","PeriodicalId":55446,"journal":{"name":"Asta-Advances in Statistical Analysis","volume":"109 3","pages":"463 - 481"},"PeriodicalIF":1.4,"publicationDate":"2025-06-05","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145384847","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Using penalized-distance likelihood functions to analyze high-dimensional sparse/non-sparse data 使用惩罚距离似然函数分析高维稀疏/非稀疏数据
IF 1.4 4区 数学 Q2 STATISTICS & PROBABILITY Pub Date : 2025-05-30 DOI: 10.1007/s10182-025-00527-4
S. K. Ghoreishi, Jingjing Wu, Qingrun Zhang, Ghazal S. Ghoreishi

In this paper, we define a penalized-distance likelihood function. This function is much more flexible than the available likelihood functions and can be used in many disciplines. Based on this function, we introduce a statistic for hypothesis testing and derive its asymptotic distribution. This statistic can be used to test a partial hypothesis in the parameter space for both non-sparse and sparse high-dimensional data. Relevant Bayesian analysis using the Markov chain Monte Carlo (MCMC) method will be discussed. Finally, we carry out a simulation study and apply our model to a real dataset.

在本文中,我们定义了一个惩罚距离似然函数。此函数比现有的似然函数灵活得多,可用于许多学科。在此基础上,我们引入了一个用于假设检验的统计量,并推导了它的渐近分布。该统计量可用于检验非稀疏和稀疏高维数据在参数空间中的部分假设。将讨论使用马尔可夫链蒙特卡罗(MCMC)方法的相关贝叶斯分析。最后,我们进行了仿真研究,并将我们的模型应用于一个真实的数据集。
{"title":"Using penalized-distance likelihood functions to analyze high-dimensional sparse/non-sparse data","authors":"S. K. Ghoreishi,&nbsp;Jingjing Wu,&nbsp;Qingrun Zhang,&nbsp;Ghazal S. Ghoreishi","doi":"10.1007/s10182-025-00527-4","DOIUrl":"10.1007/s10182-025-00527-4","url":null,"abstract":"<div><p>In this paper, we define a penalized-distance likelihood function. This function is much more flexible than the available likelihood functions and can be used in many disciplines. Based on this function, we introduce a statistic for hypothesis testing and derive its asymptotic distribution. This statistic can be used to test a partial hypothesis in the parameter space for both non-sparse and sparse high-dimensional data. Relevant Bayesian analysis using the Markov chain Monte Carlo (MCMC) method will be discussed. Finally, we carry out a simulation study and apply our model to a real dataset.</p></div>","PeriodicalId":55446,"journal":{"name":"Asta-Advances in Statistical Analysis","volume":"109 3","pages":"509 - 528"},"PeriodicalIF":1.4,"publicationDate":"2025-05-30","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145384843","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Gradient boosting for Dirichlet regression models Dirichlet回归模型的梯度增强
IF 1.4 4区 数学 Q2 STATISTICS & PROBABILITY Pub Date : 2025-04-28 DOI: 10.1007/s10182-025-00526-5
Michael Balzer, Elisabeth Bergherr, Swen Hutter, Tobias Hepp

In various real-world applications, researchers often work with compositional data which appears as proportions, amounts or rates. As a framework for dealing with the unique nature of compositional data, Dirichlet regression models have been introduced. In this article, we propose a novel model-based gradient boosting approach for Dirichlet regression models embedded in the framework of generalized additive models for location, scale and shape. This approach allows for data-driven variable selection in low- as well as high-dimensional data settings. Moreover, the implementation enables the direct calculation of marginal effects for different predictor variables. Thus, it provides an alternative estimation procedure besides the well-established approach based on the maximum likelihood principle. After conducting detailed simulation studies to evaluate the performance of the estimation procedure regarding prediction accuracy and variable selection in low- and high-dimensional settings, we present a real-world application concerning the changes in election results in the Great Recession utilizing a large-scale European dataset. Using our proposed approach, we investigate the effect of protests on voting proportions of distinct party families while identifying important socioeconomic variables and their effect on those voting proportions via variable selection.

在各种现实世界的应用中,研究人员经常使用组成数据,这些数据显示为比例、数量或速率。Dirichlet回归模型作为处理组合数据的独特性质的框架已经被引入。在本文中,我们提出了一种新的基于模型的梯度增强方法,用于嵌入在位置、规模和形状的广义加性模型框架中的Dirichlet回归模型。这种方法允许在低维和高维数据设置中进行数据驱动的变量选择。此外,该实现可以直接计算不同预测变量的边际效应。因此,除了基于极大似然原理的既定方法之外,它提供了一种替代的估计程序。在进行了详细的模拟研究以评估在低维和高维环境下预测精度和变量选择方面的估计过程的性能之后,我们利用大规模的欧洲数据集提出了一个关于大衰退中选举结果变化的实际应用。使用我们提出的方法,我们研究了抗议对不同政党家庭投票比例的影响,同时通过变量选择确定重要的社会经济变量及其对这些投票比例的影响。
{"title":"Gradient boosting for Dirichlet regression models","authors":"Michael Balzer,&nbsp;Elisabeth Bergherr,&nbsp;Swen Hutter,&nbsp;Tobias Hepp","doi":"10.1007/s10182-025-00526-5","DOIUrl":"10.1007/s10182-025-00526-5","url":null,"abstract":"<div><p>In various real-world applications, researchers often work with compositional data which appears as proportions, amounts or rates. As a framework for dealing with the unique nature of compositional data, Dirichlet regression models have been introduced. In this article, we propose a novel model-based gradient boosting approach for Dirichlet regression models embedded in the framework of generalized additive models for location, scale and shape. This approach allows for data-driven variable selection in low- as well as high-dimensional data settings. Moreover, the implementation enables the direct calculation of marginal effects for different predictor variables. Thus, it provides an alternative estimation procedure besides the well-established approach based on the maximum likelihood principle. After conducting detailed simulation studies to evaluate the performance of the estimation procedure regarding prediction accuracy and variable selection in low- and high-dimensional settings, we present a real-world application concerning the changes in election results in the Great Recession utilizing a large-scale European dataset. Using our proposed approach, we investigate the effect of protests on voting proportions of distinct party families while identifying important socioeconomic variables and their effect on those voting proportions via variable selection.</p></div>","PeriodicalId":55446,"journal":{"name":"Asta-Advances in Statistical Analysis","volume":"110 1","pages":"149 - 189"},"PeriodicalIF":1.4,"publicationDate":"2025-04-28","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://link.springer.com/content/pdf/10.1007/s10182-025-00526-5.pdf","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"147342295","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Bias-corrected estimation for (mathcal{G}^0_I) regression with applications 偏差校正估计(mathcal{G}^0_I)回归与应用程序
IF 1.4 4区 数学 Q2 STATISTICS & PROBABILITY Pub Date : 2025-03-31 DOI: 10.1007/s10182-025-00525-6
M. F. S. S. Sousa, J. M. Vasconcelos, A. D. C. Nascimento

Synthetic aperture radar (SAR) systems are highly efficient tools for addressing remote sensing challenges. They offer several advantages, such as operating independently of atmospheric conditions and producing high spatial resolution images. However, SAR images are often contaminated by a type of interference called speckle noise, which complicates their analysis and processing. Therefore, proposing statistical methods, such as regression models, that account for speckle behavior is an important step for users of SAR systems. In the work [ISPRS J. Photogramm. Remote Sens., 213, 1–13, 2024], the ({mathcal{G}^{0}_{I}}) regression model (short for (mathcal{R} {mathcal{G}^{0}_{I}})) was proposed as an interpretable tool to relate SAR intensity features to other physical properties. The authors employed maximum likelihood estimators (MLEs), known for their good asymptotic properties but prone to considerable bias in small and medium sample sizes. In this paper, we propose a matrix expression for the second-order bias of MLEs for (mathcal{R} {mathcal{G}^{0}_{I}}) parameters, based on the Cox and Snell method. This proposal is justified by the necessity of using small and moderate windows when processing SAR images, such as for classification and filtering purposes. We compare bias-corrected MLEs with their counterparts using both Monte Carlo experiments and an application to SAR data from a Brazilian region. Numerical evidence demonstrates the effectiveness of our proposal.

合成孔径雷达(SAR)系统是解决遥感挑战的高效工具。它们有几个优点,如独立于大气条件运行和产生高空间分辨率图像。然而,SAR图像经常受到一种称为散斑噪声的干扰,这使得它们的分析和处理变得复杂。因此,对于SAR系统的用户来说,提出统计方法,如回归模型,来解释散斑行为是重要的一步。在作品中[ISPRS J.摄影]。遥感学报,213,1 - 13,2024],({mathcal{G}^{0}_{I}})回归模型(简称(mathcal{R} {mathcal{G}^{0}_{I}}))被提出作为一种可解释的工具,将SAR强度特征与其他物理性质联系起来。作者采用最大似然估计(MLEs),以其良好的渐近特性而闻名,但在中小型样本量中容易产生相当大的偏差。本文基于Cox和Snell方法,提出了(mathcal{R} {mathcal{G}^{0}_{I}})参数下MLEs二阶偏置的矩阵表达式。这一建议是合理的,因为在处理SAR图像时需要使用小而适中的窗口,例如用于分类和过滤目的。我们使用蒙特卡罗实验和巴西地区SAR数据的应用,比较了偏差校正的MLEs与相应的MLEs。数值证明了该方法的有效性。
{"title":"Bias-corrected estimation for (mathcal{G}^0_I) regression with applications","authors":"M. F. S. S. Sousa,&nbsp;J. M. Vasconcelos,&nbsp;A. D. C. Nascimento","doi":"10.1007/s10182-025-00525-6","DOIUrl":"10.1007/s10182-025-00525-6","url":null,"abstract":"<div><p>Synthetic aperture radar (SAR) systems are highly efficient tools for addressing remote sensing challenges. They offer several advantages, such as operating independently of atmospheric conditions and producing high spatial resolution images. However, SAR images are often contaminated by a type of interference called speckle noise, which complicates their analysis and processing. Therefore, proposing statistical methods, such as regression models, that account for speckle behavior is an important step for users of SAR systems. In the work [<span>ISPRS J. Photogramm. Remote Sens., 213, 1–13, 2024</span>], the <span>({mathcal{G}^{0}_{I}})</span> regression model (short for <span>(mathcal{R} {mathcal{G}^{0}_{I}})</span>) was proposed as an interpretable tool to relate SAR intensity features to other physical properties. The authors employed maximum likelihood estimators (MLEs), known for their good asymptotic properties but prone to considerable bias in small and medium sample sizes. In this paper, we propose a matrix expression for the second-order bias of MLEs for <span>(mathcal{R} {mathcal{G}^{0}_{I}})</span> parameters, based on the Cox and Snell method. This proposal is justified by the necessity of using small and moderate windows when processing SAR images, such as for classification and filtering purposes. We compare bias-corrected MLEs with their counterparts using both Monte Carlo experiments and an application to SAR data from a Brazilian region. Numerical evidence demonstrates the effectiveness of our proposal.</p></div>","PeriodicalId":55446,"journal":{"name":"Asta-Advances in Statistical Analysis","volume":"109 3","pages":"557 - 589"},"PeriodicalIF":1.4,"publicationDate":"2025-03-31","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145384891","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
A two-part beta regression with mismeasured dependent variable for modeling quasi-formal employment in Europe 对欧洲准正规就业建模的两部分β回归与误测因变量
IF 1.4 4区 数学 Q2 STATISTICS & PROBABILITY Pub Date : 2025-02-22 DOI: 10.1007/s10182-025-00524-7
Maria Felice Arezzo, Giuseppina Guagnano, Domenico Vitale

Quasi-formal employment refers to a situation in which a formal employer and a formal employee agree to declare only part of the wage and the rest being paid in cash to avoid tax liabilities. This phenomenon is detrimental to government budgets and to worker protection. It is therefore crucial to understand its extent and its drivers. The Eurobarometer survey no. 498 provides the information to achieve these objectives. However, several issues in the data need to be addressed with credible solutions in order to rely upon the inferences drawn. These issues are primarily concerned with the reliability of the data, which is compromised due to the phenomenon of social desirability bias. This can be defined as the tendency of respondents to provide false information when answering sensitive questions relating to socially stigmatized behaviors, such as tax evasion. In this work, we present a unified framework for modeling such survey data that overcomes the problems raised by social desirability bias and accommodates the structure of the variables of interest. In particular, we propose a two-part beta regression model where the part one models the participation in quasi-formal employment, whereas the part two models the share of annual gross income earned under the table. We allow the dependent variables of both parts of the model to be mismeasured to handle social desirability bias and generalize the part two using a beta regression framework that is suitable for the limited dependent variable representing the incidence of wage paid in cash. The performance of the estimators is evaluated through a Monte Carlo simulation study and compared with those achieved through a standard procedure that ignores the issues arising from social desirability bias. An application to the Eurobarometer survey no. 498 on quasi-formal employment is provided.

准正式雇佣是指正式雇主和正式雇员为避免纳税义务,约定只申报部分工资,其余部分以现金支付的情况。这种现象不利于政府预算和工人保护。因此,了解其程度及其驱动因素至关重要。欧洲晴雨表调查编号:498提供了实现这些目标的信息。然而,数据中的几个问题需要用可靠的解决方案来解决,以便依赖所得出的推论。这些问题主要与数据的可靠性有关,由于社会可取性偏见现象,数据的可靠性受到损害。这可以定义为受访者在回答与社会污名行为(如逃税)有关的敏感问题时提供虚假信息的倾向。在这项工作中,我们提出了一个统一的框架来模拟这样的调查数据,克服了社会可取性偏见带来的问题,并适应了感兴趣变量的结构。特别地,我们提出了一个两部分的beta回归模型,其中第一部分模拟准正式就业的参与,而第二部分模拟年度总收入的份额。我们允许模型两部分的因变量被错测,以处理社会可取性偏差,并使用适用于代表现金支付工资发生率的有限因变量的beta回归框架概括第二部分。估计器的性能通过蒙特卡罗模拟研究进行评估,并与通过忽略社会可取性偏差引起的问题的标准程序获得的结果进行比较。申请欧洲晴雨表调查编号:498关于准正式就业的资料。
{"title":"A two-part beta regression with mismeasured dependent variable for modeling quasi-formal employment in Europe","authors":"Maria Felice Arezzo,&nbsp;Giuseppina Guagnano,&nbsp;Domenico Vitale","doi":"10.1007/s10182-025-00524-7","DOIUrl":"10.1007/s10182-025-00524-7","url":null,"abstract":"<div><p>Quasi-formal employment refers to a situation in which a formal employer and a formal employee agree to declare only part of the wage and the rest being paid in cash to avoid tax liabilities. This phenomenon is detrimental to government budgets and to worker protection. It is therefore crucial to understand its extent and its drivers. The Eurobarometer survey no. 498 provides the information to achieve these objectives. However, several issues in the data need to be addressed with credible solutions in order to rely upon the inferences drawn. These issues are primarily concerned with the reliability of the data, which is compromised due to the phenomenon of social desirability bias. This can be defined as the tendency of respondents to provide false information when answering sensitive questions relating to socially stigmatized behaviors, such as tax evasion. In this work, we present a unified framework for modeling such survey data that overcomes the problems raised by social desirability bias and accommodates the structure of the variables of interest. In particular, we propose a two-part beta regression model where the part one models the participation in quasi-formal employment, whereas the part two models the share of annual gross income earned under the table. We allow the dependent variables of both parts of the model to be mismeasured to handle social desirability bias and generalize the part two using a beta regression framework that is suitable for the limited dependent variable representing the incidence of wage paid in cash. The performance of the estimators is evaluated through a Monte Carlo simulation study and compared with those achieved through a standard procedure that ignores the issues arising from social desirability bias. An application to the Eurobarometer survey no. 498 on quasi-formal employment is provided.</p></div>","PeriodicalId":55446,"journal":{"name":"Asta-Advances in Statistical Analysis","volume":"110 1","pages":"191 - 218"},"PeriodicalIF":1.4,"publicationDate":"2025-02-22","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://link.springer.com/content/pdf/10.1007/s10182-025-00524-7.pdf","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"147341238","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
期刊
Asta-Advances in Statistical Analysis
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1