首页 > 最新文献

Statistical Papers最新文献

英文 中文
Minimax weight learning for absorbing MDPs 吸收型 MDP 的最小权重学习
IF 1.3 3区 数学 Q2 STATISTICS & PROBABILITY Pub Date : 2024-03-06 DOI: 10.1007/s00362-023-01491-4
Fengying Li, Yuqiang Li, Xianyi Wu

Reinforcement learning policy evaluation problems are often modeled as finite or discounted/averaged infinite-horizon Markov Decision Processes (MDPs). In this paper, we study undiscounted off-policy evaluation for absorbing MDPs. Given the dataset consisting of i.i.d episodes under a given truncation level, we propose an algorithm (referred to as MWLA in the text) to directly estimate the expected return via the importance ratio of the state-action occupancy measure. The Mean Square Error (MSE) bound of the MWLA method is provided and the dependence of statistical errors on the data size and the truncation level are analyzed. The performance of the algorithm is illustrated by means of computational experiments under an episodic taxi environment

强化学习政策评估问题通常被建模为有限或贴现/平均无限视距马尔可夫决策过程(MDP)。在本文中,我们将研究吸收型 MDP 的未贴现非策略评估。给定截断水平下的数据集由 i.i.d 事件组成,我们提出了一种算法(文中称为 MWLA),通过状态-行动占用度量的重要性比直接估计预期收益。我们提供了 MWLA 方法的均方误差(MSE)边界,并分析了统计误差对数据规模和截断水平的依赖性。通过在偶发出租车环境下的计算实验,说明了该算法的性能。
{"title":"Minimax weight learning for absorbing MDPs","authors":"Fengying Li, Yuqiang Li, Xianyi Wu","doi":"10.1007/s00362-023-01491-4","DOIUrl":"https://doi.org/10.1007/s00362-023-01491-4","url":null,"abstract":"<p>Reinforcement learning policy evaluation problems are often modeled as finite or discounted/averaged infinite-horizon Markov Decision Processes (MDPs). In this paper, we study undiscounted off-policy evaluation for absorbing MDPs. Given the dataset consisting of i.i.d episodes under a given truncation level, we propose an algorithm (referred to as MWLA in the text) to directly estimate the expected return via the importance ratio of the state-action occupancy measure. The Mean Square Error (MSE) bound of the MWLA method is provided and the dependence of statistical errors on the data size and the truncation level are analyzed. The performance of the algorithm is illustrated by means of computational experiments under an episodic taxi environment</p>","PeriodicalId":51166,"journal":{"name":"Statistical Papers","volume":"43 1","pages":""},"PeriodicalIF":1.3,"publicationDate":"2024-03-06","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"140045510","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Homogeneity tests and interval estimations of risk differences for stratified bilateral and unilateral correlated data 分层双边和单边相关数据的同质性检验和风险差异区间估计
IF 1.3 3区 数学 Q2 STATISTICS & PROBABILITY Pub Date : 2024-03-04 DOI: 10.1007/s00362-024-01532-6
Shuyi Liang, Kai-Tai Fang, Xin-Wei Huang, Yijing Xin, Chang-Xing Ma

In clinical trials studying paired parts of a subject with binary outcomes, it is expected to collect measurements bilaterally. However, there are cases where subjects contribute measurements for only one part. By utilizing combined data, it is possible to gain additional information compared to using bilateral or unilateral data alone. With the combined data, this article investigates homogeneity tests of risk differences with the presence of stratification effects and proposes interval estimations of a common risk difference if stratification does not introduce underlying dissimilarities. Under Dallal’s model (Biometrics 44:253–257, 1988), we propose three test statistics and evaluate their performances regarding type I error controls and powers. Confidence intervals of a common risk difference with satisfactory coverage probabilities and interval length are constructed. Our simulation results show that the score test is the most robust and the profile likelihood confidence interval outperforms other methods proposed. Data from a study of acute otitis media is used to illustrate our proposed procedures.

在对受试者的成对部分进行二元结果研究的临床试验中,预计要收集双侧的测量数据。不过,也有受试者只对一个部位进行测量的情况。与单独使用双侧或单侧数据相比,利用组合数据可以获得更多信息。利用合并数据,本文研究了存在分层效应时风险差异的同质性检验,并提出了在分层不引入潜在差异的情况下共同风险差异的区间估计。根据 Dallal 的模型(Biometrics 44:253-257, 1988),我们提出了三种检验统计量,并评估了它们在 I 型误差控制和幂级数方面的表现。我们构建了具有令人满意的覆盖概率和区间长度的共同风险差异置信区间。我们的模拟结果表明,得分检验是最稳健的,轮廓似然置信区间优于其他方法。我们使用急性中耳炎的研究数据来说明我们提出的程序。
{"title":"Homogeneity tests and interval estimations of risk differences for stratified bilateral and unilateral correlated data","authors":"Shuyi Liang, Kai-Tai Fang, Xin-Wei Huang, Yijing Xin, Chang-Xing Ma","doi":"10.1007/s00362-024-01532-6","DOIUrl":"https://doi.org/10.1007/s00362-024-01532-6","url":null,"abstract":"<p>In clinical trials studying paired parts of a subject with binary outcomes, it is expected to collect measurements bilaterally. However, there are cases where subjects contribute measurements for only one part. By utilizing combined data, it is possible to gain additional information compared to using bilateral or unilateral data alone. With the combined data, this article investigates homogeneity tests of risk differences with the presence of stratification effects and proposes interval estimations of a common risk difference if stratification does not introduce underlying dissimilarities. Under Dallal’s model (Biometrics 44:253–257, 1988), we propose three test statistics and evaluate their performances regarding type I error controls and powers. Confidence intervals of a common risk difference with satisfactory coverage probabilities and interval length are constructed. Our simulation results show that the score test is the most robust and the profile likelihood confidence interval outperforms other methods proposed. Data from a study of acute otitis media is used to illustrate our proposed procedures.</p>","PeriodicalId":51166,"journal":{"name":"Statistical Papers","volume":"55 1","pages":""},"PeriodicalIF":1.3,"publicationDate":"2024-03-04","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"140033154","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Welch’s t test is more sensitive to real world violations of distributional assumptions than student’s t test but logistic regression is more robust than either 韦尔奇 t 检验比学生 t 检验对现实世界中违反分布假设的情况更敏感,但逻辑回归比两者都更稳健。
IF 1.3 3区 数学 Q2 STATISTICS & PROBABILITY Pub Date : 2024-03-04 DOI: 10.1007/s00362-024-01531-7
David Curtis

It has previously been pointed out that Student’s t test, which assumes that samples are drawn from populations with equal standard deviations, can have an inflated Type I error rate if this assumption is violated. Hence it has been recommended that Welch’s t test should be preferred. In the context of carrying out gene-wise weighted burden tests for detecting association of rare variants with psoriasis we observe that Welch’s test performs unsatisfactorily. We show that if the assumption of normality is violated and observations follow a Poisson distribution, then with unequal sample sizes Welch’s t test has an inflated Type I error rate, is systematically biased and is prone to produce extremely low p values. We argue that such data can arise in a variety of real world situations and believe that researchers should be aware of this issue. Student’s t test performs much better in this scenario but a likelihood ratio test based on logistic regression models performs better still and we suggest that this might generally be a preferable method to test for a difference in distributions between two samples.

This research has been conducted using the UK Biobank Resource.

以前曾有人指出,学生 t 检验假定样本来自标准差相等的群体,如果违反了这一假定,I 类错误率就会增大。因此,建议采用韦尔奇 t 检验。在为检测罕见变异体与银屑病的关联而进行基因加权负担测试时,我们发现韦尔奇检验的表现并不令人满意。我们的研究表明,如果违反了正态性假设,观察结果呈泊松分布,那么在样本量不等的情况下,韦尔奇 t 检验的 I 类错误率就会升高,出现系统性偏差,并容易产生极低的 p 值。我们认为,这种数据可能出现在现实世界的各种情况中,研究人员应该意识到这个问题。在这种情况下,学生 t 检验的效果要好得多,但基于逻辑回归模型的似然比检验的效果更好,我们认为这可能是检验两个样本分布差异的较好方法。
{"title":"Welch’s t test is more sensitive to real world violations of distributional assumptions than student’s t test but logistic regression is more robust than either","authors":"David Curtis","doi":"10.1007/s00362-024-01531-7","DOIUrl":"https://doi.org/10.1007/s00362-024-01531-7","url":null,"abstract":"<p>It has previously been pointed out that Student’s <i>t</i> test, which assumes that samples are drawn from populations with equal standard deviations, can have an inflated Type I error rate if this assumption is violated. Hence it has been recommended that Welch’s <i>t</i> test should be preferred. In the context of carrying out gene-wise weighted burden tests for detecting association of rare variants with psoriasis we observe that Welch’s test performs unsatisfactorily. We show that if the assumption of normality is violated and observations follow a Poisson distribution, then with unequal sample sizes Welch’s <i>t</i> test has an inflated Type I error rate, is systematically biased and is prone to produce extremely low <i>p</i> values. We argue that such data can arise in a variety of real world situations and believe that researchers should be aware of this issue. Student’s <i>t</i> test performs much better in this scenario but a likelihood ratio test based on logistic regression models performs better still and we suggest that this might generally be a preferable method to test for a difference in distributions between two samples.</p><p>This research has been conducted using the UK Biobank Resource.</p>","PeriodicalId":51166,"journal":{"name":"Statistical Papers","volume":"239 ","pages":""},"PeriodicalIF":1.3,"publicationDate":"2024-03-04","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"140037982","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
A scale-invariant test for linear hypothesis of means in high dimensions 高维度均值线性假设的标度不变检验
IF 1.3 3区 数学 Q2 STATISTICS & PROBABILITY Pub Date : 2024-02-29 DOI: 10.1007/s00362-024-01530-8

Abstract

In this paper, we propose a new scale-invariant test for linear hypothesis of mean vectors with heteroscedasticity in high-dimensional settings. Most existing tests impose strong conditions on covariance matrices so that null distributions of their tests are asymptotically normal, which restricts the application of test procedures. However, our proposed test has different null distributions under mild conditions. Additionally, the well-known Welch-Satterthwaite chi-square approximation we adopted can automatically mimic the shapes of the null distributions of the test statistic. The performances of the test are illustrated by simulation and real data in finite samples which show that it has robustness and is more powerful than three competitors.

摘要 本文提出了一种新的规模不变检验方法,用于检验高维环境下具有异方差性的均值向量线性假设。现有的大多数检验都对协方差矩阵施加了强条件,使其检验的空分布为渐近正态分布,这限制了检验程序的应用。然而,我们提出的检验在温和条件下具有不同的空分布。此外,我们采用的著名的韦尔奇-萨特斯韦特卡方近似法可以自动模拟检验统计量的空分布形状。我们通过模拟和有限样本中的真实数据来说明该检验的性能,结果表明它具有稳健性,而且比三个竞争对手更强大。
{"title":"A scale-invariant test for linear hypothesis of means in high dimensions","authors":"","doi":"10.1007/s00362-024-01530-8","DOIUrl":"https://doi.org/10.1007/s00362-024-01530-8","url":null,"abstract":"<h3>Abstract</h3> <p>In this paper, we propose a new scale-invariant test for linear hypothesis of mean vectors with heteroscedasticity in high-dimensional settings. Most existing tests impose strong conditions on covariance matrices so that null distributions of their tests are asymptotically normal, which restricts the application of test procedures. However, our proposed test has different null distributions under mild conditions. Additionally, the well-known Welch-Satterthwaite chi-square approximation we adopted can automatically mimic the shapes of the null distributions of the test statistic. The performances of the test are illustrated by simulation and real data in finite samples which show that it has robustness and is more powerful than three competitors.</p>","PeriodicalId":51166,"journal":{"name":"Statistical Papers","volume":"46 22 1","pages":""},"PeriodicalIF":1.3,"publicationDate":"2024-02-29","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"140001940","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
A unified approach to goodness-of-fit testing for spherical and hyperspherical data 球形和超球形数据拟合优度测试的统一方法
IF 1.3 3区 数学 Q2 STATISTICS & PROBABILITY Pub Date : 2024-02-26 DOI: 10.1007/s00362-024-01529-1
Bruno Ebner, Norbert Henze, Simos Meintanis

We propose a general and relatively simple method to construct goodness-of-fit tests on the sphere and the hypersphere. The method is based on the characterization of probability distributions via their characteristic function, and it leads to test criteria that are convenient regarding applications and consistent against arbitrary deviations from the model under test. We emphasize goodness-of-fit tests for spherical distributions due to their importance in applications and the relative scarcity of available methods.

我们提出了一种在球面和超球面上构建拟合优度检验的通用而相对简单的方法。该方法基于通过概率分布的特征函数对概率分布进行表征,得出的检验标准既便于应用,又能与被检验模型的任意偏差保持一致。我们强调球面分布的拟合优度检验,因为它们在应用中非常重要,而且可用的方法相对较少。
{"title":"A unified approach to goodness-of-fit testing for spherical and hyperspherical data","authors":"Bruno Ebner, Norbert Henze, Simos Meintanis","doi":"10.1007/s00362-024-01529-1","DOIUrl":"https://doi.org/10.1007/s00362-024-01529-1","url":null,"abstract":"<p>We propose a general and relatively simple method to construct goodness-of-fit tests on the sphere and the hypersphere. The method is based on the characterization of probability distributions via their characteristic function, and it leads to test criteria that are convenient regarding applications and consistent against arbitrary deviations from the model under test. We emphasize goodness-of-fit tests for spherical distributions due to their importance in applications and the relative scarcity of available methods.</p>","PeriodicalId":51166,"journal":{"name":"Statistical Papers","volume":"2 1","pages":""},"PeriodicalIF":1.3,"publicationDate":"2024-02-26","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"139969034","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Is Fisher inference inferior to Neyman inference for policy analysis? 在政策分析中,费雪推断是否不如奈曼推断?
IF 1.3 3区 数学 Q2 STATISTICS & PROBABILITY Pub Date : 2024-02-20 DOI: 10.1007/s00362-024-01528-2
Rauf Ahmad, Per Johansson, Mårten Schultzberg

The increasing computational power has led to an increasing interest in Fisher’s test in social science. As the Fisher and Neyman inference are based on different principles there is also an increasing interest in understanding the differential features of the two procedures. For example, Young (2018) found that the Fisher test has better size properties than the Neyman test in the situation with influential observations. Ding (2017), on the other hand, showed that the asymptotic variance of the mean-difference estimator (MDE) under Fisher inference is larger than that under Neyman inference, and that the asymptotic Fisher test is less powerful than the t-test even for the simplest case of homogeneous effect. Since MDE plays an important role for policy evaluation, these latter results are a concern for using Fisher’s test as argued in Young (2018). With the aim of providing an understanding of the usefulness of the exact Fisher test for inference to the sample and to the population, this paper clarifies the results in Ding (2017). Using a novel Monte Carlo simulation following the same data generating processes as in Ding (2017), we demonstrate that the Fisher test has no worse power properties than the t-test even with heterogeneous effects.

随着计算能力的不断提高,社会科学界对费雪检验的兴趣与日俱增。由于费雪推断和奈曼推断基于不同的原理,人们也越来越有兴趣了解这两种程序的不同特点。例如,Young(2018)发现,在有影响观测值的情况下,Fisher 检验比 Neyman 检验具有更好的规模属性。而 Ding(2017)的研究则表明,Fisher 推断下均值差估计器(MDE)的渐近方差大于 Neyman 推断下的方差,即使在最简单的同质效应情况下,渐近 Fisher 检验也不如 t 检验有力。由于 MDE 在政策评估中发挥着重要作用,正如 Young(2018)所论证的那样,后面这些结果是使用 Fisher 检验的一个顾虑。为了让人们了解精确费雪检验对样本和总体推断的有用性,本文澄清了 Ding(2017)的结果。通过使用与 Ding(2017)中相同的数据生成过程进行新颖的蒙特卡罗模拟,我们证明了费雪检验的功率特性并不比 t 检验差,即使在异质效应的情况下也是如此。
{"title":"Is Fisher inference inferior to Neyman inference for policy analysis?","authors":"Rauf Ahmad, Per Johansson, Mårten Schultzberg","doi":"10.1007/s00362-024-01528-2","DOIUrl":"https://doi.org/10.1007/s00362-024-01528-2","url":null,"abstract":"<p>The increasing computational power has led to an increasing interest in Fisher’s test in social science. As the Fisher and Neyman inference are based on different principles there is also an increasing interest in understanding the differential features of the two procedures. For example, Young (2018) found that the Fisher test has better size properties than the Neyman test in the situation with influential observations. Ding (2017), on the other hand, showed that the asymptotic variance of the mean-difference estimator (MDE) under Fisher inference is larger than that under Neyman inference, and that the asymptotic Fisher test is less powerful than the <i>t</i>-test even for the simplest case of homogeneous effect. Since MDE plays an important role for policy evaluation, these latter results are a concern for using Fisher’s test as argued in Young (2018). With the aim of providing an understanding of the usefulness of the exact Fisher test for inference to the sample and to the population, this paper clarifies the results in Ding (2017). Using a novel Monte Carlo simulation following the same data generating processes as in Ding (2017), we demonstrate that the Fisher test has no worse power properties than the t-test even with heterogeneous effects.</p>","PeriodicalId":51166,"journal":{"name":"Statistical Papers","volume":"70 1","pages":""},"PeriodicalIF":1.3,"publicationDate":"2024-02-20","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"139921756","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
The effect of correlated errors on the performance of local linear estimation of regression function based on random functional design 相关误差对基于随机函数设计的回归函数局部线性估计性能的影响
IF 1.3 3区 数学 Q2 STATISTICS & PROBABILITY Pub Date : 2024-02-14 DOI: 10.1007/s00362-023-01523-z
Karim Benhenni, Ali Hajj Hassan, Yingcai Su

This article considers the problem of nonparametric estimation of the regression function (r) in a functional regression model (Y = r(X) +varepsilon ) with a scalar response Y, a functional explanatory variable X, and a second order stationary error process (varepsilon ). Under some specific criteria, we construct a local linear kernel estimator of (r) from functional random design with correlated errors. The exact rates of convergence of mean squared error of the constructed estimator are established for both short and long range dependent error processes. Simulation studies are conducted on the performance of the proposed simple local linear estimator. Examples of time series data are considered.

本文考虑的问题是在函数回归模型(Y = r(X) +varepsilon )中回归函数(r)的非参数估计,该模型具有标量响应 Y、函数解释变量 X 和二阶静态误差过程 (varepsilon)。在一些特定的标准下,我们从具有相关误差的函数随机设计中构建了一个局部线性核估计器((r))。在短程和长程依赖误差过程中,都建立了所建估计器均方误差的精确收敛率。对所提出的简单局部线性估计器的性能进行了仿真研究。考虑了时间序列数据的实例。
{"title":"The effect of correlated errors on the performance of local linear estimation of regression function based on random functional design","authors":"Karim Benhenni, Ali Hajj Hassan, Yingcai Su","doi":"10.1007/s00362-023-01523-z","DOIUrl":"https://doi.org/10.1007/s00362-023-01523-z","url":null,"abstract":"<p>This article considers the problem of nonparametric estimation of the regression function <span>(r)</span> in a functional regression model <span>(Y = r(X) +varepsilon )</span> with a scalar response <i>Y</i>, a functional explanatory variable <i>X</i>, and a second order stationary error process <span>(varepsilon )</span>. Under some specific criteria, we construct a local linear kernel estimator of <span>(r)</span> from functional random design with correlated errors. The exact rates of convergence of mean squared error of the constructed estimator are established for both short and long range dependent error processes. Simulation studies are conducted on the performance of the proposed simple local linear estimator. Examples of time series data are considered.</p>","PeriodicalId":51166,"journal":{"name":"Statistical Papers","volume":"208 1","pages":""},"PeriodicalIF":1.3,"publicationDate":"2024-02-14","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"139762372","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Strong consistency of tail value-at-risk estimator and corresponding general results under widely orthant dependent samples 广泛正交依存样本下尾部风险价值估计器的强一致性及相应的一般结果
IF 1.3 3区 数学 Q2 STATISTICS & PROBABILITY Pub Date : 2024-01-17 DOI: 10.1007/s00362-023-01525-x
Jinyu Zhou, Jigao Yan, Dongya Cheng

In this paper, strong consistency of tail value-at-risk (TVaR) estimator under widely orthant dependent (WOD) samples is established, and a numerical simulation is performed to verify the validity of the theoretical results. To reveal the essence of the result, theoretical discussion on complete and complete moment convergence corresponding to the Baum–Katz law, as well as the Marcinkiewicz–Zygmund type strong law of large numbers (MZSLLN) for maximal weighted sums and maximal product sums of widely orthant dependent (WOD) random variables are investigated. The results obtained in the context extend the corresponding ones for independent and some dependent random variables.

本文建立了广泛正交依赖(WOD)样本下尾部风险值(TVaR)估计器的强一致性,并通过数值模拟验证了理论结果的正确性。为了揭示结果的本质,研究了与 Baum-Katz 定律相对应的完全收敛和完全矩收敛,以及广泛正交依存(WOD)随机变量的最大加权和和最大乘积和的 Marcinkiewicz-Zygmund 型强大数定律(MZSLLN)。在此背景下获得的结果扩展了独立随机变量和某些从属随机变量的相应结果。
{"title":"Strong consistency of tail value-at-risk estimator and corresponding general results under widely orthant dependent samples","authors":"Jinyu Zhou, Jigao Yan, Dongya Cheng","doi":"10.1007/s00362-023-01525-x","DOIUrl":"https://doi.org/10.1007/s00362-023-01525-x","url":null,"abstract":"<p>In this paper, strong consistency of tail value-at-risk (TVaR) estimator under widely orthant dependent (WOD) samples is established, and a numerical simulation is performed to verify the validity of the theoretical results. To reveal the essence of the result, theoretical discussion on complete and complete moment convergence corresponding to the Baum–Katz law, as well as the Marcinkiewicz–Zygmund type strong law of large numbers (MZSLLN) for maximal weighted sums and maximal product sums of widely orthant dependent (WOD) random variables are investigated. The results obtained in the context extend the corresponding ones for independent and some dependent random variables.</p>","PeriodicalId":51166,"journal":{"name":"Statistical Papers","volume":"1 1","pages":""},"PeriodicalIF":1.3,"publicationDate":"2024-01-17","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"139501214","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Subgroup analysis with concave pairwise fusion penalty for ordinal response 采用凹面成对融合惩罚对序数反应进行分组分析
IF 1.3 3区 数学 Q2 STATISTICS & PROBABILITY Pub Date : 2024-01-13 DOI: 10.1007/s00362-023-01526-w
Weirong Li, Wensheng Zhu

The growing popularity of data heterogeneity motivates people to identify homogeneous subgroups with identical parameters. Meanwhile, in many fields of recent data science for some applications, such as personalized education and personalized marketing, the massive data are usually recorded as categorical or ordinal variables, which highlights the importance of performing subgroup analysis on those ordinal outcomes. In this paper, we propose a cumulative link model with subject-specific intercepts to detect and identify homogeneous subgroups through concave pairwise fusion penalty for ordinal response, where heterogeneity arises from some unknown or unobserved latent factors. The concave fusion method can simultaneously determine the number of subgroups, identify the group membership, and estimate the regression coefficients. An alternating direction method of multipliers algorithm with concave penalties for the generalized linear regression model with logit link is developed and its convergence property is studied. We also establish the oracle property of the proposed penalized estimator under some mild conditions. Our simulation studies show that the proposed method could recover the heterogeneous subgroup structure effectively when the response of interest is ordinal. Further, the advantages of our method are illustrated by the analysis on a Mathematics Student Performance Data Set of two public schools from the Alentejo region of Portugal.

数据异质性的日益普及促使人们去识别具有相同参数的同质子群。同时,在近年来数据科学的许多应用领域,如个性化教育和个性化营销,海量数据通常记录为分类或序数变量,这就凸显了对这些序数结果进行亚组分析的重要性。在本文中,我们提出了一种带有特定受试者截距的累积链接模型,通过对序数响应的凹对融合惩罚来检测和识别同质亚组,其中异质性来自一些未知或未观察到的潜在因素。凹对融合法可以同时确定亚组数量、识别组内成员和估计回归系数。针对带有 logit 链接的广义线性回归模型,我们开发了一种带有凹面惩罚的交替方向乘法算法,并对其收敛特性进行了研究。我们还在一些温和的条件下建立了所提出的惩罚估计器的甲骨文特性。我们的模拟研究表明,当感兴趣的响应是序数时,所提出的方法可以有效地恢复异质子群结构。此外,我们还通过对葡萄牙阿连特茹地区两所公立学校的数学学生成绩数据集的分析,说明了我们的方法的优势。
{"title":"Subgroup analysis with concave pairwise fusion penalty for ordinal response","authors":"Weirong Li, Wensheng Zhu","doi":"10.1007/s00362-023-01526-w","DOIUrl":"https://doi.org/10.1007/s00362-023-01526-w","url":null,"abstract":"<p>The growing popularity of data heterogeneity motivates people to identify homogeneous subgroups with identical parameters. Meanwhile, in many fields of recent data science for some applications, such as personalized education and personalized marketing, the massive data are usually recorded as categorical or ordinal variables, which highlights the importance of performing subgroup analysis on those ordinal outcomes. In this paper, we propose a cumulative link model with subject-specific intercepts to detect and identify homogeneous subgroups through concave pairwise fusion penalty for ordinal response, where heterogeneity arises from some unknown or unobserved latent factors. The concave fusion method can simultaneously determine the number of subgroups, identify the group membership, and estimate the regression coefficients. An alternating direction method of multipliers algorithm with concave penalties for the generalized linear regression model with logit link is developed and its convergence property is studied. We also establish the oracle property of the proposed penalized estimator under some mild conditions. Our simulation studies show that the proposed method could recover the heterogeneous subgroup structure effectively when the response of interest is ordinal. Further, the advantages of our method are illustrated by the analysis on a Mathematics Student Performance Data Set of two public schools from the Alentejo region of Portugal.</p>","PeriodicalId":51166,"journal":{"name":"Statistical Papers","volume":"46 1","pages":""},"PeriodicalIF":1.3,"publicationDate":"2024-01-13","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"139460088","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Some additional remarks on statistical properties of Cohen’s d in the presence of covariates 关于存在协变量时 Cohen's d 统计特性的一些补充说明
IF 1.3 3区 数学 Q2 STATISTICS & PROBABILITY Pub Date : 2024-01-12 DOI: 10.1007/s00362-023-01527-9
Jürgen Groß, Annette Möller

The size of the effect of the difference in two groups with respect to a variable of interest may be estimated by the classical Cohen’s d. A recently proposed generalized estimator allows conditioning on further independent variables within the framework of a linear regression model. In this note, it is demonstrated how unbiased estimation of the effect size parameter together with a corresponding standard error may be obtained based on the non-central t distribution. The portrayed estimator may be considered as a natural generalization of the unbiased Hedges’ g. In addition, confidence interval estimation for the unknown parameter is demonstrated by applying the so-called inversion confidence interval principle. The regarded properties collapse to already known ones in case of absence of any additional independent variables. The stated remarks are illustrated with a publicly available data set.

最近提出的一种广义估计方法允许在线性回归模型的框架内对更多的独立变量进行调节。在本说明中,我们将展示如何基于非中心 t 分布,对效应大小参数进行无偏估计,并得出相应的标准误差。所描绘的估计器可视为无偏 Hedges' g 的自然概括。此外,通过应用所谓的反转置信区间原理,还演示了未知参数的置信区间估计。在没有任何额外自变量的情况下,所考虑的特性与已知的特性相吻合。上述论述将通过一组公开数据加以说明。
{"title":"Some additional remarks on statistical properties of Cohen’s d in the presence of covariates","authors":"Jürgen Groß, Annette Möller","doi":"10.1007/s00362-023-01527-9","DOIUrl":"https://doi.org/10.1007/s00362-023-01527-9","url":null,"abstract":"<p>The size of the effect of the difference in two groups with respect to a variable of interest may be estimated by the classical Cohen’s <i>d</i>. A recently proposed generalized estimator allows conditioning on further independent variables within the framework of a linear regression model. In this note, it is demonstrated how unbiased estimation of the effect size parameter together with a corresponding standard error may be obtained based on the non-central <i>t</i> distribution. The portrayed estimator may be considered as a natural generalization of the unbiased Hedges’ <i>g</i>. In addition, confidence interval estimation for the unknown parameter is demonstrated by applying the so-called inversion confidence interval principle. The regarded properties collapse to already known ones in case of absence of any additional independent variables. The stated remarks are illustrated with a publicly available data set.</p>","PeriodicalId":51166,"journal":{"name":"Statistical Papers","volume":"17 1","pages":""},"PeriodicalIF":1.3,"publicationDate":"2024-01-12","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"139460089","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
期刊
Statistical Papers
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1