首页 > 最新文献

Brazilian Journal of Probability and Statistics最新文献

英文 中文
Neyman’s truncation test for two-sample means under high dimensional setting 高维条件下两样本均值的Neyman截断检验
IF 1 4区 数学 Q3 Mathematics Pub Date : 2022-03-01 DOI: 10.1214/21-bjps519
Ping Dong, Lu Lin
Abstract. Multivariate two-sample testing problems often arise from the statistical analysis for scientific data, especially for bioinformatics data. To detect components with different values between two mean vectors, well-known procedures are to apply Sum-of-Squares type tests, such as Hotelling’s T 2-test. However, such a test is not suitable to high dimensional settings because of singular covariance matrix and accumulated errors. Nowadays, a lot of test methods for high dimensional data are developed, mainly including two types, Sum-of-Squares type and Max type. The Sum-of-Squares type test statistics have poor performance against sparse alternatives. And the Max type test statistic is not powerful enough to deal with non-sparse datasets. In this paper, we propose a Max-Partial-Sum type statistic named Neyman’s Truncation test, which is conducted by maximum partial sums of marginal test statistics. Besides non-sparse datasets, Neyman’s Truncation test also has great power against dense and sparse alternatives. The asymptotic distribution of the test statistic under null hypothesis is obtained and the power of the test is analyzed. To avoid the slow convergence rate of the asymptotic distribution, we realize our method by Bootstrap procedures. Simulation studies and the analysis of leukemia dataset are carried out to verify the numerical performance.
摘要在对科学数据,特别是生物信息学数据进行统计分析时,经常会出现多变量双样本检验问题。为了检测两个平均向量之间具有不同值的分量,众所周知的程序是应用平方和类型测试,例如Hoteling的T2测试。然而,由于奇异协方差矩阵和累积误差,这种测试不适合高维设置。目前,高维数据的测试方法很多,主要有平方和型和最大型两种。平方和类型测试统计相对于稀疏备选方案的性能较差。并且Max类型检验统计量的功能不足以处理非稀疏数据集。本文提出了一种最大偏和型统计量Neyman截断检验,它是由边际检验统计量的最大偏和进行的。除了非稀疏数据集,Neyman的截断测试对密集和稀疏的替代方案也有很大的优势。得到了零假设下检验统计量的渐近分布,并分析了检验的幂。为了避免渐近分布的收敛速度慢,我们通过Bootstrap程序实现了我们的方法。对白血病数据集进行了仿真研究和分析,验证了数值性能。
{"title":"Neyman’s truncation test for two-sample means under high dimensional setting","authors":"Ping Dong, Lu Lin","doi":"10.1214/21-bjps519","DOIUrl":"https://doi.org/10.1214/21-bjps519","url":null,"abstract":"Abstract. Multivariate two-sample testing problems often arise from the statistical analysis for scientific data, especially for bioinformatics data. To detect components with different values between two mean vectors, well-known procedures are to apply Sum-of-Squares type tests, such as Hotelling’s T 2-test. However, such a test is not suitable to high dimensional settings because of singular covariance matrix and accumulated errors. Nowadays, a lot of test methods for high dimensional data are developed, mainly including two types, Sum-of-Squares type and Max type. The Sum-of-Squares type test statistics have poor performance against sparse alternatives. And the Max type test statistic is not powerful enough to deal with non-sparse datasets. In this paper, we propose a Max-Partial-Sum type statistic named Neyman’s Truncation test, which is conducted by maximum partial sums of marginal test statistics. Besides non-sparse datasets, Neyman’s Truncation test also has great power against dense and sparse alternatives. The asymptotic distribution of the test statistic under null hypothesis is obtained and the power of the test is analyzed. To avoid the slow convergence rate of the asymptotic distribution, we realize our method by Bootstrap procedures. Simulation studies and the analysis of leukemia dataset are carried out to verify the numerical performance.","PeriodicalId":51242,"journal":{"name":"Brazilian Journal of Probability and Statistics","volume":null,"pages":null},"PeriodicalIF":1.0,"publicationDate":"2022-03-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"43713670","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Fast feature selection via streamwise procedure for massive data 通过流式过程对海量数据进行快速特征选择
IF 1 4区 数学 Q3 Mathematics Pub Date : 2022-03-01 DOI: 10.1214/21-bjps516
Bingqing Lin, Zhen Pang, Jun Zhang, Cuiqing Chen
{"title":"Fast feature selection via streamwise procedure for massive data","authors":"Bingqing Lin, Zhen Pang, Jun Zhang, Cuiqing Chen","doi":"10.1214/21-bjps516","DOIUrl":"https://doi.org/10.1214/21-bjps516","url":null,"abstract":"","PeriodicalId":51242,"journal":{"name":"Brazilian Journal of Probability and Statistics","volume":null,"pages":null},"PeriodicalIF":1.0,"publicationDate":"2022-03-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"44260826","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Valid properties of truncated Student-t regression model with applications in analysis of censored data 截断Student-t回归模型的有效性质及其在截尾数据分析中的应用
IF 1 4区 数学 Q3 Mathematics Pub Date : 2022-03-01 DOI: 10.1214/21-bjps521
Chi Zhang, Guozheng Tian, Yibo Zhai, Y. Fei
Kim (2008) introduced an incorrect stochastic representation (SR) for the truncated Student-t (Tt) random variable. By pointing out that the gamma mixture based on a truncated normal distribution actually cannot result in a true Tt distribution, in this paper, we first propose three correct SRs and then recalculate the corresponding moments of the Tt distribution. Different from those derived by following the invalid SR of Kim (2008), the correct moments of the Tt distribution play a crucial role in parameter estimations. Based on the third SR proposed and the correct expressions of truncated moments, expectation–maximization (EM) algorithms are developed for calculating the maximum likelihood estimates of parameters in the Tt distribution. Extensions to a Tt regression model and a t interval–censored regression model are provided as well. Simulated experiments are conducted to evaluate the performance of the proposed methods. Finally, two real data analyses corroborate the theoretical results.
Kim(2008)为截断的Student-t (Tt)随机变量引入了一个不正确的随机表示(SR)。通过指出基于截断正态分布的伽马混合实际上不能得到真正的Tt分布,本文首先提出了三个正确的SRs,然后重新计算了Tt分布的相应矩。与遵循Kim(2008)的无效SR得到的结果不同,Tt分布的正确矩在参数估计中起着至关重要的作用。基于所提出的第三种SR和截断矩的正确表达式,提出了用于计算Tt分布中参数的最大似然估计的期望最大化算法。还提供了对Tt回归模型和t区间截尾回归模型的扩展。通过仿真实验对所提方法的性能进行了评价。最后,通过两个实际数据分析验证了理论结果。
{"title":"Valid properties of truncated Student-t regression model with applications in analysis of censored data","authors":"Chi Zhang, Guozheng Tian, Yibo Zhai, Y. Fei","doi":"10.1214/21-bjps521","DOIUrl":"https://doi.org/10.1214/21-bjps521","url":null,"abstract":"Kim (2008) introduced an incorrect stochastic representation (SR) for the truncated Student-t (Tt) random variable. By pointing out that the gamma mixture based on a truncated normal distribution actually cannot result in a true Tt distribution, in this paper, we first propose three correct SRs and then recalculate the corresponding moments of the Tt distribution. Different from those derived by following the invalid SR of Kim (2008), the correct moments of the Tt distribution play a crucial role in parameter estimations. Based on the third SR proposed and the correct expressions of truncated moments, expectation–maximization (EM) algorithms are developed for calculating the maximum likelihood estimates of parameters in the Tt distribution. Extensions to a Tt regression model and a t interval–censored regression model are provided as well. Simulated experiments are conducted to evaluate the performance of the proposed methods. Finally, two real data analyses corroborate the theoretical results.","PeriodicalId":51242,"journal":{"name":"Brazilian Journal of Probability and Statistics","volume":null,"pages":null},"PeriodicalIF":1.0,"publicationDate":"2022-03-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"43056982","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
The discrete renewal equation with nonsummable inhomogeneous term 具有不可和非齐次项的离散更新方程
IF 1 4区 数学 Q3 Mathematics Pub Date : 2022-03-01 DOI: 10.1214/21-bjps517
M. Sgibnev
{"title":"The discrete renewal equation with nonsummable inhomogeneous term","authors":"M. Sgibnev","doi":"10.1214/21-bjps517","DOIUrl":"https://doi.org/10.1214/21-bjps517","url":null,"abstract":"","PeriodicalId":51242,"journal":{"name":"Brazilian Journal of Probability and Statistics","volume":null,"pages":null},"PeriodicalIF":1.0,"publicationDate":"2022-03-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"48743808","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Unit level model for small area estimation with count data under square root transformation 平方根变换下计数数据小面积估计的单位级模型
IF 1 4区 数学 Q3 Mathematics Pub Date : 2022-03-01 DOI: 10.1214/21-bjps513
Kelly C. M. Gonçalves, M. Ghosh
Abstract. In recent years, the demand for small area statistics has greatly increased worldwide. Small area models are formulated with random area-specific effects assumed to account for the between-area variation that is not explained by auxiliary variables. The unit level models relate the unit values of a study variable to unit-specific covariates. The main aim of this paper is to consider small area estimation under unit level models based on count data. In particular, instead of modelling the variables assuming the Poisson distribution, which is a usual choice, we consider the square root transformation of the original data. One practical advantage is that the proposed transformation achieves approximate homoscedasticity of the error variances, reducing one layer of estimation problem. Inference for the model is carried out under the hierarchical Bayes approach. The square root transformation is evaluated under a simulation study and two design-based studies with real datasets.
摘要近年来,世界范围内对小区域统计的需求大大增加。小区域模型是用随机的区域特定效应来制定的,假设它可以解释无法由辅助变量解释的区域间变化。单位水平模型将研究变量的单位值与单位特定协变量联系起来。本文的主要目的是考虑基于计数数据的单位级模型下的小面积估计。特别是,我们考虑原始数据的平方根变换,而不是假设泊松分布的变量建模,这是一种通常的选择。一个实际的优点是,所提出的变换实现了误差方差的近似同方差,减少了一层估计问题。模型的推理是在层次贝叶斯方法下进行的。在一个模拟研究和两个基于设计的实际数据集研究中,对平方根变换进行了评估。
{"title":"Unit level model for small area estimation with count data under square root transformation","authors":"Kelly C. M. Gonçalves, M. Ghosh","doi":"10.1214/21-bjps513","DOIUrl":"https://doi.org/10.1214/21-bjps513","url":null,"abstract":"Abstract. In recent years, the demand for small area statistics has greatly increased worldwide. Small area models are formulated with random area-specific effects assumed to account for the between-area variation that is not explained by auxiliary variables. The unit level models relate the unit values of a study variable to unit-specific covariates. The main aim of this paper is to consider small area estimation under unit level models based on count data. In particular, instead of modelling the variables assuming the Poisson distribution, which is a usual choice, we consider the square root transformation of the original data. One practical advantage is that the proposed transformation achieves approximate homoscedasticity of the error variances, reducing one layer of estimation problem. Inference for the model is carried out under the hierarchical Bayes approach. The square root transformation is evaluated under a simulation study and two design-based studies with real datasets.","PeriodicalId":51242,"journal":{"name":"Brazilian Journal of Probability and Statistics","volume":null,"pages":null},"PeriodicalIF":1.0,"publicationDate":"2022-03-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"42563947","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 2
Unadjusted Langevin algorithm for sampling a mixture of weakly smooth potentials 弱光滑电位混合采样的未调整朗格万算法
IF 1 4区 数学 Q3 Mathematics Pub Date : 2021-12-17 DOI: 10.1214/22-bjps538
D. Nguyen
. Discretization of continuous-time diffusion processes is a widely recognized method for sampling. However, it seems to be a considerable restriction when the potentials are often required to be smooth (gradient Lipschitz). This paper studies the problem of sampling through Euler discretization, where the potential function is assumed to be a mixture of weakly smooth distributions and satisfies weakly dissipative. We establish the convergence in Kullback-Leibler (KL) divergence with the number of iterations to reach (cid:15) neighborhood of a target distribution in only polynomial dependence on the dimension. We relax the degenerated convex at infinity conditions of Erdogdu and Hosseinzadeh (2020) and prove convergence guarantees under Poincaré inequality or non-strongly convex outside the ball. In addition, we also provide convergence in L β -Wasserstein metric for the smoothing potential.
. 连续时间扩散过程的离散化是一种被广泛认可的采样方法。然而,这似乎是一个相当大的限制时,往往要求电位是光滑的(梯度Lipschitz)。本文研究了用欧拉离散方法进行抽样的问题,其中假定势函数是一个弱光滑分布的混合物,并且满足弱耗散。我们建立了Kullback-Leibler (KL)散度的收敛性,当迭代次数达到目标分布的(cid:15)邻域时,只依赖于维数的多项式。我们松弛了Erdogdu和Hosseinzadeh(2020)在无穷远条件下的退化凸,并证明了poincarcarr不等式或球外非强凸下的收敛保证。此外,我们还提供了平滑势的L β -Wasserstein度量的收敛性。
{"title":"Unadjusted Langevin algorithm for sampling a mixture of weakly smooth potentials","authors":"D. Nguyen","doi":"10.1214/22-bjps538","DOIUrl":"https://doi.org/10.1214/22-bjps538","url":null,"abstract":". Discretization of continuous-time diffusion processes is a widely recognized method for sampling. However, it seems to be a considerable restriction when the potentials are often required to be smooth (gradient Lipschitz). This paper studies the problem of sampling through Euler discretization, where the potential function is assumed to be a mixture of weakly smooth distributions and satisfies weakly dissipative. We establish the convergence in Kullback-Leibler (KL) divergence with the number of iterations to reach (cid:15) neighborhood of a target distribution in only polynomial dependence on the dimension. We relax the degenerated convex at infinity conditions of Erdogdu and Hosseinzadeh (2020) and prove convergence guarantees under Poincaré inequality or non-strongly convex outside the ball. In addition, we also provide convergence in L β -Wasserstein metric for the smoothing potential.","PeriodicalId":51242,"journal":{"name":"Brazilian Journal of Probability and Statistics","volume":null,"pages":null},"PeriodicalIF":1.0,"publicationDate":"2021-12-17","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"45608926","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 4
A robust partial least squares approach for function-on-function regression 函数上函数回归的一种稳健偏最小二乘方法
IF 1 4区 数学 Q3 Mathematics Pub Date : 2021-11-01 DOI: 10.1214/21-bjps523
U. Beyaztas, H. Shang
The function-on-function linear regression model in which the response and predictors consist of random curves has become a general framework to investigate the relationship between the functional response and functional predictors. Existing methods to estimate the model parameters may be sensitive to outlying observations, common in empirical applications. In addition, these methods may be severely affected by such observations, leading to undesirable estimation and prediction results. A robust estimation method, based on iteratively reweighted simple partial least squares, is introduced to improve the prediction accuracy of the function-on-function linear regression model in the presence of outliers. The performance of the proposed method is based on the number of partial least squares components used to estimate the function-on-function linear regression model. Thus, the optimum number of components is determined via a data-driven error criterion. The finite-sample performance of the proposed method is investigated via several Monte Carlo experiments and an empirical data analysis. In addition, a nonparametric bootstrap method is applied to construct pointwise prediction intervals for the response function. The results are compared with some of the existing methods to illustrate the improvement potentially gained by the proposed method.
反应和预测因子由随机曲线组成的函数对函数线性回归模型已成为研究功能反应和功能预测因子之间关系的通用框架。现有的估计模型参数的方法可能对外围观测结果敏感,这在经验应用中很常见。此外,这些方法可能会受到此类观测的严重影响,导致不期望的估计和预测结果。在存在异常值的情况下,引入了一种基于迭代重加权简单偏最小二乘的稳健估计方法,以提高函数对函数线性回归模型的预测精度。所提出的方法的性能是基于用于估计函数上函数线性回归模型的偏最小二乘分量的数量。因此,通过数据驱动的误差准则来确定部件的最佳数量。通过几个蒙特卡罗实验和经验数据分析,研究了该方法的有限样本性能。此外,还应用了一种非参数自举方法来构造响应函数的逐点预测区间。将结果与现有的一些方法进行了比较,以说明所提出的方法可能获得的改进。
{"title":"A robust partial least squares approach for function-on-function regression","authors":"U. Beyaztas, H. Shang","doi":"10.1214/21-bjps523","DOIUrl":"https://doi.org/10.1214/21-bjps523","url":null,"abstract":"The function-on-function linear regression model in which the response and predictors consist of random curves has become a general framework to investigate the relationship between the functional response and functional predictors. Existing methods to estimate the model parameters may be sensitive to outlying observations, common in empirical applications. In addition, these methods may be severely affected by such observations, leading to undesirable estimation and prediction results. A robust estimation method, based on iteratively reweighted simple partial least squares, is introduced to improve the prediction accuracy of the function-on-function linear regression model in the presence of outliers. The performance of the proposed method is based on the number of partial least squares components used to estimate the function-on-function linear regression model. Thus, the optimum number of components is determined via a data-driven error criterion. The finite-sample performance of the proposed method is investigated via several Monte Carlo experiments and an empirical data analysis. In addition, a nonparametric bootstrap method is applied to construct pointwise prediction intervals for the response function. The results are compared with some of the existing methods to illustrate the improvement potentially gained by the proposed method.","PeriodicalId":51242,"journal":{"name":"Brazilian Journal of Probability and Statistics","volume":null,"pages":null},"PeriodicalIF":1.0,"publicationDate":"2021-11-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"48499943","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Bayesian inference for zero-and/or-one augmented beta rectangular regression models 零和/或一增广β矩形回归模型的贝叶斯推理
IF 1 4区 数学 Q3 Mathematics Pub Date : 2021-11-01 DOI: 10.1214/21-bjps505
Ana R. S. Silva, C. Azevedo, J. Bazán, J. Nobre
Abstract. In this paper we developed a full set of Bayesian inference tools, for zero-and/or-one augmented beta rectangular regression models to analyze limited-augmented data, under a new parameterization. This parameterization: facilitates the development of both regression models and inferential tools as well as make simplifies the respective computational implementations. The proposed Bayesian tools were parameter estimation, model fit assessment, model comparison (information criteria), residual analysis and case influence diagnostics, developed through MCMC algorithms. In addition, we adapted available methods of posterior predictive checking, using appropriate discrepancy measures. We conducted several simulation studies, considering some situations of practical interest, aiming to evaluate: prior sensitivity choice, parameter recovery of the proposed model and estimation method, the impact of transforming the observed zeros and ones, along with the use of non-augmented models, and the behavior of the proposed model fit assessment and model comparison tools. A psychometric real data set was analyzed to illustrate the performance of the developed tools, illustrating the advantages of the developed analysis framework.
摘要在本文中,我们开发了一套完整的贝叶斯推理工具,用于零和/或一增广β矩形回归模型在新的参数化下分析有限增广数据。这种参数化促进了回归模型和推理工具的开发,并简化了各自的计算实现。提出的贝叶斯工具包括参数估计、模型拟合评估、模型比较(信息标准)、残差分析和案例影响诊断,这些工具都是通过MCMC算法开发的。此外,我们采用了可用的后验预测检验方法,使用适当的差异测量。我们进行了几项模拟研究,考虑了一些实际情况,旨在评估:先验灵敏度选择,所提出的模型和估计方法的参数恢复,转换观测到的0和1的影响,以及使用非增广模型,以及所提出的模型拟合评估和模型比较工具的行为。通过对一个心理测量真实数据集的分析来说明所开发工具的性能,说明所开发的分析框架的优点。
{"title":"Bayesian inference for zero-and/or-one augmented beta rectangular regression models","authors":"Ana R. S. Silva, C. Azevedo, J. Bazán, J. Nobre","doi":"10.1214/21-bjps505","DOIUrl":"https://doi.org/10.1214/21-bjps505","url":null,"abstract":"Abstract. In this paper we developed a full set of Bayesian inference tools, for zero-and/or-one augmented beta rectangular regression models to analyze limited-augmented data, under a new parameterization. This parameterization: facilitates the development of both regression models and inferential tools as well as make simplifies the respective computational implementations. The proposed Bayesian tools were parameter estimation, model fit assessment, model comparison (information criteria), residual analysis and case influence diagnostics, developed through MCMC algorithms. In addition, we adapted available methods of posterior predictive checking, using appropriate discrepancy measures. We conducted several simulation studies, considering some situations of practical interest, aiming to evaluate: prior sensitivity choice, parameter recovery of the proposed model and estimation method, the impact of transforming the observed zeros and ones, along with the use of non-augmented models, and the behavior of the proposed model fit assessment and model comparison tools. A psychometric real data set was analyzed to illustrate the performance of the developed tools, illustrating the advantages of the developed analysis framework.","PeriodicalId":51242,"journal":{"name":"Brazilian Journal of Probability and Statistics","volume":null,"pages":null},"PeriodicalIF":1.0,"publicationDate":"2021-11-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"43765996","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
A note on the Nielsen distribution 关于尼尔森分布的说明
IF 1 4区 数学 Q3 Mathematics Pub Date : 2021-11-01 DOI: 10.1214/21-bjps507
D. Gallardo, M. Bourguignon
Castellares, Lemonte, and Santos [Brazilian Journal of Probability and Statistics, 34(1), 90-111, 2020] introduced a two-parameter discrete Nielsen distribution, derived its properties, and illustrated the advantages of the model in three data applications. In this note, we will present a corrected version for some results for the particular case θ = 1.
Castellares、Lemonte和Santos【巴西概率与统计杂志,34(1),90-1111020】介绍了一种双参数离散尼尔森分布,推导了其性质,并说明了该模型在三个数据应用中的优势。在本说明中,我们将为特定情况θ=1的一些结果提供一个校正版本。
{"title":"A note on the Nielsen distribution","authors":"D. Gallardo, M. Bourguignon","doi":"10.1214/21-bjps507","DOIUrl":"https://doi.org/10.1214/21-bjps507","url":null,"abstract":"Castellares, Lemonte, and Santos [Brazilian Journal of Probability and Statistics, 34(1), 90-111, 2020] introduced a two-parameter discrete Nielsen distribution, derived its properties, and illustrated the advantages of the model in three data applications. In this note, we will present a corrected version for some results for the particular case θ = 1.","PeriodicalId":51242,"journal":{"name":"Brazilian Journal of Probability and Statistics","volume":null,"pages":null},"PeriodicalIF":1.0,"publicationDate":"2021-11-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"47840078","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 2
Model-assisted SCAD calibration for non-probability samples 非概率样本的模型辅助SCAD校准
IF 1 4区 数学 Q3 Mathematics Pub Date : 2021-11-01 DOI: 10.1214/21-bjps506
Zhanxu Liu, Chao-Cheng Tu, Yingli Pan
Increasing costs and non-response rates of probability samples have provoked the extensive use of non-probability samples. However, non-probability samples are subject to selection bias, resulting in difficulty for inference. Calibration is a popular method to reduce selection bias in non-probability samples. When rich covariate information is available, a key problem is how to select covariates and estimate parameters in calibration for non-probability samples. In this paper, the model-assisted SCAD calibration is proposed to make population inference from non-probability samples. A parametric model between the study variable and covariates is first established. SCAD is then used to estimate the model parameters based on non-probability samples. The modified forward Kullback-Leibler distance is lastly explored to conduct calibration for non-probability samples based on the estimated parametric model. The theoretical properties of the model-assisted SCAD calibration estimator are further derived. Results from simulation studies show that the model-assisted SCAD calibration estimator yields the smallest bias and mean square error compared with other estimators. Also, a real data from the *Correspondence author: Yingli Pan, Email: panyingli220@163.com
不断增加的成本和概率样本的无响应率促使了非概率样本的广泛使用。然而,非概率样本存在选择偏差,导致推理困难。校准是非概率样本中减少选择偏差的常用方法。在协变量信息丰富的情况下,如何对非概率样本进行协变量选择和参数估计是一个关键问题。本文提出了一种模型辅助SCAD校准方法,从非概率样本中进行总体推断。首先建立了研究变量与协变量之间的参数化模型。然后利用SCAD来估计基于非概率样本的模型参数。最后探讨了基于估计参数模型的修正前向Kullback-Leibler距离对非概率样本进行校正。进一步推导了模型辅助SCAD校准估计器的理论性质。仿真结果表明,与其他估计方法相比,模型辅助SCAD校准估计方法的偏差和均方误差最小。另外,有一个真实数据来自*通讯作者:潘英利,Email: panyingli220@163.com
{"title":"Model-assisted SCAD calibration for non-probability samples","authors":"Zhanxu Liu, Chao-Cheng Tu, Yingli Pan","doi":"10.1214/21-bjps506","DOIUrl":"https://doi.org/10.1214/21-bjps506","url":null,"abstract":"Increasing costs and non-response rates of probability samples have provoked the extensive use of non-probability samples. However, non-probability samples are subject to selection bias, resulting in difficulty for inference. Calibration is a popular method to reduce selection bias in non-probability samples. When rich covariate information is available, a key problem is how to select covariates and estimate parameters in calibration for non-probability samples. In this paper, the model-assisted SCAD calibration is proposed to make population inference from non-probability samples. A parametric model between the study variable and covariates is first established. SCAD is then used to estimate the model parameters based on non-probability samples. The modified forward Kullback-Leibler distance is lastly explored to conduct calibration for non-probability samples based on the estimated parametric model. The theoretical properties of the model-assisted SCAD calibration estimator are further derived. Results from simulation studies show that the model-assisted SCAD calibration estimator yields the smallest bias and mean square error compared with other estimators. Also, a real data from the *Correspondence author: Yingli Pan, Email: panyingli220@163.com","PeriodicalId":51242,"journal":{"name":"Brazilian Journal of Probability and Statistics","volume":null,"pages":null},"PeriodicalIF":1.0,"publicationDate":"2021-11-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"46547938","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
期刊
Brazilian Journal of Probability and Statistics
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1