首页 > 最新文献

Stats最新文献

英文 中文
Bayesian Logistic Regression Model for Sub-Areas 子区域的贝叶斯逻辑回归模型
Q4 MATHEMATICS, INTERDISCIPLINARY APPLICATIONS Pub Date : 2023-01-29 DOI: 10.3390/stats6010013
Lu Chen, B. Nandram
Many population-based surveys have binary responses from a large number of individuals in each household within small areas. One example is the Nepal Living Standards Survey (NLSS II), in which health status binary data (good versus poor) for each individual from sampled households (sub-areas) are available in the sampled wards (small areas). To make an inference for the finite population proportion of individuals in each household, we use the sub-area logistic regression model with reliable auxiliary information. The contribution of this model is twofold. First, we extend an area-level model to a sub-area level model. Second, because there are numerous sub-areas, standard Markov chain Monte Carlo (MCMC) methods to find the joint posterior density are very time-consuming. Therefore, we provide a sampling-based method, the integrated nested normal approximation (INNA), which permits fast computation. Our main goal is to describe this hierarchical Bayesian logistic regression model and to show that the computation is much faster than the exact MCMC method and also reasonably accurate. The performance of our method is studied by using NLSS II data. Our model can borrow strength from both areas and sub-areas to obtain more efficient and precise estimates. The hierarchical structure of our model captures the variation in the binary data reasonably well.
许多基于人口的调查都有来自小范围内每个家庭的大量个人的二元反应。一个例子是尼泊尔生活水平调查(NLSS II),在该调查中,抽样家庭(子地区)的每个人的健康状况二进制数据(良好与较差)可在抽样病房(小地区)中获得。为了推断每个家庭中个体的有限人口比例,我们使用了具有可靠辅助信息的子区域逻辑回归模型。这种模式的贡献是双重的。首先,我们将区域级模型扩展为子区域级模型。其次,由于有许多子区域,标准的马尔可夫链蒙特卡罗(MCMC)方法来寻找关节后验密度是非常耗时的。因此,我们提供了一种基于采样的方法,即集成嵌套正态近似(INNA),它允许快速计算。我们的主要目标是描述这种分层贝叶斯逻辑回归模型,并表明计算速度比精确的MCMC方法快得多,而且相当准确。利用NLSSⅡ数据对该方法的性能进行了研究。我们的模型可以从区域和子区域中汲取力量,以获得更高效、更精确的估计。我们模型的层次结构相当好地捕捉了二进制数据的变化。
{"title":"Bayesian Logistic Regression Model for Sub-Areas","authors":"Lu Chen, B. Nandram","doi":"10.3390/stats6010013","DOIUrl":"https://doi.org/10.3390/stats6010013","url":null,"abstract":"Many population-based surveys have binary responses from a large number of individuals in each household within small areas. One example is the Nepal Living Standards Survey (NLSS II), in which health status binary data (good versus poor) for each individual from sampled households (sub-areas) are available in the sampled wards (small areas). To make an inference for the finite population proportion of individuals in each household, we use the sub-area logistic regression model with reliable auxiliary information. The contribution of this model is twofold. First, we extend an area-level model to a sub-area level model. Second, because there are numerous sub-areas, standard Markov chain Monte Carlo (MCMC) methods to find the joint posterior density are very time-consuming. Therefore, we provide a sampling-based method, the integrated nested normal approximation (INNA), which permits fast computation. Our main goal is to describe this hierarchical Bayesian logistic regression model and to show that the computation is much faster than the exact MCMC method and also reasonably accurate. The performance of our method is studied by using NLSS II data. Our model can borrow strength from both areas and sub-areas to obtain more efficient and precise estimates. The hierarchical structure of our model captures the variation in the binary data reasonably well.","PeriodicalId":93142,"journal":{"name":"Stats","volume":"1 1","pages":""},"PeriodicalIF":0.0,"publicationDate":"2023-01-29","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"41412551","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Comparing Robust Linking and Regularized Estimation for Linking Two Groups in the 1PL and 2PL Models in the Presence of Sparse Uniform Differential Item Functioning 比较1PL和2PL模型中存在稀疏一致微分项函数的两组连接的鲁棒连接和正则化估计
Q4 MATHEMATICS, INTERDISCIPLINARY APPLICATIONS Pub Date : 2023-01-25 DOI: 10.3390/stats6010012
A. Robitzsch
In the social sciences, the performance of two groups is frequently compared based on a cognitive test involving binary items. Item response models are often utilized for comparing the two groups. However, the presence of differential item functioning (DIF) can impact group comparisons. In order to avoid the biased estimation of groups, appropriate statistical methods for handling differential item functioning are required. This article compares the performance-regularized estimation and several robust linking approaches in three simulation studies that address the one-parameter logistic (1PL) and two-parameter logistic (2PL) models, respectively. It turned out that robust linking approaches are at least as effective as the regularized estimation approach in most of the conditions in the simulation studies.
在社会科学中,两组人的表现经常基于涉及二元项目的认知测试进行比较。项目反应模型通常用于比较两组。然而,差异项目功能(DIF)的存在会影响群体比较。为了避免对群体的偏估计,需要适当的统计方法来处理不同的项目功能。本文在三个分别处理单参数逻辑(1PL)和双参数逻辑(2PL)模型的仿真研究中比较了性能正则化估计和几种鲁棒连接方法。结果表明,在仿真研究的大多数情况下,鲁棒连接方法至少与正则化估计方法一样有效。
{"title":"Comparing Robust Linking and Regularized Estimation for Linking Two Groups in the 1PL and 2PL Models in the Presence of Sparse Uniform Differential Item Functioning","authors":"A. Robitzsch","doi":"10.3390/stats6010012","DOIUrl":"https://doi.org/10.3390/stats6010012","url":null,"abstract":"In the social sciences, the performance of two groups is frequently compared based on a cognitive test involving binary items. Item response models are often utilized for comparing the two groups. However, the presence of differential item functioning (DIF) can impact group comparisons. In order to avoid the biased estimation of groups, appropriate statistical methods for handling differential item functioning are required. This article compares the performance-regularized estimation and several robust linking approaches in three simulation studies that address the one-parameter logistic (1PL) and two-parameter logistic (2PL) models, respectively. It turned out that robust linking approaches are at least as effective as the regularized estimation approach in most of the conditions in the simulation studies.","PeriodicalId":93142,"journal":{"name":"Stats","volume":" ","pages":""},"PeriodicalIF":0.0,"publicationDate":"2023-01-25","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"42309810","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 2
Informative g-Priors for Mixed Models 混合模型的信息g先验
Q4 MATHEMATICS, INTERDISCIPLINARY APPLICATIONS Pub Date : 2023-01-16 DOI: 10.3390/stats6010011
Yu-Fang Chien, Haiming Zhou, T. Hanson, Theodore C. Lystig
Zellner’s objective g-prior has been widely used in linear regression models due to its simple interpretation and computational tractability in evaluating marginal likelihoods. However, the g-prior further allows portioning the prior variability explained by the linear predictor versus that of pure noise. In this paper, we propose a novel yet remarkably simple g-prior specification when a subject matter expert has information on the marginal distribution of the response yi. The approach is extended for use in mixed models with some surprising but intuitive results. Simulation studies are conducted to compare the model fitting under the proposed g-prior with that under other existing priors.
Zellner的目标g先验由于其解释简单,计算易于处理,在线性回归模型中得到了广泛的应用。然而,g-prior进一步允许将线性预测器与纯噪声解释的先验可变性进行分割。在本文中,我们提出了一个新颖但非常简单的g-先验规范,当主题专家有关于响应yi的边际分布的信息时。将该方法扩展到混合模型中,得到了一些令人惊讶但直观的结果。通过仿真研究,将所提出的g-prior下的模型拟合与其他已有的prior下的模型拟合进行了比较。
{"title":"Informative g-Priors for Mixed Models","authors":"Yu-Fang Chien, Haiming Zhou, T. Hanson, Theodore C. Lystig","doi":"10.3390/stats6010011","DOIUrl":"https://doi.org/10.3390/stats6010011","url":null,"abstract":"Zellner’s objective g-prior has been widely used in linear regression models due to its simple interpretation and computational tractability in evaluating marginal likelihoods. However, the g-prior further allows portioning the prior variability explained by the linear predictor versus that of pure noise. In this paper, we propose a novel yet remarkably simple g-prior specification when a subject matter expert has information on the marginal distribution of the response yi. The approach is extended for use in mixed models with some surprising but intuitive results. Simulation studies are conducted to compare the model fitting under the proposed g-prior with that under other existing priors.","PeriodicalId":93142,"journal":{"name":"Stats","volume":" ","pages":""},"PeriodicalIF":0.0,"publicationDate":"2023-01-16","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"48885348","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
A Novel Flexible Class of Intervened Poisson Distribution by Lagrangian Approach 基于拉格朗日方法的一类新的柔性干涉泊松分布
Q4 MATHEMATICS, INTERDISCIPLINARY APPLICATIONS Pub Date : 2023-01-15 DOI: 10.3390/stats6010010
M. Irshad, M. Monisha, C. Chesneau, R. Maya, D. S. Shibu
The zero-truncated Poisson distribution (ZTPD) generates a statistical model that could be appropriate when observations begin once at least one event occurs. The intervened Poisson distribution (IPD) is a substitute for the ZTPD, in which some intervention processes may change the mean of the rare events. These two zero-truncated distributions exhibit underdispersion (i.e., their variance is less than their mean). In this research, we offer an alternative solution for dealing with intervention problems by proposing a generalization of the IPD by a Lagrangian approach called the Lagrangian intervened Poisson distribution (LIPD), which in fact generalizes both the ZTPD and the IPD. As a notable feature, it has the ability to analyze both overdispersed and underdispersed datasets. In addition, the LIPD has a closed-form expression of all of its statistical characteristics, as well as an increasing, decreasing, bathtub-shaped, and upside-down bathtub-shaped hazard rate function. A consequent part is devoted to its statistical application. The maximum likelihood estimation method is considered, and the effectiveness of the estimates is demonstrated through a simulated study. To evaluate the significance of the new parameter in the LIPD, a generalized likelihood ratio test is performed. Subsequently, we present a new count regression model that is suitable for both overdispersed and underdispersed datasets using the mean-parametrized form of the LIPD. Additionally, the LIPD’s relevance and application are shown using real-world datasets.
零截断泊松分布(ZTPD)生成的统计模型适用于至少一次事件发生后开始观测的情况。干预泊松分布(IPD)可以代替ZTPD,其中一些干预过程可能会改变罕见事件的平均值。这两个零截断分布表现为不充分分散(即,它们的方差小于平均值)。在这项研究中,我们提出了一种处理干预问题的替代解决方案,即通过拉格朗日方法对IPD进行推广,称为拉格朗日干预泊松分布(LIPD),它实际上推广了ZTPD和IPD。作为一个显著的特点,它具有分析过分散和欠分散数据集的能力。此外,LIPD具有其所有统计特征的封闭表达式,以及增加、减少、浴缸形和倒置浴缸形的危险率函数。随后的一部分专门讨论它的统计应用。考虑了极大似然估计方法,并通过仿真研究验证了估计的有效性。为了评估LIPD中新参数的显著性,进行了广义似然比检验。随后,我们提出了一种新的计数回归模型,该模型适用于使用LIPD的平均参数化形式的过分散和欠分散数据集。此外,LIPD的相关性和应用使用了真实世界的数据集。
{"title":"A Novel Flexible Class of Intervened Poisson Distribution by Lagrangian Approach","authors":"M. Irshad, M. Monisha, C. Chesneau, R. Maya, D. S. Shibu","doi":"10.3390/stats6010010","DOIUrl":"https://doi.org/10.3390/stats6010010","url":null,"abstract":"The zero-truncated Poisson distribution (ZTPD) generates a statistical model that could be appropriate when observations begin once at least one event occurs. The intervened Poisson distribution (IPD) is a substitute for the ZTPD, in which some intervention processes may change the mean of the rare events. These two zero-truncated distributions exhibit underdispersion (i.e., their variance is less than their mean). In this research, we offer an alternative solution for dealing with intervention problems by proposing a generalization of the IPD by a Lagrangian approach called the Lagrangian intervened Poisson distribution (LIPD), which in fact generalizes both the ZTPD and the IPD. As a notable feature, it has the ability to analyze both overdispersed and underdispersed datasets. In addition, the LIPD has a closed-form expression of all of its statistical characteristics, as well as an increasing, decreasing, bathtub-shaped, and upside-down bathtub-shaped hazard rate function. A consequent part is devoted to its statistical application. The maximum likelihood estimation method is considered, and the effectiveness of the estimates is demonstrated through a simulated study. To evaluate the significance of the new parameter in the LIPD, a generalized likelihood ratio test is performed. Subsequently, we present a new count regression model that is suitable for both overdispersed and underdispersed datasets using the mean-parametrized form of the LIPD. Additionally, the LIPD’s relevance and application are shown using real-world datasets.","PeriodicalId":93142,"journal":{"name":"Stats","volume":" ","pages":""},"PeriodicalIF":0.0,"publicationDate":"2023-01-15","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"42405726","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 2
Acknowledgment to the Reviewers of Stats in 2022 对2022年统计审稿人的感谢
Q4 MATHEMATICS, INTERDISCIPLINARY APPLICATIONS Pub Date : 2023-01-12 DOI: 10.3390/stats6010009
High-quality academic publishing is built on rigorous peer review [...]
高质量的学术出版建立在严格的同行评审的基础上[…]
{"title":"Acknowledgment to the Reviewers of Stats in 2022","authors":"","doi":"10.3390/stats6010009","DOIUrl":"https://doi.org/10.3390/stats6010009","url":null,"abstract":"High-quality academic publishing is built on rigorous peer review [...]","PeriodicalId":93142,"journal":{"name":"Stats","volume":"1 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2023-01-12","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"135996027","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Statistical Prediction of Future Sports Records Based on Record Values 基于记录值的未来体育记录的统计预测
Q4 MATHEMATICS, INTERDISCIPLINARY APPLICATIONS Pub Date : 2023-01-11 DOI: 10.3390/stats6010008
Christina Empacher, U. Kamps, G. Volovskiy
Point prediction of future record values based on sequences of previous lower or upper records is considered by means of the method of maximum product of spacings, where the underlying distribution is assumed to be a power function distribution and a Pareto distribution, respectively. Moreover, exact and approximate prediction intervals are discussed and compared with regard to their expected lengths and their percentages of coverage. The focus is on deriving explicit expressions in the point and interval prediction procedures. Predictions and forecasts are of interest, e.g., in sports analytics, which is gaining more and more attention in several sports disciplines. Previous works on forecasting athletic records have mainly been based on extreme value theory. The presented statistical prediction methods are exemplarily applied to data from various disciplines of athletics as well as to data from American football based on fantasy football points according to the points per reception scoring scheme. The results are discussed along with basic assumptions and the choice of underlying distributions.
基于先前较低或较高记录的序列的未来记录值的点预测是通过间距的最大乘积方法来考虑的,其中假设基本分布分别是幂函数分布和帕累托分布。此外,还讨论并比较了精确和近似的预测区间的预期长度及其覆盖率。重点是在点和区间预测过程中导出显式表达式。预测和预测是人们感兴趣的,例如体育分析,它在几个体育学科中越来越受到关注。以往的运动记录预测工作主要基于极值理论。所提出的统计预测方法示例性地应用于来自田径各个学科的数据,以及根据每次接收得分方案基于梦幻足球积分的美式足球数据。讨论的结果与基本假设和基本分布的选择。
{"title":"Statistical Prediction of Future Sports Records Based on Record Values","authors":"Christina Empacher, U. Kamps, G. Volovskiy","doi":"10.3390/stats6010008","DOIUrl":"https://doi.org/10.3390/stats6010008","url":null,"abstract":"Point prediction of future record values based on sequences of previous lower or upper records is considered by means of the method of maximum product of spacings, where the underlying distribution is assumed to be a power function distribution and a Pareto distribution, respectively. Moreover, exact and approximate prediction intervals are discussed and compared with regard to their expected lengths and their percentages of coverage. The focus is on deriving explicit expressions in the point and interval prediction procedures. Predictions and forecasts are of interest, e.g., in sports analytics, which is gaining more and more attention in several sports disciplines. Previous works on forecasting athletic records have mainly been based on extreme value theory. The presented statistical prediction methods are exemplarily applied to data from various disciplines of athletics as well as to data from American football based on fantasy football points according to the points per reception scoring scheme. The results are discussed along with basic assumptions and the choice of underlying distributions.","PeriodicalId":93142,"journal":{"name":"Stats","volume":" ","pages":""},"PeriodicalIF":0.0,"publicationDate":"2023-01-11","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"43503521","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
Change Point Detection by State Space Modeling of Long-Term Air Temperature Series in Europe 欧洲长期气温序列状态空间模型的变化点检测
Q4 MATHEMATICS, INTERDISCIPLINARY APPLICATIONS Pub Date : 2023-01-04 DOI: 10.3390/stats6010007
Magda Monteiro, M. Costa
This work presents the statistical analysis of a monthly average temperatures time series in several European cities using a state space approach, which considers models with a deterministic seasonal component and a stochastic trend. Temperature rise rates in Europe seem to have increased in the last decades when compared with longer periods. Therefore, change point detection methods, both parametric and non-parametric methods, were applied to the standardized residuals of the state space models (or some other related component) in order to identify these possible changes in the monthly temperature rise rates. All of the used methods have identified at least one change point in each of the temperature time series, particularly in the late 1980s or early 1990s. The differences in the average temperature trend are more evident in Eastern European cities than in Western Europe. The smoother-based t-test framework proposed in this work showed an advantage over the other methods, precisely because it considers the time correlation presented in time series. Moreover, this framework focuses the change point detection on the stochastic trend component.
这项工作使用状态空间方法对几个欧洲城市的月平均温度时间序列进行了统计分析,该方法考虑了具有确定性季节成分和随机趋势的模型。与较长时期相比,欧洲的气温上升率在过去几十年中似乎有所上升。因此,将变化点检测方法,包括参数和非参数方法,应用于状态空间模型(或一些其他相关组件)的标准化残差,以识别月温升率的这些可能变化。所有使用的方法都在每个温度时间序列中确定了至少一个变化点,特别是在20世纪80年代末或90年代初。平均气温趋势的差异在东欧城市比在西欧更明显。这项工作中提出的基于更平滑的t检验框架显示出优于其他方法的优势,正是因为它考虑了时间序列中呈现的时间相关性。此外,该框架将变化点检测的重点放在随机趋势分量上。
{"title":"Change Point Detection by State Space Modeling of Long-Term Air Temperature Series in Europe","authors":"Magda Monteiro, M. Costa","doi":"10.3390/stats6010007","DOIUrl":"https://doi.org/10.3390/stats6010007","url":null,"abstract":"This work presents the statistical analysis of a monthly average temperatures time series in several European cities using a state space approach, which considers models with a deterministic seasonal component and a stochastic trend. Temperature rise rates in Europe seem to have increased in the last decades when compared with longer periods. Therefore, change point detection methods, both parametric and non-parametric methods, were applied to the standardized residuals of the state space models (or some other related component) in order to identify these possible changes in the monthly temperature rise rates. All of the used methods have identified at least one change point in each of the temperature time series, particularly in the late 1980s or early 1990s. The differences in the average temperature trend are more evident in Eastern European cities than in Western Europe. The smoother-based t-test framework proposed in this work showed an advantage over the other methods, precisely because it considers the time correlation presented in time series. Moreover, this framework focuses the change point detection on the stochastic trend component.","PeriodicalId":93142,"journal":{"name":"Stats","volume":" ","pages":""},"PeriodicalIF":0.0,"publicationDate":"2023-01-04","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"43119824","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 2
An ϵ-Greedy Multiarmed Bandit Approach to Markov Decision Processes 一种ϵ-Greedy马尔可夫决策过程的多臂强盗方法
Q4 MATHEMATICS, INTERDISCIPLINARY APPLICATIONS Pub Date : 2023-01-01 DOI: 10.3390/stats6010006
Isa Muqattash, Jiaqiao Hu
We present REGA, a new adaptive-sampling-based algorithm for the control of finite-horizon Markov decision processes (MDPs) with very large state spaces and small action spaces. We apply a variant of the ϵ-greedy multiarmed bandit algorithm to each stage of the MDP in a recursive manner, thus computing an estimation of the “reward-to-go” value at each stage of the MDP. We provide a finite-time analysis of REGA. In particular, we provide a bound on the probability that the approximation error exceeds a given threshold, where the bound is given in terms of the number of samples collected at each stage of the MDP. We empirically compare REGA against another sampling-based algorithm called RASA by running simulations against the SysAdmin benchmark problem with 210 states. The results show that REGA and RASA achieved similar performance. Moreover, REGA and RASA empirically outperformed an implementation of the algorithm that uses the “original” ϵ-greedy algorithm that commonly appears in the literature.
我们提出了一种新的基于自适应采样的REGA算法,用于控制具有非常大状态空间和小动作空间的有限时域马尔可夫决策过程。我们以递归的方式将ε-贪婪多臂土匪算法的变体应用于MDP的每个阶段,从而计算MDP每个阶段的“奖励”值的估计。我们提供了REGA的有限时间分析。特别地,我们提供了近似误差超过给定阈值的概率的界,其中该界是根据在MDP的每个阶段收集的样本数量给出的。我们通过对210个状态的SysAdmin基准问题进行模拟,将REGA与另一种基于采样的算法RASA进行了实证比较。结果表明,REGA和RASA具有相似的性能。此外,REGA和RASA在经验上优于使用文献中常见的“原始”ε-贪婪算法的算法实现。
{"title":"An ϵ-Greedy Multiarmed Bandit Approach to Markov Decision Processes","authors":"Isa Muqattash, Jiaqiao Hu","doi":"10.3390/stats6010006","DOIUrl":"https://doi.org/10.3390/stats6010006","url":null,"abstract":"We present REGA, a new adaptive-sampling-based algorithm for the control of finite-horizon Markov decision processes (MDPs) with very large state spaces and small action spaces. We apply a variant of the ϵ-greedy multiarmed bandit algorithm to each stage of the MDP in a recursive manner, thus computing an estimation of the “reward-to-go” value at each stage of the MDP. We provide a finite-time analysis of REGA. In particular, we provide a bound on the probability that the approximation error exceeds a given threshold, where the bound is given in terms of the number of samples collected at each stage of the MDP. We empirically compare REGA against another sampling-based algorithm called RASA by running simulations against the SysAdmin benchmark problem with 210 states. The results show that REGA and RASA achieved similar performance. Moreover, REGA and RASA empirically outperformed an implementation of the algorithm that uses the “original” ϵ-greedy algorithm that commonly appears in the literature.","PeriodicalId":93142,"journal":{"name":"Stats","volume":" ","pages":""},"PeriodicalIF":0.0,"publicationDate":"2023-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"45930650","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Applying the Multilevel Approach in Estimation of Income Population Differences 多层次方法在收入人口差异估计中的应用
Q4 MATHEMATICS, INTERDISCIPLINARY APPLICATIONS Pub Date : 2022-12-29 DOI: 10.3390/stats6010005
V. Timiryanova, Dina Krasnoselskaya, N. Kuzminykh
Income inequality remains one of the most burning issues discussed in the world. The difficulty of the problem arises from its multiple manifestations at regional and local levels and unique patterns within countries. This paper employs a multilevel approach to identify factors that influence income and wage inequalities at regional and municipal scales in Russia. We carried out the study on data from 2017 municipalities of 75 Russian regions from 2015 to 2019. A Hierarchical Linear Model with Cross-Classified Random Effects (HLMHCM) allowed us to establish that most of the total variances in population income and average wages accounted for the regional scale. Our analysis revealed different variances of income per capita and average wage; we disclosed the reasons for these disparities. We also found a mixed relationship between income inequality and social transfers. These variables influence income growth but change the relationship between income and labour productivity. Our study underlined that the impacts of shares of employees in agriculture and manufacturing should be considered together with labour productivity in these industries.
收入不平等仍然是世界上讨论最激烈的问题之一。这一问题的困难在于其在区域和地方各级的多种表现以及各国内部的独特模式。本文采用多层次的方法来确定影响俄罗斯地区和城市规模的收入和工资不平等的因素。我们对2015年至2019年俄罗斯75个地区2017年市政当局的数据进行了研究。具有交叉分类随机效应的层次线性模型(HLMHCM)使我们能够确定人口收入和平均工资的大部分总方差占区域尺度。我们的分析揭示了人均收入和平均工资的不同差异;我们披露了造成这些差异的原因。我们还发现收入不平等和社会转移之间存在混合关系。这些变量影响收入增长,但改变了收入与劳动生产率之间的关系。我们的研究强调,应将农业和制造业雇员比例的影响与这些行业的劳动生产率一并考虑。
{"title":"Applying the Multilevel Approach in Estimation of Income Population Differences","authors":"V. Timiryanova, Dina Krasnoselskaya, N. Kuzminykh","doi":"10.3390/stats6010005","DOIUrl":"https://doi.org/10.3390/stats6010005","url":null,"abstract":"Income inequality remains one of the most burning issues discussed in the world. The difficulty of the problem arises from its multiple manifestations at regional and local levels and unique patterns within countries. This paper employs a multilevel approach to identify factors that influence income and wage inequalities at regional and municipal scales in Russia. We carried out the study on data from 2017 municipalities of 75 Russian regions from 2015 to 2019. A Hierarchical Linear Model with Cross-Classified Random Effects (HLMHCM) allowed us to establish that most of the total variances in population income and average wages accounted for the regional scale. Our analysis revealed different variances of income per capita and average wage; we disclosed the reasons for these disparities. We also found a mixed relationship between income inequality and social transfers. These variables influence income growth but change the relationship between income and labour productivity. Our study underlined that the impacts of shares of employees in agriculture and manufacturing should be considered together with labour productivity in these industries.","PeriodicalId":93142,"journal":{"name":"Stats","volume":" ","pages":""},"PeriodicalIF":0.0,"publicationDate":"2022-12-29","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"43768845","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Estimating Smoothness and Optimal Bandwidth for Probability Density Functions 估计平滑度和最优带宽的概率密度函数
Q4 MATHEMATICS, INTERDISCIPLINARY APPLICATIONS Pub Date : 2022-12-27 DOI: 10.3390/stats6010003
D. Politis, P. Tarassenko, V. Vasiliev
The properties of non-parametric kernel estimators for probability density function from two special classes are investigated. Each class is parametrized with distribution smoothness parameter. One of the classes was introduced by Rosenblatt, another one is introduced in this paper. For the case of the known smoothness parameter, the rates of mean square convergence of optimal (on the bandwidth) density estimators are found. For the case of unknown smoothness parameter, the estimation procedure of the parameter is developed and almost surely convergency is proved. The convergence rates in the almost sure sense of these estimators are obtained. Adaptive estimators of densities from the given class on the basis of the constructed smoothness parameter estimators are presented. It is shown in examples how parameters of the adaptive density estimation procedures can be chosen. Non-asymptotic and asymptotic properties of these estimators are investigated. Specifically, the upper bounds for the mean square error of the adaptive density estimators for a fixed sample size are found and their strong consistency is proved. The convergence of these estimators in the almost sure sense is established. Simulation results illustrate the realization of the asymptotic behavior when the sample size grows large.
研究了两类特殊类概率密度函数的非参数核估计的性质。每个类都使用分布平滑度参数进行参数化。Rosenblatt介绍了其中一个类,本文介绍了另一个类。对于已知光滑度参数的情况,找到了最优(带宽上)密度估计器的均方收敛率。对于光滑度参数未知的情况,给出了参数的估计过程,并几乎肯定地证明了其收敛性。得到了这些估计量在几乎确定意义上的收敛速度。在构造的光滑度参数估计量的基础上,给出了给定类密度的自适应估计量。举例说明了如何选择自适应密度估计过程的参数。研究了这些估计量的非渐近性质和渐近性质。具体地,给出了固定样本量下自适应密度估计的均方误差的上界,并证明了它们的强一致性。建立了这些估计量在几乎确定意义上的收敛性。仿真结果说明了当样本量变大时渐近行为的实现。
{"title":"Estimating Smoothness and Optimal Bandwidth for Probability Density Functions","authors":"D. Politis, P. Tarassenko, V. Vasiliev","doi":"10.3390/stats6010003","DOIUrl":"https://doi.org/10.3390/stats6010003","url":null,"abstract":"The properties of non-parametric kernel estimators for probability density function from two special classes are investigated. Each class is parametrized with distribution smoothness parameter. One of the classes was introduced by Rosenblatt, another one is introduced in this paper. For the case of the known smoothness parameter, the rates of mean square convergence of optimal (on the bandwidth) density estimators are found. For the case of unknown smoothness parameter, the estimation procedure of the parameter is developed and almost surely convergency is proved. The convergence rates in the almost sure sense of these estimators are obtained. Adaptive estimators of densities from the given class on the basis of the constructed smoothness parameter estimators are presented. It is shown in examples how parameters of the adaptive density estimation procedures can be chosen. Non-asymptotic and asymptotic properties of these estimators are investigated. Specifically, the upper bounds for the mean square error of the adaptive density estimators for a fixed sample size are found and their strong consistency is proved. The convergence of these estimators in the almost sure sense is established. Simulation results illustrate the realization of the asymptotic behavior when the sample size grows large.","PeriodicalId":93142,"journal":{"name":"Stats","volume":" ","pages":""},"PeriodicalIF":0.0,"publicationDate":"2022-12-27","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"48907693","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
期刊
Stats
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1