首页 > 最新文献

Statistica Neerlandica最新文献

英文 中文
On global robustness of an adversarial risk analysis solution 论对抗性风险分析解决方案的全局稳健性
IF 1.5 3区 数学 Q2 STATISTICS & PROBABILITY Pub Date : 2024-09-09 DOI: 10.1111/stan.12361
Jinming Yang, Chaitanya Joshi, Fabrizio Ruggeri
Adversarial Risk Analysis (ARA) can be a more realistic and practical alternative to traditional game theoretic or decision theoretic approaches for modeling strategic decision‐making in the presence of an adversary. ARA relies on quantifying the decision‐maker's (DM's) uncertainties about the adversary's strategic thinking, choices and utilities via probability distributions to identify the optimal solution for the DM. ARA solution will be sensitive to the choices of prior distributions used for modelling the expert beliefs. Yet, to date, no mathematical results to characterize the robustness of the ARA solution to the misspecification of one or more prior distributions exist. Prior elicitation is known to be challenging. We present the very first mathematical results on the global robustness of the ARA solution. We use the distorted band class of priors and establish the conditions under which an ordering on the ARA solution can be established when modelling the first‐price sealed‐bid auctions using the nonstrategic play and level‐ thinking solution concepts. We illustrate these results using numerical examples and discuss further areas of research.
与传统的博弈论或决策论方法相比,对抗性风险分析(ARA)是一种更现实、更实用的方法,可用于在有对手的情况下建立战略决策模型。对抗风险分析依赖于通过概率分布量化决策者(DM)对对手战略思维、选择和效用的不确定性,从而为决策者确定最优解。ARA 解决方案对用于模拟专家信念的先验分布的选择非常敏感。然而,迄今为止,还没有任何数学结果可以描述 ARA 解决方案对一个或多个先验分布的错误指定的鲁棒性。众所周知,先验激发具有挑战性。我们首次提出了 ARA 解决方案全局鲁棒性的数学结果。我们使用了扭曲带类先验,并建立了一些条件,在这些条件下,使用非战略博弈和水平思维解决方案概念对第一价格密封出价拍卖进行建模时,可以建立 ARA 解决方案的排序。我们用数字示例说明了这些结果,并讨论了进一步的研究领域。
{"title":"On global robustness of an adversarial risk analysis solution","authors":"Jinming Yang, Chaitanya Joshi, Fabrizio Ruggeri","doi":"10.1111/stan.12361","DOIUrl":"https://doi.org/10.1111/stan.12361","url":null,"abstract":"Adversarial Risk Analysis (ARA) can be a more realistic and practical alternative to traditional game theoretic or decision theoretic approaches for modeling strategic decision‐making in the presence of an adversary. ARA relies on quantifying the decision‐maker's (DM's) uncertainties about the adversary's strategic thinking, choices and utilities via probability distributions to identify the optimal solution for the DM. ARA solution will be sensitive to the choices of prior distributions used for modelling the expert beliefs. Yet, to date, no mathematical results to characterize the robustness of the ARA solution to the misspecification of one or more prior distributions exist. Prior elicitation is known to be challenging. We present the very first mathematical results on the global robustness of the ARA solution. We use the distorted band class of priors and establish the conditions under which an ordering on the ARA solution can be established when modelling the first‐price sealed‐bid auctions using the <jats:italic>nonstrategic play</jats:italic> and <jats:italic>level‐</jats:italic> thinking solution concepts. We illustrate these results using numerical examples and discuss further areas of research.","PeriodicalId":51178,"journal":{"name":"Statistica Neerlandica","volume":null,"pages":null},"PeriodicalIF":1.5,"publicationDate":"2024-09-09","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142199870","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Heterogeneous dense subhypergraph detection 异构密集子超图检测
IF 1.5 3区 数学 Q2 STATISTICS & PROBABILITY Pub Date : 2024-09-03 DOI: 10.1111/stan.12360
Mingao Yuan, Zuofeng Shang
We study the problem of testing the existence of a heterogeneous dense subhypergraph. The null hypothesis corresponds to a heterogeneous Erdös–Rényi uniform random hypergraph and the alternative hypothesis corresponds to a heterogeneous uniform random hypergraph that contains a dense subhypergraph. We establish detection boundaries when the edge probabilities are known and construct an asymptotically powerful test for distinguishing the hypotheses. We also construct an adaptive test which does not involve edge probabilities, and hence, is more practically useful.
我们研究了检验异质密集子超图存在性的问题。零假设对应于异质埃尔德斯-雷尼均匀随机超图,备择假设对应于包含密集子超图的异质均匀随机超图。当边缘概率已知时,我们建立了检测边界,并构建了一个渐近强大的检验来区分假设。我们还构建了一种自适应检验,它不涉及边缘概率,因此更实用。
{"title":"Heterogeneous dense subhypergraph detection","authors":"Mingao Yuan, Zuofeng Shang","doi":"10.1111/stan.12360","DOIUrl":"https://doi.org/10.1111/stan.12360","url":null,"abstract":"We study the problem of testing the existence of a heterogeneous dense subhypergraph. The null hypothesis corresponds to a heterogeneous Erdös–Rényi uniform random hypergraph and the alternative hypothesis corresponds to a heterogeneous uniform random hypergraph that contains a dense subhypergraph. We establish detection boundaries when the edge probabilities are known and construct an asymptotically powerful test for distinguishing the hypotheses. We also construct an adaptive test which does not involve edge probabilities, and hence, is more practically useful.","PeriodicalId":51178,"journal":{"name":"Statistica Neerlandica","volume":null,"pages":null},"PeriodicalIF":1.5,"publicationDate":"2024-09-03","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142199871","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
General adapted‐threshold monitoring in discrete environments and rules for imbalanced classes 离散环境中的一般自适应阈值监测和不平衡类规则
IF 1.5 3区 数学 Q2 STATISTICS & PROBABILITY Pub Date : 2024-08-22 DOI: 10.1111/stan.12352
Ansgar Steland, Ewaryst Rafajłowicz, Wojciech Rafajłowicz
Having in mind applications in statistics and machine learning such as individualized care monitoring, or watermark detection in large language models, we consider the following general setting: When monitoring a sequence of observations, , there may be additional information, , on the environment which should be used to design the monitoring procedure. This additional information can be incorporated by applying threshold functions to the standardized measurements to adapt the detector to the environment. For the case of categorical data encoding of discrete‐valued environmental information we study several classes of level threshold functions including a proportional one which favors rare events among imbalanced classes. For the latter rule asymptotic theory is developed for independent and identically distributed and dependent learning samples including data from new discrete autoregressive moving average model (NDARMA) series and Hidden Markov Models. Further, we propose two‐stage designs which allow to distribute in a controlled way the budget over an a priori partition of the sample space of . The approach is illustrated by a real medical data set.
考虑到统计学和机器学习中的应用,如个性化护理监控或大型语言模型中的水印检测,我们考虑了以下一般情况:在监控一系列观察结果时,可能会有关于环境的附加信息,这些信息应被用于设计监控程序。可以通过对标准化测量应用阈值函数来纳入这些附加信息,从而使检测器适应环境。对于离散值环境信息的分类数据编码情况,我们研究了几类水平阈值函数,包括在不平衡类别中偏好罕见事件的比例函数。对于后一种规则,我们开发了独立同分布和依赖学习样本的渐近理论,包括新离散自回归移动平均模型(NDARMA)序列和隐马尔可夫模型的数据。此外,我们还提出了两阶段设计方案,允许在样本空间的先验分区上以可控方式分配预算。 我们通过一个真实的医疗数据集对该方法进行了说明。
{"title":"General adapted‐threshold monitoring in discrete environments and rules for imbalanced classes","authors":"Ansgar Steland, Ewaryst Rafajłowicz, Wojciech Rafajłowicz","doi":"10.1111/stan.12352","DOIUrl":"https://doi.org/10.1111/stan.12352","url":null,"abstract":"Having in mind applications in statistics and machine learning such as individualized care monitoring, or watermark detection in large language models, we consider the following general setting: When monitoring a sequence of observations, , there may be additional information, , on the environment which should be used to design the monitoring procedure. This additional information can be incorporated by applying threshold functions to the standardized measurements to adapt the detector to the environment. For the case of categorical data encoding of discrete‐valued environmental information we study several classes of level threshold functions including a proportional one which favors rare events among imbalanced classes. For the latter rule asymptotic theory is developed for independent and identically distributed and dependent learning samples including data from new discrete autoregressive moving average model (NDARMA) series and Hidden Markov Models. Further, we propose two‐stage designs which allow to distribute in a controlled way the budget over an a priori partition of the sample space of . The approach is illustrated by a real medical data set.","PeriodicalId":51178,"journal":{"name":"Statistica Neerlandica","volume":null,"pages":null},"PeriodicalIF":1.5,"publicationDate":"2024-08-22","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142199872","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
VC‐PCR: A prediction method based on variable selection and clustering VC-PCR:基于变量选择和聚类的预测方法
IF 1.5 3区 数学 Q2 STATISTICS & PROBABILITY Pub Date : 2024-08-22 DOI: 10.1111/stan.12358
Rebecca Marion, Johannes Lederer, Bernadette Goevarts, Rainer von Sachs
Sparse linear prediction methods suffer from decreased prediction accuracy when the predictor variables have cluster structure (e.g., highly correlated groups of variables). To improve prediction accuracy, various methods have been proposed to identify variable clusters from the data and integrate cluster information into a sparse modeling process. But none of these methods achieve satisfactory performance for prediction, variable selection and variable clustering performed simultaneously. This paper presents Variable Cluster Principal Component Regression (VC‐PCR), a prediction method that uses variable selection and variable clustering in order to solve this problem. Experiments with real and simulated data demonstrate that, compared to competitor methods, VC‐PCR is the only method that achieves simultaneously good prediction, variable selection, and clustering performance when cluster structure is present.
当预测变量具有聚类结构(如高度相关的变量组)时,稀疏线性预测方法的预测精度就会下降。为了提高预测精度,人们提出了各种方法来从数据中识别变量聚类,并将聚类信息整合到稀疏建模过程中。但这些方法都无法同时实现令人满意的预测、变量选择和变量聚类效果。为了解决这个问题,本文提出了一种使用变量选择和变量聚类的预测方法--变量聚类主成分回归(VC-PCR)。使用真实数据和模拟数据进行的实验表明,与其他竞争方法相比,VC-PCR 是唯一一种在存在聚类结构的情况下同时实现良好预测、变量选择和聚类性能的方法。
{"title":"VC‐PCR: A prediction method based on variable selection and clustering","authors":"Rebecca Marion, Johannes Lederer, Bernadette Goevarts, Rainer von Sachs","doi":"10.1111/stan.12358","DOIUrl":"https://doi.org/10.1111/stan.12358","url":null,"abstract":"Sparse linear prediction methods suffer from decreased prediction accuracy when the predictor variables have cluster structure (e.g., highly correlated groups of variables). To improve prediction accuracy, various methods have been proposed to identify variable clusters from the data and integrate cluster information into a sparse modeling process. But none of these methods achieve satisfactory performance for prediction, variable selection and variable clustering performed simultaneously. This paper presents Variable Cluster Principal Component Regression (VC‐PCR), a prediction method that uses variable selection and variable clustering in order to solve this problem. Experiments with real and simulated data demonstrate that, compared to competitor methods, VC‐PCR is the only method that achieves simultaneously good prediction, variable selection, and clustering performance when cluster structure is present.","PeriodicalId":51178,"journal":{"name":"Statistica Neerlandica","volume":null,"pages":null},"PeriodicalIF":1.5,"publicationDate":"2024-08-22","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142199873","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Estimation of density functionals via cross‐validation 通过交叉验证估计密度函数
IF 1.5 3区 数学 Q2 STATISTICS & PROBABILITY Pub Date : 2024-08-01 DOI: 10.1111/stan.12359
José E. Chacón, Carlos Tenreiro
In density estimation, the mean integrated squared error (MISE) is commonly used as a measure of performance. In that setting, the cross‐validation criterion provides an unbiased estimator of the MISE minus the integral of the squared density. Since the minimum MISE is known to converge to zero, this suggests that the minimum value of the cross‐validation criterion could be regarded as an estimator of minus the integrated squared density. This novel proposal presents the outstanding feature that, unlike all other existing estimators, it does not need the choice of any tuning parameter. Indeed, it is proved here that this approach results in a consistent and efficient estimator, with remarkable performance in practice. Moreover, apart from this base case, it is shown how several other problems on density functional estimation can be similarly handled using this new principle, thus demonstrating full potential for further applications.
在密度估计中,通常使用平均综合平方误差(MISE)来衡量性能。在这种情况下,交叉验证准则提供了 MISE 减去密度平方积分的无偏估计值。由于已知最小 MISE 趋于零,这表明交叉验证准则的最小值可被视为减去平方密度积分的估计值。与其他所有现有估计器不同的是,这一新颖的建议具有无需选择任何调整参数的突出特点。事实上,本文证明了这种方法能产生一致且高效的估计器,并在实践中表现出卓越的性能。此外,除了这个基本案例,本文还展示了如何利用这一新原理同样处理其他几个关于密度函数估计的问题,从而充分展示了进一步应用的潜力。
{"title":"Estimation of density functionals via cross‐validation","authors":"José E. Chacón, Carlos Tenreiro","doi":"10.1111/stan.12359","DOIUrl":"https://doi.org/10.1111/stan.12359","url":null,"abstract":"In density estimation, the mean integrated squared error (MISE) is commonly used as a measure of performance. In that setting, the cross‐validation criterion provides an unbiased estimator of the MISE minus the integral of the squared density. Since the minimum MISE is known to converge to zero, this suggests that the minimum value of the cross‐validation criterion could be regarded as an estimator of minus the integrated squared density. This novel proposal presents the outstanding feature that, unlike all other existing estimators, it does not need the choice of any tuning parameter. Indeed, it is proved here that this approach results in a consistent and efficient estimator, with remarkable performance in practice. Moreover, apart from this base case, it is shown how several other problems on density functional estimation can be similarly handled using this new principle, thus demonstrating full potential for further applications.","PeriodicalId":51178,"journal":{"name":"Statistica Neerlandica","volume":null,"pages":null},"PeriodicalIF":1.5,"publicationDate":"2024-08-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141885829","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Artificial neural network small‐sample‐bias‐corrections of the AR(1) parameter close to unit root 人工神经网络对接近单位根的 AR(1) 参数进行小样本偏置校正
IF 1.5 3区 数学 Q2 STATISTICS & PROBABILITY Pub Date : 2024-08-01 DOI: 10.1111/stan.12354
Haozhe Jiang, Ostap Okhrin, Michael Rockinger
This paper introduces an artificial neural network (ANN) approach to estimate the autoregressive process AR(1) when the autocorrelation parameter is near one. Traditional ordinary least squares (OLS) estimators suffer from biases in small samples, necessitating various correction methods proposed in the literature. The ANN, trained on simulated data, outperforms these methods due to its nonlinear structure. Unlike competitors requiring simulations for bias corrections based on specific sample sizes, the ANN directly incorporates sample size as input, eliminating the need for repeated simulations. Stability tests involve exploring different ANN architectures and activation functions and robustness to varying distributions of the process innovations. Empirical applications on financial and industrial data highlight significant differences among methods, with ANN estimates suggesting lower persistence than other approaches.
本文介绍了一种人工神经网络(ANN)方法,用于估计自相关参数接近 1 时的自回归过程 AR(1)。传统的普通最小二乘法(OLS)估计器在小样本时存在偏差,因此需要采用文献中提出的各种修正方法。在模拟数据基础上训练的方差网络因其非线性结构而优于这些方法。与需要根据特定样本大小进行模拟以纠正偏差的竞争对手不同,方差网络直接将样本大小作为输入,无需重复模拟。稳定性测试包括探索不同的 ANN 架构和激活函数,以及对过程创新的不同分布的稳健性。金融和工业数据的实证应用凸显了各种方法之间的显著差异,其中方差网络估算的持久性低于其他方法。
{"title":"Artificial neural network small‐sample‐bias‐corrections of the AR(1) parameter close to unit root","authors":"Haozhe Jiang, Ostap Okhrin, Michael Rockinger","doi":"10.1111/stan.12354","DOIUrl":"https://doi.org/10.1111/stan.12354","url":null,"abstract":"This paper introduces an artificial neural network (ANN) approach to estimate the autoregressive process AR(1) when the autocorrelation parameter is near one. Traditional ordinary least squares (OLS) estimators suffer from biases in small samples, necessitating various correction methods proposed in the literature. The ANN, trained on simulated data, outperforms these methods due to its nonlinear structure. Unlike competitors requiring simulations for bias corrections based on specific sample sizes, the ANN directly incorporates sample size as input, eliminating the need for repeated simulations. Stability tests involve exploring different ANN architectures and activation functions and robustness to varying distributions of the process innovations. Empirical applications on financial and industrial data highlight significant differences among methods, with ANN estimates suggesting lower persistence than other approaches.","PeriodicalId":51178,"journal":{"name":"Statistica Neerlandica","volume":null,"pages":null},"PeriodicalIF":1.5,"publicationDate":"2024-08-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141885828","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Inference for Kumaraswamy‐G family of distributions under unified progressive hybrid censoring with partially observed competing risks data 库马拉斯瓦米-G 系列分布在统一渐进混合删减法下的推断,以及部分观察到的竞争风险数据
IF 1.5 3区 数学 Q2 STATISTICS & PROBABILITY Pub Date : 2024-07-25 DOI: 10.1111/stan.12357
Subhankar Dutta, Hon Keung Tony Ng, Suchandan Kayal
In this study, statistical inference for competing risks model with latent failure times following the Kumaraswamy‐G (Kw‐G) family of distributions under a unified progressive hybrid censoring (UPHC) scheme is developed. Maximum likelihood estimates (MLEs) of the unknown model parameters are obtained, and their existence and uniqueness properties are discussed. Using the asymptotic properties of MLEs, the approximate confidence intervals for the model parameters are constructed. Further, Bayes estimates with associated highest posterior density credible intervals for the model parameters are developed under squared error loss function with informative and noninformative priors. These estimates are obtained under both restricted and nonrestricted parameter spaces. Moreover, frequentist and Bayesian approaches are developed to test the equality of shape parameters of the two competing failure causes. The comparison of censoring schemes based on different criteria is also discussed. Monte Carlo simulation studies are used to evaluate the performance of the proposed statistical inference procedures. An electrical appliances data set is analyzed to illustrate the applicability of the proposed methodologies.
本研究在统一渐进混合删减(UPHC)方案下,建立了具有潜在失败时间的库马拉斯瓦米-G(Kw-G)族分布的竞争风险模型的统计推断。得到了未知模型参数的最大似然估计值(MLE),并讨论了它们的存在性和唯一性。利用 MLE 的渐近特性,构建了模型参数的近似置信区间。此外,在有信息和无信息先验的平方误差损失函数下,建立了模型参数的贝叶斯估计和相关最高后验密度可信区间。这些估计值是在受限和非受限参数空间下获得的。此外,还开发了频数法和贝叶斯法来检验两种相互竞争的故障原因的形状参数是否相等。此外,还讨论了基于不同标准的剔除方案的比较。蒙特卡罗模拟研究用于评估所提出的统计推断程序的性能。对一个电器数据集进行了分析,以说明建议方法的适用性。
{"title":"Inference for Kumaraswamy‐G family of distributions under unified progressive hybrid censoring with partially observed competing risks data","authors":"Subhankar Dutta, Hon Keung Tony Ng, Suchandan Kayal","doi":"10.1111/stan.12357","DOIUrl":"https://doi.org/10.1111/stan.12357","url":null,"abstract":"In this study, statistical inference for competing risks model with latent failure times following the Kumaraswamy‐G (Kw‐G) family of distributions under a unified progressive hybrid censoring (UPHC) scheme is developed. Maximum likelihood estimates (MLEs) of the unknown model parameters are obtained, and their existence and uniqueness properties are discussed. Using the asymptotic properties of MLEs, the approximate confidence intervals for the model parameters are constructed. Further, Bayes estimates with associated highest posterior density credible intervals for the model parameters are developed under squared error loss function with informative and noninformative priors. These estimates are obtained under both restricted and nonrestricted parameter spaces. Moreover, frequentist and Bayesian approaches are developed to test the equality of shape parameters of the two competing failure causes. The comparison of censoring schemes based on different criteria is also discussed. Monte Carlo simulation studies are used to evaluate the performance of the proposed statistical inference procedures. An electrical appliances data set is analyzed to illustrate the applicability of the proposed methodologies.","PeriodicalId":51178,"journal":{"name":"Statistica Neerlandica","volume":null,"pages":null},"PeriodicalIF":1.5,"publicationDate":"2024-07-25","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141784747","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Computing optimal allocation of trials to subregions in crop‐variety testing in case of correlated genotype effects 在基因型效应相关的情况下,计算作物品种测试中试验在次区域的最佳分配方案
IF 1.5 3区 数学 Q2 STATISTICS & PROBABILITY Pub Date : 2024-07-25 DOI: 10.1111/stan.12353
Maryna Prus
The subject of this work is the allocation of trials to subregions in crop variety testing in the case of correlated genotype effects. A solution for computation of optimal allocations using the OptimalDesign package in R is proposed. The obtained optimal designs minimize linear criteria based on the mean squared error matrix of the best linear unbiased prediction of the genotype effects. The proposed computational approach allows for any kind of additional linear constraint on the designs. The results are illustrated by a real data example.
这项工作的主题是在基因型效应相关的情况下,如何在作物品种测试中将试验分配到子区域。本文提出了一种使用 R 软件包 OptimalDesign 计算最优分配的解决方案。根据基因型效应的最佳线性无偏预测的均方误差矩阵,所获得的最优设计能最大限度地降低线性标准。所提出的计算方法允许对设计进行任何额外的线性约束。一个真实数据实例对结果进行了说明。
{"title":"Computing optimal allocation of trials to subregions in crop‐variety testing in case of correlated genotype effects","authors":"Maryna Prus","doi":"10.1111/stan.12353","DOIUrl":"https://doi.org/10.1111/stan.12353","url":null,"abstract":"The subject of this work is the allocation of trials to subregions in crop variety testing in the case of correlated genotype effects. A solution for computation of optimal allocations using the OptimalDesign package in R is proposed. The obtained optimal designs minimize linear criteria based on the mean squared error matrix of the best linear unbiased prediction of the genotype effects. The proposed computational approach allows for any kind of additional linear constraint on the designs. The results are illustrated by a real data example.","PeriodicalId":51178,"journal":{"name":"Statistica Neerlandica","volume":null,"pages":null},"PeriodicalIF":1.5,"publicationDate":"2024-07-25","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141784746","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Degree distributions in networks: Beyond the power law 网络中的学位分布:超越幂律
IF 1.5 3区 数学 Q2 STATISTICS & PROBABILITY Pub Date : 2024-07-23 DOI: 10.1111/stan.12355
Clement Lee, Emma F. Eastoe, Aiden Farrell
The power law is useful in describing count phenomena such as network degrees and word frequencies. With a single parameter, it captures the main feature that the frequencies are linear on the log‐log scale. Nevertheless, there have been criticisms of the power law, for example, that a threshold needs to be preselected without its uncertainty quantified, that the power law is simply inadequate, and that subsequent hypothesis tests are required to determine whether the data could have come from the power law. We propose a modeling framework that combines two different generalizations of the power law, namely the generalized Pareto distribution and the Zipf‐polylog distribution, to resolve these issues. The proposed mixture distributions are shown to fit the data well and quantify the threshold uncertainty in a natural way. A model selection step embedded in the Bayesian inference algorithm further answers the question whether the power law is adequate.
幂律适用于描述网络度和词频等计数现象。只需一个参数,它就能捕捉到频率在对数尺度上呈线性的主要特征。然而,也有人对幂律提出了批评,例如,需要预先选择一个阈值,而不对其不确定性进行量化;幂律根本不够充分;需要进行后续的假设检验来确定数据是否来自幂律。为了解决这些问题,我们提出了一个建模框架,将幂律的两种不同概括(即广义帕累托分布和 Zipf-Polylog 分布)结合起来。结果表明,所提出的混合分布能够很好地拟合数据,并以自然的方式量化阈值的不确定性。贝叶斯推理算法中的模型选择步骤进一步回答了幂律是否合适的问题。
{"title":"Degree distributions in networks: Beyond the power law","authors":"Clement Lee, Emma F. Eastoe, Aiden Farrell","doi":"10.1111/stan.12355","DOIUrl":"https://doi.org/10.1111/stan.12355","url":null,"abstract":"The power law is useful in describing count phenomena such as network degrees and word frequencies. With a single parameter, it captures the main feature that the frequencies are linear on the log‐log scale. Nevertheless, there have been criticisms of the power law, for example, that a threshold needs to be preselected without its uncertainty quantified, that the power law is simply inadequate, and that subsequent hypothesis tests are required to determine whether the data could have come from the power law. We propose a modeling framework that combines two different generalizations of the power law, namely the generalized Pareto distribution and the Zipf‐polylog distribution, to resolve these issues. The proposed mixture distributions are shown to fit the data well and quantify the threshold uncertainty in a natural way. A model selection step embedded in the Bayesian inference algorithm further answers the question whether the power law is adequate.","PeriodicalId":51178,"journal":{"name":"Statistica Neerlandica","volume":null,"pages":null},"PeriodicalIF":1.5,"publicationDate":"2024-07-23","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141784833","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
A note on convergence of calibration weights to inverse probability weights 关于校准权重向逆概率权重收敛的说明
IF 1.5 3区 数学 Q2 STATISTICS & PROBABILITY Pub Date : 2024-07-20 DOI: 10.1111/stan.12356
Tadayoshi Fushiki
Recently, nonresponse rates in sample surveys have been increasing. Nonresponse bias is a serious concern in the analysis of sample surveys. The calibration and propensity score methods are used to adjust nonresponse bias. The propensity score method uses the weights of the inverse probability of response. The inverse probability of response is estimated by the auxiliary variables observed in respondents and nonrespondents. The calibration method can use additional auxiliary variables observed only in respondents if the population distributions of the variables are known. The calibration method is widely used; however, the theoretical property in the nonresponse situation has not been investigated. This study provides a condition that the calibration weights asymptotically go to the inverse probability of response and clarifies the relationship between the calibration and propensity score methods.
近来,抽样调查中的无应答率不断上升。非响应偏差是抽样调查分析中的一个严重问题。校准法和倾向得分法可用于调整非响应偏差。倾向得分法使用响应的反概率权重。反向响应概率是通过在受访者和非受访者中观察到的辅助变量估算得出的。如果变量的总体分布已知,校准法可使用仅在受访者中观察到的额外辅助变量。校准法得到了广泛应用,但其在非响应情况下的理论属性尚未得到研究。本研究提供了一个条件,即校准权重近似于响应概率的倒数,并阐明了校准法与倾向得分法之间的关系。
{"title":"A note on convergence of calibration weights to inverse probability weights","authors":"Tadayoshi Fushiki","doi":"10.1111/stan.12356","DOIUrl":"https://doi.org/10.1111/stan.12356","url":null,"abstract":"Recently, nonresponse rates in sample surveys have been increasing. Nonresponse bias is a serious concern in the analysis of sample surveys. The calibration and propensity score methods are used to adjust nonresponse bias. The propensity score method uses the weights of the inverse probability of response. The inverse probability of response is estimated by the auxiliary variables observed in respondents and nonrespondents. The calibration method can use additional auxiliary variables observed only in respondents if the population distributions of the variables are known. The calibration method is widely used; however, the theoretical property in the nonresponse situation has not been investigated. This study provides a condition that the calibration weights asymptotically go to the inverse probability of response and clarifies the relationship between the calibration and propensity score methods.","PeriodicalId":51178,"journal":{"name":"Statistica Neerlandica","volume":null,"pages":null},"PeriodicalIF":1.5,"publicationDate":"2024-07-20","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141740612","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
期刊
Statistica Neerlandica
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1