首页 > 最新文献

Statistics and Computing最新文献

英文 中文
Fused lasso nearly-isotonic signal approximation in general dimensions 一般维度下的融合套索近似等调信号近似法
IF 2.2 2区 数学 Q1 Mathematics Pub Date : 2024-05-22 DOI: 10.1007/s11222-024-10432-6
Vladimir Pastukhov

In this paper, we introduce and study fused lasso nearly-isotonic signal approximation, which is a combination of fused lasso and generalized nearly-isotonic regression. We show how these three estimators relate to each other and derive solution to a general problem. Our estimator is computationally feasible and provides a trade-off between monotonicity, block sparsity, and goodness-of-fit. Next, we prove that fusion and near-isotonisation in a one-dimensional case can be applied interchangably, and this step-wise procedure gives the solution to the original optimization problem. This property of the estimator is very important, because it provides a direct way to construct a path solution when one of the penalization parameters is fixed. Also, we derive an unbiased estimator of degrees of freedom of the estimator.

本文介绍并研究了融合套索近等信号近似法,它是融合套索和广义近等回归的结合。我们展示了这三种估计器之间的关系,并推导出一般问题的解决方案。我们的估计器在计算上是可行的,并能在单调性、块稀疏性和拟合度之间进行权衡。接下来,我们证明了一维情况下的融合和近等子化可以互换应用,并且这种分步过程给出了原始优化问题的解决方案。估计器的这一特性非常重要,因为当其中一个惩罚参数固定时,它提供了构建路径解的直接方法。此外,我们还推导出了估计器自由度的无偏估计器。
{"title":"Fused lasso nearly-isotonic signal approximation in general dimensions","authors":"Vladimir Pastukhov","doi":"10.1007/s11222-024-10432-6","DOIUrl":"https://doi.org/10.1007/s11222-024-10432-6","url":null,"abstract":"<p>In this paper, we introduce and study fused lasso nearly-isotonic signal approximation, which is a combination of fused lasso and generalized nearly-isotonic regression. We show how these three estimators relate to each other and derive solution to a general problem. Our estimator is computationally feasible and provides a trade-off between monotonicity, block sparsity, and goodness-of-fit. Next, we prove that fusion and near-isotonisation in a one-dimensional case can be applied interchangably, and this step-wise procedure gives the solution to the original optimization problem. This property of the estimator is very important, because it provides a direct way to construct a path solution when one of the penalization parameters is fixed. Also, we derive an unbiased estimator of degrees of freedom of the estimator.</p>","PeriodicalId":22058,"journal":{"name":"Statistics and Computing","volume":null,"pages":null},"PeriodicalIF":2.2,"publicationDate":"2024-05-22","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141150213","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Robust multipe imputation with GAM 使用 GAM 进行稳健的多重估算
IF 2.2 2区 数学 Q1 Mathematics Pub Date : 2024-05-22 DOI: 10.1007/s11222-024-10429-1
Matthias Templ
{"title":"Robust multipe imputation with GAM","authors":"Matthias Templ","doi":"10.1007/s11222-024-10429-1","DOIUrl":"https://doi.org/10.1007/s11222-024-10429-1","url":null,"abstract":"","PeriodicalId":22058,"journal":{"name":"Statistics and Computing","volume":null,"pages":null},"PeriodicalIF":2.2,"publicationDate":"2024-05-22","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141112906","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Bayesian cross-validation by parallel Markov chain Monte Carlo 通过并行马尔科夫链蒙特卡罗进行贝叶斯交叉验证
IF 2.2 2区 数学 Q1 Mathematics Pub Date : 2024-05-21 DOI: 10.1007/s11222-024-10404-w
Alex Cooper, Aki Vehtari, Catherine Forbes, Dan Simpson, Lauren Kennedy

Brute force cross-validation (CV) is a method for predictive assessment and model selection that is general and applicable to a wide range of Bayesian models. Naive or ‘brute force’ CV approaches are often too computationally costly for interactive modeling workflows, especially when inference relies on Markov chain Monte Carlo (MCMC). We propose overcoming this limitation using massively parallel MCMC. Using accelerator hardware such as graphics processor units, our approach can be about as fast (in wall clock time) as a single full-data model fit. Parallel CV is flexible because it can easily exploit a wide range data partitioning schemes, such as those designed for non-exchangeable data. It can also accommodate a range of scoring rules. We propose MCMC diagnostics, including a summary of MCMC mixing based on the popular potential scale reduction factor ((widehat{textrm{R}})) and MCMC effective sample size ((widehat{textrm{ESS}})) measures. We also describe a method for determining whether an (widehat{textrm{R}}) diagnostic indicates approximate stationarity of the chains, that may be of more general interest for applications beyond parallel CV. Finally, we show that parallel CV and its diagnostics can be implemented with online algorithms, allowing parallel CV to scale up to very large blocking designs on memory-constrained computing accelerators.

强制交叉验证(CV)是一种用于预测评估和模型选择的方法,具有通用性,适用于各种贝叶斯模型。对于交互式建模工作流来说,天真或 "蛮力 "交叉验证方法的计算成本往往过高,尤其是当推理依赖于马尔可夫链蒙特卡罗(MCMC)时。我们建议使用大规模并行 MCMC 来克服这一限制。利用图形处理器等加速器硬件,我们的方法可以达到与单个全数据模型拟合同样快的速度(按挂钟时间计算)。并行 CV 非常灵活,因为它可以轻松利用各种数据分区方案,例如为不可交换数据设计的方案。它还能适应一系列评分规则。我们提出了 MCMC 诊断方法,包括基于流行的潜在规模缩减因子((widehat{textrm{R}})和 MCMC 有效样本大小((widehat{textrm{ESS}}))测量方法的 MCMC 混合总结。我们还描述了一种方法,用于确定((widehat{textrm{R}})诊断是否表明链的近似静止性,这可能对并行 CV 以外的应用具有更普遍的意义。最后,我们展示了并行 CV 及其诊断可以通过在线算法实现,从而允许并行 CV 在内存受限的计算加速器上扩展到非常大的阻塞设计。
{"title":"Bayesian cross-validation by parallel Markov chain Monte Carlo","authors":"Alex Cooper, Aki Vehtari, Catherine Forbes, Dan Simpson, Lauren Kennedy","doi":"10.1007/s11222-024-10404-w","DOIUrl":"https://doi.org/10.1007/s11222-024-10404-w","url":null,"abstract":"<p>Brute force cross-validation (CV) is a method for predictive assessment and model selection that is general and applicable to a wide range of Bayesian models. Naive or ‘brute force’ CV approaches are often too computationally costly for interactive modeling workflows, especially when inference relies on Markov chain Monte Carlo (MCMC). We propose overcoming this limitation using massively parallel MCMC. Using accelerator hardware such as graphics processor units, our approach can be about as fast (in wall clock time) as a single full-data model fit. Parallel CV is flexible because it can easily exploit a wide range data partitioning schemes, such as those designed for non-exchangeable data. It can also accommodate a range of scoring rules. We propose MCMC diagnostics, including a summary of MCMC mixing based on the popular potential scale reduction factor (<span>(widehat{textrm{R}})</span>) and MCMC effective sample size (<span>(widehat{textrm{ESS}})</span>) measures. We also describe a method for determining whether an <span>(widehat{textrm{R}})</span> diagnostic indicates approximate stationarity of the chains, that may be of more general interest for applications beyond parallel CV. Finally, we show that parallel CV and its diagnostics can be implemented with online algorithms, allowing parallel CV to scale up to very large blocking designs on memory-constrained computing accelerators.</p>","PeriodicalId":22058,"journal":{"name":"Statistics and Computing","volume":null,"pages":null},"PeriodicalIF":2.2,"publicationDate":"2024-05-21","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141150196","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Spike and slab Bayesian sparse principal component analysis 尖峰和板块贝叶斯稀疏主成分分析
IF 2.2 2区 数学 Q1 Mathematics Pub Date : 2024-05-13 DOI: 10.1007/s11222-024-10430-8
Yu-Chien Bo Ning, Ning Ning

Sparse principal component analysis (SPCA) is a popular tool for dimensionality reduction in high-dimensional data. However, there is still a lack of theoretically justified Bayesian SPCA methods that can scale well computationally. One of the major challenges in Bayesian SPCA is selecting an appropriate prior for the loadings matrix, considering that principal components are mutually orthogonal. We propose a novel parameter-expanded coordinate ascent variational inference (PX-CAVI) algorithm. This algorithm utilizes a spike and slab prior, which incorporates parameter expansion to cope with the orthogonality constraint. Besides comparing to two popular SPCA approaches, we introduce the PX-EM algorithm as an EM analogue to the PX-CAVI algorithm for comparison. Through extensive numerical simulations, we demonstrate that the PX-CAVI algorithm outperforms these SPCA approaches, showcasing its superiority in terms of performance. We study the posterior contraction rate of the variational posterior, providing a novel contribution to the existing literature. The PX-CAVI algorithm is then applied to study a lung cancer gene expression dataset. The (textsf{R}) package (textsf{VBsparsePCA}) with an implementation of the algorithm is available on the Comprehensive R Archive Network (CRAN).

稀疏主成分分析(SPCA)是一种常用的高维数据降维工具。然而,目前仍缺乏理论上合理、计算上可扩展的贝叶斯 SPCA 方法。考虑到主成分是相互正交的,贝叶斯 SPCA 的主要挑战之一是为载荷矩阵选择一个合适的先验值。我们提出了一种新颖的参数扩展坐标上升变异推理(PX-CAVI)算法。该算法利用尖峰和板块先验,结合参数扩展来应对正交约束。除了与两种流行的 SPCA 方法进行比较外,我们还引入了 PX-EM 算法作为 PX-CAVI 算法的 EM 类似算法进行比较。通过大量的数值模拟,我们证明了 PX-CAVI 算法的性能优于这些 SPCA 方法,展示了其在性能方面的优势。我们研究了变分后验的后验收缩率,为现有文献做出了新的贡献。然后,我们将 PX-CAVI 算法应用于研究肺癌基因表达数据集。带有该算法实现的 (textsf{R}) 软件包 (textsf{VBsparsePCA}) 可在综合 R 档案网络(CRAN)上获取。
{"title":"Spike and slab Bayesian sparse principal component analysis","authors":"Yu-Chien Bo Ning, Ning Ning","doi":"10.1007/s11222-024-10430-8","DOIUrl":"https://doi.org/10.1007/s11222-024-10430-8","url":null,"abstract":"<p>Sparse principal component analysis (SPCA) is a popular tool for dimensionality reduction in high-dimensional data. However, there is still a lack of theoretically justified Bayesian SPCA methods that can scale well computationally. One of the major challenges in Bayesian SPCA is selecting an appropriate prior for the loadings matrix, considering that principal components are mutually orthogonal. We propose a novel parameter-expanded coordinate ascent variational inference (PX-CAVI) algorithm. This algorithm utilizes a spike and slab prior, which incorporates parameter expansion to cope with the orthogonality constraint. Besides comparing to two popular SPCA approaches, we introduce the PX-EM algorithm as an EM analogue to the PX-CAVI algorithm for comparison. Through extensive numerical simulations, we demonstrate that the PX-CAVI algorithm outperforms these SPCA approaches, showcasing its superiority in terms of performance. We study the posterior contraction rate of the variational posterior, providing a novel contribution to the existing literature. The PX-CAVI algorithm is then applied to study a lung cancer gene expression dataset. The <span>(textsf{R})</span> package <span>(textsf{VBsparsePCA})</span> with an implementation of the algorithm is available on the Comprehensive R Archive Network (CRAN).</p>","PeriodicalId":22058,"journal":{"name":"Statistics and Computing","volume":null,"pages":null},"PeriodicalIF":2.2,"publicationDate":"2024-05-13","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"140941321","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
A general model-checking procedure for semiparametric accelerated failure time models 半参数加速故障时间模型的一般模型检查程序
IF 2.2 2区 数学 Q1 Mathematics Pub Date : 2024-05-07 DOI: 10.1007/s11222-024-10431-7
Dongrak Choi, Woojung Bae, Jun Yan, Sangwook Kang

We propose a set of goodness-of-fit tests for the semiparametric accelerated failure time (AFT) model, including an omnibus test, a link function test, and a functional form test. This set of tests is derived from a multi-parameter cumulative sum process shown to follow asymptotically a zero-mean Gaussian process. Its evaluation is based on the asymptotically equivalent perturbed version, which enables both graphical and numerical evaluations of the assumed AFT model. Empirical p-values are obtained using the Kolmogorov-type supremum test, which provides a reliable approach for estimating the significance of both proposed un-standardized and standardized test statistics. The proposed procedure is illustrated using the rank-based estimator but is general in the sense that it is directly applicable to some other popular estimators such as induced smoothed rank-based estimator or least-squares estimator that satisfies certain properties. Our proposed methods are rigorously evaluated using extensive simulation experiments that demonstrate their effectiveness in maintaining a Type I error rate and detecting departures from the assumed AFT model in practical sample sizes and censoring rates. Furthermore, the proposed approach is applied to the analysis of the Primary Biliary Cirrhosis data, a widely studied dataset in survival analysis, providing further evidence of the practical usefulness of the proposed methods in real-world scenarios. To make the proposed methods more accessible to researchers, we have implemented them in the R package afttest, which is publicly available on the Comprehensive R Archive Network.

我们为半参数加速失效时间(AFT)模型提出了一套拟合优度检验,包括总括检验、链接函数检验和函数形式检验。这组检验来自一个多参数累积和过程,该过程在渐近上遵循零均值高斯过程。它的评估基于渐近等效的扰动版本,可以对假定的 AFT 模型进行图形和数值评估。使用 Kolmogorov 型 supremum 检验获得经验 p 值,该检验为估计拟议的非标准化和标准化检验统计量的显著性提供了一种可靠的方法。我们使用基于秩的估计器对所提出的程序进行了说明,但该程序具有通用性,可直接适用于其他一些流行的估计器,如满足某些属性的诱导平滑秩估计器或最小二乘估计器。我们通过大量的模拟实验对所提出的方法进行了严格评估,证明了这些方法在保持 I 类错误率以及检测实际样本量和删减率偏离假定 AFT 模型方面的有效性。此外,我们还将所提出的方法应用于原发性胆汁性肝硬化数据的分析,这是一个在生存分析中被广泛研究的数据集,进一步证明了所提出的方法在现实世界中的实用性。为了让研究人员更容易使用所提出的方法,我们在 R 软件包 afttest 中实现了这些方法,该软件包可在 R 综合存档网络上公开获取。
{"title":"A general model-checking procedure for semiparametric accelerated failure time models","authors":"Dongrak Choi, Woojung Bae, Jun Yan, Sangwook Kang","doi":"10.1007/s11222-024-10431-7","DOIUrl":"https://doi.org/10.1007/s11222-024-10431-7","url":null,"abstract":"<p>We propose a set of goodness-of-fit tests for the semiparametric accelerated failure time (AFT) model, including an omnibus test, a link function test, and a functional form test. This set of tests is derived from a multi-parameter cumulative sum process shown to follow asymptotically a zero-mean Gaussian process. Its evaluation is based on the asymptotically equivalent perturbed version, which enables both graphical and numerical evaluations of the assumed AFT model. Empirical <i>p</i>-values are obtained using the Kolmogorov-type supremum test, which provides a reliable approach for estimating the significance of both proposed un-standardized and standardized test statistics. The proposed procedure is illustrated using the rank-based estimator but is general in the sense that it is directly applicable to some other popular estimators such as induced smoothed rank-based estimator or least-squares estimator that satisfies certain properties. Our proposed methods are rigorously evaluated using extensive simulation experiments that demonstrate their effectiveness in maintaining a Type I error rate and detecting departures from the assumed AFT model in practical sample sizes and censoring rates. Furthermore, the proposed approach is applied to the analysis of the Primary Biliary Cirrhosis data, a widely studied dataset in survival analysis, providing further evidence of the practical usefulness of the proposed methods in real-world scenarios. To make the proposed methods more accessible to researchers, we have implemented them in the <b>R</b> package <b>afttest</b>, which is publicly available on the Comprehensive R Archive Network.</p>","PeriodicalId":22058,"journal":{"name":"Statistics and Computing","volume":null,"pages":null},"PeriodicalIF":2.2,"publicationDate":"2024-05-07","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"140882172","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
A flexible Bayesian tool for CoDa mixed models: logistic-normal distribution with Dirichlet covariance 用于 CoDa 混合模型的灵活贝叶斯工具:具有 Dirichlet 协方差的逻辑正态分布
IF 2.2 2区 数学 Q1 Mathematics Pub Date : 2024-04-16 DOI: 10.1007/s11222-024-10427-3
Joaquín Martínez-Minaya, Haavard Rue

Compositional Data Analysis (CoDa) has gained popularity in recent years. This type of data consists of values from disjoint categories that sum up to a constant. Both Dirichlet regression and logistic-normal regression have become popular as CoDa analysis methods. However, fitting this kind of multivariate models presents challenges, especially when structured random effects are included in the model, such as temporal or spatial effects. To overcome these challenges, we propose the logistic-normal Dirichlet Model (LNDM). We seamlessly incorporate this approach into the R-INLA package, facilitating model fitting and model prediction within the framework of Latent Gaussian Models. Moreover, we explore metrics like Deviance Information Criteria, Watanabe Akaike information criterion, and cross-validation measure conditional predictive ordinate for model selection in R-INLA for CoDa. Illustrating LNDM through two simulated examples and with an ecological case study on Arabidopsis thaliana in the Iberian Peninsula, we underscore its potential as an effective tool for managing CoDa and large CoDa databases.

合成数据分析(CoDa)近年来越来越受欢迎。这类数据由来自不同类别的数值组成,这些数值的总和为一个常数。作为 CoDa 分析方法,Dirichlet 回归和 logistic-normal 回归都很流行。然而,拟合这类多元模型面临着挑战,尤其是当模型中包含结构随机效应(如时间或空间效应)时。为了克服这些挑战,我们提出了逻辑正态 Dirichlet 模型(LNDM)。我们将这一方法无缝集成到 R-INLA 软件包中,在潜在高斯模型框架内促进模型拟合和模型预测。此外,我们还探索了 Deviance Information Criteria、Watanabe Akaike Information criterion 和交叉验证测量条件预测序数等指标,用于在 R-INLA 中为 CoDa 选择模型。我们通过两个模拟示例和伊比利亚半岛拟南芥的生态案例研究来说明 LNDM,强调它作为管理 CoDa 和大型 CoDa 数据库的有效工具的潜力。
{"title":"A flexible Bayesian tool for CoDa mixed models: logistic-normal distribution with Dirichlet covariance","authors":"Joaquín Martínez-Minaya, Haavard Rue","doi":"10.1007/s11222-024-10427-3","DOIUrl":"https://doi.org/10.1007/s11222-024-10427-3","url":null,"abstract":"<p>Compositional Data Analysis (CoDa) has gained popularity in recent years. This type of data consists of values from disjoint categories that sum up to a constant. Both Dirichlet regression and logistic-normal regression have become popular as CoDa analysis methods. However, fitting this kind of multivariate models presents challenges, especially when structured random effects are included in the model, such as temporal or spatial effects. To overcome these challenges, we propose the logistic-normal Dirichlet Model (LNDM). We seamlessly incorporate this approach into the <b>R-INLA</b> package, facilitating model fitting and model prediction within the framework of Latent Gaussian Models. Moreover, we explore metrics like Deviance Information Criteria, Watanabe Akaike information criterion, and cross-validation measure conditional predictive ordinate for model selection in <b>R-INLA</b> for CoDa. Illustrating LNDM through two simulated examples and with an ecological case study on <i>Arabidopsis thaliana</i> in the Iberian Peninsula, we underscore its potential as an effective tool for managing CoDa and large CoDa databases.</p>","PeriodicalId":22058,"journal":{"name":"Statistics and Computing","volume":null,"pages":null},"PeriodicalIF":2.2,"publicationDate":"2024-04-16","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"140613645","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
A communication-efficient, online changepoint detection method for monitoring distributed sensor networks 用于监测分布式传感器网络的通信效率高的在线变化点检测方法
IF 2.2 2区 数学 Q1 Mathematics Pub Date : 2024-04-14 DOI: 10.1007/s11222-024-10428-2
Ziyang Yang, Idris A. Eckley, Paul Fearnhead

We consider the challenge of efficiently detecting changes within a network of sensors, where we also need to minimise communication between sensors and the cloud. We propose an online, communication-efficient method to detect such changes. The procedure works by performing likelihood ratio tests at each time point, and two thresholds are chosen to filter unimportant test statistics and make decisions based on the aggregated test statistics respectively. We provide asymptotic theory concerning consistency and the asymptotic distribution if there are no changes. Simulation results suggest that our method can achieve similar performance to the idealised setting, where we have no constraints on communication between sensors, but substantially reduce the transmission costs.

我们考虑了在传感器网络内有效检测变化的挑战,在这种情况下,我们还需要尽量减少传感器与云之间的通信。我们提出了一种在线、通信效率高的方法来检测此类变化。该方法在每个时间点进行似然比检验,并选择两个阈值分别用于过滤不重要的检验统计量和根据综合检验统计量做出决策。我们提供了有关一致性的渐近理论以及没有变化时的渐近分布。仿真结果表明,我们的方法可以达到与理想化设置类似的性能,在理想化设置中,我们对传感器之间的通信没有任何限制,但却大大降低了传输成本。
{"title":"A communication-efficient, online changepoint detection method for monitoring distributed sensor networks","authors":"Ziyang Yang, Idris A. Eckley, Paul Fearnhead","doi":"10.1007/s11222-024-10428-2","DOIUrl":"https://doi.org/10.1007/s11222-024-10428-2","url":null,"abstract":"<p>We consider the challenge of efficiently detecting changes within a network of sensors, where we also need to minimise communication between sensors and the cloud. We propose an online, communication-efficient method to detect such changes. The procedure works by performing likelihood ratio tests at each time point, and two thresholds are chosen to filter unimportant test statistics and make decisions based on the aggregated test statistics respectively. We provide asymptotic theory concerning consistency and the asymptotic distribution if there are no changes. Simulation results suggest that our method can achieve similar performance to the idealised setting, where we have no constraints on communication between sensors, but substantially reduce the transmission costs.\u0000</p>","PeriodicalId":22058,"journal":{"name":"Statistics and Computing","volume":null,"pages":null},"PeriodicalIF":2.2,"publicationDate":"2024-04-14","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"140573112","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Parsimonious consensus hierarchies, partitions and fuzzy partitioning of a set of hierarchies 一组层次结构的准共识层次结构、分区和模糊分区
IF 2.2 2区 数学 Q1 Mathematics Pub Date : 2024-04-12 DOI: 10.1007/s11222-024-10415-7
Ilaria Bombelli, Maurizio Vichi

Methodology is described for fitting a fuzzy partition and a parsimonious consensus hierarchy (ultrametric matrix) to a set of hierarchies of the same set of objects. A model defining a fuzzy partition of a set of hierarchical classifications, with every class of the partition synthesized by a parsimonious consensus hierarchy is described. Each consensus includes an optimal consensus hard partition of objects and all the hierarchical agglomerative aggregations among the clusters of the consensus partition. The performances of the methodology are illustrated by an extended simulation study and applications to real data. A discussion is provided on the new methodology and some interesting future developments are described.

介绍了将模糊分区和准共识层次结构(超对称矩阵)拟合到同一组对象的层次结构中的方法。描述了一个模型,该模型定义了一组分级分类的模糊分区,分区中的每个类都由一个准共识层次结构合成。每个共识包括对象的最佳共识硬分区和共识分区各簇之间的所有分层聚合。该方法的性能通过扩展的模拟研究和真实数据的应用得到了说明。此外,还对新方法进行了讨论,并介绍了一些有趣的未来发展。
{"title":"Parsimonious consensus hierarchies, partitions and fuzzy partitioning of a set of hierarchies","authors":"Ilaria Bombelli, Maurizio Vichi","doi":"10.1007/s11222-024-10415-7","DOIUrl":"https://doi.org/10.1007/s11222-024-10415-7","url":null,"abstract":"<p>Methodology is described for fitting a fuzzy partition and a parsimonious consensus hierarchy (ultrametric matrix) to a set of hierarchies of the same set of objects. A model defining a fuzzy partition of a set of hierarchical classifications, with every class of the partition synthesized by a parsimonious consensus hierarchy is described. Each consensus includes an optimal consensus hard partition of objects and all the hierarchical agglomerative aggregations among the clusters of the consensus partition. The performances of the methodology are illustrated by an extended simulation study and applications to real data. A discussion is provided on the new methodology and some interesting future developments are described.</p>","PeriodicalId":22058,"journal":{"name":"Statistics and Computing","volume":null,"pages":null},"PeriodicalIF":2.2,"publicationDate":"2024-04-12","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"140573123","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Reversed particle filtering for hidden markov models 隐马尔可夫模型的反向粒子滤波
IF 2.2 2区 数学 Q1 Mathematics Pub Date : 2024-04-08 DOI: 10.1007/s11222-024-10426-4
Frank Rotiroti, Stephen G. Walker

We present an approach to selecting the distributions in sampling-resampling which improves the efficiency of the weighted bootstrap. To complement the standard scheme of sampling from the prior and reweighting with the likelihood, we introduce a reversed scheme, which samples from the (normalized) likelihood and reweights with the prior. We begin with some motivating examples, before developing the relevant theory. We then apply the approach to the particle filtering of time series, including nonlinear and non-Gaussian Bayesian state-space models, a task that demands efficiency, given the repeated application of the weighted bootstrap. Through simulation studies on a normal dynamic linear model, Poisson hidden Markov model, and stochastic volatility model, we demonstrate the gains in efficiency obtained by the approach, involving the choice of the standard or reversed filter. In addition, for the stochastic volatility model, we provide three real-data examples, including a comparison with importance sampling methods that attempt to incorporate information about the data indirectly into the standard filtering scheme and an extension to multivariate models. We determine that the reversed filtering scheme offers an advantage over such auxiliary methods owing to its ability to incorporate information about the data directly into the sampling, an ability that further facilitates its performance in higher-dimensional settings.

我们提出了一种在采样-再采样中选择分布的方法,它提高了加权自举法的效率。为了补充从先验值取样并用似然值重新加权的标准方案,我们引入了一种相反的方案,即从(归一化)似然值取样并用先验值重新加权。在发展相关理论之前,我们先举一些激励性的例子。然后,我们将该方法应用于时间序列的粒子滤波,包括非线性和非高斯贝叶斯状态空间模型,由于加权自举法的重复应用,这项任务对效率要求很高。通过对正态动态线性模型、泊松隐马尔可夫模型和随机波动模型的模拟研究,我们证明了该方法在效率方面的收益,其中涉及标准或反向滤波器的选择。此外,对于随机波动模型,我们提供了三个实际数据示例,包括与试图将数据信息间接纳入标准过滤方案的重要性抽样方法的比较,以及对多元模型的扩展。我们认为,反向滤波方案由于能够将数据信息直接纳入采样,因此比此类辅助方法更具优势,这种能力进一步促进了反向滤波方案在高维环境中的表现。
{"title":"Reversed particle filtering for hidden markov models","authors":"Frank Rotiroti, Stephen G. Walker","doi":"10.1007/s11222-024-10426-4","DOIUrl":"https://doi.org/10.1007/s11222-024-10426-4","url":null,"abstract":"<p>We present an approach to selecting the distributions in sampling-resampling which improves the efficiency of the weighted bootstrap. To complement the standard scheme of sampling from the prior and reweighting with the likelihood, we introduce a reversed scheme, which samples from the (normalized) likelihood and reweights with the prior. We begin with some motivating examples, before developing the relevant theory. We then apply the approach to the particle filtering of time series, including nonlinear and non-Gaussian Bayesian state-space models, a task that demands efficiency, given the repeated application of the weighted bootstrap. Through simulation studies on a normal dynamic linear model, Poisson hidden Markov model, and stochastic volatility model, we demonstrate the gains in efficiency obtained by the approach, involving the choice of the standard or reversed filter. In addition, for the stochastic volatility model, we provide three real-data examples, including a comparison with importance sampling methods that attempt to incorporate information about the data indirectly into the standard filtering scheme and an extension to multivariate models. We determine that the reversed filtering scheme offers an advantage over such auxiliary methods owing to its ability to incorporate information about the data directly into the sampling, an ability that further facilitates its performance in higher-dimensional settings.</p>","PeriodicalId":22058,"journal":{"name":"Statistics and Computing","volume":null,"pages":null},"PeriodicalIF":2.2,"publicationDate":"2024-04-08","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"140573264","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Correction to: Bayesian high-dimensional covariate selection in non-linear mixed-effects models using the SAEM algorithm 更正:使用 SAEM 算法在非线性混合效应模型中进行贝叶斯高维协变量选择
IF 2.2 2区 数学 Q1 Mathematics Pub Date : 2024-04-08 DOI: 10.1007/s11222-024-10421-9
Marion Naveau, Guillaume Kon Kam King, Renaud Rincent, Laure Sansonnet, Maud Delattre
{"title":"Correction to: Bayesian high-dimensional covariate selection in non-linear mixed-effects models using the SAEM algorithm","authors":"Marion Naveau, Guillaume Kon Kam King, Renaud Rincent, Laure Sansonnet, Maud Delattre","doi":"10.1007/s11222-024-10421-9","DOIUrl":"https://doi.org/10.1007/s11222-024-10421-9","url":null,"abstract":"","PeriodicalId":22058,"journal":{"name":"Statistics and Computing","volume":null,"pages":null},"PeriodicalIF":2.2,"publicationDate":"2024-04-08","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"140573247","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
期刊
Statistics and Computing
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1