首页 > 最新文献

Computational Statistics最新文献

英文 中文
A novel nonconvex, smooth-at-origin penalty for statistical learning 用于统计学习的新型非凸、平滑原点罚则
IF 1.3 4区 数学 Q3 STATISTICS & PROBABILITY Pub Date : 2024-08-07 DOI: 10.1007/s00180-024-01525-x
Majnu John, Sujit Vettam, Yihren Wu

Nonconvex penalties are utilized for regularization in high-dimensional statistical learning algorithms primarily because they yield unbiased or nearly unbiased estimators for the parameters in the model. Nonconvex penalties existing in the literature such as SCAD, MCP, Laplace and arctan have a singularity at origin which makes them useful also for variable selection. However, in several high-dimensional frameworks such as deep learning, variable selection is less of a concern. In this paper, we present a nonconvex penalty which is smooth at origin. The paper includes asymptotic results for ordinary least squares estimators regularized with the new penalty function, showing asymptotic bias that vanishes exponentially fast. We also conducted simulations to better understand the finite sample properties and conducted an empirical study employing deep neural network architecture on three datasets and convolutional neural network on four datasets. The empirical study based on artificial neural networks showed better performance for the new regularization approach in five out of the seven datasets.

在高维统计学习算法中,非凸惩罚被用于正则化,主要是因为它们能为模型中的参数提供无偏或接近无偏的估计值。文献中现有的非凸惩罚,如 SCAD、MCP、Laplace 和 arctan,在原点处都有一个奇点,这使它们也适用于变量选择。然而,在深度学习等一些高维框架中,变量选择就不那么重要了。在本文中,我们提出了一种在原点处平滑的非凸罚分。本文包括用新惩罚函数正则化的普通最小二乘估计器的渐近结果,显示了以指数速度消失的渐近偏差。我们还进行了模拟以更好地理解有限样本特性,并在三个数据集上使用深度神经网络架构进行了实证研究,在四个数据集上使用卷积神经网络进行了实证研究。基于人工神经网络的实证研究表明,在七个数据集中,有五个数据集的新正则化方法性能更好。
{"title":"A novel nonconvex, smooth-at-origin penalty for statistical learning","authors":"Majnu John, Sujit Vettam, Yihren Wu","doi":"10.1007/s00180-024-01525-x","DOIUrl":"https://doi.org/10.1007/s00180-024-01525-x","url":null,"abstract":"<p>Nonconvex penalties are utilized for regularization in high-dimensional statistical learning algorithms primarily because they yield unbiased or nearly unbiased estimators for the parameters in the model. Nonconvex penalties existing in the literature such as SCAD, MCP, Laplace and arctan have a singularity at origin which makes them useful also for variable selection. However, in several high-dimensional frameworks such as deep learning, variable selection is less of a concern. In this paper, we present a nonconvex penalty which is smooth at origin. The paper includes asymptotic results for ordinary least squares estimators regularized with the new penalty function, showing asymptotic bias that vanishes exponentially fast. We also conducted simulations to better understand the finite sample properties and conducted an empirical study employing deep neural network architecture on three datasets and convolutional neural network on four datasets. The empirical study based on artificial neural networks showed better performance for the new regularization approach in five out of the seven datasets.</p>","PeriodicalId":55223,"journal":{"name":"Computational Statistics","volume":"4 1","pages":""},"PeriodicalIF":1.3,"publicationDate":"2024-08-07","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141969706","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Quantinar: a blockchain peer-to-peer ecosystem for modern data analytics Quantinar:用于现代数据分析的区块链点对点生态系统
IF 1.3 4区 数学 Q3 STATISTICS & PROBABILITY Pub Date : 2024-08-06 DOI: 10.1007/s00180-024-01529-7
Raul Bag, Bruno Spilak, Julian Winkel, Wolfgang Karl Härdle

The power of data and correct statistical analysis has never been more prevalent. Academics and practitioners require nowadays an accurate application of quantitative methods. Yet many branches are subject to a crisis of integrity, which is shown in an improper use of statistical models, p-hacking, HARKing, or failure to replicate results. We propose the use of a Peer-to-Peer (P2P) ecosystem based on a blockchain network, Quantinar, to support quantitative analytics knowledge paired with code in the form of Quantlets or software snippets. The integration of blockchain technology allows Quantinar to ensure fully transparent and reproducible scientific research.

数据和正确统计分析的力量从未像今天这样强大。如今,学术界和从业人员都需要准确地应用定量方法。然而,许多学科都面临着诚信危机,具体表现为统计模型使用不当、P 黑客、HARKing 或无法复制结果。我们建议使用一个基于区块链网络的点对点(P2P)生态系统--Quantinar,以支持量化分析知识与 Quantlets 或软件片段形式的代码配对。通过整合区块链技术,Quantinar 可以确保科学研究完全透明、可重复。
{"title":"Quantinar: a blockchain peer-to-peer ecosystem for modern data analytics","authors":"Raul Bag, Bruno Spilak, Julian Winkel, Wolfgang Karl Härdle","doi":"10.1007/s00180-024-01529-7","DOIUrl":"https://doi.org/10.1007/s00180-024-01529-7","url":null,"abstract":"<p>The power of data and correct statistical analysis has never been more prevalent. Academics and practitioners require nowadays an accurate application of quantitative methods. Yet many branches are subject to a crisis of integrity, which is shown in an improper use of statistical models, <i>p</i>-hacking, HARKing, or failure to replicate results. We propose the use of a Peer-to-Peer (P2P) ecosystem based on a blockchain network, Quantinar, to support quantitative analytics knowledge paired with code in the form of Quantlets or software snippets. The integration of blockchain technology allows Quantinar to ensure fully transparent and reproducible scientific research.</p>","PeriodicalId":55223,"journal":{"name":"Computational Statistics","volume":"142 1","pages":""},"PeriodicalIF":1.3,"publicationDate":"2024-08-06","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141969707","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
BARMPy: Bayesian additive regression models Python package BARMPy:贝叶斯加性回归模型 Python 软件包
IF 1.3 4区 数学 Q3 STATISTICS & PROBABILITY Pub Date : 2024-08-04 DOI: 10.1007/s00180-024-01535-9
Danielle Van Boxel

We make Bayesian additive regression networks (BARN) available as a Python package, barmpy, with documentation at https://dvbuntu.github.io/barmpy/ for general machine learning practitioners. Our object-oriented design is compatible with SciKit-Learn, allowing usage of their tools like cross-validation. To ease learning to use barmpy, we produce a companion tutorial that expands on reference information in the documentation. Any interested user can pip install barmpy from the official PyPi repository. barmpy also serves as a baseline Python library for generic Bayesian additive regression models.

我们将贝叶斯加性回归网络(BARN)作为一个 Python 软件包(barmpy)提供给广大机器学习从业者,其文档请访问 https://dvbuntu.github.io/barmpy/。我们面向对象的设计与 SciKit-Learn 兼容,允许使用交叉验证等工具。为了方便学习使用 barmpy,我们编写了配套教程,对文档中的参考信息进行了扩展。任何感兴趣的用户都可以从官方 PyPi 代码库中 pip 安装 barmpy。barmpy 还是通用贝叶斯加法回归模型的 Python 基线库。
{"title":"BARMPy: Bayesian additive regression models Python package","authors":"Danielle Van Boxel","doi":"10.1007/s00180-024-01535-9","DOIUrl":"https://doi.org/10.1007/s00180-024-01535-9","url":null,"abstract":"<p>We make Bayesian additive regression networks (BARN) available as a Python package, <span>barmpy</span>, with documentation at https://dvbuntu.github.io/barmpy/ for general machine learning practitioners. Our object-oriented design is compatible with SciKit-Learn, allowing usage of their tools like cross-validation. To ease learning to use <span>barmpy</span>, we produce a companion tutorial that expands on reference information in the documentation. Any interested user can <span>pip install barmpy</span> from the official PyPi repository. <span>barmpy</span> also serves as a baseline Python library for generic Bayesian additive regression models.</p>","PeriodicalId":55223,"journal":{"name":"Computational Statistics","volume":"55 1","pages":""},"PeriodicalIF":1.3,"publicationDate":"2024-08-04","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141946678","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Robust confidence intervals for meta-regression with interaction effects 具有交互效应的元回归的稳健置信区间
IF 1.3 4区 数学 Q3 STATISTICS & PROBABILITY Pub Date : 2024-08-02 DOI: 10.1007/s00180-024-01530-0
Maria Thurow, Thilo Welz, Eric Knop, Tim Friede, Markus Pauly

Meta-analysis is an important statistical technique for synthesizing the results of multiple studies regarding the same or closely related research question. So-called meta-regression extends meta-analysis models by accounting for study-level covariates. Mixed-effects meta-regression models provide a powerful tool for evidence synthesis, by appropriately accounting for between-study heterogeneity. In fact, modelling the study effect in terms of random effects and moderators not only allows to examine the impact of the moderators, but often leads to more accurate estimates of the involved parameters. Nevertheless, due to the often small number of studies on a specific research topic, interactions are often neglected in meta-regression. In this work we consider the research questions (i) how moderator interactions influence inference in mixed-effects meta-regression models and (ii) whether some inference methods are more reliable than others. Here we review robust methods for confidence intervals in meta-regression models including interaction effects. These methods are based on the application of robust sandwich estimators of Hartung-Knapp-Sidik-Jonkman (HKSJ) or heteroscedasticity-consistent (HC)-type for estimating the variance-covariance matrix of the vector of model coefficients. Furthermore, we compare different versions of these robust estimators in an extensive simulation study. We thereby investigate coverage and width of seven different confidence intervals under varying conditions. Our simulation study shows that the coverage rates as well as the interval widths of the parameter estimates are only slightly affected by adjustment of the parameters. It also turned out that using the Satterthwaite approximation for the degrees of freedom seems to be advantageous for accurate coverage rates. In addition, different to previous analyses for simpler models, the (textbf{HKSJ})-estimator shows a worse performance in this more complex setting compared to some of the (textbf{HC})-estimators.

荟萃分析是一种重要的统计技术,用于综合有关相同或密切相关研究问题的多项研究结果。所谓的元回归通过考虑研究层面的协变量来扩展元分析模型。混合效应元回归模型通过适当考虑研究间的异质性,为证据综合提供了强有力的工具。事实上,用随机效应和调节因子来模拟研究效应,不仅可以考察调节因子的影响,而且往往能更准确地估计相关参数。然而,由于特定研究课题的研究数量通常较少,元回归往往忽略了交互作用。在这项工作中,我们考虑了以下研究问题:(i) 在混合效应元回归模型中,调节因子的交互作用如何影响推断;(ii) 某些推断方法是否比其他方法更可靠。在此,我们回顾了元回归模型(包括交互效应)中置信区间的稳健方法。这些方法的基础是应用 Hartung-Knapp-Sidik-Jonkman(HKSJ)或异方差一致(HC)型稳健三明治估计器来估计模型系数向量的方差-协方差矩阵。此外,我们还在广泛的模拟研究中比较了这些稳健估计器的不同版本。因此,我们研究了不同条件下七个不同置信区间的覆盖率和宽度。我们的模拟研究表明,参数估计的覆盖率和区间宽度只受到参数调整的轻微影响。结果还表明,使用萨特斯韦特自由度近似值似乎更有利于获得准确的覆盖率。此外,与之前对较简单模型的分析不同,在这种较复杂的情况下,与某些(textbf{HC})估计器相比,(textbf{HKSJ})估计器的性能较差。
{"title":"Robust confidence intervals for meta-regression with interaction effects","authors":"Maria Thurow, Thilo Welz, Eric Knop, Tim Friede, Markus Pauly","doi":"10.1007/s00180-024-01530-0","DOIUrl":"https://doi.org/10.1007/s00180-024-01530-0","url":null,"abstract":"<p>Meta-analysis is an important statistical technique for synthesizing the results of multiple studies regarding the same or closely related research question. So-called meta-regression extends meta-analysis models by accounting for study-level covariates. Mixed-effects meta-regression models provide a powerful tool for evidence synthesis, by appropriately accounting for between-study heterogeneity. In fact, modelling the study effect in terms of random effects and moderators not only allows to examine the impact of the moderators, but often leads to more accurate estimates of the involved parameters. Nevertheless, due to the often small number of studies on a specific research topic, interactions are often neglected in meta-regression. In this work we consider the research questions (i) how moderator interactions influence inference in mixed-effects meta-regression models and (ii) whether some inference methods are more reliable than others. Here we review robust methods for confidence intervals in meta-regression models including interaction effects. These methods are based on the application of robust sandwich estimators of Hartung-Knapp-Sidik-Jonkman (<b>HKSJ</b>) or heteroscedasticity-consistent (<b>HC</b>)-type for estimating the variance-covariance matrix of the vector of model coefficients. Furthermore, we compare different versions of these robust estimators in an extensive simulation study. We thereby investigate coverage and width of seven different confidence intervals under varying conditions. Our simulation study shows that the coverage rates as well as the interval widths of the parameter estimates are only slightly affected by adjustment of the parameters. It also turned out that using the Satterthwaite approximation for the degrees of freedom seems to be advantageous for accurate coverage rates. In addition, different to previous analyses for simpler models, the <span>(textbf{HKSJ})</span>-estimator shows a worse performance in this more complex setting compared to some of the <span>(textbf{HC})</span>-estimators.</p>","PeriodicalId":55223,"journal":{"name":"Computational Statistics","volume":"182 1","pages":""},"PeriodicalIF":1.3,"publicationDate":"2024-08-02","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141884981","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Ordinal causal discovery based on Markov blankets 基于马尔可夫毛毯的序数因果发现
IF 1.3 4区 数学 Q3 STATISTICS & PROBABILITY Pub Date : 2024-07-30 DOI: 10.1007/s00180-024-01513-1
Yu Du, Yi Sun, Luyao Tan

This work focuses on learning causal network structures from ordinal categorical data. By combining constraint-based with score-and-search methodologies in structural learning, we propose a hybrid method called Markov Blanket Based Ordinal Causal Discovery (MBOCD) algorithm, which can capture the ordinal relationship of values in ordinal categorical variables. Theoretically, it is proved that for ordinal causal networks, two adjacent DAGs belonging to the same Markov equivalence class are identifiable, which results in the generation of a causal graph. Simulation experiments demonstrate that the proposed algorithm outperforms existing methods in terms of computational efficiency and accuracy. The code of this work is open at: https://github.com/leoydu/MBOCDcode.git.

这项研究的重点是从顺序分类数据中学习因果网络结构。通过将结构学习中的基于约束的方法与基于分数和搜索的方法相结合,我们提出了一种称为基于马尔可夫空白的序因果发现(MBOCD)算法的混合方法,它可以捕捉序分类变量中值的序关系。理论证明,对于顺序因果网络,属于同一马尔可夫等价类的两个相邻 DAG 是可识别的,从而生成因果图。仿真实验证明,所提出的算法在计算效率和准确性方面都优于现有方法。这项工作的代码公开于:https://github.com/leoydu/MBOCDcode.git。
{"title":"Ordinal causal discovery based on Markov blankets","authors":"Yu Du, Yi Sun, Luyao Tan","doi":"10.1007/s00180-024-01513-1","DOIUrl":"https://doi.org/10.1007/s00180-024-01513-1","url":null,"abstract":"<p>This work focuses on learning causal network structures from ordinal categorical data. By combining constraint-based with score-and-search methodologies in structural learning, we propose a hybrid method called Markov Blanket Based Ordinal Causal Discovery (MBOCD) algorithm, which can capture the ordinal relationship of values in ordinal categorical variables. Theoretically, it is proved that for ordinal causal networks, two adjacent DAGs belonging to the same Markov equivalence class are identifiable, which results in the generation of a causal graph. Simulation experiments demonstrate that the proposed algorithm outperforms existing methods in terms of computational efficiency and accuracy. The code of this work is open at: https://github.com/leoydu/MBOCDcode.git.</p>","PeriodicalId":55223,"journal":{"name":"Computational Statistics","volume":"262 1","pages":""},"PeriodicalIF":1.3,"publicationDate":"2024-07-30","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141862642","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
A Metropolis–Hastings Robbins–Monro algorithm via variational inference for estimating the multidimensional graded response model: a calculationally efficient estimation scheme to deal with complex test structures 通过变分推理估算多维分级响应模型的 Metropolis-Hastings Robbins-Monro 算法:处理复杂测试结构的高效计算估算方案
IF 1.3 4区 数学 Q3 STATISTICS & PROBABILITY Pub Date : 2024-07-29 DOI: 10.1007/s00180-024-01533-x
Xue Wang, Jing Lu, Jiwei Zhang

This paper introduces the Metropolis–Hastings variational inference Robbins–Monro (MHVIRM) algorithm, a modification of the Metropolis–Hastings Robbins–Monro (MHRM) method, designed for estimating parameters in complex multidimensional graded response models (MGRM). By integrating a black-box variational inference (BBVI) approach, MHVIRM enhances computational efficiency and estimation accuracy, particularly for models with high-dimensional data and complex test structures. The algorithms effectiveness is demonstrated through simulations, showing improved precision over traditional MHRM, especially in scenarios with complex structures and small sample sizes. Moreover, MHVIRM is robust to initial values. The applicability is further illustrated with a real dataset analysis.

本文介绍了 Metropolis-Hastings 变分推理 Robbins-Monro 算法(MHVIRM),它是 Metropolis-Hastings Robbins-Monro 方法(MHRM)的改进版,专为估计复杂多维分级响应模型(MGRM)中的参数而设计。通过整合黑箱变分推理(BBVI)方法,MHVIRM 提高了计算效率和估算精度,尤其适用于具有高维数据和复杂测试结构的模型。该算法通过仿真证明了其有效性,与传统的 MHRM 相比,精度有所提高,尤其是在结构复杂和样本量较小的情况下。此外,MHVIRM 对初始值具有鲁棒性。实际数据集分析进一步说明了该算法的适用性。
{"title":"A Metropolis–Hastings Robbins–Monro algorithm via variational inference for estimating the multidimensional graded response model: a calculationally efficient estimation scheme to deal with complex test structures","authors":"Xue Wang, Jing Lu, Jiwei Zhang","doi":"10.1007/s00180-024-01533-x","DOIUrl":"https://doi.org/10.1007/s00180-024-01533-x","url":null,"abstract":"<p>This paper introduces the Metropolis–Hastings variational inference Robbins–Monro (MHVIRM) algorithm, a modification of the Metropolis–Hastings Robbins–Monro (MHRM) method, designed for estimating parameters in complex multidimensional graded response models (MGRM). By integrating a black-box variational inference (BBVI) approach, MHVIRM enhances computational efficiency and estimation accuracy, particularly for models with high-dimensional data and complex test structures. The algorithms effectiveness is demonstrated through simulations, showing improved precision over traditional MHRM, especially in scenarios with complex structures and small sample sizes. Moreover, MHVIRM is robust to initial values. The applicability is further illustrated with a real dataset analysis.</p>","PeriodicalId":55223,"journal":{"name":"Computational Statistics","volume":"41 1","pages":""},"PeriodicalIF":1.3,"publicationDate":"2024-07-29","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141862644","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Semiparametric regression analysis of panel binary data with an informative observation process 具有信息观测过程的面板二元数据的半参数回归分析
IF 1.3 4区 数学 Q3 STATISTICS & PROBABILITY Pub Date : 2024-07-29 DOI: 10.1007/s00180-024-01528-8
Lei Ge, Yang Li, Jianguo Sun

Panel binary data arise in an event history study when study subjects are observed only at discrete time points instead of continuously and the only available information on the occurrence of the recurrent event of interest is whether the event has occurred over two consecutive observation times or each observation window. Although some methods have been proposed for regression analysis of such data, all of them assume independent observation times or processes, which may not be true sometimes. To address this, we propose a joint modeling procedure that allows for informative observation processes. For the implementation of the proposed method, a computationally efficient EM algorithm is developed and the resulting estimators are consistent and asymptotically normal. The simulation study conducted to assess its performance indicates that it works well in practical situations, and the proposed approach is applied to the motivating data set from the Health and Retirement Study.

事件史研究中会出现面板二元数据,即研究对象只在离散的时间点而不是连续的时间点接受观察,而关于所关注的重复事件发生情况的唯一可用信息是该事件是否在两个连续的观察时间或每个观察窗口中发生。虽然已经提出了一些对此类数据进行回归分析的方法,但所有这些方法都假定观察时间或观察过程是独立的,但有时可能并非如此。为了解决这个问题,我们提出了一种联合建模程序,允许有信息的观测过程。为了实现所提出的方法,我们开发了一种计算效率高的 EM 算法,所得到的估计值具有一致性和渐近正态性。为评估该方法的性能而进行的模拟研究表明,该方法在实际情况下运行良好。
{"title":"Semiparametric regression analysis of panel binary data with an informative observation process","authors":"Lei Ge, Yang Li, Jianguo Sun","doi":"10.1007/s00180-024-01528-8","DOIUrl":"https://doi.org/10.1007/s00180-024-01528-8","url":null,"abstract":"<p>Panel binary data arise in an event history study when study subjects are observed only at discrete time points instead of continuously and the only available information on the occurrence of the recurrent event of interest is whether the event has occurred over two consecutive observation times or each observation window. Although some methods have been proposed for regression analysis of such data, all of them assume independent observation times or processes, which may not be true sometimes. To address this, we propose a joint modeling procedure that allows for informative observation processes. For the implementation of the proposed method, a computationally efficient EM algorithm is developed and the resulting estimators are consistent and asymptotically normal. The simulation study conducted to assess its performance indicates that it works well in practical situations, and the proposed approach is applied to the motivating data set from the Health and Retirement Study.</p>","PeriodicalId":55223,"journal":{"name":"Computational Statistics","volume":"44 1","pages":""},"PeriodicalIF":1.3,"publicationDate":"2024-07-29","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141862646","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Profile transformations for reciprocal averaging and singular value decomposition 用于倒数平均和奇异值分解的轮廓变换
IF 1.3 4区 数学 Q3 STATISTICS & PROBABILITY Pub Date : 2024-07-26 DOI: 10.1007/s00180-024-01517-x
Ting-Wu Wang, Eric J. Beh, Rosaria Lombardo, Ian W. Renner

Power transformations of count data, including cell frequencies of a contingency table, have been well understood for nearly 100 years, with much of the attention focused on the square root transformation. Over the past 15 years, this topic has been the focus of some new insights into areas of correspondence analysis where two forms of power transformation have been discussed. One type considers the impact of raising the joint proportions of the cell frequencies of a table to a known power while the other examines the power transformation of the relative distribution of the cell frequencies. While the foundations of the graphical features of correspondence analysis rest with the numerical algorithms like reciprocal averaging, and other analogous techniques, discussions of the role of power transformations in reciprocal averaging have not been described. Therefore, this paper examines this link where a power transformation is applied to the cell frequencies of a two-way contingency table. In doing so, we show that reciprocal averaging can be performed under such a transformation to obtain row and column scores that provide the maximum association between the variables and the greatest discrimination between the categories. Finally, we discuss the connection between performing reciprocal averaging and singular value decomposition under this type of power transformation. The R function, powerRA.exe is included in the Appendix and performs reciprocal averaging of a power transformation of the cell frequencies of a two-way contingency table.

近 100 年来,人们对计数数据(包括或然率表中的单元频率)的幂变换已经有了很好的理解,其中大部分注意力都集中在平方根变换上。在过去的 15 年里,这个话题成为了对应分析领域一些新见解的焦点,其中有两种形式的幂变换得到了讨论。一种是考虑将表格中单元格频率的联合比例提高到已知幂的影响,另一种是研究单元格频率相对分布的幂变换。虽然对应分析图形特征的基础是倒数平均等数值算法和其他类似技术,但关于幂变换在倒数平均中的作用的讨论却未曾涉及。因此,本文在对双向或然表的单元频率进行幂变换时,对这一联系进行了研究。在此过程中,我们证明了在这种变换下可以进行往复平均,从而获得行和列分数,使变量之间的关联度最大,类别之间的区分度最大。最后,我们讨论了在这种幂变换下进行倒数平均和奇异值分解之间的联系。附录中包含了 R 函数 powerRA.exe,它可以对双向或然表的单元频率进行幂变换的倒数平均。
{"title":"Profile transformations for reciprocal averaging and singular value decomposition","authors":"Ting-Wu Wang, Eric J. Beh, Rosaria Lombardo, Ian W. Renner","doi":"10.1007/s00180-024-01517-x","DOIUrl":"https://doi.org/10.1007/s00180-024-01517-x","url":null,"abstract":"<p>Power transformations of count data, including cell frequencies of a contingency table, have been well understood for nearly 100 years, with much of the attention focused on the square root transformation. Over the past 15 years, this topic has been the focus of some new insights into areas of correspondence analysis where two forms of power transformation have been discussed. One type considers the impact of raising the joint proportions of the cell frequencies of a table to a known power while the other examines the power transformation of the relative distribution of the cell frequencies. While the foundations of the graphical features of correspondence analysis rest with the numerical algorithms like reciprocal averaging, and other analogous techniques, discussions of the role of power transformations in reciprocal averaging have not been described. Therefore, this paper examines this link where a power transformation is applied to the cell frequencies of a two-way contingency table. In doing so, we show that reciprocal averaging can be performed under such a transformation to obtain row and column scores that provide the maximum association between the variables and the greatest discrimination between the categories. Finally, we discuss the connection between performing reciprocal averaging and singular value decomposition under this type of power transformation. The <span>R</span> function, <span>powerRA.exe</span> is included in the Appendix and performs reciprocal averaging of a power transformation of the cell frequencies of a two-way contingency table.</p>","PeriodicalId":55223,"journal":{"name":"Computational Statistics","volume":"17 1","pages":""},"PeriodicalIF":1.3,"publicationDate":"2024-07-26","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141772120","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Positive time series regression models: theoretical and computational aspects 正时间序列回归模型:理论与计算方面
IF 1.3 4区 数学 Q3 STATISTICS & PROBABILITY Pub Date : 2024-07-24 DOI: 10.1007/s00180-024-01531-z
Taiane Schaedler Prass, Guilherme Pumi, Cleiton Guollo Taufemback, Jonas Hendler Carlos

This paper discusses dynamic ARMA-type regression models for positive time series, which can handle bounded non-Gaussian time series without requiring data transformations. Our proposed model includes a conditional mean modeled by a dynamic structure containing autoregressive and moving average terms, time-varying covariates, unknown parameters, and link functions. Additionally, we present the PTSR package and discuss partial maximum likelihood estimation, asymptotic theory, hypothesis testing inference, diagnostic analysis, and forecasting for a variety of regression-based dynamic models for positive time series. A Monte Carlo simulation and a real data application are provided.

本文讨论了正时间序列的动态 ARMA 型回归模型,该模型无需数据转换即可处理有界非高斯时间序列。我们提出的模型包括一个由动态结构建模的条件均值,其中包含自回归项和移动平均项、时变协变量、未知参数和链接函数。此外,我们还介绍了 PTSR 软件包,并讨论了各种基于回归的正时间序列动态模型的偏极大似然估计、渐近理论、假设检验推理、诊断分析和预测。此外,还提供了蒙特卡罗模拟和真实数据应用。
{"title":"Positive time series regression models: theoretical and computational aspects","authors":"Taiane Schaedler Prass, Guilherme Pumi, Cleiton Guollo Taufemback, Jonas Hendler Carlos","doi":"10.1007/s00180-024-01531-z","DOIUrl":"https://doi.org/10.1007/s00180-024-01531-z","url":null,"abstract":"<p>This paper discusses dynamic ARMA-type regression models for positive time series, which can handle bounded non-Gaussian time series without requiring data transformations. Our proposed model includes a conditional mean modeled by a dynamic structure containing autoregressive and moving average terms, time-varying covariates, unknown parameters, and link functions. Additionally, we present the <span>PTSR</span> package and discuss partial maximum likelihood estimation, asymptotic theory, hypothesis testing inference, diagnostic analysis, and forecasting for a variety of regression-based dynamic models for positive time series. A Monte Carlo simulation and a real data application are provided.</p>","PeriodicalId":55223,"journal":{"name":"Computational Statistics","volume":"40 1","pages":""},"PeriodicalIF":1.3,"publicationDate":"2024-07-24","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141772237","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
The root-Gaussian Cox Process for spatial-temporal disease mapping with aggregated data 根高斯考克斯过程(root-Gaussian Cox Process):利用汇总数据绘制时空疾病图谱
IF 1.3 4区 数学 Q3 STATISTICS & PROBABILITY Pub Date : 2024-07-18 DOI: 10.1007/s00180-024-01532-y
Zeytu Gashaw Asfaw, Patrick E. Brown, Jamie Stafford

The study of aggregated data influenced by time, space, and extra changes in geographic region borders was the main emphasis of the current paper. This may occur if the regions used to count the reported incidences of a health outcome over time change periodically. In order to handle the spatial-temporal scenario, we enhance the spatial root-Gaussian Cox Process (RGCP), which makes use of the square-root link function rather than the more typical log-link function. The algorithm’s ability to estimate a risk surface has been proven by a simulation study, and it has also been validated by real datasets.

本文的重点是研究受时间、空间和地理区域边界额外变化影响的汇总数据。如果用于统计健康结果报告发病率的地区随时间发生周期性变化,就可能出现这种情况。为了处理时空情景,我们改进了空间根高斯考克斯过程(RGCP),该过程使用平方根链接函数,而不是更典型的对数链接函数。该算法估计风险面的能力已通过模拟研究得到证实,并通过真实数据集得到验证。
{"title":"The root-Gaussian Cox Process for spatial-temporal disease mapping with aggregated data","authors":"Zeytu Gashaw Asfaw, Patrick E. Brown, Jamie Stafford","doi":"10.1007/s00180-024-01532-y","DOIUrl":"https://doi.org/10.1007/s00180-024-01532-y","url":null,"abstract":"<p>The study of aggregated data influenced by time, space, and extra changes in geographic region borders was the main emphasis of the current paper. This may occur if the regions used to count the reported incidences of a health outcome over time change periodically. In order to handle the spatial-temporal scenario, we enhance the spatial root-Gaussian Cox Process (RGCP), which makes use of the square-root link function rather than the more typical log-link function. The algorithm’s ability to estimate a risk surface has been proven by a simulation study, and it has also been validated by real datasets.</p>","PeriodicalId":55223,"journal":{"name":"Computational Statistics","volume":"24 1","pages":""},"PeriodicalIF":1.3,"publicationDate":"2024-07-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141742746","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
期刊
Computational Statistics
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1