Pub Date : 2024-08-07DOI: 10.1007/s00180-024-01525-x
Majnu John, Sujit Vettam, Yihren Wu
Nonconvex penalties are utilized for regularization in high-dimensional statistical learning algorithms primarily because they yield unbiased or nearly unbiased estimators for the parameters in the model. Nonconvex penalties existing in the literature such as SCAD, MCP, Laplace and arctan have a singularity at origin which makes them useful also for variable selection. However, in several high-dimensional frameworks such as deep learning, variable selection is less of a concern. In this paper, we present a nonconvex penalty which is smooth at origin. The paper includes asymptotic results for ordinary least squares estimators regularized with the new penalty function, showing asymptotic bias that vanishes exponentially fast. We also conducted simulations to better understand the finite sample properties and conducted an empirical study employing deep neural network architecture on three datasets and convolutional neural network on four datasets. The empirical study based on artificial neural networks showed better performance for the new regularization approach in five out of the seven datasets.
{"title":"A novel nonconvex, smooth-at-origin penalty for statistical learning","authors":"Majnu John, Sujit Vettam, Yihren Wu","doi":"10.1007/s00180-024-01525-x","DOIUrl":"https://doi.org/10.1007/s00180-024-01525-x","url":null,"abstract":"<p>Nonconvex penalties are utilized for regularization in high-dimensional statistical learning algorithms primarily because they yield unbiased or nearly unbiased estimators for the parameters in the model. Nonconvex penalties existing in the literature such as SCAD, MCP, Laplace and arctan have a singularity at origin which makes them useful also for variable selection. However, in several high-dimensional frameworks such as deep learning, variable selection is less of a concern. In this paper, we present a nonconvex penalty which is smooth at origin. The paper includes asymptotic results for ordinary least squares estimators regularized with the new penalty function, showing asymptotic bias that vanishes exponentially fast. We also conducted simulations to better understand the finite sample properties and conducted an empirical study employing deep neural network architecture on three datasets and convolutional neural network on four datasets. The empirical study based on artificial neural networks showed better performance for the new regularization approach in five out of the seven datasets.</p>","PeriodicalId":55223,"journal":{"name":"Computational Statistics","volume":"4 1","pages":""},"PeriodicalIF":1.3,"publicationDate":"2024-08-07","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141969706","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2024-08-06DOI: 10.1007/s00180-024-01529-7
Raul Bag, Bruno Spilak, Julian Winkel, Wolfgang Karl Härdle
The power of data and correct statistical analysis has never been more prevalent. Academics and practitioners require nowadays an accurate application of quantitative methods. Yet many branches are subject to a crisis of integrity, which is shown in an improper use of statistical models, p-hacking, HARKing, or failure to replicate results. We propose the use of a Peer-to-Peer (P2P) ecosystem based on a blockchain network, Quantinar, to support quantitative analytics knowledge paired with code in the form of Quantlets or software snippets. The integration of blockchain technology allows Quantinar to ensure fully transparent and reproducible scientific research.
{"title":"Quantinar: a blockchain peer-to-peer ecosystem for modern data analytics","authors":"Raul Bag, Bruno Spilak, Julian Winkel, Wolfgang Karl Härdle","doi":"10.1007/s00180-024-01529-7","DOIUrl":"https://doi.org/10.1007/s00180-024-01529-7","url":null,"abstract":"<p>The power of data and correct statistical analysis has never been more prevalent. Academics and practitioners require nowadays an accurate application of quantitative methods. Yet many branches are subject to a crisis of integrity, which is shown in an improper use of statistical models, <i>p</i>-hacking, HARKing, or failure to replicate results. We propose the use of a Peer-to-Peer (P2P) ecosystem based on a blockchain network, Quantinar, to support quantitative analytics knowledge paired with code in the form of Quantlets or software snippets. The integration of blockchain technology allows Quantinar to ensure fully transparent and reproducible scientific research.</p>","PeriodicalId":55223,"journal":{"name":"Computational Statistics","volume":"142 1","pages":""},"PeriodicalIF":1.3,"publicationDate":"2024-08-06","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141969707","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2024-08-04DOI: 10.1007/s00180-024-01535-9
Danielle Van Boxel
We make Bayesian additive regression networks (BARN) available as a Python package, barmpy, with documentation at https://dvbuntu.github.io/barmpy/ for general machine learning practitioners. Our object-oriented design is compatible with SciKit-Learn, allowing usage of their tools like cross-validation. To ease learning to use barmpy, we produce a companion tutorial that expands on reference information in the documentation. Any interested user can pip install barmpy from the official PyPi repository. barmpy also serves as a baseline Python library for generic Bayesian additive regression models.
{"title":"BARMPy: Bayesian additive regression models Python package","authors":"Danielle Van Boxel","doi":"10.1007/s00180-024-01535-9","DOIUrl":"https://doi.org/10.1007/s00180-024-01535-9","url":null,"abstract":"<p>We make Bayesian additive regression networks (BARN) available as a Python package, <span>barmpy</span>, with documentation at https://dvbuntu.github.io/barmpy/ for general machine learning practitioners. Our object-oriented design is compatible with SciKit-Learn, allowing usage of their tools like cross-validation. To ease learning to use <span>barmpy</span>, we produce a companion tutorial that expands on reference information in the documentation. Any interested user can <span>pip install barmpy</span> from the official PyPi repository. <span>barmpy</span> also serves as a baseline Python library for generic Bayesian additive regression models.</p>","PeriodicalId":55223,"journal":{"name":"Computational Statistics","volume":"55 1","pages":""},"PeriodicalIF":1.3,"publicationDate":"2024-08-04","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141946678","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2024-08-02DOI: 10.1007/s00180-024-01530-0
Maria Thurow, Thilo Welz, Eric Knop, Tim Friede, Markus Pauly
Meta-analysis is an important statistical technique for synthesizing the results of multiple studies regarding the same or closely related research question. So-called meta-regression extends meta-analysis models by accounting for study-level covariates. Mixed-effects meta-regression models provide a powerful tool for evidence synthesis, by appropriately accounting for between-study heterogeneity. In fact, modelling the study effect in terms of random effects and moderators not only allows to examine the impact of the moderators, but often leads to more accurate estimates of the involved parameters. Nevertheless, due to the often small number of studies on a specific research topic, interactions are often neglected in meta-regression. In this work we consider the research questions (i) how moderator interactions influence inference in mixed-effects meta-regression models and (ii) whether some inference methods are more reliable than others. Here we review robust methods for confidence intervals in meta-regression models including interaction effects. These methods are based on the application of robust sandwich estimators of Hartung-Knapp-Sidik-Jonkman (HKSJ) or heteroscedasticity-consistent (HC)-type for estimating the variance-covariance matrix of the vector of model coefficients. Furthermore, we compare different versions of these robust estimators in an extensive simulation study. We thereby investigate coverage and width of seven different confidence intervals under varying conditions. Our simulation study shows that the coverage rates as well as the interval widths of the parameter estimates are only slightly affected by adjustment of the parameters. It also turned out that using the Satterthwaite approximation for the degrees of freedom seems to be advantageous for accurate coverage rates. In addition, different to previous analyses for simpler models, the (textbf{HKSJ})-estimator shows a worse performance in this more complex setting compared to some of the (textbf{HC})-estimators.
{"title":"Robust confidence intervals for meta-regression with interaction effects","authors":"Maria Thurow, Thilo Welz, Eric Knop, Tim Friede, Markus Pauly","doi":"10.1007/s00180-024-01530-0","DOIUrl":"https://doi.org/10.1007/s00180-024-01530-0","url":null,"abstract":"<p>Meta-analysis is an important statistical technique for synthesizing the results of multiple studies regarding the same or closely related research question. So-called meta-regression extends meta-analysis models by accounting for study-level covariates. Mixed-effects meta-regression models provide a powerful tool for evidence synthesis, by appropriately accounting for between-study heterogeneity. In fact, modelling the study effect in terms of random effects and moderators not only allows to examine the impact of the moderators, but often leads to more accurate estimates of the involved parameters. Nevertheless, due to the often small number of studies on a specific research topic, interactions are often neglected in meta-regression. In this work we consider the research questions (i) how moderator interactions influence inference in mixed-effects meta-regression models and (ii) whether some inference methods are more reliable than others. Here we review robust methods for confidence intervals in meta-regression models including interaction effects. These methods are based on the application of robust sandwich estimators of Hartung-Knapp-Sidik-Jonkman (<b>HKSJ</b>) or heteroscedasticity-consistent (<b>HC</b>)-type for estimating the variance-covariance matrix of the vector of model coefficients. Furthermore, we compare different versions of these robust estimators in an extensive simulation study. We thereby investigate coverage and width of seven different confidence intervals under varying conditions. Our simulation study shows that the coverage rates as well as the interval widths of the parameter estimates are only slightly affected by adjustment of the parameters. It also turned out that using the Satterthwaite approximation for the degrees of freedom seems to be advantageous for accurate coverage rates. In addition, different to previous analyses for simpler models, the <span>(textbf{HKSJ})</span>-estimator shows a worse performance in this more complex setting compared to some of the <span>(textbf{HC})</span>-estimators.</p>","PeriodicalId":55223,"journal":{"name":"Computational Statistics","volume":"182 1","pages":""},"PeriodicalIF":1.3,"publicationDate":"2024-08-02","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141884981","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2024-07-30DOI: 10.1007/s00180-024-01513-1
Yu Du, Yi Sun, Luyao Tan
This work focuses on learning causal network structures from ordinal categorical data. By combining constraint-based with score-and-search methodologies in structural learning, we propose a hybrid method called Markov Blanket Based Ordinal Causal Discovery (MBOCD) algorithm, which can capture the ordinal relationship of values in ordinal categorical variables. Theoretically, it is proved that for ordinal causal networks, two adjacent DAGs belonging to the same Markov equivalence class are identifiable, which results in the generation of a causal graph. Simulation experiments demonstrate that the proposed algorithm outperforms existing methods in terms of computational efficiency and accuracy. The code of this work is open at: https://github.com/leoydu/MBOCDcode.git.
这项研究的重点是从顺序分类数据中学习因果网络结构。通过将结构学习中的基于约束的方法与基于分数和搜索的方法相结合,我们提出了一种称为基于马尔可夫空白的序因果发现(MBOCD)算法的混合方法,它可以捕捉序分类变量中值的序关系。理论证明,对于顺序因果网络,属于同一马尔可夫等价类的两个相邻 DAG 是可识别的,从而生成因果图。仿真实验证明,所提出的算法在计算效率和准确性方面都优于现有方法。这项工作的代码公开于:https://github.com/leoydu/MBOCDcode.git。
{"title":"Ordinal causal discovery based on Markov blankets","authors":"Yu Du, Yi Sun, Luyao Tan","doi":"10.1007/s00180-024-01513-1","DOIUrl":"https://doi.org/10.1007/s00180-024-01513-1","url":null,"abstract":"<p>This work focuses on learning causal network structures from ordinal categorical data. By combining constraint-based with score-and-search methodologies in structural learning, we propose a hybrid method called Markov Blanket Based Ordinal Causal Discovery (MBOCD) algorithm, which can capture the ordinal relationship of values in ordinal categorical variables. Theoretically, it is proved that for ordinal causal networks, two adjacent DAGs belonging to the same Markov equivalence class are identifiable, which results in the generation of a causal graph. Simulation experiments demonstrate that the proposed algorithm outperforms existing methods in terms of computational efficiency and accuracy. The code of this work is open at: https://github.com/leoydu/MBOCDcode.git.</p>","PeriodicalId":55223,"journal":{"name":"Computational Statistics","volume":"262 1","pages":""},"PeriodicalIF":1.3,"publicationDate":"2024-07-30","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141862642","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2024-07-29DOI: 10.1007/s00180-024-01533-x
Xue Wang, Jing Lu, Jiwei Zhang
This paper introduces the Metropolis–Hastings variational inference Robbins–Monro (MHVIRM) algorithm, a modification of the Metropolis–Hastings Robbins–Monro (MHRM) method, designed for estimating parameters in complex multidimensional graded response models (MGRM). By integrating a black-box variational inference (BBVI) approach, MHVIRM enhances computational efficiency and estimation accuracy, particularly for models with high-dimensional data and complex test structures. The algorithms effectiveness is demonstrated through simulations, showing improved precision over traditional MHRM, especially in scenarios with complex structures and small sample sizes. Moreover, MHVIRM is robust to initial values. The applicability is further illustrated with a real dataset analysis.
{"title":"A Metropolis–Hastings Robbins–Monro algorithm via variational inference for estimating the multidimensional graded response model: a calculationally efficient estimation scheme to deal with complex test structures","authors":"Xue Wang, Jing Lu, Jiwei Zhang","doi":"10.1007/s00180-024-01533-x","DOIUrl":"https://doi.org/10.1007/s00180-024-01533-x","url":null,"abstract":"<p>This paper introduces the Metropolis–Hastings variational inference Robbins–Monro (MHVIRM) algorithm, a modification of the Metropolis–Hastings Robbins–Monro (MHRM) method, designed for estimating parameters in complex multidimensional graded response models (MGRM). By integrating a black-box variational inference (BBVI) approach, MHVIRM enhances computational efficiency and estimation accuracy, particularly for models with high-dimensional data and complex test structures. The algorithms effectiveness is demonstrated through simulations, showing improved precision over traditional MHRM, especially in scenarios with complex structures and small sample sizes. Moreover, MHVIRM is robust to initial values. The applicability is further illustrated with a real dataset analysis.</p>","PeriodicalId":55223,"journal":{"name":"Computational Statistics","volume":"41 1","pages":""},"PeriodicalIF":1.3,"publicationDate":"2024-07-29","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141862644","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2024-07-29DOI: 10.1007/s00180-024-01528-8
Lei Ge, Yang Li, Jianguo Sun
Panel binary data arise in an event history study when study subjects are observed only at discrete time points instead of continuously and the only available information on the occurrence of the recurrent event of interest is whether the event has occurred over two consecutive observation times or each observation window. Although some methods have been proposed for regression analysis of such data, all of them assume independent observation times or processes, which may not be true sometimes. To address this, we propose a joint modeling procedure that allows for informative observation processes. For the implementation of the proposed method, a computationally efficient EM algorithm is developed and the resulting estimators are consistent and asymptotically normal. The simulation study conducted to assess its performance indicates that it works well in practical situations, and the proposed approach is applied to the motivating data set from the Health and Retirement Study.
事件史研究中会出现面板二元数据,即研究对象只在离散的时间点而不是连续的时间点接受观察,而关于所关注的重复事件发生情况的唯一可用信息是该事件是否在两个连续的观察时间或每个观察窗口中发生。虽然已经提出了一些对此类数据进行回归分析的方法,但所有这些方法都假定观察时间或观察过程是独立的,但有时可能并非如此。为了解决这个问题,我们提出了一种联合建模程序,允许有信息的观测过程。为了实现所提出的方法,我们开发了一种计算效率高的 EM 算法,所得到的估计值具有一致性和渐近正态性。为评估该方法的性能而进行的模拟研究表明,该方法在实际情况下运行良好。
{"title":"Semiparametric regression analysis of panel binary data with an informative observation process","authors":"Lei Ge, Yang Li, Jianguo Sun","doi":"10.1007/s00180-024-01528-8","DOIUrl":"https://doi.org/10.1007/s00180-024-01528-8","url":null,"abstract":"<p>Panel binary data arise in an event history study when study subjects are observed only at discrete time points instead of continuously and the only available information on the occurrence of the recurrent event of interest is whether the event has occurred over two consecutive observation times or each observation window. Although some methods have been proposed for regression analysis of such data, all of them assume independent observation times or processes, which may not be true sometimes. To address this, we propose a joint modeling procedure that allows for informative observation processes. For the implementation of the proposed method, a computationally efficient EM algorithm is developed and the resulting estimators are consistent and asymptotically normal. The simulation study conducted to assess its performance indicates that it works well in practical situations, and the proposed approach is applied to the motivating data set from the Health and Retirement Study.</p>","PeriodicalId":55223,"journal":{"name":"Computational Statistics","volume":"44 1","pages":""},"PeriodicalIF":1.3,"publicationDate":"2024-07-29","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141862646","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2024-07-26DOI: 10.1007/s00180-024-01517-x
Ting-Wu Wang, Eric J. Beh, Rosaria Lombardo, Ian W. Renner
Power transformations of count data, including cell frequencies of a contingency table, have been well understood for nearly 100 years, with much of the attention focused on the square root transformation. Over the past 15 years, this topic has been the focus of some new insights into areas of correspondence analysis where two forms of power transformation have been discussed. One type considers the impact of raising the joint proportions of the cell frequencies of a table to a known power while the other examines the power transformation of the relative distribution of the cell frequencies. While the foundations of the graphical features of correspondence analysis rest with the numerical algorithms like reciprocal averaging, and other analogous techniques, discussions of the role of power transformations in reciprocal averaging have not been described. Therefore, this paper examines this link where a power transformation is applied to the cell frequencies of a two-way contingency table. In doing so, we show that reciprocal averaging can be performed under such a transformation to obtain row and column scores that provide the maximum association between the variables and the greatest discrimination between the categories. Finally, we discuss the connection between performing reciprocal averaging and singular value decomposition under this type of power transformation. The R function, powerRA.exe is included in the Appendix and performs reciprocal averaging of a power transformation of the cell frequencies of a two-way contingency table.
近 100 年来,人们对计数数据(包括或然率表中的单元频率)的幂变换已经有了很好的理解,其中大部分注意力都集中在平方根变换上。在过去的 15 年里,这个话题成为了对应分析领域一些新见解的焦点,其中有两种形式的幂变换得到了讨论。一种是考虑将表格中单元格频率的联合比例提高到已知幂的影响,另一种是研究单元格频率相对分布的幂变换。虽然对应分析图形特征的基础是倒数平均等数值算法和其他类似技术,但关于幂变换在倒数平均中的作用的讨论却未曾涉及。因此,本文在对双向或然表的单元频率进行幂变换时,对这一联系进行了研究。在此过程中,我们证明了在这种变换下可以进行往复平均,从而获得行和列分数,使变量之间的关联度最大,类别之间的区分度最大。最后,我们讨论了在这种幂变换下进行倒数平均和奇异值分解之间的联系。附录中包含了 R 函数 powerRA.exe,它可以对双向或然表的单元频率进行幂变换的倒数平均。
{"title":"Profile transformations for reciprocal averaging and singular value decomposition","authors":"Ting-Wu Wang, Eric J. Beh, Rosaria Lombardo, Ian W. Renner","doi":"10.1007/s00180-024-01517-x","DOIUrl":"https://doi.org/10.1007/s00180-024-01517-x","url":null,"abstract":"<p>Power transformations of count data, including cell frequencies of a contingency table, have been well understood for nearly 100 years, with much of the attention focused on the square root transformation. Over the past 15 years, this topic has been the focus of some new insights into areas of correspondence analysis where two forms of power transformation have been discussed. One type considers the impact of raising the joint proportions of the cell frequencies of a table to a known power while the other examines the power transformation of the relative distribution of the cell frequencies. While the foundations of the graphical features of correspondence analysis rest with the numerical algorithms like reciprocal averaging, and other analogous techniques, discussions of the role of power transformations in reciprocal averaging have not been described. Therefore, this paper examines this link where a power transformation is applied to the cell frequencies of a two-way contingency table. In doing so, we show that reciprocal averaging can be performed under such a transformation to obtain row and column scores that provide the maximum association between the variables and the greatest discrimination between the categories. Finally, we discuss the connection between performing reciprocal averaging and singular value decomposition under this type of power transformation. The <span>R</span> function, <span>powerRA.exe</span> is included in the Appendix and performs reciprocal averaging of a power transformation of the cell frequencies of a two-way contingency table.</p>","PeriodicalId":55223,"journal":{"name":"Computational Statistics","volume":"17 1","pages":""},"PeriodicalIF":1.3,"publicationDate":"2024-07-26","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141772120","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2024-07-24DOI: 10.1007/s00180-024-01531-z
Taiane Schaedler Prass, Guilherme Pumi, Cleiton Guollo Taufemback, Jonas Hendler Carlos
This paper discusses dynamic ARMA-type regression models for positive time series, which can handle bounded non-Gaussian time series without requiring data transformations. Our proposed model includes a conditional mean modeled by a dynamic structure containing autoregressive and moving average terms, time-varying covariates, unknown parameters, and link functions. Additionally, we present the PTSR package and discuss partial maximum likelihood estimation, asymptotic theory, hypothesis testing inference, diagnostic analysis, and forecasting for a variety of regression-based dynamic models for positive time series. A Monte Carlo simulation and a real data application are provided.
本文讨论了正时间序列的动态 ARMA 型回归模型,该模型无需数据转换即可处理有界非高斯时间序列。我们提出的模型包括一个由动态结构建模的条件均值,其中包含自回归项和移动平均项、时变协变量、未知参数和链接函数。此外,我们还介绍了 PTSR 软件包,并讨论了各种基于回归的正时间序列动态模型的偏极大似然估计、渐近理论、假设检验推理、诊断分析和预测。此外,还提供了蒙特卡罗模拟和真实数据应用。
{"title":"Positive time series regression models: theoretical and computational aspects","authors":"Taiane Schaedler Prass, Guilherme Pumi, Cleiton Guollo Taufemback, Jonas Hendler Carlos","doi":"10.1007/s00180-024-01531-z","DOIUrl":"https://doi.org/10.1007/s00180-024-01531-z","url":null,"abstract":"<p>This paper discusses dynamic ARMA-type regression models for positive time series, which can handle bounded non-Gaussian time series without requiring data transformations. Our proposed model includes a conditional mean modeled by a dynamic structure containing autoregressive and moving average terms, time-varying covariates, unknown parameters, and link functions. Additionally, we present the <span>PTSR</span> package and discuss partial maximum likelihood estimation, asymptotic theory, hypothesis testing inference, diagnostic analysis, and forecasting for a variety of regression-based dynamic models for positive time series. A Monte Carlo simulation and a real data application are provided.</p>","PeriodicalId":55223,"journal":{"name":"Computational Statistics","volume":"40 1","pages":""},"PeriodicalIF":1.3,"publicationDate":"2024-07-24","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141772237","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2024-07-18DOI: 10.1007/s00180-024-01532-y
Zeytu Gashaw Asfaw, Patrick E. Brown, Jamie Stafford
The study of aggregated data influenced by time, space, and extra changes in geographic region borders was the main emphasis of the current paper. This may occur if the regions used to count the reported incidences of a health outcome over time change periodically. In order to handle the spatial-temporal scenario, we enhance the spatial root-Gaussian Cox Process (RGCP), which makes use of the square-root link function rather than the more typical log-link function. The algorithm’s ability to estimate a risk surface has been proven by a simulation study, and it has also been validated by real datasets.
{"title":"The root-Gaussian Cox Process for spatial-temporal disease mapping with aggregated data","authors":"Zeytu Gashaw Asfaw, Patrick E. Brown, Jamie Stafford","doi":"10.1007/s00180-024-01532-y","DOIUrl":"https://doi.org/10.1007/s00180-024-01532-y","url":null,"abstract":"<p>The study of aggregated data influenced by time, space, and extra changes in geographic region borders was the main emphasis of the current paper. This may occur if the regions used to count the reported incidences of a health outcome over time change periodically. In order to handle the spatial-temporal scenario, we enhance the spatial root-Gaussian Cox Process (RGCP), which makes use of the square-root link function rather than the more typical log-link function. The algorithm’s ability to estimate a risk surface has been proven by a simulation study, and it has also been validated by real datasets.</p>","PeriodicalId":55223,"journal":{"name":"Computational Statistics","volume":"24 1","pages":""},"PeriodicalIF":1.3,"publicationDate":"2024-07-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141742746","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}