Journal of Statistical Software最新文献

英文中文

spsur: An R Package for Dealing with Spatial Seemingly Unrelated Regression Models spsur:一个处理空间看似不相关回归模型的R包

IF 5.8 2区计算机科学 Q1 COMPUTER SCIENCE, INTERDISCIPLINARY APPLICATIONS

Journal of Statistical Software

Pub Date : 2022-01-01 DOI: 10.18637/jss.v104.i11

R. Mínguez, F. López, J. Mur

Spatial seemingly unrelated regression (spatial SUR) models are a useful multiequational econometric specification to simultaneously incorporate spatial effects and correlated error terms across equations. The purpose of the spsur R package is to supply a complete set of functions to test for spatial structures in the residual of a SUR model;to estimate the most popular specifications by applying different methods and test for linear restrictions on the parameters. The package also facilitates the estimation of so-called spatial impacts, conveniently adapted to a SUR framework. The package includes functions to simulate datasets with the features decided by the user, which may be useful in teaching activities or in more general research projects. The article concludes with a real data application showing the potential that spsur has to examine the relation of individual mobility over geographic areas and the incidence of COVID-19 in Spain during the first lockdown. © 2022, American Statistical Association. All rights reserved.

空间看似不相关回归(Spatial SUR)模型是一种有用的多方程计量经济学规范，可以同时包含空间效应和跨方程的相关误差项。spsur R包的目的是提供一套完整的函数来测试SUR模型残差中的空间结构;通过应用不同的方法来估计最流行的规范，并测试参数的线性限制。该方案还有助于估计所谓的空间影响，方便地适应SUR框架。该软件包包括模拟具有用户决定的特征的数据集的功能，这可能在教学活动或更一般的研究项目中有用。文章最后用一个真实的数据应用程序显示了spsur在第一次封锁期间必须研究地理区域内个人流动性与西班牙COVID-19发病率之间关系的潜力。©2022，美国统计协会。版权所有。

引用次数: 4

More on Multidimensional Scaling and Unfolding in R: smacof Version 2 更多关于R: smacof版本2中的多维缩放和展开

IF 5.8 2区计算机科学 Q1 COMPUTER SCIENCE, INTERDISCIPLINARY APPLICATIONS

Journal of Statistical Software

Pub Date : 2022-01-01 DOI: 10.18637/jss.v102.i10

P. Mair, P. Groenen, J. Leeuw

The smacof package offers a comprehensive implementation of multidimensional scaling (MDS) techniques in R . Since its first publication (De Leeuw and Mair 2009b) the functionality of the package has been enhanced, and several additional methods, features and utilities were added. Major updates include a complete re-implementation of multidimensional unfolding allowing for monotone dissimilarity transformations, including row-conditional, circular, and external unfolding. Additionally, the constrained MDS implementation was extended in terms of optimal scaling of the external variables. Further package additions include various tools and functions for goodness-of-fit assessment, unidimensional scaling, gravity MDS, asymmetric MDS, Procrustes, and MDS biplots. All these new package functionalities are illustrated using a variety of real-life applications.

smacof包在R中提供了多维缩放(MDS)技术的全面实现。自从第一次发布(De Leeuw and maair 2009b)以来，包的功能得到了增强，并添加了一些额外的方法、特性和实用程序。主要的更新包括完全重新实现多维展开，允许单调的不相似转换，包括行条件展开、循环展开和外部展开。此外，根据外部变量的最优缩放对约束MDS实现进行了扩展。进一步增加的包包括各种工具和功能，用于拟合优度评估、一维缩放、重力MDS、非对称MDS、Procrustes和MDS双标图。所有这些新的包功能都使用各种实际应用程序进行了说明。

引用次数: 24

Event History Regression with Pseudo-Observations: Computational Approaches and an Implementation in R 伪观测的事件历史回归:计算方法和R中的实现

IF 5.8 2区计算机科学 Q1 COMPUTER SCIENCE, INTERDISCIPLINARY APPLICATIONS

Journal of Statistical Software

Pub Date : 2022-01-01 DOI: 10.18637/jss.v102.i09

M. Sachs, E. Gabriel

Due to tradition and ease of estimation, the vast majority of clinical and epidemiological papers with time-to-event data report hazard ratios from Cox proportional hazards regression models. Although hazard ratios are well known, they can be difficult to interpret, particularly as causal contrasts, in many settings. Nonparametric or fully parametric estimators allow for the direct estimation of more easily causally interpretable estimands such as the cumulative incidence and restricted mean survival. However, modeling these quantities as functions of covariates is limited to a few categorical covariates with nonparametric estimators, and often requires simulation or numeric integration with parametric estimators. Combining pseudo-observations based on non-parametric estimands with parametric regression on the pseudo-observations allows for the best of these two approaches and has many nice properties. In this paper, we develop a user friendly, easy to understand way of doing event history regression for the cumulative incidence and the restricted mean survival, using the pseudo-observation framework for estimation. The interface uses the well known formulation of a generalized linear model and allows for features including plotting of residuals, the use of sampling weights, and correct variance estimation.

由于传统和易于估计，绝大多数具有时间-事件数据的临床和流行病学论文报告的风险比来自Cox比例风险回归模型。虽然风险比是众所周知的，但在许多情况下，它们很难解释，特别是作为因果对比。非参数或全参数估计允许直接估计更容易解释的因果估计，如累积发病率和限制平均生存。然而，将这些量建模为协变量的函数仅限于使用非参数估计器的几个分类协变量，并且通常需要使用参数估计器进行模拟或数值集成。将基于非参数估计的伪观测值与基于伪观测值的参数回归相结合，可以发挥这两种方法的优点，并具有许多良好的特性。在本文中，我们开发了一种用户友好，易于理解的方法，对累积发生率和限制平均生存进行事件历史回归，使用伪观测框架进行估计。该界面使用了众所周知的广义线性模型的公式，并允许包括绘制残差，使用采样权重和正确的方差估计在内的功能。

{"title":"Event History Regression with Pseudo-Observations: Computational Approaches and an Implementation in R","authors":"M. Sachs, E. Gabriel","doi":"10.18637/jss.v102.i09","DOIUrl":"https://doi.org/10.18637/jss.v102.i09","url":null,"abstract":"Due to tradition and ease of estimation, the vast majority of clinical and epidemiological papers with time-to-event data report hazard ratios from Cox proportional hazards regression models. Although hazard ratios are well known, they can be difficult to interpret, particularly as causal contrasts, in many settings. Nonparametric or fully parametric estimators allow for the direct estimation of more easily causally interpretable estimands such as the cumulative incidence and restricted mean survival. However, modeling these quantities as functions of covariates is limited to a few categorical covariates with nonparametric estimators, and often requires simulation or numeric integration with parametric estimators. Combining pseudo-observations based on non-parametric estimands with parametric regression on the pseudo-observations allows for the best of these two approaches and has many nice properties. In this paper, we develop a user friendly, easy to understand way of doing event history regression for the cumulative incidence and the restricted mean survival, using the pseudo-observation framework for estimation. The interface uses the well known formulation of a generalized linear model and allows for features including plotting of residuals, the use of sampling weights, and correct variance estimation.","PeriodicalId":17237,"journal":{"name":"Journal of Statistical Software","volume":"74 1","pages":""},"PeriodicalIF":5.8,"publicationDate":"2022-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"72655071","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 4

tidypaleo: Visualizing Paleoenvironmental Archives Using ggplot2 利用ggplot2可视化古环境档案

IF 5.8 2区计算机科学 Q1 COMPUTER SCIENCE, INTERDISCIPLINARY APPLICATIONS

Journal of Statistical Software

Pub Date : 2022-01-01 DOI: 10.18637/jss.v101.i07

D. Dunnington, Nell Libera, J. Kurek, I. Spooner, G. Gagnon

This paper presents the tidypaleo package for R, which enables high-quality reproducible visualizations of time-stratigraphic multivariate data that is common to several disciplines of the natural sciences. Rather than introduce new plotting functions, the tidypaleo package defines several orthogonal components of the ggplot2 package that, when combined, enable most types of stratigraphic diagrams to be created. We do so by conceptualizing multi-parameter data as a series of measurements (rows) with attributes (columns), enabling the use of the ggplot2 facet mechanism to display multi-parameter data. The orthogonal components include (1) scales that represent relative abundance and concentration values, (2) geometries that are commonly used in paleoenvironmental diagrams created elsewhere, (3) facets that correctly assign scales and sizes to panels representing multiple parameters, and (4) theme elements that enable tidypaleo to create elegant graphics. Collectively, this approach demonstrates the efficacy of a minimal ggplot2 wrapper to create domain-specific plots.

本文介绍了R的titypaleo包，它可以实现高质量的可重复的时间地层多元数据可视化，这是自然科学的几个学科共同的。tiypaleo软件包并没有引入新的绘图功能，而是定义了ggplot2软件包的几个正交组件，当这些组件组合在一起时，可以创建大多数类型的地层图。为此，我们将多参数数据概念化为具有属性(列)的一系列度量值(行)，从而支持使用ggplot2 facet机制来显示多参数数据。正交组件包括(1)表示相对丰度和浓度值的尺度，(2)在其他地方创建的古环境图中常用的几何形状，(3)正确分配表示多个参数的面板的尺度和大小的切面，以及(4)使tiypaleo能够创建优雅图形的主题元素。总的来说，这种方法证明了最小的ggplot2包装器在创建特定于域的图方面的有效性。

引用次数: 5

econet: An R Package for Parameter-Dependent Network Centrality Measures econet:一个R包，用于参数依赖的网络中心性度量

IF 5.8 2区计算机科学 Q1 COMPUTER SCIENCE, INTERDISCIPLINARY APPLICATIONS

Journal of Statistical Software

Pub Date : 2022-01-01 DOI: 10.18637/jss.v102.i08

M. Battaglini, Valerio Leone Sciabolazza, Eleonora Patacchini, Sida Peng

The R package econet provides methods for estimating parameter-dependent network centrality measures with linear-in-means models. Both nonlinear least squares and maximum likelihood estimators are implemented. The methods allow for both link and node heterogeneity in network eﬀects, endogenous network formation and the presence of unconnected nodes. The routines also compare the explanatory power of parameter-dependent network centrality measures with those of standard measures of network centrality. Ben-eﬁts and features of the econet package are illustrated using data from Battaglini and Patacchini (2018) and Battaglini, Leone Sciabolazza, and Patacchini (2020).

R包econet提供了使用线性均值模型估计参数相关网络中心性度量的方法。实现了非线性最小二乘估计和极大似然估计。这些方法允许链路和节点在网络效应、内生网络形成和未连接节点的存在方面的异质性。例程还比较了参数依赖的网络中心性度量与标准网络中心性度量的解释力。使用Battaglini和Patacchini(2018)以及Battaglini、Leone Sciabolazza和Patacchini(2020)的数据说明了经济一揽子计划的效益和特征。

引用次数: 4

modelsummary: Data and Model Summaries in R modelsummary: R中的数据和模型摘要

IF 5.8 2区计算机科学 Q1 COMPUTER SCIENCE, INTERDISCIPLINARY APPLICATIONS

Journal of Statistical Software

Pub Date : 2022-01-01 DOI: 10.18637/jss.v103.i01

Vincent Arel‐Bundock

modelsummary is a package to summarize data and statistical models in R . It supports over one hundred types of models out-of-the-box, and allows users to report the results of those models side-by-side in a table, or in coefficient plots. It makes it easy to execute common tasks such as computing robust standard errors, adding significance stars, and manipulating coefficient and model labels. Beyond model summaries, the package also includes a suite of tools to produce highly flexible data summary tables, such as dataset overviews, correlation matrices, (multi-level) cross-tabulations, and balance tables (also known as “Table 1”). The appearance of the tables produced by modelsummary can be customized using external packages such as kableExtra , gt , flextable , or huxtable ; the plots can be customized using ggplot2 . Tables can be exported to many output formats, including HTML, L A TEX, Text/Markdown, Microsoft Word, Powerpoint, Excel, RTF, PDF, and image files. Tables and plots can be embedded seamlessly in rmarkdown , knitr , or Sweave dynamic documents. The modelsummary package is designed to be simple, robust, modular, and extensible.

modelsummary是一个在R中汇总数据和统计模型的包。它支持一百多种现成的模型，并允许用户在表格或系数图中并排报告这些模型的结果。它使执行诸如计算健壮标准误差、添加显著性星型以及操纵系数和模型标签等常见任务变得容易。除了模型摘要之外，该软件包还包括一套工具，用于生成高度灵活的数据摘要表，例如数据集概述、相关矩阵、(多级)交叉表和平衡表(也称为“表1”)。modelsummary生成的表的外观可以使用外部包(如kableExtra、gt、flextable或huxtable)进行定制;可以使用ggplot2定制这些图。表格可以导出为多种输出格式，包括HTML、la TEX、Text/Markdown、Microsoft Word、Powerpoint、Excel、RTF、PDF和图像文件。表格和绘图可以无缝地嵌入到markdown、knitr或Sweave动态文档中。modelsummary包被设计成简单、健壮、模块化和可扩展的。

引用次数: 11

Multivariate Normal Variance Mixtures in R: The R Package nvmix R中的多元正态方差混合:R包混合

IF 5.8 2区计算机科学 Q1 COMPUTER SCIENCE, INTERDISCIPLINARY APPLICATIONS

Journal of Statistical Software

Pub Date : 2022-01-01 DOI: 10.18637/jss.v102.i02

Erik Hintz, M. Hofert, C. Lemieux

We present the features and implementation of the R package nvmix for the class of normal variance mixtures including Student t and normal distributions. The package provides functionalities for such distributions, notably the evaluation of the distribution and density function as well as likelihood-based parameter estimation. The distributional family is specified through the quantile function of the underlying mixing random variable. The R package nvmix thus allows one to model multivariate distributions well beyond the classical multivariate normal and t case. Additional functionalities include graphical goodness-of-fit assessment, the estimation of the risk measures value-at-risk and expected shortfall for univariate normal variance mixture distributions and functions to work with normal variance mixture copulas, such as sampling and the evaluation of normal variance mixture copulas and their densities. Furthermore, the package nvmix also provides functionalities for the evaluation of the distribution and density function as well as random variate generation for the more general class of grouped normal variance mixtures.

我们介绍了R包nvmix的特征和实现，用于包括Student t和正态分布在内的正态方差混合类。该软件包为这种分布提供了功能，特别是分布和密度函数的评估以及基于似然的参数估计。分布族是通过底层混合随机变量的分位数函数指定的。因此，R包nvmix允许对多变量分布进行建模，远远超出了经典的多变量正态分布和t情况。附加功能包括图形拟合优度评估，单变量正态方差混合分布的风险度量值和预期不足的估计，以及与正态方差混合copuls一起工作的函数，例如采样和正态方差混合copuls及其密度的评估。此外，nvmix包还提供了用于评估分布和密度函数的功能，以及用于更一般类别的分组正态方差混合物的随机变量生成功能。

引用次数: 2

Pro Data Visualization Using R and JavaScript: Analyze and Visualize Key Data on the Web 使用R和JavaScript的Pro数据可视化:分析和可视化Web上的关键数据

IF 5.8 2区计算机科学 Q1 COMPUTER SCIENCE, INTERDISCIPLINARY APPLICATIONS

Journal of Statistical Software

Pub Date : 2022-01-01 DOI: 10.18637/jss.v102.b01

U. Grömping

引用次数: 0

Monotone Regression: A Simple and Fast O(n) PAVA Implementation 单调回归:一种简单快速的O(n) PAVA实现

IF 5.8 2区计算机科学 Q1 COMPUTER SCIENCE, INTERDISCIPLINARY APPLICATIONS

Journal of Statistical Software

Pub Date : 2022-01-01 DOI: 10.18637/jss.v102.c01

F. Busing

Efficient coding and improvements in the execution order of the up-and-down-blocks algorithm for monotone or isotonic regression leads to a significant increase in speed as well as a short and simple O ( n ) implementation. Algorithms that use monotone regression as a subroutine, e.g., unimodal or bivariate monotone regression, also benefit from the acceleration. A substantive comparison with and characterization of currently available implementations provides an extensive overview of up-and-down-blocks implementations for the pool-adjacent-violators algorithm for simple linear ordered monotone regression.

单调或等压回归的上下块算法的高效编码和执行顺序的改进导致速度的显着提高以及短而简单的O (n)实现。使用单调回归作为子程序的算法，例如单峰单调回归或二元单调回归，也受益于加速。对当前可用实现的实质性比较和特征描述提供了简单线性有序单调回归的池邻接违反者算法的上下块实现的广泛概述。

引用次数: 3

tlrmvnmvt: Computing High-Dimensional Multivariate Normal and Student- t Probabilities with Low-Rank Methods in R 用R中的低秩方法计算高维多元正态和学生- t概率

IF 5.8 2区计算机科学 Q1 COMPUTER SCIENCE, INTERDISCIPLINARY APPLICATIONS

Journal of Statistical Software

Pub Date : 2022-01-01 DOI: 10.18637/jss.v101.i04

Jian Cao, M. Genton, D. Keyes, G. Turkiyyah

This paper introduces the usage and performance of the R package tlrmvnmvt, aimed at computing high-dimensional multivariate normal and Student-t probabilities. The package implements the tile-low-rank methods with block reordering and the separationof-variable methods with univariate reordering. The performance is compared with two other state-of-the-art R packages, namely the mvtnorm and the TruncatedNormal packages. Our package has the best scalability and is likely to be the only option in thousands of dimensions. However, for applications with high accuracy requirements, the TruncatedNormal package is more suitable. As an application example, we show that the excursion sets of a latent Gaussian random field can be computed with the tlrmvnmvt package without any model approximation and hence, the accuracy of the produced excursion sets is improved.

本文介绍了用于计算高维多元正态概率和Student-t概率的R包tlrmvnmvt的使用方法和性能。该包实现了具有块重排序的低秩瓦片方法和具有单变量重排序的分离变量方法。将性能与另外两个最先进的R包(即mvtnorm和TruncatedNormal包)进行比较。我们的包具有最好的可伸缩性，并且可能是数千个维度中唯一的选择。然而，对于精度要求较高的应用，TruncatedNormal包更合适。作为一个应用实例，我们证明了使用tlrmvnmvt包可以在没有任何模型近似的情况下计算潜在高斯随机场的偏移集，从而提高了所产生偏移集的精度。

引用次数: 6

首页上一页

下一页尾页

类型

全部化学•材料生命科学医学物理工程技术环境•农林材料科学地球科学法学管理学化学环境科学与生态学计算机科学教育学经济学农林科学人文科学生物学数学物理与天体物理心理学综合性期刊其他工业工程理学历史学农学文学信息工程

数据库

全部 ACS Publications Elsevier ieeexplore Springer The Royal Society of Chemistry Wiley

期刊

Journal of Statistical Software

全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.

﹀