首页 > 最新文献

Journal of Statistical Software最新文献

英文 中文
dalmatian: A Package for Fitting Double Hierarchical Linear Models in R via JAGS and nimble dalmatian:一个通过JAGS和nimble在R中拟合双重层次线性模型的包
IF 5.8 2区 计算机科学 Q1 Mathematics Pub Date : 2021-01-01 DOI: 10.18637/jss.v100.i10
S. Bonner, Hanjoe Kim, D. Westneat, A. Mutzel, Jonathan Wright, Matthew R. Schofield
Traditional regression models, including generalized linear mixed models, focus on understanding the deterministic factors that affect the mean of a response variable. Many biological studies seek to understand non-deterministic patterns in the variance or dispersion of a phenotypic or ecological response variable. We describe a new R package, dalmatian, that provides methods for fitting double hierarchical generalized linear models incorporating fixed and random predictors of both the mean and variance. Models are fit via Markov chain Monte Carlo sampling implemented in either JAGS or nimble and the package provides simple functions for monitoring the sampler and summarizing the results. We illustrate these functions through an application to data on food delivery by breeding pied flycatchers (Ficedula hypoleuca). Our intent is that this package makes it easier for practitioners to implement these models without having to learn the intricacies of Markov chain Monte Carlo methods.
传统的回归模型,包括广义线性混合模型,侧重于理解影响响应变量均值的确定性因素。许多生物学研究试图理解表型或生态反应变量的变异或分散的非确定性模式。我们描述了一个新的R包,dalmatian,它提供了拟合双层次广义线性模型的方法,该模型包含固定和随机的均值和方差预测因子。通过JAGS或nimble实现的马尔可夫链蒙特卡罗采样来拟合模型,并且该软件包提供了简单的功能来监控采样器和总结结果。我们将这些功能应用于通过繁殖斑蝇(Ficedula hypoleuca)来传递食物的数据。我们的目的是,这个包使从业者更容易实现这些模型,而不必学习复杂的马尔可夫链蒙特卡洛方法。
{"title":"dalmatian: A Package for Fitting Double Hierarchical Linear Models in R via JAGS and nimble","authors":"S. Bonner, Hanjoe Kim, D. Westneat, A. Mutzel, Jonathan Wright, Matthew R. Schofield","doi":"10.18637/jss.v100.i10","DOIUrl":"https://doi.org/10.18637/jss.v100.i10","url":null,"abstract":"Traditional regression models, including generalized linear mixed models, focus on understanding the deterministic factors that affect the mean of a response variable. Many biological studies seek to understand non-deterministic patterns in the variance or dispersion of a phenotypic or ecological response variable. We describe a new R package, dalmatian, that provides methods for fitting double hierarchical generalized linear models incorporating fixed and random predictors of both the mean and variance. Models are fit via Markov chain Monte Carlo sampling implemented in either JAGS or nimble and the package provides simple functions for monitoring the sampler and summarizing the results. We illustrate these functions through an application to data on food delivery by breeding pied flycatchers (Ficedula hypoleuca). Our intent is that this package makes it easier for practitioners to implement these models without having to learn the intricacies of Markov chain Monte Carlo methods.","PeriodicalId":17237,"journal":{"name":"Journal of Statistical Software","volume":null,"pages":null},"PeriodicalIF":5.8,"publicationDate":"2021-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"74155214","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 3
D-STEM v2: A Software for Modeling Functional Spatio-Temporal Data D-STEM v2:功能时空数据建模软件
IF 5.8 2区 计算机科学 Q1 Mathematics Pub Date : 2021-01-01 DOI: 10.18637/jss.v099.i10
Yaqiong Wang,Francesco Finazzi,Alessandro Fassò
Functional spatio-temporal data naturally arise in many environmental and climate applications where data are collected in a three-dimensional space over time. The MATLAB D-STEM v1 software package was first introduced for modelling multivariate space-time data and has been recently extended to D-STEM v2 to handle functional data indexed across space and over time. This paper introduces the new modelling capabilities of DSTEM v2 as well as the complexity reduction techniques required when dealing with large data sets. Model estimation, validation and dynamic kriging are demonstrated in two case studies, one related to ground-level air quality data in Beijing, China, and the other one related to atmospheric profile data collected globally through radio sounding.
功能时空数据自然出现在许多环境和气候应用中,其中数据是在三维空间中随时间收集的。MATLAB D-STEM v1软件包最初用于建模多元时空数据,最近已扩展到D-STEM v2,以处理跨空间和时间索引的功能数据。本文介绍了DSTEM v2的新建模功能,以及在处理大型数据集时所需的复杂性降低技术。模型估计、验证和动态克里格在两个案例中进行了演示,一个与中国北京的地面空气质量数据有关,另一个与通过无线电探测收集的全球大气剖面数据有关。
{"title":"D-STEM v2: A Software for Modeling Functional Spatio-Temporal Data","authors":"Yaqiong Wang,Francesco Finazzi,Alessandro Fassò","doi":"10.18637/jss.v099.i10","DOIUrl":"https://doi.org/10.18637/jss.v099.i10","url":null,"abstract":"Functional spatio-temporal data naturally arise in many environmental and climate applications where data are collected in a three-dimensional space over time. The MATLAB D-STEM v1 software package was first introduced for modelling multivariate space-time data and has been recently extended to D-STEM v2 to handle functional data indexed across space and over time. This paper introduces the new modelling capabilities of DSTEM v2 as well as the complexity reduction techniques required when dealing with large data sets. Model estimation, validation and dynamic kriging are demonstrated in two case studies, one related to ground-level air quality data in Beijing, China, and the other one related to atmospheric profile data collected globally through radio sounding.","PeriodicalId":17237,"journal":{"name":"Journal of Statistical Software","volume":null,"pages":null},"PeriodicalIF":5.8,"publicationDate":"2021-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"138494701","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
BayesCTDesign: An R Package for Bayesian Trial Design Using Historical Control Data. BayesCTDesign:一个使用历史控制数据的贝叶斯试验设计R包。
IF 5.8 2区 计算机科学 Q1 Mathematics Pub Date : 2021-01-01 Epub Date: 2021-11-30 DOI: 10.18637/jss.v100.i21
Barry S Eggleston, Joseph G Ibrahim, Becky McNeil, Diane Catellier

This article introduces the R (R Core Team 2019) package BayesCTDesign for two-arm randomized Bayesian trial design using historical control data when available, and simple two-arm randomized Bayesian trial design when historical control data is not available. The package BayesCTDesign, which is available on CRAN, has two simulation functions, historic_sim() and simple_sim() for studying trial characteristics under user defined scenarios, and two methods print() and plot() for displaying summaries of the simulated trial characteristics. The package BayesCTDesign works with two-arm trials with equal sample sizes per arm. The package BayesCTDesign allows a user to study Gaussian, Poisson, Bernoulli, Weibull, Lognormal, and Piecewise Exponential (pwe) outcomes. Power for two-sided hypothesis tests at a user defined alpha is estimated via simulation using a test within each simulation replication that involves comparing a 95% credible interval for the outcome specific treatment effect measure to the null case value. If the 95% credible interval excludes the null case value, then the null hypothesis is rejected, else the null hypothesis is accepted. In the article, the idea of including historical control data in a Bayesian analysis is reviewed, the estimation process of BayesCTDesign is explained, and the user interface is described. Finally, the BayesCTDesign is illustrated via several examples.

本文介绍了R(R Core Team 2019)软件包BayesCTDesign,用于在可用的情况下使用历史对照数据进行双臂随机贝叶斯试验设计,以及在没有历史对照数据时进行简单的双臂随机贝叶斯试验设计。CRAN上提供的包BayesCTDesign有两个模拟函数,historic_sim()和simple_sim(。BayesCTDesign软件包适用于两个手臂试验,每个手臂的样本量相等。包BayesCTDesign允许用户研究高斯、泊松、伯努利、威布尔、对数正态和逐段指数(pwe)结果。通过在每个模拟复制中使用测试的模拟来估计用户定义的α下的双侧假设测试的功率,该测试涉及将结果特异性治疗效果测量的95%可信区间与零病例值进行比较。如果95%可信区间排除了零情况值,则拒绝零假设,否则接受零假设。本文回顾了将历史控制数据纳入贝叶斯分析的思想,解释了贝叶斯设计的估计过程,并描述了用户界面。最后,通过几个例子说明了贝叶斯TDesign。
{"title":"BayesCTDesign: An R Package for Bayesian Trial Design Using Historical Control Data.","authors":"Barry S Eggleston,&nbsp;Joseph G Ibrahim,&nbsp;Becky McNeil,&nbsp;Diane Catellier","doi":"10.18637/jss.v100.i21","DOIUrl":"10.18637/jss.v100.i21","url":null,"abstract":"<p><p>This article introduces the R (R Core Team 2019) package <b>BayesCTDesign</b> for two-arm randomized Bayesian trial design using historical control data when available, and simple two-arm randomized Bayesian trial design when historical control data is not available. The package <b>BayesCTDesign</b>, which is available on CRAN, has two simulation functions, historic_sim() and simple_sim() for studying trial characteristics under user defined scenarios, and two methods print() and plot() for displaying summaries of the simulated trial characteristics. The package <b>BayesCTDesign</b> works with two-arm trials with equal sample sizes per arm. The package <b>BayesCTDesign</b> allows a user to study Gaussian, Poisson, Bernoulli, Weibull, Lognormal, and Piecewise Exponential (pwe) outcomes. Power for two-sided hypothesis tests at a user defined alpha is estimated via simulation using a test within each simulation replication that involves comparing a 95% credible interval for the outcome specific treatment effect measure to the null case value. If the 95% credible interval excludes the null case value, then the null hypothesis is rejected, else the null hypothesis is accepted. In the article, the idea of including historical control data in a Bayesian analysis is reviewed, the estimation process of <b>BayesCTDesign</b> is explained, and the user interface is described. Finally, the <b>BayesCTDesign</b> is illustrated via several examples.</p>","PeriodicalId":17237,"journal":{"name":"Journal of Statistical Software","volume":null,"pages":null},"PeriodicalIF":5.8,"publicationDate":"2021-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8715862/pdf/nihms-1661326.pdf","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"39778165","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
Data Visualization: Charts, Maps, and Interactive Graphics 数据可视化:图表、地图和交互式图形
IF 5.8 2区 计算机科学 Q1 Mathematics Pub Date : 2021-01-01 DOI: 10.18637/jss.v098.b01
U. Grömping
{"title":"Data Visualization: Charts, Maps, and Interactive Graphics","authors":"U. Grömping","doi":"10.18637/jss.v098.b01","DOIUrl":"https://doi.org/10.18637/jss.v098.b01","url":null,"abstract":"","PeriodicalId":17237,"journal":{"name":"Journal of Statistical Software","volume":null,"pages":null},"PeriodicalIF":5.8,"publicationDate":"2021-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"76672675","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 12
Analysis of Multiplex Social Networks with R 基于R的多元社会网络分析
IF 5.8 2区 计算机科学 Q1 Mathematics Pub Date : 2021-01-01 DOI: 10.18637/jss.v098.i08
Matteo Magnani, L. Rossi, Davide Vega
Multiplex social networks are characterized by a common set of actors connected through multiple types of relations. The multinet package provides a set of R functions to analyze multiplex social networks within the more general framework of multilayer networks, where each type of relation is represented as a layer in the network. The package contains functions to import/export, create and manipulate multilayer networks, implementations of several state-of-the-art multiplex network analysis algorithms, e.g., for centrality measures, layer comparison, community detection and visualization. Internally, the package is mainly written in native C++ and integrated with R using the Rcpp package.
多元社会网络的特点是一组共同的参与者通过多种类型的关系联系在一起。多网络包提供了一组R函数,用于在多层网络的更一般框架内分析多重社会网络,其中每种类型的关系都表示为网络中的一层。该软件包包含导入/导出、创建和操作多层网络的功能,以及几种最先进的多路网络分析算法的实现,例如,用于中心性测量、层比较、社区检测和可视化。在内部,该包主要是用本地c++编写的,并使用Rcpp包与R集成。
{"title":"Analysis of Multiplex Social Networks with R","authors":"Matteo Magnani, L. Rossi, Davide Vega","doi":"10.18637/jss.v098.i08","DOIUrl":"https://doi.org/10.18637/jss.v098.i08","url":null,"abstract":"Multiplex social networks are characterized by a common set of actors connected through multiple types of relations. The multinet package provides a set of R functions to analyze multiplex social networks within the more general framework of multilayer networks, where each type of relation is represented as a layer in the network. The package contains functions to import/export, create and manipulate multilayer networks, implementations of several state-of-the-art multiplex network analysis algorithms, e.g., for centrality measures, layer comparison, community detection and visualization. Internally, the package is mainly written in native C++ and integrated with R using the Rcpp package.","PeriodicalId":17237,"journal":{"name":"Journal of Statistical Software","volume":null,"pages":null},"PeriodicalIF":5.8,"publicationDate":"2021-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"82013385","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 17
Software for Bayesian Statistics 贝叶斯统计软件
IF 5.8 2区 计算机科学 Q1 Mathematics Pub Date : 2021-01-01 DOI: 10.18637/jss.v100.i01
M. Cameletti, V. Gómez‐Rubio
In this summary we introduce the papers published in the special issue on Bayesian statistics. This special issue comprises 20 papers on Bayesian statistics and Bayesian inference on different topics such as general packages for hierarchical linear model fitting, survival models, clinical trials, missing values, time series, hypothesis testing, priors, approximate Bayesian computation, and others.
在这篇摘要中,我们介绍了发表在贝叶斯统计专刊上的论文。本期特刊包括20篇关于贝叶斯统计和贝叶斯推理的论文,涉及不同的主题,如层次线性模型拟合的通用包、生存模型、临床试验、缺失值、时间序列、假设检验、先验、近似贝叶斯计算等。
{"title":"Software for Bayesian Statistics","authors":"M. Cameletti, V. Gómez‐Rubio","doi":"10.18637/jss.v100.i01","DOIUrl":"https://doi.org/10.18637/jss.v100.i01","url":null,"abstract":"In this summary we introduce the papers published in the special issue on Bayesian statistics. This special issue comprises 20 papers on Bayesian statistics and Bayesian inference on different topics such as general packages for hierarchical linear model fitting, survival models, clinical trials, missing values, time series, hypothesis testing, priors, approximate Bayesian computation, and others.","PeriodicalId":17237,"journal":{"name":"Journal of Statistical Software","volume":null,"pages":null},"PeriodicalIF":5.8,"publicationDate":"2021-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"90921509","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
Sequential Monte Carlo Methods in the nimble and nimbleSMC R Packages 灵活的esmc R包中的顺序蒙特卡罗方法
IF 5.8 2区 计算机科学 Q1 Mathematics Pub Date : 2021-01-01 DOI: 10.18637/jss.v100.i03
Nick Michaud, P. Valpine, Daniel Turek, C. Paciorek, D. Nguyen
nimble is an R package for constructing algorithms and conducting inference on hierarchical models. The nimble package provides a unique combination of flexible model specification and the ability to program model-generic algorithms. Specifically, the package allows users to code models in the BUGS language, and it allows users to write algorithms that can be applied to any appropriate model. In this paper, we introduce the nimbleSMC R package. nimbleSMC contains algorithms for state-space model analysis using sequential Monte Carlo (SMC) techniques that are built using nimble . We first provide an overview of state-space models and commonly-used SMC algorithms. We then describe how to build a state-space model in nimble and conduct inference using existing SMC algorithms within nimbleSMC . SMC algorithms within nimbleSMC currently include the bootstrap filter, auxiliary particle filter, ensemble Kalman filter, IF2 method of iterated filtering, and a particle Markov chain Monte Carlo (MCMC) sampler. These algorithms can be run in R or compiled into C++ for more efficient execution. Examples of applying SMC algorithms to linear autoregressive models and a stochastic volatility model are provided. Finally, we give an overview of how model-generic algorithms are coded within nimble by providing code for a simple SMC algorithm. This illustrates how users can easily extend nimble’s SMC methods in high-level code.
nimble是一个R包,用于构造算法和对层次模型进行推理。灵活的软件包提供了灵活的模型规范和编程模型通用算法的能力的独特组合。具体地说,这个包允许用户用BUGS语言编写模型,并且允许用户编写可应用于任何适当模型的算法。在本文中,我们介绍了灵活的esmc R包。nimbleSMC包含使用顺序蒙特卡罗(SMC)技术进行状态空间模型分析的算法,该技术是使用nimble构建的。我们首先概述了状态空间模型和常用的SMC算法。然后,我们描述了如何在nimble中构建状态空间模型,并在nimbleSMC中使用现有的SMC算法进行推理。nimbleSMC中的SMC算法目前包括自举滤波、辅助粒子滤波、集成卡尔曼滤波、IF2迭代滤波方法和粒子马尔可夫链蒙特卡罗(MCMC)采样器。这些算法可以在R中运行或编译成c++以提高执行效率。给出了将SMC算法应用于线性自回归模型和随机波动模型的实例。最后,我们通过提供一个简单的SMC算法的代码,概述了如何在敏捷中编码模型泛型算法。这说明了用户如何在高级代码中轻松扩展nimble的SMC方法。
{"title":"Sequential Monte Carlo Methods in the nimble and nimbleSMC R Packages","authors":"Nick Michaud, P. Valpine, Daniel Turek, C. Paciorek, D. Nguyen","doi":"10.18637/jss.v100.i03","DOIUrl":"https://doi.org/10.18637/jss.v100.i03","url":null,"abstract":"nimble is an R package for constructing algorithms and conducting inference on hierarchical models. The nimble package provides a unique combination of flexible model specification and the ability to program model-generic algorithms. Specifically, the package allows users to code models in the BUGS language, and it allows users to write algorithms that can be applied to any appropriate model. In this paper, we introduce the nimbleSMC R package. nimbleSMC contains algorithms for state-space model analysis using sequential Monte Carlo (SMC) techniques that are built using nimble . We first provide an overview of state-space models and commonly-used SMC algorithms. We then describe how to build a state-space model in nimble and conduct inference using existing SMC algorithms within nimbleSMC . SMC algorithms within nimbleSMC currently include the bootstrap filter, auxiliary particle filter, ensemble Kalman filter, IF2 method of iterated filtering, and a particle Markov chain Monte Carlo (MCMC) sampler. These algorithms can be run in R or compiled into C++ for more efficient execution. Examples of applying SMC algorithms to linear autoregressive models and a stochastic volatility model are provided. Finally, we give an overview of how model-generic algorithms are coded within nimble by providing code for a simple SMC algorithm. This illustrates how users can easily extend nimble’s SMC methods in high-level code.","PeriodicalId":17237,"journal":{"name":"Journal of Statistical Software","volume":null,"pages":null},"PeriodicalIF":5.8,"publicationDate":"2021-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"78438267","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 3
TRES: An R Package for Tensor Regression and Envelope Algorithms 一个用于张量回归和包络算法的R包
IF 5.8 2区 计算机科学 Q1 Mathematics Pub Date : 2021-01-01 DOI: 10.18637/jss.v099.i12
Jing Zeng, Wenjing Wang, Xin Zhang
Recently, there has been a growing interest in tensor data analysis, where tensor regression is the cornerstone of statistical modeling for tensor data. This package provides the standard least squares estimators and the more efficient envelope estimators for the tensor response regression (TRR) and the tensor predictor regression (TPR) models. Envelope methodology is a relatively new class of dimension reduction techniques that jointly models the regression mean and covariance parameters. Three types of widely applicable envelope estimation algorithms are implemented and applied to both TRR and TPR models.
最近,人们对张量数据分析越来越感兴趣,其中张量回归是张量数据统计建模的基石。该软件包为张量响应回归(TRR)和张量预测回归(TPR)模型提供了标准最小二乘估计器和更有效的包络估计器。包络方法是一类比较新的降维技术,它联合建模回归均值和协方差参数。实现了三种广泛应用的包络估计算法,并将其应用于TRR和TPR模型。
{"title":"TRES: An R Package for Tensor Regression and Envelope Algorithms","authors":"Jing Zeng, Wenjing Wang, Xin Zhang","doi":"10.18637/jss.v099.i12","DOIUrl":"https://doi.org/10.18637/jss.v099.i12","url":null,"abstract":"Recently, there has been a growing interest in tensor data analysis, where tensor regression is the cornerstone of statistical modeling for tensor data. This package provides the standard least squares estimators and the more efficient envelope estimators for the tensor response regression (TRR) and the tensor predictor regression (TPR) models. Envelope methodology is a relatively new class of dimension reduction techniques that jointly models the regression mean and covariance parameters. Three types of widely applicable envelope estimation algorithms are implemented and applied to both TRR and TPR models.","PeriodicalId":17237,"journal":{"name":"Journal of Statistical Software","volume":null,"pages":null},"PeriodicalIF":5.8,"publicationDate":"2021-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"83063781","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 2
Model-Based Clustering, Classification, and Discriminant Analysis Using the Generalized Hyperbolic Distribution: MixGHD R package 使用广义双曲分布的基于模型的聚类、分类和判别分析:MixGHD R包
IF 5.8 2区 计算机科学 Q1 Mathematics Pub Date : 2021-01-01 DOI: 10.18637/jss.v098.i03
C. Tortora, R. Browne, Aisha Elsherbiny, B. Franczak, P. McNicholas
The MixGHD package for R performs model-based clustering, classification, and discriminant analysis using the generalized hyperbolic distribution (GHD). This approach is suitable for data that can be considered a realization of a (multivariate) continuous random variable. The GHD has the advantage of being flexible due to skewness, concentration, and index parameters; as such, clustering methods that use this distribution are capable of estimating clusters characterized by different shapes. The package provides five different models all based on the GHD, an efficient routine for discriminant analysis, and a function to measure cluster agreement. This paper is split into three parts: the first is devoted to the formulation of each method, extending them for classification and discriminant analysis applications, the second focuses on the algorithms, and the third shows the use of the package on real datasets.
用于R的MixGHD包使用广义双曲分布(GHD)执行基于模型的聚类、分类和判别分析。这种方法适用于可以被认为是一个(多变量)连续随机变量的实现的数据。GHD的优点是由于偏度、浓度和指标参数而具有灵活性;因此,使用这种分布的聚类方法能够估计具有不同形状特征的聚类。该包提供了五个不同的模型都基于GHD,一个有效的例程判别分析,和一个功能来衡量聚类协议。本文分为三部分:第一部分致力于每种方法的制定,将其扩展到分类和判别分析应用;第二部分侧重于算法;第三部分展示了该软件包在实际数据集上的使用。
{"title":"Model-Based Clustering, Classification, and Discriminant Analysis Using the Generalized Hyperbolic Distribution: MixGHD R package","authors":"C. Tortora, R. Browne, Aisha Elsherbiny, B. Franczak, P. McNicholas","doi":"10.18637/jss.v098.i03","DOIUrl":"https://doi.org/10.18637/jss.v098.i03","url":null,"abstract":"The MixGHD package for R performs model-based clustering, classification, and discriminant analysis using the generalized hyperbolic distribution (GHD). This approach is suitable for data that can be considered a realization of a (multivariate) continuous random variable. The GHD has the advantage of being flexible due to skewness, concentration, and index parameters; as such, clustering methods that use this distribution are capable of estimating clusters characterized by different shapes. The package provides five different models all based on the GHD, an efficient routine for discriminant analysis, and a function to measure cluster agreement. This paper is split into three parts: the first is devoted to the formulation of each method, extending them for classification and discriminant analysis applications, the second focuses on the algorithms, and the third shows the use of the package on real datasets.","PeriodicalId":17237,"journal":{"name":"Journal of Statistical Software","volume":null,"pages":null},"PeriodicalIF":5.8,"publicationDate":"2021-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"84220551","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 11
Conformal Prediction with Orange 橙色适形预测
IF 5.8 2区 计算机科学 Q1 Mathematics Pub Date : 2021-01-01 DOI: 10.18637/jss.v098.i07
Tomaz Hocevar, B. Zupan, Jonna C. Stålring
Conformal predictors estimate the reliability of outcomes made by supervised machine learning models. Instead of a point value, conformal prediction defines an outcome region that meets a user-specified reliability threshold. Provided that the data are independently and identically distributed, the user can control the level of the prediction errors and adjust it following the requirements of a given application. The quality of conformal predictions often depends on the choice of nonconformity estimate for a given machine learning method. To promote the selection of a successful approach, we have developed Orange3-Conformal, a Python library that provides a range of conformal prediction methods for classification and regression. The library also implements several nonconformity scores. It has a modular design and can be extended to add new conformal prediction methods and nonconformities.
适形预测器估计由监督机器学习模型得出的结果的可靠性。与点值不同,保形预测定义了满足用户指定的可靠性阈值的结果区域。如果数据是独立且相同分布的,则用户可以控制预测误差的级别,并根据给定应用程序的要求进行调整。适形预测的质量通常取决于对给定机器学习方法的不合格估计的选择。为了促进对成功方法的选择,我们开发了Orange3-Conformal,这是一个Python库,提供了一系列用于分类和回归的保形预测方法。该库还实现了几个不符合分数。它具有模块化设计,可以扩展到添加新的适形预测方法和不合格项。
{"title":"Conformal Prediction with Orange","authors":"Tomaz Hocevar, B. Zupan, Jonna C. Stålring","doi":"10.18637/jss.v098.i07","DOIUrl":"https://doi.org/10.18637/jss.v098.i07","url":null,"abstract":"Conformal predictors estimate the reliability of outcomes made by supervised machine learning models. Instead of a point value, conformal prediction defines an outcome region that meets a user-specified reliability threshold. Provided that the data are independently and identically distributed, the user can control the level of the prediction errors and adjust it following the requirements of a given application. The quality of conformal predictions often depends on the choice of nonconformity estimate for a given machine learning method. To promote the selection of a successful approach, we have developed Orange3-Conformal, a Python library that provides a range of conformal prediction methods for classification and regression. The library also implements several nonconformity scores. It has a modular design and can be extended to add new conformal prediction methods and nonconformities.","PeriodicalId":17237,"journal":{"name":"Journal of Statistical Software","volume":null,"pages":null},"PeriodicalIF":5.8,"publicationDate":"2021-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"88355408","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
期刊
Journal of Statistical Software
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1