首页 > 最新文献

arXiv: Computation最新文献

英文 中文
An introduction to sampling via measure transport 通过测量传输取样的介绍
Pub Date : 2016-02-16 DOI: 10.1007/978-3-319-11259-6_23-1
Y. Marzouk, T. Moselhy, M. Parno, Alessio Spantini
{"title":"An introduction to sampling via measure transport","authors":"Y. Marzouk, T. Moselhy, M. Parno, Alessio Spantini","doi":"10.1007/978-3-319-11259-6_23-1","DOIUrl":"https://doi.org/10.1007/978-3-319-11259-6_23-1","url":null,"abstract":"","PeriodicalId":8446,"journal":{"name":"arXiv: Computation","volume":"108 1","pages":""},"PeriodicalIF":0.0,"publicationDate":"2016-02-16","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"74659204","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 84
A Unified Monte-Carlo Jackknife for Small Area Estimation after Model Selection 模型选择后小面积估计的统一蒙特卡罗折刀
Pub Date : 2016-02-16 DOI: 10.4310/AMSA.2018.V3.N2.A2
Jiming Jiang, P. Lahiri, Thuan Nguyen
We consider estimation of measure of uncertainty in small area estimation (SAE) when a procedure of model selection is involved prior to the estimation. A unified Monte-Carlo jackknife method, called McJack, is proposed for estimating the logarithm of the mean squared prediction error. We prove the second-order unbiasedness of McJack, and demonstrate the performance of McJack in assessing uncertainty in SAE after model selection through empirical investigations that include simulation studies and real-data analyses.
研究了小面积估计中不确定测度的估计问题,在估计前先进行模型选择。提出了一种统一的蒙特卡罗折刀法,即McJack,用于估计均方预测误差的对数。我们证明了McJack的二阶无偏性,并通过包括仿真研究和实际数据分析在内的实证研究证明了McJack在模型选择后评估SAE不确定性方面的性能。
{"title":"A Unified Monte-Carlo Jackknife for Small Area Estimation after Model Selection","authors":"Jiming Jiang, P. Lahiri, Thuan Nguyen","doi":"10.4310/AMSA.2018.V3.N2.A2","DOIUrl":"https://doi.org/10.4310/AMSA.2018.V3.N2.A2","url":null,"abstract":"We consider estimation of measure of uncertainty in small area estimation (SAE) when a procedure of model selection is involved prior to the estimation. A unified Monte-Carlo jackknife method, called McJack, is proposed for estimating the logarithm of the mean squared prediction error. We prove the second-order unbiasedness of McJack, and demonstrate the performance of McJack in assessing uncertainty in SAE after model selection through empirical investigations that include simulation studies and real-data analyses.","PeriodicalId":8446,"journal":{"name":"arXiv: Computation","volume":"14 1","pages":""},"PeriodicalIF":0.0,"publicationDate":"2016-02-16","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"86255428","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 18
Sparse Generalized Principal Component Analysis for Large-scale Applications beyond Gaussianity 非高斯大规模应用的稀疏广义主成分分析
Pub Date : 2015-12-12 DOI: 10.4310/SII.2016.V9.N4.A11
Qiaoya Zhang, Yiyuan She
Principal Component Analysis (PCA) is a dimension reduction technique. It produces inconsistent estimators when the dimensionality is moderate to high, which is often the problem in modern large-scale applications where algorithm scalability and model interpretability are difficult to achieve, not to mention the prevalence of missing values. While existing sparse PCA methods alleviate inconsistency, they are constrained to the Gaussian assumption of classical PCA and fail to address algorithm scalability issues. We generalize sparse PCA to the broad exponential family distributions under high-dimensional setup, with built-in treatment for missing values. Meanwhile we propose a family of iterative sparse generalized PCA (SG-PCA) algorithms such that despite the non-convexity and non-smoothness of the optimization task, the loss function decreases in every iteration. In terms of ease and intuitive parameter tuning, our sparsity-inducing regularization is far superior to the popular Lasso. Furthermore, to promote overall scalability, accelerated gradient is integrated for fast convergence, while a progressive screening technique gradually squeezes out nuisance dimensions of a large-scale problem for feasible optimization. High-dimensional simulation and real data experiments demonstrate the efficiency and efficacy of SG-PCA.
主成分分析(PCA)是一种降维技术。当维数中高时,它会产生不一致的估计量,这是现代大规模应用程序中经常出现的问题,其中算法可伸缩性和模型可解释性难以实现,更不用说普遍存在的缺失值。虽然现有的稀疏主成分分析方法可以缓解不一致性,但它们受限于经典主成分分析的高斯假设,无法解决算法的可扩展性问题。我们将稀疏PCA推广到高维设置下的广义指数族分布,并对缺失值进行了内置处理。同时,我们提出了一组迭代稀疏广义PCA (SG-PCA)算法,尽管优化任务具有非凸性和非光滑性,但每次迭代的损失函数都在减小。在简单和直观的参数调整方面,我们的稀疏性诱导正则化远远优于流行的Lasso。此外,为了提高整体的可扩展性,采用加速梯度快速收敛,渐进筛选技术逐步剔除大规模问题的干扰维度,实现可行的优化。高维仿真和实际数据实验验证了SG-PCA的有效性和有效性。
{"title":"Sparse Generalized Principal Component Analysis for Large-scale Applications beyond Gaussianity","authors":"Qiaoya Zhang, Yiyuan She","doi":"10.4310/SII.2016.V9.N4.A11","DOIUrl":"https://doi.org/10.4310/SII.2016.V9.N4.A11","url":null,"abstract":"Principal Component Analysis (PCA) is a dimension reduction technique. It produces inconsistent estimators when the dimensionality is moderate to high, which is often the problem in modern large-scale applications where algorithm scalability and model interpretability are difficult to achieve, not to mention the prevalence of missing values. While existing sparse PCA methods alleviate inconsistency, they are constrained to the Gaussian assumption of classical PCA and fail to address algorithm scalability issues. We generalize sparse PCA to the broad exponential family distributions under high-dimensional setup, with built-in treatment for missing values. Meanwhile we propose a family of iterative sparse generalized PCA (SG-PCA) algorithms such that despite the non-convexity and non-smoothness of the optimization task, the loss function decreases in every iteration. In terms of ease and intuitive parameter tuning, our sparsity-inducing regularization is far superior to the popular Lasso. Furthermore, to promote overall scalability, accelerated gradient is integrated for fast convergence, while a progressive screening technique gradually squeezes out nuisance dimensions of a large-scale problem for feasible optimization. High-dimensional simulation and real data experiments demonstrate the efficiency and efficacy of SG-PCA.","PeriodicalId":8446,"journal":{"name":"arXiv: Computation","volume":"9 1","pages":""},"PeriodicalIF":0.0,"publicationDate":"2015-12-12","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"73104443","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
Robust Estimation of the Generalized Loggamma Model. The R Package robustloggamma 广义对数模型的鲁棒估计。R包鲁棒对数
Pub Date : 2015-12-05 DOI: 10.18637/JSS.V070.I07
C. Agostinelli, A. Marazzi, V. Yohai, A. Randriamiharisoa
robustloggamma is an R package for robust estimation and inference in the generalized loggamma model. We briefly introduce the model, the estimation procedures and the computational algorithms. Then, we illustrate the use of the package with the help of a real data set.
robustloggamma是一个R包,用于在广义loggamma模型中进行稳健估计和推理。简要介绍了该模型、估计过程和计算算法。然后,我们借助一个真实的数据集来说明该包的使用。
{"title":"Robust Estimation of the Generalized Loggamma Model. The R Package robustloggamma","authors":"C. Agostinelli, A. Marazzi, V. Yohai, A. Randriamiharisoa","doi":"10.18637/JSS.V070.I07","DOIUrl":"https://doi.org/10.18637/JSS.V070.I07","url":null,"abstract":"robustloggamma is an R package for robust estimation and inference in the generalized loggamma model. We briefly introduce the model, the estimation procedures and the computational algorithms. Then, we illustrate the use of the package with the help of a real data set.","PeriodicalId":8446,"journal":{"name":"arXiv: Computation","volume":"1 1","pages":""},"PeriodicalIF":0.0,"publicationDate":"2015-12-05","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"77094174","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 12
Embarrassingly Parallel Sequential Markov-chain Monte Carlo for Large Sets of Time Series 大时间序列集的令人尴尬的平行序列马尔可夫链蒙特卡罗
Pub Date : 2015-12-04 DOI: 10.4310/SII.2016.V9.N4.A9
R. Casarin, Radu V. Craiu, F. Leisen
Bayesian computation crucially relies on Markov chain Monte Carlo (MCMC) algorithms. In the case of massive data sets, running the Metropolis-Hastings sampler to draw from the posterior distribution becomes prohibitive due to the large number of likelihood terms that need to be calculated at each iteration. In order to perform Bayesian inference for a large set of time series, we consider an algorithm that combines 'divide and conquer" ideas previously used to design MCMC algorithms for big data with a sequential MCMC strategy. The performance of the method is illustrated using a large set of financial data.
贝叶斯计算关键依赖于马尔可夫链蒙特卡罗(MCMC)算法。在大量数据集的情况下,运行Metropolis-Hastings采样器从后验分布中提取数据变得令人望而却步,因为每次迭代都需要计算大量的似然项。为了对大时间序列集执行贝叶斯推理,我们考虑了一种算法,该算法结合了以前用于为大数据设计MCMC算法的“分而治之”思想和顺序MCMC策略。用一组大型财务数据说明了该方法的性能。
{"title":"Embarrassingly Parallel Sequential Markov-chain Monte Carlo for Large Sets of Time Series","authors":"R. Casarin, Radu V. Craiu, F. Leisen","doi":"10.4310/SII.2016.V9.N4.A9","DOIUrl":"https://doi.org/10.4310/SII.2016.V9.N4.A9","url":null,"abstract":"Bayesian computation crucially relies on Markov chain Monte Carlo (MCMC) algorithms. In the case of massive data sets, running the Metropolis-Hastings sampler to draw from the posterior distribution becomes prohibitive due to the large number of likelihood terms that need to be calculated at each iteration. In order to perform Bayesian inference for a large set of time series, we consider an algorithm that combines 'divide and conquer\" ideas previously used to design MCMC algorithms for big data with a sequential MCMC strategy. The performance of the method is illustrated using a large set of financial data.","PeriodicalId":8446,"journal":{"name":"arXiv: Computation","volume":"34 1","pages":""},"PeriodicalIF":0.0,"publicationDate":"2015-12-04","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"76134836","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 5
Convergence of the risk for nonparametric IV quantile regression and nonparametric IV regression with full independence 非参数IV分位数回归和完全独立的非参数IV回归风险的收敛性
Pub Date : 2015-12-03 DOI: 10.17877/DE290R-16447
Fabian Dunker
In econometrics some nonparametric instrumental regression models and nonparametric demand models with endogeneity lead to nonlinear integral equations with unknown integral kernels. We prove convergence rates of the risk for the iteratively regularized Newton method applied to these problems. Compared to related results we relay on a weaker non-linearity condition and have stronger convergence results. We demonstrate by numerical simulations for a nonparametric IV regression problem with continuous instrument and regressor that the method produces better results than the standard method.
在计量经济学中,一些具有内生性的非参数工具回归模型和非参数需求模型会导致具有未知积分核的非线性积分方程。我们证明了迭代正则牛顿法在这些问题上的收敛速度。与相关结果相比,我们所依赖的非线性条件较弱,收敛性较强。通过对具有连续仪器和回归器的非参数IV回归问题的数值模拟,证明了该方法比标准方法产生更好的结果。
{"title":"Convergence of the risk for nonparametric IV quantile regression and nonparametric IV regression with full independence","authors":"Fabian Dunker","doi":"10.17877/DE290R-16447","DOIUrl":"https://doi.org/10.17877/DE290R-16447","url":null,"abstract":"In econometrics some nonparametric instrumental regression models and nonparametric demand models with endogeneity lead to nonlinear integral equations with unknown integral kernels. We prove convergence rates of the risk for the iteratively regularized Newton method applied to these problems. Compared to related results we relay on a weaker non-linearity condition and have stronger convergence results. We demonstrate by numerical simulations for a nonparametric IV regression problem with continuous instrument and regressor that the method produces better results than the standard method.","PeriodicalId":8446,"journal":{"name":"arXiv: Computation","volume":"31 1","pages":""},"PeriodicalIF":0.0,"publicationDate":"2015-12-03","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"89871681","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 3
blavaan: Bayesian structural equation models via parameter expansion blavaan:通过参数展开的贝叶斯结构方程模型
Pub Date : 2015-11-17 DOI: 10.18637/jss.v085.i04
E. Merkle, Y. Rosseel
This article describes blavaan, an R package for estimating Bayesian structural equation models (SEMs) via JAGS and for summarizing the results. It also describes a novel parameter expansion approach for estimating specific types of models with residual covariances, which facilitates estimation of these models in JAGS. The methodology and software are intended to provide users with a general means of estimating Bayesian SEMs, both classical and novel, in a straightforward fashion. Users can estimate Bayesian versions of classical SEMs with lavaan syntax, they can obtain state-of-the-art Bayesian fit measures associated with the models, and they can export JAGS code to modify the SEMs as desired. These features and more are illustrated by example, and the parameter expansion approach is explained in detail.
本文介绍了blavaan,这是一个R包,用于通过JAGS估计贝叶斯结构方程模型(sem)并总结结果。本文还描述了一种新的参数展开方法,用于估计具有残差协方差的特定类型的模型,这有助于在JAGS中对这些模型进行估计。该方法和软件旨在以简单的方式为用户提供估计贝叶斯sem的一般方法,包括经典的和新颖的。用户可以使用lavaan语法估计经典sem的贝叶斯版本,他们可以获得与模型相关的最先进的贝叶斯拟合度量,并且可以导出JAGS代码以根据需要修改sem。通过实例说明了这些特点,并详细说明了参数展开方法。
{"title":"blavaan: Bayesian structural equation models via parameter expansion","authors":"E. Merkle, Y. Rosseel","doi":"10.18637/jss.v085.i04","DOIUrl":"https://doi.org/10.18637/jss.v085.i04","url":null,"abstract":"This article describes blavaan, an R package for estimating Bayesian structural equation models (SEMs) via JAGS and for summarizing the results. It also describes a novel parameter expansion approach for estimating specific types of models with residual covariances, which facilitates estimation of these models in JAGS. The methodology and software are intended to provide users with a general means of estimating Bayesian SEMs, both classical and novel, in a straightforward fashion. Users can estimate Bayesian versions of classical SEMs with lavaan syntax, they can obtain state-of-the-art Bayesian fit measures associated with the models, and they can export JAGS code to modify the SEMs as desired. These features and more are illustrated by example, and the parameter expansion approach is explained in detail.","PeriodicalId":8446,"journal":{"name":"arXiv: Computation","volume":"16 1","pages":""},"PeriodicalIF":0.0,"publicationDate":"2015-11-17","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"86681229","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 215
spTest: An R Package Implementing Nonparametric Tests of Isotropy spTest:一个实现各向同性非参数测试的R包
Pub Date : 2015-09-24 DOI: 10.18637/JSS.V083.I04
Zachary D. Weller
An important step of modeling spatially-referenced data is appropriately specifying the second order properties of the random field. A scientist developing a model for spatial data has a number of options regarding the nature of the dependence between observations. One of these options is deciding whether or not the dependence between observations depends on direction, or, in other words, whether or not the spatial covariance function is isotropic. Isotropy implies that spatial dependence is a function of only the distance and not the direction of the spatial separation between sampling locations. A researcher may use graphical techniques, such as directional sample semivariograms, to determine whether an assumption of isotropy holds. These graphical diagnostics can be difficult to assess, subject to personal interpretation, and potentially misleading as they typically do not include a measure of uncertainty. In order to escape these issues, a hypothesis test of the assumption of isotropy may be more desirable. To avoid specification of the covariance function, a number of nonparametric tests of isotropy have been developed using both the spatial and spectral representations of random fields. Several of these nonparametric tests are implemented in the R package spTest, available on CRAN. We demonstrate how graphical techniques and the hypothesis tests programmed in spTest can be used in practice to assess isotropy properties.
对空间引用数据建模的一个重要步骤是适当地指定随机场的二阶属性。开发空间数据模型的科学家对于观测之间的依赖性的性质有许多选择。其中一个选择是决定观测之间的依赖是否取决于方向,或者换句话说,空间协方差函数是否各向同性。各向同性意味着空间依赖性仅是采样位置之间空间分离的距离而不是方向的函数。研究人员可以使用图形技术,如定向样本半变异图,来确定各向同性的假设是否成立。这些图形诊断可能难以评估,受个人解释的影响,并且可能具有误导性,因为它们通常不包括不确定性的测量。为了避免这些问题,各向同性假设的假设检验可能更可取。为了避免协方差函数的规范,已经使用随机场的空间和光谱表示开发了一些各向同性的非参数测试。其中一些非参数测试是在R包spTest中实现的,可以在CRAN上获得。我们演示了在spTest中编程的图形技术和假设检验如何在实践中用于评估各向同性性质。
{"title":"spTest: An R Package Implementing Nonparametric Tests of Isotropy","authors":"Zachary D. Weller","doi":"10.18637/JSS.V083.I04","DOIUrl":"https://doi.org/10.18637/JSS.V083.I04","url":null,"abstract":"An important step of modeling spatially-referenced data is appropriately specifying the second order properties of the random field. A scientist developing a model for spatial data has a number of options regarding the nature of the dependence between observations. One of these options is deciding whether or not the dependence between observations depends on direction, or, in other words, whether or not the spatial covariance function is isotropic. Isotropy implies that spatial dependence is a function of only the distance and not the direction of the spatial separation between sampling locations. A researcher may use graphical techniques, such as directional sample semivariograms, to determine whether an assumption of isotropy holds. These graphical diagnostics can be difficult to assess, subject to personal interpretation, and potentially misleading as they typically do not include a measure of uncertainty. In order to escape these issues, a hypothesis test of the assumption of isotropy may be more desirable. To avoid specification of the covariance function, a number of nonparametric tests of isotropy have been developed using both the spatial and spectral representations of random fields. Several of these nonparametric tests are implemented in the R package spTest, available on CRAN. We demonstrate how graphical techniques and the hypothesis tests programmed in spTest can be used in practice to assess isotropy properties.","PeriodicalId":8446,"journal":{"name":"arXiv: Computation","volume":"27 1","pages":""},"PeriodicalIF":0.0,"publicationDate":"2015-09-24","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"83305095","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 7
Simulations on the combinatorial structure of D-optimal designs 组合结构的d -最优设计仿真
Pub Date : 2015-09-21 DOI: 10.1007/978-3-319-76035-3_24
R. Fontana, Fabio Rapallo
{"title":"Simulations on the combinatorial structure of D-optimal designs","authors":"R. Fontana, Fabio Rapallo","doi":"10.1007/978-3-319-76035-3_24","DOIUrl":"https://doi.org/10.1007/978-3-319-76035-3_24","url":null,"abstract":"","PeriodicalId":8446,"journal":{"name":"arXiv: Computation","volume":"98 1","pages":""},"PeriodicalIF":0.0,"publicationDate":"2015-09-21","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"73903311","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
EMMIXcskew: an R Package for the Fitting of a Mixture of Canonical Fundamental Skew t-Distributions EMMIXcskew:一个正则基本偏态t分布混合拟合的R包
Pub Date : 2015-09-07 DOI: 10.18637/JSS.V083.I03
Sharon X. Lee, G. J. Mclachlan
This paper presents an R package EMMIXcskew for the fitting of the canonical fundamental skew t-distribution (CFUST) and finite mixtures of this distribution (FM-CFUST) via maximum likelihood (ML). The CFUST distribution provides a flexible family of models to handle non-normal data, with parameters for capturing skewness and heavy-tails in the data. It formally encompasses the normal, t, and skew-normal distributions as special and/or limiting cases. A few other versions of the skew t-distributions are also nested within the CFUST distribution. In this paper, an Expectation-Maximization (EM) algorithm is described for computing the ML estimates of the parameters of the FM-CFUST model, and different strategies for initializing the algorithm are discussed and illustrated. The methodology is implemented in the EMMIXcskew package, and examples are presented using two real datasets. The EMMIXcskew package contains functions to fit the FM-CFUST model, including procedures for generating different initial values. Additional features include random sample generation and contour visualization in 2D and 3D.
本文提出了一个R包EMMIXcskew,用于通过极大似然(ML)拟合典型基本偏态t分布(CFUST)和该分布的有限混合(FM-CFUST)。CFUST分布提供了一组灵活的模型来处理非正态数据,其中包含用于捕获数据偏度和重尾的参数。它形式上包括正态分布、t分布和偏正态分布作为特殊和/或极限情况。一些其他版本的倾斜t分布也嵌套在CFUST分布中。本文描述了一种用于计算FM-CFUST模型参数的ML估计的期望最大化(EM)算法,并讨论和说明了初始化算法的不同策略。该方法在EMMIXcskew包中实现,并使用两个真实数据集给出了示例。EMMIXcskew包包含适合FM-CFUST模型的函数,包括生成不同初始值的过程。附加功能包括随机样本生成和轮廓可视化在2D和3D。
{"title":"EMMIXcskew: an R Package for the Fitting of a Mixture of Canonical Fundamental Skew t-Distributions","authors":"Sharon X. Lee, G. J. Mclachlan","doi":"10.18637/JSS.V083.I03","DOIUrl":"https://doi.org/10.18637/JSS.V083.I03","url":null,"abstract":"This paper presents an R package EMMIXcskew for the fitting of the canonical fundamental skew t-distribution (CFUST) and finite mixtures of this distribution (FM-CFUST) via maximum likelihood (ML). The CFUST distribution provides a flexible family of models to handle non-normal data, with parameters for capturing skewness and heavy-tails in the data. It formally encompasses the normal, t, and skew-normal distributions as special and/or limiting cases. A few other versions of the skew t-distributions are also nested within the CFUST distribution. In this paper, an Expectation-Maximization (EM) algorithm is described for computing the ML estimates of the parameters of the FM-CFUST model, and different strategies for initializing the algorithm are discussed and illustrated. The methodology is implemented in the EMMIXcskew package, and examples are presented using two real datasets. The EMMIXcskew package contains functions to fit the FM-CFUST model, including procedures for generating different initial values. Additional features include random sample generation and contour visualization in 2D and 3D.","PeriodicalId":8446,"journal":{"name":"arXiv: Computation","volume":"31 1","pages":""},"PeriodicalIF":0.0,"publicationDate":"2015-09-07","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"80013752","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 12
期刊
arXiv: Computation
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1