首页 > 最新文献

Statistical Theory and Related Fields最新文献

英文 中文
Discussion on the paper ‘A review of distributed statistical inference’ 关于“分布式统计推断综述”一文的讨论
IF 0.5 Q3 STATISTICS & PROBABILITY Pub Date : 2021-12-16 DOI: 10.1080/24754269.2021.2015861
Junlong Zhao
Distributed statistical inferences have attracted more and more attention in recent years with the emergence of massive data. We are grateful to the authors for the excellent review of the literature in this active area. Besides the progress mentioned by the authors, we would like to discuss some additional development in this interesting area. Specifically, we focus on the balance of communication cost and the statistical efficiency of divide-and-conquer (DC) type estimators in linear discriminant analysis and hypothesis testing. It is seen that the DC approach has different behaviours in these problems, which is different from that in estimation problems. Furthermore, we discuss some issues on the statistical inferences under restricted communication budgets.
近年来,随着海量数据的出现,分布式统计推断越来越受到关注。我们感谢作者们对这一活跃领域的文献进行了出色的评论。除了作者提到的进展之外,我们还想讨论这个有趣领域的一些额外发展。具体而言,我们关注线性判别分析和假设检验中的通信成本平衡和分治(DC)型估计量的统计效率。可以看出,DC方法在这些问题中具有不同的行为,这与估计问题中的行为不同。此外,我们还讨论了在通信预算受限的情况下统计推断的一些问题。
{"title":"Discussion on the paper ‘A review of distributed statistical inference’","authors":"Junlong Zhao","doi":"10.1080/24754269.2021.2015861","DOIUrl":"https://doi.org/10.1080/24754269.2021.2015861","url":null,"abstract":"Distributed statistical inferences have attracted more and more attention in recent years with the emergence of massive data. We are grateful to the authors for the excellent review of the literature in this active area. Besides the progress mentioned by the authors, we would like to discuss some additional development in this interesting area. Specifically, we focus on the balance of communication cost and the statistical efficiency of divide-and-conquer (DC) type estimators in linear discriminant analysis and hypothesis testing. It is seen that the DC approach has different behaviours in these problems, which is different from that in estimation problems. Furthermore, we discuss some issues on the statistical inferences under restricted communication budgets.","PeriodicalId":22070,"journal":{"name":"Statistical Theory and Related Fields","volume":"6 1","pages":"108 - 110"},"PeriodicalIF":0.5,"publicationDate":"2021-12-16","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"46176137","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Eight predictive powers with historical and interim data for futility and efficacy analysis 具有历史和中期数据的八种预测能力,用于徒劳和疗效分析
IF 0.5 Q3 STATISTICS & PROBABILITY Pub Date : 2021-10-25 DOI: 10.1080/24754269.2021.1991557
Ying-Ying Zhang, Tengzhong Rong, Man-Man Li
ABSTRACT When the historical data of the early phase trial and the interim data of the Phase III trial are available, we should use them to give a more accurate prediction in both futility and efficacy analysis. The predictive power is an important measure of the practical utility of a proposed trial, and it is better than the classical statistical power in giving a good indication of the probability that the trial will demonstrate a positive or statistically significant outcome. In addition to the four predictive powers with historical and interim data available in the literature and summarized in Table 1, we discover and calculate another four predictive powers also summarized in Table 1, for one-sided hypotheses. Moreover, we calculate eight predictive powers summarized in Table 2, for the reversed hypotheses. The combination of the two tables gives us a complete picture of the predictive powers with historical and interim data for futility and efficacy analysis. Furthermore, the eight predictive powers with historical and interim data are utilized to guide the futility analysis in the tamoxifen example. Finally, extensive simulations have been conducted to investigate the sensitivity analysis of priors, sample sizes, interim result and interim time on different predictive powers.
摘要当早期试验的历史数据和III期试验的中期数据可用时,我们应该使用它们来在无效性和有效性分析中给出更准确的预测。预测能力是衡量拟议试验实际效用的重要指标,它比经典统计能力更好地表明试验将显示积极或统计显著结果的概率。除了文献中可用的、表1中总结的具有历史和中期数据的四种预测能力外,我们发现并计算了表1中也总结的另四种单方面假设的预测能力。此外,我们计算了表2中总结的八种预测能力,用于反向假设。这两个表的结合为我们提供了一个完整的预测能力的图片,以及徒劳和有效性分析的历史和中期数据。此外,在他莫昔芬的例子中,利用具有历史和中期数据的八种预测能力来指导无效性分析。最后,进行了广泛的模拟,以研究先验、样本量、中期结果和中期时间对不同预测能力的敏感性分析。
{"title":"Eight predictive powers with historical and interim data for futility and efficacy analysis","authors":"Ying-Ying Zhang, Tengzhong Rong, Man-Man Li","doi":"10.1080/24754269.2021.1991557","DOIUrl":"https://doi.org/10.1080/24754269.2021.1991557","url":null,"abstract":"ABSTRACT When the historical data of the early phase trial and the interim data of the Phase III trial are available, we should use them to give a more accurate prediction in both futility and efficacy analysis. The predictive power is an important measure of the practical utility of a proposed trial, and it is better than the classical statistical power in giving a good indication of the probability that the trial will demonstrate a positive or statistically significant outcome. In addition to the four predictive powers with historical and interim data available in the literature and summarized in Table 1, we discover and calculate another four predictive powers also summarized in Table 1, for one-sided hypotheses. Moreover, we calculate eight predictive powers summarized in Table 2, for the reversed hypotheses. The combination of the two tables gives us a complete picture of the predictive powers with historical and interim data for futility and efficacy analysis. Furthermore, the eight predictive powers with historical and interim data are utilized to guide the futility analysis in the tamoxifen example. Finally, extensive simulations have been conducted to investigate the sensitivity analysis of priors, sample sizes, interim result and interim time on different predictive powers.","PeriodicalId":22070,"journal":{"name":"Statistical Theory and Related Fields","volume":"6 1","pages":"277 - 298"},"PeriodicalIF":0.5,"publicationDate":"2021-10-25","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"49302617","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
High-dimensional proportionality test of two covariance matrices and its application to gene expression data 两个协方差矩阵的高维比例检验及其在基因表达数据中的应用
IF 0.5 Q3 STATISTICS & PROBABILITY Pub Date : 2021-10-06 DOI: 10.1080/24754269.2021.1984373
Long Feng, Xiaoxu Zhang, Binghui Liu
With the development of modern science and technology, more and more high-dimensional data appear in the application fields. Since the high dimension can potentially increase the complexity of the covariance structure, comparing the covariance matrices among populations is strongly motivated in high-dimensional data analysis. In this article, we consider the proportionality test of two high-dimensional covariance matrices, where the data dimension is potentially much larger than the sample sizes, or even larger than the squares of the sample sizes. We devise a novel high-dimensional spatial rank test that has much-improved power than many existing popular tests, especially for the data generated from some heavy-tailed distributions. The asymptotic normality of the proposed test statistics is established under the family of elliptically symmetric distributions, which is a more general distribution family than the normal distribution family, including numerous commonly used heavy-tailed distributions. Extensive numerical experiments demonstrate the superiority of the proposed test in terms of both empirical size and power. Then, a real data analysis demonstrates the practicability of the proposed test for high-dimensional gene expression data.
随着现代科学技术的发展,越来越多的高维数据出现在应用领域。由于高维可能会增加协方差结构的复杂性,因此在高维数据分析中,比较总体之间的协方差矩阵是非常有动力的。在本文中,我们考虑两个高维协方差矩阵的比例检验,其中数据维度可能远大于样本大小,甚至大于样本大小的平方。我们设计了一种新的高维空间秩检验,它比许多现有的流行检验有很大的改进,特别是对于一些重尾分布产生的数据。所提出的检验统计量的渐近正态性是在椭圆对称分布族下建立的,椭圆对称分布是一个比正态分布族更一般的分布族,包括许多常用的重尾分布。大量的数值实验证明了所提出的测试在经验大小和功率方面的优越性。然后,实际数据分析证明了所提出的高维基因表达数据测试的实用性。
{"title":"High-dimensional proportionality test of two covariance matrices and its application to gene expression data","authors":"Long Feng, Xiaoxu Zhang, Binghui Liu","doi":"10.1080/24754269.2021.1984373","DOIUrl":"https://doi.org/10.1080/24754269.2021.1984373","url":null,"abstract":"With the development of modern science and technology, more and more high-dimensional data appear in the application fields. Since the high dimension can potentially increase the complexity of the covariance structure, comparing the covariance matrices among populations is strongly motivated in high-dimensional data analysis. In this article, we consider the proportionality test of two high-dimensional covariance matrices, where the data dimension is potentially much larger than the sample sizes, or even larger than the squares of the sample sizes. We devise a novel high-dimensional spatial rank test that has much-improved power than many existing popular tests, especially for the data generated from some heavy-tailed distributions. The asymptotic normality of the proposed test statistics is established under the family of elliptically symmetric distributions, which is a more general distribution family than the normal distribution family, including numerous commonly used heavy-tailed distributions. Extensive numerical experiments demonstrate the superiority of the proposed test in terms of both empirical size and power. Then, a real data analysis demonstrates the practicability of the proposed test for high-dimensional gene expression data.","PeriodicalId":22070,"journal":{"name":"Statistical Theory and Related Fields","volume":"6 1","pages":"161 - 174"},"PeriodicalIF":0.5,"publicationDate":"2021-10-06","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"43264799","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Variable screening in multivariate linear regression with high-dimensional covariates 高维协变量多元线性回归的变量筛选
IF 0.5 Q3 STATISTICS & PROBABILITY Pub Date : 2021-10-06 DOI: 10.1080/24754269.2021.1982607
Shiferaw B. Bizuayehu, Luquan Li, Jin Xu
We propose two variable selection methods in multivariate linear regression with high-dimensional covariates. The first method uses a multiple correlation coefficient to fast reduce the dimension of the relevant predictors to a moderate or low level. The second method extends the univariate forward regression of Wang [(2009). Forward regression for ultra-high dimensional variable screening. Journal of the American Statistical Association, 104(488), 1512–1524. https://doi.org/10.1198/jasa.2008.tm08516] in a unified way such that the variable selection and model estimation can be obtained simultaneously. We establish the sure screening property for both methods. Simulation and real data applications are presented to show the finite sample performance of the proposed methods in comparison with some naive method.
在具有高维协变量的多元线性回归中,我们提出了两种变量选择方法。第一种方法使用多重相关系数将相关预测因子的维数快速降低到中等或低水平。第二种方法扩展了王的单变量正向回归[(2009).超高维变量筛选的正向回归.美国统计协会杂志,104(488),1512-1524。https://doi.org/10.1198/jasa.2008.tm08516]以统一的方式使得可以同时获得变量选择和模型估计。我们建立了两种方法的确定筛选性质。仿真和实际数据应用表明,与一些朴素方法相比,所提出的方法具有有限样本的性能。
{"title":"Variable screening in multivariate linear regression with high-dimensional covariates","authors":"Shiferaw B. Bizuayehu, Luquan Li, Jin Xu","doi":"10.1080/24754269.2021.1982607","DOIUrl":"https://doi.org/10.1080/24754269.2021.1982607","url":null,"abstract":"We propose two variable selection methods in multivariate linear regression with high-dimensional covariates. The first method uses a multiple correlation coefficient to fast reduce the dimension of the relevant predictors to a moderate or low level. The second method extends the univariate forward regression of Wang [(2009). Forward regression for ultra-high dimensional variable screening. Journal of the American Statistical Association, 104(488), 1512–1524. https://doi.org/10.1198/jasa.2008.tm08516] in a unified way such that the variable selection and model estimation can be obtained simultaneously. We establish the sure screening property for both methods. Simulation and real data applications are presented to show the finite sample performance of the proposed methods in comparison with some naive method.","PeriodicalId":22070,"journal":{"name":"Statistical Theory and Related Fields","volume":"6 1","pages":"241 - 253"},"PeriodicalIF":0.5,"publicationDate":"2021-10-06","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"47622019","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 2
An introduction to statistical learning with applications in R 统计学习在R中的应用介绍
IF 0.5 Q3 STATISTICS & PROBABILITY Pub Date : 2021-09-26 DOI: 10.1080/24754269.2021.1980261
Fariha Sohil, Muhammad Umair Sohali, J. Shabbir
The fundamental mathematical tools needed to understand machine learning include linear algebra, analytic geometry, matrix decompositions, vector calculus, optimization, probability and statistics. These topics are traditionally taught in disparate courses, making it hard for data science or computer science students, or professionals, to efficiently learn the mathematics. This self-contained textbook bridges the gap between mathematical and machine learning texts, introducing the mathematical concepts with a minimum of prerequisites. It uses these concepts to derive four central machine learning methods: linear regression, principal component analysis, Gaussian mixture models and support vector machines. For students and others with a mathematical background, these derivations provide a starting point to machine learning texts. For those learning the mathematics for the first time, the methods help build intuition and practical experience with applying mathematical concepts. Every chapter includes worked examples and exercises to test understanding. Programming tutorials are offered on the book's web site. This textbook considers statistical learning applications when interest centers on the conditional distribution of a response variable, given a set of predictors, and in the absence of a credible model that can be specified before the data analysis begins. Consistent with modern data analytics, it emphasizes that a proper statistical learning data analysis depends in an integrated fashion on sound data collection, intelligent data management, appropriate statistical procedures, and an
理解机器学习所需的基本数学工具包括线性代数、解析几何、矩阵分解、向量微积分、优化、概率论和统计学。这些主题传统上是在不同的课程中教授的,这使得数据科学或计算机科学的学生或专业人士很难有效地学习数学。这本自成一体的教科书弥合了数学和机器学习文本之间的差距,以最低的先决条件介绍了数学概念。它使用这些概念推导出四种核心机器学习方法:线性回归、主成分分析、高斯混合模型和支持向量机。对于学生和其他有数学背景的人来说,这些推导为机器学习文本提供了一个起点。对于那些第一次学习数学的人来说,这些方法有助于建立应用数学概念的直觉和实践经验。每一章都包括一些例子和练习来测试理解。本书的网站上提供了编程教程。本教材考虑了统计学习的应用,当兴趣集中在响应变量的条件分布上,给定一组预测因子,并且在数据分析开始之前没有可以指定的可信模型。与现代数据分析一致,它强调正确的统计学习数据分析依赖于健全的数据收集,智能的数据管理,适当的统计程序和一个综合的方式
{"title":"An introduction to statistical learning with applications in R","authors":"Fariha Sohil, Muhammad Umair Sohali, J. Shabbir","doi":"10.1080/24754269.2021.1980261","DOIUrl":"https://doi.org/10.1080/24754269.2021.1980261","url":null,"abstract":"The fundamental mathematical tools needed to understand machine learning include linear algebra, analytic geometry, matrix decompositions, vector calculus, optimization, probability and statistics. These topics are traditionally taught in disparate courses, making it hard for data science or computer science students, or professionals, to efficiently learn the mathematics. This self-contained textbook bridges the gap between mathematical and machine learning texts, introducing the mathematical concepts with a minimum of prerequisites. It uses these concepts to derive four central machine learning methods: linear regression, principal component analysis, Gaussian mixture models and support vector machines. For students and others with a mathematical background, these derivations provide a starting point to machine learning texts. For those learning the mathematics for the first time, the methods help build intuition and practical experience with applying mathematical concepts. Every chapter includes worked examples and exercises to test understanding. Programming tutorials are offered on the book's web site. This textbook considers statistical learning applications when interest centers on the conditional distribution of a response variable, given a set of predictors, and in the absence of a credible model that can be specified before the data analysis begins. Consistent with modern data analytics, it emphasizes that a proper statistical learning data analysis depends in an integrated fashion on sound data collection, intelligent data management, appropriate statistical procedures, and an","PeriodicalId":22070,"journal":{"name":"Statistical Theory and Related Fields","volume":"6 1","pages":"87 - 87"},"PeriodicalIF":0.5,"publicationDate":"2021-09-26","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"42115335","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 2608
Exponential tilted likelihood for stationary time series models 平稳时间序列模型的指数倾斜似然
IF 0.5 Q3 STATISTICS & PROBABILITY Pub Date : 2021-09-23 DOI: 10.1080/24754269.2021.1978207
Xiuzhen Zhang, Yukun Liu, Riquan Zhang, Zhiping Lu
Depending on the asymptotical independence of periodograms, exponential tilted (ET) likelihood, as an effective nonparametric statistical method, is developed to deal with time series in this paper. Similar to empirical likelihood (EL), it still suffers from two drawbacks: the non-definition problem of the likelihood function and the under-coverage probability of confidence region. To overcome these two problems, we further proposed the adjusted ET (AET) likelihood. With a specific adjustment level, our simulation studies indicate that the AET method achieves a higher-order coverage precision than the unadjusted ET method. In addition, due to the good performance of ET under moment model misspecification [Schennach, S. M. (2007). Point estimation with exponentially tilted empirical likelihood. The Annals of Statistics, 35(2), 634–672. https://doi.org/10.1214/009053606000001208], we show that the one-order property of point estimate is preserved for the misspecified spectral estimating equations of the autoregressive coefficient of AR(1). The simulation results illustrate that the point estimates of the ET outperform those of the EL and their hybrid in terms of standard deviation. A real data set is analyzed for illustration purpose.
基于周期图的渐近独立性,本文提出了一种有效的非参数统计方法——指数倾斜似然方法。与经验似然(EL)相似,它仍然存在两个缺点:似然函数的非定义问题和置信区域的概率覆盖不足。为了克服这两个问题,我们进一步提出了调整后ET (AET)似然。在特定平差水平下,我们的模拟研究表明,AET方法比未平差的ET方法获得更高的阶覆盖精度。此外,由于ET在矩模型错误规范下的良好性能[Schennach, s.m.(2007)]。具有指数倾斜经验似然的点估计。统计年鉴,35(2),634-672。https://doi.org/10.1214/009053606000001208],我们证明了对于AR(1)的自回归系数的错误谱估计方程,点估计的一阶性质是保持的。仿真结果表明,在标准差方面,ET的点估计优于EL及其混合估计。为了说明目的,分析了一个真实的数据集。
{"title":"Exponential tilted likelihood for stationary time series models","authors":"Xiuzhen Zhang, Yukun Liu, Riquan Zhang, Zhiping Lu","doi":"10.1080/24754269.2021.1978207","DOIUrl":"https://doi.org/10.1080/24754269.2021.1978207","url":null,"abstract":"Depending on the asymptotical independence of periodograms, exponential tilted (ET) likelihood, as an effective nonparametric statistical method, is developed to deal with time series in this paper. Similar to empirical likelihood (EL), it still suffers from two drawbacks: the non-definition problem of the likelihood function and the under-coverage probability of confidence region. To overcome these two problems, we further proposed the adjusted ET (AET) likelihood. With a specific adjustment level, our simulation studies indicate that the AET method achieves a higher-order coverage precision than the unadjusted ET method. In addition, due to the good performance of ET under moment model misspecification [Schennach, S. M. (2007). Point estimation with exponentially tilted empirical likelihood. The Annals of Statistics, 35(2), 634–672. https://doi.org/10.1214/009053606000001208], we show that the one-order property of point estimate is preserved for the misspecified spectral estimating equations of the autoregressive coefficient of AR(1). The simulation results illustrate that the point estimates of the ET outperform those of the EL and their hybrid in terms of standard deviation. A real data set is analyzed for illustration purpose.","PeriodicalId":22070,"journal":{"name":"Statistical Theory and Related Fields","volume":"6 1","pages":"254 - 263"},"PeriodicalIF":0.5,"publicationDate":"2021-09-23","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"44094075","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Time-dependent reliability analysis for repairable consecutive-k-out-of-n:F system 可修连续n-:F系统的时变可靠性分析
IF 0.5 Q3 STATISTICS & PROBABILITY Pub Date : 2021-09-13 DOI: 10.1080/24754269.2021.1971489
Gökhan Gökdere, H. K. Tony Ng
In a repairable consecutive system, after the system operates for a certain time, some components may fail, some failed components may be repaired and the state of the system may change. The models developed in the existing literature usually assume that the state of the system varies over time depending on the values of n and k and the state of the system is known. Since the system reliability will vary over time, it is of great interest to analyse the time-dependent system reliability. In this paper, we develop a novel and simple method that utilizes the eigenvalues of the transition rate matrix of the system for the computation of time-dependent system reliability when the system state is known. In addition, the transition performance probabilities of the system from a known state to the possible states are also analysed. Computational results are presented to illustrate the applicability and accuracy of the proposed method.
在可修连续系统中,系统运行一段时间后,可能会出现一些部件故障,一些故障部件可能被修复,系统的状态可能发生变化。现有文献中开发的模型通常假设系统的状态随时间变化,取决于n和k的值,并且系统的状态是已知的。由于系统的可靠性会随时间而变化,因此分析随时间变化的系统可靠性具有重要的意义。本文提出了一种新颖而简单的方法,即在系统状态已知的情况下,利用系统转移率矩阵的特征值计算时变系统的可靠度。此外,还分析了系统从已知状态到可能状态的转换性能概率。计算结果说明了该方法的适用性和准确性。
{"title":"Time-dependent reliability analysis for repairable consecutive-k-out-of-n:F system","authors":"Gökhan Gökdere, H. K. Tony Ng","doi":"10.1080/24754269.2021.1971489","DOIUrl":"https://doi.org/10.1080/24754269.2021.1971489","url":null,"abstract":"In a repairable consecutive system, after the system operates for a certain time, some components may fail, some failed components may be repaired and the state of the system may change. The models developed in the existing literature usually assume that the state of the system varies over time depending on the values of n and k and the state of the system is known. Since the system reliability will vary over time, it is of great interest to analyse the time-dependent system reliability. In this paper, we develop a novel and simple method that utilizes the eigenvalues of the transition rate matrix of the system for the computation of time-dependent system reliability when the system state is known. In addition, the transition performance probabilities of the system from a known state to the possible states are also analysed. Computational results are presented to illustrate the applicability and accuracy of the proposed method.","PeriodicalId":22070,"journal":{"name":"Statistical Theory and Related Fields","volume":"6 1","pages":"139 - 147"},"PeriodicalIF":0.5,"publicationDate":"2021-09-13","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"44643166","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 2
A review of distributed statistical inference 分布式统计推理综述
IF 0.5 Q3 STATISTICS & PROBABILITY Pub Date : 2021-09-13 DOI: 10.1080/24754269.2021.1974158
Yuan Gao, Weidong Liu, Hansheng Wang, Xiaozhou Wang, Yibo Yan, Riquan Zhang
The rapid emergence of massive datasets in various fields poses a serious challenge to traditional statistical methods. Meanwhile, it provides opportunities for researchers to develop novel algorithms. Inspired by the idea of divide-and-conquer, various distributed frameworks for statistical estimation and inference have been proposed. They were developed to deal with large-scale statistical optimization problems. This paper aims to provide a comprehensive review for related literature. It includes parametric models, nonparametric models, and other frequently used models. Their key ideas and theoretical properties are summarized. The trade-off between communication cost and estimate precision together with other concerns is discussed.
各领域海量数据的迅速涌现,对传统的统计方法提出了严峻的挑战。同时,它为研究人员提供了开发新算法的机会。受分而治之思想的启发,人们提出了各种用于统计估计和推断的分布式框架。它们是为了处理大规模的统计优化问题而开发的。本文旨在对相关文献进行全面的综述。它包括参数模型、非参数模型和其他常用模型。总结了他们的主要思想和理论性质。讨论了通信成本和估计精度之间的权衡以及其他问题。
{"title":"A review of distributed statistical inference","authors":"Yuan Gao, Weidong Liu, Hansheng Wang, Xiaozhou Wang, Yibo Yan, Riquan Zhang","doi":"10.1080/24754269.2021.1974158","DOIUrl":"https://doi.org/10.1080/24754269.2021.1974158","url":null,"abstract":"The rapid emergence of massive datasets in various fields poses a serious challenge to traditional statistical methods. Meanwhile, it provides opportunities for researchers to develop novel algorithms. Inspired by the idea of divide-and-conquer, various distributed frameworks for statistical estimation and inference have been proposed. They were developed to deal with large-scale statistical optimization problems. This paper aims to provide a comprehensive review for related literature. It includes parametric models, nonparametric models, and other frequently used models. Their key ideas and theoretical properties are summarized. The trade-off between communication cost and estimate precision together with other concerns is discussed.","PeriodicalId":22070,"journal":{"name":"Statistical Theory and Related Fields","volume":"6 1","pages":"89 - 99"},"PeriodicalIF":0.5,"publicationDate":"2021-09-13","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"46736414","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 16
Interpreting uninterpretable predictors: kernel methods, Shtarkov solutions, and random forests 解释不可解释的预测因子:核方法、Shtarkov解和随机森林
IF 0.5 Q3 STATISTICS & PROBABILITY Pub Date : 2021-09-08 DOI: 10.1080/24754269.2021.1974157
Tri Le, B. Clarke
Many of the best predictors for complex problems are typically regarded as hard to interpret physically. These include kernel methods, Shtarkov solutions, and random forests. We show that, despite the inability to interpret these three predictors to infinite precision, they can be asymptotically approximated and admit conceptual interpretations in terms of their mathematical/statistical properties. The resulting expressions can be in terms of polynomials, basis elements, or other functions that an analyst may regard as interpretable.
许多复杂问题的最佳预测因子通常被认为很难从物理上解释。其中包括核方法、Shtarkov解和随机森林。我们表明,尽管无法无限精确地解释这三个预测因子,但它们可以渐近近似,并允许根据其数学/统计特性进行概念解释。所得到的表达式可以是多项式、基元或分析师可能认为可解释的其他函数。
{"title":"Interpreting uninterpretable predictors: kernel methods, Shtarkov solutions, and random forests","authors":"Tri Le, B. Clarke","doi":"10.1080/24754269.2021.1974157","DOIUrl":"https://doi.org/10.1080/24754269.2021.1974157","url":null,"abstract":"Many of the best predictors for complex problems are typically regarded as hard to interpret physically. These include kernel methods, Shtarkov solutions, and random forests. We show that, despite the inability to interpret these three predictors to infinite precision, they can be asymptotically approximated and admit conceptual interpretations in terms of their mathematical/statistical properties. The resulting expressions can be in terms of polynomials, basis elements, or other functions that an analyst may regard as interpretable.","PeriodicalId":22070,"journal":{"name":"Statistical Theory and Related Fields","volume":"6 1","pages":"10 - 28"},"PeriodicalIF":0.5,"publicationDate":"2021-09-08","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"46715213","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
On the non-local priors for sparsity selection in high-dimensional Gaussian DAG models 高维高斯DAG模型稀疏度选择的非局部先验
IF 0.5 Q3 STATISTICS & PROBABILITY Pub Date : 2021-09-05 DOI: 10.1080/24754269.2021.1963182
Xuan Cao, F. Yang
We consider sparsity selection for the Cholesky factor L of the inverse covariance matrix in high-dimensional Gaussian DAG models. The sparsity is induced over the space of L via non-local priors, namely the product moment (pMOM) prior [Johnson, V., & Rossell, D. (2012). Bayesian model selection in high-dimensional settings. Journal of the American Statistical Association, 107(498), 649–660. https://doi.org/10.1080/01621459.2012.682536] and the hierarchical hyper-pMOM prior [Cao, X., Khare, K., & Ghosh, M. (2020). High-dimensional posterior consistency for hierarchical non-local priors in regression. Bayesian Analysis, 15(1), 241–262. https://doi.org/10.1214/19-BA1154]. We establish model selection consistency for Cholesky factor under more relaxed conditions compared to those in the literature and implement an efficient MCMC algorithm for parallel selecting the sparsity pattern for each column of L. We demonstrate the validity of our theoretical results via numerical simulations, and also use further simulations to demonstrate that our sparsity selection approach is competitive with existing methods.
我们考虑了高维高斯DAG模型中逆协方差矩阵的Cholesky因子L的稀疏度选择。稀疏性是通过非局部先验,即积矩(pMOM)先验在L空间上产生的[Johnson, V., & Rossell, D.(2012)]。高维环境下的贝叶斯模型选择。美国统计学会学报,107(498),649-660。https://doi.org/10.1080/01621459.2012.682536]和分层hyper-pMOM先验[Cao, X., Khare, K., Ghosh, M.(2020)。回归中层次非局部先验的高维后验一致性。贝叶斯分析,15(1),241-262。https://doi.org/10.1214/19-BA1154]。与文献相比,我们在更宽松的条件下建立了Cholesky因子的模型选择一致性,并实现了一种高效的MCMC算法来并行选择l的每列的稀疏性模式。我们通过数值模拟证明了理论结果的有效性,并使用进一步的模拟来证明我们的稀疏性选择方法与现有方法具有竞争力。
{"title":"On the non-local priors for sparsity selection in high-dimensional Gaussian DAG models","authors":"Xuan Cao, F. Yang","doi":"10.1080/24754269.2021.1963182","DOIUrl":"https://doi.org/10.1080/24754269.2021.1963182","url":null,"abstract":"We consider sparsity selection for the Cholesky factor L of the inverse covariance matrix in high-dimensional Gaussian DAG models. The sparsity is induced over the space of L via non-local priors, namely the product moment (pMOM) prior [Johnson, V., & Rossell, D. (2012). Bayesian model selection in high-dimensional settings. Journal of the American Statistical Association, 107(498), 649–660. https://doi.org/10.1080/01621459.2012.682536] and the hierarchical hyper-pMOM prior [Cao, X., Khare, K., & Ghosh, M. (2020). High-dimensional posterior consistency for hierarchical non-local priors in regression. Bayesian Analysis, 15(1), 241–262. https://doi.org/10.1214/19-BA1154]. We establish model selection consistency for Cholesky factor under more relaxed conditions compared to those in the literature and implement an efficient MCMC algorithm for parallel selecting the sparsity pattern for each column of L. We demonstrate the validity of our theoretical results via numerical simulations, and also use further simulations to demonstrate that our sparsity selection approach is competitive with existing methods.","PeriodicalId":22070,"journal":{"name":"Statistical Theory and Related Fields","volume":"5 1","pages":"332 - 345"},"PeriodicalIF":0.5,"publicationDate":"2021-09-05","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"47443670","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
期刊
Statistical Theory and Related Fields
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1