首页 > 最新文献

Journal of Statistical Distributions and Applications最新文献

英文 中文
A generalization to the log-inverse Weibull distribution and its applications in cancer research 对数逆威布尔分布的推广及其在癌症研究中的应用
Q2 Mathematics Pub Date : 2021-12-12 DOI: 10.1186/s40488-021-00116-1
Kumar, C. Satheesh, Nair, Subha R.
In this paper we consider a generalization of a log-transformed version of the inverse Weibull distribution. Several theoretical properties of the distribution are studied in detail including expressions for its probability density function, reliability function, hazard rate function, quantile function, characteristic function, raw moments, percentile measures, entropy measures, median, mode etc. Certain structural properties of the distribution along with expressions for reliability measures as well as the distribution and moments of order statistics are obtained. Also we discuss the maximum likelihood estimation of the parameters of the proposed distribution and illustrate the usefulness of the model through real life examples. In addition, the asymptotic behaviour of the maximum likelihood estimators are examined with the help of simulated data sets.
在本文中,我们考虑了逆威布尔分布的对数变换版本的推广。详细研究了该分布的若干理论性质,包括其概率密度函数、可靠性函数、危险率函数、分位数函数、特征函数、原始矩、百分位测度、熵测度、中位数、众数等的表达式。得到了该分布的某些结构性质,给出了可靠性测度的表达式以及阶统计量的分布和矩。我们还讨论了所提出的分布参数的最大似然估计,并通过实际例子说明了该模型的实用性。此外,利用模拟数据集检验了极大似然估计量的渐近性。
{"title":"A generalization to the log-inverse Weibull distribution and its applications in cancer research","authors":"Kumar, C. Satheesh, Nair, Subha R.","doi":"10.1186/s40488-021-00116-1","DOIUrl":"https://doi.org/10.1186/s40488-021-00116-1","url":null,"abstract":"In this paper we consider a generalization of a log-transformed version of the inverse Weibull distribution. Several theoretical properties of the distribution are studied in detail including expressions for its probability density function, reliability function, hazard rate function, quantile function, characteristic function, raw moments, percentile measures, entropy measures, median, mode etc. Certain structural properties of the distribution along with expressions for reliability measures as well as the distribution and moments of order statistics are obtained. Also we discuss the maximum likelihood estimation of the parameters of the proposed distribution and illustrate the usefulness of the model through real life examples. In addition, the asymptotic behaviour of the maximum likelihood estimators are examined with the help of simulated data sets.","PeriodicalId":52216,"journal":{"name":"Journal of Statistical Distributions and Applications","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2021-12-12","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"138503770","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Approximations of conditional probability density functions in Lebesgue spaces via mixture of experts models 勒贝格空间条件概率密度函数的混合专家模型逼近
Q2 Mathematics Pub Date : 2021-08-06 DOI: 10.1186/s40488-021-00125-0
H. Nguyen, TrungTin Nguyen, Faicel Chamroukhi, Geoffrey John McLachlan
{"title":"Approximations of conditional probability density functions in Lebesgue spaces via mixture of experts models","authors":"H. Nguyen, TrungTin Nguyen, Faicel Chamroukhi, Geoffrey John McLachlan","doi":"10.1186/s40488-021-00125-0","DOIUrl":"https://doi.org/10.1186/s40488-021-00125-0","url":null,"abstract":"","PeriodicalId":52216,"journal":{"name":"Journal of Statistical Distributions and Applications","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2021-08-06","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://sci-hub-pdf.com/10.1186/s40488-021-00125-0","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"65895569","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 15
Structural properties of generalised Planck distributions 广义普朗克分布的结构性质
Q2 Mathematics Pub Date : 2021-08-01 DOI: 10.1186/s40488-021-00124-1
Pakes, Anthony G.
A family of generalised Planck (GP) laws is defined and its structural properties explored. Sometimes subject to parameter restrictions, a GP law is a randomly scaled gamma law; it arises as the equilibrium law of a perturbed version of the Feller mean reverting diffusion; the density functions can be decreasing, unimodal or bimodal; it is infinitely divisible. It is argued that the GP law is not a generalised gamma convolution. Characterisations are obtained in terms of invariance under random contraction of a weighted version of a related law. The GP law is a particular instance of equilibrium laws obtained from a recursion suggested by a genetic mutation-selection balance model. Some related infinitely divisible laws are exhibited.
定义了一组广义普朗克(GP)定律,并对其结构性质进行了探索。有时受参数限制,GP定律是随机缩放的伽马定律;它是扰动版Feller均值恢复扩散的平衡定律;密度函数可以是递减的、单峰的或双峰的;它是无限可分的。本文论证了GP定律不是一个广义的卷积。在相关定律的加权版本的随机收缩下,根据不变性获得特征。GP定律是由遗传突变-选择平衡模型提出的递归得到的平衡定律的一个特例。给出了一些相关的无限可分定律。
{"title":"Structural properties of generalised Planck distributions","authors":"Pakes, Anthony G.","doi":"10.1186/s40488-021-00124-1","DOIUrl":"https://doi.org/10.1186/s40488-021-00124-1","url":null,"abstract":"A family of generalised Planck (GP) laws is defined and its structural properties explored. Sometimes subject to parameter restrictions, a GP law is a randomly scaled gamma law; it arises as the equilibrium law of a perturbed version of the Feller mean reverting diffusion; the density functions can be decreasing, unimodal or bimodal; it is infinitely divisible. It is argued that the GP law is not a generalised gamma convolution. Characterisations are obtained in terms of invariance under random contraction of a weighted version of a related law. The GP law is a particular instance of equilibrium laws obtained from a recursion suggested by a genetic mutation-selection balance model. Some related infinitely divisible laws are exhibited.","PeriodicalId":52216,"journal":{"name":"Journal of Statistical Distributions and Applications","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2021-08-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"138503769","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 2
New class of Lindley distributions: properties and applications 新一类Lindley分布:属性和应用
Q2 Mathematics Pub Date : 2021-07-19 DOI: 10.1186/s40488-021-00127-y
D. Hamed, Ahmad Alzaghal
{"title":"New class of Lindley distributions: properties and applications","authors":"D. Hamed, Ahmad Alzaghal","doi":"10.1186/s40488-021-00127-y","DOIUrl":"https://doi.org/10.1186/s40488-021-00127-y","url":null,"abstract":"","PeriodicalId":52216,"journal":{"name":"Journal of Statistical Distributions and Applications","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2021-07-19","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://sci-hub-pdf.com/10.1186/s40488-021-00127-y","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"65895647","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 4
Tolerance intervals in statistical software and robustness under model misspecification 统计软件的容差区间与模型错配下的鲁棒性
Q2 Mathematics Pub Date : 2021-07-18 DOI: 10.1186/s40488-021-00123-2
Kyung Serk Cho, Hon Keung Tony Ng
A tolerance interval is a statistical interval that covers at least 100ρ% of the population of interest with a 100(1−α)% confidence, where ρ and α are pre-specified values in (0, 1). In many scientific fields, such as pharmaceutical sciences, manufacturing processes, clinical sciences, and environmental sciences, tolerance intervals are used for statistical inference and quality control. Despite the usefulness of tolerance intervals, the procedures to compute tolerance intervals are not commonly implemented in statistical software packages. This paper aims to provide a comparative study of the computational procedures for tolerance intervals in some commonly used statistical software packages including JMP, Minitab, NCSS, Python, R, and SAS. On the other hand, we also investigate the effect of misspecifying the underlying probability model on the performance of tolerance intervals. We study the performance of tolerance intervals when the assumed distribution is the same as the true underlying distribution and when the assumed distribution is different from the true distribution via a Monte Carlo simulation study. We also propose a robust model selection approach to obtain tolerance intervals that are relatively insensitive to the model misspecification. We show that the proposed robust model selection approach performs well when the underlying distribution is unknown but candidate distributions are available.
容差区间是一个统计区间,它以100(1−α)%的置信度覆盖至少100ρ%的感兴趣总体,其中ρ和α是(0,1)中预先指定的值。在许多科学领域,如制药科学,制造工艺,临床科学和环境科学,容差区间用于统计推断和质量控制。尽管公差区间很有用,但是计算公差区间的过程在统计软件包中通常没有实现。比较研究了JMP、Minitab、NCSS、Python、R、SAS等常用统计软件包中公差区间的计算过程。另一方面,我们还研究了错误指定潜在概率模型对公差区间性能的影响。通过蒙特卡罗模拟研究了假设分布与真实底层分布相同以及假设分布与真实底层分布不同时容差区间的性能。我们还提出了一种鲁棒模型选择方法,以获得对模型错误规范相对不敏感的公差区间。我们表明,当潜在分布未知但候选分布可用时,所提出的鲁棒模型选择方法表现良好。
{"title":"Tolerance intervals in statistical software and robustness under model misspecification","authors":"Kyung Serk Cho, Hon Keung Tony Ng","doi":"10.1186/s40488-021-00123-2","DOIUrl":"https://doi.org/10.1186/s40488-021-00123-2","url":null,"abstract":"A tolerance interval is a statistical interval that covers at least 100ρ% of the population of interest with a 100(1−α)% confidence, where ρ and α are pre-specified values in (0, 1). In many scientific fields, such as pharmaceutical sciences, manufacturing processes, clinical sciences, and environmental sciences, tolerance intervals are used for statistical inference and quality control. Despite the usefulness of tolerance intervals, the procedures to compute tolerance intervals are not commonly implemented in statistical software packages. This paper aims to provide a comparative study of the computational procedures for tolerance intervals in some commonly used statistical software packages including JMP, Minitab, NCSS, Python, R, and SAS. On the other hand, we also investigate the effect of misspecifying the underlying probability model on the performance of tolerance intervals. We study the performance of tolerance intervals when the assumed distribution is the same as the true underlying distribution and when the assumed distribution is different from the true distribution via a Monte Carlo simulation study. We also propose a robust model selection approach to obtain tolerance intervals that are relatively insensitive to the model misspecification. We show that the proposed robust model selection approach performs well when the underlying distribution is unknown but candidate distributions are available.","PeriodicalId":52216,"journal":{"name":"Journal of Statistical Distributions and Applications","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2021-07-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"138503768","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 2
Combining assumptions and graphical network into gene expression data analysis 结合假设和图形网络进行基因表达数据分析
Q2 Mathematics Pub Date : 2021-07-08 DOI: 10.1186/s40488-021-00126-z
Demba Fofana, E. O. George, Dale Bowman
Analyzing gene expression data rigorously requires taking assumptions into consideration but also relies on using information about network relations that exist among genes. Combining these different elements cannot only improve statistical power, but also provide a better framework through which gene expression can be properly analyzed. We propose a novel statistical model that combines assumptions and gene network information into the analysis. Assumptions are important since every test statistic is valid only when required assumptions hold. So, we propose hybrid p-values and show that, under the null hypothesis of primary interest, these p-values are uniformly distributed. These proposed hybrid p-values take assumptions into consideration. We incorporate gene network information into the analysis because neighboring genes share biological functions. This correlation factor is taken into account via similar prior probabilities for neighboring genes. With a series of simulations our approach is compared with other approaches. Area Under the ROC Curves (AUCs) are constructed to compare the different methodologies; the AUC based on our methodology is larger than others. For regression analysis, AUC from our proposed method contains AUCs of Spearman test and of Pearson test. In addition, true negative rates (TNRs) also known as specificities are higher with our approach than with the other approaches. For two group comparison analysis, for instance, with a sample size of n=10, specificity corresponding to our proposed methodology is 0.716146 and specificities for t-test and rank sum are 0.689223 and 0.69797, respectively. Our method that combines assumptions and network information into the analysis is shown to be more powerful. These proposed procedures are introduced as a general class of methods that can incorporate procedure-selection, account for multiple-testing, and incorporate graphical network information into the analysis. We obtain very good performance in simulations, and in real data analysis.
严格分析基因表达数据需要考虑假设,但也依赖于使用有关基因之间存在的网络关系的信息。结合这些不同的元素不仅可以提高统计能力,而且还提供了一个更好的框架,通过这个框架可以正确地分析基因表达。我们提出了一个新的统计模型,将假设和基因网络信息结合到分析中。假设是重要的,因为每个检验统计量只有在必要的假设成立时才有效。因此,我们提出混合p值,并证明,在原假设下,这些p值是均匀分布的。这些建议的混合p值考虑了假设。我们将基因网络信息纳入分析,因为邻近基因共享生物学功能。这种相关因素是通过邻近基因的相似先验概率来考虑的。通过一系列的仿真,将该方法与其他方法进行了比较。构建ROC曲线下面积(auc)来比较不同的方法;基于我们方法的AUC比其他方法的AUC大。对于回归分析,我们提出的方法的AUC包含Spearman检验和Pearson检验的AUC。此外,与其他方法相比,我们的方法的真阴性率(tnr)也被称为特异性更高。以两组比较分析为例,当样本量为n=10时,我们提出的方法对应的特异性为0.716146,t检验特异性为0.689223,秩和特异性为0.69797。我们将假设和网络信息结合到分析中的方法被证明是更强大的。这些建议的程序被介绍为一般类型的方法,可以结合程序选择,考虑多重测试,并将图形网络信息纳入分析。我们在仿真和实际数据分析中都取得了很好的效果。
{"title":"Combining assumptions and graphical network into gene expression data analysis","authors":"Demba Fofana, E. O. George, Dale Bowman","doi":"10.1186/s40488-021-00126-z","DOIUrl":"https://doi.org/10.1186/s40488-021-00126-z","url":null,"abstract":"Analyzing gene expression data rigorously requires taking assumptions into consideration but also relies on using information about network relations that exist among genes. Combining these different elements cannot only improve statistical power, but also provide a better framework through which gene expression can be properly analyzed. We propose a novel statistical model that combines assumptions and gene network information into the analysis. Assumptions are important since every test statistic is valid only when required assumptions hold. So, we propose hybrid p-values and show that, under the null hypothesis of primary interest, these p-values are uniformly distributed. These proposed hybrid p-values take assumptions into consideration. We incorporate gene network information into the analysis because neighboring genes share biological functions. This correlation factor is taken into account via similar prior probabilities for neighboring genes. With a series of simulations our approach is compared with other approaches. Area Under the ROC Curves (AUCs) are constructed to compare the different methodologies; the AUC based on our methodology is larger than others. For regression analysis, AUC from our proposed method contains AUCs of Spearman test and of Pearson test. In addition, true negative rates (TNRs) also known as specificities are higher with our approach than with the other approaches. For two group comparison analysis, for instance, with a sample size of n=10, specificity corresponding to our proposed methodology is 0.716146 and specificities for t-test and rank sum are 0.689223 and 0.69797, respectively. Our method that combines assumptions and network information into the analysis is shown to be more powerful. These proposed procedures are introduced as a general class of methods that can incorporate procedure-selection, account for multiple-testing, and incorporate graphical network information into the analysis. We obtain very good performance in simulations, and in real data analysis.","PeriodicalId":52216,"journal":{"name":"Journal of Statistical Distributions and Applications","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2021-07-08","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"138503767","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
A general stochastic model for bivariate episodes driven by a gamma sequence 由伽马序列驱动的双变量事件的一般随机模型
Q2 Mathematics Pub Date : 2021-04-12 DOI: 10.1186/s40488-021-00120-5
Charles K. Amponsah, Tomasz J. Kozubowski, Anna K. Panorska
We propose a new stochastic model describing the joint distribution of (X,N), where N is a counting variable while X is the sum of N independent gamma random variables. We present the main properties of this general model, which include marginal and conditional distributions, integral transforms, moments and parameter estimation. We also discuss in more detail a special case where N has a heavy tailed discrete Pareto distribution. An example from finance illustrates the modeling potential of this new mixed bivariate distribution.
我们提出了一个描述(X,N)联合分布的新随机模型,其中N是计数变量,而X是N个独立随机变量的和。给出了该模型的主要性质,包括边缘分布和条件分布、积分变换、矩和参数估计。我们还详细讨论了N具有重尾离散Pareto分布的特殊情况。金融领域的一个例子说明了这种新的混合二元分布的建模潜力。
{"title":"A general stochastic model for bivariate episodes driven by a gamma sequence","authors":"Charles K. Amponsah, Tomasz J. Kozubowski, Anna K. Panorska","doi":"10.1186/s40488-021-00120-5","DOIUrl":"https://doi.org/10.1186/s40488-021-00120-5","url":null,"abstract":"We propose a new stochastic model describing the joint distribution of (X,N), where N is a counting variable while X is the sum of N independent gamma random variables. We present the main properties of this general model, which include marginal and conditional distributions, integral transforms, moments and parameter estimation. We also discuss in more detail a special case where N has a heavy tailed discrete Pareto distribution. An example from finance illustrates the modeling potential of this new mixed bivariate distribution.","PeriodicalId":52216,"journal":{"name":"Journal of Statistical Distributions and Applications","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2021-04-12","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"138503766","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
A flexible multivariate model for high-dimensional correlated count data 高维相关计数数据的灵活多元模型
Q2 Mathematics Pub Date : 2021-03-16 DOI: 10.1186/s40488-021-00119-y
Alexander D. Knudson, Tomasz J. Kozubowski, Anna K. Panorska, A. Grant Schissler
We propose a flexible multivariate stochastic model for over-dispersed count data. Our methodology is built upon mixed Poisson random vectors (Y1,…,Yd), where the {Yi} are conditionally independent Poisson random variables. The stochastic rates of the {Yi} are multivariate distributions with arbitrary non-negative margins linked by a copula function. We present basic properties of these mixed Poisson multivariate distributions and provide several examples. A particular case with geometric and negative binomial marginal distributions is studied in detail. We illustrate an application of our model by conducting a high-dimensional simulation motivated by RNA-sequencing data.
我们提出了一个灵活的多元随机模型,用于过度分散的计数数据。我们的方法建立在混合泊松随机向量(Y1,…,Yd)之上,其中{Yi}是条件独立的泊松随机变量。{Yi}的随机率是由一个联结函数连接的任意非负边界的多元分布。我们给出了这些混合泊松多元分布的基本性质,并给出了几个例子。详细研究了具有几何负二项边际分布的一种特殊情况。我们通过进行由rna测序数据驱动的高维模拟来说明我们模型的应用。
{"title":"A flexible multivariate model for high-dimensional correlated count data","authors":"Alexander D. Knudson, Tomasz J. Kozubowski, Anna K. Panorska, A. Grant Schissler","doi":"10.1186/s40488-021-00119-y","DOIUrl":"https://doi.org/10.1186/s40488-021-00119-y","url":null,"abstract":"We propose a flexible multivariate stochastic model for over-dispersed count data. Our methodology is built upon mixed Poisson random vectors (Y1,…,Yd), where the {Yi} are conditionally independent Poisson random variables. The stochastic rates of the {Yi} are multivariate distributions with arbitrary non-negative margins linked by a copula function. We present basic properties of these mixed Poisson multivariate distributions and provide several examples. A particular case with geometric and negative binomial marginal distributions is studied in detail. We illustrate an application of our model by conducting a high-dimensional simulation motivated by RNA-sequencing data.","PeriodicalId":52216,"journal":{"name":"Journal of Statistical Distributions and Applications","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2021-03-16","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"138503765","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Generalized fiducial inference on the mean of zero-inflated Poisson and Poisson hurdle models 零膨胀泊松和泊松障碍模型均值的广义基准推断
Q2 Mathematics Pub Date : 2021-03-06 DOI: 10.1186/s40488-021-00117-0
Yixuan Zou, Jan Hannig, D. S. Young
{"title":"Generalized fiducial inference on the mean of zero-inflated Poisson and Poisson hurdle models","authors":"Yixuan Zou, Jan Hannig, D. S. Young","doi":"10.1186/s40488-021-00117-0","DOIUrl":"https://doi.org/10.1186/s40488-021-00117-0","url":null,"abstract":"","PeriodicalId":52216,"journal":{"name":"Journal of Statistical Distributions and Applications","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2021-03-06","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://sci-hub-pdf.com/10.1186/s40488-021-00117-0","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"65895438","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
Multivariate distributions of correlated binary variables generated by pair-copulas 由成对耦合产生的相关二元变量的多元分布
Q2 Mathematics Pub Date : 2021-03-05 DOI: 10.1186/s40488-021-00118-z
Huihui Lin, N. Rao Chaganty
Correlated binary data are prevalent in a wide range of scientific disciplines, including healthcare and medicine. The generalized estimating equations (GEEs) and the multivariate probit (MP) model are two of the popular methods for analyzing such data. However, both methods have some significant drawbacks. The GEEs may not have an underlying likelihood and the MP model may fail to generate a multivariate binary distribution with specified marginals and bivariate correlations. In this paper, we study multivariate binary distributions that are based on D-vine pair-copula models as a superior alternative to these methods. We elucidate the construction of these binary distributions in two and three dimensions with numerical examples. For higher dimensions, we provide a method of constructing a multidimensional binary distribution with specified marginals and equicorrelated correlation matrix. We present a real-life data analysis to illustrate the application of our results.
相关二进制数据在广泛的科学学科中很流行,包括医疗保健和医学。广义估计方程(GEEs)和多元概率模型(MP)是分析此类数据的两种常用方法。然而,这两种方法都有一些明显的缺点。GEEs可能没有潜在的可能性,MP模型可能无法生成具有特定边际和二元相关性的多元二元分布。在本文中,我们研究了基于D-vine对-copula模型的多元二元分布,作为这些方法的一个较好的替代。我们用数值例子说明了二维和三维二进制分布的构造。对于高维,我们提供了一种构造具有指定边缘和等相关矩阵的多维二元分布的方法。我们提出了一个现实生活中的数据分析来说明我们的结果的应用。
{"title":"Multivariate distributions of correlated binary variables generated by pair-copulas","authors":"Huihui Lin, N. Rao Chaganty","doi":"10.1186/s40488-021-00118-z","DOIUrl":"https://doi.org/10.1186/s40488-021-00118-z","url":null,"abstract":"Correlated binary data are prevalent in a wide range of scientific disciplines, including healthcare and medicine. The generalized estimating equations (GEEs) and the multivariate probit (MP) model are two of the popular methods for analyzing such data. However, both methods have some significant drawbacks. The GEEs may not have an underlying likelihood and the MP model may fail to generate a multivariate binary distribution with specified marginals and bivariate correlations. In this paper, we study multivariate binary distributions that are based on D-vine pair-copula models as a superior alternative to these methods. We elucidate the construction of these binary distributions in two and three dimensions with numerical examples. For higher dimensions, we provide a method of constructing a multidimensional binary distribution with specified marginals and equicorrelated correlation matrix. We present a real-life data analysis to illustrate the application of our results.","PeriodicalId":52216,"journal":{"name":"Journal of Statistical Distributions and Applications","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2021-03-05","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"138503845","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
期刊
Journal of Statistical Distributions and Applications
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1