首页 > 最新文献

Communications in Statistics Case Studies Data Analysis and Applications最新文献

英文 中文
Tail aligned composite quantile estimator for bootstrapping of high quantiles 用于高分位数自举的尾部对齐复合分位数估计器
Q4 Mathematics Pub Date : 2021-05-02 DOI: 10.1080/23737484.2021.1915900
R. S. Jagtap, Mohan Kale, V. K. Gedam
Abstract Reliable assessment of high quantiles, namely quantile with relatively low exceedance probability, based on available sample is of interest in hydrology, meteorology, finance and many other fields. Interval estimation of extreme quantities in real-world mechanisms is essential, but it is challenging due to complexities in underlying data-generating processes, small sample sizes, data are not normal, failure of the standard statistical assumptions etc. leading to huge stochastic uncertainties. A composite quantile function estimator aligned using tail of generalized extreme value distribution is employed to construct bootstrap confidence intervals for high-order quantiles. The proposed semi-parametric estimator is shown to be asymptotically unbiased and consistent. The utility of the proposed estimator in comparison with traditional nonparametric and parametric bootstrap in terms of coverage probability for small size and case study application to real-world precipitation datasets has been illustrated. Limitations posed in computations and scope for future work is highlighted.
摘要基于现有样本对高分位数(即超出概率相对较低的分位数)进行可靠评估,是水文、气象、金融等诸多领域关注的问题。在现实世界的机制中,极值的区间估计是必不可少的,但由于底层数据生成过程的复杂性、小样本量、数据不正常、标准统计假设的失败等导致巨大的随机不确定性,这是具有挑战性的。利用广义极值分布尾部对齐的复合分位数函数估计量构造高阶分位数的自举置信区间。证明了所提出的半参数估计量是渐近无偏和一致的。与传统的非参数自举和参数自举相比,所提出的估计器在小尺寸的覆盖概率和实际降水数据集的案例研究应用方面的效用已经得到说明。强调了计算中的局限性和未来工作的范围。
{"title":"Tail aligned composite quantile estimator for bootstrapping of high quantiles","authors":"R. S. Jagtap, Mohan Kale, V. K. Gedam","doi":"10.1080/23737484.2021.1915900","DOIUrl":"https://doi.org/10.1080/23737484.2021.1915900","url":null,"abstract":"Abstract Reliable assessment of high quantiles, namely quantile with relatively low exceedance probability, based on available sample is of interest in hydrology, meteorology, finance and many other fields. Interval estimation of extreme quantities in real-world mechanisms is essential, but it is challenging due to complexities in underlying data-generating processes, small sample sizes, data are not normal, failure of the standard statistical assumptions etc. leading to huge stochastic uncertainties. A composite quantile function estimator aligned using tail of generalized extreme value distribution is employed to construct bootstrap confidence intervals for high-order quantiles. The proposed semi-parametric estimator is shown to be asymptotically unbiased and consistent. The utility of the proposed estimator in comparison with traditional nonparametric and parametric bootstrap in terms of coverage probability for small size and case study application to real-world precipitation datasets has been illustrated. Limitations posed in computations and scope for future work is highlighted.","PeriodicalId":36561,"journal":{"name":"Communications in Statistics Case Studies Data Analysis and Applications","volume":"73 1","pages":"494 - 515"},"PeriodicalIF":0.0,"publicationDate":"2021-05-02","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"74084020","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
School motivation profiles of Dutch 9th graders 荷兰九年级学生求学动机分析
Q4 Mathematics Pub Date : 2021-05-01 DOI: 10.1080/23737484.2021.1911719
Denise M. Blom, M. Warrens, Meike Faber
Abstract The aim of this study was to identify school motivation profiles of Dutch 9th grade students in a four-dimensional motivation space, including mastery, performance, social and extrinsic motivation. Multiple clustering methods (K-means, K-medoids, restricted latent profile analysis) and multiple indices for selecting the optimal number of clusters were applied. The statistical selection methods did not completely concur on the optimal number of clusters, but a clear common denominator was provided by the Calinski-Harabasz index and the minimum and mean Silhouette values. All three indices indicated two clusters as the optimal number, regardless of the clustering method used: one cluster of 9th graders with high average scores on all dimensions and one cluster with low mean scores on all dimensions. In addition, we explored the substantive interpretation of multiple cluster solutions. It was discovered that most students are in clusters that can be classified into one of three profile types that may differ in level: (1) approximately equal mean scores on all dimensions, (2) relative high mean scores on mastery and social motivation, and (3) a relatively low mean score on performance motivation. The latter profile type is believed to be a new discovery.
摘要本研究的目的是在四维动机空间中识别荷兰九年级学生的学习动机特征,包括掌握、表现、社会动机和外在动机。采用多聚类方法(K-means、k - medidoids、限制性潜在剖面分析)和多指标选择最优聚类数量。统计选择方法在最优聚类数量上并不完全一致,但Calinski-Harabasz指数和剪影值的最小值和平均值提供了一个明确的公分母。无论采用何种聚类方法,这三个指标都表明两个聚类是最优数量:一个聚类是所有维度平均得分高的九年级学生,另一个聚类是所有维度平均得分低的九年级学生。此外,我们还探讨了多集群解决方案的实质性解释。研究发现,大多数学生所在的集群可以被划分为三种不同水平的概况类型之一:(1)所有维度的平均得分大致相等,(2)掌握和社会动机的平均得分相对较高,(3)绩效动机的平均得分相对较低。后一种剖面类型被认为是新发现的。
{"title":"School motivation profiles of Dutch 9th graders","authors":"Denise M. Blom, M. Warrens, Meike Faber","doi":"10.1080/23737484.2021.1911719","DOIUrl":"https://doi.org/10.1080/23737484.2021.1911719","url":null,"abstract":"Abstract The aim of this study was to identify school motivation profiles of Dutch 9th grade students in a four-dimensional motivation space, including mastery, performance, social and extrinsic motivation. Multiple clustering methods (K-means, K-medoids, restricted latent profile analysis) and multiple indices for selecting the optimal number of clusters were applied. The statistical selection methods did not completely concur on the optimal number of clusters, but a clear common denominator was provided by the Calinski-Harabasz index and the minimum and mean Silhouette values. All three indices indicated two clusters as the optimal number, regardless of the clustering method used: one cluster of 9th graders with high average scores on all dimensions and one cluster with low mean scores on all dimensions. In addition, we explored the substantive interpretation of multiple cluster solutions. It was discovered that most students are in clusters that can be classified into one of three profile types that may differ in level: (1) approximately equal mean scores on all dimensions, (2) relative high mean scores on mastery and social motivation, and (3) a relatively low mean score on performance motivation. The latter profile type is believed to be a new discovery.","PeriodicalId":36561,"journal":{"name":"Communications in Statistics Case Studies Data Analysis and Applications","volume":"58 2 1","pages":"359 - 381"},"PeriodicalIF":0.0,"publicationDate":"2021-05-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"76868487","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Disease mapping of biomarkers for breast cancer in Tehran using spatial joint model: A Bayesian perspective 利用空间联合模型绘制德黑兰乳腺癌生物标志物的疾病图谱:贝叶斯视角
Q4 Mathematics Pub Date : 2021-03-14 DOI: 10.1080/23737484.2021.1882354
T. Baghfalaki, M. Kamarehee, M. Ganjali, A. Shabbak, M. Khayamzadeh, M. Akbari
Abstract Breast cancer is one of the most important medical concerns that women face today. There are some biomarkers for detection of this cancer. Modeling these biomarkers, finding important factors that are associated with them and estimating the spatial pattern in disease risk across the areal units by disease mapping are the main foci of many studies. In this article, three binary biomarkers (the presence of estrogen receptors, the presence of progesterone receptors, and the absence of human epidermal growth factor receptor-2) are considered simultaneously for disease mapping of breast cancer. The association of these three biomarkers and spatial effects on them are jointly considered by using a convolution model. The proposed approach is applied to disease mapping of biomarkers of breast cancer in Tehran.
乳腺癌是当今女性面临的最重要的医学问题之一。有一些生物标志物可以检测这种癌症。对这些生物标志物进行建模,发现与它们相关的重要因素,并通过疾病制图估计疾病风险的空间格局,是许多研究的主要焦点。在这篇文章中,三种二元生物标志物(雌激素受体的存在、孕激素受体的存在和人表皮生长因子受体-2的缺失)被同时考虑用于乳腺癌的疾病定位。这三种生物标志物的关联及其空间效应通过使用卷积模型共同考虑。提出的方法应用于德黑兰乳腺癌生物标志物的疾病制图。
{"title":"Disease mapping of biomarkers for breast cancer in Tehran using spatial joint model: A Bayesian perspective","authors":"T. Baghfalaki, M. Kamarehee, M. Ganjali, A. Shabbak, M. Khayamzadeh, M. Akbari","doi":"10.1080/23737484.2021.1882354","DOIUrl":"https://doi.org/10.1080/23737484.2021.1882354","url":null,"abstract":"Abstract Breast cancer is one of the most important medical concerns that women face today. There are some biomarkers for detection of this cancer. Modeling these biomarkers, finding important factors that are associated with them and estimating the spatial pattern in disease risk across the areal units by disease mapping are the main foci of many studies. In this article, three binary biomarkers (the presence of estrogen receptors, the presence of progesterone receptors, and the absence of human epidermal growth factor receptor-2) are considered simultaneously for disease mapping of breast cancer. The association of these three biomarkers and spatial effects on them are jointly considered by using a convolution model. The proposed approach is applied to disease mapping of biomarkers of breast cancer in Tehran.","PeriodicalId":36561,"journal":{"name":"Communications in Statistics Case Studies Data Analysis and Applications","volume":"3 1","pages":"289 - 314"},"PeriodicalIF":0.0,"publicationDate":"2021-03-14","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"86470919","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Detecting time lag between a pair of time series using visibility graph algorithm 利用可见性图算法检测一对时间序列之间的时间滞后
Q4 Mathematics Pub Date : 2021-02-26 DOI: 10.1080/23737484.2021.1882356
Majnu John, J. Ferbinteanu
Abstract Estimating the time lag between a pair of time series is of significance in many practical applications. In this article, we introduce a method to quantify such lags by adapting the visibility graph algorithm, which converts time series into a mathematical graph. Currently widely used method to detect such lags is based on cross-correlations, which has certain limitations. We present simulated examples where the new method identifies the lag correctly and unambiguously while as the cross-correlation method does not. The article includes results from an extensive simulation study conducted to better understand the scenarios where the new method performed better or worse than the existing approach. We also present a likelihood based parametric modeling framework and consider frameworks for quantifying uncertainty and hypothesis testing for the new approach. We apply the current and new methods to two case studies, one from neuroscience and the other from environmental epidemiology, to illustrate the methods further.
摘要估计一对时间序列之间的时滞在许多实际应用中具有重要意义。在本文中,我们介绍了一种量化这种滞后的方法,该方法采用可见性图算法,将时间序列转换为数学图。目前广泛使用的滞后检测方法是基于相互关系的,这种方法有一定的局限性。我们给出了一些模拟实例,其中新方法可以正确和明确地识别滞后,而互相关方法则不能。本文包括一项广泛的模拟研究的结果,该研究旨在更好地理解新方法比现有方法执行得更好或更差的场景。我们还提出了一个基于似然的参数化建模框架,并考虑了量化新方法的不确定性和假设检验的框架。我们将现有方法和新方法应用于两个案例研究,一个来自神经科学,另一个来自环境流行病学,以进一步说明这些方法。
{"title":"Detecting time lag between a pair of time series using visibility graph algorithm","authors":"Majnu John, J. Ferbinteanu","doi":"10.1080/23737484.2021.1882356","DOIUrl":"https://doi.org/10.1080/23737484.2021.1882356","url":null,"abstract":"Abstract Estimating the time lag between a pair of time series is of significance in many practical applications. In this article, we introduce a method to quantify such lags by adapting the visibility graph algorithm, which converts time series into a mathematical graph. Currently widely used method to detect such lags is based on cross-correlations, which has certain limitations. We present simulated examples where the new method identifies the lag correctly and unambiguously while as the cross-correlation method does not. The article includes results from an extensive simulation study conducted to better understand the scenarios where the new method performed better or worse than the existing approach. We also present a likelihood based parametric modeling framework and consider frameworks for quantifying uncertainty and hypothesis testing for the new approach. We apply the current and new methods to two case studies, one from neuroscience and the other from environmental epidemiology, to illustrate the methods further.","PeriodicalId":36561,"journal":{"name":"Communications in Statistics Case Studies Data Analysis and Applications","volume":"40 1","pages":"315 - 343"},"PeriodicalIF":0.0,"publicationDate":"2021-02-26","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"85748506","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 3
A flexible probability model for proportion data: Unit-half-normal distribution 一种灵活的比例数据概率模型:单位半正态分布
Q4 Mathematics Pub Date : 2021-02-11 DOI: 10.1080/23737484.2021.1882355
H. Bakouch, A. S. Nik, A. Asgharzadeh, Hugo S. Salinas
Abstract A new class of unimodal asymmetric distributions is introduced to the unit interval and these distributions are useful for modeling data of percentages, proportions and fractions. Therefore, we propose the unit-half-normal distribution as a contribution to the earlier path and investigate some of its mathematical properties. The maximum likelihood estimator is obtained with a comprehensive inference. This new class of distributions belongs to the exponential family, hence the uniformly minimum variance unbiased estimator of the distribution parameter is obtained. The distribution represents a power alternative to the unit interval distributions, namely the beta, Kumaraswamy and other recent ones. We investigate a small simulation study to analyze the behavior of the obtained estimators for different sample sizes. Moreover, we illustrate the goodness of fit of the proposed model for image data. Lastly, we describe a procedure of incorporating covariates into regression analysis of the proposed distribution.
在单位区间中引入了一类新的单峰不对称分布,这些分布可用于百分比、比例和分数数据的建模。因此,我们提出了单位半正态分布作为早期路径的贡献,并研究了它的一些数学性质。通过综合推理得到了极大似然估计量。这类分布属于指数族,因此得到了分布参数的一致最小方差无偏估计量。该分布代表了单位区间分布(即beta、Kumaraswamy和其他最近的分布)的一个有力替代。我们研究了一个小型的模拟研究,以分析得到的估计量在不同样本量下的行为。此外,我们还说明了该模型对图像数据的拟合良好性。最后,我们描述了将协变量纳入拟分布回归分析的过程。
{"title":"A flexible probability model for proportion data: Unit-half-normal distribution","authors":"H. Bakouch, A. S. Nik, A. Asgharzadeh, Hugo S. Salinas","doi":"10.1080/23737484.2021.1882355","DOIUrl":"https://doi.org/10.1080/23737484.2021.1882355","url":null,"abstract":"Abstract A new class of unimodal asymmetric distributions is introduced to the unit interval and these distributions are useful for modeling data of percentages, proportions and fractions. Therefore, we propose the unit-half-normal distribution as a contribution to the earlier path and investigate some of its mathematical properties. The maximum likelihood estimator is obtained with a comprehensive inference. This new class of distributions belongs to the exponential family, hence the uniformly minimum variance unbiased estimator of the distribution parameter is obtained. The distribution represents a power alternative to the unit interval distributions, namely the beta, Kumaraswamy and other recent ones. We investigate a small simulation study to analyze the behavior of the obtained estimators for different sample sizes. Moreover, we illustrate the goodness of fit of the proposed model for image data. Lastly, we describe a procedure of incorporating covariates into regression analysis of the proposed distribution.","PeriodicalId":36561,"journal":{"name":"Communications in Statistics Case Studies Data Analysis and Applications","volume":"51 1","pages":"271 - 288"},"PeriodicalIF":0.0,"publicationDate":"2021-02-11","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"81790928","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 13
A double mixture autoregressive model of commodity prices 商品价格的双混合自回归模型
Q4 Mathematics Pub Date : 2021-02-11 DOI: 10.1080/23737484.2021.1882353
Gilbert Mbara
Abstract Many commodity prices exhibit boom-bust type behavior: sustained periods of price increases, followed by sudden sharp collapses. Since around the year 2000, booms have become longer while busts have tended to be short but steep, suggesting a structural change in growth and persistence. We model these features of the data using a novel double mixture autoregression with two independent hidden Markov chains. One chain tracks shifts in mean growth rates that account for rising and falling prices, while a second chain tracks changes in volatility and lag-structure. While the two chains are independent, the persistence of price growth depends on the volatility state, which allows the lag-structure to vary across variance regimes. Estimation requires a two-stage Fisherian approach. Initially, location-related parameters are estimated while suppressing the underlying autoregressive structure. These parameters are then held fixed while the optimal lag-structure across variance regimes is determined. We apply the model to three industrial commodities price time series: Crude Oil, Aluminum, and Rubber. We find that in each case, the model captures boom and bust cycles, with data from more recent periods exhibiting higher volatility, longer price rallies, and steeper collapses.
许多商品价格表现出盛衰型行为:价格持续上涨,随后突然急剧下跌。自2000年左右以来,繁荣变得更长,而萧条往往是短暂而陡峭的,这表明增长和持久性发生了结构性变化。我们使用一种新颖的双混合自回归模型来模拟数据的这些特征,该模型具有两个独立的隐马尔可夫链。一条链追踪反映价格涨跌的平均增长率的变化,另一条链追踪波动性和滞后结构的变化。虽然这两条链是独立的,但价格增长的持久性取决于波动性状态,这使得滞后结构在不同的方差制度下变化。估计需要两阶段费雪方法。首先,估计与位置相关的参数,同时抑制潜在的自回归结构。这些参数然后保持固定,同时确定跨方差制度的最优滞后结构。我们将模型应用于原油、铝和橡胶这三种工业商品的价格时间序列。我们发现,在每种情况下,该模型都捕捉到了繁荣和萧条的周期,来自最近时期的数据显示出更高的波动性,更长的价格反弹和更陡峭的崩溃。
{"title":"A double mixture autoregressive model of commodity prices","authors":"Gilbert Mbara","doi":"10.1080/23737484.2021.1882353","DOIUrl":"https://doi.org/10.1080/23737484.2021.1882353","url":null,"abstract":"Abstract Many commodity prices exhibit boom-bust type behavior: sustained periods of price increases, followed by sudden sharp collapses. Since around the year 2000, booms have become longer while busts have tended to be short but steep, suggesting a structural change in growth and persistence. We model these features of the data using a novel double mixture autoregression with two independent hidden Markov chains. One chain tracks shifts in mean growth rates that account for rising and falling prices, while a second chain tracks changes in volatility and lag-structure. While the two chains are independent, the persistence of price growth depends on the volatility state, which allows the lag-structure to vary across variance regimes. Estimation requires a two-stage Fisherian approach. Initially, location-related parameters are estimated while suppressing the underlying autoregressive structure. These parameters are then held fixed while the optimal lag-structure across variance regimes is determined. We apply the model to three industrial commodities price time series: Crude Oil, Aluminum, and Rubber. We find that in each case, the model captures boom and bust cycles, with data from more recent periods exhibiting higher volatility, longer price rallies, and steeper collapses.","PeriodicalId":36561,"journal":{"name":"Communications in Statistics Case Studies Data Analysis and Applications","volume":"97 1","pages":"249 - 270"},"PeriodicalIF":0.0,"publicationDate":"2021-02-11","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"79188194","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
Tweedie, Bar-Lev, and Enis class of leptokurtic distributions as a candidate for modeling real data Tweedie, Bar-Lev和Enis类的细峰分布作为真实数据建模的候选
Q4 Mathematics Pub Date : 2021-02-11 DOI: 10.1080/23737484.2021.1880988
S. Bar-Lev, A. Batsidis, P. Economou
Abstract Modeling real life data is often a demanding task and plethora of distributional models have been proposed in the statistical literature in an attempt to describe different data sets in a better way than those used to describe them. In this article, we establish a broad pool of families of parametric distributions previously used in the literature. This pool, which includes 23 parametric models of distributions, is implemented to test the fit of its models to 17 data sets having different characteristics. In doing so, we will mainly pay attention to a three-parameter model that includes the class of natural exponential families generated by positive stable distributions. Indeed, this is the class we wish to pinpoint in this article and highlight its importance for modeling real data sets. The class is shown to be rather competitive alternative to some well-known parametric models in the pool especially when applied to leptokurtic data sets is available. Appropriate R codes which include all parametric models in the pool are provided in a supplementary file for further applications and implementations for other data sets. Supplemental data for this article is available online at https://doi.org/10.1080/23737484.2021.1880988
对现实生活中的数据进行建模通常是一项艰巨的任务,统计文献中提出了大量的分布模型,试图以比用来描述它们的方法更好的方式描述不同的数据集。在本文中,我们建立了以前在文献中使用的参数分布族的广泛池。该池包含23个分布参数模型,并对17个具有不同特征的数据集进行拟合检验。在此过程中,我们将主要关注一个三参数模型,该模型包括由正稳定分布生成的一类自然指数族。实际上,这就是我们希望在本文中指出的类,并强调它对真实数据集建模的重要性。该类被证明是池中一些众所周知的参数模型的相当有竞争力的替代方案,特别是当应用于可用的细峰数据集时。在补充文件中提供了适当的R代码,其中包括池中的所有参数模型,用于其他数据集的进一步应用和实现。本文的补充数据可在https://doi.org/10.1080/23737484.2021.1880988上在线获得
{"title":"Tweedie, Bar-Lev, and Enis class of leptokurtic distributions as a candidate for modeling real data","authors":"S. Bar-Lev, A. Batsidis, P. Economou","doi":"10.1080/23737484.2021.1880988","DOIUrl":"https://doi.org/10.1080/23737484.2021.1880988","url":null,"abstract":"Abstract Modeling real life data is often a demanding task and plethora of distributional models have been proposed in the statistical literature in an attempt to describe different data sets in a better way than those used to describe them. In this article, we establish a broad pool of families of parametric distributions previously used in the literature. This pool, which includes 23 parametric models of distributions, is implemented to test the fit of its models to 17 data sets having different characteristics. In doing so, we will mainly pay attention to a three-parameter model that includes the class of natural exponential families generated by positive stable distributions. Indeed, this is the class we wish to pinpoint in this article and highlight its importance for modeling real data sets. The class is shown to be rather competitive alternative to some well-known parametric models in the pool especially when applied to leptokurtic data sets is available. Appropriate R codes which include all parametric models in the pool are provided in a supplementary file for further applications and implementations for other data sets. Supplemental data for this article is available online at https://doi.org/10.1080/23737484.2021.1880988","PeriodicalId":36561,"journal":{"name":"Communications in Statistics Case Studies Data Analysis and Applications","volume":"65 1","pages":"229 - 248"},"PeriodicalIF":0.0,"publicationDate":"2021-02-11","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"85802369","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 2
Bayesian inference for the Birnbaum–Saunders autoregressive conditional duration model with application to high-frequency financial data Birnbaum-Saunders自回归条件期限模型的贝叶斯推断及其在高频金融数据中的应用
Q4 Mathematics Pub Date : 2021-01-28 DOI: 10.1080/23737484.2021.1874571
Nascimento Fernando, Leao Jeremias, H. Saulo
Abstract Autoregressive conditional duration (ACD) models have been preponderant when the subject is the modeling of high-frequency financial data. A prominent model that has demonstrated great adjustment capacity is the ACD model based on the Birnbaum–Saunders distribution (BS-ACD). Recent works have shown that this model outperforms the existing models in the literature. Nevertheless, these works explore only classical estimation approaches. In this article, we perform a Bayesian approach of the BS-ACD model. The scale parameter was modeled considering a dynamic linear model. Estimation of posterior distribution of parameters was approximated through Markov chain Monte Carlo methods. A simulation study is conducted to evaluate the performance of Bayesian estimators and two applications to real high frequency data illustrate the proposed methodology.
自回归条件持续时间(ACD)模型在高频金融数据建模中占有优势。基于Birnbaum-Saunders分布的ACD模型(BS-ACD)是具有较强调节能力的突出模型。最近的研究表明,该模型优于文献中现有的模型。然而,这些工作只探讨了经典的估计方法。在本文中,我们执行了BS-ACD模型的贝叶斯方法。尺度参数采用动态线性模型建模。通过马尔可夫链蒙特卡罗方法逼近了参数后验分布的估计。通过仿真研究评估了贝叶斯估计器的性能,并在实际高频数据中的两个应用验证了所提出的方法。
{"title":"Bayesian inference for the Birnbaum–Saunders autoregressive conditional duration model with application to high-frequency financial data","authors":"Nascimento Fernando, Leao Jeremias, H. Saulo","doi":"10.1080/23737484.2021.1874571","DOIUrl":"https://doi.org/10.1080/23737484.2021.1874571","url":null,"abstract":"Abstract Autoregressive conditional duration (ACD) models have been preponderant when the subject is the modeling of high-frequency financial data. A prominent model that has demonstrated great adjustment capacity is the ACD model based on the Birnbaum–Saunders distribution (BS-ACD). Recent works have shown that this model outperforms the existing models in the literature. Nevertheless, these works explore only classical estimation approaches. In this article, we perform a Bayesian approach of the BS-ACD model. The scale parameter was modeled considering a dynamic linear model. Estimation of posterior distribution of parameters was approximated through Markov chain Monte Carlo methods. A simulation study is conducted to evaluate the performance of Bayesian estimators and two applications to real high frequency data illustrate the proposed methodology.","PeriodicalId":36561,"journal":{"name":"Communications in Statistics Case Studies Data Analysis and Applications","volume":"90 3 1","pages":"215 - 228"},"PeriodicalIF":0.0,"publicationDate":"2021-01-28","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"82563076","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 2
Geographically weighted Poisson regression models with different kernels: Application to road traffic accident data 不同核的地理加权泊松回归模型:在道路交通事故数据中的应用
Q4 Mathematics Pub Date : 2021-01-25 DOI: 10.1080/23737484.2020.1869628
Ghanim Al-Hasani, M. Asaduzzaman, A. Soliman
Abstract Geographically weighted Poisson regression (GWPR) models are the class of spatial count regression models that capture the localization effect on various influencing factors on the dependent variable. The main challenge with the GWPR models is to set appropriate kernel function to give weights for each neighboring point during the model calibration. In this article, we consider GWPR models for many different kernel functions, including box-car, bi-square, tri-cube, exponential, and Gaussian function. Likelihood function, parameter estimation, and model selection criteria have been shown in details. We applied the model formulation to the road traffic accident (RTA) data in Oman as the country is one of the largest RTA-prone countries in the Gulf region. Akaike information criterion, corrected Akaike information criterion, and geographically weighted deviance have been used to assess the model fitting. The model with the exponential kernel weighted function provides the best fit for the data and captures the spatial heterogeneity and factors better with the exponential kernel weighting function.
地理加权泊松回归(GWPR)模型是一类空间计数回归模型,它捕捉了因变量上各种影响因素的局部化效应。GWPR模型的主要挑战是在模型标定过程中如何设置合适的核函数来赋予每个相邻点的权值。在本文中,我们将考虑许多不同核函数的GWPR模型,包括箱形函数、双平方函数、三立方函数、指数函数和高斯函数。似然函数、参数估计和模型选择标准已详细说明。我们将模型公式应用于阿曼的道路交通事故(RTA)数据,因为该国是海湾地区最大的RTA易发国家之一。采用赤池信息准则、修正赤池信息准则和地理加权偏差评价模型拟合。采用指数核加权函数的模型对数据的拟合效果最好,能更好地捕捉到空间异质性和影响因素。
{"title":"Geographically weighted Poisson regression models with different kernels: Application to road traffic accident data","authors":"Ghanim Al-Hasani, M. Asaduzzaman, A. Soliman","doi":"10.1080/23737484.2020.1869628","DOIUrl":"https://doi.org/10.1080/23737484.2020.1869628","url":null,"abstract":"Abstract Geographically weighted Poisson regression (GWPR) models are the class of spatial count regression models that capture the localization effect on various influencing factors on the dependent variable. The main challenge with the GWPR models is to set appropriate kernel function to give weights for each neighboring point during the model calibration. In this article, we consider GWPR models for many different kernel functions, including box-car, bi-square, tri-cube, exponential, and Gaussian function. Likelihood function, parameter estimation, and model selection criteria have been shown in details. We applied the model formulation to the road traffic accident (RTA) data in Oman as the country is one of the largest RTA-prone countries in the Gulf region. Akaike information criterion, corrected Akaike information criterion, and geographically weighted deviance have been used to assess the model fitting. The model with the exponential kernel weighted function provides the best fit for the data and captures the spatial heterogeneity and factors better with the exponential kernel weighting function.","PeriodicalId":36561,"journal":{"name":"Communications in Statistics Case Studies Data Analysis and Applications","volume":"70 1","pages":"166 - 181"},"PeriodicalIF":0.0,"publicationDate":"2021-01-25","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"86905651","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 6
Modeling and forecasting life expectancy in India: A systematic approach 印度人预期寿命的建模和预测:一种系统的方法
Q4 Mathematics Pub Date : 2021-01-12 DOI: 10.1080/23737484.2020.1869630
Abhishek Singh, S. Hasija
Abstract In this study, the autoregressive integrated moving average models were used to fit yearly life expectancy data collected from the official website of the World Bank. The results show a steady growth of life expectancy at birth for males, females, and the total population during 2017–2030. Moreover, this study also attempted to examine the risk factors associated with life expectancy at birth in India. The results of the multiple linear regression showed that the employment rate, school enrollment rate, and healthcare expenditure were significant risk factors associated with life expectancy at birth for males, females, and the total population.
摘要本研究采用自回归综合移动平均模型对世界银行官方网站收集的年预期寿命数据进行拟合。结果显示,在2017-2030年期间,男性、女性和总人口的出生时预期寿命都在稳步增长。此外,本研究还试图检查与印度出生时预期寿命相关的风险因素。多元线性回归结果显示,就业率、入学率和医疗费用是影响男性、女性和总人口出生时预期寿命的显著危险因素。
{"title":"Modeling and forecasting life expectancy in India: A systematic approach","authors":"Abhishek Singh, S. Hasija","doi":"10.1080/23737484.2020.1869630","DOIUrl":"https://doi.org/10.1080/23737484.2020.1869630","url":null,"abstract":"Abstract In this study, the autoregressive integrated moving average models were used to fit yearly life expectancy data collected from the official website of the World Bank. The results show a steady growth of life expectancy at birth for males, females, and the total population during 2017–2030. Moreover, this study also attempted to examine the risk factors associated with life expectancy at birth in India. The results of the multiple linear regression showed that the employment rate, school enrollment rate, and healthcare expenditure were significant risk factors associated with life expectancy at birth for males, females, and the total population.","PeriodicalId":36561,"journal":{"name":"Communications in Statistics Case Studies Data Analysis and Applications","volume":"82 1","pages":"200 - 214"},"PeriodicalIF":0.0,"publicationDate":"2021-01-12","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"83407844","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
期刊
Communications in Statistics Case Studies Data Analysis and Applications
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1