首页 > 最新文献

Communications in Statistics Case Studies Data Analysis and Applications最新文献

英文 中文
Pointwise norm-based clustering of data in arbitrary dimensional space 任意维空间中基于点向范数的数据聚类
Q4 Mathematics Pub Date : 2023-04-03 DOI: 10.1080/23737484.2023.2199952
Soumita Modak
ABSTRACT In this paper, a novel nonparametric norm-based clustering algorithm is proposed to classify real-valued continuous data sets given in arbitrary dimensional space. For data univariate, multivariate or high-dimensional, with the number of study variables close to or larger than the data size, our straightforward algorithm is implemented throughout under an univariate set-up, where we make use of the observation-wise (or pointwise) norms which quantify the distances of the observations from the origin zero or the null vector. The method begins with determination of the sample quantile using nonparamteric bootstrapping on the computed norms and always converges independently. By its design, the suggested algorithm is fast enough to detect the number of existing clusters itself and to form well-defined groups. Data study demonstrates its competitiveness in comparison to 2 popular clustering algorithms K-means and K-medoids.
本文提出了一种新的基于非参数范数的聚类算法,用于对任意维空间中给定的实值连续数据集进行分类。对于单变量、多变量或高维数据,研究变量的数量接近或大于数据大小,我们的直接算法在单变量设置下实现,其中我们利用观测智能(或点向)规范,量化观测值与原点零或零向量的距离。该方法首先利用非参数自举法对计算的范数确定样本分位数,并且总是独立收敛。通过其设计,建议的算法足够快,可以检测现有集群的数量,并形成定义良好的组。数据研究表明,与两种流行的聚类算法K-means和K-medoids相比,该算法具有竞争力。
{"title":"Pointwise norm-based clustering of data in arbitrary dimensional space","authors":"Soumita Modak","doi":"10.1080/23737484.2023.2199952","DOIUrl":"https://doi.org/10.1080/23737484.2023.2199952","url":null,"abstract":"ABSTRACT In this paper, a novel nonparametric norm-based clustering algorithm is proposed to classify real-valued continuous data sets given in arbitrary dimensional space. For data univariate, multivariate or high-dimensional, with the number of study variables close to or larger than the data size, our straightforward algorithm is implemented throughout under an univariate set-up, where we make use of the observation-wise (or pointwise) norms which quantify the distances of the observations from the origin zero or the null vector. The method begins with determination of the sample quantile using nonparamteric bootstrapping on the computed norms and always converges independently. By its design, the suggested algorithm is fast enough to detect the number of existing clusters itself and to form well-defined groups. Data study demonstrates its competitiveness in comparison to 2 popular clustering algorithms K-means and K-medoids.","PeriodicalId":36561,"journal":{"name":"Communications in Statistics Case Studies Data Analysis and Applications","volume":"10 1","pages":"121 - 134"},"PeriodicalIF":0.0,"publicationDate":"2023-04-03","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"77030573","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
Spatial patterns in Brazilian state legislative elections 巴西州立法选举的空间格局
Q4 Mathematics Pub Date : 2023-04-03 DOI: 10.1080/23737484.2023.2203840
Pedro Chaim, M. Laurini
Abstract In this paper we explore spatial patterns in voting behavior of the Brazilian electorate in the state legislative elections of 2014. With data aggregated at the municipality level, we employ a Beta regression model augmented with spatially correlated random effects to model the share of votes received by the three largest political parties: PMDB, PSDB, and PT. Results suggest PT is more preferred by the electorate in poorer and more densely populated areas, specially in states of the Northwest and South regions, while PSDB performs better in municipalities with relatively higher standards of living. Also, analysis of the spatial random effects indicates this component is especially important to account for the major stylized fact that is the simultaneous PSDB hegemony and relative lack of PMDB presence in the state of São Paulo.
摘要本文探讨了2014年巴西州议会选举中选民投票行为的空间格局。基于城市层面的数据汇总,我们采用了一个增强了空间相关随机效应的Beta回归模型来模拟PMDB、PSDB和PT这三个最大政党的得票份额。结果表明,在较贫穷和人口稠密的地区,特别是在西北和南部地区,PT更受选民的青睐,而PSDB在生活水平相对较高的城市表现更好。此外,对空间随机效应的分析表明,这一成分对于解释主要的程式化事实尤其重要,即在圣保罗州,PSDB霸权和相对缺乏PMDB的存在同时存在。
{"title":"Spatial patterns in Brazilian state legislative elections","authors":"Pedro Chaim, M. Laurini","doi":"10.1080/23737484.2023.2203840","DOIUrl":"https://doi.org/10.1080/23737484.2023.2203840","url":null,"abstract":"Abstract In this paper we explore spatial patterns in voting behavior of the Brazilian electorate in the state legislative elections of 2014. With data aggregated at the municipality level, we employ a Beta regression model augmented with spatially correlated random effects to model the share of votes received by the three largest political parties: PMDB, PSDB, and PT. Results suggest PT is more preferred by the electorate in poorer and more densely populated areas, specially in states of the Northwest and South regions, while PSDB performs better in municipalities with relatively higher standards of living. Also, analysis of the spatial random effects indicates this component is especially important to account for the major stylized fact that is the simultaneous PSDB hegemony and relative lack of PMDB presence in the state of São Paulo.","PeriodicalId":36561,"journal":{"name":"Communications in Statistics Case Studies Data Analysis and Applications","volume":"1 1","pages":"181 - 195"},"PeriodicalIF":0.0,"publicationDate":"2023-04-03","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"83016033","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Toward statistical real-time power fault detection 实现电力故障的实时统计检测
Q4 Mathematics Pub Date : 2023-04-03 DOI: 10.1080/23737484.2023.2199957
Mantautas Rimkus, P. Kokoszka, K. Prabakar, Haonan Wang
Abstract We propose statistical fault detection methodology based on high-frequency data streams that are becoming available in modern power grids. Our approach can be treated as an online (sequential) change point monitoring methodology. However, due to the mostly unexplored and very nonstandard structure of high-frequency power grid streaming data, substantial new statistical development is required to make this methodology practically applicable. The paper includes development of scalar detectors based on multichannel data streams, determination of data-driven alarm thresholds and investigation of the performance and robustness of the new tools. Due to a reasonably large database of faults, we can calculate frequencies of false and correct fault signals, and recommend implementations that optimize these empirical success rates.
摘要本文提出了一种基于高频数据流的统计故障检测方法。我们的方法可以被视为一种在线(连续的)变更点监控方法。然而,由于高频电网流数据的大部分未开发和非常不标准的结构,需要大量新的统计发展使该方法实际适用。本文包括基于多通道数据流的标量检测器的开发,数据驱动报警阈值的确定以及新工具的性能和鲁棒性的研究。由于有一个相当大的故障数据库,我们可以计算错误和正确故障信号的频率,并推荐优化这些经验成功率的实现。
{"title":"Toward statistical real-time power fault detection","authors":"Mantautas Rimkus, P. Kokoszka, K. Prabakar, Haonan Wang","doi":"10.1080/23737484.2023.2199957","DOIUrl":"https://doi.org/10.1080/23737484.2023.2199957","url":null,"abstract":"Abstract We propose statistical fault detection methodology based on high-frequency data streams that are becoming available in modern power grids. Our approach can be treated as an online (sequential) change point monitoring methodology. However, due to the mostly unexplored and very nonstandard structure of high-frequency power grid streaming data, substantial new statistical development is required to make this methodology practically applicable. The paper includes development of scalar detectors based on multichannel data streams, determination of data-driven alarm thresholds and investigation of the performance and robustness of the new tools. Due to a reasonably large database of faults, we can calculate frequencies of false and correct fault signals, and recommend implementations that optimize these empirical success rates.","PeriodicalId":36561,"journal":{"name":"Communications in Statistics Case Studies Data Analysis and Applications","volume":"18 1","pages":"196 - 217"},"PeriodicalIF":0.0,"publicationDate":"2023-04-03","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"75165080","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
Spatio-temporal dependence modelling of extreme rainfall in South Africa: A Bayesian integrated nested Laplace approximation technique 南非极端降雨的时空依赖模型:贝叶斯集成嵌套拉普拉斯近似技术
Q4 Mathematics Pub Date : 2023-04-03 DOI: 10.1080/23737484.2023.2207503
T. A. Diriba, L. K. Debusho
Abstract The spatial and spatio-temporal dependence modeling to extreme value distributions have been used to analyze the extremes of daily maximum rainfall data across selected weather stations in South Africa combining generalized Pareto distribution (GPD) with the flexible Bayesian Latent Gaussian Model (LGM). The paper demonstrated the spatio-temporal GPD model for applications in extreme rainfall data that capture systematic variation through spatial and spatio-temporal modeling framework, in which the temporal constitutes the week and month as random separately. The paper uses the Bayesian integrated Nested Laplace approximation (INLA) algorithm to estimate marginal posterior means of the parameters and hyper-parameters for Bayesian spatio-temporal models. The Bayesian inferences using INLA technique were applied to obtain prediction of the return levels at each station, which incorporate uncertainty due to model estimation, as well as the randomness that is inherent in the processes.
摘要结合广义帕累托分布(GPD)和灵活贝叶斯隐高斯模型(LGM),利用极值分布的时空依赖模型对南非部分气象站的日最大降雨量极值进行了分析。本文通过时空建模框架,以周、月为随机时间,建立了时空GPD模型,该模型可以捕捉极端降雨数据的系统变化。本文采用贝叶斯积分嵌套拉普拉斯近似(INLA)算法估计贝叶斯时空模型参数和超参数的边际后验均值。使用INLA技术的贝叶斯推断被用于获得每个站点的返回水平的预测,其中包括由于模型估计而产生的不确定性,以及过程中固有的随机性。
{"title":"Spatio-temporal dependence modelling of extreme rainfall in South Africa: A Bayesian integrated nested Laplace approximation technique","authors":"T. A. Diriba, L. K. Debusho","doi":"10.1080/23737484.2023.2207503","DOIUrl":"https://doi.org/10.1080/23737484.2023.2207503","url":null,"abstract":"Abstract The spatial and spatio-temporal dependence modeling to extreme value distributions have been used to analyze the extremes of daily maximum rainfall data across selected weather stations in South Africa combining generalized Pareto distribution (GPD) with the flexible Bayesian Latent Gaussian Model (LGM). The paper demonstrated the spatio-temporal GPD model for applications in extreme rainfall data that capture systematic variation through spatial and spatio-temporal modeling framework, in which the temporal constitutes the week and month as random separately. The paper uses the Bayesian integrated Nested Laplace approximation (INLA) algorithm to estimate marginal posterior means of the parameters and hyper-parameters for Bayesian spatio-temporal models. The Bayesian inferences using INLA technique were applied to obtain prediction of the return levels at each station, which incorporate uncertainty due to model estimation, as well as the randomness that is inherent in the processes.","PeriodicalId":36561,"journal":{"name":"Communications in Statistics Case Studies Data Analysis and Applications","volume":"5 1","pages":"152 - 180"},"PeriodicalIF":0.0,"publicationDate":"2023-04-03","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"89805499","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Uses of a new asymmetric loss-based process capability index in the electronic industries 一种新的基于非对称损耗的过程能力指数在电子工业中的应用
Q4 Mathematics Pub Date : 2023-04-03 DOI: 10.1080/23737484.2023.2207499
Mahendra Saha, S. Dey
Abstract This article suggests a novel process capability index (PCI) termed as , which is based on an asymmetric loss function (linear exponential) for a normal process and offers a specific method of incorporating the loss in capability analysis. Next, we estimate the suggested PCI using the moment estimation approach when the process follows a normal distribution, and we compare the effectiveness of the investigated estimation methods in terms of their mean squared errors through simulation analysis. Additionally, the confidence intervals for the index are constructed using the generalized confidence interval (GCI) and parametric bootstrap confidence interval (BCI) approach. Using Monte Carlo simulation, the performance of the GCI and BCI is compared in terms of average width, associated coverage probabilities, and relative coverage. Finally, three real data sets from the electronic industries are re-analyzed to show the usefulness of the suggested index, MOM estimation, GCI and BCI.
本文提出了一种基于非对称损失函数(线性指数)的新型过程能力指数(PCI),并提供了一种将损失纳入能力分析的具体方法。接下来,当过程服从正态分布时,我们使用矩估计方法估计建议的PCI,并通过仿真分析比较所研究的估计方法的均方误差的有效性。此外,采用广义置信区间(GCI)和参数自举置信区间(BCI)方法构造了指数的置信区间。通过蒙特卡罗模拟,比较了GCI和BCI在平均宽度、相关覆盖概率和相对覆盖方面的性能。最后,对三个电子行业的真实数据集进行了重新分析,以证明建议指数、MOM估计、GCI和BCI的有效性。
{"title":"Uses of a new asymmetric loss-based process capability index in the electronic industries","authors":"Mahendra Saha, S. Dey","doi":"10.1080/23737484.2023.2207499","DOIUrl":"https://doi.org/10.1080/23737484.2023.2207499","url":null,"abstract":"Abstract This article suggests a novel process capability index (PCI) termed as , which is based on an asymmetric loss function (linear exponential) for a normal process and offers a specific method of incorporating the loss in capability analysis. Next, we estimate the suggested PCI using the moment estimation approach when the process follows a normal distribution, and we compare the effectiveness of the investigated estimation methods in terms of their mean squared errors through simulation analysis. Additionally, the confidence intervals for the index are constructed using the generalized confidence interval (GCI) and parametric bootstrap confidence interval (BCI) approach. Using Monte Carlo simulation, the performance of the GCI and BCI is compared in terms of average width, associated coverage probabilities, and relative coverage. Finally, three real data sets from the electronic industries are re-analyzed to show the usefulness of the suggested index, MOM estimation, GCI and BCI.","PeriodicalId":36561,"journal":{"name":"Communications in Statistics Case Studies Data Analysis and Applications","volume":"6 1","pages":"135 - 151"},"PeriodicalIF":0.0,"publicationDate":"2023-04-03","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"90488465","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Zero-inflated poisson quasi-Lindley regression for modeling number of doctor visit data 零膨胀poisson准lindley回归对就诊数据的建模
Q4 Mathematics Pub Date : 2023-01-02 DOI: 10.1080/23737484.2023.2164941
Hossein Zamani, Zohreh Pakdaman, Marzieh Shekari
Abstract The Poisson regression is a popular approach in modeling count data. However, in many situations often the variance of data is greater than the mean (over-dispersed data) and the generalized Poisson or mixed Poisson models such as the Poisson gamma (negative binomial), Poisson inverse Gaussian, Poisson lognormal, and Poisson Lindley have been proposed as the alternatives to the Poisson for describing over-dispersed count data. In some situations, the source of over-dispersion is the large percentage of zeros in the dataset. In the other words, the dataset involves an excessive number of zeros than are expected in the common discrete distributions which are known as the zero-inflated events. In order to analyze these data, zero-inflated models such as the zero-inflated Poisson, zero-inflated generalized Poisson, and zero-inflated negative binomial have been applied. This work proposes the functional form and the regression model of the zero-inflated Poisson quasi-Lindley (ZIPQL) and then, beside the alternative models, it was fitted and compared to US National Medical Expenditure Survey data.
泊松回归是一种常用的计数数据建模方法。然而,在许多情况下,数据的方差往往大于平均值(过度分散的数据),广义泊松或混合泊松模型,如泊松伽马(负二项)、泊松逆高斯、泊松对数正态和泊松林德利,已被提出作为泊松的替代品,用于描述过度分散的计数数据。在某些情况下,过度分散的来源是数据集中很大比例的零。换句话说,数据集涉及的零数量超过了常见离散分布中所期望的零数量,这些分布被称为零膨胀事件。为了分析这些数据,应用了零膨胀泊松、零膨胀广义泊松和零膨胀负二项等零膨胀模型。本文提出了零膨胀泊松准林德利(ZIPQL)的函数形式和回归模型,并将其与美国国家医疗支出调查数据进行拟合和比较。
{"title":"Zero-inflated poisson quasi-Lindley regression for modeling number of doctor visit data","authors":"Hossein Zamani, Zohreh Pakdaman, Marzieh Shekari","doi":"10.1080/23737484.2023.2164941","DOIUrl":"https://doi.org/10.1080/23737484.2023.2164941","url":null,"abstract":"Abstract The Poisson regression is a popular approach in modeling count data. However, in many situations often the variance of data is greater than the mean (over-dispersed data) and the generalized Poisson or mixed Poisson models such as the Poisson gamma (negative binomial), Poisson inverse Gaussian, Poisson lognormal, and Poisson Lindley have been proposed as the alternatives to the Poisson for describing over-dispersed count data. In some situations, the source of over-dispersion is the large percentage of zeros in the dataset. In the other words, the dataset involves an excessive number of zeros than are expected in the common discrete distributions which are known as the zero-inflated events. In order to analyze these data, zero-inflated models such as the zero-inflated Poisson, zero-inflated generalized Poisson, and zero-inflated negative binomial have been applied. This work proposes the functional form and the regression model of the zero-inflated Poisson quasi-Lindley (ZIPQL) and then, beside the alternative models, it was fitted and compared to US National Medical Expenditure Survey data.","PeriodicalId":36561,"journal":{"name":"Communications in Statistics Case Studies Data Analysis and Applications","volume":"27 1","pages":"1 - 15"},"PeriodicalIF":0.0,"publicationDate":"2023-01-02","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"84907773","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Weighted least squares estimation of the risk-free rate from derivative prices 衍生品价格对无风险利率的加权最小二乘估计
Q4 Mathematics Pub Date : 2023-01-02 DOI: 10.1080/23737484.2023.2186283
Jörgen Blomvall, Pontus Söderbäck, M. Singull
Abstract This study proposes a method for estimating the interest spread over an OIS-implied spot rate used in market-consistent derivative pricing. Our method generalizes previous proposed ordinary least squares methods in the literature in two ways. First, it utilizes intraday data rather than data from a single time. Second, it is formulated as weighted least squares to counteract heteroscedasticity. Additionally, we present a general methodology to quantify the performance difference between methods when the true value is unknown. We find that our method outperforms previously proposed methods with statistical significance and that the primary improvement is the utilization of intraday data.
摘要本研究提出了一种在市场一致的衍生品定价中使用ois隐含即期利率来估计利差的方法。我们的方法从两个方面推广了以往文献中提出的普通最小二乘法。首先,它利用日内数据而不是单一时间的数据。其次,将其表述为加权最小二乘来抵消异方差。此外,我们提出了一种通用的方法来量化方法之间的性能差异,当真实值是未知的。我们发现我们的方法优于以前提出的方法,具有统计显著性,主要改进是对日内数据的利用。
{"title":"Weighted least squares estimation of the risk-free rate from derivative prices","authors":"Jörgen Blomvall, Pontus Söderbäck, M. Singull","doi":"10.1080/23737484.2023.2186283","DOIUrl":"https://doi.org/10.1080/23737484.2023.2186283","url":null,"abstract":"Abstract This study proposes a method for estimating the interest spread over an OIS-implied spot rate used in market-consistent derivative pricing. Our method generalizes previous proposed ordinary least squares methods in the literature in two ways. First, it utilizes intraday data rather than data from a single time. Second, it is formulated as weighted least squares to counteract heteroscedasticity. Additionally, we present a general methodology to quantify the performance difference between methods when the true value is unknown. We find that our method outperforms previously proposed methods with statistical significance and that the primary improvement is the utilization of intraday data.","PeriodicalId":36561,"journal":{"name":"Communications in Statistics Case Studies Data Analysis and Applications","volume":"79 1","pages":"72 - 105"},"PeriodicalIF":0.0,"publicationDate":"2023-01-02","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"76642575","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
Nonlinearity and forecast performance of realized volatility 已实现波动率的非线性及预测性能
Q4 Mathematics Pub Date : 2023-01-02 DOI: 10.1080/23737484.2023.2175277
Daiki Maki
Abstract This study examines whether accounting for the nonlinearity of realized volatility leads to better forecast performance. We propose a new realized volatility forecasting model that considers nonlinearities without the assumption of a particular nonlinear model. The proposed model uses the Taylor series approximation method to account for nonlinearities. We applied it to the realized volatility of representative stock indices from the U.S., Japan, the U.K., and China and observed their in-sample nonlinearities. Additionally, we evaluate out-of-sample forecast performance. The empirical results show that realized volatility has nonlinearity, and the proposed models exhibit better forecast performance than standard models.
摘要本研究探讨考虑已实现波动率的非线性是否会导致更好的预测绩效。本文提出了一种新的考虑非线性因素的可实现波动率预测模型,而不需要假设特定的非线性模型。提出的模型使用泰勒级数近似方法来解释非线性。我们将其应用于美国、日本、英国和中国的代表性股票指数的已实现波动率,并观察了它们的样本内非线性。此外,我们评估样本外预测性能。实证结果表明,实际波动率具有非线性,所提模型的预测效果优于标准模型。
{"title":"Nonlinearity and forecast performance of realized volatility","authors":"Daiki Maki","doi":"10.1080/23737484.2023.2175277","DOIUrl":"https://doi.org/10.1080/23737484.2023.2175277","url":null,"abstract":"Abstract This study examines whether accounting for the nonlinearity of realized volatility leads to better forecast performance. We propose a new realized volatility forecasting model that considers nonlinearities without the assumption of a particular nonlinear model. The proposed model uses the Taylor series approximation method to account for nonlinearities. We applied it to the realized volatility of representative stock indices from the U.S., Japan, the U.K., and China and observed their in-sample nonlinearities. Additionally, we evaluate out-of-sample forecast performance. The empirical results show that realized volatility has nonlinearity, and the proposed models exhibit better forecast performance than standard models.","PeriodicalId":36561,"journal":{"name":"Communications in Statistics Case Studies Data Analysis and Applications","volume":"252 1","pages":"51 - 71"},"PeriodicalIF":0.0,"publicationDate":"2023-01-02","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"75826594","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
A novel application of finite Gaussian mixture model (GMM) using real and simulated biomarkers of cardiovascular disease to distinguish adolescents with and without obesity 有限高斯混合模型(GMM)的新应用,使用真实和模拟的心血管疾病生物标志物来区分青少年是否肥胖
Q4 Mathematics Pub Date : 2023-01-02 DOI: 10.1080/23737484.2023.2194249
M. J. Hossain, P. Balagopal
Abstract Obesity-induced derangements in adipose tissue and other organs lead to the development of cardiovascular disease (CVD). The loss of CV-health in children is a continuum, and the manifestation of overt CVD takes several years. Therefore, robust biomarkers are crucial for its early prediction, prevention, and management. Biomarkers of CVD are highly mutually correlated, and typical regression approaches do not precisely appraise the obesity-induced summative alterations of these overlapping variables. This study examines if the confluence of biomarkers of CVD can distinguish adolescents with obesity from their normal-weight counterparts to illustrate obesity as a strong risk factor of CVD. The biomarkers were measured in a well-controlled study in 21 adolescents. Application of the Gaussian mixture model to these biomarkers identified two distinct groups that matched with the obesity status of participants, which was further confirmed using supervised learning methods. Classification of biomarkers from a simulation study of 1,000 data points, each comprising a vector of five biomarkers and the classification identifier, resulted in two groups that matched with the classification in the simulated dataset. The precise identification of obesity by the pattern of concurring CVD biomarkers in real and simulated datasets confirms obesity as a strong risk factor of CVD.
肥胖引起的脂肪组织和其他器官紊乱可导致心血管疾病(CVD)的发展。儿童心血管健康的丧失是一个连续的过程,明显的心血管疾病的表现需要几年的时间。因此,强大的生物标志物对其早期预测、预防和管理至关重要。CVD的生物标志物是高度相互关联的,典型的回归方法并不能精确地评估肥胖引起的这些重叠变量的总结性改变。本研究考察了CVD的生物标志物是否可以将肥胖青少年与正常体重的青少年区分开来,以说明肥胖是CVD的一个重要危险因素。在一项控制良好的研究中,对21名青少年进行了生物标志物测量。将高斯混合模型应用于这些生物标志物,确定了与参与者的肥胖状况相匹配的两个不同组,并使用监督学习方法进一步证实了这一点。从1000个数据点的模拟研究中对生物标志物进行分类,每个数据点由5个生物标志物和分类标识符组成的向量组成,结果产生了与模拟数据集中的分类相匹配的两组。通过在真实和模拟数据集中同时出现CVD生物标志物的模式来精确识别肥胖,证实了肥胖是CVD的一个强大危险因素。
{"title":"A novel application of finite Gaussian mixture model (GMM) using real and simulated biomarkers of cardiovascular disease to distinguish adolescents with and without obesity","authors":"M. J. Hossain, P. Balagopal","doi":"10.1080/23737484.2023.2194249","DOIUrl":"https://doi.org/10.1080/23737484.2023.2194249","url":null,"abstract":"Abstract Obesity-induced derangements in adipose tissue and other organs lead to the development of cardiovascular disease (CVD). The loss of CV-health in children is a continuum, and the manifestation of overt CVD takes several years. Therefore, robust biomarkers are crucial for its early prediction, prevention, and management. Biomarkers of CVD are highly mutually correlated, and typical regression approaches do not precisely appraise the obesity-induced summative alterations of these overlapping variables. This study examines if the confluence of biomarkers of CVD can distinguish adolescents with obesity from their normal-weight counterparts to illustrate obesity as a strong risk factor of CVD. The biomarkers were measured in a well-controlled study in 21 adolescents. Application of the Gaussian mixture model to these biomarkers identified two distinct groups that matched with the obesity status of participants, which was further confirmed using supervised learning methods. Classification of biomarkers from a simulation study of 1,000 data points, each comprising a vector of five biomarkers and the classification identifier, resulted in two groups that matched with the classification in the simulated dataset. The precise identification of obesity by the pattern of concurring CVD biomarkers in real and simulated datasets confirms obesity as a strong risk factor of CVD.","PeriodicalId":36561,"journal":{"name":"Communications in Statistics Case Studies Data Analysis and Applications","volume":"27 1","pages":"106 - 120"},"PeriodicalIF":0.0,"publicationDate":"2023-01-02","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"80987134","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Using a Bayesian model of the joint distribution of pain and time on medication to decide on pain medication for neuropathy. 使用疼痛和服药时间联合分布的贝叶斯模型来决定神经病变的止痛药物。
Q4 Mathematics Pub Date : 2023-01-01 Epub Date: 2023-05-19 DOI: 10.1080/23737484.2023.2212262
Guangyi Gao, Jo A Wick, Alexandra R Brown, Richard J Barohn, Byron J Gajewski

The PAIN-CONTRoLS trial compared four medications in treating Cryptogenic sensory polyneuropathy. The primary outcome was a utility function that combined two outcomes, patients' pain score reduction and patients' quit rate. However, additional analysis of the individual outcomes could also be leveraged to inform selecting an optimal medication for future patients. We demonstrate how joint modeling of longitudinal and time-to-event data from PAIN-CONTRoLS can be used to predict the effects of medication in a patient-specific manner and helps to make patient-focused decisions. A joint model was used to evaluate the two outcomes while accounting for the association between the longitudinal process and the time-to-event processes. Results suggested no significant association between the patients' pain scores and time to the medication quit in the PAIN-CONTRoLS study, but the joint model still provided robust estimates and a better model fit. Using the model estimates, given patients' baseline characteristics, a drug profile on both the pain reduction and medication time could be obtained for each drug, providing information on how likely they would quit and how much pain reduction they should expect. Our analysis suggested that drugs viable for one patient may not be beneficial for others.

PAIN-CONTRoLS 试验比较了四种治疗隐源性感觉性多发性神经病的药物。主要结果是一个效用函数,它综合了两种结果,即患者疼痛评分的降低和患者的戒烟率。然而,对单个结果的额外分析也可以为未来患者选择最佳药物提供依据。我们展示了如何利用 PAIN-CONTRoLS 的纵向数据和时间到事件数据联合建模,以特定患者的方式预测药物治疗效果,并帮助做出以患者为中心的决策。我们采用了一个联合模型来评估这两个结果,同时考虑纵向过程和时间到事件过程之间的关联。结果表明,在 PAIN-CONTRoLS 研究中,患者的疼痛评分与戒药时间之间并无明显关联,但联合模型仍能提供可靠的估计值,且模型拟合度更高。利用模型估算值,考虑到患者的基线特征,可以得到每种药物在减轻疼痛和用药时间方面的药物概况,从而为患者戒药的可能性和预期减轻疼痛的程度提供信息。我们的分析表明,对某位患者可行的药物可能对其他患者无益。
{"title":"Using a Bayesian model of the joint distribution of pain and time on medication to decide on pain medication for neuropathy.","authors":"Guangyi Gao, Jo A Wick, Alexandra R Brown, Richard J Barohn, Byron J Gajewski","doi":"10.1080/23737484.2023.2212262","DOIUrl":"10.1080/23737484.2023.2212262","url":null,"abstract":"<p><p>The PAIN-CONTRoLS trial compared four medications in treating Cryptogenic sensory polyneuropathy. The primary outcome was a utility function that combined two outcomes, patients' pain score reduction and patients' quit rate. However, additional analysis of the individual outcomes could also be leveraged to inform selecting an optimal medication for future patients. We demonstrate how joint modeling of longitudinal and time-to-event data from PAIN-CONTRoLS can be used to predict the effects of medication in a patient-specific manner and helps to make patient-focused decisions. A joint model was used to evaluate the two outcomes while accounting for the association between the longitudinal process and the time-to-event processes. Results suggested no significant association between the patients' pain scores and time to the medication quit in the PAIN-CONTRoLS study, but the joint model still provided robust estimates and a better model fit. Using the model estimates, given patients' baseline characteristics, a drug profile on both the pain reduction and medication time could be obtained for each drug, providing information on how likely they would quit and how much pain reduction they should expect. Our analysis suggested that drugs viable for one patient may not be beneficial for others.</p>","PeriodicalId":36561,"journal":{"name":"Communications in Statistics Case Studies Data Analysis and Applications","volume":"9 3","pages":"252-269"},"PeriodicalIF":0.0,"publicationDate":"2023-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10491414/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"10241375","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
期刊
Communications in Statistics Case Studies Data Analysis and Applications
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1