首页 > 最新文献

Statistics in Transition最新文献

英文 中文
Scaled Fisher consistency for the partial likelihood estimation in various extensions of the Cox model Cox模型各种扩展中偏似然估计的标度Fisher一致性
Q4 Mathematics Pub Date : 2022-06-01 DOI: 10.2478/stattrans-2022-0023
T. Bednarski, Piotr B. Nowak, Magdalena Skolimowska-Kulig
Abstract The Cox proportional hazards model has become the most widely used procedure in survival analysis. The theoretical basis of the original model has been developed in various extensions. In the recent years, vital research has been undertaken involving the incorporation of random effects to survival models. In this setting, the random effect is a variable (frailty) which embraces a variation among individuals or groups of individuals which cannot be explained by observable covariates. The right choice of the frailty distribution is essential for an accurate description of the dependence structure present in the data. In this paper, we aim to investigate the accuracy of inference based on the primer Cox model in the existence of unobserved heterogeneity, that is, when the data generating mechanism is more complex than presumed and described by the kind of an extension of the Cox model with undefined frailty. We show that the conventional partial likelihood estimator under the considered extension is Fisher-consistent up to a scaling factor, provided symmetry-type distributional assumptions on covariates. We also present the results of simulation experiments that reveal an exemplary behaviour of the estimators.
Cox比例风险模型已成为生存分析中应用最广泛的方法。原始模型的理论基础在各种扩展中得到了发展。近年来,人们进行了一些重要的研究,包括将随机效应纳入生存模型。在这种情况下,随机效应是一种变量(脆弱性),它包含个体或个体群体之间的变化,这种变化不能用可观察到的协变量来解释。正确选择脆弱分布对于准确描述数据中存在的依赖结构至关重要。在本文中,我们的目的是研究在未观察到异质性存在的情况下,即当数据生成机制比假设的更复杂,并且由具有未定义脆弱性的Cox模型的一种扩展描述时,基于引物Cox模型的推理的准确性。在对协变量的对称型分布假设下,我们证明了在考虑的扩展下,传统的部分似然估计在一个比例因子内是费雪一致的。我们还提出了模拟实验的结果,揭示了估计器的示范行为。
{"title":"Scaled Fisher consistency for the partial likelihood estimation in various extensions of the Cox model","authors":"T. Bednarski, Piotr B. Nowak, Magdalena Skolimowska-Kulig","doi":"10.2478/stattrans-2022-0023","DOIUrl":"https://doi.org/10.2478/stattrans-2022-0023","url":null,"abstract":"Abstract The Cox proportional hazards model has become the most widely used procedure in survival analysis. The theoretical basis of the original model has been developed in various extensions. In the recent years, vital research has been undertaken involving the incorporation of random effects to survival models. In this setting, the random effect is a variable (frailty) which embraces a variation among individuals or groups of individuals which cannot be explained by observable covariates. The right choice of the frailty distribution is essential for an accurate description of the dependence structure present in the data. In this paper, we aim to investigate the accuracy of inference based on the primer Cox model in the existence of unobserved heterogeneity, that is, when the data generating mechanism is more complex than presumed and described by the kind of an extension of the Cox model with undefined frailty. We show that the conventional partial likelihood estimator under the considered extension is Fisher-consistent up to a scaling factor, provided symmetry-type distributional assumptions on covariates. We also present the results of simulation experiments that reveal an exemplary behaviour of the estimators.","PeriodicalId":37985,"journal":{"name":"Statistics in Transition","volume":"23 1","pages":"185 - 196"},"PeriodicalIF":0.0,"publicationDate":"2022-06-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"46189480","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Advances on Permutation Multivariate Analysis of Variance for big data 大数据变异多元方差分析研究进展
Q4 Mathematics Pub Date : 2022-06-01 DOI: 10.2478/stattrans-2022-0022
S. Bonnini, Getnet Melak Assegie
Abstract In many applications of the multivariate analyses of variance, the classic parametric solutions for testing hypotheses of equality in population means or multisample and multivariate location problems might not be suitable for various reasons. Multivariate multisample location problems lack a comparative study of the power behaviour of the most important combined permutation tests as the number of variables diverges. In particular, it is useful to know under which conditions each of the different tests is preferable in terms of power, how the power of each test increases when the number of variables under the alternative hypothesis diverges, and the power behaviour of each test as the function of the proportion of true alternative hypotheses. The purpose of this paper is to fill the gap in the literature about combined permutation tests, in particular for big data with a large number of variables. A Monte Carlo simulation study was carried out to investigate the power behaviour of the tests, and the application to a real case study was performed to show the utility of the method.
摘要在多变量方差分析的许多应用中,用于检验总体均值相等假设或多样本和多变量位置问题的经典参数解可能由于各种原因而不适用。多变量多样本定位问题缺乏对最重要的组合排列检验的功率行为的比较研究,因为变量的数量不同。特别是,了解在哪些条件下,不同的测试在功率方面更可取,当替代假设下的变量数量出现偏差时,每个测试的功率如何增加,以及每个测试的性能作为真实替代假设比例的函数,都是有用的。本文的目的是填补文献中关于组合排列测试的空白,特别是对于具有大量变量的大数据。进行了蒙特卡罗模拟研究,以研究测试的功率行为,并将其应用于实际案例研究,以表明该方法的实用性。
{"title":"Advances on Permutation Multivariate Analysis of Variance for big data","authors":"S. Bonnini, Getnet Melak Assegie","doi":"10.2478/stattrans-2022-0022","DOIUrl":"https://doi.org/10.2478/stattrans-2022-0022","url":null,"abstract":"Abstract In many applications of the multivariate analyses of variance, the classic parametric solutions for testing hypotheses of equality in population means or multisample and multivariate location problems might not be suitable for various reasons. Multivariate multisample location problems lack a comparative study of the power behaviour of the most important combined permutation tests as the number of variables diverges. In particular, it is useful to know under which conditions each of the different tests is preferable in terms of power, how the power of each test increases when the number of variables under the alternative hypothesis diverges, and the power behaviour of each test as the function of the proportion of true alternative hypotheses. The purpose of this paper is to fill the gap in the literature about combined permutation tests, in particular for big data with a large number of variables. A Monte Carlo simulation study was carried out to investigate the power behaviour of the tests, and the application to a real case study was performed to show the utility of the method.","PeriodicalId":37985,"journal":{"name":"Statistics in Transition","volume":"23 1","pages":"163 - 183"},"PeriodicalIF":0.0,"publicationDate":"2022-06-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"48600894","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 2
On the quick estimation of probability of recovery from COVID-19 during first wave of epidemic in India: a logistic regression approach 关于印度第一波疫情期间COVID-19恢复概率的快速估计:逻辑回归方法
Q4 Mathematics Pub Date : 2022-06-01 DOI: 10.2478/stattrans-2022-0024
Hemlata Joshi, S. Azarudheen, M. Nagaraja, Singh Chandraketu
Abstract The COVID-19 pandemic has recently become a threat all across the globe with the rising cases every day and many countries experiencing its outbreak. According to the WHO, the virus is capable of spreading at an exponential rate across countries, and India is now one of the worst-affected country in the world. Researchers all around the world are racing to come up with a cure or treatment for COVID-19, and this is creating extreme pressure on the policy makers and epidemiologists. However, in India the recovery rate has been far better than in other countries, and is steadily improving. Still in such a difficult situation with no effective medicine, it is essential to know if a patient with the COVID-19 is going to recover or die. To meet this end, a model has been developed in this article to estimate the probability of a recovery of a patient based on the demographic characteristics. The study used data published by the Ministry of Health and Family Welfare of India for the empirical analysis.
最近,COVID-19大流行已成为全球范围内的威胁,病例每天都在上升,许多国家都出现了疫情。据世界卫生组织称,这种病毒能够以指数速度在各国传播,印度现在是世界上受影响最严重的国家之一。世界各地的研究人员都在竞相提出COVID-19的治愈或治疗方法,这给政策制定者和流行病学家带来了极大的压力。然而,印度的回收率远远好于其他国家,并且正在稳步提高。在没有有效药物的情况下,仍然处于这种困难的情况下,了解COVID-19患者是会康复还是会死亡至关重要。为了达到这一目的,本文开发了一个模型来估计基于人口统计学特征的患者康复的概率。本研究使用印度卫生和家庭福利部公布的数据进行实证分析。
{"title":"On the quick estimation of probability of recovery from COVID-19 during first wave of epidemic in India: a logistic regression approach","authors":"Hemlata Joshi, S. Azarudheen, M. Nagaraja, Singh Chandraketu","doi":"10.2478/stattrans-2022-0024","DOIUrl":"https://doi.org/10.2478/stattrans-2022-0024","url":null,"abstract":"Abstract The COVID-19 pandemic has recently become a threat all across the globe with the rising cases every day and many countries experiencing its outbreak. According to the WHO, the virus is capable of spreading at an exponential rate across countries, and India is now one of the worst-affected country in the world. Researchers all around the world are racing to come up with a cure or treatment for COVID-19, and this is creating extreme pressure on the policy makers and epidemiologists. However, in India the recovery rate has been far better than in other countries, and is steadily improving. Still in such a difficult situation with no effective medicine, it is essential to know if a patient with the COVID-19 is going to recover or die. To meet this end, a model has been developed in this article to estimate the probability of a recovery of a patient based on the demographic characteristics. The study used data published by the Ministry of Health and Family Welfare of India for the empirical analysis.","PeriodicalId":37985,"journal":{"name":"Statistics in Transition","volume":"23 1","pages":"197 - 208"},"PeriodicalIF":0.0,"publicationDate":"2022-06-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"44463368","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
A Bayesian estimation of the Gini index and the Bonferroni index for the Dagum distribution with the application of different priors 不同先验条件下Dagum分布的Gini指数和Bonferroni指数的贝叶斯估计
Q4 Mathematics Pub Date : 2022-06-01 DOI: 10.2478/stattrans-2022-0016
Sangeeta Arora, K. Mahajan, Vikas Jangra
Abstract Bayesian estimators and highest posterior density credible intervals are obtained for two popular inequality measures, viz. the Gini index and the Bonferroni index in the case of the Dagum distribution. The study considers informative and non-informative priors, i.e. the Mukherjee-Islam prior and the extension of Jeffrey’s prior, respectively, under the presumption of the Linear Exponential (LINEX) loss function. A Monte Carlo simulation study is carried out in order to obtain the relative efficiency of both the Gini and Bonferroni indices while taking into consideration different priors and loss functions. The estimated loss proves lower when using the Mukherjee-Islam prior in comparison to the extension of Jeffrey’s prior and the LINEX loss function outperforms the squared error loss function (SELF) in terms of the estimated loss. Highest posterior density credible intervals are also obtained for both these measures. The study used real-life data sets for illustration purposes.
摘要对Dagum分布下的基尼指数和Bonferroni指数这两种常用的不等式测度,得到了贝叶斯估计量和最高后验密度可信区间。在线性指数损失函数的假设下,研究考虑了信息先验和非信息先验,即Mukherjee-Islam先验和Jeffrey先验的扩展。为了在考虑不同先验函数和损失函数的情况下获得基尼指数和邦费罗尼指数的相对效率,进行了蒙特卡罗模拟研究。与杰弗里先验的扩展相比,使用Mukherjee-Islam先验的估计损失更低,而LINEX损失函数在估计损失方面优于平方误差损失函数(SELF)。这两种测量方法也获得了最高的后验密度可信区间。为了说明问题,该研究使用了真实的数据集。
{"title":"A Bayesian estimation of the Gini index and the Bonferroni index for the Dagum distribution with the application of different priors","authors":"Sangeeta Arora, K. Mahajan, Vikas Jangra","doi":"10.2478/stattrans-2022-0016","DOIUrl":"https://doi.org/10.2478/stattrans-2022-0016","url":null,"abstract":"Abstract Bayesian estimators and highest posterior density credible intervals are obtained for two popular inequality measures, viz. the Gini index and the Bonferroni index in the case of the Dagum distribution. The study considers informative and non-informative priors, i.e. the Mukherjee-Islam prior and the extension of Jeffrey’s prior, respectively, under the presumption of the Linear Exponential (LINEX) loss function. A Monte Carlo simulation study is carried out in order to obtain the relative efficiency of both the Gini and Bonferroni indices while taking into consideration different priors and loss functions. The estimated loss proves lower when using the Mukherjee-Islam prior in comparison to the extension of Jeffrey’s prior and the LINEX loss function outperforms the squared error loss function (SELF) in terms of the estimated loss. Highest posterior density credible intervals are also obtained for both these measures. The study used real-life data sets for illustration purposes.","PeriodicalId":37985,"journal":{"name":"Statistics in Transition","volume":"23 1","pages":"49 - 68"},"PeriodicalIF":0.0,"publicationDate":"2022-06-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"41581835","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
The length-biased power hazard rate distribution: Some properties and applications 长度偏置功率危险率分布:一些特性和应用
Q4 Mathematics Pub Date : 2022-06-01 DOI: 10.2478/stattrans-2022-0013
A. Mustafa, M. I. Khan
Abstract In this article, the length-biased power hazard rate distribution has introduced and investigated several statistical properties. This distribution reports an extension of several probability distributions, namely: exponential, Rayleigh, Weibull, and linear hazard rate. The procedure of maximum likelihood estimation is taken for parameters. Finally, the applicability of the model is explored by three real data sets. To examine, the performance of the technique, a simulation study is extracted.
摘要在本文中,引入并研究了长度偏差功率危害率分布的几个统计特性。该分布报告了几种概率分布的扩展,即:指数、瑞利、威布尔和线性危险率。参数采用最大似然估计的方法。最后,通过三个真实数据集对模型的适用性进行了探讨。为了检验该技术的性能,提取了一个模拟研究。
{"title":"The length-biased power hazard rate distribution: Some properties and applications","authors":"A. Mustafa, M. I. Khan","doi":"10.2478/stattrans-2022-0013","DOIUrl":"https://doi.org/10.2478/stattrans-2022-0013","url":null,"abstract":"Abstract In this article, the length-biased power hazard rate distribution has introduced and investigated several statistical properties. This distribution reports an extension of several probability distributions, namely: exponential, Rayleigh, Weibull, and linear hazard rate. The procedure of maximum likelihood estimation is taken for parameters. Finally, the applicability of the model is explored by three real data sets. To examine, the performance of the technique, a simulation study is extracted.","PeriodicalId":37985,"journal":{"name":"Statistics in Transition","volume":"23 1","pages":"1 - 16"},"PeriodicalIF":0.0,"publicationDate":"2022-06-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"43633596","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 2
Institutional equilibrium in EU economies in 2008 and 2018: SEM-PLS models 2008年和2018年欧盟经济体的制度均衡:SEM-PLS模型
Q4 Mathematics Pub Date : 2022-06-01 DOI: 10.2478/stattrans-2022-0019
Mateusz Borkowski
Abstract The aim of the research is to identify the strength and direction of the development of the relationship between formal and informal institutions and to assess the institutional equilibrium of modern economies. The structural equations modelling based on partial least squares (SEM-PLS) is applied to achieve the purpose of the article. It is an econometric method that allows the measurement and analysis of the dependencies between latent variables (measures that cannot be directly observed). The study included 27 EU economies and the research period covered the years 2008 and 2018. The results of the study demonstrate that the quality of informal institutions strongly, positively determines the quality of formal institutions. The conducted analyses indicate that modern economies are diversified in terms of the quality of informal and formal institutions and, consequently, in institutional equilibrium. Considerable institutional disparities also translate into a large diversification in economic development. The article proposes a different meaning of institutional equilibrium, understood as the achieved state of institutional structure characterised by high quality informal institutions which interact with each other to improve the efficiency of formal institutions. The article presents a comprehensive model of the institutional structure and a unique method of measuring institutional equilibrium.
摘要本研究的目的是确定正式和非正式制度之间关系的强度和发展方向,并评估现代经济的制度平衡。为了达到本文的目的,采用了基于偏最小二乘的结构方程建模方法。这是一种计量经济学方法,允许测量和分析潜在变量(无法直接观察到的指标)之间的相关性。该研究涵盖了27个欧盟经济体,研究期为2008年和2018年。研究结果表明,非正式制度的质量强烈、积极地决定着正式制度的质量。所进行的分析表明,现代经济在非正规和正规制度的质量方面是多样化的,因此处于制度平衡。巨大的体制差异也转化为经济发展的巨大多样化。本文提出了制度均衡的不同含义,即以高质量的非正式制度为特征的制度结构的实现状态,这些非正式制度相互作用以提高正式制度的效率。本文提出了一个全面的制度结构模型和一种独特的衡量制度均衡的方法。
{"title":"Institutional equilibrium in EU economies in 2008 and 2018: SEM-PLS models","authors":"Mateusz Borkowski","doi":"10.2478/stattrans-2022-0019","DOIUrl":"https://doi.org/10.2478/stattrans-2022-0019","url":null,"abstract":"Abstract The aim of the research is to identify the strength and direction of the development of the relationship between formal and informal institutions and to assess the institutional equilibrium of modern economies. The structural equations modelling based on partial least squares (SEM-PLS) is applied to achieve the purpose of the article. It is an econometric method that allows the measurement and analysis of the dependencies between latent variables (measures that cannot be directly observed). The study included 27 EU economies and the research period covered the years 2008 and 2018. The results of the study demonstrate that the quality of informal institutions strongly, positively determines the quality of formal institutions. The conducted analyses indicate that modern economies are diversified in terms of the quality of informal and formal institutions and, consequently, in institutional equilibrium. Considerable institutional disparities also translate into a large diversification in economic development. The article proposes a different meaning of institutional equilibrium, understood as the achieved state of institutional structure characterised by high quality informal institutions which interact with each other to improve the efficiency of formal institutions. The article presents a comprehensive model of the institutional structure and a unique method of measuring institutional equilibrium.","PeriodicalId":37985,"journal":{"name":"Statistics in Transition","volume":"23 1","pages":"107 - 127"},"PeriodicalIF":0.0,"publicationDate":"2022-06-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"41885672","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Socio-economic development and quality of life of NUTS-2 units in the European Union 欧洲联盟NUTS-2单位的社会经济发展和生活质量
Q4 Mathematics Pub Date : 2022-06-01 DOI: 10.2478/stattrans-2022-0015
Maciej Jewczak, Magdalena Brudz
Abstract Analyses regarding socio-economic development and quality of life are an important aspect of research and discussion for many international organisations, states and local authorities. Due to the complexity and multidimensionality of these issues, conducting research can be problematic. The conclusions of various analytical centres indicate that there are many paths towards establishing a set of factors which affect quality of life and ways of assessing socio-economic development levels. Depending on the criteria considered, the most common methods for determining the degree of the advancement of life quality or socio-economic development include taxonomical techniques and analyses of potential, which are based mainly on objective data sourced from official registers. The main purpose of the paper is to investigate the level of socio-economic development and quality of life in the European Union in the years 2004 and 2018. The analyses were conducted for a rarely used level of spatial data aggregation, i.e. for NUTS-2 units. The analysis covers only those European regions that were EU members in 2004. As the primary research tool, the two-dimensional development matrix was adopted, which enabled the verification of the hypothesis regarding the convergence of synthetic measures that indicate the levels of socio-economic development and quality of life in the EU regions. For these indices, the development matrix is also used to identify the strengths and weaknesses as well as the opportunities and threats for selected spatial units, and, at the same time, to estimate the rates of change of the socio-economic development and quality of life levels.
关于社会经济发展和生活质量的分析是许多国际组织、国家和地方当局研究和讨论的一个重要方面。由于这些问题的复杂性和多维性,进行研究可能会有问题。各分析中心的结论表明,有许多途径可以确定一套影响生活质量的因素和评估社会经济发展水平的方法。根据所考虑的标准,确定生活质量或社会经济发展进步程度的最常用方法包括分类学技术和潜力分析,这些方法主要基于来自官方登记册的客观数据。本文的主要目的是调查欧盟在2004年和2018年的社会经济发展水平和生活质量。这些分析是针对很少使用的空间数据汇总水平,即nut -2单位进行的。该分析仅涵盖2004年欧盟成员国的欧洲地区。作为主要的研究工具,采用了二维发展矩阵,这使得能够验证关于表明欧盟地区社会经济发展水平和生活质量的综合措施趋同的假设。对于这些指数,还使用发展矩阵来确定选定空间单元的优势和劣势以及机会和威胁,同时估计社会经济发展和生活质量水平的变化率。
{"title":"Socio-economic development and quality of life of NUTS-2 units in the European Union","authors":"Maciej Jewczak, Magdalena Brudz","doi":"10.2478/stattrans-2022-0015","DOIUrl":"https://doi.org/10.2478/stattrans-2022-0015","url":null,"abstract":"Abstract Analyses regarding socio-economic development and quality of life are an important aspect of research and discussion for many international organisations, states and local authorities. Due to the complexity and multidimensionality of these issues, conducting research can be problematic. The conclusions of various analytical centres indicate that there are many paths towards establishing a set of factors which affect quality of life and ways of assessing socio-economic development levels. Depending on the criteria considered, the most common methods for determining the degree of the advancement of life quality or socio-economic development include taxonomical techniques and analyses of potential, which are based mainly on objective data sourced from official registers. The main purpose of the paper is to investigate the level of socio-economic development and quality of life in the European Union in the years 2004 and 2018. The analyses were conducted for a rarely used level of spatial data aggregation, i.e. for NUTS-2 units. The analysis covers only those European regions that were EU members in 2004. As the primary research tool, the two-dimensional development matrix was adopted, which enabled the verification of the hypothesis regarding the convergence of synthetic measures that indicate the levels of socio-economic development and quality of life in the EU regions. For these indices, the development matrix is also used to identify the strengths and weaknesses as well as the opportunities and threats for selected spatial units, and, at the same time, to estimate the rates of change of the socio-economic development and quality of life levels.","PeriodicalId":37985,"journal":{"name":"Statistics in Transition","volume":"23 1","pages":"33 - 48"},"PeriodicalIF":0.0,"publicationDate":"2022-06-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"41984122","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
On the nonparametric estimation of the conditional hazard estimator in a single functional index 单函数指标下条件危险估计量的非参数估计
Q4 Mathematics Pub Date : 2022-06-01 DOI: 10.2478/stattrans-2022-0018
Abdelmalek Gagui, Abdelhak Chouaf
Abstract This paper deals with the conditional hazard estimator of a real response where the variable is given a functional random variable (i.e it takes values in an infinite-dimensional space). Specifically, we focus on the functional index model. This approach offers a good compromise between nonparametric and parametric models. The principle aim is to prove the asymptotic normality of the proposed estimator under general conditions and in cases where the variables satisfy the strong mixing dependency. This was achieved by means of the kernel estimator method, based on a single-index structure. Finally, a simulation of our methodology shows that it is efficient for large sample sizes.
摘要本文研究了给定一个泛函随机变量(即在无限维空间中取值)的实响应的条件危险估计量。具体来说,我们关注的是函数索引模型。这种方法在非参数模型和参数模型之间提供了一个很好的折衷。主要目的是证明在一般条件下和在变量满足强混合依赖的情况下所提出的估计量的渐近正态性。这是通过基于单索引结构的核估计器方法实现的。最后,我们的方法的模拟表明,它是有效的大样本量。
{"title":"On the nonparametric estimation of the conditional hazard estimator in a single functional index","authors":"Abdelmalek Gagui, Abdelhak Chouaf","doi":"10.2478/stattrans-2022-0018","DOIUrl":"https://doi.org/10.2478/stattrans-2022-0018","url":null,"abstract":"Abstract This paper deals with the conditional hazard estimator of a real response where the variable is given a functional random variable (i.e it takes values in an infinite-dimensional space). Specifically, we focus on the functional index model. This approach offers a good compromise between nonparametric and parametric models. The principle aim is to prove the asymptotic normality of the proposed estimator under general conditions and in cases where the variables satisfy the strong mixing dependency. This was achieved by means of the kernel estimator method, based on a single-index structure. Finally, a simulation of our methodology shows that it is efficient for large sample sizes.","PeriodicalId":37985,"journal":{"name":"Statistics in Transition","volume":"23 1","pages":"89 - 105"},"PeriodicalIF":0.0,"publicationDate":"2022-06-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"48262383","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Extracting relevant predictors of the severity of mental illnesses from clinical information using regularisation regression models 利用正则化回归模型从临床信息中提取精神疾病严重程度的相关预测因子
Q4 Mathematics Pub Date : 2022-06-01 DOI: 10.2478/stattrans-2022-0020
Sakshi Kaushik, A. Sabharwal, G. Grover
Abstract Mental disorders are common non-communicable diseases whose occurrence rises at epidemic rates globally. The determination of the severity of a mental illness has important clinical implications and it serves as a prognostic factor for effective intervention planning and management. This paper aims to identify the relevant predictors of the severity of mental illnesses (measured by psychiatric rating scales) from a wide range of clinical variables consisting of information on both laboratory test results and psychiatric factors. The laboratory test results collectively indicate the measurements of 23 components derived from vital signs and blood tests results for the evaluation of the complete blood count. The 8 psychiatric factors known to affect the severity of mental illnesses are considered, viz. the family history, course and onset of an illness, etc. Retrospective data of 78 patients diagnosed with mental and behavioural disorders were collected from the Lady Hardinge Medical College & Smt. S.K, Hospital in New Delhi, India. The observations missing in the data are imputed using the non-parametric random forest algorithm. The multicollinearity is detected based on the variance inflation factor. Owing to the presence of multicollinearity, regularisation techniques such as ridge regression and extensions of the least absolute shrinkage and selection operator (LASSO), viz. adaptive and group LASSO are used for fitting the regression model. Optimal tuning parameter λ is obtained through 13-fold cross-validation. It was observed that the coefficients of the quantitative predictors extracted by the adaptive LASSO and the group of predictors extracted by the group LASSO were comparable to the coefficients obtained through ridge regression.
摘要精神障碍是一种常见的非传染性疾病,其发病率在全球范围内呈上升趋势。精神疾病严重程度的确定具有重要的临床意义,它是有效干预计划和管理的预后因素。本文旨在从一系列临床变量中确定精神疾病严重程度的相关预测因素(通过精神评定量表测量),这些变量包括实验室测试结果和精神因素的信息。实验室测试结果共同指示了来自生命体征的23个成分的测量值和用于评估全血细胞计数的血液测试结果。考虑了已知影响精神疾病严重程度的8个精神因素,即家族史、病程和发病等。从哈丁夫人医学院和史密斯医学院收集了78名被诊断为精神和行为障碍的患者的回顾性数据。S.K,印度新德里医院。使用非参数随机森林算法估算数据中缺失的观测值。基于方差膨胀因子来检测多重共线性。由于多重共线性的存在,使用正则化技术,如岭回归和最小绝对收缩和选择算子(LASSO)的扩展,即自适应和群LASSO来拟合回归模型。通过13次交叉验证得到了最佳调谐参数λ。观察到由自适应LASSO提取的定量预测因子的系数和由LASSO组提取的预测因子组的系数与通过岭回归获得的系数相当。
{"title":"Extracting relevant predictors of the severity of mental illnesses from clinical information using regularisation regression models","authors":"Sakshi Kaushik, A. Sabharwal, G. Grover","doi":"10.2478/stattrans-2022-0020","DOIUrl":"https://doi.org/10.2478/stattrans-2022-0020","url":null,"abstract":"Abstract Mental disorders are common non-communicable diseases whose occurrence rises at epidemic rates globally. The determination of the severity of a mental illness has important clinical implications and it serves as a prognostic factor for effective intervention planning and management. This paper aims to identify the relevant predictors of the severity of mental illnesses (measured by psychiatric rating scales) from a wide range of clinical variables consisting of information on both laboratory test results and psychiatric factors. The laboratory test results collectively indicate the measurements of 23 components derived from vital signs and blood tests results for the evaluation of the complete blood count. The 8 psychiatric factors known to affect the severity of mental illnesses are considered, viz. the family history, course and onset of an illness, etc. Retrospective data of 78 patients diagnosed with mental and behavioural disorders were collected from the Lady Hardinge Medical College & Smt. S.K, Hospital in New Delhi, India. The observations missing in the data are imputed using the non-parametric random forest algorithm. The multicollinearity is detected based on the variance inflation factor. Owing to the presence of multicollinearity, regularisation techniques such as ridge regression and extensions of the least absolute shrinkage and selection operator (LASSO), viz. adaptive and group LASSO are used for fitting the regression model. Optimal tuning parameter λ is obtained through 13-fold cross-validation. It was observed that the coefficients of the quantitative predictors extracted by the adaptive LASSO and the group of predictors extracted by the group LASSO were comparable to the coefficients obtained through ridge regression.","PeriodicalId":37985,"journal":{"name":"Statistics in Transition","volume":"23 1","pages":"129 - 152"},"PeriodicalIF":0.0,"publicationDate":"2022-06-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"42749561","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Jackknife winsorized variance estimator under imputed data 估算数据下的Jackknife winsorized方差估计
Q4 Mathematics Pub Date : 2022-06-01 DOI: 10.2478/stattrans-2022-0014
Fariha Sohil, M. U. Sohail, J. Shabbir, Sat Gupta
Abstract In the present study, we consider the problem of missing and extreme values for the estimation of population variance. The presence of extreme values either in the study variable, or the auxiliary variable, or in both of them, can adversely affect the performance of the estimation procedure. We consider three different situations for the presence of extreme values and also consider jackknife variance estimators for the population variance by handling these extreme values under stratified random sampling. Bootstrap technique ABB is carried out to understand the relative relationship more precisely.
摘要在本研究中,我们考虑了群体方差估计的缺失值和极值问题。研究变量、辅助变量或二者中存在极值,可能会对估计程序的性能产生不利影响。我们考虑了极值存在的三种不同情况,并通过在分层随机抽样下处理这些极值,考虑了总体方差的jackknife方差估计量。ABB的Bootstrap技术是为了更准确地理解相对关系。
{"title":"Jackknife winsorized variance estimator under imputed data","authors":"Fariha Sohil, M. U. Sohail, J. Shabbir, Sat Gupta","doi":"10.2478/stattrans-2022-0014","DOIUrl":"https://doi.org/10.2478/stattrans-2022-0014","url":null,"abstract":"Abstract In the present study, we consider the problem of missing and extreme values for the estimation of population variance. The presence of extreme values either in the study variable, or the auxiliary variable, or in both of them, can adversely affect the performance of the estimation procedure. We consider three different situations for the presence of extreme values and also consider jackknife variance estimators for the population variance by handling these extreme values under stratified random sampling. Bootstrap technique ABB is carried out to understand the relative relationship more precisely.","PeriodicalId":37985,"journal":{"name":"Statistics in Transition","volume":"23 1","pages":"17 - 32"},"PeriodicalIF":0.0,"publicationDate":"2022-06-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"49648591","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
期刊
Statistics in Transition
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1