首页 > 最新文献

Communications in Statistics Case Studies Data Analysis and Applications最新文献

英文 中文
Stepwise multiple testing procedures for the successive comparison of variances 方差连续比较的逐步多重检验程序
Q4 Mathematics Pub Date : 2022-10-02 DOI: 10.1080/23737484.2022.2133028
Jatesh Kumar, Parminder Singh, A. N. Gill
Abstract In this paper, stepwise multiple testing procedures are proposed for comparing successive populations in a sequence of several independent normal populations using index parameter variance. The proposed procedures have advantages over the single-step procedures and closed testing procedures available in the existing literature. The proposed stepwise testing procedures, control the family-wise error rate (FWER) strongly, and dramatically improve in power over the relevant single-step procedures. The closed testing procedure, which is step-down in nature and is developed for a testing problem, is very complex in its implementation, and this complexity increases further as the number of successive comparisons increases. The relevant critical constants have been tabulated to facilitate the implementation of the proposed procedures. We also proposed testing procedures for comparing the successive populations in a sequence of two-parametric exponential populations with regards to their scale parameters. In an effort to discern the efficiency of proposed procedures, simulated power comparisons with relevant existing procedures are presented, and the working of the proposed procedures is exemplified by means of two real-life data sets.
摘要本文提出了利用指标参数方差比较若干独立正态总体序列中连续总体的逐步多重检验方法。与现有文献中可用的单步程序和封闭测试程序相比,所建议的程序具有优势。所提出的逐步测试程序,强有力地控制了家族错误率(FWER),并显著提高了相关单步程序的能力。封闭测试程序本质上是逐步下降的,是为测试问题开发的,它的实现非常复杂,并且随着连续比较次数的增加,这种复杂性进一步增加。已将有关的临界常数制成表格,以方便实施所建议的程序。我们还提出了比较双参数指数种群序列中连续种群与其尺度参数的检验程序。为了了解拟议程序的效率,本文与相关现有程序进行了模拟功率比较,并通过两个实际数据集举例说明了拟议程序的工作。
{"title":"Stepwise multiple testing procedures for the successive comparison of variances","authors":"Jatesh Kumar, Parminder Singh, A. N. Gill","doi":"10.1080/23737484.2022.2133028","DOIUrl":"https://doi.org/10.1080/23737484.2022.2133028","url":null,"abstract":"Abstract In this paper, stepwise multiple testing procedures are proposed for comparing successive populations in a sequence of several independent normal populations using index parameter variance. The proposed procedures have advantages over the single-step procedures and closed testing procedures available in the existing literature. The proposed stepwise testing procedures, control the family-wise error rate (FWER) strongly, and dramatically improve in power over the relevant single-step procedures. The closed testing procedure, which is step-down in nature and is developed for a testing problem, is very complex in its implementation, and this complexity increases further as the number of successive comparisons increases. The relevant critical constants have been tabulated to facilitate the implementation of the proposed procedures. We also proposed testing procedures for comparing the successive populations in a sequence of two-parametric exponential populations with regards to their scale parameters. In an effort to discern the efficiency of proposed procedures, simulated power comparisons with relevant existing procedures are presented, and the working of the proposed procedures is exemplified by means of two real-life data sets.","PeriodicalId":36561,"journal":{"name":"Communications in Statistics Case Studies Data Analysis and Applications","volume":"61 1","pages":"649 - 662"},"PeriodicalIF":0.0,"publicationDate":"2022-10-02","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"91056322","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Optimal playing strategies of a batsman against bowling type in limited-over cricket: An application of game theory 极限板球中击球手对保龄球类型的最优比赛策略:博弈论的应用
Q4 Mathematics Pub Date : 2022-10-02 DOI: 10.1080/23737484.2022.2133027
Dhruba Das, H. Saikia, Dibyojyoti Bhattacharjee
ABSTRACT Every player is expected to contribute to team’s batting effort irrespective of their batting position. The expected contribution of each batsman is to score runs as quickly as possible without getting dismissed. It has often seen that based on different situations within a game, a batsman must play either carefully to defend his wicket or strike out aggressively to score runs quickly. Based on the expertise of the batsman, the captain of the fielding team arrange the fielders in such a way that the batsman couldn’t maximize his score. However, the arrangement of fielders is also dependent on the type of bowler (spin/fast) as well as team’s bowling strategies. Therefore, this study tries to find out the optimal playing strategies of a batsman on the field against different bowling types through the approach of game theory. To substantiate the model with live data a batsman’s strategies against different types of bowlers are explained in this work.
每个球员都被期望为球队的打击努力做出贡献,无论他们的打击位置如何。每个击球手的预期贡献是在不被解雇的情况下尽可能快地得分。我们经常看到,根据比赛中的不同情况,击球手要么必须小心地防守他的三柱门,要么必须积极地三振出局以迅速得分。根据击球手的专业知识,外野队队长安排外野手,使击球手无法获得最大的得分。然而,外野手的安排也取决于投球手的类型(旋转/快速)以及球队的投球策略。因此,本研究试图通过博弈论的方法,找出击球手在球场上对抗不同类型保龄球的最优策略。为了证明该模型与现场数据,击球手的策略对不同类型的投球手解释在这项工作。
{"title":"Optimal playing strategies of a batsman against bowling type in limited-over cricket: An application of game theory","authors":"Dhruba Das, H. Saikia, Dibyojyoti Bhattacharjee","doi":"10.1080/23737484.2022.2133027","DOIUrl":"https://doi.org/10.1080/23737484.2022.2133027","url":null,"abstract":"ABSTRACT Every player is expected to contribute to team’s batting effort irrespective of their batting position. The expected contribution of each batsman is to score runs as quickly as possible without getting dismissed. It has often seen that based on different situations within a game, a batsman must play either carefully to defend his wicket or strike out aggressively to score runs quickly. Based on the expertise of the batsman, the captain of the fielding team arrange the fielders in such a way that the batsman couldn’t maximize his score. However, the arrangement of fielders is also dependent on the type of bowler (spin/fast) as well as team’s bowling strategies. Therefore, this study tries to find out the optimal playing strategies of a batsman on the field against different bowling types through the approach of game theory. To substantiate the model with live data a batsman’s strategies against different types of bowlers are explained in this work.","PeriodicalId":36561,"journal":{"name":"Communications in Statistics Case Studies Data Analysis and Applications","volume":"106 1","pages":"738 - 751"},"PeriodicalIF":0.0,"publicationDate":"2022-10-02","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"86430009","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Extension of generalized Poisson log-linear regression models for analysing three-way contingency table: Application to malaria data 广义泊松对数线性回归模型在三元列联表分析中的推广:在疟疾数据中的应用
Q4 Mathematics Pub Date : 2022-10-02 DOI: 10.1080/23737484.2022.2133026
Shehu Bala, Usman Abubakar Umar
Abstract This study presents the extension of generalized Poisson (GP-1 and GP-2) models for three-way contingency table. We assume a mixed systematic component of the log-linear models for contingency tables to produce a linear transformation for the link function of Generalized Linear Models (GLMs). Maximum likelihood estimation method was derived for the parameters estimates of the models. An over-dispersed malaria data of 2019 was considered for the study. The GP-1 and GP-2 models for three-way contingency table was used to model the data. Based on Akaike Information Criterion (AIC), and Bayesian Information Criterion (BIC) goodness-of-fits measures, the GP-2 model outperformed the GP-1 model for three-way contingency table on malaria data. We found that some parameters of the full model were statistically significant as; malaria cases was sensitive to all ages considered in the study, and people were more infected with malaria in the month of April, June, and July 2019.
摘要研究了三向列联表的广义泊松(GP-1和GP-2)模型的推广。我们假设列联表的对数线性模型的混合系统成分,以产生广义线性模型(GLMs)的链接函数的线性变换。导出了模型参数估计的极大似然估计方法。该研究考虑了2019年过度分散的疟疾数据。采用三元列联表的GP-1和GP-2模型对数据进行建模。基于赤池信息准则(Akaike Information Criterion, AIC)和贝叶斯信息准则(Bayesian Information Criterion, BIC)拟合优度度量,GP-2模型在疟疾数据的三元列联表上优于GP-1模型。我们发现整个模型的一些参数在统计学上显著为;疟疾病例对研究中考虑的所有年龄段都敏感,2019年4月、6月和7月的人群感染疟疾较多。
{"title":"Extension of generalized Poisson log-linear regression models for analysing three-way contingency table: Application to malaria data","authors":"Shehu Bala, Usman Abubakar Umar","doi":"10.1080/23737484.2022.2133026","DOIUrl":"https://doi.org/10.1080/23737484.2022.2133026","url":null,"abstract":"Abstract This study presents the extension of generalized Poisson (GP-1 and GP-2) models for three-way contingency table. We assume a mixed systematic component of the log-linear models for contingency tables to produce a linear transformation for the link function of Generalized Linear Models (GLMs). Maximum likelihood estimation method was derived for the parameters estimates of the models. An over-dispersed malaria data of 2019 was considered for the study. The GP-1 and GP-2 models for three-way contingency table was used to model the data. Based on Akaike Information Criterion (AIC), and Bayesian Information Criterion (BIC) goodness-of-fits measures, the GP-2 model outperformed the GP-1 model for three-way contingency table on malaria data. We found that some parameters of the full model were statistically significant as; malaria cases was sensitive to all ages considered in the study, and people were more infected with malaria in the month of April, June, and July 2019.","PeriodicalId":36561,"journal":{"name":"Communications in Statistics Case Studies Data Analysis and Applications","volume":"24 1","pages":"634 - 648"},"PeriodicalIF":0.0,"publicationDate":"2022-10-02","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"82572467","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Effects of dichotomizing continuous outcome on efficiencies of measures of explained variation in logistic regression: Simulation study and application 二分类连续结果对逻辑回归中解释变异测度效率的影响:模拟研究与应用
Q4 Mathematics Pub Date : 2022-10-02 DOI: 10.1080/23737484.2022.2139019
Suay Erees
Abstract Dichotomizing continuous outcome variables is a common procedure in medical sciences. When analyzing these variables using binary logistic regression, great attention should be paid to the choice of the measure of explained variation ( . Since there are many different R 2 in logistic regression, in order to make correct inferences about models, evaluating their performances has become more important. The purpose of this paper is to reveal asymptotically more efficient and reliable R 2 measure when analyzing the models with dichotomized outcome. The eight most recommended R 2 statistics and ordinary least squares R 2 associated with the underlying continuous outcome have been included. Their asymptotic distributions have been studied. They have also been compared under varying correlational conditions between outcome and covariate. Extensive simulations using the bootstrap method have been conducted under two modeling scenarios. A real data example is also presented. The findings provide support and important basis for making efficient decisions.
摘要连续结果变量的二分类是医学中常用的方法。在使用二元逻辑回归分析这些变量时,应非常注意选择被解释变异的度量()。由于在逻辑回归中有许多不同的r2,为了对模型做出正确的推断,评估它们的性能变得更加重要。本文的目的是在分析具有二分类结果的模型时揭示渐近更有效和可靠的r2度量。8个最推荐的r2统计量和与潜在连续结果相关的普通最小二乘r2已被纳入。研究了它们的渐近分布。在结果和协变量之间的不同相关条件下,也对它们进行了比较。利用自举法在两种建模情景下进行了广泛的模拟。并给出了一个实际的数据实例。研究结果为有效决策提供了支持和重要依据。
{"title":"Effects of dichotomizing continuous outcome on efficiencies of measures of explained variation in logistic regression: Simulation study and application","authors":"Suay Erees","doi":"10.1080/23737484.2022.2139019","DOIUrl":"https://doi.org/10.1080/23737484.2022.2139019","url":null,"abstract":"Abstract Dichotomizing continuous outcome variables is a common procedure in medical sciences. When analyzing these variables using binary logistic regression, great attention should be paid to the choice of the measure of explained variation ( . Since there are many different R 2 in logistic regression, in order to make correct inferences about models, evaluating their performances has become more important. The purpose of this paper is to reveal asymptotically more efficient and reliable R 2 measure when analyzing the models with dichotomized outcome. The eight most recommended R 2 statistics and ordinary least squares R 2 associated with the underlying continuous outcome have been included. Their asymptotic distributions have been studied. They have also been compared under varying correlational conditions between outcome and covariate. Extensive simulations using the bootstrap method have been conducted under two modeling scenarios. A real data example is also presented. The findings provide support and important basis for making efficient decisions.","PeriodicalId":36561,"journal":{"name":"Communications in Statistics Case Studies Data Analysis and Applications","volume":"9 1","pages":"663 - 681"},"PeriodicalIF":0.0,"publicationDate":"2022-10-02","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"84257204","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Joint design of control chart and maintenance policy under multiple assignable causes and random failures by considering the statistical constraints 考虑统计约束的多可分配原因和随机故障情况下的控制图和维修策略联合设计
Q4 Mathematics Pub Date : 2022-09-26 DOI: 10.1080/23737484.2022.2126413
A. Salmasnia, Ehsan Emamjomeh, M. Maleki
ABSTRACT In modern industries, statistical process monitoring (SPM) and maintenance management are extensively employed to increase the production rate of conforming items. Aiming at minimizing the expected total cost per time unit subject to some statistical constraints, this study proposes a hybrid quality-maintenance model for imperfect short-run process. To bring the proposed model closer to real short-run systems, it is considered that the process mean may shift to an out-of-control condition due to occurrence of several types of assignable causes. Furthermore, it is supposed that the time-to-failure follows a non-homogenous Poisson process implying that the system may suddenly fail with an increasing failure rate function. Moreover, a non-uniform sampling scheme is developed in order to improve the system reliability. Finally, the main advantages of the proposed model are highlighted by conducting two comparative studies. The first one illustrates the efficiency of the non-uniform sampling scheme in increasing the in-control time interval and decreasing the expected total cost. The second one confirms the importance of considering the system failure on both the expected total cost and in-control time interval.
在现代工业中,统计过程监控(SPM)和维护管理被广泛应用于提高合格品的生产率。在一定的统计约束下,以最小化单位时间内的期望总成本为目标,提出了不完全短时过程的混合质量维护模型。为了使所提出的模型更接近于实际的短期系统,考虑到过程均值可能由于几种可分配原因的发生而转向失控状态。此外,假定失效时间遵循非齐次泊松过程,这意味着系统可能会突然失效,且故障率函数增加。为了提高系统的可靠性,提出了一种非均匀采样方案。最后,通过两项比较研究,突出了所提模型的主要优势。第一个例子说明了非均匀采样方案在增加控制时间间隔和降低期望总成本方面的效率。第二个结果证实了考虑系统故障对预期总成本和控制时间间隔的重要性。
{"title":"Joint design of control chart and maintenance policy under multiple assignable causes and random failures by considering the statistical constraints","authors":"A. Salmasnia, Ehsan Emamjomeh, M. Maleki","doi":"10.1080/23737484.2022.2126413","DOIUrl":"https://doi.org/10.1080/23737484.2022.2126413","url":null,"abstract":"ABSTRACT In modern industries, statistical process monitoring (SPM) and maintenance management are extensively employed to increase the production rate of conforming items. Aiming at minimizing the expected total cost per time unit subject to some statistical constraints, this study proposes a hybrid quality-maintenance model for imperfect short-run process. To bring the proposed model closer to real short-run systems, it is considered that the process mean may shift to an out-of-control condition due to occurrence of several types of assignable causes. Furthermore, it is supposed that the time-to-failure follows a non-homogenous Poisson process implying that the system may suddenly fail with an increasing failure rate function. Moreover, a non-uniform sampling scheme is developed in order to improve the system reliability. Finally, the main advantages of the proposed model are highlighted by conducting two comparative studies. The first one illustrates the efficiency of the non-uniform sampling scheme in increasing the in-control time interval and decreasing the expected total cost. The second one confirms the importance of considering the system failure on both the expected total cost and in-control time interval.","PeriodicalId":36561,"journal":{"name":"Communications in Statistics Case Studies Data Analysis and Applications","volume":"13 1","pages":"607 - 633"},"PeriodicalIF":0.0,"publicationDate":"2022-09-26","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"86080797","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Estimation of area under the ROC curve in the framework of gamma mixtures 伽马混合框架下ROC曲线下面积的估计
Q4 Mathematics Pub Date : 2022-09-15 DOI: 10.1080/23737484.2022.2121947
Arunima S. Kannan, R. V. Vardhan
Abstract Receiver operating characteristic (ROC) curve is one of the well-known classification tools. There are several bi-distributional ROC models in the literature, which can be applied only when there is a prior knowledge on the class/status of the subject. If the predefined status of the subject is not known, then we need to administer a statistical methodology to identify the homogeneous components within it. Once this is done, modeling of ROC can be made, and here it is assumed that the data underlie non-normal distribution. In this paper, the need for handling non-normal data in the framework of mixture model is discussed and demonstrated using a real data set and simulation studies. It is shown that, the proposed mixGamma ROC model replaces the existing ROC models when the data is of non-normal and multi-mode.
接收者工作特征曲线(Receiver operating characteristic, ROC)是公认的分类工具之一。文献中有几个双分布ROC模型,只有在对受试者的类别/状态有先验知识时才能应用。如果不知道主题的预定义状态,那么我们需要管理一种统计方法来识别其中的同类组件。一旦完成,就可以进行ROC建模,这里假设数据是非正态分布。本文讨论了在混合模型框架下处理非正态数据的必要性,并通过实际数据集和仿真研究进行了论证。结果表明,当数据是非正态和多模态时,所提出的mixGamma ROC模型可以替代现有的ROC模型。
{"title":"Estimation of area under the ROC curve in the framework of gamma mixtures","authors":"Arunima S. Kannan, R. V. Vardhan","doi":"10.1080/23737484.2022.2121947","DOIUrl":"https://doi.org/10.1080/23737484.2022.2121947","url":null,"abstract":"Abstract Receiver operating characteristic (ROC) curve is one of the well-known classification tools. There are several bi-distributional ROC models in the literature, which can be applied only when there is a prior knowledge on the class/status of the subject. If the predefined status of the subject is not known, then we need to administer a statistical methodology to identify the homogeneous components within it. Once this is done, modeling of ROC can be made, and here it is assumed that the data underlie non-normal distribution. In this paper, the need for handling non-normal data in the framework of mixture model is discussed and demonstrated using a real data set and simulation studies. It is shown that, the proposed mixGamma ROC model replaces the existing ROC models when the data is of non-normal and multi-mode.","PeriodicalId":36561,"journal":{"name":"Communications in Statistics Case Studies Data Analysis and Applications","volume":"61 1","pages":"714 - 727"},"PeriodicalIF":0.0,"publicationDate":"2022-09-15","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"89181524","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Spatial and non-spatial clustering algorithms in the analysis of Brazilian educational data 巴西教育数据分析中的空间和非空间聚类算法
Q4 Mathematics Pub Date : 2022-09-13 DOI: 10.1080/23737484.2022.2117744
Daiane Chitko de Souza, C. Taconeli
Abstract Education is one of the pillars of human societies, such that achieving better indicators in this area is a common goal for different federate entities. In this context, identifying patterns on the results of such indicators, evaluated for different entities, as well as grouping them based on their similarities, can lead to a better understanding of the educational scenario of a population. This knowledge, moreover, might subsidize the formulation of public policies and allow the decision-making by the responsible managers. In the present work, we present an illustrative example of the application of spatial and non-spatial clustering algorithms in the analysis of data from six important indicators of basic education (middle and high school) evaluated for the municipalities of the state of Paraná, Brazil. Clusters provided by each method were evaluated according to their spatial distributions and educational features. The different clustering algorithms produced clusters with different levels of spatial contiguity and homogeneity regarding the educational indicators, reflecting the importance of choosing the appropriate clustering technique based on the research objectives.
教育是人类社会的支柱之一,因此在这一领域取得更好的指标是不同联邦实体的共同目标。在这方面,确定这些指标结果的模式,对不同实体进行评价,并根据它们的相似性对它们进行分组,可以使人们更好地了解人口的教育情况。此外,这种知识可能会补贴公共政策的制定,并使负责任的管理人员能够作出决策。在目前的工作中,我们提出了一个应用空间和非空间聚类算法分析巴西帕拉纳州市政当局评估的基础教育(初中和高中)六个重要指标数据的说明性示例。根据每种方法提供的聚类的空间分布和教育特征进行评价。不同的聚类算法对教育指标产生的聚类具有不同程度的空间连续性和均匀性,反映了根据研究目标选择合适的聚类技术的重要性。
{"title":"Spatial and non-spatial clustering algorithms in the analysis of Brazilian educational data","authors":"Daiane Chitko de Souza, C. Taconeli","doi":"10.1080/23737484.2022.2117744","DOIUrl":"https://doi.org/10.1080/23737484.2022.2117744","url":null,"abstract":"Abstract Education is one of the pillars of human societies, such that achieving better indicators in this area is a common goal for different federate entities. In this context, identifying patterns on the results of such indicators, evaluated for different entities, as well as grouping them based on their similarities, can lead to a better understanding of the educational scenario of a population. This knowledge, moreover, might subsidize the formulation of public policies and allow the decision-making by the responsible managers. In the present work, we present an illustrative example of the application of spatial and non-spatial clustering algorithms in the analysis of data from six important indicators of basic education (middle and high school) evaluated for the municipalities of the state of Paraná, Brazil. Clusters provided by each method were evaluated according to their spatial distributions and educational features. The different clustering algorithms produced clusters with different levels of spatial contiguity and homogeneity regarding the educational indicators, reflecting the importance of choosing the appropriate clustering technique based on the research objectives.","PeriodicalId":36561,"journal":{"name":"Communications in Statistics Case Studies Data Analysis and Applications","volume":"1 1","pages":"588 - 606"},"PeriodicalIF":0.0,"publicationDate":"2022-09-13","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"89321301","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Parameter estimation of structural equation models with misclassification: The MC-SIMEX approach 含错分类的结构方程模型参数估计:MC-SIMEX方法
Q4 Mathematics Pub Date : 2022-08-12 DOI: 10.1080/23737484.2022.2106324
Sahika Gokmen, J. Lyhagen
Abstract The random errors in the measurement process, called measurement error or misclassification, are inevitable and cause bias and inconsistent parameter estimates. Misclassification Simulation Extrapolation (MC-SIMEX) is a simulation based measurement error estimation method to obtain reduced parameter bias under misclassification. The main purpose of this study is an adaptation of MC-SIMEX method on Structural Equation Modeling (SEM). The effects of misclassification on the parameter estimates of a binary explanatory variables in SEM and the performance of MC-SIMEX method investigated with both Monte Carlo and an empirical study. According to the main results, finding the best extrapolant function is just as important as estimating the misclassification matrix although MC-SIMEX corrected a part of the bias.
测量过程中的随机误差,即测量误差或误分类,是不可避免的,会导致误差和参数估计不一致。误分类仿真外推法(MC-SIMEX)是一种基于仿真的测量误差估计方法,用于在误分类情况下获得较小的参数偏差。本研究的主要目的是将MC-SIMEX方法应用于结构方程建模(SEM)。通过蒙特卡罗和实证研究,研究了错误分类对SEM中二元解释变量参数估计的影响以及MC-SIMEX方法的性能。根据主要结果,尽管MC-SIMEX修正了部分偏差,但寻找最佳外推函数与估计错误分类矩阵同样重要。
{"title":"Parameter estimation of structural equation models with misclassification: The MC-SIMEX approach","authors":"Sahika Gokmen, J. Lyhagen","doi":"10.1080/23737484.2022.2106324","DOIUrl":"https://doi.org/10.1080/23737484.2022.2106324","url":null,"abstract":"Abstract The random errors in the measurement process, called measurement error or misclassification, are inevitable and cause bias and inconsistent parameter estimates. Misclassification Simulation Extrapolation (MC-SIMEX) is a simulation based measurement error estimation method to obtain reduced parameter bias under misclassification. The main purpose of this study is an adaptation of MC-SIMEX method on Structural Equation Modeling (SEM). The effects of misclassification on the parameter estimates of a binary explanatory variables in SEM and the performance of MC-SIMEX method investigated with both Monte Carlo and an empirical study. According to the main results, finding the best extrapolant function is just as important as estimating the misclassification matrix although MC-SIMEX corrected a part of the bias.","PeriodicalId":36561,"journal":{"name":"Communications in Statistics Case Studies Data Analysis and Applications","volume":"1 1","pages":"545 - 558"},"PeriodicalIF":0.0,"publicationDate":"2022-08-12","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"89812208","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Applications of a new process capability index to electronic industries 一种新的过程能力指标在电子工业中的应用
Q4 Mathematics Pub Date : 2022-08-08 DOI: 10.1080/23737484.2022.2107962
Mahendra Saha
ABSTRACT The process capability indices (PCIs) are frequently adopted to measure the performance of a process within the specifications. Although higher PCIs indicate higher process “quality,” yet it does not ascertain fewer rates of rejection. Thus, it is more appropriate to adopt a loss-based PCI for measuring the process capability. In this paper, our first objective is to introduce a new capability index called which is based on symmetric loss function for normal process which provides a tailored way of incorporating the loss in capability analysis. Next, we estimate the PCI when the process follows the normal distribution using method of moment (MOM) estimation and compare the performance of the MOM estimation in terms of their absolute biases and corresponding mean squared errors through simulation study in respect of sample sizes. Besides, generalized confidence interval (GCI) is employed for constructing the confidence intervals for the index . The performance of GCI is compared in terms of average widths and coverage probabilities using Monte Carlo simulation. Finally, for illustrating the effectiveness of the proposed method of estimation and GCI, three real data sets from electronic industries are analyzed.
过程能力指数(pci)经常被用来衡量过程在规范范围内的性能。虽然更高的pci表示更高的工艺“质量”,但它并不能确定更低的拒绝率。因此,采用基于损耗的PCI来度量流程能力更为合适。在本文中,我们的第一个目标是引入一种新的能力指标,称为标准过程的对称损失函数,它提供了一种将损失纳入能力分析的定制方法。接下来,我们使用矩估计法(MOM)估计过程服从正态分布时的PCI,并通过对样本量的模拟研究,比较MOM估计的绝对偏差和相应的均方误差的性能。此外,采用广义置信区间(GCI)构造了该指标的置信区间。通过蒙特卡罗模拟,比较了GCI的平均宽度和覆盖概率。最后,为了说明所提出的估计方法和GCI的有效性,分析了来自电子行业的三个真实数据集。
{"title":"Applications of a new process capability index to electronic industries","authors":"Mahendra Saha","doi":"10.1080/23737484.2022.2107962","DOIUrl":"https://doi.org/10.1080/23737484.2022.2107962","url":null,"abstract":"ABSTRACT The process capability indices (PCIs) are frequently adopted to measure the performance of a process within the specifications. Although higher PCIs indicate higher process “quality,” yet it does not ascertain fewer rates of rejection. Thus, it is more appropriate to adopt a loss-based PCI for measuring the process capability. In this paper, our first objective is to introduce a new capability index called which is based on symmetric loss function for normal process which provides a tailored way of incorporating the loss in capability analysis. Next, we estimate the PCI when the process follows the normal distribution using method of moment (MOM) estimation and compare the performance of the MOM estimation in terms of their absolute biases and corresponding mean squared errors through simulation study in respect of sample sizes. Besides, generalized confidence interval (GCI) is employed for constructing the confidence intervals for the index . The performance of GCI is compared in terms of average widths and coverage probabilities using Monte Carlo simulation. Finally, for illustrating the effectiveness of the proposed method of estimation and GCI, three real data sets from electronic industries are analyzed.","PeriodicalId":36561,"journal":{"name":"Communications in Statistics Case Studies Data Analysis and Applications","volume":"104 1","pages":"574 - 587"},"PeriodicalIF":0.0,"publicationDate":"2022-08-08","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"75930099","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 3
Regularized estimation of the Mahalanobis distance based on modified Cholesky decomposition 基于修正Cholesky分解的马氏距离正则化估计
Q4 Mathematics Pub Date : 2022-08-08 DOI: 10.1080/23737484.2022.2107961
D. Dai, Jianxin Pan, Yuli Liang
Abstract Estimating inverse covariance matrix is an essential part of many statistical methods. This paper proposes a regularized estimator for the inverse covariance matrix. Modified Cholesky decomposition (MCD) is utilized to construct positive definite estimators. Instead of directly regularizing the inverse covariance matrix itself, we impose regularization on the Cholesky factor. The estimated inverse covariance matrix is used to build Mahalanobis distance (MD). The proposed method is evaluated by detecting outliers through simulations and empirical studies.
摘要协方差逆矩阵的估计是许多统计方法的重要组成部分。本文提出了一种正则化的逆协方差矩阵估计。利用修正Cholesky分解(MCD)构造正定估计量。我们不是直接对逆协方差矩阵本身进行正则化,而是对Cholesky因子进行正则化。利用估计的逆协方差矩阵构建马氏距离(MD)。通过模拟和实证研究,对该方法进行了异常值检测。
{"title":"Regularized estimation of the Mahalanobis distance based on modified Cholesky decomposition","authors":"D. Dai, Jianxin Pan, Yuli Liang","doi":"10.1080/23737484.2022.2107961","DOIUrl":"https://doi.org/10.1080/23737484.2022.2107961","url":null,"abstract":"Abstract Estimating inverse covariance matrix is an essential part of many statistical methods. This paper proposes a regularized estimator for the inverse covariance matrix. Modified Cholesky decomposition (MCD) is utilized to construct positive definite estimators. Instead of directly regularizing the inverse covariance matrix itself, we impose regularization on the Cholesky factor. The estimated inverse covariance matrix is used to build Mahalanobis distance (MD). The proposed method is evaluated by detecting outliers through simulations and empirical studies.","PeriodicalId":36561,"journal":{"name":"Communications in Statistics Case Studies Data Analysis and Applications","volume":"601 1","pages":"559 - 573"},"PeriodicalIF":0.0,"publicationDate":"2022-08-08","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"77312934","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
期刊
Communications in Statistics Case Studies Data Analysis and Applications
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1