Methodology: European Journal of Research Methods for The Behavioral and Social Sciences最新文献

英文中文

A general procedure for testing inequality constrained hypotheses in SEM 扫描电镜中检验不等式约束假设的一般程序

IF 3.1 3区心理学 Q2 PSYCHOLOGY, MATHEMATICAL

Methodology: European Journal of Research Methods for The Behavioral and Social Sciences

Pub Date : 2017-06-02 DOI: 10.1027/1614-2241/A000123

Leonard Vanbrabant, R. Schoot, N. Loey, Y. Rosseel

Abstract. Researchers in the social and behavioral sciences often have clear expectations about the order and/or the sign of the parameters in their statistical model. For example, a researcher might expect that regression coefficient β1 is larger than regression coefficients β2 and β3. To test such a constrained hypothesis special methods have been developed. However, the existing methods for structural equation models (SEM) are complex, computationally demanding, and a software routine is lacking. Therefore, in this paper we describe a general procedure for testing order/inequality constrained hypotheses in SEM using the R package lavaan. We use the likelihood ratio (LR) statistic to test constrained hypotheses and the resulting plug-in p value is computed by either parametric or Bollen-Stine bootstrapping. Since the obtained plug-in p value can be biased, a double bootstrap approach is available. The procedure is illustrated by a real-life example about the psychosocial functioning in patients with fac...

摘要社会科学和行为科学的研究人员通常对其统计模型中参数的顺序和/或符号有明确的期望。例如，研究人员可能期望回归系数β1大于回归系数β2和β3。为了检验这样一个受约束的假设，人们开发了一些特殊的方法。然而，现有的结构方程模型(SEM)方法复杂，计算量大，且缺乏软件程序。因此，在本文中，我们描述了使用R包lavaan测试SEM中阶/不等式约束假设的一般过程。我们使用似然比(LR)统计量来检验约束假设，并通过参数或Bollen-Stine bootstrapping计算得到的插件p值。由于获得的插件p值可能存在偏差，因此可以使用双自举方法。这一过程是通过一个现实生活中的例子来说明的，这个例子是关于面瘫患者的心理社会功能。

引用次数: 1

Using the Errors-in-Variables Method in Two-Group Pretest-Posttest Designs 变量误差法在两组测试前后设计中的应用

IF 3.1 3区心理学 Q2 PSYCHOLOGY, MATHEMATICAL

Methodology: European Journal of Research Methods for The Behavioral and Social Sciences

Pub Date : 2017-03-22 DOI: 10.1027/1614-2241/a000122

A. Counsell, R. Cribbie

Culpepper and Aguinis (2011) highlighted the benefit of using the errors-in-variables (EIV) method to control for measurement error and obtain unbiased regression estimates. The current study investigated the EIV method and compared it to change scores and analysis of covariance (ANCOVA) in a two-group pretest-posttest design. Results indicated that the EIV method’s estimates were unbiased under many conditions, but the EIV method consistently demonstrated lower power than the change score method. An additional risk with using the EIV method is that one must enter the covariate reliability into the EIV model, and results highlighted that estimates are biased if a researcher chooses a value that differs from the true covariate reliability. Obtaining unbiased results also depended on sample size. Our conclusion is that there is no additional benefit to using the EIV method over change score or ANCOVA methods for comparing the amount of change in pretest-posttest designs.

Culpepper和Aguinis(2011)强调了使用变量误差(EIV)方法控制测量误差和获得无偏回归估计的好处。本研究对EIV方法进行了研究，并将其与两组前测后测设计中的变化评分和协方差分析(ANCOVA)进行了比较。结果表明，EIV方法的估计在许多条件下是无偏的，但EIV方法始终显示出低于变化评分方法的功率。使用EIV方法的另一个风险是必须将协变量可靠性输入EIV模型，并且结果强调，如果研究人员选择的值与真正的协变量可靠性不同，则估计是有偏差的。获得无偏结果也取决于样本量。我们的结论是，在比较前测-后测设计的变化量时，使用EIV方法与变化评分或ANCOVA方法相比没有额外的好处。

引用次数: 3

Power of Modified Brown-Forsythe and Mixed-Model Approaches in Split-Plot Designs 改良Brown Forsyth的幂和混合模型方法在分割图设计中的应用

IF 3.1 3区心理学 Q2 PSYCHOLOGY, MATHEMATICAL

Methodology: European Journal of Research Methods for The Behavioral and Social Sciences

Pub Date : 2017-03-22 DOI: 10.1027/1614-2241/a000124

Pablo Livacic-Rojas, G. Vallejo, P. Fernández, Ellián Tuero-Herrero

Low precision of the inferences of data analyzed with univariate or multivariate models of the Analysis of Variance (ANOVA) in repeated-measures design is associated to the absence of normality distribution of data, nonspherical covariance structures and free variation of the variance and covariance, the lack of knowledge of the error structure underlying the data, and the wrong choice of covariance structure from different selectors. In this study, levels of statistical power presented the Modified Brown Forsythe (MBF) and two procedures with the Mixed-Model Approaches (the Akaike’s Criterion, the Correctly Identified Model [CIM]) are compared. The data were analyzed using Monte Carlo simulation method with the statistical package SAS 9.2, a split-plot design, and considering six manipulated variables. The results show that the procedures exhibit high statistical power levels for within and interactional effects, and moderate and low levels for the between-groups effects under the different conditions analyzed. For the latter, only the Modified Brown Forsythe shows high level of power mainly for groups with 30 cases and Unstructured (UN) and Autoregressive Heterogeneity (ARH) matrices. For this reason, we recommend using this procedure since it exhibits higher levels of power for all effects and does not require a matrix type that underlies the structure of the data. Future research needs to be done in order to compare the power with corrected selectors using single-level and multilevel designs for fixed and random effects.

在重复测量设计中，用方差分析（ANOVA）的单变量或多变量模型分析的数据的推断精度低与数据的正态分布、非球形协方差结构和方差和协方差的自由变化、缺乏对数据背后的误差结构的知识、，以及来自不同选择器的协方差结构的错误选择。在这项研究中，比较了修正的Brown Forsyth（MBF）和混合模型方法（Akaike准则，正确识别模型[CIM]）的两个程序的统计能力水平。数据采用蒙特卡罗模拟方法进行分析，采用SAS 9.2统计软件包，采用分裂图设计，并考虑六个操纵变量。结果表明，在所分析的不同条件下，程序对内部效应和交互效应表现出较高的统计能力水平，对组间效应表现出中等和较低的统计能力。对于后者，只有改进的Brown Forsyth主要对具有30种情况和非结构化（UN）和自回归异质性（ARH）矩阵的组显示出高水平的幂。因此，我们建议使用此程序，因为它对所有效果都表现出更高的功率水平，并且不需要作为数据结构基础的矩阵类型。未来需要进行研究，以比较使用固定和随机效应的单级和多级设计的校正选择器的功率。

{"title":"Power of Modified Brown-Forsythe and Mixed-Model Approaches in Split-Plot Designs","authors":"Pablo Livacic-Rojas, G. Vallejo, P. Fernández, Ellián Tuero-Herrero","doi":"10.1027/1614-2241/a000124","DOIUrl":"https://doi.org/10.1027/1614-2241/a000124","url":null,"abstract":"Low precision of the inferences of data analyzed with univariate or multivariate models of the Analysis of Variance (ANOVA) in repeated-measures design is associated to the absence of normality distribution of data, nonspherical covariance structures and free variation of the variance and covariance, the lack of knowledge of the error structure underlying the data, and the wrong choice of covariance structure from different selectors. In this study, levels of statistical power presented the Modified Brown Forsythe (MBF) and two procedures with the Mixed-Model Approaches (the Akaike’s Criterion, the Correctly Identified Model [CIM]) are compared. The data were analyzed using Monte Carlo simulation method with the statistical package SAS 9.2, a split-plot design, and considering six manipulated variables. The results show that the procedures exhibit high statistical power levels for within and interactional effects, and moderate and low levels for the between-groups effects under the different conditions analyzed. For the latter, only the Modified Brown Forsythe shows high level of power mainly for groups with 30 cases and Unstructured (UN) and Autoregressive Heterogeneity (ARH) matrices. For this reason, we recommend using this procedure since it exhibits higher levels of power for all effects and does not require a matrix type that underlies the structure of the data. Future research needs to be done in order to compare the power with corrected selectors using single-level and multilevel designs for fixed and random effects.","PeriodicalId":18476,"journal":{"name":"Methodology: European Journal of Research Methods for The Behavioral and Social Sciences","volume":"13 1","pages":"9–22"},"PeriodicalIF":3.1,"publicationDate":"2017-03-22","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"42853409","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"心理学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 2

The Effect of Partly Missing Covariates on Statistical Power in Randomized Controlled Trials With Discrete-Time Survival Endpoints 部分缺失协变量对离散时间生存终点随机对照试验统计效力的影响

IF 3.1 3区心理学 Q2 PSYCHOLOGY, MATHEMATICAL

Methodology: European Journal of Research Methods for The Behavioral and Social Sciences

Pub Date : 2017-02-16 DOI: 10.1027/1614-2241/A000121

S. Jolani, M. Safarkhani

Abstract. In randomized controlled trials (RCTs), a common strategy to increase power to detect a treatment effect is adjustment for baseline covariates. However, adjustment with partly missing covariates, where complete cases are only used, is inefficient. We consider different alternatives in trials with discrete-time survival data, where subjects are measured in discrete-time intervals while they may experience an event at any point in time. The results of a Monte Carlo simulation study, as well as a case study of randomized trials in smokers with attention deficit hyperactivity disorder (ADHD), indicated that single and multiple imputation methods outperform the other methods and increase precision in estimating the treatment effect. Missing indicator method, which uses a dummy variable in the statistical model to indicate whether the value for that variable is missing and sets the same value to all missing values, is comparable to imputation methods. Nevertheless, the power level to detect the treatm...

摘要在随机对照试验（RCT）中，增加检测治疗效果的能力的一种常见策略是调整基线协变量。然而，仅使用完整情况的部分缺失协变量的调整是低效的。我们在具有离散时间生存数据的试验中考虑了不同的替代方案，其中受试者在离散时间间隔内进行测量，而他们可能在任何时间点经历一个事件。蒙特卡洛模拟研究的结果，以及对患有注意力缺陷多动障碍（ADHD）的吸烟者进行的随机试验的案例研究表明，单一和多重插补方法优于其他方法，并提高了估计治疗效果的准确性。缺失指标法使用统计模型中的虚拟变量来指示该变量的值是否缺失，并将相同的值设置为所有缺失值，与插补方法相当。然而，检测治疗的功率水平。。。

引用次数: 2

Performance of Combined Models in Discrete Binary Classification 组合模型在离散二值分类中的性能

IF 3.1 3区心理学 Q2 PSYCHOLOGY, MATHEMATICAL

Methodology: European Journal of Research Methods for The Behavioral and Social Sciences

Pub Date : 2017-02-16 DOI: 10.1027/1614-2241/a000117

Anabela Marques, A. Ferreira, Margarida M. G. S. Cardoso

Diverse Discrete Discriminant Analysis (DDA) models perform differently in different samples. This fact has encouraged research in combined models which seems particularly promising when the a priori classes are not well separated or when small or moderate sized samples are considered, which often occurs in practice. In this study, we evaluate the performance of a convex combination of two DDA models: the First-Order Independence Model (FOIM) and the Dependence Trees Model (DTM). We use simulated data sets with two classes and consider diverse data complexity factors which may influence performance of the combined model – the separation of classes, balance, and number of missing states, as well as sample size and also the number of parameters to be estimated in DDA. We resort to cross-validation to evaluate the precision of classification. The results obtained illustrate the advantage of the proposed combination when compared with FOIM and DTM: it yields the best results, especially when very small samples are considered. The experimental study also provides a ranking of the data complexity factors, according to their relative impact on classification performance, by means of a regression model. It leads to the conclusion that the separation of classes is the most influential factor in classification performance. The ratio between the number of degrees of freedom and sample size, along with the proportion of missing states in the minority class, also has significant impact on classification performance. An additional gain of this study, also deriving from the estimated regression model, is the ability to successfully predict the precision of classification in a real data set based on the data complexity factors.

不同的离散判别分析（DDA）模型在不同的样本中表现不同。这一事实鼓励了对组合模型的研究，当先验类没有很好地分离时，或者当考虑小或中等大小的样本时，这似乎特别有希望，这在实践中经常发生。在本研究中，我们评估了两个DDA模型的凸组合的性能：一阶独立模型（FOIM）和依赖树模型（DTM）。我们使用具有两个类别的模拟数据集，并考虑可能影响组合模型性能的各种数据复杂性因素——类别的分离、平衡和缺失状态的数量，以及样本大小和DDA中要估计的参数数量。我们采用交叉验证来评估分类的准确性。与FOIM和DTM相比，所获得的结果说明了所提出的组合的优势：它产生了最好的结果，尤其是在考虑非常小的样本时。实验研究还通过回归模型，根据数据复杂性因素对分类性能的相对影响，对数据复杂性因素进行了排名。结果表明，类的分离是影响分类性能的最大因素。自由度数量和样本量之间的比率，以及少数类中缺失状态的比例，也对分类性能有显著影响。这项研究的另一个收获，也是从估计的回归模型中得出的，是基于数据复杂性因素成功预测真实数据集中分类精度的能力。

{"title":"Performance of Combined Models in Discrete Binary Classification","authors":"Anabela Marques, A. Ferreira, Margarida M. G. S. Cardoso","doi":"10.1027/1614-2241/a000117","DOIUrl":"https://doi.org/10.1027/1614-2241/a000117","url":null,"abstract":"Diverse Discrete Discriminant Analysis (DDA) models perform differently in different samples. This fact has encouraged research in combined models which seems particularly promising when the a priori classes are not well separated or when small or moderate sized samples are considered, which often occurs in practice. In this study, we evaluate the performance of a convex combination of two DDA models: the First-Order Independence Model (FOIM) and the Dependence Trees Model (DTM). We use simulated data sets with two classes and consider diverse data complexity factors which may influence performance of the combined model – the separation of classes, balance, and number of missing states, as well as sample size and also the number of parameters to be estimated in DDA. We resort to cross-validation to evaluate the precision of classification. The results obtained illustrate the advantage of the proposed combination when compared with FOIM and DTM: it yields the best results, especially when very small samples are considered. The experimental study also provides a ranking of the data complexity factors, according to their relative impact on classification performance, by means of a regression model. It leads to the conclusion that the separation of classes is the most influential factor in classification performance. The ratio between the number of degrees of freedom and sample size, along with the proportion of missing states in the minority class, also has significant impact on classification performance. An additional gain of this study, also deriving from the estimated regression model, is the ability to successfully predict the precision of classification in a real data set based on the data complexity factors.","PeriodicalId":18476,"journal":{"name":"Methodology: European Journal of Research Methods for The Behavioral and Social Sciences","volume":"13 1","pages":"23–37"},"PeriodicalIF":3.1,"publicationDate":"2017-02-16","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"41421439","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"心理学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Using Conditional Association to Identify Locally Independent Item Sets 使用条件关联来识别局部独立的项目集

IF 3.1 3区心理学 Q2 PSYCHOLOGY, MATHEMATICAL

Methodology: European Journal of Research Methods for The Behavioral and Social Sciences

Pub Date : 2016-12-05 DOI: 10.1027/1614-2241/A000115

J. Straat, L. V. D. Ark, K. Sijtsma

Abstract. The ordinal, unidimensional monotone latent variable model assumes unidimensionality, local independence, and monotonicity, and implies the observable property of conditional association....

摘要有序的一维单调潜变量模型假定为单维性、局部独立性和单调性，并暗示了条件关联的可观察性....

引用次数: 39

Dealing with data streams: An online, row-by-row, estimation tutorial 处理数据流:一个在线的、逐行的评估教程

IF 3.1 3区心理学 Q2 PSYCHOLOGY, MATHEMATICAL

Methodology: European Journal of Research Methods for The Behavioral and Social Sciences

Pub Date : 2016-12-05 DOI: 10.1027/1614-2241/A000116

Lianne Ippel, M. Kaptein, J. Vermunt

Abstract. Novel technological advances allow distributed and automatic measurement of human behavior. While these technologies provide exciting new research opportunities, they also provide challenges: datasets collected using new technologies grow increasingly large, and in many applications the collected data are continuously augmented. These data streams make the standard computation of well-known estimators inefficient as the computation has to be repeated each time a new data point enters. In this tutorial paper, we detail online learning, an analysis method that facilitates the efficient analysis of Big Data and continuous data streams. We illustrate how common analysis methods can be adapted for use with Big Data using an online, or “row-by-row,” processing approach. We present several simple (and exact) examples of the online estimation and discuss Stochastic Gradient Descent as a general (approximate) approach to estimate more complex models. We end this article with a discussion of the methodolo...

摘要新的技术进步允许对人类行为进行分布式和自动测量。虽然这些技术提供了令人兴奋的新研究机会，但它们也带来了挑战:使用新技术收集的数据集越来越大，并且在许多应用中收集的数据不断增加。这些数据流使得众所周知的估计器的标准计算效率低下，因为每次新数据点进入时都必须重复计算。在这篇教程中，我们详细介绍了在线学习，这是一种有助于对大数据和连续数据流进行有效分析的分析方法。我们说明了如何使用在线或“逐行”处理方法将常见的分析方法用于大数据。我们提出了几个简单的(和精确的)在线估计的例子，并讨论了随机梯度下降作为估计更复杂模型的一般(近似)方法。我们以讨论方法来结束这篇文章。

引用次数: 9

A Meta-Analytic Investigation of the Relationship Between Scale-Item Length, Label Format, and Reliability 量表项目长度、标签格式与信度关系的元分析研究

IF 3.1 3区心理学 Q2 PSYCHOLOGY, MATHEMATICAL

Methodology: European Journal of Research Methods for The Behavioral and Social Sciences

Pub Date : 2016-10-05 DOI: 10.1027/1614-2241/A000112

Tyler Hamby, R. Peterson

Abstract. Using two meta-analytic datasets, we investigated the effect that two scale-item characteristics – number of item response categories and item response-category label format – have on the reliability of multi-item rating scales. The first dataset contained 289 reliability coefficients harvested from 100 samples that measured Big Five traits. The second dataset contained 2,524 reliability coefficients harvested from 381 samples that measured a wide variety of constructs in psychology, marketing, management, and education. We performed moderator analyses on the two datasets with the two item characteristics and their interaction. As expected, as the number of item response categories increased, so did reliability, but more importantly, there was a significant interaction between the number of item response categories and item response-category label format. Increasing the number of response categories increased reliabilities for scale-items with all response categories labeled more so than for oth...

摘要利用两个元分析数据集，我们研究了两个量表-项目特征-项目反应类别数量和项目反应类别标签格式-对多项目评定量表可靠性的影响。第一个数据集包含289个可靠性系数，这些系数来自100个测量五大特征的样本。第二个数据集包含从381个样本中收集的2524个可靠性系数，这些样本测量了心理学、市场营销、管理和教育领域的各种结构。我们对两个数据集的两个项目特征及其相互作用进行了调节分析。正如预期的那样，随着项目反应类别数量的增加，信度也随之增加，但更重要的是，项目反应类别数量与项目反应类别标签格式之间存在显著的交互作用。增加反应类别的数量增加了量表项目的可靠性，所有反应类别的标签都比其他的多。

引用次数: 7

Comparing the Performance of Improved Classify-Analyze Approaches For Distal Outcomes in Latent Profile Analysis. 比较潜在剖面分析中远端结果的改进分类分析方法的性能。

IF 3.1 3区心理学 Q2 PSYCHOLOGY, MATHEMATICAL

Methodology: European Journal of Research Methods for The Behavioral and Social Sciences

Pub Date : 2016-10-01 Epub Date: 2016-12-05 DOI: 10.1027/1614-2241/a000114

John J Dziak, Bethany C Bray, Jieting Zhang, Minqiang Zhang, Stephanie T Lanza

Several approaches are available for estimating the relationship of latent class membership to distal outcomes in latent profile analysis (LPA). A three-step approach is commonly used, but has problems with estimation bias and confidence interval coverage. Proposed improvements include the correction method of Bolck, Croon, and Hagenaars (BCH; 2004), Vermunt's (2010) maximum likelihood (ML) approach, and the inclusive three-step approach of Bray, Lanza, & Tan (2015). These methods have been studied in the related case of latent class analysis (LCA) with categorical indicators, but not as well studied for LPA with continuous indicators. We investigated the performance of these approaches in LPA with normally distributed indicators, under different conditions of distal outcome distribution, class measurement quality, relative latent class size, and strength of association between latent class and the distal outcome. The modified BCH implemented in Latent GOLD had excellent performance. The maximum likelihood and inclusive approaches were not robust to violations of distributional assumptions. These findings broadly agree with and extend the results presented by Bakk and Vermunt (2016) in the context of LCA with categorical indicators.

在潜在特征分析(LPA)中，有几种方法可用于估计潜在类别隶属度与远端结果的关系。通常使用三步方法，但存在估计偏差和置信区间覆盖的问题。提出的改进方法包括Bolck, Croon, and Hagenaars (BCH)的校正方法;2004)，佛蒙特(2010)的最大似然(ML)方法，以及Bray, Lanza， & Tan(2015)的包容性三步方法。这些方法已经在具有分类指标的潜在类分析(LCA)的相关案例中进行了研究，但对于具有连续指标的潜在类分析(LPA)的研究还不够。我们在远端结果分布、类别测量质量、相对潜在类别大小以及潜在类别与远端结果之间的关联强度等不同条件下，研究了这些方法在具有正态分布指标的LPA中的表现。在Latent GOLD中实现的改性BCH具有优异的性能。最大似然和包容性方法对违反分布假设的情况并不稳健。这些发现与Bakk和vermont(2016)在具有分类指标的LCA背景下提出的结果大致一致并进行了扩展。

{"title":"Comparing the Performance of Improved Classify-Analyze Approaches For Distal Outcomes in Latent Profile Analysis.","authors":"John J Dziak, Bethany C Bray, Jieting Zhang, Minqiang Zhang, Stephanie T Lanza","doi":"10.1027/1614-2241/a000114","DOIUrl":"https://doi.org/10.1027/1614-2241/a000114","url":null,"abstract":"<p><p>Several approaches are available for estimating the relationship of latent class membership to distal outcomes in latent profile analysis (LPA). A three-step approach is commonly used, but has problems with estimation bias and confidence interval coverage. Proposed improvements include the correction method of Bolck, Croon, and Hagenaars (BCH; 2004), Vermunt's (2010) maximum likelihood (ML) approach, and the inclusive three-step approach of Bray, Lanza, & Tan (2015). These methods have been studied in the related case of latent class analysis (LCA) with categorical indicators, but not as well studied for LPA with continuous indicators. We investigated the performance of these approaches in LPA with normally distributed indicators, under different conditions of distal outcome distribution, class measurement quality, relative latent class size, and strength of association between latent class and the distal outcome. The modified BCH implemented in Latent GOLD had excellent performance. The maximum likelihood and inclusive approaches were not robust to violations of distributional assumptions. These findings broadly agree with and extend the results presented by Bakk and Vermunt (2016) in the context of LCA with categorical indicators.</p>","PeriodicalId":18476,"journal":{"name":"Methodology: European Journal of Research Methods for The Behavioral and Social Sciences","volume":"12 4","pages":"107-116"},"PeriodicalIF":3.1,"publicationDate":"2016-10-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5473653/pdf/nihms-834564.pdf","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"35102499","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"心理学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 54

Normality and Sample Size Do Not Matter for the Selection of an Appropriate Statistical Test for Two-Group Comparisons 正态性和样本量对于两组比较选择适当的统计检验无关紧要

IF 3.1 3区心理学 Q2 PSYCHOLOGY, MATHEMATICAL

Methodology: European Journal of Research Methods for The Behavioral and Social Sciences

Pub Date : 2016-06-20 DOI: 10.1027/1614-2241/A000110

A. Poncet, D. Courvoisier, C. Combescure, T. Perneger

Abstract. Many applied researchers are taught to use the t-test when distributions appear normal and/or sample sizes are large and non-parametric tests otherwise, and fear inflated error rates if the “wrong” test is used. In a simulation study (four tests: t-test, Mann-Whitney test, Robust t-test, Permutation test; seven sample sizes between 2 × 10 and 2 × 500; four distributions: normal, uniform, log-normal, bimodal; under the null and alternate hypotheses), we show that type 1 errors are well controlled in all conditions. The t-test is most powerful under the normal and the uniform distributions, the Mann-Whitney test under the lognormal distribution, and the robust t-test under the bimodal distribution. Importantly, even the t-test was more powerful under asymmetric distributions than under the normal distribution for the same effect size. It appears that normality and sample size do not matter for the selection of a test to compare two groups of same size and variance. The researcher can opt for the t...

摘要许多应用研究人员被教导在分布呈现正态和/或样本量较大时使用t检验，否则使用非参数检验，并且担心如果使用“错误”检验会增加错误率。在模拟研究中(4个检验:t检验、Mann-Whitney检验、Robust t检验、置换检验;7个样本量在2 × 10到2 × 500之间;四种分布:正态、均匀、对数正态、双峰;在零假设和交替假设下，我们证明在所有条件下，类型1误差都得到了很好的控制。正态分布和均匀分布下的t检验最有效，对数正态分布下的Mann-Whitney检验最有效，双峰分布下的稳健t检验最有效。重要的是，即使t检验在非对称分布下也比在正态分布下更有效。似乎正态性和样本量对于选择比较两个大小和方差相同的组的检验并不重要。研究者可以选择t…

引用次数: 35

首页上一页

下一页尾页

类型

全部化学•材料生命科学医学物理工程技术环境•农林材料科学地球科学法学管理学化学环境科学与生态学计算机科学教育学经济学农林科学人文科学生物学数学物理与天体物理心理学综合性期刊其他工业工程理学历史学农学文学信息工程

数据库

全部 ACS Publications Elsevier ieeexplore Springer The Royal Society of Chemistry Wiley

期刊

Methodology: European Journal of Research Methods for The Behavioral and Social Sciences

全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.

﹀