{"title":"Exploring potential unknown subgroups in your data: An introduction to finite mixture models for applied linguistics","authors":"Tove Larsson , Gregory R. Hancock","doi":"10.1016/j.rmal.2024.100117","DOIUrl":null,"url":null,"abstract":"<div><p>This article provides an introduction to finite mixture models in an applied linguistics context. Mixture models can be used to address questions relating to whether there are unknown subgroups in one's data, and if so, which participants/texts are likely to belong to which subgroup. Put differently, the technique enables us to assess whether our data might come from a heterogeneous population that is made up of latent classes. As such, mixture models offer a model-based framework to answer research questions for which the field previously has either attempted to use nonparametric heuristic techniques (e.g., cluster analysis) or has left entirely unanswered. An example of such research questions would be, ‘Does the treatment work equally well for all the participants, or are there unknown subgroups in the data that respond differently to the treatment?’ The article starts by introducing univariate mixture models and then broadens the scope to cover bivariate and multivariate mixture models. It also discusses some known pitfalls of the technique and how one might ameliorate these in practice.</p></div>","PeriodicalId":101075,"journal":{"name":"Research Methods in Applied Linguistics","volume":"3 2","pages":"Article 100117"},"PeriodicalIF":0.0000,"publicationDate":"2024-05-22","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Research Methods in Applied Linguistics","FirstCategoryId":"1085","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S2772766124000235","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0
Abstract
This article provides an introduction to finite mixture models in an applied linguistics context. Mixture models can be used to address questions relating to whether there are unknown subgroups in one's data, and if so, which participants/texts are likely to belong to which subgroup. Put differently, the technique enables us to assess whether our data might come from a heterogeneous population that is made up of latent classes. As such, mixture models offer a model-based framework to answer research questions for which the field previously has either attempted to use nonparametric heuristic techniques (e.g., cluster analysis) or has left entirely unanswered. An example of such research questions would be, ‘Does the treatment work equally well for all the participants, or are there unknown subgroups in the data that respond differently to the treatment?’ The article starts by introducing univariate mixture models and then broadens the scope to cover bivariate and multivariate mixture models. It also discusses some known pitfalls of the technique and how one might ameliorate these in practice.