This second paper in a two-part methodological guide demonstrates how Generalised Linear Mixed Model (GLMM) tree analysis can be used to explore linguistic conditioning in sociolinguistic variation. Building on Part 1, which introduced the dataset and illustrated how GLMM trees reveal social patterning in (ING) variation, Part 2 focuses on the internal linguistic factors governing the alternation between [ɪn] and [ɪŋ] in Canadian Maritime English (CME). Using over 10,000 tokens from more than 300 speakers—the largest single-dialect (ING) dataset to date—this study shows how GLMM trees accommodate overlapping or collinear predictors such as grammatical category, phonological context, and lexical frequency. The analysis confirms the dominant morphological constraint (verbs favouring [ɪn], non-verbal forms favouring [ɪŋ]) and identifies additional phonological and frequency effects that emerge only in specific prosodic and lexical environments. These results demonstrate that GLMM trees offer a straightforward, statistically robust way to model complex sociolinguistic data while maintaining interpretability. The paper concludes by comparing hierarchical tree-based and ‘flat’ regression models, validating the stability of the GLMM trees through cross-validation, and highlighting how this approach can clarify the interplay of linguistic and social constraints in variationist research.
{"title":"A Guide to Build (ING) GLMM Trees in Canadian Maritime English: Part 2, Linguistic Factors","authors":"Matt Hunt Gardner","doi":"10.1111/lnc3.70037","DOIUrl":"10.1111/lnc3.70037","url":null,"abstract":"<p>This second paper in a two-part methodological guide demonstrates how Generalised Linear Mixed Model (GLMM) tree analysis can be used to explore linguistic conditioning in sociolinguistic variation. Building on Part 1, which introduced the dataset and illustrated how GLMM trees reveal social patterning in (ING) variation, Part 2 focuses on the internal linguistic factors governing the alternation between [ɪn] and [ɪŋ] in Canadian Maritime English (CME). Using over 10,000 tokens from more than 300 speakers—the largest single-dialect (ING) dataset to date—this study shows how GLMM trees accommodate overlapping or collinear predictors such as grammatical category, phonological context, and lexical frequency. The analysis confirms the dominant morphological constraint (verbs favouring [ɪn], non-verbal forms favouring [ɪŋ]) and identifies additional phonological and frequency effects that emerge only in specific prosodic and lexical environments. These results demonstrate that GLMM trees offer a straightforward, statistically robust way to model complex sociolinguistic data while maintaining interpretability. The paper concludes by comparing hierarchical tree-based and ‘flat’ regression models, validating the stability of the GLMM trees through cross-validation, and highlighting how this approach can clarify the interplay of linguistic and social constraints in variationist research.</p>","PeriodicalId":47472,"journal":{"name":"Language and Linguistics Compass","volume":"20 2","pages":""},"PeriodicalIF":2.8,"publicationDate":"2026-02-17","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://compass.onlinelibrary.wiley.com/doi/epdf/10.1111/lnc3.70037","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"146224308","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
In recent years, much research has focused on what happens in the human brain when a perceptual stimulus, such as a picture, is converted into linguistic content, a word. This process is commonly referred to as object naming and is considered a crucial aspect of language processing, production, and cognition. It refers to the identification of an object with a word or phrase, as well as the psychometric method of investigating this human behavior to obtain insights into different aspects of human cognition and language, such as the organization of the mental lexicon, language acquisition, disorders, or cognitive aging. The ability to name objects is considered a fundamental skill in interpersonal communication and has long been studied in numerous disciplines, such as cognitive science, psycholinguistics, psychology, and, more recently, in computer vision and research on language and vision. In the latter two, object naming has become an extremely powerful tool, especially in the development and fine-tuning of multi-modal models, facilitating tasks such as visual question answering, image captioning tasks, object detection, or visual scene understanding. Our comprehensive, cross-linguistic review explores the key findings, commonly cited, and prominent datasets and their applications that establish object naming both in the past and now, as well as discusses its chances and challenges in future work.
{"title":"From Psycholinguistics to Computer Vision. A Comprehensive Review of Object Naming Data and Studies","authors":"Alžběta Kučerová, Johann-Mattis List","doi":"10.1111/lnc3.70034","DOIUrl":"10.1111/lnc3.70034","url":null,"abstract":"<p>In recent years, much research has focused on what happens in the human brain when a perceptual stimulus, such as a picture, is converted into linguistic content, a word. This process is commonly referred to as object naming and is considered a crucial aspect of language processing, production, and cognition. It refers to the identification of an object with a word or phrase, as well as the psychometric method of investigating this human behavior to obtain insights into different aspects of human cognition and language, such as the organization of the mental lexicon, language acquisition, disorders, or cognitive aging. The ability to name objects is considered a fundamental skill in interpersonal communication and has long been studied in numerous disciplines, such as cognitive science, psycholinguistics, psychology, and, more recently, in computer vision and research on language and vision. In the latter two, object naming has become an extremely powerful tool, especially in the development and fine-tuning of multi-modal models, facilitating tasks such as visual question answering, image captioning tasks, object detection, or visual scene understanding. Our comprehensive, cross-linguistic review explores the key findings, commonly cited, and prominent datasets and their applications that establish object naming both in the past and now, as well as discusses its chances and challenges in future work.</p>","PeriodicalId":47472,"journal":{"name":"Language and Linguistics Compass","volume":"20 2","pages":""},"PeriodicalIF":2.8,"publicationDate":"2026-02-12","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://compass.onlinelibrary.wiley.com/doi/epdf/10.1111/lnc3.70034","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"146217010","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}