{"title":"Areal and phylogenetic dimensions of word order variation in Indo-European languages","authors":"Christian Ebert, Balthasar Bickel, Paul Widmer","doi":"10.1515/ling-2022-0146","DOIUrl":null,"url":null,"abstract":"\n Both areal and phylogenetic affiliation have been discussed as driving factors of the distribution of word order in the languages of the world. However, disentangling the interaction of these two factors is challenging. Here we take Indo-European as a test case. Word order in this family is largely homogeneous both within areas and within branches, which makes it difficult to assess which factor was more important in shaping the present-day distribution. To break out of this impasse we turn to corpus data and explicit statistical modeling. Building on a parallel corpus of movie subtitles, we investigate word order on the sentence level under stable pragmatic conditions. We measure the similarity of word order variation between pairs of languages with an information-theoretic distance metric. Using cluster analysis and variation partitioning methods these distance metrics show that phylogenetic distance predicts more variation than geographical distance, but the most important predictor is the shared fraction where phylogeny and area overlap. We conclude that word order has evolved along both dimensions and cannot be reduced to a single one.","PeriodicalId":47548,"journal":{"name":"Linguistics","volume":null,"pages":null},"PeriodicalIF":1.3000,"publicationDate":"2024-07-24","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Linguistics","FirstCategoryId":"98","ListUrlMain":"https://doi.org/10.1515/ling-2022-0146","RegionNum":2,"RegionCategory":"文学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"0","JCRName":"LANGUAGE & LINGUISTICS","Score":null,"Total":0}
引用次数: 0
Abstract
Both areal and phylogenetic affiliation have been discussed as driving factors of the distribution of word order in the languages of the world. However, disentangling the interaction of these two factors is challenging. Here we take Indo-European as a test case. Word order in this family is largely homogeneous both within areas and within branches, which makes it difficult to assess which factor was more important in shaping the present-day distribution. To break out of this impasse we turn to corpus data and explicit statistical modeling. Building on a parallel corpus of movie subtitles, we investigate word order on the sentence level under stable pragmatic conditions. We measure the similarity of word order variation between pairs of languages with an information-theoretic distance metric. Using cluster analysis and variation partitioning methods these distance metrics show that phylogenetic distance predicts more variation than geographical distance, but the most important predictor is the shared fraction where phylogeny and area overlap. We conclude that word order has evolved along both dimensions and cannot be reduced to a single one.
期刊介绍:
Linguistics publishes articles in the traditional subdisciplines of linguistics as well as in neighboring disciplines insofar as these are deemed to be of interest to linguists and other students of natural language. This includes grammar, both functional and formal, with a focus on morphology, syntax, and semantics, pragmatics and discourse, phonetics and phonology, psycholinguistics, and sociolinguistics. The focus may be on one or several languages, but studies with a wide crosslinguistic (typological) coverage are also welcome. The perspective may be synchronic or diachronic. Linguistics also publishes up to two special issues a year in these areas, for which it welcomes proposals.