Sofia Oskolskaya, Ezequiel Koile, Martine Robbeets
{"title":"A Bayesian approach to the classification of Tungusic languages","authors":"Sofia Oskolskaya, Ezequiel Koile, Martine Robbeets","doi":"10.1075/DIA.20010.OSK","DOIUrl":null,"url":null,"abstract":"\n The Tungusic language family is comprised of languages spoken in Siberia, the Russian Far East, Northeast China and\n Xinjiang. There is a general consensus that these languages are genealogically related and descend from a common ancestral language.\n Nevertheless, there is considerable disagreement with regard to the internal structure of the Tungusic family and the time depth of its\n separation into daughter languages. Here we use computational Bayesian phylogenetic methods to generate a phylogeny of Tungusic languages\n and estimate the time-depth of the family. Our analysis is based on the recently introduced Leipzig-Jakarta-Jena list, a dataset of 254\n basic vocabulary items collected for 21 Tungusic doculects. Our results are consistent with two basic classifications previously proposed in\n the literature, notably a Manchu-Tungusic classification, in which the break-up of Jurchenic constitutes the first split in the tree, as\n well as a North-South classification, which includes a Jurchenic-Nanaic and an Orochic-Ewenic branch. In addition, we obtain a time-depth\n for the age of Proto-Tungusic between the 8th century BC and the 12th century AD (95% highest posterior density interval). Previous\n classifications of Tungusic were based on both classical historical comparative linguistic and lexicostatistic approaches, but the\n application of Bayesian phylogenetic methods to the Tungusic languages has not so far been attempted. In contrast to previous approaches,\n our Bayesian analysis adds an understanding of the statistical robustness of the proposed branches and infers absolute divergence dates,\n allowing variation of rates of change across branches and cognate sets. In this way, our research provides a reliable quantitative basis for\n previous estimates based on classical historical linguistic and lexicostatistic approaches.","PeriodicalId":44637,"journal":{"name":"Diachronica","volume":" ","pages":""},"PeriodicalIF":0.6000,"publicationDate":"2021-05-22","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"3","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Diachronica","FirstCategoryId":"98","ListUrlMain":"https://doi.org/10.1075/DIA.20010.OSK","RegionNum":2,"RegionCategory":"文学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"0","JCRName":"LANGUAGE & LINGUISTICS","Score":null,"Total":0}
引用次数: 3
Abstract
The Tungusic language family is comprised of languages spoken in Siberia, the Russian Far East, Northeast China and
Xinjiang. There is a general consensus that these languages are genealogically related and descend from a common ancestral language.
Nevertheless, there is considerable disagreement with regard to the internal structure of the Tungusic family and the time depth of its
separation into daughter languages. Here we use computational Bayesian phylogenetic methods to generate a phylogeny of Tungusic languages
and estimate the time-depth of the family. Our analysis is based on the recently introduced Leipzig-Jakarta-Jena list, a dataset of 254
basic vocabulary items collected for 21 Tungusic doculects. Our results are consistent with two basic classifications previously proposed in
the literature, notably a Manchu-Tungusic classification, in which the break-up of Jurchenic constitutes the first split in the tree, as
well as a North-South classification, which includes a Jurchenic-Nanaic and an Orochic-Ewenic branch. In addition, we obtain a time-depth
for the age of Proto-Tungusic between the 8th century BC and the 12th century AD (95% highest posterior density interval). Previous
classifications of Tungusic were based on both classical historical comparative linguistic and lexicostatistic approaches, but the
application of Bayesian phylogenetic methods to the Tungusic languages has not so far been attempted. In contrast to previous approaches,
our Bayesian analysis adds an understanding of the statistical robustness of the proposed branches and infers absolute divergence dates,
allowing variation of rates of change across branches and cognate sets. In this way, our research provides a reliable quantitative basis for
previous estimates based on classical historical linguistic and lexicostatistic approaches.
期刊介绍:
Diachronica provides a forum for the presentation and discussion of information concerning all aspects of language change in any and all languages of the globe. Contributions which combine theoretical interest and philological acumen are especially welcome. Diachronica appears three times per year, publishing articles, review articles, book reviews, and a miscellanea section including notes, reports and discussions.