原始印欧-欧亚语系假说的统计证据

IF 0.8 2区文学 0 LANGUAGE & LINGUISTICS Diachronica Pub Date : 2021-05-10 DOI:10.1075/DIA.19014.BLE

J. Blevins, R. Sproat

{"title":"原始印欧-欧亚语系假说的统计证据","authors":"J. Blevins, R. Sproat","doi":"10.1075/DIA.19014.BLE","DOIUrl":null,"url":null,"abstract":"\nBased on a new reconstruction of Proto-Basque, and regular sound correspondences between this Proto-Basque and Proto-Indo-European as standardly reconstructed, Blevins (2018) argues that Proto-Basque and Proto-Indo-European have a common ancestor that pre-dates the two proto-languages. Part of this argument is based on proposed Proto-Indo-European/Proto-Basque cognate sets that include basic vocabulary items. In this study we offer statistical support for Blevins’ conclusions by using a Monte Carlo simulation that allows us to estimate the probability that the proposed lexical correspondences could have arisen by chance. The method makes use of phonotactic language models to generate possible words in a pair of languages, and then attempts to discover consistent correspondences between the words, producing a list of possible “cognates”. The method differs from some previous approaches by considering matches between all segments in the word pairs. By running such a simulation a large number of times, we can estimate the probability that two languages with the given phonotactics could have produced the number of cognate pairs observed in the actual data. The method is independently assessed by comparing wordlists from 100 pairs of languages, related and unrelated, where relations are known. Our conclusion is that the proposed correspondences are unlikely to have arisen by chance, supporting a distant relationship between Proto-Basque as reconstructed by Blevins (2018) and Proto-Indo-European.","PeriodicalId":44637,"journal":{"name":"Diachronica","volume":" ","pages":""},"PeriodicalIF":0.8000,"publicationDate":"2021-05-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"2","resultStr":"{\"title\":\"Statistical evidence for the Proto-Indo-European-Euskarian hypothesis\",\"authors\":\"J. Blevins, R. Sproat\",\"doi\":\"10.1075/DIA.19014.BLE\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"\\nBased on a new reconstruction of Proto-Basque, and regular sound correspondences between this Proto-Basque and Proto-Indo-European as standardly reconstructed, Blevins (2018) argues that Proto-Basque and Proto-Indo-European have a common ancestor that pre-dates the two proto-languages. Part of this argument is based on proposed Proto-Indo-European/Proto-Basque cognate sets that include basic vocabulary items. In this study we offer statistical support for Blevins’ conclusions by using a Monte Carlo simulation that allows us to estimate the probability that the proposed lexical correspondences could have arisen by chance. The method makes use of phonotactic language models to generate possible words in a pair of languages, and then attempts to discover consistent correspondences between the words, producing a list of possible “cognates”. The method differs from some previous approaches by considering matches between all segments in the word pairs. By running such a simulation a large number of times, we can estimate the probability that two languages with the given phonotactics could have produced the number of cognate pairs observed in the actual data. The method is independently assessed by comparing wordlists from 100 pairs of languages, related and unrelated, where relations are known. Our conclusion is that the proposed correspondences are unlikely to have arisen by chance, supporting a distant relationship between Proto-Basque as reconstructed by Blevins (2018) and Proto-Indo-European.\",\"PeriodicalId\":44637,\"journal\":{\"name\":\"Diachronica\",\"volume\":\" \",\"pages\":\"\"},\"PeriodicalIF\":0.8000,\"publicationDate\":\"2021-05-10\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"2\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Diachronica\",\"FirstCategoryId\":\"98\",\"ListUrlMain\":\"https://doi.org/10.1075/DIA.19014.BLE\",\"RegionNum\":2,\"RegionCategory\":\"文学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"0\",\"JCRName\":\"LANGUAGE & LINGUISTICS\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Diachronica","FirstCategoryId":"98","ListUrlMain":"https://doi.org/10.1075/DIA.19014.BLE","RegionNum":2,"RegionCategory":"文学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"0","JCRName":"LANGUAGE & LINGUISTICS","Score":null,"Total":0}

引用次数: 2

摘要

Blevins（2018）基于对原巴斯克语的新重建，以及标准重建的原巴斯克和原印欧语之间的规则发音对应，认为原巴斯克语言和原印欧语有一个共同的祖先，早于两种原语言。这一论点的一部分是基于所提出的包括基本词汇项目的原始印欧/原始巴斯克同源集合。在这项研究中，我们通过使用蒙特卡罗模拟为Blevins的结论提供了统计支持，该模拟使我们能够估计所提出的词汇对应可能是偶然出现的概率。该方法利用表音语言模型生成一对语言中可能的单词，然后试图发现单词之间的一致对应关系，生成一个可能的“同源词”列表。该方法与以前的一些方法不同，因为它考虑了单词对中所有分段之间的匹配。通过多次运行这样的模拟，我们可以估计具有给定表音策略的两种语言产生实际数据中观察到的同源对数量的概率。该方法通过比较100对已知关系的语言（相关和不相关）的单词表进行独立评估。我们的结论是，拟议的对应关系不太可能是偶然出现的，这支持了Blevins（2018）重建的原巴斯克和原印欧之间的遥远关系。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文

微信好友朋友圈 QQ好友复制链接

本刊更多论文

Statistical evidence for the Proto-Indo-European-Euskarian hypothesis

Based on a new reconstruction of Proto-Basque, and regular sound correspondences between this Proto-Basque and Proto-Indo-European as standardly reconstructed, Blevins (2018) argues that Proto-Basque and Proto-Indo-European have a common ancestor that pre-dates the two proto-languages. Part of this argument is based on proposed Proto-Indo-European/Proto-Basque cognate sets that include basic vocabulary items. In this study we offer statistical support for Blevins’ conclusions by using a Monte Carlo simulation that allows us to estimate the probability that the proposed lexical correspondences could have arisen by chance. The method makes use of phonotactic language models to generate possible words in a pair of languages, and then attempts to discover consistent correspondences between the words, producing a list of possible “cognates”. The method differs from some previous approaches by considering matches between all segments in the word pairs. By running such a simulation a large number of times, we can estimate the probability that two languages with the given phonotactics could have produced the number of cognate pairs observed in the actual data. The method is independently assessed by comparing wordlists from 100 pairs of languages, related and unrelated, where relations are known. Our conclusion is that the proposed correspondences are unlikely to have arisen by chance, supporting a distant relationship between Proto-Basque as reconstructed by Blevins (2018) and Proto-Indo-European.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

Diachronica Multiple-

CiteScore

1.60

自引率

0.00%

发文量

期刊介绍： Diachronica provides a forum for the presentation and discussion of information concerning all aspects of language change in any and all languages of the globe. Contributions which combine theoretical interest and philological acumen are especially welcome. Diachronica appears three times per year, publishing articles, review articles, book reviews, and a miscellanea section including notes, reports and discussions.

期刊最新文献

Gender reduction in contact Diachrony and Diachronica Abrupt grammatical reorganization of an emergent sign language Linguistic mechanisms of colour term evolution Realis morphology and Chatino’s role in the diversification of Zapotec languages