{"title":"原始印欧-欧亚语系假说的统计证据","authors":"J. Blevins, R. Sproat","doi":"10.1075/DIA.19014.BLE","DOIUrl":null,"url":null,"abstract":"\nBased on a new reconstruction of Proto-Basque, and regular sound correspondences between this Proto-Basque and Proto-Indo-European as standardly reconstructed, Blevins (2018) argues that Proto-Basque and Proto-Indo-European have a common ancestor that pre-dates the two proto-languages. Part of this argument is based on proposed Proto-Indo-European/Proto-Basque cognate sets that include basic vocabulary items. In this study we offer statistical support for Blevins’ conclusions by using a Monte Carlo simulation that allows us to estimate the probability that the proposed lexical correspondences could have arisen by chance. The method makes use of phonotactic language models to generate possible words in a pair of languages, and then attempts to discover consistent correspondences between the words, producing a list of possible “cognates”. The method differs from some previous approaches by considering matches between all segments in the word pairs. By running such a simulation a large number of times, we can estimate the probability that two languages with the given phonotactics could have produced the number of cognate pairs observed in the actual data. The method is independently assessed by comparing wordlists from 100 pairs of languages, related and unrelated, where relations are known. Our conclusion is that the proposed correspondences are unlikely to have arisen by chance, supporting a distant relationship between Proto-Basque as reconstructed by Blevins (2018) and Proto-Indo-European.","PeriodicalId":44637,"journal":{"name":"Diachronica","volume":" ","pages":""},"PeriodicalIF":0.6000,"publicationDate":"2021-05-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"2","resultStr":"{\"title\":\"Statistical evidence for the Proto-Indo-European-Euskarian hypothesis\",\"authors\":\"J. Blevins, R. Sproat\",\"doi\":\"10.1075/DIA.19014.BLE\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"\\nBased on a new reconstruction of Proto-Basque, and regular sound correspondences between this Proto-Basque and Proto-Indo-European as standardly reconstructed, Blevins (2018) argues that Proto-Basque and Proto-Indo-European have a common ancestor that pre-dates the two proto-languages. Part of this argument is based on proposed Proto-Indo-European/Proto-Basque cognate sets that include basic vocabulary items. In this study we offer statistical support for Blevins’ conclusions by using a Monte Carlo simulation that allows us to estimate the probability that the proposed lexical correspondences could have arisen by chance. The method makes use of phonotactic language models to generate possible words in a pair of languages, and then attempts to discover consistent correspondences between the words, producing a list of possible “cognates”. The method differs from some previous approaches by considering matches between all segments in the word pairs. By running such a simulation a large number of times, we can estimate the probability that two languages with the given phonotactics could have produced the number of cognate pairs observed in the actual data. The method is independently assessed by comparing wordlists from 100 pairs of languages, related and unrelated, where relations are known. Our conclusion is that the proposed correspondences are unlikely to have arisen by chance, supporting a distant relationship between Proto-Basque as reconstructed by Blevins (2018) and Proto-Indo-European.\",\"PeriodicalId\":44637,\"journal\":{\"name\":\"Diachronica\",\"volume\":\" \",\"pages\":\"\"},\"PeriodicalIF\":0.6000,\"publicationDate\":\"2021-05-10\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"2\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Diachronica\",\"FirstCategoryId\":\"98\",\"ListUrlMain\":\"https://doi.org/10.1075/DIA.19014.BLE\",\"RegionNum\":2,\"RegionCategory\":\"文学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"0\",\"JCRName\":\"LANGUAGE & LINGUISTICS\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Diachronica","FirstCategoryId":"98","ListUrlMain":"https://doi.org/10.1075/DIA.19014.BLE","RegionNum":2,"RegionCategory":"文学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"0","JCRName":"LANGUAGE & LINGUISTICS","Score":null,"Total":0}
Statistical evidence for the Proto-Indo-European-Euskarian hypothesis
Based on a new reconstruction of Proto-Basque, and regular sound correspondences between this Proto-Basque and Proto-Indo-European as standardly reconstructed, Blevins (2018) argues that Proto-Basque and Proto-Indo-European have a common ancestor that pre-dates the two proto-languages. Part of this argument is based on proposed Proto-Indo-European/Proto-Basque cognate sets that include basic vocabulary items. In this study we offer statistical support for Blevins’ conclusions by using a Monte Carlo simulation that allows us to estimate the probability that the proposed lexical correspondences could have arisen by chance. The method makes use of phonotactic language models to generate possible words in a pair of languages, and then attempts to discover consistent correspondences between the words, producing a list of possible “cognates”. The method differs from some previous approaches by considering matches between all segments in the word pairs. By running such a simulation a large number of times, we can estimate the probability that two languages with the given phonotactics could have produced the number of cognate pairs observed in the actual data. The method is independently assessed by comparing wordlists from 100 pairs of languages, related and unrelated, where relations are known. Our conclusion is that the proposed correspondences are unlikely to have arisen by chance, supporting a distant relationship between Proto-Basque as reconstructed by Blevins (2018) and Proto-Indo-European.
期刊介绍:
Diachronica provides a forum for the presentation and discussion of information concerning all aspects of language change in any and all languages of the globe. Contributions which combine theoretical interest and philological acumen are especially welcome. Diachronica appears three times per year, publishing articles, review articles, book reviews, and a miscellanea section including notes, reports and discussions.