Bayesian Identification of Cognates and Correspondences

Special Interest Group on Computational Morphology and Phonology Workshop Pub Date : 2007-06-28 DOI:10.3115/1626516.1626519

T. M. Ellison

引用次数: 12

Abstract

This paper presents a Bayesian approach to comparing languages: identifying cognates and the regular correspondences that compose them. A simple model of language is extended to include these notions in an account of parent languages. An expression is developed for the posterior probability of child language forms given a parent language. Bayes' Theorem offers a schema for evaluating choices of cognates and correspondences to explain semantically matched data. An implementation optimising this value with gradient descent is shown to distinguish cognates from non-cognates in data from Polish and Russian.

查看原文

微信好友朋友圈 QQ好友复制链接

本刊更多论文

同源词和对应词的贝叶斯识别

本文提出了一种贝叶斯方法来比较语言:识别同源词和构成它们的规则对应关系。一个简单的语言模型被扩展到包括这些概念在母体语言的帐户。给出了一种母语言的子语言形式的后验概率表达式。贝叶斯定理提供了一种模式来评估同源词和对应关系的选择，以解释语义匹配的数据。用梯度下降优化这个值的实现被证明可以区分波兰语和俄语数据中的同源词和非同源词。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文去求助

来源期刊

Special Interest Group on Computational Morphology and Phonology Workshop

自引率

0.00%

发文量