The Initial Research of Mongolian Literary Corpus-Take the Text of Da.Nachugdorji’s Work for Instance

2019 International Conference on Asian Language Processing (IALP) Pub Date : 2019-11-01 DOI:10.1109/IALP48816.2019.9037660

Yin Hai

引用次数: 0

Abstract

Today, the Mongolian corpus is gradually developed from the basic resource construction stage to an in-depth research covering multi-level processing or authorcorpus-based quantitative analysis, and multi-functional electronic dictionary’s development. However, there are still many shortcomings and deficiencies in the collection, development and processing of literary corpus. In this paper, the author will introduces the corpus of Da.Nachugdorji’s Literature and will discusses its profound significance, and fulfill multi-level processing such as lexical, syntactic and semantic annotation, as well as dissertates the preliminary processing research of Mongolian literary corpus from the perspective of statistics on the POS, word and phrase frequency and computation of lexical richness.

查看原文

微信好友朋友圈 QQ好友复制链接

本刊更多论文

蒙古族文学语料库初探——以《达》文本为例。比如纳丘多吉的作品

如今，蒙古语语料库正逐步从基础资源建设阶段发展到多层次处理或基于作者语料库的定量分析的深入研究，以及多功能电子词典的开发。然而，在文学语料库的收集、开发和处理方面，还存在许多不足和不足。在本文中，作者将介绍语料库。纳楚克多吉的《文学与文学》论述了其深刻意义，完成了词汇、句法、语义标注等多层次处理，并从词性统计、词频统计、词汇丰富度计算等角度论述了蒙文文学语料库的初步处理研究。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文去求助

来源期刊

2019 International Conference on Asian Language Processing (IALP)

自引率

0.00%

发文量