文献互助智能选刊最新文献

高级搜索发布求助登录注册

Chinese Word Segmentation as LMR Tagging

Workshop on Chinese Language Processing Pub Date : 2003-07-11 DOI:10.3115/1119250.1119278

Nianwen Xue, Libin Shen

引用次数: 151

Abstract

In this paper we present Chinese word segmentation algorithms based on the so-called LMR tagging. Our LMR taggers are implemented with the Maximum Entropy Markov Model and we then use Transformation-Based Learning to combine the results of the two LMR taggers that scan the input in opposite directions. Our system achieves F-scores of 95.9% and 91.6% on the Academia Sinica corpus and the Hong Kong City University corpus respectively.

查看原文

微信好友朋友圈 QQ好友复制链接

本刊更多论文

基于LMR标注的汉语分词

本文提出了一种基于LMR标注的中文分词算法。我们的LMR标记器是用最大熵马尔可夫模型实现的，然后我们使用基于转换的学习来组合两个LMR标记器的结果，这两个LMR标记器从相反的方向扫描输入。我们的系统在中央研究院语料库和香港城市大学语料库上分别获得95.9%和91.6%的f分。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文去求助

来源期刊

Workshop on Chinese Language Processing

Workshop on Chinese Language Processing

自引率

0.00%

发文量

0

期刊最新文献

Building a Large Chinese Corpus Annotated with Semantic Dependency A Two-stage Statistical Word Segmentation System for Chinese Unsupervised Training for Overlapping Ambiguity Resolution in Chinese Word Segmentation Chinese Word Segmentation in MSR-NLP Annotating the Propositions in the Penn Chinese Treebank

0

微信

客服QQ

Book学术公众号

扫码关注我们

反馈

Book学术官方微信

Book学术文献互助

Book学术文献互助群
群号：604180095

文献互助智能选刊最新文献互助须知联系我们：info@booksci.cn

Book学术提供免费学术资源搜索服务，方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。

Copyright © 2023 Book学术 All rights reserved.

京公网安备 11010802042870号京ICP备2023020795号-1