A Deep Attention Network for Chinese Word Segment

International Conference on Machine Learning and Computing Pub Date : 2019-02-22 DOI:10.1145/3318299.3318351

Lanxin Li, Ping Gong, L. Ji

引用次数: 1

Abstract

Character-level sequence label tagging is the most efficient way to solve unknown words problem for Chinese word segment. But the most widely used model, Conditional Random Fields (CRF), needs a large amount of manual design features. So it is appropriate to combine CRF and neural networks such as recurrent neural network (RNN), which is adopted in many natural language processing (NLP) tasks. However, RNN is rather slow because of the timing dependence between computations and not good at capturing local information of the sentence. In order to solve this problem, we introduce a self-attention mechanism, which completes the calculation between the different positions of the sentence with the same distance, into CWS. And we propose a deep neural network, which combines convolution neural networks and self-attention mechanism. Then, we evaluate the model on the PKU dataset and the MSR dataset. The results show that our model perform much better.

查看原文

微信好友朋友圈 QQ好友复制链接

本刊更多论文

汉语分词的深度注意网络

字符级序列标注是解决汉语分词中未知词问题的最有效方法。但是使用最广泛的条件随机场(CRF)模型需要大量的手工设计特征。因此，将CRF与神经网络(如递归神经网络(RNN))相结合是非常合适的，而递归神经网络在许多自然语言处理(NLP)任务中都得到了应用。然而，由于计算之间的时间依赖，RNN的速度很慢，并且不擅长捕获句子的局部信息。为了解决这一问题，我们引入自注意机制，在CWS中完成相同距离句子的不同位置之间的计算。并提出了一种将卷积神经网络与自注意机制相结合的深度神经网络。然后，我们在PKU数据集和MSR数据集上对模型进行了评估。结果表明，该模型的性能要好得多。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文去求助

来源期刊

International Conference on Machine Learning and Computing

自引率

0.00%

发文量