Unified End-to-End Sentence Denoising

2020 International Conference on Computational Science and Computational Intelligence (CSCI) Pub Date : 2020-12-01 DOI:10.1109/CSCI51800.2020.00087

Zhantong Liang, A. Youssef

引用次数: 0

Abstract

It takes more than correct grammar to speak good English. In this paper, we describe the sentence denoising task that reduces the vagueness, redundancy, and irrationality of a grammatical sentence. We define a rich, linguistics-inspired noise taxonomy and establish the formal definition of the problem. A unified end-to-end model based on Transformer is proposed and an efficient algorithm for constructing the training data is given, together with a separate fine-tuning step to get the ideal model. Our method outperforms previous results and keeps good accuracy as the noise composition gets more complicated.

查看原文

微信好友朋友圈 QQ好友复制链接

本刊更多论文

统一的端到端句子去噪

说一口流利的英语需要的不仅仅是正确的语法。在本文中，我们描述了句子去噪任务，以减少语法句子的模糊，冗余和不合理。我们定义了一个丰富的，受语言学启发的噪声分类，并建立了问题的正式定义。提出了一种基于Transformer的统一的端到端模型，给出了一种高效的训练数据构造算法，并给出了一个单独的微调步骤，以获得理想的模型。该方法在噪声组成复杂的情况下仍能保持较好的精度。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文去求助

来源期刊

2020 International Conference on Computational Science and Computational Intelligence (CSCI)

自引率

0.00%

发文量