Using LSTM Encoder-Decoder for Rhetorical Structure Prediction

2018 7th Brazilian Conference on Intelligent Systems (BRACIS) Pub Date : 2018-10-01 DOI:10.1109/bracis.2018.00055

Gustavo Bennemann de Moura, Valéria Delisandra Feltrim

引用次数: 2

Abstract

The importance of identifying rhetorical categories in texts has been widely acknowledged in the literature, since information regarding text organization or structure can be applied in a variety of scenarios, including genre-specific writing support and evaluation, both manually and automatically. In this paper we present a Long Short-Term Memory (LSTM) encoder-decoder classifier for scientific abstracts. As a large corpus of annotated abstracts was required to train our classifier, we built a corpus using abstracts extracted from PUBMED/MEDLINE. Using the proposed classifier we achieved approximately 3% improvement in per-abstract accuracy over the baselines and 1% improvement for both per-sentence accuracy and f1-score.

查看原文

微信好友朋友圈 QQ好友复制链接

本刊更多论文

用LSTM编解码器进行修辞结构预测

识别语篇修辞类别的重要性已在文献中得到广泛认可，因为关于语篇组织或结构的信息可以应用于各种场景，包括特定体裁的写作支持和评估，无论是手动还是自动。本文提出了一种基于长短期记忆(LSTM)的科学摘要编码器分类器。由于需要大量带注释的摘要语料库来训练我们的分类器，我们使用从PUBMED/MEDLINE提取的摘要构建了一个语料库。使用提出的分类器，我们在每个抽象的准确率上比基线提高了大约3%，每个句子的准确率和f1-score都提高了1%。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文去求助

来源期刊

2018 7th Brazilian Conference on Intelligent Systems (BRACIS)

自引率

0.00%

发文量