用LSTM编解码器进行修辞结构预测

2018 7th Brazilian Conference on Intelligent Systems (BRACIS) Pub Date : 2018-10-01 DOI:10.1109/bracis.2018.00055

Gustavo Bennemann de Moura, Valéria Delisandra Feltrim

{"title":"用LSTM编解码器进行修辞结构预测","authors":"Gustavo Bennemann de Moura, Valéria Delisandra Feltrim","doi":"10.1109/bracis.2018.00055","DOIUrl":null,"url":null,"abstract":"The importance of identifying rhetorical categories in texts has been widely acknowledged in the literature, since information regarding text organization or structure can be applied in a variety of scenarios, including genre-specific writing support and evaluation, both manually and automatically. In this paper we present a Long Short-Term Memory (LSTM) encoder-decoder classifier for scientific abstracts. As a large corpus of annotated abstracts was required to train our classifier, we built a corpus using abstracts extracted from PUBMED/MEDLINE. Using the proposed classifier we achieved approximately 3% improvement in per-abstract accuracy over the baselines and 1% improvement for both per-sentence accuracy and f1-score.","PeriodicalId":405190,"journal":{"name":"2018 7th Brazilian Conference on Intelligent Systems (BRACIS)","volume":"49 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2018-10-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"2","resultStr":"{\"title\":\"Using LSTM Encoder-Decoder for Rhetorical Structure Prediction\",\"authors\":\"Gustavo Bennemann de Moura, Valéria Delisandra Feltrim\",\"doi\":\"10.1109/bracis.2018.00055\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"The importance of identifying rhetorical categories in texts has been widely acknowledged in the literature, since information regarding text organization or structure can be applied in a variety of scenarios, including genre-specific writing support and evaluation, both manually and automatically. In this paper we present a Long Short-Term Memory (LSTM) encoder-decoder classifier for scientific abstracts. As a large corpus of annotated abstracts was required to train our classifier, we built a corpus using abstracts extracted from PUBMED/MEDLINE. Using the proposed classifier we achieved approximately 3% improvement in per-abstract accuracy over the baselines and 1% improvement for both per-sentence accuracy and f1-score.\",\"PeriodicalId\":405190,\"journal\":{\"name\":\"2018 7th Brazilian Conference on Intelligent Systems (BRACIS)\",\"volume\":\"49 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2018-10-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"2\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2018 7th Brazilian Conference on Intelligent Systems (BRACIS)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/bracis.2018.00055\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2018 7th Brazilian Conference on Intelligent Systems (BRACIS)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/bracis.2018.00055","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 2

摘要

识别语篇修辞类别的重要性已在文献中得到广泛认可，因为关于语篇组织或结构的信息可以应用于各种场景，包括特定体裁的写作支持和评估，无论是手动还是自动。本文提出了一种基于长短期记忆(LSTM)的科学摘要编码器分类器。由于需要大量带注释的摘要语料库来训练我们的分类器，我们使用从PUBMED/MEDLINE提取的摘要构建了一个语料库。使用提出的分类器，我们在每个抽象的准确率上比基线提高了大约3%，每个句子的准确率和f1-score都提高了1%。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文

微信好友朋友圈 QQ好友复制链接

本刊更多论文

Using LSTM Encoder-Decoder for Rhetorical Structure Prediction

The importance of identifying rhetorical categories in texts has been widely acknowledged in the literature, since information regarding text organization or structure can be applied in a variety of scenarios, including genre-specific writing support and evaluation, both manually and automatically. In this paper we present a Long Short-Term Memory (LSTM) encoder-decoder classifier for scientific abstracts. As a large corpus of annotated abstracts was required to train our classifier, we built a corpus using abstracts extracted from PUBMED/MEDLINE. Using the proposed classifier we achieved approximately 3% improvement in per-abstract accuracy over the baselines and 1% improvement for both per-sentence accuracy and f1-score.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

2018 7th Brazilian Conference on Intelligent Systems (BRACIS)

自引率

0.00%

发文量