Fang Xie Fang Xie, Hao Li Fang Xie, Beiye Zhang Hao Li, Jianan He Beiye Zhang, Xincong Zhong Jianan He
{"title":"TA-Sum: The Extractive Summarization Research Based on Topic Information","authors":"Fang Xie Fang Xie, Hao Li Fang Xie, Beiye Zhang Hao Li, Jianan He Beiye Zhang, Xincong Zhong Jianan He","doi":"10.53106/160792642023122407010","DOIUrl":null,"url":null,"abstract":"Text summarization is divided into extractive summarization and abstractive summarization. The extractive summarization technology aims to extract some main phrases and sentences from the original text to form a short summary for people to read quickly. However, extractive summarization is faced with problems such as poor sentence coherence and incomplete information, which makes it difficult to screen out important sentences from the source text. DNN (Deep Neural Network) is widely used for text summarization task. This paper proposes a TA-Sum model based on the neural topic model. Introducing the topic information can help people understand the relevant main content of source text quickly. We obtain the topic information using the neural topic model and implement the attention mechanism to fuse the topic information with the text representation, which improves the semantic integrity and completeness of the summary. The experimental results on the large-scale English data sets CNN/Daily mail are improved by 0.37%, 0.11%, and 0.17% respectively compared with BertSum, which demonstrates the effectiveness of our method.","PeriodicalId":442331,"journal":{"name":"網際網路技術學刊","volume":"21 6","pages":""},"PeriodicalIF":0.0000,"publicationDate":"2023-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"網際網路技術學刊","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.53106/160792642023122407010","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0
Abstract
Text summarization is divided into extractive summarization and abstractive summarization. The extractive summarization technology aims to extract some main phrases and sentences from the original text to form a short summary for people to read quickly. However, extractive summarization is faced with problems such as poor sentence coherence and incomplete information, which makes it difficult to screen out important sentences from the source text. DNN (Deep Neural Network) is widely used for text summarization task. This paper proposes a TA-Sum model based on the neural topic model. Introducing the topic information can help people understand the relevant main content of source text quickly. We obtain the topic information using the neural topic model and implement the attention mechanism to fuse the topic information with the text representation, which improves the semantic integrity and completeness of the summary. The experimental results on the large-scale English data sets CNN/Daily mail are improved by 0.37%, 0.11%, and 0.17% respectively compared with BertSum, which demonstrates the effectiveness of our method.