TA-Sum: The Extractive Summarization Research Based on Topic Information

Fang Xie Fang Xie, Hao Li Fang Xie, Beiye Zhang Hao Li, Jianan He Beiye Zhang, Xincong Zhong Jianan He
{"title":"TA-Sum: The Extractive Summarization Research Based on Topic Information","authors":"Fang Xie Fang Xie, Hao Li Fang Xie, Beiye Zhang Hao Li, Jianan He Beiye Zhang, Xincong Zhong Jianan He","doi":"10.53106/160792642023122407010","DOIUrl":null,"url":null,"abstract":"Text summarization is divided into extractive summarization and abstractive summarization. The extractive summarization technology aims to extract some main phrases and sentences from the original text to form a short summary for people to read quickly. However, extractive summarization is faced with problems such as poor sentence coherence and incomplete information, which makes it difficult to screen out important sentences from the source text. DNN (Deep Neural Network) is widely used for text summarization task. This paper proposes a TA-Sum model based on the neural topic model. Introducing the topic information can help people understand the relevant main content of source text quickly. We obtain the topic information using the neural topic model and implement the attention mechanism to fuse the topic information with the text representation, which improves the semantic integrity and completeness of the summary. The experimental results on the large-scale English data sets CNN/Daily mail are improved by 0.37%, 0.11%, and 0.17% respectively compared with BertSum, which demonstrates the effectiveness of our method.","PeriodicalId":442331,"journal":{"name":"網際網路技術學刊","volume":"21 6","pages":""},"PeriodicalIF":0.0000,"publicationDate":"2023-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"網際網路技術學刊","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.53106/160792642023122407010","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0

Abstract

Text summarization is divided into extractive summarization and abstractive summarization. The extractive summarization technology aims to extract some main phrases and sentences from the original text to form a short summary for people to read quickly. However, extractive summarization is faced with problems such as poor sentence coherence and incomplete information, which makes it difficult to screen out important sentences from the source text. DNN (Deep Neural Network) is widely used for text summarization task. This paper proposes a TA-Sum model based on the neural topic model. Introducing the topic information can help people understand the relevant main content of source text quickly. We obtain the topic information using the neural topic model and implement the attention mechanism to fuse the topic information with the text representation, which improves the semantic integrity and completeness of the summary. The experimental results on the large-scale English data sets CNN/Daily mail are improved by 0.37%, 0.11%, and 0.17% respectively compared with BertSum, which demonstrates the effectiveness of our method.
查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
TA-Sum:基于主题信息的提取式总结研究
文本摘要分为提取式摘要和抽象式摘要。提取式摘要技术旨在从原文中提取一些主要短语和句子,形成简短的摘要供人们快速阅读。然而,抽取式摘要面临着句子连贯性差、信息不完整等问题,难以从源文本中筛选出重要句子。深度神经网络(DNN)被广泛应用于文本摘要任务。本文提出了一种基于神经主题模型的 TA-Sum 模型。引入主题信息可以帮助人们快速理解源文本的相关主要内容。我们利用神经主题模型获取主题信息,并实施关注机制将主题信息与文本表示融合,从而提高了摘要的语义完整性和完备性。在大规模英文数据集 CNN/Daily mail 上的实验结果与 BertSum 相比分别提高了 0.37%、0.11% 和 0.17%,这证明了我们方法的有效性。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 去求助
来源期刊
自引率
0.00%
发文量
0
期刊最新文献
A Compact Depth Separable Convolutional Image Filter for Clinical Color Perception Test Hybrid Dynamic Analysis for Android Malware Protected by Anti-Analysis Techniques with DOOLDA An Improved SSD Model for Small Size Work-pieces Recognition in Automatic Production Line A Construction of Knowledge Graph for Semiconductor Industry Chain Based on Lattice-LSTM and PCNN Models Designing a Multi-Criteria Decision-Making Framework to Establish a Value Ranking System for the Quality Evaluation of Long-Term Care Services
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1