基于多文档查询的文本摘要冗余去除方法

Nazreena Rahman, B. Borah
{"title":"基于多文档查询的文本摘要冗余去除方法","authors":"Nazreena Rahman, B. Borah","doi":"10.1145/3459104.3459197","DOIUrl":null,"url":null,"abstract":"We present RedunWSD, a Word Sense Disambiguation (WSD) based redundancy removal method for multiple text documents query-based text summarization. Recognizing and identifying the redundancy from the text documents is a challenging task that has not been fully resolved yet. In the case of multiple text document summarization processes, redundancy removal helps in getting maximum content coverage by minimizing any redundant or repetitive sentences. Another novel contribution of our work is the disambiguating a word's sense, since it helps us in getting accurate semantic similarity score between two sentences. Along with the semantic similarity score, we also consider the order of words between two sentences. Our model has the additional advantage of detecting the redundant sentences based on its meaning. Our experimental results show that the proposed WSD method outperforms many existing and current WSD methods. We have evaluated the proposed RedunWSD method on Document Understanding Conference (DUC) benchmark datasets and show that it helps in getting better recall values than many top existing query-based text summarization methods.","PeriodicalId":142284,"journal":{"name":"2021 International Symposium on Electrical, Electronics and Information Engineering","volume":"14 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2021-02-19","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":"{\"title\":\"Redundancy Removal Method for Multi-Document Query-Based Text Summarization\",\"authors\":\"Nazreena Rahman, B. Borah\",\"doi\":\"10.1145/3459104.3459197\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"We present RedunWSD, a Word Sense Disambiguation (WSD) based redundancy removal method for multiple text documents query-based text summarization. Recognizing and identifying the redundancy from the text documents is a challenging task that has not been fully resolved yet. In the case of multiple text document summarization processes, redundancy removal helps in getting maximum content coverage by minimizing any redundant or repetitive sentences. Another novel contribution of our work is the disambiguating a word's sense, since it helps us in getting accurate semantic similarity score between two sentences. Along with the semantic similarity score, we also consider the order of words between two sentences. Our model has the additional advantage of detecting the redundant sentences based on its meaning. Our experimental results show that the proposed WSD method outperforms many existing and current WSD methods. We have evaluated the proposed RedunWSD method on Document Understanding Conference (DUC) benchmark datasets and show that it helps in getting better recall values than many top existing query-based text summarization methods.\",\"PeriodicalId\":142284,\"journal\":{\"name\":\"2021 International Symposium on Electrical, Electronics and Information Engineering\",\"volume\":\"14 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2021-02-19\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"1\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2021 International Symposium on Electrical, Electronics and Information Engineering\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1145/3459104.3459197\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2021 International Symposium on Electrical, Electronics and Information Engineering","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1145/3459104.3459197","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 1

摘要

提出了一种基于语义消歧(WSD)的冗余消除方法,用于多文本文档基于查询的文本摘要。从文本文档中识别和识别冗余是一项具有挑战性的任务,目前尚未完全解决。在多个文本文档摘要过程的情况下,删除冗余有助于通过最小化任何冗余或重复的句子来获得最大的内容覆盖率。我们工作的另一个新颖贡献是消除单词的歧义,因为它可以帮助我们获得两个句子之间准确的语义相似度评分。除了语义相似度评分外,我们还考虑了两个句子之间的单词顺序。我们的模型还有一个额外的优点,就是可以根据句子的意思来检测多余的句子。实验结果表明,本文提出的WSD方法优于许多现有的WSD方法。我们在文档理解会议(DUC)基准数据集上评估了所提出的reunwsd方法,并表明它比许多现有的基于查询的文本摘要方法更有助于获得更好的召回值。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
Redundancy Removal Method for Multi-Document Query-Based Text Summarization
We present RedunWSD, a Word Sense Disambiguation (WSD) based redundancy removal method for multiple text documents query-based text summarization. Recognizing and identifying the redundancy from the text documents is a challenging task that has not been fully resolved yet. In the case of multiple text document summarization processes, redundancy removal helps in getting maximum content coverage by minimizing any redundant or repetitive sentences. Another novel contribution of our work is the disambiguating a word's sense, since it helps us in getting accurate semantic similarity score between two sentences. Along with the semantic similarity score, we also consider the order of words between two sentences. Our model has the additional advantage of detecting the redundant sentences based on its meaning. Our experimental results show that the proposed WSD method outperforms many existing and current WSD methods. We have evaluated the proposed RedunWSD method on Document Understanding Conference (DUC) benchmark datasets and show that it helps in getting better recall values than many top existing query-based text summarization methods.
求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
自引率
0.00%
发文量
0
期刊最新文献
Exploring the Integration of Blockchain Technology and IoT in a Smart University Application Architecture 3D Moving Rigid Body Localization in the Presence of Anchor Position Errors RANS/LES Simulation of Low-Frequency Flow Oscillations on a NACA0012 Airfoil Near Stall Tuning Language Representation Models for Classification of Turkish News Improving Consumer Experience for Medical Information Using Text Analytics
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1