基于人工神经网络的多文档内聚摘要提取

Marwan B. Mohammed, W. Al-Hameed
{"title":"基于人工神经网络的多文档内聚摘要提取","authors":"Marwan B. Mohammed, W. Al-Hameed","doi":"10.1109/ICCITM53167.2021.9677647","DOIUrl":null,"url":null,"abstract":"Increasing growth in the volume of digital data from documents performed the difficulty of accessing important information. The solution is using automatic summarization systems that aim to extract important information in a short time. Usually, these systems work to extract a single summary from a single document or multi-documents. However, extracting a summary from a multi-document may encounter some obstacles. This work focuses on how to overcome these obstacles and extract an appropriate and cohesion summary by presenting and suggesting four important contributions with a qualitative leap in the process of extracting the important information that seeks to extract a candidate summary that matches the sentences of the golden summary. The first suggestion is a set of features to extract important sentences and easy to understand, the second, build a Backpropagation Multi-layer Perceptron Neural Network (BMPNN) based on these features input to extract the score for each sentence, the third, using the Random oversampling method and its effective role in rebalancing the data during the training process in BMPNN, and finally, solving the problem of reordering sentences in the candidate summary according to the importance of the sentence depending on one of the features. The results of the evaluation Rouge-1, Rouge-2, and Rouge-L measures showed that the candidate's summary is very close to the golden summary in terms of matching sentences, and it achieved very good results.","PeriodicalId":406104,"journal":{"name":"2021 7th International Conference on Contemporary Information Technology and Mathematics (ICCITM)","volume":"51 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2021-08-25","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":"{\"title\":\"Cohesive Summary Extraction From Multi-Document Based On Artificial Neural Network\",\"authors\":\"Marwan B. Mohammed, W. Al-Hameed\",\"doi\":\"10.1109/ICCITM53167.2021.9677647\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Increasing growth in the volume of digital data from documents performed the difficulty of accessing important information. The solution is using automatic summarization systems that aim to extract important information in a short time. Usually, these systems work to extract a single summary from a single document or multi-documents. However, extracting a summary from a multi-document may encounter some obstacles. This work focuses on how to overcome these obstacles and extract an appropriate and cohesion summary by presenting and suggesting four important contributions with a qualitative leap in the process of extracting the important information that seeks to extract a candidate summary that matches the sentences of the golden summary. The first suggestion is a set of features to extract important sentences and easy to understand, the second, build a Backpropagation Multi-layer Perceptron Neural Network (BMPNN) based on these features input to extract the score for each sentence, the third, using the Random oversampling method and its effective role in rebalancing the data during the training process in BMPNN, and finally, solving the problem of reordering sentences in the candidate summary according to the importance of the sentence depending on one of the features. The results of the evaluation Rouge-1, Rouge-2, and Rouge-L measures showed that the candidate's summary is very close to the golden summary in terms of matching sentences, and it achieved very good results.\",\"PeriodicalId\":406104,\"journal\":{\"name\":\"2021 7th International Conference on Contemporary Information Technology and Mathematics (ICCITM)\",\"volume\":\"51 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2021-08-25\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"1\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2021 7th International Conference on Contemporary Information Technology and Mathematics (ICCITM)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/ICCITM53167.2021.9677647\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2021 7th International Conference on Contemporary Information Technology and Mathematics (ICCITM)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICCITM53167.2021.9677647","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 1

摘要

来自文件的数字数据量的不断增长使得获取重要信息变得困难。解决方案是使用旨在在短时间内提取重要信息的自动摘要系统。通常,这些系统的工作是从单个文档或多个文档中提取单个摘要。但是,从多文档中提取摘要可能会遇到一些障碍。本研究的重点是如何克服这些障碍,通过在提取重要信息的过程中提出四个具有质的飞跃的重要贡献,从而提取出与黄金摘要的句子相匹配的候选摘要。第二,基于这些特征输入构建反向传播多层感知器神经网络(BMPNN)来提取每个句子的分数;第三,利用随机过采样方法及其在BMPNN训练过程中对数据进行再平衡的有效作用,最后,解决了根据某一特征对句子的重要性对候选摘要中的句子进行重新排序的问题。Rouge-1、Rouge-2和Rouge-L测量的评价结果表明,候选人的总结在句子匹配方面非常接近黄金总结,取得了很好的效果。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
Cohesive Summary Extraction From Multi-Document Based On Artificial Neural Network
Increasing growth in the volume of digital data from documents performed the difficulty of accessing important information. The solution is using automatic summarization systems that aim to extract important information in a short time. Usually, these systems work to extract a single summary from a single document or multi-documents. However, extracting a summary from a multi-document may encounter some obstacles. This work focuses on how to overcome these obstacles and extract an appropriate and cohesion summary by presenting and suggesting four important contributions with a qualitative leap in the process of extracting the important information that seeks to extract a candidate summary that matches the sentences of the golden summary. The first suggestion is a set of features to extract important sentences and easy to understand, the second, build a Backpropagation Multi-layer Perceptron Neural Network (BMPNN) based on these features input to extract the score for each sentence, the third, using the Random oversampling method and its effective role in rebalancing the data during the training process in BMPNN, and finally, solving the problem of reordering sentences in the candidate summary according to the importance of the sentence depending on one of the features. The results of the evaluation Rouge-1, Rouge-2, and Rouge-L measures showed that the candidate's summary is very close to the golden summary in terms of matching sentences, and it achieved very good results.
求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
自引率
0.00%
发文量
0
期刊最新文献
Integration of Big Data, IoT and Cloud Computing Numerical Solution of Initial Value Problems of Time-Fractional Order via a Novel Fractional 4-Stage Runge-Kutta Method Multiclass Model for Quality of Service Using Machine Learning and Cloud Computing Study of the Factors Affecting the Incidence of COVID-19 Infection Using an Accelerrated Weibull Regression Model Contagious Patient Tracking Application Spotlight: Privacy and Security Rights
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1