{"title":"Cohesive Summary Extraction From Multi-Document Based On Artificial Neural Network","authors":"Marwan B. Mohammed, W. Al-Hameed","doi":"10.1109/ICCITM53167.2021.9677647","DOIUrl":null,"url":null,"abstract":"Increasing growth in the volume of digital data from documents performed the difficulty of accessing important information. The solution is using automatic summarization systems that aim to extract important information in a short time. Usually, these systems work to extract a single summary from a single document or multi-documents. However, extracting a summary from a multi-document may encounter some obstacles. This work focuses on how to overcome these obstacles and extract an appropriate and cohesion summary by presenting and suggesting four important contributions with a qualitative leap in the process of extracting the important information that seeks to extract a candidate summary that matches the sentences of the golden summary. The first suggestion is a set of features to extract important sentences and easy to understand, the second, build a Backpropagation Multi-layer Perceptron Neural Network (BMPNN) based on these features input to extract the score for each sentence, the third, using the Random oversampling method and its effective role in rebalancing the data during the training process in BMPNN, and finally, solving the problem of reordering sentences in the candidate summary according to the importance of the sentence depending on one of the features. The results of the evaluation Rouge-1, Rouge-2, and Rouge-L measures showed that the candidate's summary is very close to the golden summary in terms of matching sentences, and it achieved very good results.","PeriodicalId":406104,"journal":{"name":"2021 7th International Conference on Contemporary Information Technology and Mathematics (ICCITM)","volume":"51 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2021-08-25","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2021 7th International Conference on Contemporary Information Technology and Mathematics (ICCITM)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICCITM53167.2021.9677647","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 1
Abstract
Increasing growth in the volume of digital data from documents performed the difficulty of accessing important information. The solution is using automatic summarization systems that aim to extract important information in a short time. Usually, these systems work to extract a single summary from a single document or multi-documents. However, extracting a summary from a multi-document may encounter some obstacles. This work focuses on how to overcome these obstacles and extract an appropriate and cohesion summary by presenting and suggesting four important contributions with a qualitative leap in the process of extracting the important information that seeks to extract a candidate summary that matches the sentences of the golden summary. The first suggestion is a set of features to extract important sentences and easy to understand, the second, build a Backpropagation Multi-layer Perceptron Neural Network (BMPNN) based on these features input to extract the score for each sentence, the third, using the Random oversampling method and its effective role in rebalancing the data during the training process in BMPNN, and finally, solving the problem of reordering sentences in the candidate summary according to the importance of the sentence depending on one of the features. The results of the evaluation Rouge-1, Rouge-2, and Rouge-L measures showed that the candidate's summary is very close to the golden summary in terms of matching sentences, and it achieved very good results.