基于自然语言处理的抽取摘要算法的比较研究

2020 IEEE Student Conference on Research and Development (SCOReD) Pub Date : 2020-09-27 DOI:10.1109/SCOReD50371.2020.9251032

W. H. Ong, K. Tay, C. C. Chew, A. Huong

{"title":"基于自然语言处理的抽取摘要算法的比较研究","authors":"W. H. Ong, K. Tay, C. C. Chew, A. Huong","doi":"10.1109/SCOReD50371.2020.9251032","DOIUrl":null,"url":null,"abstract":"Language, an important feature in our daily life. It is a tool in communication used by humans. Currently, there is an increasing number of articles and journals flooded on the Internet It is hard to read and study au the related articles to users’ research areas manually because there is limited time for each people. One of the solutions is to summarize texts in the article. Natural Language Processing (NLP) is one of the features in Machine Learning (ML) and it is used for summarization This study was tried to investigate the performances of the three different extractive algorithms from NLP. The results were evaluated with the ROUGE evaluation package using ROUGE-1, ROUGE-2, ROUGE-L, and ROUGE-SU4 methods. 3 $\\theta$ samples from the BBC dataset were used as the training data in the evaluation process. Results generated from ROUGE toolkit show the performance of the Barrios et al.’s works is the best among the other two.","PeriodicalId":142867,"journal":{"name":"2020 IEEE Student Conference on Research and Development (SCOReD)","volume":"204 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2020-09-27","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":"{\"title\":\"A Comparative Study of Extractive Summary Algorithms Using Natural Language Processing\",\"authors\":\"W. H. Ong, K. Tay, C. C. Chew, A. Huong\",\"doi\":\"10.1109/SCOReD50371.2020.9251032\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Language, an important feature in our daily life. It is a tool in communication used by humans. Currently, there is an increasing number of articles and journals flooded on the Internet It is hard to read and study au the related articles to users’ research areas manually because there is limited time for each people. One of the solutions is to summarize texts in the article. Natural Language Processing (NLP) is one of the features in Machine Learning (ML) and it is used for summarization This study was tried to investigate the performances of the three different extractive algorithms from NLP. The results were evaluated with the ROUGE evaluation package using ROUGE-1, ROUGE-2, ROUGE-L, and ROUGE-SU4 methods. 3 $\\\\theta$ samples from the BBC dataset were used as the training data in the evaluation process. Results generated from ROUGE toolkit show the performance of the Barrios et al.’s works is the best among the other two.\",\"PeriodicalId\":142867,\"journal\":{\"name\":\"2020 IEEE Student Conference on Research and Development (SCOReD)\",\"volume\":\"204 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2020-09-27\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"1\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2020 IEEE Student Conference on Research and Development (SCOReD)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/SCOReD50371.2020.9251032\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2020 IEEE Student Conference on Research and Development (SCOReD)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/SCOReD50371.2020.9251032","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 1

摘要

语言，是我们日常生活中的一个重要特征。它是人类使用的一种交流工具。目前，互联网上充斥着越来越多的文章和期刊，由于每个人的时间有限，很难手动阅读和研究用户研究领域的相关文章。解决方法之一是总结文章中的文本。自然语言处理(Natural Language Processing, NLP)是机器学习(Machine Learning, ML)的特征之一，用于总结。本研究试图探讨三种不同的NLP提取算法的性能。采用ROUGE评价包，采用ROUGE-1、ROUGE-2、ROUGE- l和ROUGE- su4方法对结果进行评价。在评估过程中，使用来自BBC数据集的3个$\theta$样本作为训练数据。ROUGE工具包生成的结果表明，Barrios等人的作品在其他两种作品中表现最好。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文

微信好友朋友圈 QQ好友复制链接

本刊更多论文

A Comparative Study of Extractive Summary Algorithms Using Natural Language Processing

Language, an important feature in our daily life. It is a tool in communication used by humans. Currently, there is an increasing number of articles and journals flooded on the Internet It is hard to read and study au the related articles to users’ research areas manually because there is limited time for each people. One of the solutions is to summarize texts in the article. Natural Language Processing (NLP) is one of the features in Machine Learning (ML) and it is used for summarization This study was tried to investigate the performances of the three different extractive algorithms from NLP. The results were evaluated with the ROUGE evaluation package using ROUGE-1, ROUGE-2, ROUGE-L, and ROUGE-SU4 methods. 3 $\theta$ samples from the BBC dataset were used as the training data in the evaluation process. Results generated from ROUGE toolkit show the performance of the Barrios et al.’s works is the best among the other two.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

2020 IEEE Student Conference on Research and Development (SCOReD)

自引率

0.00%

发文量

期刊最新文献

Assessing the Performance of Smart Inverter Functionalities in PV-Rich LV Distribution Networks Simulation of Temporal Correlation Detection using HfO2-Based ReRAM Arrays Design and Development of a Quadcopter for Landmine Detection A Waste Recycling System for a Better Living World Study for Microstrip Patch Antenna for 5G Networks