{"title":"基于自然语言处理的抽取摘要算法的比较研究","authors":"W. H. Ong, K. Tay, C. C. Chew, A. Huong","doi":"10.1109/SCOReD50371.2020.9251032","DOIUrl":null,"url":null,"abstract":"Language, an important feature in our daily life. It is a tool in communication used by humans. Currently, there is an increasing number of articles and journals flooded on the Internet It is hard to read and study au the related articles to users’ research areas manually because there is limited time for each people. One of the solutions is to summarize texts in the article. Natural Language Processing (NLP) is one of the features in Machine Learning (ML) and it is used for summarization This study was tried to investigate the performances of the three different extractive algorithms from NLP. The results were evaluated with the ROUGE evaluation package using ROUGE-1, ROUGE-2, ROUGE-L, and ROUGE-SU4 methods. 3 $\\theta$ samples from the BBC dataset were used as the training data in the evaluation process. Results generated from ROUGE toolkit show the performance of the Barrios et al.’s works is the best among the other two.","PeriodicalId":142867,"journal":{"name":"2020 IEEE Student Conference on Research and Development (SCOReD)","volume":"204 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2020-09-27","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":"{\"title\":\"A Comparative Study of Extractive Summary Algorithms Using Natural Language Processing\",\"authors\":\"W. H. Ong, K. Tay, C. C. Chew, A. Huong\",\"doi\":\"10.1109/SCOReD50371.2020.9251032\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Language, an important feature in our daily life. It is a tool in communication used by humans. Currently, there is an increasing number of articles and journals flooded on the Internet It is hard to read and study au the related articles to users’ research areas manually because there is limited time for each people. One of the solutions is to summarize texts in the article. Natural Language Processing (NLP) is one of the features in Machine Learning (ML) and it is used for summarization This study was tried to investigate the performances of the three different extractive algorithms from NLP. The results were evaluated with the ROUGE evaluation package using ROUGE-1, ROUGE-2, ROUGE-L, and ROUGE-SU4 methods. 3 $\\\\theta$ samples from the BBC dataset were used as the training data in the evaluation process. Results generated from ROUGE toolkit show the performance of the Barrios et al.’s works is the best among the other two.\",\"PeriodicalId\":142867,\"journal\":{\"name\":\"2020 IEEE Student Conference on Research and Development (SCOReD)\",\"volume\":\"204 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2020-09-27\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"1\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2020 IEEE Student Conference on Research and Development (SCOReD)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/SCOReD50371.2020.9251032\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2020 IEEE Student Conference on Research and Development (SCOReD)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/SCOReD50371.2020.9251032","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 1
摘要
语言,是我们日常生活中的一个重要特征。它是人类使用的一种交流工具。目前,互联网上充斥着越来越多的文章和期刊,由于每个人的时间有限,很难手动阅读和研究用户研究领域的相关文章。解决方法之一是总结文章中的文本。自然语言处理(Natural Language Processing, NLP)是机器学习(Machine Learning, ML)的特征之一,用于总结。本研究试图探讨三种不同的NLP提取算法的性能。采用ROUGE评价包,采用ROUGE-1、ROUGE-2、ROUGE- l和ROUGE- su4方法对结果进行评价。在评估过程中,使用来自BBC数据集的3个$\theta$样本作为训练数据。ROUGE工具包生成的结果表明,Barrios等人的作品在其他两种作品中表现最好。
A Comparative Study of Extractive Summary Algorithms Using Natural Language Processing
Language, an important feature in our daily life. It is a tool in communication used by humans. Currently, there is an increasing number of articles and journals flooded on the Internet It is hard to read and study au the related articles to users’ research areas manually because there is limited time for each people. One of the solutions is to summarize texts in the article. Natural Language Processing (NLP) is one of the features in Machine Learning (ML) and it is used for summarization This study was tried to investigate the performances of the three different extractive algorithms from NLP. The results were evaluated with the ROUGE evaluation package using ROUGE-1, ROUGE-2, ROUGE-L, and ROUGE-SU4 methods. 3 $\theta$ samples from the BBC dataset were used as the training data in the evaluation process. Results generated from ROUGE toolkit show the performance of the Barrios et al.’s works is the best among the other two.