Farrel Octavianus, Albert Wihardi, Muhamad Keenan Ario, Derwin Suhartono
{"title":"基于BART和SVM的新闻聚合系统的自动文本摘要和主题检测","authors":"Farrel Octavianus, Albert Wihardi, Muhamad Keenan Ario, Derwin Suhartono","doi":"10.1109/ISITDI55734.2022.9944521","DOIUrl":null,"url":null,"abstract":"With a large amount of news consumed by the public, it is impossible to digest all the available news. This paper developed an automated text summarization and topic detection algorithm for news articles, allowing the public to read summarized news without losing the essential points of the news. The algorithm will then be used to build and develop a system that has news aggregation technology. First, the system will scrape news articles from various sources, then topic detection and text summarization will be applied to each article before finally being displayed. The methodology used in this research can be divided into data gathering, topic detection, text summarization, and system development. The result of this research shows that the Support Vector Machine performed exceptionally well in topic detection tasks, better than other supervised learning algorithms used in this research, whereas Bidirectional and Auto-Regressive Transformer (BART) with the appropriate parameters performed relatively well in text summarization. To conclude, topic detection and automated text summarization can both be combined and used to develop a news aggregation system, with Support Vector Machine and BART both performing well in their respective tasks.","PeriodicalId":312644,"journal":{"name":"2022 International Symposium on Information Technology and Digital Innovation (ISITDI)","volume":"202 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2022-07-27","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Automated Text Summarization and Topic Detection on News Aggregation System Using BART and SVM\",\"authors\":\"Farrel Octavianus, Albert Wihardi, Muhamad Keenan Ario, Derwin Suhartono\",\"doi\":\"10.1109/ISITDI55734.2022.9944521\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"With a large amount of news consumed by the public, it is impossible to digest all the available news. This paper developed an automated text summarization and topic detection algorithm for news articles, allowing the public to read summarized news without losing the essential points of the news. The algorithm will then be used to build and develop a system that has news aggregation technology. First, the system will scrape news articles from various sources, then topic detection and text summarization will be applied to each article before finally being displayed. The methodology used in this research can be divided into data gathering, topic detection, text summarization, and system development. The result of this research shows that the Support Vector Machine performed exceptionally well in topic detection tasks, better than other supervised learning algorithms used in this research, whereas Bidirectional and Auto-Regressive Transformer (BART) with the appropriate parameters performed relatively well in text summarization. To conclude, topic detection and automated text summarization can both be combined and used to develop a news aggregation system, with Support Vector Machine and BART both performing well in their respective tasks.\",\"PeriodicalId\":312644,\"journal\":{\"name\":\"2022 International Symposium on Information Technology and Digital Innovation (ISITDI)\",\"volume\":\"202 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2022-07-27\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2022 International Symposium on Information Technology and Digital Innovation (ISITDI)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/ISITDI55734.2022.9944521\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2022 International Symposium on Information Technology and Digital Innovation (ISITDI)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ISITDI55734.2022.9944521","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
Automated Text Summarization and Topic Detection on News Aggregation System Using BART and SVM
With a large amount of news consumed by the public, it is impossible to digest all the available news. This paper developed an automated text summarization and topic detection algorithm for news articles, allowing the public to read summarized news without losing the essential points of the news. The algorithm will then be used to build and develop a system that has news aggregation technology. First, the system will scrape news articles from various sources, then topic detection and text summarization will be applied to each article before finally being displayed. The methodology used in this research can be divided into data gathering, topic detection, text summarization, and system development. The result of this research shows that the Support Vector Machine performed exceptionally well in topic detection tasks, better than other supervised learning algorithms used in this research, whereas Bidirectional and Auto-Regressive Transformer (BART) with the appropriate parameters performed relatively well in text summarization. To conclude, topic detection and automated text summarization can both be combined and used to develop a news aggregation system, with Support Vector Machine and BART both performing well in their respective tasks.