Juan Lee Atipa, Javin Javin, Fernando Bryan, V. Yesmaya, Rini Wongso
{"title":"Abstractive Text Summary with Transformer on Youtube Video Subtitle","authors":"Juan Lee Atipa, Javin Javin, Fernando Bryan, V. Yesmaya, Rini Wongso","doi":"10.46338/ijetae0223_01","DOIUrl":null,"url":null,"abstract":"Time limitation is one of the most important factors when consuming media. Longer duration makes it harder for users to watch the entirety of the video. Text summarization could be a way for users to acquire information swiftly and concisely. However, the extent to which the summary of the information made has really approached the main core of the information to be conveyed. In this study using YouTube video subtitles as the data that will be used to get a summary of the core information from the video. Consequently, this research focuses on abstractive summarization utilizing several Transformer models namely T5, BART, and PEGASUS, and using the video subtitle dataset to create a summary. The text data from the video subtitle is used as the main source of information in the learning process of the model, ultimately enhancing the model’s ability on this specific summarization task. In evaluating the models’ results, ROUGE is employed, specifically ROUGE-1, ROUGE-2, and ROUGE-L.","PeriodicalId":169403,"journal":{"name":"International Journal of Emerging Technology and Advanced Engineering","volume":"130 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2023-02-04","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"International Journal of Emerging Technology and Advanced Engineering","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.46338/ijetae0223_01","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0
Abstract
Time limitation is one of the most important factors when consuming media. Longer duration makes it harder for users to watch the entirety of the video. Text summarization could be a way for users to acquire information swiftly and concisely. However, the extent to which the summary of the information made has really approached the main core of the information to be conveyed. In this study using YouTube video subtitles as the data that will be used to get a summary of the core information from the video. Consequently, this research focuses on abstractive summarization utilizing several Transformer models namely T5, BART, and PEGASUS, and using the video subtitle dataset to create a summary. The text data from the video subtitle is used as the main source of information in the learning process of the model, ultimately enhancing the model’s ability on this specific summarization task. In evaluating the models’ results, ROUGE is employed, specifically ROUGE-1, ROUGE-2, and ROUGE-L.