Abstractive Text Summary with Transformer on Youtube Video Subtitle

Juan Lee Atipa, Javin Javin, Fernando Bryan, V. Yesmaya, Rini Wongso
{"title":"Abstractive Text Summary with Transformer on Youtube Video Subtitle","authors":"Juan Lee Atipa, Javin Javin, Fernando Bryan, V. Yesmaya, Rini Wongso","doi":"10.46338/ijetae0223_01","DOIUrl":null,"url":null,"abstract":"Time limitation is one of the most important factors when consuming media. Longer duration makes it harder for users to watch the entirety of the video. Text summarization could be a way for users to acquire information swiftly and concisely. However, the extent to which the summary of the information made has really approached the main core of the information to be conveyed. In this study using YouTube video subtitles as the data that will be used to get a summary of the core information from the video. Consequently, this research focuses on abstractive summarization utilizing several Transformer models namely T5, BART, and PEGASUS, and using the video subtitle dataset to create a summary. The text data from the video subtitle is used as the main source of information in the learning process of the model, ultimately enhancing the model’s ability on this specific summarization task. In evaluating the models’ results, ROUGE is employed, specifically ROUGE-1, ROUGE-2, and ROUGE-L.","PeriodicalId":169403,"journal":{"name":"International Journal of Emerging Technology and Advanced Engineering","volume":"130 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2023-02-04","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"International Journal of Emerging Technology and Advanced Engineering","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.46338/ijetae0223_01","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0

Abstract

Time limitation is one of the most important factors when consuming media. Longer duration makes it harder for users to watch the entirety of the video. Text summarization could be a way for users to acquire information swiftly and concisely. However, the extent to which the summary of the information made has really approached the main core of the information to be conveyed. In this study using YouTube video subtitles as the data that will be used to get a summary of the core information from the video. Consequently, this research focuses on abstractive summarization utilizing several Transformer models namely T5, BART, and PEGASUS, and using the video subtitle dataset to create a summary. The text data from the video subtitle is used as the main source of information in the learning process of the model, ultimately enhancing the model’s ability on this specific summarization task. In evaluating the models’ results, ROUGE is employed, specifically ROUGE-1, ROUGE-2, and ROUGE-L.
查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
抽象文本摘要与变压器在Youtube视频字幕
时间限制是消费媒体时最重要的因素之一。持续时间越长,用户就越难以完整地观看视频。文本摘要可以成为用户快速、简洁地获取信息的一种方式。然而,所做的信息总结的程度已经真正接近所要传达的信息的主要核心。在本研究中,使用YouTube视频字幕作为数据,将用于从视频中获得核心信息的总结。因此,本研究的重点是利用几个Transformer模型(T5、BART和PEGASUS)进行抽象摘要,并使用视频字幕数据集创建摘要。在模型的学习过程中,将视频字幕中的文本数据作为主要的信息来源,最终增强模型完成这一特定摘要任务的能力。在评估模型的结果时,使用了ROUGE,特别是ROUGE-1, ROUGE-2和ROUGE- l。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 去求助
来源期刊
自引率
0.00%
发文量
0
期刊最新文献
Impact of Climate Change on Fish Species Classification Using Machine Learning and Deep Learning Algorithms Bibliometric Analysis of the Influence of Artificial Intelligence on the Development of Education Wireless IoT Networks Security and Lightweight Encryption Schemes- Survey Challenges of Requirements Engineering in Agile Projects: A Systematic Review From Data to Design: An IoT-Based Novel Solution for Combating Distracted Driving and Speeding Events
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1