基于多模态深度学习的仇恨模因预测模型

Md. Rekib Ahmed, Neeraj Bhadani, I. Chakraborty
{"title":"基于多模态深度学习的仇恨模因预测模型","authors":"Md. Rekib Ahmed, Neeraj Bhadani, I. Chakraborty","doi":"10.1109/CCGE50943.2021.9776440","DOIUrl":null,"url":null,"abstract":"With the emergence of deep neural networks along with high-end computers that can process deep architectures, there has been a lot of research when Computer Vision and Natural Language Processing has been fused into a single problem. To enable students and researchers to deep dive into multimodal deep learning Facebook AI Research team published a dataset on hateful meme classification “The Hateful Meme Challenge Dataset” in May 2020 that gave us the motivation to test ourselves and an opportunity to learn more about the dataset. The rise of communication on the internet with memes as a medium, they have been used to convey incorrect information, political agendas and also has led to cyberbullying, trolling etc. This results in the need of creating an automated tool that can detect such hateful content published on the internet and remove it at the root level before it does any harm. This paper intends to adopt Unimodal Text and Image models using Bert, LSTM and VGG16, Resnet50, SE-Resnet50, XSE-Resnet architectures and combining them into Multimodal models for effective prediction of a hateful meme. The paper compares various architectures both unimodal models and multimodal models on the evaluation metrics AUC-ROC score, F1 score and accuracy score.)","PeriodicalId":130452,"journal":{"name":"2021 International Conference on Computing, Communication and Green Engineering (CCGE)","volume":"33 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2021-09-23","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":"{\"title\":\"Hateful Meme Prediction Model Using Multimodal Deep Learning\",\"authors\":\"Md. Rekib Ahmed, Neeraj Bhadani, I. Chakraborty\",\"doi\":\"10.1109/CCGE50943.2021.9776440\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"With the emergence of deep neural networks along with high-end computers that can process deep architectures, there has been a lot of research when Computer Vision and Natural Language Processing has been fused into a single problem. To enable students and researchers to deep dive into multimodal deep learning Facebook AI Research team published a dataset on hateful meme classification “The Hateful Meme Challenge Dataset” in May 2020 that gave us the motivation to test ourselves and an opportunity to learn more about the dataset. The rise of communication on the internet with memes as a medium, they have been used to convey incorrect information, political agendas and also has led to cyberbullying, trolling etc. This results in the need of creating an automated tool that can detect such hateful content published on the internet and remove it at the root level before it does any harm. This paper intends to adopt Unimodal Text and Image models using Bert, LSTM and VGG16, Resnet50, SE-Resnet50, XSE-Resnet architectures and combining them into Multimodal models for effective prediction of a hateful meme. The paper compares various architectures both unimodal models and multimodal models on the evaluation metrics AUC-ROC score, F1 score and accuracy score.)\",\"PeriodicalId\":130452,\"journal\":{\"name\":\"2021 International Conference on Computing, Communication and Green Engineering (CCGE)\",\"volume\":\"33 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2021-09-23\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"1\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2021 International Conference on Computing, Communication and Green Engineering (CCGE)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/CCGE50943.2021.9776440\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2021 International Conference on Computing, Communication and Green Engineering (CCGE)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/CCGE50943.2021.9776440","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 1

摘要

随着深度神经网络的出现以及可以处理深度架构的高端计算机的出现,将计算机视觉和自然语言处理融合为一个问题的研究已经很多。为了让学生和研究人员深入研究多模态深度学习,Facebook人工智能研究团队于2020年5月发布了一个关于仇恨模因分类的数据集“仇恨模因挑战数据集”,这给了我们测试自己的动力,并有机会了解更多关于数据集的信息。以表情包为媒介的互联网交流的兴起,它们被用来传达不正确的信息、政治议程,也导致了网络欺凌、网络喷子等。这就需要创建一个自动化工具来检测互联网上发布的这种仇恨内容,并在其造成任何伤害之前从根本上将其删除。本文拟采用Bert、LSTM和VGG16、Resnet50、SE-Resnet50、XSE-Resnet架构的单模态文本和图像模型,并将它们组合成多模态模型,以有效预测仇恨模因。在评价指标AUC-ROC评分、F1评分和准确率评分上,比较了单模态模型和多模态模型的不同架构。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
Hateful Meme Prediction Model Using Multimodal Deep Learning
With the emergence of deep neural networks along with high-end computers that can process deep architectures, there has been a lot of research when Computer Vision and Natural Language Processing has been fused into a single problem. To enable students and researchers to deep dive into multimodal deep learning Facebook AI Research team published a dataset on hateful meme classification “The Hateful Meme Challenge Dataset” in May 2020 that gave us the motivation to test ourselves and an opportunity to learn more about the dataset. The rise of communication on the internet with memes as a medium, they have been used to convey incorrect information, political agendas and also has led to cyberbullying, trolling etc. This results in the need of creating an automated tool that can detect such hateful content published on the internet and remove it at the root level before it does any harm. This paper intends to adopt Unimodal Text and Image models using Bert, LSTM and VGG16, Resnet50, SE-Resnet50, XSE-Resnet architectures and combining them into Multimodal models for effective prediction of a hateful meme. The paper compares various architectures both unimodal models and multimodal models on the evaluation metrics AUC-ROC score, F1 score and accuracy score.)
求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
自引率
0.00%
发文量
0
期刊最新文献
Stock Market Analysis using Time Series Data Analytics Techniques [Agendas] Irrigation to Smart Irrigation and Tube Well Users A Feature Cum Intensity Based SSIM Optimised Hybrid Image Registration Technique Flood Level Control and Management Using Instrumentation and Control
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1