基于多模态深度学习的仇恨模因预测模型

2021 International Conference on Computing, Communication and Green Engineering (CCGE) Pub Date : 2021-09-23 DOI:10.1109/CCGE50943.2021.9776440

Md. Rekib Ahmed, Neeraj Bhadani, I. Chakraborty

{"title":"基于多模态深度学习的仇恨模因预测模型","authors":"Md. Rekib Ahmed, Neeraj Bhadani, I. Chakraborty","doi":"10.1109/CCGE50943.2021.9776440","DOIUrl":null,"url":null,"abstract":"With the emergence of deep neural networks along with high-end computers that can process deep architectures, there has been a lot of research when Computer Vision and Natural Language Processing has been fused into a single problem. To enable students and researchers to deep dive into multimodal deep learning Facebook AI Research team published a dataset on hateful meme classification “The Hateful Meme Challenge Dataset” in May 2020 that gave us the motivation to test ourselves and an opportunity to learn more about the dataset. The rise of communication on the internet with memes as a medium, they have been used to convey incorrect information, political agendas and also has led to cyberbullying, trolling etc. This results in the need of creating an automated tool that can detect such hateful content published on the internet and remove it at the root level before it does any harm. This paper intends to adopt Unimodal Text and Image models using Bert, LSTM and VGG16, Resnet50, SE-Resnet50, XSE-Resnet architectures and combining them into Multimodal models for effective prediction of a hateful meme. The paper compares various architectures both unimodal models and multimodal models on the evaluation metrics AUC-ROC score, F1 score and accuracy score.)","PeriodicalId":130452,"journal":{"name":"2021 International Conference on Computing, Communication and Green Engineering (CCGE)","volume":"33 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2021-09-23","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":"{\"title\":\"Hateful Meme Prediction Model Using Multimodal Deep Learning\",\"authors\":\"Md. Rekib Ahmed, Neeraj Bhadani, I. Chakraborty\",\"doi\":\"10.1109/CCGE50943.2021.9776440\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"With the emergence of deep neural networks along with high-end computers that can process deep architectures, there has been a lot of research when Computer Vision and Natural Language Processing has been fused into a single problem. To enable students and researchers to deep dive into multimodal deep learning Facebook AI Research team published a dataset on hateful meme classification “The Hateful Meme Challenge Dataset” in May 2020 that gave us the motivation to test ourselves and an opportunity to learn more about the dataset. The rise of communication on the internet with memes as a medium, they have been used to convey incorrect information, political agendas and also has led to cyberbullying, trolling etc. This results in the need of creating an automated tool that can detect such hateful content published on the internet and remove it at the root level before it does any harm. This paper intends to adopt Unimodal Text and Image models using Bert, LSTM and VGG16, Resnet50, SE-Resnet50, XSE-Resnet architectures and combining them into Multimodal models for effective prediction of a hateful meme. The paper compares various architectures both unimodal models and multimodal models on the evaluation metrics AUC-ROC score, F1 score and accuracy score.)\",\"PeriodicalId\":130452,\"journal\":{\"name\":\"2021 International Conference on Computing, Communication and Green Engineering (CCGE)\",\"volume\":\"33 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2021-09-23\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"1\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2021 International Conference on Computing, Communication and Green Engineering (CCGE)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/CCGE50943.2021.9776440\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2021 International Conference on Computing, Communication and Green Engineering (CCGE)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/CCGE50943.2021.9776440","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 1

摘要

随着深度神经网络的出现以及可以处理深度架构的高端计算机的出现，将计算机视觉和自然语言处理融合为一个问题的研究已经很多。为了让学生和研究人员深入研究多模态深度学习，Facebook人工智能研究团队于2020年5月发布了一个关于仇恨模因分类的数据集“仇恨模因挑战数据集”，这给了我们测试自己的动力，并有机会了解更多关于数据集的信息。以表情包为媒介的互联网交流的兴起，它们被用来传达不正确的信息、政治议程，也导致了网络欺凌、网络喷子等。这就需要创建一个自动化工具来检测互联网上发布的这种仇恨内容，并在其造成任何伤害之前从根本上将其删除。本文拟采用Bert、LSTM和VGG16、Resnet50、SE-Resnet50、XSE-Resnet架构的单模态文本和图像模型，并将它们组合成多模态模型，以有效预测仇恨模因。在评价指标AUC-ROC评分、F1评分和准确率评分上，比较了单模态模型和多模态模型的不同架构。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文

微信好友朋友圈 QQ好友复制链接

本刊更多论文

Hateful Meme Prediction Model Using Multimodal Deep Learning

With the emergence of deep neural networks along with high-end computers that can process deep architectures, there has been a lot of research when Computer Vision and Natural Language Processing has been fused into a single problem. To enable students and researchers to deep dive into multimodal deep learning Facebook AI Research team published a dataset on hateful meme classification “The Hateful Meme Challenge Dataset” in May 2020 that gave us the motivation to test ourselves and an opportunity to learn more about the dataset. The rise of communication on the internet with memes as a medium, they have been used to convey incorrect information, political agendas and also has led to cyberbullying, trolling etc. This results in the need of creating an automated tool that can detect such hateful content published on the internet and remove it at the root level before it does any harm. This paper intends to adopt Unimodal Text and Image models using Bert, LSTM and VGG16, Resnet50, SE-Resnet50, XSE-Resnet architectures and combining them into Multimodal models for effective prediction of a hateful meme. The paper compares various architectures both unimodal models and multimodal models on the evaluation metrics AUC-ROC score, F1 score and accuracy score.)

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

2021 International Conference on Computing, Communication and Green Engineering (CCGE)

自引率

0.00%

发文量

期刊最新文献

Stock Market Analysis using Time Series Data Analytics Techniques [Agendas] Irrigation to Smart Irrigation and Tube Well Users A Feature Cum Intensity Based SSIM Optimised Hybrid Image Registration Technique Flood Level Control and Management Using Instrumentation and Control