UPB @ DANKMEMES: Italian Memes Analysis - Employing Visual Models and Graph Convolutional Networks for Meme Identification and Hate Speech Detection (short paper)

EVALITA Evaluation of NLP and Speech Tools for Italian - December 17th, 2020 Pub Date : 1900-01-01 DOI:10.4000/BOOKS.AACCADEMIA.7360

G. Vlad, George-Eduard Zaharia, Dumitru-Clementin Cercel, M. Dascalu

{"title":"UPB @ DANKMEMES: Italian Memes Analysis - Employing Visual Models and Graph Convolutional Networks for Meme Identification and Hate Speech Detection (short paper)","authors":"G. Vlad, George-Eduard Zaharia, Dumitru-Clementin Cercel, M. Dascalu","doi":"10.4000/BOOKS.AACCADEMIA.7360","DOIUrl":null,"url":null,"abstract":"Certain events or political situations determine users from the online environment to express themselves by using different modalities. One of them is represented by Internet memes, which combine text with a representative image to entail a wide range of emotions, from humor to sarcasm and even hate. In this paper, we describe our approach for the DANKMEMES competition from EVALITA 2020 consisting of a multimodal multi-task learning architecture based on two main components. The first one is a Graph Convolutional Network combined with an Italian BERT for text encoding, while the second is varied between different image-based architectures (i.e., ResNet50, ResNet152, and VGG-16) for image representation. Our solution achieves good performance on the first two tasks of the current competition, ranking 3rd for both Task 1 (.8437 macroF1 score) and Task 2 (.8169 macro-F1 score), while exceeding by high margins the official baselines.","PeriodicalId":184564,"journal":{"name":"EVALITA Evaluation of NLP and Speech Tools for Italian - December 17th, 2020","volume":"1 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"1900-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"7","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"EVALITA Evaluation of NLP and Speech Tools for Italian - December 17th, 2020","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.4000/BOOKS.AACCADEMIA.7360","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 7

Abstract

Certain events or political situations determine users from the online environment to express themselves by using different modalities. One of them is represented by Internet memes, which combine text with a representative image to entail a wide range of emotions, from humor to sarcasm and even hate. In this paper, we describe our approach for the DANKMEMES competition from EVALITA 2020 consisting of a multimodal multi-task learning architecture based on two main components. The first one is a Graph Convolutional Network combined with an Italian BERT for text encoding, while the second is varied between different image-based architectures (i.e., ResNet50, ResNet152, and VGG-16) for image representation. Our solution achieves good performance on the first two tasks of the current competition, ranking 3rd for both Task 1 (.8437 macroF1 score) and Task 2 (.8169 macro-F1 score), while exceeding by high margins the official baselines.

查看原文

微信好友朋友圈 QQ好友复制链接

本刊更多论文

UPB @ DANKMEMES:意大利模因分析-使用视觉模型和图形卷积网络进行模因识别和仇恨言论检测(短论文)

某些事件或政治局势决定了网络环境中的用户通过使用不同的方式来表达自己。其中一种以网络表情包为代表，它将文字与具有代表性的图像结合起来，包含了从幽默到讽刺甚至仇恨的各种情绪。在本文中，我们描述了我们在EVALITA 2020的DANKMEMES竞赛中的方法，该方法由基于两个主要组件的多模态多任务学习架构组成。第一个是结合了意大利BERT的图形卷积网络，用于文本编码，而第二个是不同的基于图像的架构(即ResNet50, ResNet152和VGG-16)，用于图像表示。我们的解决方案在本次比赛的前两个任务上都取得了很好的成绩，在任务1(1)和任务2(2)上都获得了第3名。8437 macroF1分数)和任务2(。8169宏观f1得分)，同时大大超出了官方基线。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文去求助

来源期刊

EVALITA Evaluation of NLP and Speech Tools for Italian - December 17th, 2020

自引率

0.00%

发文量