首页 > 最新文献

EVALITA Evaluation of NLP and Speech Tools for Italian - December 17th, 2020最新文献

英文 中文
KLUMSy @ KIPoS: Experiments on Part-of-Speech Tagging of Spoken Italian KLUMSy @ KIPoS:意大利语口语词性标注实验
Pub Date : 1900-01-01 DOI: 10.4000/BOOKS.AACCADEMIA.7780
Thomas Proisl, Gabriella Lapesa
In this paper, we describe experiments on part-of-speech tagging of spoken Italian that we conducted in the context of the EVALITA 2020 KIPoS shared task (Bosco et al., 2020). Our submission to the shared task is based on SoMeWeTa (Proisl, 2018), a tagger which supports domain adaptation and is designed to flexibly incorporate external resources. We document our approach and discuss our results in the shared task along with a statistical analysis of the factors which impact performance the most. Additionally, we report on a set of additional experiments involving the combination of neural language models with unsupervised HMMs, and compare its performance to that of our system.
在本文中,我们描述了我们在EVALITA 2020 KIPoS共享任务(Bosco et al., 2020)的背景下进行的意大利语口语词性标注实验。我们提交的共享任务是基于SoMeWeTa (Proisl, 2018),这是一个支持领域自适应的标记器,旨在灵活地整合外部资源。我们记录我们的方法,并在共享任务中讨论我们的结果,同时对影响性能最大的因素进行统计分析。此外,我们报告了一组涉及神经语言模型与无监督hmm相结合的附加实验,并将其性能与我们的系统进行了比较。
{"title":"KLUMSy @ KIPoS: Experiments on Part-of-Speech Tagging of Spoken Italian","authors":"Thomas Proisl, Gabriella Lapesa","doi":"10.4000/BOOKS.AACCADEMIA.7780","DOIUrl":"https://doi.org/10.4000/BOOKS.AACCADEMIA.7780","url":null,"abstract":"In this paper, we describe experiments on part-of-speech tagging of spoken Italian that we conducted in the context of the EVALITA 2020 KIPoS shared task (Bosco et al., 2020). Our submission to the shared task is based on SoMeWeTa (Proisl, 2018), a tagger which supports domain adaptation and is designed to flexibly incorporate external resources. We document our approach and discuss our results in the shared task along with a statistical analysis of the factors which impact performance the most. Additionally, we report on a set of additional experiments involving the combination of neural language models with unsupervised HMMs, and compare its performance to that of our system.","PeriodicalId":184564,"journal":{"name":"EVALITA Evaluation of NLP and Speech Tools for Italian - December 17th, 2020","volume":"19 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"1900-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"131931624","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 3
AMI @ EVALITA2020: Automatic Misogyny Identification AMI @ EVALITA2020:厌女症自动识别
Pub Date : 1900-01-01 DOI: 10.4000/BOOKS.AACCADEMIA.6764
E. Fersini, Debora Nozza, Paolo Rosso
English. Automatic Misogyny Identification (AMI) is a shared task proposed at the Evalita 2020 evaluation campaign. The AMI challenge, based on Italian tweets, is organized into two subtasks: (1) Subtask A about misogyny and aggressiveness identification and (2) Subtask B about the fairness of the model. At the end of the evaluation phase, we received a total of 20 runs for Subtask A and 11 runs for Subtask B, submitted by 8 teams. In this paper, we present an overview of the AMI shared task, the datasets, the evaluation method-ology, the results obtained by the participants and a discussion about the method-ology adopted by the teams. Finally, we draw some conclusions and discuss future work.
英语。厌女症自动识别(AMI)是在Evalita 2020评估活动中提出的一项共享任务。AMI挑战基于意大利语推文,分为两个子任务:(1)关于厌女症和攻击性识别的子任务A和(2)关于模型公平性的子任务B。在评估阶段结束时,我们总共收到了8个团队提交的子任务a的20次运行和子任务B的11次运行。本文概述了AMI共享任务、数据集、评估方法、参与者获得的结果,并讨论了团队采用的方法。最后,提出了一些结论,并对今后的工作进行了讨论。
{"title":"AMI @ EVALITA2020: Automatic Misogyny Identification","authors":"E. Fersini, Debora Nozza, Paolo Rosso","doi":"10.4000/BOOKS.AACCADEMIA.6764","DOIUrl":"https://doi.org/10.4000/BOOKS.AACCADEMIA.6764","url":null,"abstract":"English. Automatic Misogyny Identification (AMI) is a shared task proposed at the Evalita 2020 evaluation campaign. The AMI challenge, based on Italian tweets, is organized into two subtasks: (1) Subtask A about misogyny and aggressiveness identification and (2) Subtask B about the fairness of the model. At the end of the evaluation phase, we received a total of 20 runs for Subtask A and 11 runs for Subtask B, submitted by 8 teams. In this paper, we present an overview of the AMI shared task, the datasets, the evaluation method-ology, the results obtained by the participants and a discussion about the method-ology adopted by the teams. Finally, we draw some conclusions and discuss future work.","PeriodicalId":184564,"journal":{"name":"EVALITA Evaluation of NLP and Speech Tools for Italian - December 17th, 2020","volume":"41 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"1900-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"132590575","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 61
Jigsaw @ AMI and HaSpeeDe2: Fine-Tuning a Pre-Trained Comment-Domain BERT Model Jigsaw @ AMI和HaSpeeDe2:微调预训练的评论域BERT模型
Pub Date : 1900-01-01 DOI: 10.4000/BOOKS.AACCADEMIA.6789
Alyssa Lees, Jeffrey Scott Sorensen, I. Kivlichan
The Google Jigsaw team produced submissions for two of the EVALITA 2020 (Basile et al., 2020) shared tasks, based in part on the technology that powers the publicly available PerspectiveAPI comment evaluation service. We present a basic description of our submitted results and a review of the types of errors that our system made in these shared tasks.
b谷歌Jigsaw团队提交了两个EVALITA 2020 (Basile等人,2020)共享任务,部分基于支持公开可用的PerspectiveAPI评论评估服务的技术。我们对提交的结果进行了基本描述,并回顾了我们的系统在这些共享任务中所犯的错误类型。
{"title":"Jigsaw @ AMI and HaSpeeDe2: Fine-Tuning a Pre-Trained Comment-Domain BERT Model","authors":"Alyssa Lees, Jeffrey Scott Sorensen, I. Kivlichan","doi":"10.4000/BOOKS.AACCADEMIA.6789","DOIUrl":"https://doi.org/10.4000/BOOKS.AACCADEMIA.6789","url":null,"abstract":"The Google Jigsaw team produced submissions for two of the EVALITA 2020 (Basile et al., 2020) shared tasks, based in part on the technology that powers the publicly available PerspectiveAPI comment evaluation service. We present a basic description of our submitted results and a review of the types of errors that our system made in these shared tasks.","PeriodicalId":184564,"journal":{"name":"EVALITA Evaluation of NLP and Speech Tools for Italian - December 17th, 2020","volume":"2676 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"1900-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"127487255","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 18
fabsam @ AMI: A Convolutional Neural Network Approach (short paper) fabsam @ AMI:卷积神经网络方法(短文)
Pub Date : 1900-01-01 DOI: 10.4000/BOOKS.AACCADEMIA.6782
Samuel Fabrizi
The presence of misogynistic contents is one of the most crucial problems of social networks. In this paper we present our system for misogyny identification on Twitter. Our approach is based on a convolutional neural network that exploits pretrained word embeddings. We also experimented a comparison among different architectures to understand the effectiveness of our method. The paper also described our submissions to both subtasks A and B to Automatic Misogyny Identification competition at Evalita 2020.
厌恶女性内容的存在是社交网络最关键的问题之一。在本文中,我们介绍了我们在Twitter上识别厌女症的系统。我们的方法是基于卷积神经网络,利用预训练词嵌入。我们还在不同的体系结构之间进行了实验比较,以了解我们的方法的有效性。本文还描述了我们在Evalita 2020自动厌女症识别比赛中提交的子任务A和B。
{"title":"fabsam @ AMI: A Convolutional Neural Network Approach (short paper)","authors":"Samuel Fabrizi","doi":"10.4000/BOOKS.AACCADEMIA.6782","DOIUrl":"https://doi.org/10.4000/BOOKS.AACCADEMIA.6782","url":null,"abstract":"The presence of misogynistic contents is one of the most crucial problems of social networks. In this paper we present our system for misogyny identification on Twitter. Our approach is based on a convolutional neural network that exploits pretrained word embeddings. We also experimented a comparison among different architectures to understand the effectiveness of our method. The paper also described our submissions to both subtasks A and B to Automatic Misogyny Identification competition at Evalita 2020.","PeriodicalId":184564,"journal":{"name":"EVALITA Evaluation of NLP and Speech Tools for Italian - December 17th, 2020","volume":"63 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"1900-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"132594030","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 4
UniBO @ AMI: A Multi-Class Approach to Misogyny and Aggressiveness Identification on Twitter Posts Using AlBERTo UniBO @ AMI:使用AlBERTo在Twitter帖子中识别厌女症和攻击性的多类方法
Pub Date : 1900-01-01 DOI: 10.4000/BOOKS.AACCADEMIA.6769
Arianna Muti, Alberto Barrón-Cedeño
We describe our participation in the EVALITA 2020 (Basile et al., 2020) shared task on Automatic Misogyny Identification. We focus on task A —Misogyny and Aggressive Behaviour Identification— which aims at detecting whether a tweet in Italian is misogynous and, if so, whether it is aggressive. Rather than building two different models, one for misogyny and one for aggressiveness identification, we handle the problem as one single multi-label classification task, considering three classes: nonmisogynous, non-aggressive misogynous, and aggressive misogynous. Our threeclass supervised model, built on top of AlBERTo, obtains an overall F1 score of 0.7438 on the task test set (F1 = 0.8102 for the misogyny and F1 = 0.6774 for the aggressiveness task), which outperforms the top submitted model (F1 = 0.7406).1
我们描述了我们参与EVALITA 2020 (Basile et al., 2020)关于厌女症自动识别的共享任务。我们专注于任务A——厌女症和攻击性行为识别——旨在检测意大利语的推文是否厌女症,如果是,是否具有攻击性。我们没有建立两个不同的模型,一个用于厌女症,一个用于攻击性识别,而是将这个问题作为一个单一的多标签分类任务来处理,考虑了三个类别:非厌女症、非攻击性厌女症和攻击性厌女症。我们基于AlBERTo构建的三类监督模型在任务测试集上获得了0.7438的总F1分数(厌女任务F1 = 0.8102,攻击性任务F1 = 0.6774),优于最高提交的模型(F1 = 0.7406)
{"title":"UniBO @ AMI: A Multi-Class Approach to Misogyny and Aggressiveness Identification on Twitter Posts Using AlBERTo","authors":"Arianna Muti, Alberto Barrón-Cedeño","doi":"10.4000/BOOKS.AACCADEMIA.6769","DOIUrl":"https://doi.org/10.4000/BOOKS.AACCADEMIA.6769","url":null,"abstract":"We describe our participation in the EVALITA 2020 (Basile et al., 2020) shared task on Automatic Misogyny Identification. We focus on task A —Misogyny and Aggressive Behaviour Identification— which aims at detecting whether a tweet in Italian is misogynous and, if so, whether it is aggressive. Rather than building two different models, one for misogyny and one for aggressiveness identification, we handle the problem as one single multi-label classification task, considering three classes: nonmisogynous, non-aggressive misogynous, and aggressive misogynous. Our threeclass supervised model, built on top of AlBERTo, obtains an overall F1 score of 0.7438 on the task test set (F1 = 0.8102 for the misogyny and F1 = 0.6774 for the aggressiveness task), which outperforms the top submitted model (F1 = 0.7406).1","PeriodicalId":184564,"journal":{"name":"EVALITA Evaluation of NLP and Speech Tools for Italian - December 17th, 2020","volume":"76 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"1900-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"115180762","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 7
"Il Mago della Ghigliottina" @ GhigliottinAI: When Linguistics meets Artificial Intelligence “断头台魔术师”@“当语言遇到人工智能
Pub Date : 1900-01-01 DOI: 10.4000/BOOKS.AACCADEMIA.7498
Federico Sangati, Antonio Pascucci, J. Monti
English. This paper describes Il mago della Ghigliottina , a bot which took part in the Ghigliottin-AI task of the Evalita 2020 evaluation campaign. The aim is to build a system able to solve the TV game “La Ghigliottina”. Our system has already participated in the Evalita 2018 task NLP4FUN . Compared to that occasion, it improved its accuracy from 61% to 68.6%. Italiano. Questo contributo descrive Il mago della Ghigliottina, un bot che ha partecipato a Ghigliottin-AI, uno dei task di Evalita 2020. Scopo del task è mettere in piedi un sistema automatico capace di risolvere il gioco televisivo “La Ghigliot-tina”. Il nostro sistema ha già parteci-pato all’edizione del 2018 di Evalita al task NLP4FUN. Rispetto all’edizione del 2018 di NLP4FUN, l’accuratezza è salita dal 61% al 68.6%.
English。这篇论文描述了断头台魔术师,一个机器人,他在断头台中扮演了一个角色目标是建立一个可以解决电视游戏“断头台”的系统。我们的系统已经参与了eveta 2018 p4fun任务组。与此相比,它的准确性从61%提高到68.6%。意大利。这篇文章描述了断头台魔术师,一个参与了断头台- ai的机器人,evita 2020任务之一。该小组的目标是建立一个能够解决游戏游戏“断头台-tina”的自动系统。我们的系统已经参与了2018年版的evita在专责小组p4fun。与NLP4FUN 2018版相比,精度从61%上升到68.6%。
{"title":"\"Il Mago della Ghigliottina\" @ GhigliottinAI: When Linguistics meets Artificial Intelligence","authors":"Federico Sangati, Antonio Pascucci, J. Monti","doi":"10.4000/BOOKS.AACCADEMIA.7498","DOIUrl":"https://doi.org/10.4000/BOOKS.AACCADEMIA.7498","url":null,"abstract":"English. This paper describes Il mago della Ghigliottina , a bot which took part in the Ghigliottin-AI task of the Evalita 2020 evaluation campaign. The aim is to build a system able to solve the TV game “La Ghigliottina”. Our system has already participated in the Evalita 2018 task NLP4FUN . Compared to that occasion, it improved its accuracy from 61% to 68.6%. Italiano. Questo contributo descrive Il mago della Ghigliottina, un bot che ha partecipato a Ghigliottin-AI, uno dei task di Evalita 2020. Scopo del task è mettere in piedi un sistema automatico capace di risolvere il gioco televisivo “La Ghigliot-tina”. Il nostro sistema ha già parteci-pato all’edizione del 2018 di Evalita al task NLP4FUN. Rispetto all’edizione del 2018 di NLP4FUN, l’accuratezza è salita dal 61% al 68.6%.","PeriodicalId":184564,"journal":{"name":"EVALITA Evaluation of NLP and Speech Tools for Italian - December 17th, 2020","volume":"1 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"1900-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"122440107","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 3
No Place For Hate Speech @ HaSpeeDe 2: Ensemble to Identify Hate Speech in Italian (short paper) 没有仇恨言论的地方@ HaSpeeDe 2:集体识别意大利语中的仇恨言论(短文)
Pub Date : 1900-01-01 DOI: 10.4000/BOOKS.AACCADEMIA.7027
Adriano dos S. R. da Silva, N. T. Roman
English. In this article, we present the results of applying a Stacking Ensemble method to the problem of hate speech classification proposed in the main task of HaSpeeDe 2 at EVALITA 2020. The model was then compared to a Logistic Regression classifier, along with two other benchmarks defined by the competition’s organising committee (an SVM with a linear kernel and a majority class classifier). Results showed our Ensemble to outperform the benchmarks to various degrees, both when testing in the same domain as training and in a different domain. Italiano. In questo articolo, ci presentiamo i risultati dell’applicazione di un modello di Stacking Ensemble al problema della classificazione dei discorsi di incitamento all’odio nel compito A di EVALITA (HaSpeeDe 2). Il modello è stato quindi confrontato con un modello di regressione logistica, insieme ad altri due benchmark definiti dal comitato organizzatore della competizione (un SVM con un kernel lineare e un classificatore di classe maggioritaria). I risultati hanno mostrato che il nostro Ensemble supera i benchmark a vari livelli, sia durante i test nello stesso dominio di sviluppo che in un dominio di-
English。在这篇文章中,我们展示了在2020年HaSpeeDe 2的主要任务中提出的对仇恨言论分类问题的组合方法的结果。然后将该模型与竞争组织委员会定义的另外两个基准进行比较。结果展示了我们的合奏,打破了对不同学位的基准,同时在同一个领域进行训练和不同领域的测试。意大利。在这篇文章中,我们执行的结果那乐团模式分类问题的任务的仇恨言论EVALITA (HaSpeeDe 2)。因此,该模型是与一个logistic回归分析的模式,再加上另外两个竞争的组委会确定的基准(与一个内核和线性分类器,多数类)。结果显示,在同一开发领域和同一开发领域的测试中,我们的合奏在不同层次上都超过了基准
{"title":"No Place For Hate Speech @ HaSpeeDe 2: Ensemble to Identify Hate Speech in Italian (short paper)","authors":"Adriano dos S. R. da Silva, N. T. Roman","doi":"10.4000/BOOKS.AACCADEMIA.7027","DOIUrl":"https://doi.org/10.4000/BOOKS.AACCADEMIA.7027","url":null,"abstract":"English. In this article, we present the results of applying a Stacking Ensemble method to the problem of hate speech classification proposed in the main task of HaSpeeDe 2 at EVALITA 2020. The model was then compared to a Logistic Regression classifier, along with two other benchmarks defined by the competition’s organising committee (an SVM with a linear kernel and a majority class classifier). Results showed our Ensemble to outperform the benchmarks to various degrees, both when testing in the same domain as training and in a different domain. Italiano. In questo articolo, ci presentiamo i risultati dell’applicazione di un modello di Stacking Ensemble al problema della classificazione dei discorsi di incitamento all’odio nel compito A di EVALITA (HaSpeeDe 2). Il modello è stato quindi confrontato con un modello di regressione logistica, insieme ad altri due benchmark definiti dal comitato organizzatore della competizione (un SVM con un kernel lineare e un classificatore di classe maggioritaria). I risultati hanno mostrato che il nostro Ensemble supera i benchmark a vari livelli, sia durante i test nello stesso dominio di sviluppo che in un dominio di-","PeriodicalId":184564,"journal":{"name":"EVALITA Evaluation of NLP and Speech Tools for Italian - December 17th, 2020","volume":"316 ","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"1900-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"120871381","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
TheNorth @ HaSpeeDe 2: BERT-based Language Model Fine-tuning for Italian Hate Speech Detection (short paper) TheNorth @ HaSpeeDe 2:基于bert的语言模型微调意大利语仇恨语音检测(短论文)
Pub Date : 1900-01-01 DOI: 10.4000/BOOKS.AACCADEMIA.6989
Eric Lavergne, Rajkumar Saini, György Kovács, Killian Murphy
English. This report was written to describe the systems that were submitted by the team “TheNorth” for the HaSpeeDe 2 shared task organised within EVALITA 2020. To address the main task which is hate speech detection, we fine-tuned BERT-based models. We evaluated both multilingual and Italian language models trained with the data provided and additional data. We also studied the contributions of multitask learning considering both hate speech detection and stereotype detection tasks.
英语。本报告描述了由“the north”团队为EVALITA 2020组织的HaSpeeDe 2共享任务提交的系统。为了解决仇恨语音检测的主要任务,我们对基于bert的模型进行了微调。我们评估了用所提供的数据和附加数据训练的多语言和意大利语模型。我们还研究了多任务学习对仇恨言论检测和刻板印象检测任务的贡献。
{"title":"TheNorth @ HaSpeeDe 2: BERT-based Language Model Fine-tuning for Italian Hate Speech Detection (short paper)","authors":"Eric Lavergne, Rajkumar Saini, György Kovács, Killian Murphy","doi":"10.4000/BOOKS.AACCADEMIA.6989","DOIUrl":"https://doi.org/10.4000/BOOKS.AACCADEMIA.6989","url":null,"abstract":"English. This report was written to describe the systems that were submitted by the team “TheNorth” for the HaSpeeDe 2 shared task organised within EVALITA 2020. To address the main task which is hate speech detection, we fine-tuned BERT-based models. We evaluated both multilingual and Italian language models trained with the data provided and additional data. We also studied the contributions of multitask learning considering both hate speech detection and stereotype detection tasks.","PeriodicalId":184564,"journal":{"name":"EVALITA Evaluation of NLP and Speech Tools for Italian - December 17th, 2020","volume":"10 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"1900-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"133029326","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 10
UPB @ DANKMEMES: Italian Memes Analysis - Employing Visual Models and Graph Convolutional Networks for Meme Identification and Hate Speech Detection (short paper) UPB @ DANKMEMES:意大利模因分析-使用视觉模型和图形卷积网络进行模因识别和仇恨言论检测(短论文)
Pub Date : 1900-01-01 DOI: 10.4000/BOOKS.AACCADEMIA.7360
G. Vlad, George-Eduard Zaharia, Dumitru-Clementin Cercel, M. Dascalu
Certain events or political situations determine users from the online environment to express themselves by using different modalities. One of them is represented by Internet memes, which combine text with a representative image to entail a wide range of emotions, from humor to sarcasm and even hate. In this paper, we describe our approach for the DANKMEMES competition from EVALITA 2020 consisting of a multimodal multi-task learning architecture based on two main components. The first one is a Graph Convolutional Network combined with an Italian BERT for text encoding, while the second is varied between different image-based architectures (i.e., ResNet50, ResNet152, and VGG-16) for image representation. Our solution achieves good performance on the first two tasks of the current competition, ranking 3rd for both Task 1 (.8437 macroF1 score) and Task 2 (.8169 macro-F1 score), while exceeding by high margins the official baselines.
某些事件或政治局势决定了网络环境中的用户通过使用不同的方式来表达自己。其中一种以网络表情包为代表,它将文字与具有代表性的图像结合起来,包含了从幽默到讽刺甚至仇恨的各种情绪。在本文中,我们描述了我们在EVALITA 2020的DANKMEMES竞赛中的方法,该方法由基于两个主要组件的多模态多任务学习架构组成。第一个是结合了意大利BERT的图形卷积网络,用于文本编码,而第二个是不同的基于图像的架构(即ResNet50, ResNet152和VGG-16),用于图像表示。我们的解决方案在本次比赛的前两个任务上都取得了很好的成绩,在任务1(1)和任务2(2)上都获得了第3名。8437 macroF1分数)和任务2(。8169宏观f1得分),同时大大超出了官方基线。
{"title":"UPB @ DANKMEMES: Italian Memes Analysis - Employing Visual Models and Graph Convolutional Networks for Meme Identification and Hate Speech Detection (short paper)","authors":"G. Vlad, George-Eduard Zaharia, Dumitru-Clementin Cercel, M. Dascalu","doi":"10.4000/BOOKS.AACCADEMIA.7360","DOIUrl":"https://doi.org/10.4000/BOOKS.AACCADEMIA.7360","url":null,"abstract":"Certain events or political situations determine users from the online environment to express themselves by using different modalities. One of them is represented by Internet memes, which combine text with a representative image to entail a wide range of emotions, from humor to sarcasm and even hate. In this paper, we describe our approach for the DANKMEMES competition from EVALITA 2020 consisting of a multimodal multi-task learning architecture based on two main components. The first one is a Graph Convolutional Network combined with an Italian BERT for text encoding, while the second is varied between different image-based architectures (i.e., ResNet50, ResNet152, and VGG-16) for image representation. Our solution achieves good performance on the first two tasks of the current competition, ranking 3rd for both Task 1 (.8437 macroF1 score) and Task 2 (.8169 macro-F1 score), while exceeding by high margins the official baselines.","PeriodicalId":184564,"journal":{"name":"EVALITA Evaluation of NLP and Speech Tools for Italian - December 17th, 2020","volume":"1 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"1900-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"129114313","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 7
TextWiller @ SardiStance, HaSpeede2: Text or Con-text? A Smart Use of Social Network Data in Predicting Polarization (short paper) texttwiller @ SardiStance, HaSpeede2:文本或上下文文本?社会网络数据在极化预测中的智能应用(短文)
Pub Date : 1900-01-01 DOI: 10.4000/BOOKS.AACCADEMIA.7152
Federico Ferraccioli, Andrea Sciandra, Mattia Da Pont, P. Girardi, Dario Solari, L. Finos
In this contribution we describe the system (i.e. a statistical model) used to participate in Evalita conference 2020, SardiStance (Tasks A and B) and Haspeede2 (Tasks A and B). We first developed a classifier by extracting features from the texts and the social network of users. Then, we fit the data through an extreme gradient boosting, with cross-validation tuning of the hyper-parameters. A key factor for a good performance in SardiStance Task B was the features extraction by using Multidimensional Scaling of the distance matrix (minimum path, undirected graph) applied on each network. The second system exploits the same features above, but it trains and performs predictions in twosteps. The performances proved to be lower than those of the single-step model.
在这篇文章中,我们描述了用于参加Evalita会议2020、SardiStance(任务a和B)和Haspeede2(任务a和B)的系统(即统计模型)。我们首先通过从文本和用户的社交网络中提取特征开发了一个分类器。然后,我们通过极端梯度增强拟合数据,并对超参数进行交叉验证调优。SardiStance Task B中性能良好的一个关键因素是通过在每个网络上使用距离矩阵(最小路径,无向图)的多维缩放来提取特征。第二个系统利用了上述相同的特征,但它分两步训练和执行预测。结果表明,该模型的性能低于单步模型。
{"title":"TextWiller @ SardiStance, HaSpeede2: Text or Con-text? A Smart Use of Social Network Data in Predicting Polarization (short paper)","authors":"Federico Ferraccioli, Andrea Sciandra, Mattia Da Pont, P. Girardi, Dario Solari, L. Finos","doi":"10.4000/BOOKS.AACCADEMIA.7152","DOIUrl":"https://doi.org/10.4000/BOOKS.AACCADEMIA.7152","url":null,"abstract":"In this contribution we describe the system (i.e. a statistical model) used to participate in Evalita conference 2020, SardiStance (Tasks A and B) and Haspeede2 (Tasks A and B). We first developed a classifier by extracting features from the texts and the social network of users. Then, we fit the data through an extreme gradient boosting, with cross-validation tuning of the hyper-parameters. A key factor for a good performance in SardiStance Task B was the features extraction by using Multidimensional Scaling of the distance matrix (minimum path, undirected graph) applied on each network. The second system exploits the same features above, but it trains and performs predictions in twosteps. The performances proved to be lower than those of the single-step model.","PeriodicalId":184564,"journal":{"name":"EVALITA Evaluation of NLP and Speech Tools for Italian - December 17th, 2020","volume":"50 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"1900-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"123682179","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 3
期刊
EVALITA Evaluation of NLP and Speech Tools for Italian - December 17th, 2020
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1