A Machine Learning approach for Sentiment Analysis for Italian Reviews in Healthcare

Proceedings of the Seventh Italian Conference on Computational Linguistics CLiC-it 2020 Pub Date : 1900-01-01 DOI:10.4000/books.aaccademia.8225

Luca Bacco, Andrea Cimino, L. Paulon, M. Merone, F. Dell’Orletta

引用次数: 4

Abstract

In this paper, we present our approach to the task of binary sentiment classification for Italian reviews in healthcare domain. We first collected a new dataset for such domain. Then, we compared the results obtained by two different systems, one including a Support Vector Machine and one with BERT. For the first one, we linguistic pre–processed the dataset to extract hand-crafted features exploited by the classifier. For the second one, we oversampled the dataset to achieve better results. Our results show that the SVMbased system, without the worry of having to oversample, has better performance than the BERT-based one, achieving an F1-score of 91.21%.

查看原文

微信好友朋友圈 QQ好友复制链接

本刊更多论文

医疗保健领域意大利语评论情感分析的机器学习方法

在本文中，我们提出了我们的方法来二元情感分类任务的意大利评论在医疗保健领域。我们首先为该领域收集了一个新的数据集。然后，我们比较了两种不同系统的结果，一种是包含支持向量机的系统，一种是包含BERT的系统。对于第一个，我们对数据集进行语言预处理，以提取分类器利用的手工特征。对于第二个，我们对数据集进行过采样以获得更好的结果。我们的研究结果表明，基于svm的系统在不需要过采样的情况下，比基于bert的系统有更好的性能，达到了91.21%的f1分数。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文去求助

来源期刊

Proceedings of the Seventh Italian Conference on Computational Linguistics CLiC-it 2020

自引率

0.00%

发文量