Hierarchical Deep Learning for Arabic Dialect Identification

WANLP@ACL 2019 Pub Date : 2019-08-01 DOI:10.18653/v1/W19-4631

Gael de Francony, Victor Guichard, Praveen Joshi, Haithem Afli, Abdessalam Bouchekif

引用次数: 8

Abstract

In this paper, we present two approaches for Arabic Fine-Grained Dialect Identification. The first approach is based on Recurrent Neural Networks (BLSTM, BGRU) using hierarchical classification. The main idea is to separate the classification process for a sentence from a given text in two stages. We start with a higher level of classification (8 classes) and then the finer-grained classification (26 classes). The second approach is given by a voting system based on Naive Bayes and Random Forest. Our system achieves an F1 score of 63.02 % on the subtask evaluation dataset.

查看原文

微信好友朋友圈 QQ好友复制链接

本刊更多论文

阿拉伯语方言识别的层次深度学习

本文提出了两种阿拉伯语细粒度方言识别方法。第一种方法是基于递归神经网络(BLSTM, BGRU)的分层分类。其主要思想是将句子和给定文本的分类过程分为两个阶段。我们从更高级别的分类(8个类)开始，然后是更细粒度的分类(26个类)。第二种方法是基于朴素贝叶斯和随机森林的投票系统。我们的系统在子任务评价数据集上取得了63.02%的F1分数。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文去求助

来源期刊

WANLP@ACL 2019

自引率

0.00%

发文量