{"title":"Arabic Multi-Topic Labelling using Bidirectional Long Short-Term Memory","authors":"Sireen Abuqran","doi":"10.1109/ICICS52457.2021.9464581","DOIUrl":null,"url":null,"abstract":"The number of text documents on the internet is rapidly increasing. As a result, there is a growing demand for methods that can automatically organize and identify electronic documents (instances). The multi-label classification task has been used in a variety of applications and is commonly used in real-world problems. It simultaneously assigns multiple labels to each text. In the Arabic language, there have been few and inadequate research studies on the multi-label text classification issue. In this paper, I proposed a deep learning model using bidirectional long short-term memory (BiLSTM) for multi-class topic classification using Mowjaz Multi-Topic Labelling Task dataset. The BiLSTM model consists of 4 layers only which can be considered as light weight model, these layers are input layer, bidirectional LSTM layer, and two dense layers. The results show that the model successfully to classify topics with F1-Socre of 0.8089 on the testing dataset.","PeriodicalId":421803,"journal":{"name":"2021 12th International Conference on Information and Communication Systems (ICICS)","volume":"6 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2021-05-24","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"3","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2021 12th International Conference on Information and Communication Systems (ICICS)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICICS52457.2021.9464581","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 3
Abstract
The number of text documents on the internet is rapidly increasing. As a result, there is a growing demand for methods that can automatically organize and identify electronic documents (instances). The multi-label classification task has been used in a variety of applications and is commonly used in real-world problems. It simultaneously assigns multiple labels to each text. In the Arabic language, there have been few and inadequate research studies on the multi-label text classification issue. In this paper, I proposed a deep learning model using bidirectional long short-term memory (BiLSTM) for multi-class topic classification using Mowjaz Multi-Topic Labelling Task dataset. The BiLSTM model consists of 4 layers only which can be considered as light weight model, these layers are input layer, bidirectional LSTM layer, and two dense layers. The results show that the model successfully to classify topics with F1-Socre of 0.8089 on the testing dataset.