{"title":"判别多标称的阿拉伯语主题检测Naïve贝叶斯和频率变换","authors":"Ahmed Alsanad","doi":"10.1145/3297067.3297095","DOIUrl":null,"url":null,"abstract":"Arabic topic detection (ATD) has become an attractive research field. It is used in many applications, such as Arabic documents classification, web search, social media, and security. ATD uses machine learning algorithms with ultimate aim to classify Arabic documents based on text contents. Arabic text classification require a complicated process. The Arabic words have unlimited variation in the meaning, which add more complexity and ambiguity to the process Arabic text classification. There are some studies have been proposed for Arabic text classification in recent years. However, these previous studies need improvements to rise accuracy and efficiency. Therefore, this paper proposes an effective approach for Arabic text classification and topic detection using discriminative multi nominal naïve Bayes (DMNB) classifier and frequency transform. The proposed approach includes three main steps: Arabic text preprocessing, Arabic text feature extraction and normalization, and Arabic text classification. A dataset of 1500 Arabic documents collected from Arabic articles corpus in 5 different topics is used to evaluate the proposed approach. The experimental results of 10-folds cross-validation show that the proposed approach performs competitively better than the state-of-the-art approaches.","PeriodicalId":340004,"journal":{"name":"International Conference on Signal Processing and Machine Learning","volume":"184 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2018-11-28","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"3","resultStr":"{\"title\":\"Arabic Topic Detection Using Discriminative Multi nominal Naïve Bayes and Frequency Transforms\",\"authors\":\"Ahmed Alsanad\",\"doi\":\"10.1145/3297067.3297095\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Arabic topic detection (ATD) has become an attractive research field. It is used in many applications, such as Arabic documents classification, web search, social media, and security. ATD uses machine learning algorithms with ultimate aim to classify Arabic documents based on text contents. Arabic text classification require a complicated process. The Arabic words have unlimited variation in the meaning, which add more complexity and ambiguity to the process Arabic text classification. There are some studies have been proposed for Arabic text classification in recent years. However, these previous studies need improvements to rise accuracy and efficiency. Therefore, this paper proposes an effective approach for Arabic text classification and topic detection using discriminative multi nominal naïve Bayes (DMNB) classifier and frequency transform. The proposed approach includes three main steps: Arabic text preprocessing, Arabic text feature extraction and normalization, and Arabic text classification. A dataset of 1500 Arabic documents collected from Arabic articles corpus in 5 different topics is used to evaluate the proposed approach. The experimental results of 10-folds cross-validation show that the proposed approach performs competitively better than the state-of-the-art approaches.\",\"PeriodicalId\":340004,\"journal\":{\"name\":\"International Conference on Signal Processing and Machine Learning\",\"volume\":\"184 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2018-11-28\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"3\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"International Conference on Signal Processing and Machine Learning\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1145/3297067.3297095\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"International Conference on Signal Processing and Machine Learning","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1145/3297067.3297095","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
Arabic Topic Detection Using Discriminative Multi nominal Naïve Bayes and Frequency Transforms
Arabic topic detection (ATD) has become an attractive research field. It is used in many applications, such as Arabic documents classification, web search, social media, and security. ATD uses machine learning algorithms with ultimate aim to classify Arabic documents based on text contents. Arabic text classification require a complicated process. The Arabic words have unlimited variation in the meaning, which add more complexity and ambiguity to the process Arabic text classification. There are some studies have been proposed for Arabic text classification in recent years. However, these previous studies need improvements to rise accuracy and efficiency. Therefore, this paper proposes an effective approach for Arabic text classification and topic detection using discriminative multi nominal naïve Bayes (DMNB) classifier and frequency transform. The proposed approach includes three main steps: Arabic text preprocessing, Arabic text feature extraction and normalization, and Arabic text classification. A dataset of 1500 Arabic documents collected from Arabic articles corpus in 5 different topics is used to evaluate the proposed approach. The experimental results of 10-folds cross-validation show that the proposed approach performs competitively better than the state-of-the-art approaches.