M. Fraiwan, Bayan Al-Younes, O. Al-Jarrah, N. Khasawneh
{"title":"阿拉伯语众包新闻报道的分析与分类:以叙利亚危机为例","authors":"M. Fraiwan, Bayan Al-Younes, O. Al-Jarrah, N. Khasawneh","doi":"10.1109/INNOVATIONS.2016.7880033","DOIUrl":null,"url":null,"abstract":"The prevalence of social media, in the whole world and the Arab region in particular, has fueled the active engagement and participation of large swaths of the Arabic society in current events. Social media has been used to rally public opinion, increase awareness, spread information/misinformation, and organize large events. Data analysis is necessary to drive decision making, advertisement, political campaigning, counter-intelligence, etc. The sheer volume of data and number of users calls for automated methods for analysis and classification of Arabic text. In this paper, the problem of analysis and classification of Arabic news reports was studied. Innovative methods, based on lexical analysis and machine learning, were employed to tame the complexity of the Arabic language. Different classification algorithms were compared and the classification accuracy results are promising. This research presents seminal steps toward specialized analysis of Arabic crowd-sourced data and social media.","PeriodicalId":412653,"journal":{"name":"2016 12th International Conference on Innovations in Information Technology (IIT)","volume":"25 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2016-11-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":"{\"title\":\"Analysis and classification of Arabic crowd-sourced news reports: A case study of the Syrian crisis\",\"authors\":\"M. Fraiwan, Bayan Al-Younes, O. Al-Jarrah, N. Khasawneh\",\"doi\":\"10.1109/INNOVATIONS.2016.7880033\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"The prevalence of social media, in the whole world and the Arab region in particular, has fueled the active engagement and participation of large swaths of the Arabic society in current events. Social media has been used to rally public opinion, increase awareness, spread information/misinformation, and organize large events. Data analysis is necessary to drive decision making, advertisement, political campaigning, counter-intelligence, etc. The sheer volume of data and number of users calls for automated methods for analysis and classification of Arabic text. In this paper, the problem of analysis and classification of Arabic news reports was studied. Innovative methods, based on lexical analysis and machine learning, were employed to tame the complexity of the Arabic language. Different classification algorithms were compared and the classification accuracy results are promising. This research presents seminal steps toward specialized analysis of Arabic crowd-sourced data and social media.\",\"PeriodicalId\":412653,\"journal\":{\"name\":\"2016 12th International Conference on Innovations in Information Technology (IIT)\",\"volume\":\"25 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2016-11-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"1\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2016 12th International Conference on Innovations in Information Technology (IIT)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/INNOVATIONS.2016.7880033\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2016 12th International Conference on Innovations in Information Technology (IIT)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/INNOVATIONS.2016.7880033","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
Analysis and classification of Arabic crowd-sourced news reports: A case study of the Syrian crisis
The prevalence of social media, in the whole world and the Arab region in particular, has fueled the active engagement and participation of large swaths of the Arabic society in current events. Social media has been used to rally public opinion, increase awareness, spread information/misinformation, and organize large events. Data analysis is necessary to drive decision making, advertisement, political campaigning, counter-intelligence, etc. The sheer volume of data and number of users calls for automated methods for analysis and classification of Arabic text. In this paper, the problem of analysis and classification of Arabic news reports was studied. Innovative methods, based on lexical analysis and machine learning, were employed to tame the complexity of the Arabic language. Different classification algorithms were compared and the classification accuracy results are promising. This research presents seminal steps toward specialized analysis of Arabic crowd-sourced data and social media.