阿拉伯语众包新闻报道的分析与分类:以叙利亚危机为例

M. Fraiwan, Bayan Al-Younes, O. Al-Jarrah, N. Khasawneh
{"title":"阿拉伯语众包新闻报道的分析与分类:以叙利亚危机为例","authors":"M. Fraiwan, Bayan Al-Younes, O. Al-Jarrah, N. Khasawneh","doi":"10.1109/INNOVATIONS.2016.7880033","DOIUrl":null,"url":null,"abstract":"The prevalence of social media, in the whole world and the Arab region in particular, has fueled the active engagement and participation of large swaths of the Arabic society in current events. Social media has been used to rally public opinion, increase awareness, spread information/misinformation, and organize large events. Data analysis is necessary to drive decision making, advertisement, political campaigning, counter-intelligence, etc. The sheer volume of data and number of users calls for automated methods for analysis and classification of Arabic text. In this paper, the problem of analysis and classification of Arabic news reports was studied. Innovative methods, based on lexical analysis and machine learning, were employed to tame the complexity of the Arabic language. Different classification algorithms were compared and the classification accuracy results are promising. This research presents seminal steps toward specialized analysis of Arabic crowd-sourced data and social media.","PeriodicalId":412653,"journal":{"name":"2016 12th International Conference on Innovations in Information Technology (IIT)","volume":"25 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2016-11-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":"{\"title\":\"Analysis and classification of Arabic crowd-sourced news reports: A case study of the Syrian crisis\",\"authors\":\"M. Fraiwan, Bayan Al-Younes, O. Al-Jarrah, N. Khasawneh\",\"doi\":\"10.1109/INNOVATIONS.2016.7880033\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"The prevalence of social media, in the whole world and the Arab region in particular, has fueled the active engagement and participation of large swaths of the Arabic society in current events. Social media has been used to rally public opinion, increase awareness, spread information/misinformation, and organize large events. Data analysis is necessary to drive decision making, advertisement, political campaigning, counter-intelligence, etc. The sheer volume of data and number of users calls for automated methods for analysis and classification of Arabic text. In this paper, the problem of analysis and classification of Arabic news reports was studied. Innovative methods, based on lexical analysis and machine learning, were employed to tame the complexity of the Arabic language. Different classification algorithms were compared and the classification accuracy results are promising. This research presents seminal steps toward specialized analysis of Arabic crowd-sourced data and social media.\",\"PeriodicalId\":412653,\"journal\":{\"name\":\"2016 12th International Conference on Innovations in Information Technology (IIT)\",\"volume\":\"25 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2016-11-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"1\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2016 12th International Conference on Innovations in Information Technology (IIT)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/INNOVATIONS.2016.7880033\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2016 12th International Conference on Innovations in Information Technology (IIT)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/INNOVATIONS.2016.7880033","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 1

摘要

社交媒体在全球特别是阿拉伯地区的普及,推动了阿拉伯社会广大民众积极参与时事。社交媒体已被用于凝聚公众舆论、提高认识、传播信息/错误信息以及组织大型活动。数据分析对于推动决策、广告、政治竞选、反情报等都是必要的。庞大的数据量和用户数量要求对阿拉伯文本进行分析和分类的自动化方法。本文对阿拉伯语新闻报道的分析与分类问题进行了研究。基于词法分析和机器学习的创新方法被用来驯服阿拉伯语的复杂性。对不同的分类算法进行了比较,结果表明分类精度较高。这项研究为阿拉伯众包数据和社交媒体的专业分析提供了开创性的步骤。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
Analysis and classification of Arabic crowd-sourced news reports: A case study of the Syrian crisis
The prevalence of social media, in the whole world and the Arab region in particular, has fueled the active engagement and participation of large swaths of the Arabic society in current events. Social media has been used to rally public opinion, increase awareness, spread information/misinformation, and organize large events. Data analysis is necessary to drive decision making, advertisement, political campaigning, counter-intelligence, etc. The sheer volume of data and number of users calls for automated methods for analysis and classification of Arabic text. In this paper, the problem of analysis and classification of Arabic news reports was studied. Innovative methods, based on lexical analysis and machine learning, were employed to tame the complexity of the Arabic language. Different classification algorithms were compared and the classification accuracy results are promising. This research presents seminal steps toward specialized analysis of Arabic crowd-sourced data and social media.
求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
自引率
0.00%
发文量
0
期刊最新文献
Candidate document retrieval for Arabic-based text reuse detection on the web Two dimensional filters for improving the resolution of up-sampled video files Identifying roles of fishing ports using multi-source data aggregation Lightweight encryption algorithm in wireless body area network for e-health monitoring Review of personalized language learning systems
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1