Kamran Amjad, Maria Ishtiaq, Samar Firdous, M. Mehmood
{"title":"使用基于乌尔都语的情感词典探索Twitter新闻偏见","authors":"Kamran Amjad, Maria Ishtiaq, Samar Firdous, M. Mehmood","doi":"10.1109/ICOSST.2017.8279004","DOIUrl":null,"url":null,"abstract":"Social media has become a tremendous success in recent times. It has enabled people to keep in touch with each other anytime and anywhere around the globe. People share their opinions and experiences publically through such platforms. Twitter is one of the significant social networking platform that is used by many news media to disseminate breaking news instantaneously. Sentiment analysis of social media is successfully used to gain insights regarding collective behavior of the society. In addition, sentiment analysis is used to detect positive or negative content posted on the social media for various purposes such as riots detection. In this paper, we focus on the sentiment analysis of news tweets in Urdu language by major news sources in Pakistan. By gathering tweets data over the period of 10 months, we built a sentiment lexicon in Urdu language. Moreover, we devise an algorithm that classifies Urdu text into positive, negative, or neutral classes based on the cumulative sentiment score of the text. Our sentiment analysis algorithm achieves 77% accuracy. Furthermore, we have done perspective analysis in which we estimated the bias in the news reporting through tweets with respect to the government with 77.45% accuracy.","PeriodicalId":414131,"journal":{"name":"2017 International Conference on Open Source Systems & Technologies (ICOSST)","volume":"22 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2017-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"6","resultStr":"{\"title\":\"Exploring Twitter news biases using urdu-based sentiment lexicon\",\"authors\":\"Kamran Amjad, Maria Ishtiaq, Samar Firdous, M. Mehmood\",\"doi\":\"10.1109/ICOSST.2017.8279004\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Social media has become a tremendous success in recent times. It has enabled people to keep in touch with each other anytime and anywhere around the globe. People share their opinions and experiences publically through such platforms. Twitter is one of the significant social networking platform that is used by many news media to disseminate breaking news instantaneously. Sentiment analysis of social media is successfully used to gain insights regarding collective behavior of the society. In addition, sentiment analysis is used to detect positive or negative content posted on the social media for various purposes such as riots detection. In this paper, we focus on the sentiment analysis of news tweets in Urdu language by major news sources in Pakistan. By gathering tweets data over the period of 10 months, we built a sentiment lexicon in Urdu language. Moreover, we devise an algorithm that classifies Urdu text into positive, negative, or neutral classes based on the cumulative sentiment score of the text. Our sentiment analysis algorithm achieves 77% accuracy. Furthermore, we have done perspective analysis in which we estimated the bias in the news reporting through tweets with respect to the government with 77.45% accuracy.\",\"PeriodicalId\":414131,\"journal\":{\"name\":\"2017 International Conference on Open Source Systems & Technologies (ICOSST)\",\"volume\":\"22 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2017-12-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"6\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2017 International Conference on Open Source Systems & Technologies (ICOSST)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/ICOSST.2017.8279004\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2017 International Conference on Open Source Systems & Technologies (ICOSST)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICOSST.2017.8279004","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
Exploring Twitter news biases using urdu-based sentiment lexicon
Social media has become a tremendous success in recent times. It has enabled people to keep in touch with each other anytime and anywhere around the globe. People share their opinions and experiences publically through such platforms. Twitter is one of the significant social networking platform that is used by many news media to disseminate breaking news instantaneously. Sentiment analysis of social media is successfully used to gain insights regarding collective behavior of the society. In addition, sentiment analysis is used to detect positive or negative content posted on the social media for various purposes such as riots detection. In this paper, we focus on the sentiment analysis of news tweets in Urdu language by major news sources in Pakistan. By gathering tweets data over the period of 10 months, we built a sentiment lexicon in Urdu language. Moreover, we devise an algorithm that classifies Urdu text into positive, negative, or neutral classes based on the cumulative sentiment score of the text. Our sentiment analysis algorithm achieves 77% accuracy. Furthermore, we have done perspective analysis in which we estimated the bias in the news reporting through tweets with respect to the government with 77.45% accuracy.