{"title":"A polarity calculation approach for lexicon-based Turkish sentiment analysis","authors":"Gökhan Yurtalan, Murat Koyuncu, Ç. Turhan","doi":"10.3906/ELK-1803-92","DOIUrl":null,"url":null,"abstract":"Sentiment analysis attempts to resolve the senses or emotions that a writer or speaker intends to send across to the people about an object or event. It generally uses natural language processing and/or artificial intelligence techniques for processing electronic documents and mining the opinion specified in the content. In recent years, researchers have conducted many successful sentiment analysis studies for the English language which consider many words and word groups that set emotion polarities arising from the English grammar structure, and then use datasets to test their performance. However, there are only a limited number of studies for the Turkish language, and these studies have lower performance results compared to those studies for English. The reasons for this can be incorrect translation of datasets from English into Turkish and ignoring the special grammar structures in the latter. In this study, special Turkish words and linguistic constructs which affect the polarity of a sentence are determined with the aid of a Turkish linguist, and an appropriate lexicon-based polarity determination and calculation approach is introduced for this language. The proposed methodology is tested using different datasets collected from Twitter, and the test results show that the proposed system achieves better accuracy than the previously developed lexical-based sentiment analysis systems for Turkish. The authors conclude that especially analysis of word groups increases the overall performance of the system significantly.","PeriodicalId":49410,"journal":{"name":"Turkish Journal of Electrical Engineering and Computer Sciences","volume":"31 1","pages":""},"PeriodicalIF":1.2000,"publicationDate":"2019-04-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"11","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Turkish Journal of Electrical Engineering and Computer Sciences","FirstCategoryId":"94","ListUrlMain":"https://doi.org/10.3906/ELK-1803-92","RegionNum":4,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q4","JCRName":"COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE","Score":null,"Total":0}
引用次数: 11
Abstract
Sentiment analysis attempts to resolve the senses or emotions that a writer or speaker intends to send across to the people about an object or event. It generally uses natural language processing and/or artificial intelligence techniques for processing electronic documents and mining the opinion specified in the content. In recent years, researchers have conducted many successful sentiment analysis studies for the English language which consider many words and word groups that set emotion polarities arising from the English grammar structure, and then use datasets to test their performance. However, there are only a limited number of studies for the Turkish language, and these studies have lower performance results compared to those studies for English. The reasons for this can be incorrect translation of datasets from English into Turkish and ignoring the special grammar structures in the latter. In this study, special Turkish words and linguistic constructs which affect the polarity of a sentence are determined with the aid of a Turkish linguist, and an appropriate lexicon-based polarity determination and calculation approach is introduced for this language. The proposed methodology is tested using different datasets collected from Twitter, and the test results show that the proposed system achieves better accuracy than the previously developed lexical-based sentiment analysis systems for Turkish. The authors conclude that especially analysis of word groups increases the overall performance of the system significantly.
期刊介绍:
The Turkish Journal of Electrical Engineering & Computer Sciences is published electronically 6 times a year by the Scientific and Technological Research Council of Turkey (TÜBİTAK)
Accepts English-language manuscripts in the areas of power and energy, environmental sustainability and energy efficiency, electronics, industry applications, control systems, information and systems, applied electromagnetics, communications, signal and image processing, tomographic image reconstruction, face recognition, biometrics, speech processing, video processing and analysis, object recognition, classification, feature extraction, parallel and distributed computing, cognitive systems, interaction, robotics, digital libraries and content, personalized healthcare, ICT for mobility, sensors, and artificial intelligence.
Contribution is open to researchers of all nationalities.