Kyriakos Skoularikis, I. Savvas, G. Garani, George Kakarontzas
{"title":"为希腊语开发一个特定领域的词典","authors":"Kyriakos Skoularikis, I. Savvas, G. Garani, George Kakarontzas","doi":"10.1145/3575879.3576004","DOIUrl":null,"url":null,"abstract":"We live in a society where a massive quantity of data is generated daily on online social network platforms. This enormous data contains vital opinion-related information that many companies and other scientific and commercial industries are trying to exploit for their benefits. For that purpose, sentiment analysis is required. Sentiment analysis or opinion mining is the branch of data analytics for extracting sentiments from messages expressed by users on a particular subject. Although, in the past years a considerable research has been made for the English language, the works of Sentiment Analysis in Greek language is not so popular, due to smaller user base. In this work, we provide a method to create domain-specific dictionaries given a corpus of tweets in the Greek language. In those Lexicons, we take into consideration the significance of each word for the specific domain, by introducing a new attribute Weightw. Also, we deploy a hybrid framework which utilizes the newly created domain-specific Lexicon with the Naïve Bayes classifier to analyze and predict the sentiment of each tweet. Our framework has the ability to merge the better of the two basic concepts, the Lexicon and Machine Learning method, and demonstrates the significance of the words for domain-specific Lexicon, for achieving optimal results when performing Sentiment Analysis.","PeriodicalId":164036,"journal":{"name":"Proceedings of the 26th Pan-Hellenic Conference on Informatics","volume":"115 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2022-11-25","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Developing a Domain-Specific Lexicon for the Greek Language\",\"authors\":\"Kyriakos Skoularikis, I. Savvas, G. Garani, George Kakarontzas\",\"doi\":\"10.1145/3575879.3576004\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"We live in a society where a massive quantity of data is generated daily on online social network platforms. This enormous data contains vital opinion-related information that many companies and other scientific and commercial industries are trying to exploit for their benefits. For that purpose, sentiment analysis is required. Sentiment analysis or opinion mining is the branch of data analytics for extracting sentiments from messages expressed by users on a particular subject. Although, in the past years a considerable research has been made for the English language, the works of Sentiment Analysis in Greek language is not so popular, due to smaller user base. In this work, we provide a method to create domain-specific dictionaries given a corpus of tweets in the Greek language. In those Lexicons, we take into consideration the significance of each word for the specific domain, by introducing a new attribute Weightw. Also, we deploy a hybrid framework which utilizes the newly created domain-specific Lexicon with the Naïve Bayes classifier to analyze and predict the sentiment of each tweet. Our framework has the ability to merge the better of the two basic concepts, the Lexicon and Machine Learning method, and demonstrates the significance of the words for domain-specific Lexicon, for achieving optimal results when performing Sentiment Analysis.\",\"PeriodicalId\":164036,\"journal\":{\"name\":\"Proceedings of the 26th Pan-Hellenic Conference on Informatics\",\"volume\":\"115 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2022-11-25\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Proceedings of the 26th Pan-Hellenic Conference on Informatics\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1145/3575879.3576004\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Proceedings of the 26th Pan-Hellenic Conference on Informatics","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1145/3575879.3576004","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
Developing a Domain-Specific Lexicon for the Greek Language
We live in a society where a massive quantity of data is generated daily on online social network platforms. This enormous data contains vital opinion-related information that many companies and other scientific and commercial industries are trying to exploit for their benefits. For that purpose, sentiment analysis is required. Sentiment analysis or opinion mining is the branch of data analytics for extracting sentiments from messages expressed by users on a particular subject. Although, in the past years a considerable research has been made for the English language, the works of Sentiment Analysis in Greek language is not so popular, due to smaller user base. In this work, we provide a method to create domain-specific dictionaries given a corpus of tweets in the Greek language. In those Lexicons, we take into consideration the significance of each word for the specific domain, by introducing a new attribute Weightw. Also, we deploy a hybrid framework which utilizes the newly created domain-specific Lexicon with the Naïve Bayes classifier to analyze and predict the sentiment of each tweet. Our framework has the ability to merge the better of the two basic concepts, the Lexicon and Machine Learning method, and demonstrates the significance of the words for domain-specific Lexicon, for achieving optimal results when performing Sentiment Analysis.