{"title":"Sentiment Analysis of Code-Mixed Text: A Comprehensive Review","authors":"Anne Perera, Amitha Caldera","doi":"10.3897/jucs.98708","DOIUrl":null,"url":null,"abstract":"Sentiment Analysis is the task of identifying and extracting the opinion expressed in a text to determine the writer's perception of an entity. Due to globalization, people often mix two or more languages and use phonetic typing and lexical borrowing in web communication. This concept is known as code-mixing. Although extracting the opinion of text written in monolingual languages is simple and straightforward, Sentiment Analysis of code-mixed text is challenging. Classifiers fail within the context of the code-mixed text as text may consist of creative writing, spelling variations, grammatical errors, and different word orders. Hence, SA of code-mixed text is an interesting, challenging, and popular research area. This paper presents the state-of-the-art in Sentiment Analysis of code-mixed text by discussing each concept in detail. The paper also discusses the focused areas, techniques used, limitations, and performances of the studies related to code-mixing.","PeriodicalId":14652,"journal":{"name":"J. Univers. Comput. Sci.","volume":"67 2","pages":"242-261"},"PeriodicalIF":0.0000,"publicationDate":"2024-02-28","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"J. Univers. Comput. Sci.","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.3897/jucs.98708","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0
Abstract
Sentiment Analysis is the task of identifying and extracting the opinion expressed in a text to determine the writer's perception of an entity. Due to globalization, people often mix two or more languages and use phonetic typing and lexical borrowing in web communication. This concept is known as code-mixing. Although extracting the opinion of text written in monolingual languages is simple and straightforward, Sentiment Analysis of code-mixed text is challenging. Classifiers fail within the context of the code-mixed text as text may consist of creative writing, spelling variations, grammatical errors, and different word orders. Hence, SA of code-mixed text is an interesting, challenging, and popular research area. This paper presents the state-of-the-art in Sentiment Analysis of code-mixed text by discussing each concept in detail. The paper also discusses the focused areas, techniques used, limitations, and performances of the studies related to code-mixing.
情感分析是指识别和提取文本中表达的观点,以确定作者对某一实体的看法。由于全球化,人们经常在网络交流中混合使用两种或两种以上的语言,并使用拼音输入和词汇借用。这一概念被称为代码混合。虽然提取用单语撰写的文本的观点既简单又直接,但对代码混合文本进行情感分析却极具挑战性。由于文本可能包含创意写作、拼写错误、语法错误和不同的词序,分类器在代码混合文本的语境中会失效。因此,代码混合文本的 SA 是一个有趣、具有挑战性的热门研究领域。本文通过详细讨论每个概念,介绍了代码混合文本情感分析的最新进展。本文还讨论了与代码混合相关研究的重点领域、使用的技术、局限性和性能。