深度学习在印尼语小说评论中的情感分析

Rifqi Fauzi Rahmadzani, Widyawan, T. B. Adji
{"title":"深度学习在印尼语小说评论中的情感分析","authors":"Rifqi Fauzi Rahmadzani, Widyawan, T. B. Adji","doi":"10.1109/ICST50505.2020.9732785","DOIUrl":null,"url":null,"abstract":"The rapid development of technology, especially in the internet field, is influencing the increasing number of texts available. In recent years, there has been an increase in research on the internet or social media to find out the sentiments in the review text. Sentiment analysis is a part of Natural Language Processing (NLP), which can help to show whether certain opinions tend to contain positive opinions or negative opinions. In this study, three sentiment polarities were studied using an Indonesian novel review dataset. Data was classified using the Long Short-Term Memory (LSTM) approach, one of the deep learning methods. To increase success rate, we used pre-trained word embedding to represent words into vectors. The analysis was performed by comparing the word embedding model using GloVe, Word2Vec i.e. Continuous Bag of Words and Skip-gram, and FastText i.e. Continuous Bag of Words and Skip-gram. The experimental results showed that sentiment analysis using the FastText Continuous Bag of Words model reached the highest accuracy of 80% while the Word2Vec Skip-gram model had the lowest accuracy of 78.3%. So, it can be concluded that the implementation of the FastText CBOW model is accurately used as a word representation to analyze sentiments on Indonesian novel review.","PeriodicalId":125807,"journal":{"name":"2020 6th International Conference on Science and Technology (ICST)","volume":"164 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2020-09-07","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Deep Learning for Sentiment Analysis in Indonesian Novel Review\",\"authors\":\"Rifqi Fauzi Rahmadzani, Widyawan, T. B. Adji\",\"doi\":\"10.1109/ICST50505.2020.9732785\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"The rapid development of technology, especially in the internet field, is influencing the increasing number of texts available. In recent years, there has been an increase in research on the internet or social media to find out the sentiments in the review text. Sentiment analysis is a part of Natural Language Processing (NLP), which can help to show whether certain opinions tend to contain positive opinions or negative opinions. In this study, three sentiment polarities were studied using an Indonesian novel review dataset. Data was classified using the Long Short-Term Memory (LSTM) approach, one of the deep learning methods. To increase success rate, we used pre-trained word embedding to represent words into vectors. The analysis was performed by comparing the word embedding model using GloVe, Word2Vec i.e. Continuous Bag of Words and Skip-gram, and FastText i.e. Continuous Bag of Words and Skip-gram. The experimental results showed that sentiment analysis using the FastText Continuous Bag of Words model reached the highest accuracy of 80% while the Word2Vec Skip-gram model had the lowest accuracy of 78.3%. So, it can be concluded that the implementation of the FastText CBOW model is accurately used as a word representation to analyze sentiments on Indonesian novel review.\",\"PeriodicalId\":125807,\"journal\":{\"name\":\"2020 6th International Conference on Science and Technology (ICST)\",\"volume\":\"164 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2020-09-07\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2020 6th International Conference on Science and Technology (ICST)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/ICST50505.2020.9732785\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2020 6th International Conference on Science and Technology (ICST)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICST50505.2020.9732785","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0

摘要

技术的快速发展,特别是在互联网领域,正在影响越来越多的文本可用。近年来,在互联网或社交媒体上寻找评论文本中情感的研究有所增加。情感分析是自然语言处理(NLP)的一部分,它可以帮助显示某些观点是否倾向于包含积极的观点或消极的观点。在本研究中,使用印度尼西亚小说评论数据集研究了三种情绪极性。使用长短期记忆(LSTM)方法对数据进行分类,这是深度学习方法之一。为了提高成功率,我们使用预训练的词嵌入将词表示成向量。通过对比GloVe、Word2Vec(连续词袋和Skip-gram)和FastText(连续词袋和Skip-gram)的词嵌入模型进行分析。实验结果表明,使用FastText连续词袋模型进行情感分析的准确率最高,为80%,而使用Word2Vec Skip-gram模型的准确率最低,为78.3%。因此,可以得出结论,FastText CBOW模型的实现可以准确地用作单词表示来分析印尼语小说评论的情感。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
Deep Learning for Sentiment Analysis in Indonesian Novel Review
The rapid development of technology, especially in the internet field, is influencing the increasing number of texts available. In recent years, there has been an increase in research on the internet or social media to find out the sentiments in the review text. Sentiment analysis is a part of Natural Language Processing (NLP), which can help to show whether certain opinions tend to contain positive opinions or negative opinions. In this study, three sentiment polarities were studied using an Indonesian novel review dataset. Data was classified using the Long Short-Term Memory (LSTM) approach, one of the deep learning methods. To increase success rate, we used pre-trained word embedding to represent words into vectors. The analysis was performed by comparing the word embedding model using GloVe, Word2Vec i.e. Continuous Bag of Words and Skip-gram, and FastText i.e. Continuous Bag of Words and Skip-gram. The experimental results showed that sentiment analysis using the FastText Continuous Bag of Words model reached the highest accuracy of 80% while the Word2Vec Skip-gram model had the lowest accuracy of 78.3%. So, it can be concluded that the implementation of the FastText CBOW model is accurately used as a word representation to analyze sentiments on Indonesian novel review.
求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
自引率
0.00%
发文量
0
期刊最新文献
Review on Battery Energy Storage System for Power System with Grid Connected Wind Farm Identification of Reef Characteristics Using Remote Sensing Technology in Ayau Islands, Indonesia Cardinality Single Column Analysis for Data Profiling using an Open Source Platform Techno-Economic Analysis of Implementation IEEE 802.11ah Standard for Smart Meter Application in Bandung Area Performance Analysis of On-Off Keying Modulation on Underwater Visible Light Communication
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1