在电影评论的向量空间中寻找情感维度:一种无监督的方法

Youngsam Kim, Hyopil Shin
{"title":"在电影评论的向量空间中寻找情感维度:一种无监督的方法","authors":"Youngsam Kim, Hyopil Shin","doi":"10.17791/JCS.2017.18.1.85","DOIUrl":null,"url":null,"abstract":"This study suggests an unsupervised method to find sentiment orienations of the words in Korean movie reviews. The orientations are represented as real values on a sentiment domain, which is derived from high-dimensional vector space for the movie reviews. To search for the dimension, the Pointwise Mutual Information is first used to select a set of words that are close to common modifiers; The phrases comprised of these words often form good/ bad associations (e.g., “good acting”, “terrible acting”). A neural language model (Word2Vec) is then used to calculate the point-wise similarity distances between the chosen words and, dimensionality reduction algorithms (e.g., PCA, MDS) are employed to find the axis of the sentiment orientations. Finally, the performance of our method is measured by unsupervised classification of the two movie reviews based on the orientation values. According to the results, the best accuracy achieves 66% and 76% for the two datasets.","PeriodicalId":135438,"journal":{"name":"The Journal of Cognitive Science","volume":null,"pages":null},"PeriodicalIF":0.0000,"publicationDate":"2017-03-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"4","resultStr":"{\"title\":\"Finding Sentiment Dimension in Vector Space of Movie Reviews: An Unsupervised Approach\",\"authors\":\"Youngsam Kim, Hyopil Shin\",\"doi\":\"10.17791/JCS.2017.18.1.85\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"This study suggests an unsupervised method to find sentiment orienations of the words in Korean movie reviews. The orientations are represented as real values on a sentiment domain, which is derived from high-dimensional vector space for the movie reviews. To search for the dimension, the Pointwise Mutual Information is first used to select a set of words that are close to common modifiers; The phrases comprised of these words often form good/ bad associations (e.g., “good acting”, “terrible acting”). A neural language model (Word2Vec) is then used to calculate the point-wise similarity distances between the chosen words and, dimensionality reduction algorithms (e.g., PCA, MDS) are employed to find the axis of the sentiment orientations. Finally, the performance of our method is measured by unsupervised classification of the two movie reviews based on the orientation values. According to the results, the best accuracy achieves 66% and 76% for the two datasets.\",\"PeriodicalId\":135438,\"journal\":{\"name\":\"The Journal of Cognitive Science\",\"volume\":null,\"pages\":null},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2017-03-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"4\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"The Journal of Cognitive Science\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.17791/JCS.2017.18.1.85\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"The Journal of Cognitive Science","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.17791/JCS.2017.18.1.85","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 4

摘要

本研究提出了一种无监督的方法来寻找韩国电影评论中词语的情感倾向。方向表示为情感域上的实值,该情感域来自电影评论的高维向量空间。为了搜索维度,首先使用点互信息选择一组与常用修饰词相近的词;由这些词组成的短语通常会形成好/坏的联想(例如,“good acting”,“terrible acting”)。然后使用神经语言模型(Word2Vec)来计算所选单词之间逐点的相似距离,并使用降维算法(例如PCA, MDS)来找到情感取向的轴。最后,通过基于取向值对两篇电影评论进行无监督分类来衡量我们方法的性能。结果表明,两个数据集的最佳准确率分别达到66%和76%。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
Finding Sentiment Dimension in Vector Space of Movie Reviews: An Unsupervised Approach
This study suggests an unsupervised method to find sentiment orienations of the words in Korean movie reviews. The orientations are represented as real values on a sentiment domain, which is derived from high-dimensional vector space for the movie reviews. To search for the dimension, the Pointwise Mutual Information is first used to select a set of words that are close to common modifiers; The phrases comprised of these words often form good/ bad associations (e.g., “good acting”, “terrible acting”). A neural language model (Word2Vec) is then used to calculate the point-wise similarity distances between the chosen words and, dimensionality reduction algorithms (e.g., PCA, MDS) are employed to find the axis of the sentiment orientations. Finally, the performance of our method is measured by unsupervised classification of the two movie reviews based on the orientation values. According to the results, the best accuracy achieves 66% and 76% for the two datasets.
求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
自引率
0.00%
发文量
0
期刊最新文献
The Wraths of Fire: Contagion Heuristic and Intuitive Personification Processes in Early Modern Finnish-Karelian Conceptions of Illness Peering into the Minds of Gods: What Cross-Cultural Variation in Gods’ Concerns Can Tell Us about the Evolution of Religion Transcranial magnetic stimulation in cognitive neuroscience: Methodological basis and safety Effect of Speaker’s Fatigue on Features of Spoken Discourse Adaptation of the Aphasia Bedside Check for Russian
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1