在电影评论的向量空间中寻找情感维度:一种无监督的方法

The Journal of Cognitive Science Pub Date : 2017-03-01 DOI:10.17791/JCS.2017.18.1.85

Youngsam Kim, Hyopil Shin

{"title":"在电影评论的向量空间中寻找情感维度:一种无监督的方法","authors":"Youngsam Kim, Hyopil Shin","doi":"10.17791/JCS.2017.18.1.85","DOIUrl":null,"url":null,"abstract":"This study suggests an unsupervised method to find sentiment orienations of the words in Korean movie reviews. The orientations are represented as real values on a sentiment domain, which is derived from high-dimensional vector space for the movie reviews. To search for the dimension, the Pointwise Mutual Information is first used to select a set of words that are close to common modifiers; The phrases comprised of these words often form good/ bad associations (e.g., “good acting”, “terrible acting”). A neural language model (Word2Vec) is then used to calculate the point-wise similarity distances between the chosen words and, dimensionality reduction algorithms (e.g., PCA, MDS) are employed to find the axis of the sentiment orientations. Finally, the performance of our method is measured by unsupervised classification of the two movie reviews based on the orientation values. According to the results, the best accuracy achieves 66% and 76% for the two datasets.","PeriodicalId":135438,"journal":{"name":"The Journal of Cognitive Science","volume":null,"pages":null},"PeriodicalIF":0.0000,"publicationDate":"2017-03-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"4","resultStr":"{\"title\":\"Finding Sentiment Dimension in Vector Space of Movie Reviews: An Unsupervised Approach\",\"authors\":\"Youngsam Kim, Hyopil Shin\",\"doi\":\"10.17791/JCS.2017.18.1.85\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"This study suggests an unsupervised method to find sentiment orienations of the words in Korean movie reviews. The orientations are represented as real values on a sentiment domain, which is derived from high-dimensional vector space for the movie reviews. To search for the dimension, the Pointwise Mutual Information is first used to select a set of words that are close to common modifiers; The phrases comprised of these words often form good/ bad associations (e.g., “good acting”, “terrible acting”). A neural language model (Word2Vec) is then used to calculate the point-wise similarity distances between the chosen words and, dimensionality reduction algorithms (e.g., PCA, MDS) are employed to find the axis of the sentiment orientations. Finally, the performance of our method is measured by unsupervised classification of the two movie reviews based on the orientation values. According to the results, the best accuracy achieves 66% and 76% for the two datasets.\",\"PeriodicalId\":135438,\"journal\":{\"name\":\"The Journal of Cognitive Science\",\"volume\":null,\"pages\":null},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2017-03-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"4\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"The Journal of Cognitive Science\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.17791/JCS.2017.18.1.85\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"The Journal of Cognitive Science","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.17791/JCS.2017.18.1.85","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 4

摘要

本研究提出了一种无监督的方法来寻找韩国电影评论中词语的情感倾向。方向表示为情感域上的实值，该情感域来自电影评论的高维向量空间。为了搜索维度，首先使用点互信息选择一组与常用修饰词相近的词;由这些词组成的短语通常会形成好/坏的联想(例如，“good acting”，“terrible acting”)。然后使用神经语言模型(Word2Vec)来计算所选单词之间逐点的相似距离，并使用降维算法(例如PCA, MDS)来找到情感取向的轴。最后，通过基于取向值对两篇电影评论进行无监督分类来衡量我们方法的性能。结果表明，两个数据集的最佳准确率分别达到66%和76%。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文

微信好友朋友圈 QQ好友复制链接

本刊更多论文

Finding Sentiment Dimension in Vector Space of Movie Reviews: An Unsupervised Approach

This study suggests an unsupervised method to find sentiment orienations of the words in Korean movie reviews. The orientations are represented as real values on a sentiment domain, which is derived from high-dimensional vector space for the movie reviews. To search for the dimension, the Pointwise Mutual Information is first used to select a set of words that are close to common modifiers; The phrases comprised of these words often form good/ bad associations (e.g., “good acting”, “terrible acting”). A neural language model (Word2Vec) is then used to calculate the point-wise similarity distances between the chosen words and, dimensionality reduction algorithms (e.g., PCA, MDS) are employed to find the axis of the sentiment orientations. Finally, the performance of our method is measured by unsupervised classification of the two movie reviews based on the orientation values. According to the results, the best accuracy achieves 66% and 76% for the two datasets.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

The Journal of Cognitive Science

自引率

0.00%

发文量