{"title":"通过不同的词选择改进情感分类","authors":"Heeryon Cho, S. Yoon","doi":"10.1109/HSI.2017.8005029","DOIUrl":null,"url":null,"abstract":"While the performance of sentiment classification has steadily risen through the introduction of various feature-based methods and distributed representation-based approaches, less attention was given to the qualitative aspect of classification, for instance, the identification of useful words in individual opinion texts. We present an approach using set operations for identifying useful words for sentiment classification, and employ truncated singular value decomposition (SVD), a classic low-rank matrix decomposition technique for document retrieval, in order to tackle the issue of both synonymy and noise removal. The sentiment classification performance of our approach, which concatenates three kinds of features, outperforms the existing word-based and distributed word representation-based methods and is comparable to the existing state of the art distributed document representation-based approaches.","PeriodicalId":355011,"journal":{"name":"2017 10th International Conference on Human System Interactions (HSI)","volume":"91 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2017-07-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":"{\"title\":\"Improving sentiment classification through distinct word selection\",\"authors\":\"Heeryon Cho, S. Yoon\",\"doi\":\"10.1109/HSI.2017.8005029\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"While the performance of sentiment classification has steadily risen through the introduction of various feature-based methods and distributed representation-based approaches, less attention was given to the qualitative aspect of classification, for instance, the identification of useful words in individual opinion texts. We present an approach using set operations for identifying useful words for sentiment classification, and employ truncated singular value decomposition (SVD), a classic low-rank matrix decomposition technique for document retrieval, in order to tackle the issue of both synonymy and noise removal. The sentiment classification performance of our approach, which concatenates three kinds of features, outperforms the existing word-based and distributed word representation-based methods and is comparable to the existing state of the art distributed document representation-based approaches.\",\"PeriodicalId\":355011,\"journal\":{\"name\":\"2017 10th International Conference on Human System Interactions (HSI)\",\"volume\":\"91 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2017-07-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"1\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2017 10th International Conference on Human System Interactions (HSI)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/HSI.2017.8005029\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2017 10th International Conference on Human System Interactions (HSI)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/HSI.2017.8005029","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
Improving sentiment classification through distinct word selection
While the performance of sentiment classification has steadily risen through the introduction of various feature-based methods and distributed representation-based approaches, less attention was given to the qualitative aspect of classification, for instance, the identification of useful words in individual opinion texts. We present an approach using set operations for identifying useful words for sentiment classification, and employ truncated singular value decomposition (SVD), a classic low-rank matrix decomposition technique for document retrieval, in order to tackle the issue of both synonymy and noise removal. The sentiment classification performance of our approach, which concatenates three kinds of features, outperforms the existing word-based and distributed word representation-based methods and is comparable to the existing state of the art distributed document representation-based approaches.