Jingxuan Yang, Qinjie Lyu, Sheng Gao, Lin Qiu, Jun Guo
{"title":"Review aspect extraction based on character-enhanced embedding models","authors":"Jingxuan Yang, Qinjie Lyu, Sheng Gao, Lin Qiu, Jun Guo","doi":"10.1109/ICNIDC.2016.7974568","DOIUrl":null,"url":null,"abstract":"User reviews, in the form of short unstructured natural texts, often provide rich information to benefit product adoption or service improvement. Aspect can be extracted as the abstract meaning from the reviews. Traditional methods have employed either rule-based templates or bag-of-words features for aspect extraction from text. However, these models cannot effectively handle short texts, especially in Chinese reviews. In this paper, we address the issue by learning the character embeddings as the basic semantic unit and incorporating the compositional sentence-level representation into a neural network approach for review aspect classification. For that, the character embeddings from the reviews are learned in position-based and clustered-based fashions, and then combined into sentence vectors to yield better text representations. Extensive experiments on real world data set suggest that our proposed model highly outperforms the state-of-the-art methods for review aspect extraction task.","PeriodicalId":439987,"journal":{"name":"2016 IEEE International Conference on Network Infrastructure and Digital Content (IC-NIDC)","volume":"80 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2016-09-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2016 IEEE International Conference on Network Infrastructure and Digital Content (IC-NIDC)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICNIDC.2016.7974568","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 1
Abstract
User reviews, in the form of short unstructured natural texts, often provide rich information to benefit product adoption or service improvement. Aspect can be extracted as the abstract meaning from the reviews. Traditional methods have employed either rule-based templates or bag-of-words features for aspect extraction from text. However, these models cannot effectively handle short texts, especially in Chinese reviews. In this paper, we address the issue by learning the character embeddings as the basic semantic unit and incorporating the compositional sentence-level representation into a neural network approach for review aspect classification. For that, the character embeddings from the reviews are learned in position-based and clustered-based fashions, and then combined into sentence vectors to yield better text representations. Extensive experiments on real world data set suggest that our proposed model highly outperforms the state-of-the-art methods for review aspect extraction task.