{"title":"Improving the Accuracy and Diversity of Feature Extraction From Online Reviews Using Keyword Embedding and Two Clustering Methods","authors":"Seyoung Park, Harrison M. Kim","doi":"10.1115/detc2020-22642","DOIUrl":null,"url":null,"abstract":"\n In product design, it is essential to understand customers’ preferences for product features. Traditional methods including the survey and interview are time-consuming and costly. As an alternative, research on utilizing online data for user analysis has been actively conducted. Although various methods have been proposed in this domain, most of them focus on the main features or usages of the product. However, from the manufacturer’s perspective, sub-features are as crucial as main features or usages, because the preference for sub-features is necessary for component configuration in actual product development. As the first step to solve this problem, this paper proposes a methodology to extract and cluster sub-features by incorporating phrase embedding into the previous word embedding. Also, the presented methodology increases the accuracy and diversity of the clustering result by using X-means clustering as a noise filter and adopting spectral clustering.","PeriodicalId":415040,"journal":{"name":"Volume 11A: 46th Design Automation Conference (DAC)","volume":"78 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2020-08-17","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"4","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Volume 11A: 46th Design Automation Conference (DAC)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1115/detc2020-22642","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 4
Abstract
In product design, it is essential to understand customers’ preferences for product features. Traditional methods including the survey and interview are time-consuming and costly. As an alternative, research on utilizing online data for user analysis has been actively conducted. Although various methods have been proposed in this domain, most of them focus on the main features or usages of the product. However, from the manufacturer’s perspective, sub-features are as crucial as main features or usages, because the preference for sub-features is necessary for component configuration in actual product development. As the first step to solve this problem, this paper proposes a methodology to extract and cluster sub-features by incorporating phrase embedding into the previous word embedding. Also, the presented methodology increases the accuracy and diversity of the clustering result by using X-means clustering as a noise filter and adopting spectral clustering.