Improving the Accuracy and Diversity of Feature Extraction From Online Reviews Using Keyword Embedding and Two Clustering Methods

Seyoung Park, Harrison M. Kim
{"title":"Improving the Accuracy and Diversity of Feature Extraction From Online Reviews Using Keyword Embedding and Two Clustering Methods","authors":"Seyoung Park, Harrison M. Kim","doi":"10.1115/detc2020-22642","DOIUrl":null,"url":null,"abstract":"\n In product design, it is essential to understand customers’ preferences for product features. Traditional methods including the survey and interview are time-consuming and costly. As an alternative, research on utilizing online data for user analysis has been actively conducted. Although various methods have been proposed in this domain, most of them focus on the main features or usages of the product. However, from the manufacturer’s perspective, sub-features are as crucial as main features or usages, because the preference for sub-features is necessary for component configuration in actual product development. As the first step to solve this problem, this paper proposes a methodology to extract and cluster sub-features by incorporating phrase embedding into the previous word embedding. Also, the presented methodology increases the accuracy and diversity of the clustering result by using X-means clustering as a noise filter and adopting spectral clustering.","PeriodicalId":415040,"journal":{"name":"Volume 11A: 46th Design Automation Conference (DAC)","volume":"78 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2020-08-17","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"4","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Volume 11A: 46th Design Automation Conference (DAC)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1115/detc2020-22642","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 4

Abstract

In product design, it is essential to understand customers’ preferences for product features. Traditional methods including the survey and interview are time-consuming and costly. As an alternative, research on utilizing online data for user analysis has been actively conducted. Although various methods have been proposed in this domain, most of them focus on the main features or usages of the product. However, from the manufacturer’s perspective, sub-features are as crucial as main features or usages, because the preference for sub-features is necessary for component configuration in actual product development. As the first step to solve this problem, this paper proposes a methodology to extract and cluster sub-features by incorporating phrase embedding into the previous word embedding. Also, the presented methodology increases the accuracy and diversity of the clustering result by using X-means clustering as a noise filter and adopting spectral clustering.
查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
利用关键词嵌入和两种聚类方法提高在线评论特征提取的准确性和多样性
在产品设计中,了解客户对产品功能的偏好是至关重要的。包括调查和访谈在内的传统方法既耗时又昂贵。作为替代方案,利用在线数据进行用户分析的研究已经积极开展。虽然在这一领域已经提出了各种方法,但大多数方法都集中在产品的主要特征或用途上。然而,从制造商的角度来看,子功能与主要功能或用途一样重要,因为对子功能的偏好对于实际产品开发中的组件配置是必要的。作为解决这一问题的第一步,本文提出了一种将短语嵌入到之前的词嵌入中提取和聚类子特征的方法。该方法采用x均值聚类作为噪声滤波器,采用谱聚类,提高了聚类结果的准确性和多样性。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 去求助
来源期刊
自引率
0.00%
发文量
0
期刊最新文献
Extrapolation With Gaussian Random Processes and Evolutionary Programming Design and Optimization of Functionally Graded Superelastic NiTi Stents Topic Modeling and Sentiment Analysis of Social Media Data to Drive Experiential Redesign Risk-Averse Optimization for Resilience Enhancement Under Uncertainty Multi-Objective Design Exploration of a Canine Ventriculoperitoneal Shunt for Hydrocephalus
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1