Unsupervised Aspect Extraction Algorithm for opinion mining using topic modeling

Azizkhan F Pathan , Chetana Prakash
{"title":"Unsupervised Aspect Extraction Algorithm for opinion mining using topic modeling","authors":"Azizkhan F Pathan ,&nbsp;Chetana Prakash","doi":"10.1016/j.gltp.2021.08.005","DOIUrl":null,"url":null,"abstract":"<div><p>With the massive use of electronic gadgets and the developing fame of web-based media, a great deal of text information is being produced at the rate never observed. It is not feasible for people to pursue all information produced and discover what is being investigated in their area of interest. To determine topics in large textual documents Topic modeling is used. Topic Modeling Algorithms are Unsupervised Machine Learning approaches which are widely used and have proven to be successful in the area of Aspect-based Opinion Mining to extract ‘latent’ topics, which are aspects of interest. In this paper, the approaches that are widely used for topic modeling are examined and compared to find their importance in detecting topics based on metrics such as Perplexity and Coherence. As a result, Latent Dirichlet Allocation is a good topic modeling algorithm compared to Latent Semantic Analysis and Hierarchical Dirichlet Process for aspect extraction process in aspect-based opinion mining. Also, we have proposed an unsupervised aspect extraction algorithm based on topic models for Aspect-based Opinion mining.</p></div>","PeriodicalId":100588,"journal":{"name":"Global Transitions Proceedings","volume":"2 2","pages":"Pages 492-499"},"PeriodicalIF":0.0000,"publicationDate":"2021-11-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://sci-hub-pdf.com/10.1016/j.gltp.2021.08.005","citationCount":"3","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Global Transitions Proceedings","FirstCategoryId":"1085","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S2666285X21000339","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 3

Abstract

With the massive use of electronic gadgets and the developing fame of web-based media, a great deal of text information is being produced at the rate never observed. It is not feasible for people to pursue all information produced and discover what is being investigated in their area of interest. To determine topics in large textual documents Topic modeling is used. Topic Modeling Algorithms are Unsupervised Machine Learning approaches which are widely used and have proven to be successful in the area of Aspect-based Opinion Mining to extract ‘latent’ topics, which are aspects of interest. In this paper, the approaches that are widely used for topic modeling are examined and compared to find their importance in detecting topics based on metrics such as Perplexity and Coherence. As a result, Latent Dirichlet Allocation is a good topic modeling algorithm compared to Latent Semantic Analysis and Hierarchical Dirichlet Process for aspect extraction process in aspect-based opinion mining. Also, we have proposed an unsupervised aspect extraction algorithm based on topic models for Aspect-based Opinion mining.

查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
基于主题建模的意见挖掘无监督方面提取算法
随着电子产品的大量使用和网络媒体的发展,大量的文字信息正在以前所未有的速度产生。人们不可能追求所有产生的信息,并发现在他们感兴趣的领域正在调查什么。为了确定大型文本文档中的主题,需要使用主题建模。主题建模算法是一种被广泛使用的无监督机器学习方法,并且在基于方面的意见挖掘领域被证明是成功的,可以提取“潜在”主题,即感兴趣的方面。在本文中,对广泛用于主题建模的方法进行了检查和比较,以发现它们在基于诸如Perplexity和Coherence等度量来检测主题方面的重要性。结果表明,在基于方面的意见挖掘中,相对于潜在语义分析和层次狄利克雷过程,潜在狄利克雷分配是一种较好的主题建模算法。此外,我们还提出了一种基于主题模型的无监督方面提取算法,用于基于方面的意见挖掘。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 去求助
来源期刊
自引率
0.00%
发文量
0
期刊最新文献
Enhanced Energy Efficient Secure Routing Protocol for Mobile Ad-Hoc Network Grid interconnected H-bridge multilevel inverter for renewable power applications using repeating units and level boosting network Power Generation Using Ocean Waves: A Review Development of an Arabic HQAS-based ASAG to consider an ignored knowledge in misspelled multiple words short answers Smartphone assist deep neural network to detect the citrus diseases in Agri-informatics
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1