sDTM: A Supervised Bayesian Deep Topic Model for Text Analytics

Yi Yang, Kunpeng Zhang, Yangyang Fan
{"title":"sDTM: A Supervised Bayesian Deep Topic Model for Text Analytics","authors":"Yi Yang, Kunpeng Zhang, Yangyang Fan","doi":"10.2139/ssrn.3612168","DOIUrl":null,"url":null,"abstract":"This study proposes a novel supervised deep topic modeling approach for effective text analysis. This approach leverages the auxiliary data associated with text, such as ratings in consumer reviews or categories of posts in online forums, to enhance the discovery of latent topics in text. The proposed approach can effectively improve topic modeling performance in several ways. First, the learned latent topics are more meaningful and distinguishable, which helps text data exploration. Second, the latent topics discovered by the novel supervised deep topic model are more accurate, which improves the performance of downstream econometrics and predictive analytics that utilize latent topics as inputs. Given the prevalence of auxiliary data in real-world text analysis tasks and the wide adoption of topic modeling in business research and practice, the study offers an effective solution for extracting insights from text data.","PeriodicalId":13594,"journal":{"name":"Information Systems & Economics eJournal","volume":null,"pages":null},"PeriodicalIF":0.0000,"publicationDate":"2020-05-20","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"12","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Information Systems & Economics eJournal","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.2139/ssrn.3612168","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 12

Abstract

This study proposes a novel supervised deep topic modeling approach for effective text analysis. This approach leverages the auxiliary data associated with text, such as ratings in consumer reviews or categories of posts in online forums, to enhance the discovery of latent topics in text. The proposed approach can effectively improve topic modeling performance in several ways. First, the learned latent topics are more meaningful and distinguishable, which helps text data exploration. Second, the latent topics discovered by the novel supervised deep topic model are more accurate, which improves the performance of downstream econometrics and predictive analytics that utilize latent topics as inputs. Given the prevalence of auxiliary data in real-world text analysis tasks and the wide adoption of topic modeling in business research and practice, the study offers an effective solution for extracting insights from text data.
查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
sDTM:用于文本分析的监督贝叶斯深度主题模型
本研究提出了一种新的监督深度主题建模方法,用于有效的文本分析。这种方法利用与文本相关的辅助数据,例如消费者评论中的评分或在线论坛中帖子的类别,来增强对文本中潜在主题的发现。该方法可以从多个方面有效地提高主题建模性能。首先,学习到的潜在主题更有意义和可区分,这有助于文本数据的探索。其次,新监督深度主题模型发现的潜在主题更加准确,从而提高了利用潜在主题作为输入的下游计量经济学和预测分析的性能。鉴于辅助数据在现实世界文本分析任务中的普遍存在以及主题建模在商业研究和实践中的广泛采用,本研究为从文本数据中提取见解提供了有效的解决方案。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 去求助
来源期刊
自引率
0.00%
发文量
0
期刊最新文献
Investing in Lending Technology: IT Spending in Banking Governing 'European values' Inside Data Flows: Interdisciplinary Perspectives More Competitive Search Through Regulation Business News and Business Cycles Efecto de la banda ancha sobre el valor agregado en los municipios de Colombia (Effect of Broadband on Added Value in Colombia Municipalities)
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1