Keyword-Assisted Topic Models

IF 5 1区 社会学 Q1 POLITICAL SCIENCE American Journal of Political Science Pub Date : 2023-04-01 DOI:10.1111/ajps.12779
Shusei Eshima, Kosuke Imai, Tomoya Sasaki
{"title":"Keyword-Assisted Topic Models","authors":"Shusei Eshima,&nbsp;Kosuke Imai,&nbsp;Tomoya Sasaki","doi":"10.1111/ajps.12779","DOIUrl":null,"url":null,"abstract":"<p>In recent years, fully automated content analysis based on probabilistic topic models has become popular among social scientists because of their scalability. However, researchers find that these models often fail to measure specific concepts of substantive interest by inadvertently creating multiple topics with similar content and combining distinct themes into a single topic. In this article, we empirically demonstrate that providing a small number of keywords can substantially enhance the measurement performance of topic models. An important advantage of the proposed keyword-assisted topic model (keyATM) is that the specification of keywords requires researchers to label topics prior to fitting a model to the data. This contrasts with a widespread practice of post hoc topic interpretation and adjustments that compromises the objectivity of empirical findings. In our application, we find that keyATM provides more interpretable results, has better document classification performance, and is less sensitive to the number of topics.</p>","PeriodicalId":48447,"journal":{"name":"American Journal of Political Science","volume":"68 2","pages":"730-750"},"PeriodicalIF":5.0000,"publicationDate":"2023-04-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"American Journal of Political Science","FirstCategoryId":"90","ListUrlMain":"https://onlinelibrary.wiley.com/doi/10.1111/ajps.12779","RegionNum":1,"RegionCategory":"社会学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"POLITICAL SCIENCE","Score":null,"Total":0}
引用次数: 0

Abstract

In recent years, fully automated content analysis based on probabilistic topic models has become popular among social scientists because of their scalability. However, researchers find that these models often fail to measure specific concepts of substantive interest by inadvertently creating multiple topics with similar content and combining distinct themes into a single topic. In this article, we empirically demonstrate that providing a small number of keywords can substantially enhance the measurement performance of topic models. An important advantage of the proposed keyword-assisted topic model (keyATM) is that the specification of keywords requires researchers to label topics prior to fitting a model to the data. This contrasts with a widespread practice of post hoc topic interpretation and adjustments that compromises the objectivity of empirical findings. In our application, we find that keyATM provides more interpretable results, has better document classification performance, and is less sensitive to the number of topics.

查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
关键词辅助主题模型
近年来,基于概率主题模型的全自动内容分析因其可扩展性而受到社会科学家的青睐。然而,研究人员发现,这些模型经常会无意中创建多个内容相似的主题,并将不同的主题合并为一个主题,从而无法衡量实质性的特定概念。在本文中,我们通过实证证明,提供少量关键词就能大大提高主题模型的测量性能。所提出的关键词辅助主题模型(keyATM)的一个重要优势是,关键词的指定要求研究人员在对数据拟合模型之前标注主题。这与普遍存在的事后对主题进行解释和调整的做法形成了鲜明对比,这种做法损害了实证研究结果的客观性。在我们的应用中,我们发现 keyATM 提供了更多可解释的结果,具有更好的文档分类性能,而且对主题数量的敏感度较低。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 去求助
来源期刊
CiteScore
9.30
自引率
2.40%
发文量
61
期刊介绍: The American Journal of Political Science (AJPS) publishes research in all major areas of political science including American politics, public policy, international relations, comparative politics, political methodology, and political theory. Founded in 1956, the AJPS publishes articles that make outstanding contributions to scholarly knowledge about notable theoretical concerns, puzzles or controversies in any subfield of political science.
期刊最新文献
Issue Information Correction to Skill specificity and attitudes toward immigration Issue Information Issue Information - Table of Contents Unsubscribed and undemanding: Partisanship and the minimal effects of a field experiment encouraging local news consumption
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1