利用深度学习模型划分政治党派的两步法

IF 3 2区 社会学 Q2 COMPUTER SCIENCE, INTERDISCIPLINARY APPLICATIONS Social Science Computer Review Pub Date : 2023-12-06 DOI:10.1177/08944393231219685
Lingshu Hu
{"title":"利用深度学习模型划分政治党派的两步法","authors":"Lingshu Hu","doi":"10.1177/08944393231219685","DOIUrl":null,"url":null,"abstract":"Political partisanship constitutes a pivotal group identity that significantly influences individuals’ voting behaviors and shapes their ideological and cultural perspectives. While traditional surveys and experimental studies can directly capture political identity by asking the participants, this task has become intricate when employing digital trace data sourced from social media. Previous classification methods, attempting to infer political identity from users’ networks or textual content, suffered from limited efficiency or generalizability. In response, this study introduces a two-step method that utilizes deep learning models to enhance classification efficiency, generalizability, and interpretability. In the first step, two deep learning models, trained on 2.5 million tweets from 825 Congressional politicians in the U.S., achieved accuracy rates of 87.71% and 89.54%, respectively, in detecting politicians’ partisanships based on their individual tweets. Subsequently, in the second step, by employing a simple machine learning model that leverages the aggregated predicted values derived from the first-step models, accuracy rates of 94.92% and 96.61% were attained for identifying non-politician users’ political identities based off their 50 and 200 tweets, respectively. In addition, an attention mechanism was integrated into the deep learning model to assess the contribution of each word in the classification process.","PeriodicalId":49509,"journal":{"name":"Social Science Computer Review","volume":null,"pages":null},"PeriodicalIF":3.0000,"publicationDate":"2023-12-06","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"A Two-Step Method for Classifying Political Partisanship Using Deep Learning Models\",\"authors\":\"Lingshu Hu\",\"doi\":\"10.1177/08944393231219685\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Political partisanship constitutes a pivotal group identity that significantly influences individuals’ voting behaviors and shapes their ideological and cultural perspectives. While traditional surveys and experimental studies can directly capture political identity by asking the participants, this task has become intricate when employing digital trace data sourced from social media. Previous classification methods, attempting to infer political identity from users’ networks or textual content, suffered from limited efficiency or generalizability. In response, this study introduces a two-step method that utilizes deep learning models to enhance classification efficiency, generalizability, and interpretability. In the first step, two deep learning models, trained on 2.5 million tweets from 825 Congressional politicians in the U.S., achieved accuracy rates of 87.71% and 89.54%, respectively, in detecting politicians’ partisanships based on their individual tweets. Subsequently, in the second step, by employing a simple machine learning model that leverages the aggregated predicted values derived from the first-step models, accuracy rates of 94.92% and 96.61% were attained for identifying non-politician users’ political identities based off their 50 and 200 tweets, respectively. In addition, an attention mechanism was integrated into the deep learning model to assess the contribution of each word in the classification process.\",\"PeriodicalId\":49509,\"journal\":{\"name\":\"Social Science Computer Review\",\"volume\":null,\"pages\":null},\"PeriodicalIF\":3.0000,\"publicationDate\":\"2023-12-06\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Social Science Computer Review\",\"FirstCategoryId\":\"90\",\"ListUrlMain\":\"https://doi.org/10.1177/08944393231219685\",\"RegionNum\":2,\"RegionCategory\":\"社会学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q2\",\"JCRName\":\"COMPUTER SCIENCE, INTERDISCIPLINARY APPLICATIONS\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Social Science Computer Review","FirstCategoryId":"90","ListUrlMain":"https://doi.org/10.1177/08944393231219685","RegionNum":2,"RegionCategory":"社会学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"COMPUTER SCIENCE, INTERDISCIPLINARY APPLICATIONS","Score":null,"Total":0}
引用次数: 0

摘要

政治党派关系是一种重要的群体认同,它对个人的投票行为产生重大影响,并塑造了个人的思想和文化观点。虽然传统的调查和实验研究可以通过询问参与者来直接捕捉政治身份,但当使用来自社交媒体的数字跟踪数据时,这项任务变得复杂起来。以往的分类方法试图从用户的网络或文本内容中推断政治身份,其效率和概括性有限。为此,本研究引入了一种利用深度学习模型来提高分类效率、泛化性和可解释性的两步方法。在第一步中,两个深度学习模型接受了来自美国825名国会议员的250万条推文的训练,在基于个人推文检测政客的党派倾向方面,准确率分别达到了87.71%和89.54%。随后,在第二步中,通过使用一个简单的机器学习模型,利用从第一步模型中获得的汇总预测值,基于50条和200条推文识别非政治家用户的政治身份的准确率分别达到了94.92%和96.61%。此外,将注意力机制集成到深度学习模型中,以评估每个单词在分类过程中的贡献。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
A Two-Step Method for Classifying Political Partisanship Using Deep Learning Models
Political partisanship constitutes a pivotal group identity that significantly influences individuals’ voting behaviors and shapes their ideological and cultural perspectives. While traditional surveys and experimental studies can directly capture political identity by asking the participants, this task has become intricate when employing digital trace data sourced from social media. Previous classification methods, attempting to infer political identity from users’ networks or textual content, suffered from limited efficiency or generalizability. In response, this study introduces a two-step method that utilizes deep learning models to enhance classification efficiency, generalizability, and interpretability. In the first step, two deep learning models, trained on 2.5 million tweets from 825 Congressional politicians in the U.S., achieved accuracy rates of 87.71% and 89.54%, respectively, in detecting politicians’ partisanships based on their individual tweets. Subsequently, in the second step, by employing a simple machine learning model that leverages the aggregated predicted values derived from the first-step models, accuracy rates of 94.92% and 96.61% were attained for identifying non-politician users’ political identities based off their 50 and 200 tweets, respectively. In addition, an attention mechanism was integrated into the deep learning model to assess the contribution of each word in the classification process.
求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
Social Science Computer Review
Social Science Computer Review 社会科学-计算机:跨学科应用
CiteScore
9.00
自引率
4.90%
发文量
95
审稿时长
>12 weeks
期刊介绍: Unique Scope Social Science Computer Review is an interdisciplinary journal covering social science instructional and research applications of computing, as well as societal impacts of informational technology. Topics included: artificial intelligence, business, computational social science theory, computer-assisted survey research, computer-based qualitative analysis, computer simulation, economic modeling, electronic modeling, electronic publishing, geographic information systems, instrumentation and research tools, public administration, social impacts of computing and telecommunications, software evaluation, world-wide web resources for social scientists. Interdisciplinary Nature Because the Uses and impacts of computing are interdisciplinary, so is Social Science Computer Review. The journal is of direct relevance to scholars and scientists in a wide variety of disciplines. In its pages you''ll find work in the following areas: sociology, anthropology, political science, economics, psychology, computer literacy, computer applications, and methodology.
期刊最新文献
Can AI Lie? Chabot Technologies, the Subject, and the Importance of Lying Improving the Quality of Individual-Level Web Tracking: Challenges of Existing Approaches and Introduction of a New Content and Long-Tail Sensitive Academic Solution Using Google Trends Data to Study High-Frequency Search Terms: Evidence for a Reliability-Frequency Continuum Large Language Models Outperform Expert Coders and Supervised Classifiers at Annotating Political Social Media Messages Status Spill-Over in Cryptomarket for Illegal Goods
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1