Political Analysis最新文献

英文中文

An Improved Method of Automated Nonparametric Content Analysis for Social Science 一种改进的社会科学非参数内容自动分析方法

IF 5.4 2区社会学 Q1 POLITICAL SCIENCE

Political Analysis

Pub Date : 2022-01-07 DOI: 10.1017/pan.2021.36

Gary King, Connor Jerzak, Anton Strezhnev

Abstract Some scholars build models to classify documents into chosen categories. Others, especially social scientists who tend to focus on population characteristics, instead usually estimate the proportion of documents in each category—using either parametric “classify-and-count” methods or “direct” nonparametric estimation of proportions without individual classification. Unfortunately, classify-and-count methods can be highly model-dependent or generate more bias in the proportions even as the percent of documents correctly classified increases. Direct estimation avoids these problems, but can suffer when the meaning of language changes between training and test sets or is too similar across categories. We develop an improved direct estimation approach without these issues by including and optimizing continuous text features, along with a form of matching adapted from the causal inference literature. Our approach substantially improves performance in a diverse collection of 73 datasets. We also offer easy-to-use software that implements all ideas discussed herein.

摘要一些学者建立模型将文档分类到选定的类别中。其他人，尤其是倾向于关注人群特征的社会科学家，通常会估计每个类别中文件的比例——使用参数“分类和计数”方法，或在没有单独分类的情况下“直接”非参数估计比例。不幸的是，即使正确分类的文档百分比增加，分类和计数方法也可能高度依赖于模型，或者在比例上产生更多偏差。直接估计可以避免这些问题，但当语言的含义在训练集和测试集之间发生变化或在不同类别之间过于相似时，可能会受到影响。我们通过包括和优化连续文本特征，以及根据因果推理文献改编的匹配形式，开发了一种改进的直接估计方法，而没有这些问题。我们的方法大大提高了73个数据集的性能。我们还提供易于使用的软件，实现这里讨论的所有想法。

引用次数: 5

PAN volume 30 issue 1 Cover and Back matter PAN第30卷第1期封面和封底

IF 5.4 2区社会学 Q1 POLITICAL SCIENCE

Political Analysis

Pub Date : 2022-01-01 DOI: 10.1017/pan.2021.45

引用次数: 0

Introduction to the Special Issue: Innovations and Current Challenges in Experimental Methods 特刊导论:实验方法的创新和当前挑战

IF 5.4 2区社会学 Q1 POLITICAL SCIENCE

Political Analysis

Pub Date : 2022-01-01 DOI: 10.1017/pan.2021.26

Libby Jenke

Political science has increasingly embraced the experimental method to establish causal relationships suggested by theories and observational studies—from experiments’ traditional sub-disciplinary home, political psychology, to international relations. The most frequently voiced concern with experiments, qualms about their external validity in terms of sample, has been well documented and addressed (Coppock, Leeper, and Mullinix 2018; Krupnikov and Levine 2014; Krupnikov, Nam, and Style 2021; Lupton 2019; McDermott 2011; Mutz 2021). 1 Experiments uniquely provide scholars with the internal validity necessary to confidently identify causal effects, and issues with specific experiments tend to arise through errors of application by individual scholars rather than through any broad problems with the methodology.

政治学越来越多地采用实验方法来建立理论和观察研究提出的因果关系——从实验的传统分支学科，政治心理学，到国际关系。最常表达的对实验的关注，对样本的外部有效性的疑虑，已经得到了很好的记录和解决(Coppock, Leeper, and Mullinix 2018;Krupnikov and Levine 2014;Krupnikov, Nam, and Style 2021;勒普顿2019;麦克德莫特2011;Mutz 2021)。实验独特地为学者们提供了自信地确定因果关系所必需的内部效度，而具体实验的问题往往是由于个别学者的应用错误而产生的，而不是由于方法论上的任何广泛问题。

引用次数: 1

PAN volume 30 issue 1 Cover and Front matter PAN第30卷第1期封面和封面问题

IF 5.4 2区社会学 Q1 POLITICAL SCIENCE

Political Analysis

Pub Date : 2021-12-22 DOI: 10.1017/pan.2021.44

引用次数: 0

Human Rights Violations in Space: Assessing the External Validity of Machine-Geocoded versus Human-Geocoded Data 空间侵犯人权行为：评估机器地理编码数据与人类地理编码数据的外部有效性

IF 5.4 2区社会学 Q1 POLITICAL SCIENCE

Political Analysis

Pub Date : 2021-12-15 DOI: 10.1017/pan.2021.40

Logan Stundal, Benjamin E. Bagozzi, John R. Freeman, J. Holmes

Abstract Political event data are widely used in studies of political violence. Recent years have seen notable advances in the automated coding of political event data from international news sources. Yet, the validity of machine-coded event data remains disputed, especially in the context of event geolocation. We analyze the frequencies of human- and machine-geocoded event data agreement in relation to an independent (ground truth) source. The events are human rights violations in Colombia. We perform our evaluation for a key, 8-year period of the Colombian conflict and in three 2-year subperiods as well as for a selected set of (non)journalistically remote municipalities. As a complement to this analysis, we estimate spatial probit models based on the three datasets. These models assume Gaussian Markov Random Field error processes; they are constructed using a stochastic partial differential equation and estimated with integrated nested Laplacian approximation. The estimated models tell us whether the three datasets produce comparable predictions, underreport events in relation to the same covariates, and have similar patterns of prediction error. Together the two analyses show that, for this subnational conflict, the machine- and human-geocoded datasets are comparable in terms of external validity but, according to the geostatistical models, produce prediction errors that differ in important respects.

摘要政治事件数据被广泛用于政治暴力研究。近年来，国际新闻来源的政治事件数据的自动编码取得了显著进展。然而，机器编码的事件数据的有效性仍然存在争议，尤其是在事件地理定位的背景下。我们分析了与独立（地面实况）源相关的人类和机器地理编码事件数据一致性的频率。这些事件是哥伦比亚境内侵犯人权的行为。我们对哥伦比亚冲突的一个关键的8年时期、三个2年的次级时期以及一组选定的（非）新闻偏远城市进行了评估。作为对该分析的补充，我们基于这三个数据集估计空间概率集模型。这些模型假设高斯马尔可夫随机场误差过程；它们是使用随机偏微分方程构造的，并使用集成嵌套拉普拉斯近似进行估计。估计的模型告诉我们，这三个数据集是否产生了可比较的预测，是否少报了与相同协变量相关的事件，以及是否具有相似的预测误差模式。这两项分析共同表明，对于这种国家以下的冲突，机器和人类地理编码的数据集在外部有效性方面是可比较的，但根据地质统计学模型，会产生在重要方面不同的预测误差。

{"title":"Human Rights Violations in Space: Assessing the External Validity of Machine-Geocoded versus Human-Geocoded Data","authors":"Logan Stundal, Benjamin E. Bagozzi, John R. Freeman, J. Holmes","doi":"10.1017/pan.2021.40","DOIUrl":"https://doi.org/10.1017/pan.2021.40","url":null,"abstract":"Abstract Political event data are widely used in studies of political violence. Recent years have seen notable advances in the automated coding of political event data from international news sources. Yet, the validity of machine-coded event data remains disputed, especially in the context of event geolocation. We analyze the frequencies of human- and machine-geocoded event data agreement in relation to an independent (ground truth) source. The events are human rights violations in Colombia. We perform our evaluation for a key, 8-year period of the Colombian conflict and in three 2-year subperiods as well as for a selected set of (non)journalistically remote municipalities. As a complement to this analysis, we estimate spatial probit models based on the three datasets. These models assume Gaussian Markov Random Field error processes; they are constructed using a stochastic partial differential equation and estimated with integrated nested Laplacian approximation. The estimated models tell us whether the three datasets produce comparable predictions, underreport events in relation to the same covariates, and have similar patterns of prediction error. Together the two analyses show that, for this subnational conflict, the machine- and human-geocoded datasets are comparable in terms of external validity but, according to the geostatistical models, produce prediction errors that differ in important respects.","PeriodicalId":48270,"journal":{"name":"Political Analysis","volume":"31 1","pages":"81 - 97"},"PeriodicalIF":5.4,"publicationDate":"2021-12-15","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"46399224","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"社会学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 2

Rejoinder: Concluding Remarks on Scholarly Communications 复辩状：关于学术交流的结论性意见

IF 5.4 2区社会学 Q1 POLITICAL SCIENCE

Political Analysis

Pub Date : 2021-12-02 DOI: 10.1017/pan.2021.48

Jonathan Katz, G. King, E. Rosenblatt

Abstract We are grateful to DeFord et al. for the continued attention to our work and the crucial issues of fair representation in democratic electoral systems. Our response (Katz, King, and Rosenblatt Forthcoming) was designed to help readers avoid being misled by mistaken claims in DeFord et al. (Forthcoming-a), and does not address other literature or uses of our prior work. As it happens, none of our corrections were addressed (or contradicted) in the most recent submission (DeFord et al. Forthcoming-b).

我们感谢DeFord等人对我们的工作和民主选举制度中公平代表权的关键问题的持续关注。我们的回应(Katz, King, and Rosenblatt即将出版)旨在帮助读者避免被DeFord等人的错误主张所误导(即将出版-a)，并且不涉及其他文献或我们先前工作的使用。碰巧的是，我们的任何更正都没有在最近的提交中得到解决(或反驳)(DeFord等人)。Forthcoming-b)。

引用次数: 0

The Essential Role of Statistical Inference in Evaluating Electoral Systems: A Response to DeFord et al. 统计推断在评估选举制度中的重要作用:对DeFord等人的回应。

IF 5.4 2区社会学 Q1 POLITICAL SCIENCE

Political Analysis

Pub Date : 2021-12-02 DOI: 10.1017/pan.2021.46

Jonathan N. Katz, Gary King, E. Rosenblatt

Abstract Katz, King, and Rosenblatt (2020, American Political Science Review 114, 164–178) introduces a theoretical framework for understanding redistricting and electoral systems, built on basic statistical and social science principles of inference. DeFord et al. (2021, Political Analysis, this issue) instead focuses solely on descriptive measures, which lead to the problems identified in our article. In this article, we illustrate the essential role of these basic principles and then offer statistical, mathematical, and substantive corrections required to apply DeFord et al.’s calculations to social science questions of interest, while also showing how to easily resolve all claimed paradoxes and problems. We are grateful to the authors for their interest in our work and for this opportunity to clarify these principles and our theoretical framework.

摘要Katz，King和Rosenblatt（2020，《美国政治科学评论》114164-178）介绍了一个基于基本统计和社会科学推理原理的理解选区划分和选举制度的理论框架。DeFord等人（2021，《政治分析》，本期）只关注描述性措施，这导致了我们文章中发现的问题。在这篇文章中，我们说明了这些基本原则的基本作用，然后提供了将DeFord等人的计算应用于感兴趣的社会科学问题所需的统计、数学和实质性更正，同时还展示了如何轻松解决所有声称的悖论和问题。我们感谢作者对我们的工作感兴趣，并借此机会澄清这些原则和我们的理论框架。

引用次数: 1

Implementing Partisan Symmetry: A Response to a Response 实现党派对称:对回应的回应

IF 5.4 2区社会学 Q1 POLITICAL SCIENCE

Political Analysis

Pub Date : 2021-12-02 DOI: 10.1017/pan.2021.47

Daryl R. DeFord, Natasha Dhamankar, M. Duchin, Varun Gupta, Mackenzie McPike, Gabe Schoenbach, Ki Wan Sim

Abstract Katz, King, and Rosenblatt recently wrote a broad survey developing and extending the theory of partisan symmetry. Our paper reviewed the implementability of the theory, focusing on simplified scores of symmetry—seemingly compatible with their formulation—that are in wide use. We analyzed these simplified scores and concluded that they are not suited for redistricting reform. By our reading of their response, Katz, King, and Rosenblatt agree.

摘要Katz，King和Rosenblatt最近写了一篇广泛的调查报告，发展和扩展了党派对称理论。我们的论文回顾了该理论的可实施性，重点是被广泛使用的简化对称性分数——似乎与它们的公式相兼容。我们分析了这些简化的分数，得出的结论是，它们不适合重新划分选区改革。通过阅读他们的回应，Katz、King和Rosenblatt表示同意。

引用次数: 1

Minmaxing of Bayesian Improved Surname Geocoding and Geography Level Ups in Predicting Race 贝叶斯改进姓氏地理编码的最大化和种族预测中的地理层次提升

IF 5.4 2区社会学 Q1 POLITICAL SCIENCE

Political Analysis

Pub Date : 2021-11-29 DOI: 10.1017/pan.2021.31

Jesse T. Clark, John A. Curiel, T. Steelman

Abstract Racial identification is a critical factor in understanding a multitude of important outcomes in many fields. However, inferring an individual’s race from ecological data is prone to bias and error. This process was only recently improved via Bayesian improved surname geocoding (BISG). With surname and geographic-based demographic data, it is possible to more accurately estimate individual racial identification than ever before. However, the level of geography used in this process varies widely. Whereas some existing work makes use of geocoding to place individuals in precise census blocks, a substantial portion either skips geocoding altogether or relies on estimation using surname or county-level analyses. Presently, the trade-offs of such variation are unknown. In this letter, we quantify those trade-offs through a validation of BISG on Georgia’s voter file using both geocoded and nongeocoded processes and introduce a new level of geography—ZIP codes—to this method. We find that when estimating the racial identification of White and Black voters, nongeocoded ZIP code-based estimates are acceptable alternatives. However, census blocks provide the most accurate estimations when imputing racial identification for Asian and Hispanic voters. Our results document the most efficient means to sequentially conduct BISG analysis to maximize racial identification estimation while simultaneously minimizing data missingness and bias.

摘要种族认同是理解许多领域中许多重要成果的关键因素。然而，从生态数据推断一个人的种族容易产生偏见和错误。这一过程最近才通过贝叶斯改进姓氏地理编码（BISG）得到改进。有了基于姓氏和地理的人口统计数据，就有可能比以往任何时候都更准确地估计个人的种族认同。然而，在这一过程中使用的地理水平差异很大。尽管一些现有的工作利用地理编码将个人放在精确的人口普查区块中，但很大一部分要么完全跳过地理编码，要么依赖于使用姓氏或县级分析的估计。目前，这种变化的利弊尚不清楚。在这封信中，我们通过使用地理编码和非地理编码过程对佐治亚州选民文件的BISG进行验证，量化了这些权衡，并为这种方法引入了一个新的地理级别——邮政编码。我们发现，在估计白人和黑人选民的种族认同时，基于非地理编码邮政编码的估计是可以接受的替代方案。然而，人口普查区块在对亚裔和西班牙裔选民进行种族识别时提供了最准确的估计。我们的结果记录了顺序进行BISG分析的最有效方法，以最大限度地估计种族认同，同时最大限度地减少数据丢失和偏差。

{"title":"Minmaxing of Bayesian Improved Surname Geocoding and Geography Level Ups in Predicting Race","authors":"Jesse T. Clark, John A. Curiel, T. Steelman","doi":"10.1017/pan.2021.31","DOIUrl":"https://doi.org/10.1017/pan.2021.31","url":null,"abstract":"Abstract Racial identification is a critical factor in understanding a multitude of important outcomes in many fields. However, inferring an individual’s race from ecological data is prone to bias and error. This process was only recently improved via Bayesian improved surname geocoding (BISG). With surname and geographic-based demographic data, it is possible to more accurately estimate individual racial identification than ever before. However, the level of geography used in this process varies widely. Whereas some existing work makes use of geocoding to place individuals in precise census blocks, a substantial portion either skips geocoding altogether or relies on estimation using surname or county-level analyses. Presently, the trade-offs of such variation are unknown. In this letter, we quantify those trade-offs through a validation of BISG on Georgia’s voter file using both geocoded and nongeocoded processes and introduce a new level of geography—ZIP codes—to this method. We find that when estimating the racial identification of White and Black voters, nongeocoded ZIP code-based estimates are acceptable alternatives. However, census blocks provide the most accurate estimations when imputing racial identification for Asian and Hispanic voters. Our results document the most efficient means to sequentially conduct BISG analysis to maximize racial identification estimation while simultaneously minimizing data missingness and bias.","PeriodicalId":48270,"journal":{"name":"Political Analysis","volume":"30 1","pages":"456 - 462"},"PeriodicalIF":5.4,"publicationDate":"2021-11-29","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"46254036","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"社会学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 5

Cross-Domain Topic Classification for Political Texts 政治文本的跨领域主题分类

IF 5.4 2区社会学 Q1 POLITICAL SCIENCE

Political Analysis

Pub Date : 2021-10-21 DOI: 10.1017/pan.2021.37

Moritz Osnabrügge, Elliott Ash, M. Morelli

Abstract We introduce and assess the use of supervised learning in cross-domain topic classification. In this approach, an algorithm learns to classify topics in a labeled source corpus and then extrapolates topics in an unlabeled target corpus from another domain. The ability to use existing training data makes this method significantly more efficient than within-domain supervised learning. It also has three advantages over unsupervised topic models: the method can be more specifically targeted to a research question and the resulting topics are easier to validate and interpret. We demonstrate the method using the case of labeled party platforms (source corpus) and unlabeled parliamentary speeches (target corpus). In addition to the standard within-domain error metrics, we further validate the cross-domain performance by labeling a subset of target-corpus documents. We find that the classifier accurately assigns topics in the parliamentary speeches, although accuracy varies substantially by topic. We also propose tools diagnosing cross-domain classification. To illustrate the usefulness of the method, we present two case studies on how electoral rules and the gender of parliamentarians influence the choice of speech topics.

摘要我们介绍并评估了监督学习在跨领域主题分类中的应用。在这种方法中，算法学习对标记的源语料库中的主题进行分类，然后从另一个领域推断未标记的目标语料库中的话题。使用现有训练数据的能力使该方法比域内监督学习更有效。与无监督主题模型相比，它还有三个优点：该方法可以更具体地针对研究问题，并且生成的主题更容易验证和解释。我们使用标记的政党纲领（源语料库）和未标记的议会演讲（目标语料库）来演示该方法。除了标准的域内错误度量外，我们还通过标记目标语料库文档的子集来进一步验证跨域性能。我们发现，分类器准确地分配了议会演讲中的主题，尽管准确性因主题而异。我们还提出了诊断跨领域分类的工具。为了说明该方法的有用性，我们提出了两个关于选举规则和议员性别如何影响演讲主题选择的案例研究。

{"title":"Cross-Domain Topic Classification for Political Texts","authors":"Moritz Osnabrügge, Elliott Ash, M. Morelli","doi":"10.1017/pan.2021.37","DOIUrl":"https://doi.org/10.1017/pan.2021.37","url":null,"abstract":"Abstract We introduce and assess the use of supervised learning in cross-domain topic classification. In this approach, an algorithm learns to classify topics in a labeled source corpus and then extrapolates topics in an unlabeled target corpus from another domain. The ability to use existing training data makes this method significantly more efficient than within-domain supervised learning. It also has three advantages over unsupervised topic models: the method can be more specifically targeted to a research question and the resulting topics are easier to validate and interpret. We demonstrate the method using the case of labeled party platforms (source corpus) and unlabeled parliamentary speeches (target corpus). In addition to the standard within-domain error metrics, we further validate the cross-domain performance by labeling a subset of target-corpus documents. We find that the classifier accurately assigns topics in the parliamentary speeches, although accuracy varies substantially by topic. We also propose tools diagnosing cross-domain classification. To illustrate the usefulness of the method, we present two case studies on how electoral rules and the gender of parliamentarians influence the choice of speech topics.","PeriodicalId":48270,"journal":{"name":"Political Analysis","volume":"31 1","pages":"59 - 80"},"PeriodicalIF":5.4,"publicationDate":"2021-10-21","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"46636031","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"社会学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 17

首页上一页

下一页尾页

类型

全部化学•材料生命科学医学物理工程技术环境•农林材料科学地球科学法学管理学化学环境科学与生态学计算机科学教育学经济学农林科学人文科学生物学数学物理与天体物理心理学综合性期刊其他工业工程理学历史学农学文学信息工程

数据库

全部 ACS Publications Elsevier ieeexplore Springer The Royal Society of Chemistry Wiley

期刊

Political Analysis

全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.

﹀