首页 > 最新文献

Political Analysis最新文献

英文 中文
Balance as a Pre-Estimation Test for Time Series Analysis 平衡作为时间序列分析的预估计检验
IF 5.4 2区 社会学 Q1 POLITICAL SCIENCE Pub Date : 2022-04-20 DOI: 10.1017/pan.2022.4
Mark Pickup, Paul M. Kellstedt
Abstract It is understood that ensuring equation balance is a necessary condition for a valid model of times series data. Yet, the definition of balance provided so far has been incomplete and there has not been a consistent understanding of exactly why balance is important or how it can be applied. The discussion to date has focused on the estimates produced by the general error correction model (GECM). In this paper, we go beyond the GECM and beyond model estimates. We treat equation balance as a theoretical matter, not merely an empirical one, and describe how to use the concept of balance to test theoretical propositions before longitudinal data have been gathered. We explain how equation balance can be used to check if your theoretical or empirical model is either wrong or incomplete in a way that will prevent a meaningful interpretation of the model. We also raise the issue of “ $I(0)$ balance” and its importance.
摘要保证方程平衡是建立有效的时间序列数据模型的必要条件。然而,迄今为止对平衡的定义并不完整,对平衡为什么重要或如何应用平衡的确切理解也不一致。迄今为止的讨论主要集中在一般误差校正模型(GECM)产生的估计上。在本文中,我们超越了GECM和模型估计。我们将方程平衡视为一个理论问题,而不仅仅是一个经验问题,并描述了如何在收集纵向数据之前使用平衡的概念来检验理论命题。我们解释了如何使用方程平衡来检查你的理论或经验模型是否错误或不完整,从而阻止对模型进行有意义的解释。我们还提出了“$I(0)$余额”及其重要性的问题。
{"title":"Balance as a Pre-Estimation Test for Time Series Analysis","authors":"Mark Pickup, Paul M. Kellstedt","doi":"10.1017/pan.2022.4","DOIUrl":"https://doi.org/10.1017/pan.2022.4","url":null,"abstract":"Abstract It is understood that ensuring equation balance is a necessary condition for a valid model of times series data. Yet, the definition of balance provided so far has been incomplete and there has not been a consistent understanding of exactly why balance is important or how it can be applied. The discussion to date has focused on the estimates produced by the general error correction model (GECM). In this paper, we go beyond the GECM and beyond model estimates. We treat equation balance as a theoretical matter, not merely an empirical one, and describe how to use the concept of balance to test theoretical propositions before longitudinal data have been gathered. We explain how equation balance can be used to check if your theoretical or empirical model is either wrong or incomplete in a way that will prevent a meaningful interpretation of the model. We also raise the issue of “ \u0000$I(0)$\u0000 balance” and its importance.","PeriodicalId":48270,"journal":{"name":"Political Analysis","volume":"31 1","pages":"295 - 304"},"PeriodicalIF":5.4,"publicationDate":"2022-04-20","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"44251727","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"社会学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 6
Using Multiple Pretreatment Periods to Improve Difference-in-Differences and Staggered Adoption Designs 使用多个预处理周期改善差异中的差异和交错采用设计
IF 5.4 2区 社会学 Q1 POLITICAL SCIENCE Pub Date : 2022-03-30 DOI: 10.1017/pan.2022.8
Naoki Egami, S. Yamauchi
Abstract While a difference-in-differences (DID) design was originally developed with one pre- and one posttreatment period, data from additional pretreatment periods are often available. How can researchers improve the DID design with such multiple pretreatment periods under what conditions? We first use potential outcomes to clarify three benefits of multiple pretreatment periods: (1) assessing the parallel trends assumption, (2) improving estimation accuracy, and (3) allowing for a more flexible parallel trends assumption. We then propose a new estimator, double DID, which combines all the benefits through the generalized method of moments and contains the two-way fixed effects regression as a special case. We show that the double DID requires a weaker assumption about outcome trends and is more efficient than existing DID estimators. We also generalize the double DID to the staggered adoption design where different units can receive the treatment in different time periods. We illustrate the proposed method with two empirical applications, covering both the basic DID and staggered adoption designs. We offer an open-source R package that implements the proposed methodologies.
摘要虽然差异设计最初是在一个治疗前和一个治疗后阶段开发的,但通常可以获得额外治疗期的数据。在什么条件下,研究人员如何改进具有如此多个预处理期的DID设计?我们首先使用潜在结果来阐明多个预处理期的三个好处:(1)评估平行趋势假设,(2)提高估计精度,以及(3)允许更灵活的平行趋势假设。然后,我们提出了一种新的估计量,双DID,它通过广义矩方法结合了所有的优点,并将双向固定效应回归作为特例。我们表明,双重DID需要对结果趋势的较弱假设,并且比现有的DID估计更有效。我们还将双重DID推广到交错采用设计,其中不同的单元可以在不同的时间段接受治疗。我们用两个经验应用来说明所提出的方法,包括基本的DID和交错采用设计。我们提供了一个开源的R包来实现所提出的方法。
{"title":"Using Multiple Pretreatment Periods to Improve Difference-in-Differences and Staggered Adoption Designs","authors":"Naoki Egami, S. Yamauchi","doi":"10.1017/pan.2022.8","DOIUrl":"https://doi.org/10.1017/pan.2022.8","url":null,"abstract":"Abstract While a difference-in-differences (DID) design was originally developed with one pre- and one posttreatment period, data from additional pretreatment periods are often available. How can researchers improve the DID design with such multiple pretreatment periods under what conditions? We first use potential outcomes to clarify three benefits of multiple pretreatment periods: (1) assessing the parallel trends assumption, (2) improving estimation accuracy, and (3) allowing for a more flexible parallel trends assumption. We then propose a new estimator, double DID, which combines all the benefits through the generalized method of moments and contains the two-way fixed effects regression as a special case. We show that the double DID requires a weaker assumption about outcome trends and is more efficient than existing DID estimators. We also generalize the double DID to the staggered adoption design where different units can receive the treatment in different time periods. We illustrate the proposed method with two empirical applications, covering both the basic DID and staggered adoption designs. We offer an open-source R package that implements the proposed methodologies.","PeriodicalId":48270,"journal":{"name":"Political Analysis","volume":"31 1","pages":"195 - 212"},"PeriodicalIF":5.4,"publicationDate":"2022-03-30","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"42521787","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"社会学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 5
An Information-theoretic Approach to Prompt Engineering Without Ground Truth Labels 一种无地面实况标签的提示工程信息论方法
IF 5.4 2区 社会学 Q1 POLITICAL SCIENCE Pub Date : 2022-03-21 DOI: 10.1017/pan.2023.2
Lisa P. Argyle, E. Busby, Nancy Fulda, Joshua R Gubler, Christopher Rytting, Taylor Sorensen, D. Wingate
Pre-trained language models derive substantial linguistic and factual knowledge from the massive corpora on which they are trained, and prompt engineering seeks to align these models to specific tasks. Unfortunately, existing prompt engineering methods require significant amounts of labeled data, access to model parameters, or both. We introduce a new method for selecting prompt templates without labeled examples and without direct access to the model. Specifically, over a set of candidate templates, we choose the template that maximizes the mutual information between the input and the corresponding model output. Across 8 datasets representing 7 distinct NLP tasks, we show that when a template has high mutual information, it also has high accuracy on the task. On the largest model, selecting prompts with our method gets 90% of the way from the average prompt accuracy to the best prompt accuracy and requires no ground truth labels.
经过预训练的语言模型从其训练的大量语料库中获得大量的语言和事实知识,而即时工程则试图将这些模型与特定任务相结合。不幸的是,现有的快速工程方法需要大量的标记数据、对模型参数的访问,或者两者兼而有之。我们介绍了一种新的方法来选择提示模板,无需标记示例,也无需直接访问模型。具体来说,在一组候选模板上,我们选择最大化输入和相应模型输出之间相互信息的模板。在代表7个不同NLP任务的8个数据集中,我们表明,当模板具有高互信息时,它在任务上也具有高准确性。在最大的模型上,用我们的方法选择提示可以获得从平均提示准确度到最佳提示准确度的90%,并且不需要地面实况标签。
{"title":"An Information-theoretic Approach to Prompt Engineering Without Ground Truth Labels","authors":"Lisa P. Argyle, E. Busby, Nancy Fulda, Joshua R Gubler, Christopher Rytting, Taylor Sorensen, D. Wingate","doi":"10.1017/pan.2023.2","DOIUrl":"https://doi.org/10.1017/pan.2023.2","url":null,"abstract":"Pre-trained language models derive substantial linguistic and factual knowledge from the massive corpora on which they are trained, and prompt engineering seeks to align these models to specific tasks. Unfortunately, existing prompt engineering methods require significant amounts of labeled data, access to model parameters, or both. We introduce a new method for selecting prompt templates without labeled examples and without direct access to the model. Specifically, over a set of candidate templates, we choose the template that maximizes the mutual information between the input and the corresponding model output. Across 8 datasets representing 7 distinct NLP tasks, we show that when a template has high mutual information, it also has high accuracy on the task. On the largest model, selecting prompts with our method gets 90% of the way from the average prompt accuracy to the best prompt accuracy and requires no ground truth labels.","PeriodicalId":48270,"journal":{"name":"Political Analysis","volume":"31 1","pages":"337 - 351"},"PeriodicalIF":5.4,"publicationDate":"2022-03-21","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"45581735","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"社会学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 93
Taking Distributions Seriously: On the Interpretation of the Estimates of Interactive Nonlinear Models 认真对待分布:关于相互作用非线性模型估计的解释
IF 5.4 2区 社会学 Q1 POLITICAL SCIENCE Pub Date : 2022-03-18 DOI: 10.1017/pan.2022.9
A. Zhirnov, Mert Moral, Evgeny Sedashov
Abstract In recent decades, political science literature has experienced significant growth in the popularity of nonlinear models with multiplicative interaction terms. When one or more constitutive variables are not binary, most studies report the marginal effect of the variable of interest at its sample mean while allowing the other constitutive variable/s to vary along its range and holding all other covariates constant at their means, modes, or medians. In this article, we argue that this conventional approach is not always the most suitable since the marginal effect of a variable at its sample mean might not be sufficiently representative of its prevalent effect at a specific value of the conditioning variable and might produce excessively model-dependent predictions. We propose two procedures to help researchers gain a better understanding of how the typical effect of the variable of interest varies as a function of the conditioning variable: (1) computing and plotting the marginal effects at all in-sample combinations of the values of the constitutive variables and (2) computing and plotting what we call the “Distribution-Weighted Average Marginal Effect” over the values of the conditioning variable.
摘要近几十年来,具有乘法交互项的非线性模型在政治学文献中得到了显著的普及。当一个或多个本构变量不是二元的时,大多数研究报告了感兴趣变量在其样本均值处的边际效应,同时允许另一个本构变数沿其范围变化,并使所有其他协变量在其均值、模式或中值处保持不变。在这篇文章中,我们认为这种传统的方法并不总是最合适的,因为变量在其样本均值处的边际效应可能不能充分代表其在条件变量的特定值处的普遍效应,并且可能产生过度依赖模型的预测。我们提出了两个程序来帮助研究人员更好地理解感兴趣变量的典型效应是如何作为条件变量的函数变化的:(1)计算和绘制本构变量值的样本组合中的边际效应;(2)计算和绘图我们称之为“分布加权平均边际效应”超过调节变量的值。
{"title":"Taking Distributions Seriously: On the Interpretation of the Estimates of Interactive Nonlinear Models","authors":"A. Zhirnov, Mert Moral, Evgeny Sedashov","doi":"10.1017/pan.2022.9","DOIUrl":"https://doi.org/10.1017/pan.2022.9","url":null,"abstract":"Abstract In recent decades, political science literature has experienced significant growth in the popularity of nonlinear models with multiplicative interaction terms. When one or more constitutive variables are not binary, most studies report the marginal effect of the variable of interest at its sample mean while allowing the other constitutive variable/s to vary along its range and holding all other covariates constant at their means, modes, or medians. In this article, we argue that this conventional approach is not always the most suitable since the marginal effect of a variable at its sample mean might not be sufficiently representative of its prevalent effect at a specific value of the conditioning variable and might produce excessively model-dependent predictions. We propose two procedures to help researchers gain a better understanding of how the typical effect of the variable of interest varies as a function of the conditioning variable: (1) computing and plotting the marginal effects at all in-sample combinations of the values of the constitutive variables and (2) computing and plotting what we call the “Distribution-Weighted Average Marginal Effect” over the values of the conditioning variable.","PeriodicalId":48270,"journal":{"name":"Political Analysis","volume":"31 1","pages":"213 - 234"},"PeriodicalIF":5.4,"publicationDate":"2022-03-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"41839871","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"社会学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 2
PAN volume 30 issue 2 Cover and Front matter PAN第30卷第2期封面和封面
IF 5.4 2区 社会学 Q1 POLITICAL SCIENCE Pub Date : 2022-03-10 DOI: 10.1017/pan.2022.6
Emily M. Farris
V o lu m e 28• N u m b er • A p rl20 20 V o l u m e 2 8•Number2•April2020 V o lu m e 28• N u m b er • A p rl20 20 ARICLES Dcrete Coice Data w ith U noerved H etereneity:A CondnalBinary Q untile M odel Xiao Lu M eauring he om petienessof Eltions Gary W .Cx,Jon H .iva nd DnielM Sm ith U neected Eveduring Srvey Dign: Prom se nd PitfallsfusalInference JrdiM uñoz,rtFalcó-Gim no nd Erique H erndez M eauring Sugroup Preencesin ConjtExperim ents Thom asJ.per,Sara .H obltand am esilley
V o lu m e 28•N u m b er•A p rl20 20 V o l u m e 2 8•数字2•2020年4月V o lu m e 28•N u m b r•A p rl20 20 ARICLES Dcrete Coice数据与u无服务的H确定性:一个CondnalBinary Q,直到m模型Xiao lu m获得了Gary w.Cx、Jon H.iva和DnielM Sm与u Neeted Evevide Srvey Dign的合作伙伴关系:Prom se nd Pitfallsfusal推理JrdiM uñoz,rtFalcó-Gim no nd Erique H erndez M eauring Sugroup Preencesin联合专家Thom asJ.per,Sara.H obltand am esilley
{"title":"PAN volume 30 issue 2 Cover and Front matter","authors":"Emily M. Farris","doi":"10.1017/pan.2022.6","DOIUrl":"https://doi.org/10.1017/pan.2022.6","url":null,"abstract":"V o lu m e 28• N u m b er • A p rl20 20 V o l u m e 2 8•Number2•April2020 V o lu m e 28• N u m b er • A p rl20 20 ARICLES Dcrete Coice Data w ith U noerved H etereneity:A CondnalBinary Q untile M odel Xiao Lu M eauring he om petienessof Eltions Gary W .Cx,Jon H .iva nd DnielM Sm ith U neected Eveduring Srvey Dign: Prom se nd PitfallsfusalInference JrdiM uñoz,rtFalcó-Gim no nd Erique H erndez M eauring Sugroup Preencesin ConjtExperim ents Thom asJ.per,Sara .H obltand am esilley","PeriodicalId":48270,"journal":{"name":"Political Analysis","volume":" ","pages":"f1 - f3"},"PeriodicalIF":5.4,"publicationDate":"2022-03-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"47064427","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"社会学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
PAN volume 30 issue 2 Cover and Back matter PAN第30卷第2期封面和封底
IF 5.4 2区 社会学 Q1 POLITICAL SCIENCE Pub Date : 2022-03-10 DOI: 10.1017/pan.2022.7
{"title":"PAN volume 30 issue 2 Cover and Back matter","authors":"","doi":"10.1017/pan.2022.7","DOIUrl":"https://doi.org/10.1017/pan.2022.7","url":null,"abstract":"","PeriodicalId":48270,"journal":{"name":"Political Analysis","volume":"30 1","pages":"b1 - b2"},"PeriodicalIF":5.4,"publicationDate":"2022-03-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"44298732","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"社会学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Geographic Boundaries and Local Economic Conditions Matter for Views of the Economy 地理边界和当地经济条件对经济观至关重要
IF 5.4 2区 社会学 Q1 POLITICAL SCIENCE Pub Date : 2022-02-22 DOI: 10.1017/pan.2021.50
James Bisbee, J. Zilinsky
Abstract The link between objective facts and politically relevant beliefs is an essential mechanism for democratic accountability. Yet the bulk of empirical work on this topic measures objective facts at whatever geographic units are readily available. We investigate the implications of these largely arbitrary choices for predicting individual-level opinions. We show that varying the geographic resolution—namely aggregating economic data to different geographic units—influences the strength of the relationship between economic evaluations and local economic conditions. Finding that unemployment claims are the best predictor of economic evaluations, especially when aggregated at the commuting zone or media market level, we underscore the importance of the modifiable areal unit problem. Our methods provide an example of how applied scholars might investigate the importance of geography in their own research going forward.
摘要客观事实和政治相关信念之间的联系是民主问责制的一个重要机制。然而,关于这一主题的大部分实证工作都是以现成的地理单位来衡量客观事实。我们研究了这些在很大程度上武断的选择对预测个人层面意见的影响。我们表明,地理分辨率的变化——即将经济数据聚合到不同的地理单元——会影响经济评估与当地经济条件之间关系的强度。发现失业申请人数是经济评估的最佳预测指标,尤其是在通勤区或媒体市场层面进行汇总时,我们强调了可修改面积单位问题的重要性。我们的方法提供了一个例子,说明应用学者如何在未来的研究中调查地理的重要性。
{"title":"Geographic Boundaries and Local Economic Conditions Matter for Views of the Economy","authors":"James Bisbee, J. Zilinsky","doi":"10.1017/pan.2021.50","DOIUrl":"https://doi.org/10.1017/pan.2021.50","url":null,"abstract":"Abstract The link between objective facts and politically relevant beliefs is an essential mechanism for democratic accountability. Yet the bulk of empirical work on this topic measures objective facts at whatever geographic units are readily available. We investigate the implications of these largely arbitrary choices for predicting individual-level opinions. We show that varying the geographic resolution—namely aggregating economic data to different geographic units—influences the strength of the relationship between economic evaluations and local economic conditions. Finding that unemployment claims are the best predictor of economic evaluations, especially when aggregated at the commuting zone or media market level, we underscore the importance of the modifiable areal unit problem. Our methods provide an example of how applied scholars might investigate the importance of geography in their own research going forward.","PeriodicalId":48270,"journal":{"name":"Political Analysis","volume":"31 1","pages":"288 - 294"},"PeriodicalIF":5.4,"publicationDate":"2022-02-22","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"49474478","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"社会学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Quantifying Bias from Measurable and Unmeasurable Confounders Across Three Domains of Individual Determinants of Political Preferences 量化政治偏好个体决定因素三个领域中可测量和不可测量的混淆者的偏差
IF 5.4 2区 社会学 Q1 POLITICAL SCIENCE Pub Date : 2022-02-22 DOI: 10.1017/pan.2022.2
Rafael Ahlskog, Sven Oskarsson
Abstract A core part of political research is to identify how political preferences are shaped. The nature of these questions is such that robust causal identification is often difficult to achieve, and we are not seldom stuck with observational methods that we know have limited causal validity. The purpose of this paper is to measure the magnitude of bias stemming from both measurable and unmeasurable confounders across three broad domains of individual determinants of political preferences: socio-economic factors, moral values, and psychological constructs. We leverage a unique combination of rich Swedish registry data for a large sample of identical twins, with a comprehensive battery of 34 political preference measures, and build a meta-analytical model comparing our most conservative observational (naive) estimates with discordant twin estimates. This allows us to infer the amount of bias from unobserved genetic and shared environmental factors that remains in the naive models for our predictors, while avoiding precision issues common in family-based designs. The results are sobering: in most cases, substantial bias remains in naive models. A rough heuristic is that about half of the effect size even in conservative observational estimates is composed of confounding.
政治研究的一个核心部分是确定政治偏好是如何形成的。这些问题的本质是这样的,强大的因果识别往往很难实现,我们很少被我们知道因果有效性有限的观察方法所困住。本文的目的是在三个广泛的政治偏好的个人决定因素领域中测量可测量和不可测量的混杂因素所产生的偏见的程度:社会经济因素、道德价值观和心理结构。我们利用大量同卵双胞胎样本中丰富的瑞典注册数据的独特组合,以及34种政治偏好措施的综合组合,并建立了一个元分析模型,比较我们最保守的观察(朴素)估计与不一致的双胞胎估计。这使我们能够从未观察到的遗传和共享的环境因素中推断出偏差的数量,这些因素仍然存在于我们的预测器的幼稚模型中,同时避免了基于家庭的设计中常见的精度问题。结果是发人深省的:在大多数情况下,幼稚模型中仍然存在大量的偏见。一个粗略的启发是,即使在保守的观察估计中,约有一半的效应大小是由混淆组成的。
{"title":"Quantifying Bias from Measurable and Unmeasurable Confounders Across Three Domains of Individual Determinants of Political Preferences","authors":"Rafael Ahlskog, Sven Oskarsson","doi":"10.1017/pan.2022.2","DOIUrl":"https://doi.org/10.1017/pan.2022.2","url":null,"abstract":"Abstract A core part of political research is to identify how political preferences are shaped. The nature of these questions is such that robust causal identification is often difficult to achieve, and we are not seldom stuck with observational methods that we know have limited causal validity. The purpose of this paper is to measure the magnitude of bias stemming from both measurable and unmeasurable confounders across three broad domains of individual determinants of political preferences: socio-economic factors, moral values, and psychological constructs. We leverage a unique combination of rich Swedish registry data for a large sample of identical twins, with a comprehensive battery of 34 political preference measures, and build a meta-analytical model comparing our most conservative observational (naive) estimates with discordant twin estimates. This allows us to infer the amount of bias from unobserved genetic and shared environmental factors that remains in the naive models for our predictors, while avoiding precision issues common in family-based designs. The results are sobering: in most cases, substantial bias remains in naive models. A rough heuristic is that about half of the effect size even in conservative observational estimates is composed of confounding.","PeriodicalId":48270,"journal":{"name":"Political Analysis","volume":"31 1","pages":"181 - 194"},"PeriodicalIF":5.4,"publicationDate":"2022-02-22","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"47468189","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"社会学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 2
Polls, Context, and Time: A Dynamic Hierarchical Bayesian Forecasting Model for US Senate Elections 民调、背景和时间:美国参议院选举的动态分层贝叶斯预测模型
IF 5.4 2区 社会学 Q1 POLITICAL SCIENCE Pub Date : 2022-01-18 DOI: 10.1017/pan.2021.42
Yehua Chen, R. Garnett, J. Montgomery
Abstract We present a hierarchical Dirichlet regression model with Gaussian process priors that enables accurate and well-calibrated forecasts for U.S. Senate elections at varying time horizons. This Bayesian model provides a balance between predictions based on time-dependent opinion polls and those made based on fundamentals. It also provides uncertainty estimates that arise naturally from historical data on elections and polls. Experiments show that our model is highly accurate and has a well calibrated coverage rate for vote share predictions at various forecasting horizons. We validate the model with a retrospective forecast of the 2018 cycle as well as a true out-of-sample forecast for 2020. We show that our approach achieves state-of-the art accuracy and coverage despite relying on few covariates.
我们提出了一个具有高斯过程先验的分层Dirichlet回归模型,该模型能够在不同的时间范围内对美国参议院选举进行准确和精心校准的预测。贝叶斯模型在基于时间依赖的民意调查和基于基本面的预测之间提供了一种平衡。它还提供了从选举和民意调查的历史数据中自然产生的不确定性估计。实验表明,我们的模型具有很高的准确性,并且在不同的预测范围内对投票份额预测具有很好的校准覆盖率。我们通过对2018年周期的回顾性预测以及对2020年的真实样本外预测来验证模型。我们表明,尽管依赖于少数协变量,我们的方法实现了最先进的准确性和覆盖率。
{"title":"Polls, Context, and Time: A Dynamic Hierarchical Bayesian Forecasting Model for US Senate Elections","authors":"Yehua Chen, R. Garnett, J. Montgomery","doi":"10.1017/pan.2021.42","DOIUrl":"https://doi.org/10.1017/pan.2021.42","url":null,"abstract":"Abstract We present a hierarchical Dirichlet regression model with Gaussian process priors that enables accurate and well-calibrated forecasts for U.S. Senate elections at varying time horizons. This Bayesian model provides a balance between predictions based on time-dependent opinion polls and those made based on fundamentals. It also provides uncertainty estimates that arise naturally from historical data on elections and polls. Experiments show that our model is highly accurate and has a well calibrated coverage rate for vote share predictions at various forecasting horizons. We validate the model with a retrospective forecast of the 2018 cycle as well as a true out-of-sample forecast for 2020. We show that our approach achieves state-of-the art accuracy and coverage despite relying on few covariates.","PeriodicalId":48270,"journal":{"name":"Political Analysis","volume":"31 1","pages":"113 - 133"},"PeriodicalIF":5.4,"publicationDate":"2022-01-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"41362685","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"社会学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
An Improved Method of Automated Nonparametric Content Analysis for Social Science 一种改进的社会科学非参数内容自动分析方法
IF 5.4 2区 社会学 Q1 POLITICAL SCIENCE Pub Date : 2022-01-07 DOI: 10.1017/pan.2021.36
Gary King, Connor Jerzak, Anton Strezhnev
Abstract Some scholars build models to classify documents into chosen categories. Others, especially social scientists who tend to focus on population characteristics, instead usually estimate the proportion of documents in each category—using either parametric “classify-and-count” methods or “direct” nonparametric estimation of proportions without individual classification. Unfortunately, classify-and-count methods can be highly model-dependent or generate more bias in the proportions even as the percent of documents correctly classified increases. Direct estimation avoids these problems, but can suffer when the meaning of language changes between training and test sets or is too similar across categories. We develop an improved direct estimation approach without these issues by including and optimizing continuous text features, along with a form of matching adapted from the causal inference literature. Our approach substantially improves performance in a diverse collection of 73 datasets. We also offer easy-to-use software that implements all ideas discussed herein.
摘要一些学者建立模型将文档分类到选定的类别中。其他人,尤其是倾向于关注人群特征的社会科学家,通常会估计每个类别中文件的比例——使用参数“分类和计数”方法,或在没有单独分类的情况下“直接”非参数估计比例。不幸的是,即使正确分类的文档百分比增加,分类和计数方法也可能高度依赖于模型,或者在比例上产生更多偏差。直接估计可以避免这些问题,但当语言的含义在训练集和测试集之间发生变化或在不同类别之间过于相似时,可能会受到影响。我们通过包括和优化连续文本特征,以及根据因果推理文献改编的匹配形式,开发了一种改进的直接估计方法,而没有这些问题。我们的方法大大提高了73个数据集的性能。我们还提供易于使用的软件,实现这里讨论的所有想法。
{"title":"An Improved Method of Automated Nonparametric Content Analysis for Social Science","authors":"Gary King, Connor Jerzak, Anton Strezhnev","doi":"10.1017/pan.2021.36","DOIUrl":"https://doi.org/10.1017/pan.2021.36","url":null,"abstract":"Abstract Some scholars build models to classify documents into chosen categories. Others, especially social scientists who tend to focus on population characteristics, instead usually estimate the proportion of documents in each category—using either parametric “classify-and-count” methods or “direct” nonparametric estimation of proportions without individual classification. Unfortunately, classify-and-count methods can be highly model-dependent or generate more bias in the proportions even as the percent of documents correctly classified increases. Direct estimation avoids these problems, but can suffer when the meaning of language changes between training and test sets or is too similar across categories. We develop an improved direct estimation approach without these issues by including and optimizing continuous text features, along with a form of matching adapted from the causal inference literature. Our approach substantially improves performance in a diverse collection of 73 datasets. We also offer easy-to-use software that implements all ideas discussed herein.","PeriodicalId":48270,"journal":{"name":"Political Analysis","volume":"31 1","pages":"42 - 58"},"PeriodicalIF":5.4,"publicationDate":"2022-01-07","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"44479486","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"社会学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 5
期刊
Political Analysis
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1