Sociological Methodology最新文献_第4页

Decomposing Ethnic Achievement Gaps across Multiple Levels of Analysis and for Multiple Ethnic Groups 跨多个层次分析和多民族的民族成就差距分解

IF 3 2区社会学 Q1 SOCIOLOGY

Sociological Methodology

Pub Date : 2022-06-08 DOI: 10.1177/00811750221099503

Beatriz Gallo Cordoba, G. Leckie, W. Browne

Ethnic achievement gaps are often explained in terms of student and school factors. The decomposition of these gaps into their within- and between-school components has therefore been applied as a strategy to quantify the overall influence of each set of factors. Three competing approaches have previously been proposed, but each is limited to the study of student-school decompositions of the gap between two ethnic groups (e.g., White and Black). The authors show that these approaches can be reformulated as mediation models facilitating new extensions to allow additional levels in the school system (e.g., classrooms, school districts, geographic areas) and multiple ethnic groups (e.g., White, Black, Hispanic, Asian). The authors illustrate these extensions using administrative data for high school students in Colombia and highlight the increased substantive insights and nuanced policy implications they afford.

种族成就差距通常是从学生和学校因素的角度来解释的。因此，将这些差距分解为学校内部和学校之间的组成部分，作为量化每组因素的总体影响的策略。之前已经提出了三种相互竞争的方法，但每种方法都仅限于研究两个种族群体（如白人和黑人）之间差距的学生-学校分解。作者表明，这些方法可以重新制定为调解模式，促进新的扩展，以允许学校系统（如教室、学区、地理区域）和多个种族群体（如白人、黑人、西班牙裔、亚裔）的额外级别。作者使用哥伦比亚高中生的行政数据说明了这些扩展，并强调了它们提供的更多实质性见解和微妙的政策影响。

引用次数: 0

Lorenz Interpolation: A Method for Estimating Income Inequality from Grouped Income Data 洛伦兹插值：一种从分组收入数据中估计收入不平等的方法

IF 3 2区社会学 Q1 SOCIOLOGY

Sociological Methodology

Pub Date : 2022-04-07 DOI: 10.1177/00811750221085586

Andrew Carr

To understand how income inequality affects individuals and communities, researchers must have accurate measures of income inequality at lower geographic levels, such as counties, school districts, and census tracts. Studies of income inequality, however, are constrained by the tabular format in which censuses publish income data. In this article, the author proposes a new method, Lorenz interpolation, for estimating income inequality from binned income data. Using public microsample data from the American Community Survey (ACS), the author shows that Lorenz interpolation produces more accurate and reliable income inequality estimates than do alternative estimation methods. Then, using restricted ACS income data obtained through a Federal Statistical Research Data Center, the author evaluates the accuracy of Lorenz interpolation at the census tract and school district levels. Lorenz interpolation produces reliable school district–level estimates, but the method produces less reliable estimates for some income inequality measures at the tract level. These findings indicate that researchers should refrain from estimating tract-level income inequality measures from tabular data. They also show that aggregating tract income distributions to higher geographic levels can produce valid estimates of income inequality.

为了了解收入不平等如何影响个人和社区，研究人员必须准确衡量较低地理水平的收入不平等，如县、学区和人口普查区。然而，对收入不平等的研究受到人口普查公布收入数据的表格格式的限制。在这篇文章中，作者提出了一种新的方法，洛伦兹插值，以估计收入不平等的合并收入数据。利用美国社区调查（ACS）的公共微观样本数据，作者表明洛伦茨插值比其他估计方法产生了更准确、更可靠的收入不平等估计。然后，使用通过联邦统计研究数据中心获得的限制性ACS收入数据，作者评估了Lorenz插值在人口普查区和学区层面的准确性。洛伦兹插值产生了可靠的学区水平估计，但该方法对地区水平的一些收入不平等指标产生了不太可靠的估计。这些发现表明，研究人员应该避免从表格数据中估计地区层面的收入不平等指标。他们还表明，将地区收入分配汇总到更高的地理水平可以产生对收入不平等的有效估计。

{"title":"Lorenz Interpolation: A Method for Estimating Income Inequality from Grouped Income Data","authors":"Andrew Carr","doi":"10.1177/00811750221085586","DOIUrl":"https://doi.org/10.1177/00811750221085586","url":null,"abstract":"To understand how income inequality affects individuals and communities, researchers must have accurate measures of income inequality at lower geographic levels, such as counties, school districts, and census tracts. Studies of income inequality, however, are constrained by the tabular format in which censuses publish income data. In this article, the author proposes a new method, Lorenz interpolation, for estimating income inequality from binned income data. Using public microsample data from the American Community Survey (ACS), the author shows that Lorenz interpolation produces more accurate and reliable income inequality estimates than do alternative estimation methods. Then, using restricted ACS income data obtained through a Federal Statistical Research Data Center, the author evaluates the accuracy of Lorenz interpolation at the census tract and school district levels. Lorenz interpolation produces reliable school district–level estimates, but the method produces less reliable estimates for some income inequality measures at the tract level. These findings indicate that researchers should refrain from estimating tract-level income inequality measures from tabular data. They also show that aggregating tract income distributions to higher geographic levels can produce valid estimates of income inequality.","PeriodicalId":48140,"journal":{"name":"Sociological Methodology","volume":"52 1","pages":"141 - 161"},"PeriodicalIF":3.0,"publicationDate":"2022-04-07","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"41707373","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"社会学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Clinical severity of, and effectiveness of mRNA vaccines against, covid-19 from omicron, delta, and alpha SARS-CoV-2 variants in the United States: prospective observational study. 美国 SARS-CoV-2（Omicron、delta 和 alpha）变种 covid-19 的临床严重性和 mRNA 疫苗的有效性：前瞻性观察研究。

2区社会学 Q1 SOCIOLOGY

Sociological Methodology

Pub Date : 2022-03-09 DOI: 10.1136/bmj-2021-069761

Adam S Lauring, Mark W Tenforde, James D Chappell, Manjusha Gaglani, Adit A Ginde, Tresa McNeal, Shekhar Ghamande, David J Douin, H Keipp Talbot, Jonathan D Casey, Nicholas M Mohr, Anne Zepeski, Nathan I Shapiro, Kevin W Gibbs, D Clark Files, David N Hager, Arber Shehu, Matthew E Prekker, Heidi L Erickson, Matthew C Exline, Michelle N Gong, Amira Mohamed, Nicholas J Johnson, Vasisht Srinivasan, Jay S Steingrub, Ithan D Peltan, Samuel M Brown, Emily T Martin, Arnold S Monto, Akram Khan, Catherine L Hough, Laurence W Busse, Caitlin C Ten Lohuis, Abhijit Duggal, Jennifer G Wilson, Alexandra June Gordon, Nida Qadir, Steven Y Chang, Christopher Mallow, Carolina Rivas, Hilary M Babcock, Jennie H Kwon, Natasha Halasa, Carlos G Grijalva, Todd W Rice, William B Stubblefield, Adrienne Baughman, Kelsey N Womack, Jillian P Rhoads, Christopher J Lindsell, Kimberly W Hart, Yuwei Zhu, Katherine Adams, Stephanie J Schrag, Samantha M Olson, Miwako Kobayashi, Jennifer R Verani, Manish M Patel, Wesley H Self

Objectives: To characterize the clinical severity of covid-19 associated with the alpha, delta, and omicron SARS-CoV-2 variants among adults admitted to hospital and to compare the effectiveness of mRNA vaccines to prevent hospital admissions related to each variant.

Design: Case-control study.

Setting: 21 hospitals across the United States.

Participants: 11 690 adults (≥18 years) admitted to hospital: 5728 with covid-19 (cases) and 5962 without covid-19 (controls). Patients were classified into SARS-CoV-2 variant groups based on viral whole genome sequencing, and, if sequencing did not reveal a lineage, by the predominant circulating variant at the time of hospital admission: alpha (11 March to 3 July 2021), delta (4 July to 25 December 2021), and omicron (26 December 2021 to 14 January 2022).

Main outcome measures: Vaccine effectiveness calculated using a test negative design for mRNA vaccines to prevent covid-19 related hospital admissions by each variant (alpha, delta, omicron). Among patients admitted to hospital with covid-19, disease severity on the World Health Organization's clinical progression scale was compared among variants using proportional odds regression.

Results: Effectiveness of the mRNA vaccines to prevent covid-19 associated hospital admissions was 85% (95% confidence interval 82% to 88%) for two vaccine doses against the alpha variant, 85% (83% to 87%) for two doses against the delta variant, 94% (92% to 95%) for three doses against the delta variant, 65% (51% to 75%) for two doses against the omicron variant; and 86% (77% to 91%) for three doses against the omicron variant. In-hospital mortality was 7.6% (81/1060) for alpha, 12.2% (461/3788) for delta, and 7.1% (40/565) for omicron. Among unvaccinated patients with covid-19 admitted to hospital, severity on the WHO clinical progression scale was higher for the delta versus alpha variant (adjusted proportional odds ratio 1.28, 95% confidence interval 1.11 to 1.46), and lower for the omicron versus delta variant (0.61, 0.49 to 0.77). Compared with unvaccinated patients, severity was lower for vaccinated patients for each variant, including alpha (adjusted proportional odds ratio 0.33, 0.23 to 0.49), delta (0.44, 0.37 to 0.51), and omicron (0.61, 0.44 to 0.85).

Conclusions: mRNA vaccines were found to be highly effective in preventing covid-19 associated hospital admissions related to the alpha, delta, and omicron variants, but three vaccine doses were required to achieve protection against omicron similar to the protection that two doses provided against the delta and alpha variants. Among adults admitted to hospital with covid-19, the omicron variant was associated with less severe disease than the delta variant but still resulted in substantial morbidity and mortality. Vaccinated patients admitted to hospital with cov

目的描述入院成人中与α、δ和ΩSARS-CoV-2变异体相关的covid-19的临床严重程度，并比较mRNA疫苗在预防与每种变异体相关的入院治疗方面的效果：病例对照研究。研究地点：全美 21 家医院：入院的 11 690 名成人（≥18 岁）：5728 人接种了 covid-19（病例），5962 人未接种 covid-19（对照）。根据病毒全基因组测序结果将患者分为SARS-CoV-2变异体组，如果测序结果未显示血统，则根据入院时的主要循环变异体将患者分为阿尔法组（2021年3月11日至7月3日）、德尔塔组（2021年7月4日至12月25日）和奥米克隆组（2021年12月26日至2022年1月14日）：采用 mRNA 疫苗的阴性试验设计计算疫苗有效性，以预防各变体（α、δ、ocmicron）与 covid-19 相关的入院治疗。在因covid-19入院的患者中，采用比例几率回归法比较了不同变异株在世界卫生组织临床进展量表中的疾病严重程度：结果：mRNA疫苗预防covid-19相关住院病例的效果为：接种两剂疫苗预防α变异株的效果为85%（95%置信区间为82%至88%）；接种两剂疫苗预防δ变异株的效果为85%（83%至87%）；接种三剂疫苗预防δ变异株的效果为94%（92%至95%）；接种两剂疫苗预防Ω变异株的效果为65%（51%至75%）；接种三剂疫苗预防Ω变异株的效果为86%（77%至91%）。阿尔法型的院内死亡率为 7.6%（81/1060），德尔塔型为 12.2%（461/3788），奥米克龙型为 7.1%（40/565）。在入院的未接种疫苗的covid-19患者中，δ变异型与α变异型相比，在世界卫生组织临床进展量表中的严重程度更高（调整后比例几率比1.28，95%置信区间为1.11至1.46），而ocmicron变异型与δ变异型相比，严重程度更低（0.61，0.49至0.77）。与未接种疫苗的患者相比，接种疫苗的患者每个变体的严重程度都较低，包括α变体（调整后比例几率比0.33，0.23至0.49）、δ变体（0.44，0.37至0.51）和奥米克隆变体（0.61，0.44至0.85）。结论：研究发现，mRNA疫苗对预防与covid-19相关的α、δ和ogicron变异型入院治疗非常有效，但需要接种三剂疫苗才能达到预防ogicron的效果，这与接种两剂疫苗预防δ和α变异型的效果相似。在因感染 covid-19 而入院的成年人中，与 delta 变体相比，ocmicron 变体的病情较轻，但仍会导致大量的发病率和死亡率。就所有变异株而言，接种了covid-19疫苗的住院病人的疾病严重程度明显低于未接种疫苗的病人。

{"title":"Clinical severity of, and effectiveness of mRNA vaccines against, covid-19 from omicron, delta, and alpha SARS-CoV-2 variants in the United States: prospective observational study.","authors":"Adam S Lauring, Mark W Tenforde, James D Chappell, Manjusha Gaglani, Adit A Ginde, Tresa McNeal, Shekhar Ghamande, David J Douin, H Keipp Talbot, Jonathan D Casey, Nicholas M Mohr, Anne Zepeski, Nathan I Shapiro, Kevin W Gibbs, D Clark Files, David N Hager, Arber Shehu, Matthew E Prekker, Heidi L Erickson, Matthew C Exline, Michelle N Gong, Amira Mohamed, Nicholas J Johnson, Vasisht Srinivasan, Jay S Steingrub, Ithan D Peltan, Samuel M Brown, Emily T Martin, Arnold S Monto, Akram Khan, Catherine L Hough, Laurence W Busse, Caitlin C Ten Lohuis, Abhijit Duggal, Jennifer G Wilson, Alexandra June Gordon, Nida Qadir, Steven Y Chang, Christopher Mallow, Carolina Rivas, Hilary M Babcock, Jennie H Kwon, Natasha Halasa, Carlos G Grijalva, Todd W Rice, William B Stubblefield, Adrienne Baughman, Kelsey N Womack, Jillian P Rhoads, Christopher J Lindsell, Kimberly W Hart, Yuwei Zhu, Katherine Adams, Stephanie J Schrag, Samantha M Olson, Miwako Kobayashi, Jennifer R Verani, Manish M Patel, Wesley H Self","doi":"10.1136/bmj-2021-069761","DOIUrl":"10.1136/bmj-2021-069761","url":null,"abstract":"Objectives: To characterize the clinical severity of covid-19 associated with the alpha, delta, and omicron SARS-CoV-2 variants among adults admitted to hospital and to compare the effectiveness of mRNA vaccines to prevent hospital admissions related to each variant.Design: Case-control study.Setting: 21 hospitals across the United States.Participants: 11 690 adults (≥18 years) admitted to hospital: 5728 with covid-19 (cases) and 5962 without covid-19 (controls). Patients were classified into SARS-CoV-2 variant groups based on viral whole genome sequencing, and, if sequencing did not reveal a lineage, by the predominant circulating variant at the time of hospital admission: alpha (11 March to 3 July 2021), delta (4 July to 25 December 2021), and omicron (26 December 2021 to 14 January 2022).Main outcome measures: Vaccine effectiveness calculated using a test negative design for mRNA vaccines to prevent covid-19 related hospital admissions by each variant (alpha, delta, omicron). Among patients admitted to hospital with covid-19, disease severity on the World Health Organization's clinical progression scale was compared among variants using proportional odds regression.Results: Effectiveness of the mRNA vaccines to prevent covid-19 associated hospital admissions was 85% (95% confidence interval 82% to 88%) for two vaccine doses against the alpha variant, 85% (83% to 87%) for two doses against the delta variant, 94% (92% to 95%) for three doses against the delta variant, 65% (51% to 75%) for two doses against the omicron variant; and 86% (77% to 91%) for three doses against the omicron variant. In-hospital mortality was 7.6% (81/1060) for alpha, 12.2% (461/3788) for delta, and 7.1% (40/565) for omicron. Among unvaccinated patients with covid-19 admitted to hospital, severity on the WHO clinical progression scale was higher for the delta versus alpha variant (adjusted proportional odds ratio 1.28, 95% confidence interval 1.11 to 1.46), and lower for the omicron versus delta variant (0.61, 0.49 to 0.77). Compared with unvaccinated patients, severity was lower for vaccinated patients for each variant, including alpha (adjusted proportional odds ratio 0.33, 0.23 to 0.49), delta (0.44, 0.37 to 0.51), and omicron (0.61, 0.44 to 0.85).Conclusions: mRNA vaccines were found to be highly effective in preventing covid-19 associated hospital admissions related to the alpha, delta, and omicron variants, but three vaccine doses were required to achieve protection against omicron similar to the protection that two doses provided against the delta and alpha variants. Among adults admitted to hospital with covid-19, the omicron variant was associated with less severe disease than the delta variant but still resulted in substantial morbidity and mortality. Vaccinated patients admitted to hospital with cov","PeriodicalId":48140,"journal":{"name":"Sociological Methodology","volume":"43 1","pages":"e069761"},"PeriodicalIF":0.0,"publicationDate":"2022-03-09","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8905308/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"86913270","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"社会学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Asking about the Worst First: An Examination of Contextual Effects in Factorial Vignettes 先问最坏的:析因小片段中语境效应的检验

IF 3 2区社会学 Q1 SOCIOLOGY

Sociological Methodology

Pub Date : 2022-02-01 DOI: 10.1177/00811750211071129

Amelie Pedneault, Dale W. Willits

Contextual effects refer to the process by which responses given to survey questions can be affected by question order. Generally, contextual effects harm data measurement validity by introducing bias and increasing measurement error; the risk is that responses to a survey’s later questions are partly affected not only by the substance of the question but also by the preceding questions. Two opposite effects are possible: a carryover effect refers to the assimilation of later questions into those previously asked, and a backfire effect refers to the contrasting of earlier and later questions. In the case where a stereotype is activated in earlier questions of a survey, the previous literature suggests a carryover effect is more likely. The present study tests whether this is also the case in factorial vignette research by examining the influence of first presenting a vignette that corresponds more closely to a stereotypical view of sexual abuse. Results indicate a backfire effect, pointing to the distinctively different way in which vignette scenarios activate stereotypes compared to general survey questions. The results also highlight the need for researchers to control for contextual ordering effects when modeling factorial vignette data.

上下文效应是指对调查问题的回答会受到问题顺序的影响。一般来说，背景效应通过引入偏倚和增加测量误差来损害数据的测量效度;这样做的风险在于，对调查后几个问题的回答不仅部分地受到问题实质的影响，而且还受到前几个问题的影响。有两种相反的影响是可能存在的:一种是结转效应，指的是把后面的问题同化到前面的问题中，另一种是反作用，指的是把前面和后面的问题进行对比。如果刻板印象在调查的早期问题中被激活，先前的文献表明，结转效应更有可能发生。本研究通过检查首先呈现一个与性虐待的刻板印象更接近的小插曲的影响，来测试在析因小插曲研究中是否也是如此。结果表明了一种适得其反的效果，指出了与一般调查问题相比，小情节情景激活刻板印象的明显不同的方式。结果还强调需要研究人员控制上下文排序的影响时，建模因子小插曲数据。

{"title":"Asking about the Worst First: An Examination of Contextual Effects in Factorial Vignettes","authors":"Amelie Pedneault, Dale W. Willits","doi":"10.1177/00811750211071129","DOIUrl":"https://doi.org/10.1177/00811750211071129","url":null,"abstract":"Contextual effects refer to the process by which responses given to survey questions can be affected by question order. Generally, contextual effects harm data measurement validity by introducing bias and increasing measurement error; the risk is that responses to a survey’s later questions are partly affected not only by the substance of the question but also by the preceding questions. Two opposite effects are possible: a carryover effect refers to the assimilation of later questions into those previously asked, and a backfire effect refers to the contrasting of earlier and later questions. In the case where a stereotype is activated in earlier questions of a survey, the previous literature suggests a carryover effect is more likely. The present study tests whether this is also the case in factorial vignette research by examining the influence of first presenting a vignette that corresponds more closely to a stereotypical view of sexual abuse. Results indicate a backfire effect, pointing to the distinctively different way in which vignette scenarios activate stereotypes compared to general survey questions. The results also highlight the need for researchers to control for contextual ordering effects when modeling factorial vignette data.","PeriodicalId":48140,"journal":{"name":"Sociological Methodology","volume":"52 1","pages":"103 - 118"},"PeriodicalIF":3.0,"publicationDate":"2022-02-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"42651125","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"社会学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Surveying Spontaneous Mass Protests: Mixed-mode Sampling and Field Methods 调查自发的群众抗议:混合模式采样和现场方法

IF 3 2区社会学 Q1 SOCIOLOGY

Sociological Methodology

Pub Date : 2022-02-01 DOI: 10.1177/00811750211071130

S. Yuen, Gary Tang, Francis L. F. Lee, Edmund W. Cheng

Protest survey is a standard tool for scholars to understand protests. However, although protest survey methods are well established, the occurrence of spontaneous and leaderless protests has created new challenges for researchers. Not only do their unpredictable occurrences hinder planning, their fluidity also creates problems in obtaining representative samples. This article addresses these challenges based on our research during Hong Kong’s Anti-Extradition Law Amendment Bill Movement. We propose a mixed-mode sampling method combining face-to-face survey and smartphone-based online survey (onsite and post hoc), which can maximize sample sizes while ensuring representativeness in a cost-effective manner. Test results indicate that key variables from the survey modes are not statistically different in a consistent manner, except for age. Our findings show mixed-mode sampling can better capture protesters’ characteristics in contemporary protests and is replicable in other contexts.

抗议调查是学者了解抗议活动的标准工具。然而，尽管抗议调查方法已经确立，但自发和无领导抗议的发生给研究人员带来了新的挑战。它们不可预测的出现不仅阻碍了计划，而且它们的流动性也给获得具有代表性的样本带来了问题。本文通过对香港反萃取法修订法案运动的研究，探讨了这些挑战。我们提出了一种将面对面调查和基于智能手机的在线调查（现场和事后）相结合的混合模式抽样方法，该方法可以最大限度地扩大样本量，同时以经济高效的方式确保代表性。测试结果表明，除了年龄之外，调查模式中的关键变量在统计上没有一致的差异。我们的研究结果表明，混合模式抽样可以更好地捕捉抗议者在当代抗议活动中的特征，并在其他情况下可复制。

引用次数: 10

Language Models in Sociological Research: An Application to Classifying Large Administrative Data and Measuring Religiosity 社会学研究中的语言模型：在大型行政数据分类和宗教信仰测量中的应用

IF 3 2区社会学 Q1 SOCIOLOGY

Sociological Methodology

Pub Date : 2021-10-25 DOI: 10.1177/00811750211053370

Jeffrey L. Jensen, Daniel Karell, Cole Tanigawa-Lau, Nizar Habash, Mai Oudah, Dhia Fairus Shofia Fani

Computational methods have become widespread in the social sciences, but probabilistic language models remain relatively underused. We introduce language models to a general social science readership. First, we offer an accessible explanation of language models, detailing how they estimate the probability of a piece of language, such as a word or sentence, on the basis of the linguistic context. Second, we apply language models in an illustrative analysis to demonstrate the mechanics of using these models in social science research. The example application uses language models to classify names in a large administrative database; the classifications are then used to measure a sociologically important phenomenon: the spatial variation of religiosity. This application highlights several advantages of language models, including their effectiveness in classifying text that contains variation around the base structures, as is often the case with localized naming conventions and dialects. We conclude by discussing language models’ potential to contribute to sociological research beyond classification through their ability to generate language.

计算方法在社会科学中已经普及，但概率语言模型仍然相对未得到充分利用。我们向一般社会科学读者介绍语言模型。首先，我们对语言模型提供了一个易于理解的解释，详细说明了它们如何根据语境来估计一段语言（如单词或句子）的概率。其次，我们将语言模型应用于例证分析中，以展示在社会科学研究中使用这些模型的机制。示例应用程序使用语言模型对大型管理数据库中的名称进行分类；然后，这些分类被用来衡量一个社会学上重要的现象：宗教信仰的空间变异。该应用程序突出了语言模型的几个优点，包括它们在对包含基本结构变化的文本进行分类方面的有效性，本地化命名约定和方言通常就是这样。最后，我们讨论了语言模型通过其生成语言的能力，在分类之外为社会学研究做出贡献的潜力。

{"title":"Language Models in Sociological Research: An Application to Classifying Large Administrative Data and Measuring Religiosity","authors":"Jeffrey L. Jensen, Daniel Karell, Cole Tanigawa-Lau, Nizar Habash, Mai Oudah, Dhia Fairus Shofia Fani","doi":"10.1177/00811750211053370","DOIUrl":"https://doi.org/10.1177/00811750211053370","url":null,"abstract":"Computational methods have become widespread in the social sciences, but probabilistic language models remain relatively underused. We introduce language models to a general social science readership. First, we offer an accessible explanation of language models, detailing how they estimate the probability of a piece of language, such as a word or sentence, on the basis of the linguistic context. Second, we apply language models in an illustrative analysis to demonstrate the mechanics of using these models in social science research. The example application uses language models to classify names in a large administrative database; the classifications are then used to measure a sociologically important phenomenon: the spatial variation of religiosity. This application highlights several advantages of language models, including their effectiveness in classifying text that contains variation around the base structures, as is often the case with localized naming conventions and dialects. We conclude by discussing language models’ potential to contribute to sociological research beyond classification through their ability to generate language.","PeriodicalId":48140,"journal":{"name":"Sociological Methodology","volume":"52 1","pages":"30 - 52"},"PeriodicalIF":3.0,"publicationDate":"2021-10-25","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"48332990","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"社会学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 7

What Goes Up Might Not Come Down: Modeling Directional Asymmetry with Large-N, Large-T Data 上升的东西可能不会下降：用大N、大T数据建模方向不对称

IF 3 2区社会学 Q1 SOCIOLOGY

Sociological Methodology

Pub Date : 2021-09-28 DOI: 10.1177/00811750211046307

Ryan P. Thombs, Xiaorui Huang, Jared Berry Fitzgerald

Modeling asymmetric relationships is an emerging subject of interest among sociologists. York and Light advanced a method to estimate asymmetric models with panel data, which was further developed by Allison. However, little attention has been given to the large-N, large-T case, wherein autoregression, slope heterogeneity, and cross-sectional dependence are important issues to consider. The authors fill this gap by conducting Monte Carlo experiments comparing the bias and power of the fixed-effects estimator to a set of heterogeneous panel estimators. The authors find that dynamic misspecification can produce substantial biases in the coefficients. Furthermore, even when the dynamics are correctly specified, the fixed-effects estimator will produce inconsistent and unstable estimates of the long-run effects in the presence of slope heterogeneity. The authors demonstrate these findings by testing for directional asymmetry in the economic development–CO2 emissions relationship, a key question in macro sociology, using data for 66 countries from 1971 to 2015. The authors conclude with a set of methodological recommendations on modeling directional asymmetry.

不对称关系建模是社会学家感兴趣的新兴课题。York和Light提出了一种利用面板数据估计不对称模型的方法，Allison对此进行了进一步的开发。然而，很少有人关注大N、大T的情况，其中自回归、斜率非均质性和横截面依赖性是需要考虑的重要问题。作者通过进行蒙特卡洛实验来填补这一空白，将固定效应估计器的偏差和功率与一组异质面板估计器进行比较。作者发现，动态错误指定会在系数中产生很大的偏差。此外，即使正确指定了动力学，在存在边坡异质性的情况下，固定效应估计器也会对长期效应产生不一致和不稳定的估计。作者使用1971年至2015年66个国家的数据，通过测试经济发展与二氧化碳排放关系的方向不对称来证明这些发现，这是宏观社会学中的一个关键问题。最后，作者提出了一系列关于建模方向不对称的方法论建议。

{"title":"What Goes Up Might Not Come Down: Modeling Directional Asymmetry with Large-N, Large-T Data","authors":"Ryan P. Thombs, Xiaorui Huang, Jared Berry Fitzgerald","doi":"10.1177/00811750211046307","DOIUrl":"https://doi.org/10.1177/00811750211046307","url":null,"abstract":"Modeling asymmetric relationships is an emerging subject of interest among sociologists. York and Light advanced a method to estimate asymmetric models with panel data, which was further developed by Allison. However, little attention has been given to the large-N, large-T case, wherein autoregression, slope heterogeneity, and cross-sectional dependence are important issues to consider. The authors fill this gap by conducting Monte Carlo experiments comparing the bias and power of the fixed-effects estimator to a set of heterogeneous panel estimators. The authors find that dynamic misspecification can produce substantial biases in the coefficients. Furthermore, even when the dynamics are correctly specified, the fixed-effects estimator will produce inconsistent and unstable estimates of the long-run effects in the presence of slope heterogeneity. The authors demonstrate these findings by testing for directional asymmetry in the economic development–CO2 emissions relationship, a key question in macro sociology, using data for 66 countries from 1971 to 2015. The authors conclude with a set of methodological recommendations on modeling directional asymmetry.","PeriodicalId":48140,"journal":{"name":"Sociological Methodology","volume":"52 1","pages":"1 - 29"},"PeriodicalIF":3.0,"publicationDate":"2021-09-28","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"42559507","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"社会学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 3

From sequences to variables – Rethinking the relationship between sequences and outcomes 从序列到变量——重新思考序列和结果之间的关系

IF 3 2区社会学 Q1 SOCIOLOGY

Sociological Methodology

Pub Date : 2021-09-16 DOI: 10.31235/osf.io/srxag

S. Helske, Jouni Helske, Guilherme Kenji Chihaya

Sequence analysis (SA) has gained increasing interest in social sciences for theholistic analysis of life course and other longitudinal data. The usual approach isto construct sequences, calculate dissimilarities, group similar sequences with clusteranalysis, and use cluster membership as a dependent or independent variable in a linear or nonlinear regression model.This approach may be problematic as the cluster memberships are assumed to befixed known characteristics of the subjects in subsequent analysis. Furthermore, often it is more reasonable to assume that individual sequences are mixtures of multiple ideal types rather than equal members of some group. Failing to account for these issues may lead to wrong conclusions about the nature of the studied relationships.In this paper, we bring forward and discuss the problems of the "traditional" useof SA clusters and compare four approaches for different types of data. We conduct a simulation study and an empirical study, demonstrating the importance of considering how sequences and outcomes are related and the need to adjust the analysis accordingly. In many typical social science applications, the traditional approach is prone to result in wrong conclusions and so-called position-dependent approaches such as representativeness should be preferred.

序列分析（SA）在社会科学中对生命历程和其他纵向数据的精细分析越来越感兴趣。通常的方法是构建序列，计算相异性，用聚类分析对相似序列进行分组，并在线性或非线性回归模型中使用聚类隶属度作为因变量或自变量。这种方法可能会有问题，因为在随后的分析中，假设聚类成员资格适合受试者的已知特征。此外，通常更合理的假设是，单个序列是多个理想类型的混合物，而不是某个群的相等成员。不考虑这些问题可能会导致对所研究关系的性质得出错误的结论。在本文中，我们提出并讨论了SA聚类的“传统”使用问题，并对不同类型数据的四种方法进行了比较。我们进行了一项模拟研究和一项实证研究，证明了考虑序列和结果如何相关的重要性，以及相应调整分析的必要性。在许多典型的社会科学应用中，传统的方法容易导致错误的结论，应该首选所谓的立场依赖方法，如代表性。

{"title":"From sequences to variables – Rethinking the relationship between sequences and outcomes","authors":"S. Helske, Jouni Helske, Guilherme Kenji Chihaya","doi":"10.31235/osf.io/srxag","DOIUrl":"https://doi.org/10.31235/osf.io/srxag","url":null,"abstract":"Sequence analysis (SA) has gained increasing interest in social sciences for theholistic analysis of life course and other longitudinal data. The usual approach isto construct sequences, calculate dissimilarities, group similar sequences with clusteranalysis, and use cluster membership as a dependent or independent variable in a linear or nonlinear regression model.This approach may be problematic as the cluster memberships are assumed to befixed known characteristics of the subjects in subsequent analysis. Furthermore, often it is more reasonable to assume that individual sequences are mixtures of multiple ideal types rather than equal members of some group. Failing to account for these issues may lead to wrong conclusions about the nature of the studied relationships.In this paper, we bring forward and discuss the problems of the \"traditional\" useof SA clusters and compare four approaches for different types of data. We conduct a simulation study and an empirical study, demonstrating the importance of considering how sequences and outcomes are related and the need to adjust the analysis accordingly. In many typical social science applications, the traditional approach is prone to result in wrong conclusions and so-called position-dependent approaches such as representativeness should be preferred.","PeriodicalId":48140,"journal":{"name":"Sociological Methodology","volume":"1 1","pages":""},"PeriodicalIF":3.0,"publicationDate":"2021-09-16","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"45045791","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"社会学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 1

Uncovering Sociological Effect Heterogeneity Using Tree-Based Machine Learning. 利用树型机器学习揭示社会学效应异质性。

IF 2.4 2区社会学 Q1 SOCIOLOGY

Sociological Methodology

Pub Date : 2021-08-01 Epub Date: 2021-03-04 DOI: 10.1177/0081175021993503

Jennie E Brand, Jiahui Xu, Bernard Koch, Pablo Geraldo

Individuals do not respond uniformly to treatments, such as events or interventions. Sociologists routinely partition samples into subgroups to explore how the effects of treatments vary by selected covariates, such as race and gender, on the basis of theoretical priors. Data-driven discoveries are also routine, yet the analyses by which sociologists typically go about them are often problematic and seldom move us beyond our biases to explore new meaningful subgroups. Emerging machine learning methods based on decision trees allow researchers to explore sources of variation that they may not have previously considered or envisaged. In this article, the authors use tree-based machine learning, that is, causal trees, to recursively partition the sample to uncover sources of effect heterogeneity. Assessing a central topic in social inequality, college effects on wages, the authors compare what is learned from covariate and propensity score-based partitioning approaches with recursive partitioning based on causal trees. Decision trees, although superseded by forests for estimation, can be used to uncover subpopulations responsive to treatments. Using observational data, the authors expand on the existing causal tree literature by applying leaf-specific effect estimation strategies to adjust for observed confounding, including inverse propensity weighting, nearest neighbor matching, and doubly robust causal forests. We also assess localized balance metrics and sensitivity analyses to address the possibility of differential imbalance and unobserved confounding. The authors encourage researchers to follow similar data exploration practices in their work on variation in sociological effects and offer a straightforward framework by which to do so.

个人对事件或干预等处理方法的反应并不一致。社会学家通常根据理论先验，将样本划分为不同的子群体，以探讨不同的协变量（如种族和性别）对治疗效果的影响。数据驱动的发现也是例行工作，但社会学家通常采用的分析方法往往存在问题，很少能让我们超越偏见，探索新的有意义的亚群。基于决策树的新兴机器学习方法使研究人员能够探索他们以前可能未曾考虑或设想过的变异来源。在本文中，作者使用基于树的机器学习方法，即因果树，对样本进行递归分区，以发现效应异质性的来源。在评估社会不平等的一个核心主题--大学对工资的影响时，作者比较了基于协变量和倾向得分的分区方法与基于因果树的递归分区方法。决策树虽然在估算中被森林所取代，但仍可用于发现对治疗有反应的亚群。作者利用观察数据，对现有的因果树文献进行了扩展，采用叶片特异性效应估计策略来调整观察到的混杂因素，包括反倾向加权、近邻匹配和双重稳健因果森林。我们还评估了局部平衡指标和敏感性分析，以解决差异不平衡和未观察到的混杂的可能性。作者鼓励研究人员在社会学效应变异的工作中遵循类似的数据探索实践，并提供了一个简单明了的框架。

{"title":"Uncovering Sociological Effect Heterogeneity Using Tree-Based Machine Learning.","authors":"Jennie E Brand, Jiahui Xu, Bernard Koch, Pablo Geraldo","doi":"10.1177/0081175021993503","DOIUrl":"10.1177/0081175021993503","url":null,"abstract":"Individuals do not respond uniformly to treatments, such as events or interventions. Sociologists routinely partition samples into subgroups to explore how the effects of treatments vary by selected covariates, such as race and gender, on the basis of theoretical priors. Data-driven discoveries are also routine, yet the analyses by which sociologists typically go about them are often problematic and seldom move us beyond our biases to explore new meaningful subgroups. Emerging machine learning methods based on decision trees allow researchers to explore sources of variation that they may not have previously considered or envisaged. In this article, the authors use tree-based machine learning, that is, causal trees, to recursively partition the sample to uncover sources of effect heterogeneity. Assessing a central topic in social inequality, college effects on wages, the authors compare what is learned from covariate and propensity score-based partitioning approaches with recursive partitioning based on causal trees. Decision trees, although superseded by forests for estimation, can be used to uncover subpopulations responsive to treatments. Using observational data, the authors expand on the existing causal tree literature by applying leaf-specific effect estimation strategies to adjust for observed confounding, including inverse propensity weighting, nearest neighbor matching, and doubly robust causal forests. We also assess localized balance metrics and sensitivity analyses to address the possibility of differential imbalance and unobserved confounding. The authors encourage researchers to follow similar data exploration practices in their work on variation in sociological effects and offer a straightforward framework by which to do so.","PeriodicalId":48140,"journal":{"name":"Sociological Methodology","volume":"51 2","pages":"189-223"},"PeriodicalIF":2.4,"publicationDate":"2021-08-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9897445/pdf/nihms-1849062.pdf","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"10652104","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"社会学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

A General Panel Model for Unobserved Time Heterogeneity with Application to the Politics of Mass Incarceration 非观测时间异质性的一般面板模型及其在大规模监禁政治中的应用

IF 3 2区社会学 Q1 SOCIOLOGY

Sociological Methodology

Pub Date : 2021-05-25 DOI: 10.1177/00811750211016033

Scott W. Duxbury

Panel data analysis is common in the social sciences. Fixed effects models are a favorite among sociologists because they control for unobserved heterogeneity (unexplained variation) among cross-sectional units, but estimates are biased when there is unobserved heterogeneity in the underlying time trends. Two-way fixed effects models adjust for unobserved time heterogeneity but are inefficient, cannot include unit-invariant variables, and eliminate common trends: the portion of variance in a time-varying variable that is invariant across cross-sectional units. This article introduces a general panel model that can include unit-invariant variables, corrects for unobserved time heterogeneity, and provides the effect of common trends while also allowing for unobserved unit heterogeneity, time-varying coefficients, and time-invariant variables. One-way and two-way fixed effects models are shown to be restrictive forms of this general model. Other restrictive forms are also derived that offer all the usual advantages of one-way and two-way fixed effects models but account for unobserved time heterogeneity. The author uses the models to examine the increase in state incarceration rates between 1970 and 2015.

面板数据分析在社会科学中很常见。固定效应模型是社会学家的最爱，因为它们控制了横截面单位之间未观察到的异质性（无法解释的变化），但当潜在时间趋势中存在未观察到异质性时，估计是有偏差的。双向固定效应模型针对未观察到的时间异质性进行调整，但效率低下，不能包括单位不变变量，并消除了常见趋势：时变变量中跨横截面单位不变的方差部分。本文介绍了一个通用面板模型，该模型可以包括单位不变变量，校正未观测到的时间异质性，并提供了共同趋势的影响，同时还考虑到未观测到单位异质性、时变系数和时间不变变量。单向和双向固定效应模型被证明是这种通用模型的限制形式。还导出了其他限制性形式，它们提供了单向和双向固定效应模型的所有常见优势，但考虑到了未观察到的时间异质性。作者使用这些模型来研究1970年至2015年间各州监禁率的上升情况。

{"title":"A General Panel Model for Unobserved Time Heterogeneity with Application to the Politics of Mass Incarceration","authors":"Scott W. Duxbury","doi":"10.1177/00811750211016033","DOIUrl":"https://doi.org/10.1177/00811750211016033","url":null,"abstract":"Panel data analysis is common in the social sciences. Fixed effects models are a favorite among sociologists because they control for unobserved heterogeneity (unexplained variation) among cross-sectional units, but estimates are biased when there is unobserved heterogeneity in the underlying time trends. Two-way fixed effects models adjust for unobserved time heterogeneity but are inefficient, cannot include unit-invariant variables, and eliminate common trends: the portion of variance in a time-varying variable that is invariant across cross-sectional units. This article introduces a general panel model that can include unit-invariant variables, corrects for unobserved time heterogeneity, and provides the effect of common trends while also allowing for unobserved unit heterogeneity, time-varying coefficients, and time-invariant variables. One-way and two-way fixed effects models are shown to be restrictive forms of this general model. Other restrictive forms are also derived that offer all the usual advantages of one-way and two-way fixed effects models but account for unobserved time heterogeneity. The author uses the models to examine the increase in state incarceration rates between 1970 and 2015.","PeriodicalId":48140,"journal":{"name":"Sociological Methodology","volume":"51 1","pages":"348 - 377"},"PeriodicalIF":3.0,"publicationDate":"2021-05-25","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://sci-hub-pdf.com/10.1177/00811750211016033","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"48076745","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"社会学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 2