首页 > 最新文献

Sociological Methodology最新文献

英文 中文
Comparing the Incomparable? Issues of Lacking Common Support, Functional-Form Misspecification, and Insufficient Sample Size in Decompositions 比较无可比拟的?分解中缺乏共同支持、功能形式规范错误和样本量不足的问题
IF 3 2区 社会学 Q1 Social Sciences Pub Date : 2023-05-20 DOI: 10.1177/00811750231169729
Maik Hamjediers, Maximilian Sprengholz
Decompositions make it possible to investigate whether gaps between groups in certain outcomes would remain if groups had comparable characteristics. In practice, however, such a counterfactual comparability is difficult to establish in the presence of lacking common support, functional-form misspecification, and insufficient sample size. In this article, the authors show how decompositions can be undermined by these three interrelated issues by comparing the results of a regression-based Kitagawa-Blinder-Oaxaca decomposition and matching decompositions applied to simulated and real-world data. The results show that matching decompositions are robust to issues of common support and functional-form misspecification but demand a large number of observations. Kitagawa-Blinder-Oaxaca decompositions provide consistent estimates also for smaller samples but require assumptions for model specification and, when common support is lacking, for model-based extrapolation. The authors recommend that any decomposition benefits from using a matching approach first to assess potential problems of common support and misspecification.
如果各组具有可比较的特征,则可以通过分解来调查各组之间在某些结果上是否会存在差距。然而,在实践中,在缺乏共同支持、功能形式错误指定和样本量不足的情况下,很难建立这种反事实的可比性。在这篇文章中,作者通过比较基于回归的Kitagawa Blinder Oaxaca分解和应用于模拟和真实世界数据的匹配分解的结果,展示了这三个相互关联的问题如何破坏分解。结果表明,匹配分解对公共支持和函数形式错误指定问题是鲁棒的,但需要大量的观察。Kitagawa Blinder Oaxaca分解也为较小的样本提供了一致的估计,但需要对模型规范进行假设,并且在缺乏通用支持的情况下,需要对基于模型的外推进行假设。作者建议,任何分解都得益于首先使用匹配方法来评估共同支持和错误指定的潜在问题。
{"title":"Comparing the Incomparable? Issues of Lacking Common Support, Functional-Form Misspecification, and Insufficient Sample Size in Decompositions","authors":"Maik Hamjediers, Maximilian Sprengholz","doi":"10.1177/00811750231169729","DOIUrl":"https://doi.org/10.1177/00811750231169729","url":null,"abstract":"Decompositions make it possible to investigate whether gaps between groups in certain outcomes would remain if groups had comparable characteristics. In practice, however, such a counterfactual comparability is difficult to establish in the presence of lacking common support, functional-form misspecification, and insufficient sample size. In this article, the authors show how decompositions can be undermined by these three interrelated issues by comparing the results of a regression-based Kitagawa-Blinder-Oaxaca decomposition and matching decompositions applied to simulated and real-world data. The results show that matching decompositions are robust to issues of common support and functional-form misspecification but demand a large number of observations. Kitagawa-Blinder-Oaxaca decompositions provide consistent estimates also for smaller samples but require assumptions for model specification and, when common support is lacking, for model-based extrapolation. The authors recommend that any decomposition benefits from using a matching approach first to assess potential problems of common support and misspecification.","PeriodicalId":48140,"journal":{"name":"Sociological Methodology","volume":null,"pages":null},"PeriodicalIF":3.0,"publicationDate":"2023-05-20","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"42433140","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"社会学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Multivariate Small Area Estimation of Social Indicators: The Case of Continuous and Binary Variables 社会指标的多元小面积估计:连续变量和二元变量的情况
IF 3 2区 社会学 Q1 Social Sciences Pub Date : 2023-05-11 DOI: 10.1177/00811750231169726
Angelo Moretti
Large-scale sample surveys are not designed to produce reliable estimates for small areas. Here, small area estimation methods can be applied to estimate population parameters of target variables to detailed geographic scales. Small area estimation for noncontinuous variables is a topic of great interest in the social sciences where such variables can be found. Generalized linear mixed models are widely adopted in the literature. Interestingly, the small area estimation literature shows that multivariate small area estimators, where correlations among outcome variables are taken into account, produce more efficient estimates than do the traditional univariate techniques. In this article, the author evaluate a multivariate small area estimator on the basis of a joint mixed model in which a small area proportion and mean of a continuous variable are estimated simultaneously. Using this method, the author “borrows strength” across response variables. The author carried out a design-based simulation study to evaluate the approach where the indicators object of study are the income and a monetary poverty (binary) indicator. The author found that the multivariate approach produces more efficient small area estimates than does the univariate modeling approach. The method can be extended to a large variety of indicators on the basis of social surveys.
大规模抽样调查的目的不是对小区域作出可靠的估计。在这里,小面积估计方法可以用于在详细的地理尺度上估计目标变量的种群参数。不连续变量的小面积估计是社会科学中一个非常有趣的话题,在社会科学中可以找到这样的变量。广义线性混合模型在文献中被广泛采用。有趣的是,小面积估计文献表明,考虑到结果变量之间的相关性的多变量小面积估计器比传统的单变量技术产生更有效的估计。本文基于同时估计连续变量的小面积比例和均值的联合混合模型,对多元小面积估计量进行了估计。使用这种方法,作者可以跨响应变量“借用力量”。笔者进行了基于设计的模拟研究,以收入和货币贫困(二元)指标为研究对象,对该方法进行了评价。作者发现,多变量建模方法比单变量建模方法产生更有效的小面积估计。该方法可以在社会调查的基础上扩展到各种各样的指标。
{"title":"Multivariate Small Area Estimation of Social Indicators: The Case of Continuous and Binary Variables","authors":"Angelo Moretti","doi":"10.1177/00811750231169726","DOIUrl":"https://doi.org/10.1177/00811750231169726","url":null,"abstract":"Large-scale sample surveys are not designed to produce reliable estimates for small areas. Here, small area estimation methods can be applied to estimate population parameters of target variables to detailed geographic scales. Small area estimation for noncontinuous variables is a topic of great interest in the social sciences where such variables can be found. Generalized linear mixed models are widely adopted in the literature. Interestingly, the small area estimation literature shows that multivariate small area estimators, where correlations among outcome variables are taken into account, produce more efficient estimates than do the traditional univariate techniques. In this article, the author evaluate a multivariate small area estimator on the basis of a joint mixed model in which a small area proportion and mean of a continuous variable are estimated simultaneously. Using this method, the author “borrows strength” across response variables. The author carried out a design-based simulation study to evaluate the approach where the indicators object of study are the income and a monetary poverty (binary) indicator. The author found that the multivariate approach produces more efficient small area estimates than does the univariate modeling approach. The method can be extended to a large variety of indicators on the basis of social surveys.","PeriodicalId":48140,"journal":{"name":"Sociological Methodology","volume":null,"pages":null},"PeriodicalIF":3.0,"publicationDate":"2023-05-11","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"48197993","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"社会学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Strategies for Multidomain Sequence Analysis in Social Research 社会研究中的多领域序列分析策略
IF 3 2区 社会学 Q1 Social Sciences Pub Date : 2023-04-25 DOI: 10.1177/00811750231163833
G. Ritschard, T. Liao, E. Struffolino
Multidomain/multichannel sequence analysis has become widely used in social science research to uncover the underlying relationships between two or more observed trajectories in parallel. For example, life-course researchers use multidomain sequence analysis to study the parallel unfolding of multiple life-course domains. In this article, the authors conduct a critical review of the approaches most used in multidomain sequence analysis. The parallel unfolding of trajectories in multiple domains is typically analyzed by building a joint multidomain typology and by examining how domain-specific sequence patterns combine with one another within the multidomain groups. The authors identify four strategies to construct the joint multidomain typology: proceeding independently of domain costs and distances between domain sequences, deriving multidomain costs from domain costs, deriving distances between multidomain sequences from within-domain distances, and combining typologies constructed for each domain. The second and third strategies are prevalent in the literature and typically proceed additively. The authors show that these additive procedures assume between-domain independence, and they make explicit the constraints these procedures impose on between-multidomain costs and distances. Regarding the fourth strategy, the authors propose a merging algorithm to avoid scarce combined types. As regards the first strategy, the authors demonstrate, with a real example based on data from the Swiss Household Panel, that using edit distances with data-driven costs at the multidomain level (i.e., independent of domain costs) remains easily manageable with more than 200 different multidomain combined states. In addition, the authors introduce strategies to enhance visualization by types and domains.
多域/多通道序列分析已广泛应用于社会科学研究,以揭示两个或多个平行观察轨迹之间的潜在关系。例如,生命过程研究者使用多域序列分析来研究多个生命过程域的并行展开。在本文中,作者对多域序列分析中最常用的方法进行了批判性的回顾。通过建立一个联合的多域类型学,并通过检查领域特定序列模式如何在多域组中相互结合来分析多域中轨迹的平行展开。作者确定了四种构建联合多域类型学的策略:独立于域成本和域序列之间的距离进行处理,从域成本中推导出多域成本,从域内距离中推导出多域序列之间的距离,以及将每个域构建的类型学结合起来。第二种和第三种策略在文献中很普遍,通常是相加的。作者证明了这些加性过程具有域间独立性,并明确了这些过程对多域间成本和距离的约束。对于第四种策略,作者提出了一种避免稀缺组合类型的合并算法。关于第一种策略,作者通过一个基于瑞士家庭小组数据的真实示例证明,在多领域级别(即独立于领域成本)使用具有数据驱动成本的编辑距离仍然可以轻松管理200多个不同的多领域组合状态。此外,作者还介绍了通过类型和领域来增强可视化的策略。
{"title":"Strategies for Multidomain Sequence Analysis in Social Research","authors":"G. Ritschard, T. Liao, E. Struffolino","doi":"10.1177/00811750231163833","DOIUrl":"https://doi.org/10.1177/00811750231163833","url":null,"abstract":"Multidomain/multichannel sequence analysis has become widely used in social science research to uncover the underlying relationships between two or more observed trajectories in parallel. For example, life-course researchers use multidomain sequence analysis to study the parallel unfolding of multiple life-course domains. In this article, the authors conduct a critical review of the approaches most used in multidomain sequence analysis. The parallel unfolding of trajectories in multiple domains is typically analyzed by building a joint multidomain typology and by examining how domain-specific sequence patterns combine with one another within the multidomain groups. The authors identify four strategies to construct the joint multidomain typology: proceeding independently of domain costs and distances between domain sequences, deriving multidomain costs from domain costs, deriving distances between multidomain sequences from within-domain distances, and combining typologies constructed for each domain. The second and third strategies are prevalent in the literature and typically proceed additively. The authors show that these additive procedures assume between-domain independence, and they make explicit the constraints these procedures impose on between-multidomain costs and distances. Regarding the fourth strategy, the authors propose a merging algorithm to avoid scarce combined types. As regards the first strategy, the authors demonstrate, with a real example based on data from the Swiss Household Panel, that using edit distances with data-driven costs at the multidomain level (i.e., independent of domain costs) remains easily manageable with more than 200 different multidomain combined states. In addition, the authors introduce strategies to enhance visualization by types and domains.","PeriodicalId":48140,"journal":{"name":"Sociological Methodology","volume":null,"pages":null},"PeriodicalIF":3.0,"publicationDate":"2023-04-25","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"46683423","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"社会学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Systematic Social Observation at Scale: Using Crowdsourcing and Computer Vision to Measure Visible Neighborhood Conditions 大规模的系统社会观察:使用众包和计算机视觉来测量可见的邻里条件
IF 3 2区 社会学 Q1 Social Sciences Pub Date : 2023-04-10 DOI: 10.1177/00811750231160781
Jackelyn Hwang, Nikhil Naik
Analysis of neighborhood environments is important for understanding inequality. Few studies, however, use direct measures of the visible characteristics of neighborhood conditions, despite their theorized importance in shaping individual and community well-being, because collecting data on the physical conditions of places across neighborhoods and cities and over time has required extensive time and labor. The authors introduce systematic social observation at scale (SSO@S), a pipeline for using visual data, crowdsourcing, and computer vision to identify visible characteristics of neighborhoods at a large scale. The authors implement SSO@S on millions of street-level images across three physically distinct cities—Boston, Detroit, and Los Angeles—from 2007 to 2020 to identify trash across space and over time. The authors evaluate the extent to which this approach can be used to assist with systematic coding of street-level imagery through cross-validation and out-of-sample validation, class-activation mapping, and comparisons with other sources of observed neighborhood characteristics. The SSO@S approach produces estimates with high reliability that correlate with some expected demographic characteristics but not others, depending on the city. The authors conclude with an assessment of this approach for measuring visible characteristics of neighborhoods and the implications for methods and research.
分析邻里环境对于理解不平等很重要。然而,很少有研究直接测量社区条件的可见特征,尽管理论上它们在塑造个人和社区福祉方面很重要,因为收集社区和城市各个地方的物理条件数据需要大量的时间和劳动。作者介绍了大规模的系统社会观察(SSO@S),这是一个使用视觉数据、众包和计算机视觉来大规模识别社区可见特征的管道。从2007年到2020年,作者在波士顿、底特律和洛杉矶三个不同城市的数百万张街道图像上实现了SSO@S,以识别空间和时间上的垃圾。作者通过交叉验证和样本外验证、类别激活映射以及与观察到的社区特征的其他来源进行比较,评估了这种方法在多大程度上可以用于辅助街道级图像的系统编码。SSO@S方法产生的估计具有很高的可靠性,与某些预期的人口特征相关,但与其他特征无关,具体取决于城市。作者最后评估了这种测量社区可见特征的方法,以及对方法和研究的影响。
{"title":"Systematic Social Observation at Scale: Using Crowdsourcing and Computer Vision to Measure Visible Neighborhood Conditions","authors":"Jackelyn Hwang, Nikhil Naik","doi":"10.1177/00811750231160781","DOIUrl":"https://doi.org/10.1177/00811750231160781","url":null,"abstract":"Analysis of neighborhood environments is important for understanding inequality. Few studies, however, use direct measures of the visible characteristics of neighborhood conditions, despite their theorized importance in shaping individual and community well-being, because collecting data on the physical conditions of places across neighborhoods and cities and over time has required extensive time and labor. The authors introduce systematic social observation at scale (SSO@S), a pipeline for using visual data, crowdsourcing, and computer vision to identify visible characteristics of neighborhoods at a large scale. The authors implement SSO@S on millions of street-level images across three physically distinct cities—Boston, Detroit, and Los Angeles—from 2007 to 2020 to identify trash across space and over time. The authors evaluate the extent to which this approach can be used to assist with systematic coding of street-level imagery through cross-validation and out-of-sample validation, class-activation mapping, and comparisons with other sources of observed neighborhood characteristics. The SSO@S approach produces estimates with high reliability that correlate with some expected demographic characteristics but not others, depending on the city. The authors conclude with an assessment of this approach for measuring visible characteristics of neighborhoods and the implications for methods and research.","PeriodicalId":48140,"journal":{"name":"Sociological Methodology","volume":null,"pages":null},"PeriodicalIF":3.0,"publicationDate":"2023-04-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"43079457","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"社会学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
The Anatomy of Cohort Analysis: Decomposing Comparative Cohort Careers 剖析队列分析:分解比较队列职业
IF 3 2区 社会学 Q1 Social Sciences Pub Date : 2023-03-28 DOI: 10.1177/00811750231151949
E. Fosse, Christopher Winship
In a widely influential essay, Ryder argued that to understand social change, researchers should compare cohort careers, contrasting how different cohorts change over the life cycle with respect to some outcome. Ryder, however, provided few technical details on how to actually conduct a cohort analysis. In this article, the authors develop a framework for analyzing temporally structured data grounded in the construction, comparison, and decomposition of cohort careers. The authors begin by illustrating how one can analyze age-period-cohort (APC) data by constructing graphs of cohort careers. Although a useful starting point, the major problem with this approach is that the graphs are typically of sufficient complexity that it can be difficult, if not impossible, to discern the underlying trends and patterns in the data. To provide a more useful foundation for cohort analysis, the authors therefore introduce three distinct improvements over the purely graphical approach. First, they provide a mathematical definition of a cohort career, demonstrating how the underlying parameters of interest can be estimated using a reparameterized version of the conventional APC model. The authors call this the life cycle and social change (LC-SC) model. Second, they contrast the proposed model with two alternative three-factor APC models and all logically possible two-factor models, showing that none of these other models are adequate for fully representing Ryder’s ideas. Third, the authors present the article’s major accomplishment: using the LC-SC model, they show how a collection of cohort careers can be decomposed into just four basic components: a curve representing an overall intracohort trend (or life cycle change); a curve representing an overall intercohort trend (or social change); a set of common cross-period temporal fluctuations that permit variability across cohort careers; and, finally, a set of terms representing cell-specific heterogeneity (or, equivalently, interactions among age, period, and/or cohort). As the authors demonstrate, these parts can be reassembled into simpler versions of cohort careers, revealing underlying trends and patterns that may not be evident otherwise. The authors illustrate this approach by analyzing trends in political party strength in the General Social Survey.
在一篇影响广泛的文章中,Ryder认为,为了理解社会变化,研究人员应该比较队列职业,对比不同队列在生命周期中对某些结果的变化。然而,Ryder没有提供关于如何进行队列分析的技术细节。在本文中,作者开发了一个框架,用于分析基于队列职业的构建、比较和分解的时间结构化数据。作者首先说明了如何通过构建队列职业图来分析年龄-时期-队列(APC)数据。尽管这是一个有用的起点,但这种方法的主要问题是,图形通常非常复杂,即使不是不可能,也很难识别数据中的潜在趋势和模式。为了给队列分析提供一个更有用的基础,作者在纯图形方法的基础上引入了三个明显的改进。首先,他们提供了队列职业的数学定义,展示了如何使用传统APC模型的重新参数化版本来估计感兴趣的潜在参数。作者称之为生命周期和社会变化(LC-SC)模型。其次,他们将提出的模型与两种可供选择的三因素APC模型和所有逻辑上可能的两因素模型进行了对比,表明这些模型都不足以充分代表Ryder的想法。第三,作者介绍了本文的主要成就:使用LC-SC模型,他们展示了如何将队列职业的集合分解为四个基本组成部分:代表整体队列内趋势(或生命周期变化)的曲线;代表整体群体间趋势(或社会变化)的曲线;一组常见的跨时期时间波动,允许不同队列职业之间的变化;最后,一组表示细胞特异性异质性的术语(或者,等价地,年龄、时期和/或队列之间的相互作用)。正如作者所展示的那样,这些部分可以重新组合成更简单的群体职业,揭示出潜在的趋势和模式,否则这些趋势和模式可能并不明显。作者通过分析综合社会调查中政党实力的趋势来说明这种方法。
{"title":"The Anatomy of Cohort Analysis: Decomposing Comparative Cohort Careers","authors":"E. Fosse, Christopher Winship","doi":"10.1177/00811750231151949","DOIUrl":"https://doi.org/10.1177/00811750231151949","url":null,"abstract":"In a widely influential essay, Ryder argued that to understand social change, researchers should compare cohort careers, contrasting how different cohorts change over the life cycle with respect to some outcome. Ryder, however, provided few technical details on how to actually conduct a cohort analysis. In this article, the authors develop a framework for analyzing temporally structured data grounded in the construction, comparison, and decomposition of cohort careers. The authors begin by illustrating how one can analyze age-period-cohort (APC) data by constructing graphs of cohort careers. Although a useful starting point, the major problem with this approach is that the graphs are typically of sufficient complexity that it can be difficult, if not impossible, to discern the underlying trends and patterns in the data. To provide a more useful foundation for cohort analysis, the authors therefore introduce three distinct improvements over the purely graphical approach. First, they provide a mathematical definition of a cohort career, demonstrating how the underlying parameters of interest can be estimated using a reparameterized version of the conventional APC model. The authors call this the life cycle and social change (LC-SC) model. Second, they contrast the proposed model with two alternative three-factor APC models and all logically possible two-factor models, showing that none of these other models are adequate for fully representing Ryder’s ideas. Third, the authors present the article’s major accomplishment: using the LC-SC model, they show how a collection of cohort careers can be decomposed into just four basic components: a curve representing an overall intracohort trend (or life cycle change); a curve representing an overall intercohort trend (or social change); a set of common cross-period temporal fluctuations that permit variability across cohort careers; and, finally, a set of terms representing cell-specific heterogeneity (or, equivalently, interactions among age, period, and/or cohort). As the authors demonstrate, these parts can be reassembled into simpler versions of cohort careers, revealing underlying trends and patterns that may not be evident otherwise. The authors illustrate this approach by analyzing trends in political party strength in the General Social Survey.","PeriodicalId":48140,"journal":{"name":"Sociological Methodology","volume":null,"pages":null},"PeriodicalIF":3.0,"publicationDate":"2023-03-28","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"46970228","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"社会学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
Evaluating Substitution as a Strategy for Handling U.S. Postal Service Drop Points in Self-Administered Address-Based Sampling Frame Surveys 在基于自我管理地址的抽样框架调查中,评估替代作为处理美国邮政服务投递点的策略
IF 3 2区 社会学 Q1 Social Sciences Pub Date : 2023-01-13 DOI: 10.1177/00811750221147525
Taylor Lewis, Joseph McMichael, Charlotte Looby
Most addresses on modern address-based sampling frames derived from the U.S. Postal Service’s Computerized Delivery Sequence file have a one-to-one relationship with a household. Some addresses, however, are associated with multiple households. These addresses are referred to as drop points, and the households therein are referred to as drop point units (DPUs). DPUs pose a challenge for self-administered surveys because no apartment number or unit designation is available, making it impossible to send targeted correspondence. The authors evaluate a method for substituting sampled DPUs with similar non-DPUs, which was implemented in the 2021 Healthy Chicago Survey alongside a concurrent survey of the originally sampled DPUs. Comparing aggregate distributions of DPUs and the non-DPU substitutes, the authors observe certain differences with respect to age, employment status, marital status, and housing tenure but no substantive differences in key health outcomes measured by the survey.
从美国邮政局的计算机化投递序列文件中提取的现代基于地址的采样帧中的大多数地址与家庭有一对一的关系。然而,有些地址与多个家庭有关。这些地址称为投递点,其中的家庭称为投递单元(DPU)。DPU对自我管理的调查构成了挑战,因为没有可用的公寓号码或单元名称,因此无法发送有针对性的信件。作者评估了一种用类似的非DPU替代采样DPU的方法,该方法在2021年芝加哥健康调查中实施,同时对最初采样的DPU进行了调查。比较DPU和非DPU替代品的总体分布,作者观察到在年龄、就业状况、婚姻状况和住房保有权方面存在一定差异,但在调查测量的关键健康结果方面没有实质性差异。
{"title":"Evaluating Substitution as a Strategy for Handling U.S. Postal Service Drop Points in Self-Administered Address-Based Sampling Frame Surveys","authors":"Taylor Lewis, Joseph McMichael, Charlotte Looby","doi":"10.1177/00811750221147525","DOIUrl":"https://doi.org/10.1177/00811750221147525","url":null,"abstract":"Most addresses on modern address-based sampling frames derived from the U.S. Postal Service’s Computerized Delivery Sequence file have a one-to-one relationship with a household. Some addresses, however, are associated with multiple households. These addresses are referred to as drop points, and the households therein are referred to as drop point units (DPUs). DPUs pose a challenge for self-administered surveys because no apartment number or unit designation is available, making it impossible to send targeted correspondence. The authors evaluate a method for substituting sampled DPUs with similar non-DPUs, which was implemented in the 2021 Healthy Chicago Survey alongside a concurrent survey of the originally sampled DPUs. Comparing aggregate distributions of DPUs and the non-DPU substitutes, the authors observe certain differences with respect to age, employment status, marital status, and housing tenure but no substantive differences in key health outcomes measured by the survey.","PeriodicalId":48140,"journal":{"name":"Sociological Methodology","volume":null,"pages":null},"PeriodicalIF":3.0,"publicationDate":"2023-01-13","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"42127373","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"社会学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Sparse Data Reconstruction, Missing Value and Multiple Imputation through Matrix Factorization 稀疏数据重构、缺失值与矩阵分解多重插值
IF 3 2区 社会学 Q1 Social Sciences Pub Date : 2022-10-22 DOI: 10.1177/00811750221125799
Nandana Sengupta, Madeleine Udell, N. Srebro, James Evans
Social science approaches to missing values predict avoided, unrequested, or lost information from dense data sets, typically surveys. The authors propose a matrix factorization approach to missing data imputation that (1) identifies underlying factors to model similarities across respondents and responses and (2) regularizes across factors to reduce their overinfluence for optimal data reconstruction. This approach may enable social scientists to draw new conclusions from sparse data sets with a large number of features, for example, historical or archival sources, online surveys with high attrition rates, or data sets created from Web scraping, which confound traditional imputation techniques. The authors introduce matrix factorization techniques and detail their probabilistic interpretation, and they demonstrate these techniques’ consistency with Rubin’s multiple imputation framework. The authors show via simulations using artificial data and data from real-world subsets of the General Social Survey and National Longitudinal Study of Youth cases for which matrix factorization techniques may be preferred. These findings recommend the use of matrix factorization for data reconstruction in several settings, particularly when data are Boolean and categorical and when large proportions of the data are missing.
缺失值的社会科学方法预测了密集数据集(通常是调查)中被回避、未被请求或丢失的信息。作者提出了一种缺失数据插补的矩阵分解方法,该方法(1)识别潜在因素,以模拟受访者和回答之间的相似性,(2)对各因素进行正则化,以减少它们对最佳数据重建的过度影响。这种方法可能使社会科学家能够从具有大量特征的稀疏数据集中得出新的结论,例如,历史或档案来源、流失率高的在线调查,或通过网络抓取创建的数据集,这些数据集混淆了传统的插补技术。作者介绍了矩阵分解技术,并详细介绍了它们的概率解释,并证明了这些技术与鲁宾的多重插补框架的一致性。作者通过使用人工数据和来自一般社会调查和全国青年纵向研究的真实世界子集的数据进行模拟,表明矩阵分解技术可能是首选的。这些发现建议在几种情况下使用矩阵分解进行数据重建,特别是当数据是布尔和分类的,以及当大量数据丢失时。
{"title":"Sparse Data Reconstruction, Missing Value and Multiple Imputation through Matrix Factorization","authors":"Nandana Sengupta, Madeleine Udell, N. Srebro, James Evans","doi":"10.1177/00811750221125799","DOIUrl":"https://doi.org/10.1177/00811750221125799","url":null,"abstract":"Social science approaches to missing values predict avoided, unrequested, or lost information from dense data sets, typically surveys. The authors propose a matrix factorization approach to missing data imputation that (1) identifies underlying factors to model similarities across respondents and responses and (2) regularizes across factors to reduce their overinfluence for optimal data reconstruction. This approach may enable social scientists to draw new conclusions from sparse data sets with a large number of features, for example, historical or archival sources, online surveys with high attrition rates, or data sets created from Web scraping, which confound traditional imputation techniques. The authors introduce matrix factorization techniques and detail their probabilistic interpretation, and they demonstrate these techniques’ consistency with Rubin’s multiple imputation framework. The authors show via simulations using artificial data and data from real-world subsets of the General Social Survey and National Longitudinal Study of Youth cases for which matrix factorization techniques may be preferred. These findings recommend the use of matrix factorization for data reconstruction in several settings, particularly when data are Boolean and categorical and when large proportions of the data are missing.","PeriodicalId":48140,"journal":{"name":"Sociological Methodology","volume":null,"pages":null},"PeriodicalIF":3.0,"publicationDate":"2022-10-22","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"48838062","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"社会学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 3
Hyperscanning and the Future of Neurosociology 超扫描与神经社会学的未来
IF 3 2区 社会学 Q1 Social Sciences Pub Date : 2022-10-15 DOI: 10.1177/00811750221128790
W. TenHouten, L. Schussel, Maria Gritsch, C. D. Kaplan
Because all aspects of social life have a mental component, sociology’s focus is not society alone but mind and society. Insofar as mind is an emergent level of brainwork, the description and measurement of mindwork amidst social interaction can be accomplished by neurometric measurement methodology. The authors’ topic, hyperscanning, involves the simultaneous recording of either hemodynamic or neuroelectric measurement of brain activity in two (or more) interacting individuals. The authors consider two hyperscanning methods, functional magnetic resonance imaging and electroencephalography (EEG). Although functional magnetic resonance imaging provides excellent spatial resolution of brain-region activation, the temporal resolution of EEG is unmatched. EEG’s low spatial resolution has been overcome by low-resolution electromagnetic tomography. Hyperscanning studies show that interpersonal coordination of action includes mutual entrainment or synchronization of neural dynamics, flow of information between brains, and causal effects of one brain upon another with respect to social-signaling processes involving fairness, reciprocity, trust, competition, cooperation, and leadership.
因为社会生活的各个方面都有心理成分,所以社会学的重点不仅仅是社会,而是心理和社会。由于心理是脑力劳动的一种突现水平,对社会互动中脑力劳动的描述和测量可以通过神经测量方法来完成。作者的主题,超扫描,涉及同时记录两个(或更多)相互作用的个体的血流动力学或神经电测量的大脑活动。作者考虑了两种超扫描方法,功能磁共振成像和脑电图(EEG)。虽然功能磁共振成像提供了良好的脑区激活的空间分辨率,但EEG的时间分辨率是无法比拟的。低分辨率电磁成像技术克服了脑电图的低空间分辨率。超扫描研究表明,行为的人际协调包括神经动力学的相互卷入或同步,大脑之间的信息流,以及一个大脑对另一个大脑在涉及公平、互惠、信任、竞争、合作和领导的社会信号过程中的因果效应。
{"title":"Hyperscanning and the Future of Neurosociology","authors":"W. TenHouten, L. Schussel, Maria Gritsch, C. D. Kaplan","doi":"10.1177/00811750221128790","DOIUrl":"https://doi.org/10.1177/00811750221128790","url":null,"abstract":"Because all aspects of social life have a mental component, sociology’s focus is not society alone but mind and society. Insofar as mind is an emergent level of brainwork, the description and measurement of mindwork amidst social interaction can be accomplished by neurometric measurement methodology. The authors’ topic, hyperscanning, involves the simultaneous recording of either hemodynamic or neuroelectric measurement of brain activity in two (or more) interacting individuals. The authors consider two hyperscanning methods, functional magnetic resonance imaging and electroencephalography (EEG). Although functional magnetic resonance imaging provides excellent spatial resolution of brain-region activation, the temporal resolution of EEG is unmatched. EEG’s low spatial resolution has been overcome by low-resolution electromagnetic tomography. Hyperscanning studies show that interpersonal coordination of action includes mutual entrainment or synchronization of neural dynamics, flow of information between brains, and causal effects of one brain upon another with respect to social-signaling processes involving fairness, reciprocity, trust, competition, cooperation, and leadership.","PeriodicalId":48140,"journal":{"name":"Sociological Methodology","volume":null,"pages":null},"PeriodicalIF":3.0,"publicationDate":"2022-10-15","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"44291541","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"社会学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 2
Data Quality and Recall Bias in Time-Diary Research: The Effects of Prolonged Recall Periods in Self-Administered Online Time-Use Surveys 时间日记研究中的数据质量和回忆偏差:自我管理的在线时间使用调查中长回忆期的影响
IF 3 2区 社会学 Q1 Social Sciences Pub Date : 2022-10-05 DOI: 10.1177/00811750221126499
Petrus te Braak, T. P. van Tienoven, Joeri Minnen, I. Glorieux
Previous research has shown that a prolonged recall period is associated with lower data quality in time-diary research. In these studies, the recall period is roughly estimated on the basis of the period between the assigned diary day and the agreed collection day. Because this is so rudimentary, little is known about the duration of the mean recall period and its consequences for data quality. Recent advances in online methodology now allow a better investigation of the recall period using time stamps. Using a refined indicator, the authors examine the duration of the recall period, to what extent this duration is related to socioeconomic characteristics, and how a prolonged recall period affects data quality. The authors demonstrate that using online time-diary data collected from 8,535 teachers in Belgium, the mean recall period is less than 24 hr for most respondents, although respondents with many time constraints have extended recall periods. Additionally, a prolonged recall period indeed has negative consequences for data quality. Quality deterioration already arises several hours after an activity has been completed, much sooner than previous research has indicated.
先前的研究表明,在时间日记研究中,长时间的回忆与较低的数据质量有关。在这些研究中,回忆期是根据指定的日记日和约定的收集日之间的时间段粗略估计的。因为这是非常初级的,所以对平均召回期的持续时间及其对数据质量的影响知之甚少。在线方法的最新进展现在允许使用时间戳更好地调查召回期。使用一个精细的指标,作者检查了召回期的持续时间,这一持续时间在多大程度上与社会经济特征有关,以及延长的召回期如何影响数据质量。作者证明,使用从比利时8535名教师那里收集的在线时间日记数据,大多数受访者的平均回忆期不到24小时,尽管有许多时间限制的受访者延长了回忆期。此外,长时间的召回确实会对数据质量产生负面影响。在一项活动完成数小时后,质量就已经恶化,比之前的研究表明的要早得多。
{"title":"Data Quality and Recall Bias in Time-Diary Research: The Effects of Prolonged Recall Periods in Self-Administered Online Time-Use Surveys","authors":"Petrus te Braak, T. P. van Tienoven, Joeri Minnen, I. Glorieux","doi":"10.1177/00811750221126499","DOIUrl":"https://doi.org/10.1177/00811750221126499","url":null,"abstract":"Previous research has shown that a prolonged recall period is associated with lower data quality in time-diary research. In these studies, the recall period is roughly estimated on the basis of the period between the assigned diary day and the agreed collection day. Because this is so rudimentary, little is known about the duration of the mean recall period and its consequences for data quality. Recent advances in online methodology now allow a better investigation of the recall period using time stamps. Using a refined indicator, the authors examine the duration of the recall period, to what extent this duration is related to socioeconomic characteristics, and how a prolonged recall period affects data quality. The authors demonstrate that using online time-diary data collected from 8,535 teachers in Belgium, the mean recall period is less than 24 hr for most respondents, although respondents with many time constraints have extended recall periods. Additionally, a prolonged recall period indeed has negative consequences for data quality. Quality deterioration already arises several hours after an activity has been completed, much sooner than previous research has indicated.","PeriodicalId":48140,"journal":{"name":"Sociological Methodology","volume":null,"pages":null},"PeriodicalIF":3.0,"publicationDate":"2022-10-05","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"45524136","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"社会学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 3
Bayesian Multistate Life Table Methods for Large and Complex State Spaces: Development and Illustration of a New Method. 大型复杂状态空间的贝叶斯多状态生命表方法:一种新方法的发展与说明。
IF 3 2区 社会学 Q1 Social Sciences Pub Date : 2022-08-01 DOI: 10.1177/00811750221112398
Scott M Lynch, Emma Zang

Multistate life table methods are an important tool for producing easily understood measures of population health. Most contemporary uses of these methods involve sample data, thus requiring techniques for capturing uncertainty in estimates. In recent decades, several methods have been developed to do so. Among these methods, the Bayesian approach proposed by Lynch and Brown has several unique advantages. However, the approach is limited to estimating years to be spent in only two living states, such as "healthy" and "unhealthy." In this article, the authors extend this method to allow for large state spaces with "quasi-absorbing" states. The authors illustrate the new method and show its advantages using data from the Health and Retirement Study to investigate U.S. regional differences in years of remaining life to be spent with diabetes, chronic conditions, and disabilities. The method works well and yields rich output for reporting and subsequent analyses. The expanded method also should facilitate the use of multi-state life tables to address a wider array of social science research questions.

多态生命表法是产生易于理解的人口健康指标的重要工具。这些方法的大多数当代应用涉及样本数据,因此需要技术来捕捉估计中的不确定性。近几十年来,已经开发了几种方法来做到这一点。在这些方法中,Lynch和Brown提出的贝叶斯方法有几个独特的优点。然而,这种方法仅限于估计在两种生活状态下的寿命,比如“健康”和“不健康”。在本文中,作者扩展了该方法,以允许具有“准吸收”状态的大状态空间。作者用健康和退休研究的数据说明了这种新方法,并展示了它的优势,该研究调查了美国地区在糖尿病、慢性病和残疾患者剩余寿命方面的差异。该方法工作良好,并为报告和后续分析产生丰富的输出。扩展后的方法还应有助于使用多状态生命表来解决更广泛的社会科学研究问题。
{"title":"Bayesian Multistate Life Table Methods for Large and Complex State Spaces: Development and Illustration of a New Method.","authors":"Scott M Lynch,&nbsp;Emma Zang","doi":"10.1177/00811750221112398","DOIUrl":"https://doi.org/10.1177/00811750221112398","url":null,"abstract":"<p><p>Multistate life table methods are an important tool for producing easily understood measures of population health. Most contemporary uses of these methods involve sample data, thus requiring techniques for capturing uncertainty in estimates. In recent decades, several methods have been developed to do so. Among these methods, the Bayesian approach proposed by Lynch and Brown has several unique advantages. However, the approach is limited to estimating years to be spent in only two living states, such as \"healthy\" and \"unhealthy.\" In this article, the authors extend this method to allow for large state spaces with \"quasi-absorbing\" states. The authors illustrate the new method and show its advantages using data from the Health and Retirement Study to investigate U.S. regional differences in years of remaining life to be spent with diabetes, chronic conditions, and disabilities. The method works well and yields rich output for reporting and subsequent analyses. The expanded method also should facilitate the use of multi-state life tables to address a wider array of social science research questions.</p>","PeriodicalId":48140,"journal":{"name":"Sociological Methodology","volume":null,"pages":null},"PeriodicalIF":3.0,"publicationDate":"2022-08-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10241463/pdf/nihms-1852062.pdf","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"10293115","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"社会学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
期刊
Sociological Methodology
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1