首页 > 最新文献

Sociological Methods & Research最新文献

英文 中文
Accounting for Individual-Specific Heterogeneity in Intergenerational Income Mobility 代际收入流动中个体特异性异质性的核算
IF 6.3 2区 社会学 Q1 SOCIAL SCIENCES, MATHEMATICAL METHODS Pub Date : 2025-05-22 DOI: 10.1177/00491241251339654
Yoosoon Chang, Steven N. Durlauf, Bo Hu, Joon Y. Park
This article proposes a fully nonparametric model to investigate the dynamics of intergenerational income mobility for discrete outcomes. In our model, an individual’s income class probabilities depend on parental income in a manner that accommodates nonlinearities and interactions among various individual and parental characteristics, including race, education, and parental age at childbearing, and so generalizes Markov chain mobility models. We show how the model may be estimated using kernel techniques from machine learning. Utilizing data from the panel study of income dynamics, we show how race, parental education, and mother’s age at birth interact with family income to determine mobility between generations.
本文提出了一个完全非参数模型来研究离散结果的代际收入流动动态。在我们的模型中,个人的收入类别概率取决于父母的收入,以适应各种个人和父母特征(包括种族、教育程度和父母生育年龄)之间的非线性和相互作用的方式,从而推广了马尔可夫链流动性模型。我们展示了如何使用机器学习中的核技术来估计模型。利用收入动态面板研究的数据,我们展示了种族、父母教育程度和母亲出生年龄如何与家庭收入相互作用,以决定代际流动。
{"title":"Accounting for Individual-Specific Heterogeneity in Intergenerational Income Mobility","authors":"Yoosoon Chang, Steven N. Durlauf, Bo Hu, Joon Y. Park","doi":"10.1177/00491241251339654","DOIUrl":"https://doi.org/10.1177/00491241251339654","url":null,"abstract":"This article proposes a fully nonparametric model to investigate the dynamics of intergenerational income mobility for discrete outcomes. In our model, an individual’s income class probabilities depend on parental income in a manner that accommodates nonlinearities and interactions among various individual and parental characteristics, including race, education, and parental age at childbearing, and so generalizes Markov chain mobility models. We show how the model may be estimated using kernel techniques from machine learning. Utilizing data from the panel study of income dynamics, we show how race, parental education, and mother’s age at birth interact with family income to determine mobility between generations.","PeriodicalId":21849,"journal":{"name":"Sociological Methods & Research","volume":"35 1","pages":""},"PeriodicalIF":6.3,"publicationDate":"2025-05-22","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"144113541","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"社会学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
The Unrealized Potential of Audits: Applicant-Side Inequalities in Effort, Opportunities, and Certainty 未实现的审计潜力:申请人在努力、机会和确定性方面的不平等
IF 6.3 2区 社会学 Q1 SOCIAL SCIENCES, MATHEMATICAL METHODS Pub Date : 2025-05-22 DOI: 10.1177/00491241251338240
Mike Vuolo, Sadé L. Lindsay, Vincent J. Roscigno, Shawn D. Bushway
Randomized audits and correspondence studies are widely regarded as a “gold standard” for capturing discrimination and bias. However, gatekeepers (e.g., employers) are the analytic unit even though stated implications often center on group-level inequalities. Employing simple rules, we show that audits have the potential to uncover applicant-side inequalities and burdens beyond the gatekeeper biases standardly reported. Specifically, applicants from groups facing lower callback rates must submit more applications to ensure an eventual callback, have fewer opportunities to choose from, and face higher uncertainty regarding how many applications to submit. These results reflect several sequential and cumulative stratification processes “real-world” applicants face that warrant attention in conventional audit reporting. Our approach can be straightforwardly applied and, we show, is particularly pertinent for employment relative to other institutional domains (e.g., education, religion). We discuss the methodological and theoretical relevance of our suggested extensions and the implications for the study of inequality, discrimination, and social closure.
随机审计和函电研究被广泛认为是捕捉歧视和偏见的“黄金标准”。然而,守门人(如雇主)是分析单位,尽管所陈述的含义往往集中在群体层面的不平等上。采用简单的规则,我们表明审计有可能发现申请人方面的不平等和负担,而不是标准报道的看门人偏见。具体来说,来自回调率较低的群体的申请人必须提交更多的申请,以确保最终的回调,他们的选择机会更少,并且在提交多少申请方面面临更大的不确定性。这些结果反映了“现实世界”申请人面临的几个顺序和累积分层过程,这些过程在传统审计报告中值得注意。我们的方法可以直接应用,并且我们表明,与其他机构领域(例如,教育,宗教)相比,我们的方法特别适用于就业。我们讨论了我们建议的扩展的方法和理论相关性,以及对不平等、歧视和社会封闭研究的影响。
{"title":"The Unrealized Potential of Audits: Applicant-Side Inequalities in Effort, Opportunities, and Certainty","authors":"Mike Vuolo, Sadé L. Lindsay, Vincent J. Roscigno, Shawn D. Bushway","doi":"10.1177/00491241251338240","DOIUrl":"https://doi.org/10.1177/00491241251338240","url":null,"abstract":"Randomized audits and correspondence studies are widely regarded as a “gold standard” for capturing discrimination and bias. However, gatekeepers (e.g., employers) are the analytic unit even though stated implications often center on group-level inequalities. Employing simple rules, we show that audits have the potential to uncover applicant-side inequalities and burdens beyond the gatekeeper biases standardly reported. Specifically, applicants from groups facing lower callback rates must submit more applications to ensure an eventual callback, have fewer opportunities to choose from, and face higher uncertainty regarding how many applications to submit. These results reflect several sequential and cumulative stratification processes “real-world” applicants face that warrant attention in conventional audit reporting. Our approach can be straightforwardly applied and, we show, is particularly pertinent for employment relative to other institutional domains (e.g., education, religion). We discuss the methodological and theoretical relevance of our suggested extensions and the implications for the study of inequality, discrimination, and social closure.","PeriodicalId":21849,"journal":{"name":"Sociological Methods & Research","volume":"57 1","pages":""},"PeriodicalIF":6.3,"publicationDate":"2025-05-22","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"144113542","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"社会学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Updating “The Future of Coding”: Qualitative Coding with Generative Large Language Models 更新“编码的未来”:用生成式大型语言模型进行定性编码
IF 6.3 2区 社会学 Q1 SOCIAL SCIENCES, MATHEMATICAL METHODS Pub Date : 2025-05-21 DOI: 10.1177/00491241251339188
Nga Than, Leanne Fan, Tina Law, Laura K. Nelson, Leslie McCall
Over the past decade, social scientists have adapted computational methods for qualitative text analysis, with the hope that they can match the accuracy and reliability of hand coding. The emergence of GPT and open-source generative large language models (LLMs) has transformed this process by shifting from programming to engaging with models using natural language, potentially mimicking the in-depth, inductive, and/or iterative process of qualitative analysis. We test the ability of generative LLMs to replicate and augment traditional qualitative coding, experimenting with multiple prompt structures across four closed- and open-source generative LLMs and proposing a workflow for conducting qualitative coding with generative LLMs. We find that LLMs can perform nearly as well as prior supervised machine learning models in accurately matching hand-coding output. Moreover, using generative LLMs as a natural language interlocutor closely replicates traditional qualitative methods, indicating their potential to transform the qualitative research process, despite ongoing challenges.
在过去的十年里,社会科学家已经将计算方法应用于定性文本分析,希望它们能够与手工编码的准确性和可靠性相匹配。GPT和开源生成式大型语言模型(llm)的出现改变了这一过程,从编程转向使用自然语言的模型,潜在地模仿了定性分析的深入、归纳和/或迭代过程。我们测试了生成法学硕士复制和增强传统定性编码的能力,在四个封闭和开源的生成法学硕士中试验了多个提示结构,并提出了使用生成法学硕士进行定性编码的工作流程。我们发现llm在精确匹配手工编码输出方面的表现几乎与先验监督机器学习模型一样好。此外,使用生成法学硕士作为自然语言对话者密切复制了传统的定性方法,表明它们有潜力改变定性研究过程,尽管存在挑战。
{"title":"Updating “The Future of Coding”: Qualitative Coding with Generative Large Language Models","authors":"Nga Than, Leanne Fan, Tina Law, Laura K. Nelson, Leslie McCall","doi":"10.1177/00491241251339188","DOIUrl":"https://doi.org/10.1177/00491241251339188","url":null,"abstract":"Over the past decade, social scientists have adapted computational methods for qualitative text analysis, with the hope that they can match the accuracy and reliability of hand coding. The emergence of GPT and open-source generative large language models (LLMs) has transformed this process by shifting from programming to engaging with models using natural language, potentially mimicking the in-depth, inductive, and/or iterative process of qualitative analysis. We test the ability of generative LLMs to replicate and augment traditional qualitative coding, experimenting with multiple prompt structures across four closed- and open-source generative LLMs and proposing a workflow for conducting qualitative coding with generative LLMs. We find that LLMs can perform nearly as well as prior supervised machine learning models in accurately matching hand-coding output. Moreover, using generative LLMs as a natural language interlocutor closely replicates traditional qualitative methods, indicating their potential to transform the qualitative research process, despite ongoing challenges.","PeriodicalId":21849,"journal":{"name":"Sociological Methods & Research","volume":"11 1","pages":""},"PeriodicalIF":6.3,"publicationDate":"2025-05-21","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"144113548","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"社会学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Balancing Large Language Model Alignment and Algorithmic Fidelity in Social Science Research 社会科学研究中大语言模型一致性与算法保真度的平衡
IF 6.3 2区 社会学 Q1 SOCIAL SCIENCES, MATHEMATICAL METHODS Pub Date : 2025-05-21 DOI: 10.1177/00491241251342008
Alex Lyman, Bryce Hepner, Lisa P. Argyle, Ethan C. Busby, Joshua R. Gubler, David Wingate
Generative artificial intelligence (AI) has the potential to revolutionize social science research. However, researchers face the difficult challenge of choosing a specific AI model, often without social science-specific guidance. To demonstrate the importance of this choice, we present an evaluation of the effect of alignment, or human-driven modification, on the ability of large language models (LLMs) to simulate the attitudes of human populations (sometimes called silicon sampling ). We benchmark aligned and unaligned versions of six open-source LLMs against each other and compare them to similar responses by humans. Our results suggest that model alignment impacts output in predictable ways, with implications for prompting, task completion, and the substantive content of LLM-based results. We conclude that researchers must be aware of the complex ways in which model training affects their research and carefully consider model choice for each project. We discuss future steps to improve how social scientists work with generative AI tools.
生成式人工智能(AI)有可能彻底改变社会科学研究。然而,研究人员面临着选择特定人工智能模型的艰巨挑战,通常没有社会科学的具体指导。为了证明这种选择的重要性,我们评估了对齐或人类驱动的修改对大型语言模型(llm)模拟人类群体态度的能力的影响(有时称为硅采样)。我们对六个开源llm的对齐和未对齐版本进行基准测试,并将它们与人类的类似响应进行比较。我们的结果表明,模型对齐以可预测的方式影响输出,对提示、任务完成和基于llm的结果的实质性内容都有影响。我们的结论是,研究人员必须意识到模型训练影响他们研究的复杂方式,并仔细考虑每个项目的模型选择。我们讨论了改善社会科学家如何使用生成式人工智能工具的未来步骤。
{"title":"Balancing Large Language Model Alignment and Algorithmic Fidelity in Social Science Research","authors":"Alex Lyman, Bryce Hepner, Lisa P. Argyle, Ethan C. Busby, Joshua R. Gubler, David Wingate","doi":"10.1177/00491241251342008","DOIUrl":"https://doi.org/10.1177/00491241251342008","url":null,"abstract":"Generative artificial intelligence (AI) has the potential to revolutionize social science research. However, researchers face the difficult challenge of choosing a specific AI model, often without social science-specific guidance. To demonstrate the importance of this choice, we present an evaluation of the effect of alignment, or human-driven modification, on the ability of large language models (LLMs) to simulate the attitudes of human populations (sometimes called <jats:italic>silicon sampling</jats:italic> ). We benchmark aligned and unaligned versions of six open-source LLMs against each other and compare them to similar responses by humans. Our results suggest that model alignment impacts output in predictable ways, with implications for prompting, task completion, and the substantive content of LLM-based results. We conclude that researchers must be aware of the complex ways in which model training affects their research and carefully consider model choice for each project. We discuss future steps to improve how social scientists work with generative AI tools.","PeriodicalId":21849,"journal":{"name":"Sociological Methods & Research","volume":"16 1","pages":""},"PeriodicalIF":6.3,"publicationDate":"2025-05-21","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"144113546","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"社会学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Generative AI Meets Open-Ended Survey Responses: Research Participant Use of AI and Homogenization 生成人工智能满足开放式调查回应:研究参与者使用人工智能和同质化
IF 6.3 2区 社会学 Q1 SOCIAL SCIENCES, MATHEMATICAL METHODS Pub Date : 2025-05-07 DOI: 10.1177/00491241251327130
Simone Zhang, Janet Xu, AJ Alvero
The growing popularity of generative artificial intelligence (AI) tools presents new challenges for data quality in online surveys and experiments. This study examines participants’ use of large language models to answer open-ended survey questions and describes empirical tendencies in human versus large language model (LLM)-generated text responses. In an original survey of research participants recruited from a popular online platform for sourcing social science research subjects, 34 percent reported using LLMs to help them answer open-ended survey questions. Simulations comparing human-written responses from three pre-ChatGPT studies with LLM-generated text reveal that LLM responses are more homogeneous and positive, particularly when they describe social groups in sensitive questions. These homogenization patterns may mask important underlying social variation in attitudes and beliefs among human subjects, raising concerns about data validity. Our findings shed light on the scope and potential consequences of participants’ LLM use in online research.
生成式人工智能(AI)工具的日益普及对在线调查和实验的数据质量提出了新的挑战。本研究考察了参与者使用大型语言模型来回答开放式调查问题,并描述了人类与大型语言模型(LLM)生成的文本响应的经验趋势。在一项原始调查中,从一个流行的社会科学研究对象在线平台招募的研究参与者中,34%的人表示使用法学硕士来帮助他们回答开放式调查问题。将三个chatgpt前的人类书面回答与法学硕士生成的文本进行模拟比较,发现法学硕士的回答更加均匀和积极,特别是当他们在敏感问题中描述社会群体时。这些同质化模式可能掩盖了人类受试者之间态度和信仰的重要潜在社会差异,引起了对数据有效性的担忧。我们的研究结果揭示了参与者在在线研究中使用法学硕士的范围和潜在后果。
{"title":"Generative AI Meets Open-Ended Survey Responses: Research Participant Use of AI and Homogenization","authors":"Simone Zhang, Janet Xu, AJ Alvero","doi":"10.1177/00491241251327130","DOIUrl":"https://doi.org/10.1177/00491241251327130","url":null,"abstract":"The growing popularity of generative artificial intelligence (AI) tools presents new challenges for data quality in online surveys and experiments. This study examines participants’ use of large language models to answer open-ended survey questions and describes empirical tendencies in human versus large language model (LLM)-generated text responses. In an original survey of research participants recruited from a popular online platform for sourcing social science research subjects, 34 percent reported using LLMs to help them answer open-ended survey questions. Simulations comparing human-written responses from three pre-ChatGPT studies with LLM-generated text reveal that LLM responses are more homogeneous and positive, particularly when they describe social groups in sensitive questions. These homogenization patterns may mask important underlying social variation in attitudes and beliefs among human subjects, raising concerns about data validity. Our findings shed light on the scope and potential consequences of participants’ LLM use in online research.","PeriodicalId":21849,"journal":{"name":"Sociological Methods & Research","volume":"74 1","pages":""},"PeriodicalIF":6.3,"publicationDate":"2025-05-07","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"143920428","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"社会学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Integrating Generative Artificial Intelligence into Social Science Research: Measurement, Prompting, and Simulation 将生成式人工智能整合到社会科学研究:测量、提示和模拟
IF 6.3 2区 社会学 Q1 SOCIAL SCIENCES, MATHEMATICAL METHODS Pub Date : 2025-05-07 DOI: 10.1177/00491241251339184
Thomas Davidson, Daniel Karell
Generative artificial intelligence (AI) offers new capabilities for analyzing data, creating synthetic media, and simulating realistic social interactions. This essay introduces a special issue that examines how these and other affordances of generative AI can advance social science research. We discuss three core themes that appear across the contributed articles: rigorous measurement and validation of AI-generated outputs, optimizing model performance and reproducibility via prompting, and novel uses of AI for the simulation of attitudes and behaviors. We highlight how generative AI enable new methodological innovations that complement and augment existing approaches. This essay and the special issue’s ten articles collectively provide a detailed roadmap for integrating generative AI into social science research in theoretically informed and methodologically rigorous ways. We conclude by reflecting on the implications of the ongoing advances in AI.
生成式人工智能(AI)为分析数据、创建合成媒体和模拟现实社会互动提供了新的能力。本文介绍了一个特别的问题,探讨了生成式人工智能的这些和其他功能如何推动社会科学研究。我们讨论了在贡献的文章中出现的三个核心主题:严格测量和验证人工智能生成的输出,通过提示优化模型性能和可重复性,以及人工智能在模拟态度和行为方面的新用途。我们强调生成式人工智能如何实现新的方法创新,以补充和增强现有方法。这篇文章和特刊的十篇文章共同提供了一个详细的路线图,将生成人工智能整合到社会科学研究中,以理论上知情和方法上严谨的方式。最后,我们反思了人工智能不断进步的影响。
{"title":"Integrating Generative Artificial Intelligence into Social Science Research: Measurement, Prompting, and Simulation","authors":"Thomas Davidson, Daniel Karell","doi":"10.1177/00491241251339184","DOIUrl":"https://doi.org/10.1177/00491241251339184","url":null,"abstract":"Generative artificial intelligence (AI) offers new capabilities for analyzing data, creating synthetic media, and simulating realistic social interactions. This essay introduces a special issue that examines how these and other affordances of generative AI can advance social science research. We discuss three core themes that appear across the contributed articles: rigorous measurement and validation of AI-generated outputs, optimizing model performance and reproducibility via prompting, and novel uses of AI for the simulation of attitudes and behaviors. We highlight how generative AI enable new methodological innovations that complement and augment existing approaches. This essay and the special issue’s ten articles collectively provide a detailed roadmap for integrating generative AI into social science research in theoretically informed and methodologically rigorous ways. We conclude by reflecting on the implications of the ongoing advances in AI.","PeriodicalId":21849,"journal":{"name":"Sociological Methods & Research","volume":"15 1","pages":""},"PeriodicalIF":6.3,"publicationDate":"2025-05-07","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"143920458","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"社会学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Large Language Models for Text Classification: From Zero-Shot Learning to Instruction-Tuning 用于文本分类的大型语言模型:从零学习到指令调整
IF 6.3 2区 社会学 Q1 SOCIAL SCIENCES, MATHEMATICAL METHODS Pub Date : 2025-04-24 DOI: 10.1177/00491241251325243
Youngjin Chae, Thomas Davidson
Large language models (LLMs) have tremendous potential for social science research as they are trained on vast amounts of text and can generalize to many tasks. We explore the use of LLMs for supervised text classification, specifically the application to stance detection, which involves detecting attitudes and opinions in texts. We examine the performance of these models across different architectures, training regimes, and task specifications. We compare 10 models ranging in size from tens of millions to hundreds of billions of parameters and test four distinct training regimes: Prompt-based zero-shot learning and few-shot learning, fine-tuning, and instruction-tuning, which combines prompting and fine-tuning. The largest, most powerful models generally offer the best predictive performance even with little or no training examples, but fine-tuning smaller models is a competitive solution due to their relatively high accuracy and low cost. Instruction-tuning the latest generative LLMs expands the scope of text classification, enabling applications to more complex tasks than previously feasible. We offer practical recommendations on the use of LLMs for text classification in sociological research and discuss their limitations and challenges. Ultimately, LLMs can make text classification and other text analysis methods more accurate, accessible, and adaptable, opening new possibilities for computational social science.
大型语言模型(llm)在社会科学研究中具有巨大的潜力,因为它们是在大量文本上训练的,并且可以推广到许多任务。我们探索了llm在监督文本分类中的使用,特别是在立场检测中的应用,这涉及到检测文本中的态度和观点。我们在不同的体系结构、训练制度和任务规范中检查这些模型的性能。我们比较了10个模型,其规模从数千万到数千亿个参数不等,并测试了四种不同的训练机制:基于提示的零次学习和少次学习、微调和指令调整,后者结合了提示和微调。最大、最强大的模型通常即使在很少或没有训练样例的情况下也能提供最好的预测性能,但微调较小的模型是一种有竞争力的解决方案,因为它们相对较高的准确性和较低的成本。指令调优最新的生成法学硕士扩展了文本分类的范围,使应用程序能够执行比以前更复杂的任务。我们提供了在社会学研究中使用法学硕士进行文本分类的实用建议,并讨论了它们的局限性和挑战。最终,法学硕士可以使文本分类和其他文本分析方法更加准确、可访问和适应性强,为计算社会科学开辟了新的可能性。
{"title":"Large Language Models for Text Classification: From Zero-Shot Learning to Instruction-Tuning","authors":"Youngjin Chae, Thomas Davidson","doi":"10.1177/00491241251325243","DOIUrl":"https://doi.org/10.1177/00491241251325243","url":null,"abstract":"Large language models (LLMs) have tremendous potential for social science research as they are trained on vast amounts of text and can generalize to many tasks. We explore the use of LLMs for supervised text classification, specifically the application to stance detection, which involves detecting attitudes and opinions in texts. We examine the performance of these models across different architectures, training regimes, and task specifications. We compare 10 models ranging in size from tens of millions to hundreds of billions of parameters and test four distinct training regimes: Prompt-based zero-shot learning and few-shot learning, fine-tuning, and instruction-tuning, which combines prompting and fine-tuning. The largest, most powerful models generally offer the best predictive performance even with little or no training examples, but fine-tuning smaller models is a competitive solution due to their relatively high accuracy and low cost. Instruction-tuning the latest generative LLMs expands the scope of text classification, enabling applications to more complex tasks than previously feasible. We offer practical recommendations on the use of LLMs for text classification in sociological research and discuss their limitations and challenges. Ultimately, LLMs can make text classification and other text analysis methods more accurate, accessible, and adaptable, opening new possibilities for computational social science.","PeriodicalId":21849,"journal":{"name":"Sociological Methods & Research","volume":"72 1","pages":""},"PeriodicalIF":6.3,"publicationDate":"2025-04-24","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"143866960","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"社会学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Conceptualizing Job and Employment Concepts for Earnings Inequality Estimands With Linked Employer-Employee Data 1 概念化工作和就业概念与关联雇主-雇员数据的收入不平等估计1
IF 6.3 2区 社会学 Q1 SOCIAL SCIENCES, MATHEMATICAL METHODS Pub Date : 2025-04-24 DOI: 10.1177/00491241251334124
Donald Tomaskovic-Devey, Chen-Shuo Hong
We examine variations in pay gap estimates and inferences associated with distinct conceptualizations of jobs and employment contexts under legal and comparable worth theories of pay bias. We find that job titles produce smaller estimates of within job pay gaps than job groups, but the inferential importance of job concepts differs across organizational, workplace, and job groups within workplace units of observation. Moving from more to less job concept detail, we find almost no inference differences when pay gaps are estimated at the organizational level. Tradeoffs at the workplace and job groups within workplace levels are more common, comprising around 10 percent to 20 percent of observations. A legal theoretical framework leads to fewer empirical estimates of significant pay disparities, while comparable worth estimates suggest higher levels of gender and racial bias at the job and workplace levels. This research has implications for future analyses of linked employer-employee data and for both scientific research and regulatory enforcement of equal opportunity law.
在薪酬偏见的法律和可比价值理论下,我们研究了与工作和就业背景的不同概念相关的薪酬差距估计和推论的变化。我们发现,职位名称对工作薪酬差距的估计比工作组要小,但工作概念的推论重要性在不同的组织、工作场所和工作组中是不同的。从更多到更少的工作概念细节,我们发现在组织层面估计薪酬差距时几乎没有推断差异。工作场所的权衡和工作场所级别内的工作群体更为常见,约占观察结果的10%至20%。法律理论框架导致对重大薪酬差异的经验估计较少,而可比价值估计表明,在工作和工作场所层面存在较高程度的性别和种族偏见。这项研究对未来有关雇主-雇员数据的分析,以及对平等机会法的科学研究和监管执行都有影响。
{"title":"Conceptualizing Job and Employment Concepts for Earnings Inequality Estimands With Linked Employer-Employee Data 1","authors":"Donald Tomaskovic-Devey, Chen-Shuo Hong","doi":"10.1177/00491241251334124","DOIUrl":"https://doi.org/10.1177/00491241251334124","url":null,"abstract":"We examine variations in pay gap estimates and inferences associated with distinct conceptualizations of jobs and employment contexts under legal and comparable worth theories of pay bias. We find that job titles produce smaller estimates of within job pay gaps than job groups, but the inferential importance of job concepts differs across organizational, workplace, and job groups within workplace units of observation. Moving from more to less job concept detail, we find almost no inference differences when pay gaps are estimated at the organizational level. Tradeoffs at the workplace and job groups within workplace levels are more common, comprising around 10 percent to 20 percent of observations. A legal theoretical framework leads to fewer empirical estimates of significant pay disparities, while comparable worth estimates suggest higher levels of gender and racial bias at the job and workplace levels. This research has implications for future analyses of linked employer-employee data and for both scientific research and regulatory enforcement of equal opportunity law.","PeriodicalId":21849,"journal":{"name":"Sociological Methods & Research","volume":"17 1","pages":""},"PeriodicalIF":6.3,"publicationDate":"2025-04-24","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"143866959","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"社会学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
The Target Study: A Conceptual Model and Framework for Measuring Disparity 目标研究:衡量差异的概念模型和框架
IF 6.3 2区 社会学 Q1 SOCIAL SCIENCES, MATHEMATICAL METHODS Pub Date : 2025-04-22 DOI: 10.1177/00491241251314037
John W. Jackson, Yea-Jen Hsu, Raquel C. Greer, Romsai T. Boonyasai, Chanelle J. Howe
We present a conceptual model to measure disparity—the target study—where social groups may be similarly situated (i.e., balanced) on allowable covariates. Our model, based on a sampling design, does not intervene to assign social group membership or alter allowable covariates. To address nonrandom sample selection, we extend our model to generalize or transport disparity or to assess disparity after an intervention on eligibility-related variables that eliminates forms of collider-stratification. To avoid bias from differential timing of enrollment, we aggregate time-specific study results by balancing calendar time of enrollment across social groups. To provide a framework for emulating our model, we discuss study designs, data structures, and G-computation and weighting estimators. We compare our sampling-based model to prominent decomposition-based models used in healthcare and algorithmic fairness. We provide R code for all estimators and apply our methods to measure health system disparities in hypertension control using electronic medical records.
我们提出了一个衡量差异的概念模型——目标研究——在允许的协变量上,社会群体可能处于相似的位置(即,平衡)。我们的模型,基于抽样设计,不干预分配社会群体成员或改变允许的协变量。为了解决非随机样本选择问题,我们扩展了我们的模型,以推广或转移差异,或在对排除碰撞分层形式的资格相关变量进行干预后评估差异。为了避免不同入组时间的偏差,我们通过平衡不同社会群体入组的日历时间来汇总特定时间的研究结果。为了提供一个模拟我们模型的框架,我们讨论了研究设计、数据结构、g计算和加权估计器。我们将我们的基于抽样的模型与医疗保健和算法公平中使用的基于分解的模型进行比较。我们为所有的估计器提供了R代码,并应用我们的方法来测量卫生系统在使用电子病历控制高血压方面的差异。
{"title":"The Target Study: A Conceptual Model and Framework for Measuring Disparity","authors":"John W. Jackson, Yea-Jen Hsu, Raquel C. Greer, Romsai T. Boonyasai, Chanelle J. Howe","doi":"10.1177/00491241251314037","DOIUrl":"https://doi.org/10.1177/00491241251314037","url":null,"abstract":"We present a conceptual model to measure disparity—the target study—where social groups may be similarly situated (i.e., balanced) on allowable covariates. Our model, based on a sampling design, does not intervene to assign social group membership or alter allowable covariates. To address nonrandom sample selection, we extend our model to generalize or transport disparity or to assess disparity after an intervention on eligibility-related variables that eliminates forms of collider-stratification. To avoid bias from differential timing of enrollment, we aggregate time-specific study results by balancing calendar time of enrollment across social groups. To provide a framework for emulating our model, we discuss study designs, data structures, and G-computation and weighting estimators. We compare our sampling-based model to prominent decomposition-based models used in healthcare and algorithmic fairness. We provide R code for all estimators and apply our methods to measure health system disparities in hypertension control using electronic medical records.","PeriodicalId":21849,"journal":{"name":"Sociological Methods & Research","volume":"26 1","pages":""},"PeriodicalIF":6.3,"publicationDate":"2025-04-22","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"143863022","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"社会学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Networks Beyond Categories: A Computational Approach to Examining Gender Homophily 超越类别的网络:研究性别同源性的计算方法
IF 6.3 2区 社会学 Q1 SOCIAL SCIENCES, MATHEMATICAL METHODS Pub Date : 2025-04-22 DOI: 10.1177/00491241251321152
Chen-Shuo Hong
Social networks literature has explored homophily, the tendency to associate with similar others, as a critical boundary-making process contributing to segregated networks along the lines of identities. Yet, social network research generally conceptualizes identities as sociodemographic categories and seldom considers the inherently continuous and heterogeneous nature of differences. Drawing upon the infracategorical model of inequality, this study demonstrates that a computational approach – combining machine learning and exponential random graph models (ERGMs) – can capture the role of categorical conformity in network structures. Through a case study of gender segregation in friendships, this study presents a workflow for developing a machine-learning-based gender conformity measure and applying it to guide the social network analysis of cultural matching. Results show that adolescents with similar gender conformity are more likely to form friendships, net of homophily based on categorical gender and other controls, and homophily by gender conformity mediates homophily by categorical gender. The study concludes by discussing the limitations of this computational approach and its unique strengths in enhancing theories on categories, boundaries, and stratification.
社会网络文学探讨了同质性,即与相似的人联系的倾向,作为一个关键的边界制定过程,有助于沿着身份的路线隔离网络。然而,社会网络研究通常将身份概念化为社会人口分类,很少考虑差异的内在连续性和异质性。利用不平等的次分类模型,本研究证明了一种计算方法-结合机器学习和指数随机图模型(ergm) -可以捕捉网络结构中分类一致性的作用。通过对友谊中性别隔离的案例研究,本研究提出了一个开发基于机器学习的性别一致性测量的工作流程,并将其应用于指导文化匹配的社会网络分析。结果表明,具有相似性别一致性的青少年更容易形成友谊、基于类别性别和其他控制的同质网络,性别一致性介导类别性别的同质。研究最后讨论了这种计算方法的局限性,以及它在加强分类、边界和分层理论方面的独特优势。
{"title":"Networks Beyond Categories: A Computational Approach to Examining Gender Homophily","authors":"Chen-Shuo Hong","doi":"10.1177/00491241251321152","DOIUrl":"https://doi.org/10.1177/00491241251321152","url":null,"abstract":"Social networks literature has explored homophily, the tendency to associate with similar others, as a critical boundary-making process contributing to segregated networks along the lines of identities. Yet, social network research generally conceptualizes identities as sociodemographic categories and seldom considers the inherently continuous and heterogeneous nature of differences. Drawing upon the infracategorical model of inequality, this study demonstrates that a computational approach – combining machine learning and exponential random graph models (ERGMs) – can capture the role of categorical conformity in network structures. Through a case study of gender segregation in friendships, this study presents a workflow for developing a machine-learning-based gender conformity measure and applying it to guide the social network analysis of cultural matching. Results show that adolescents with similar gender conformity are more likely to form friendships, net of homophily based on categorical gender and other controls, and homophily by gender conformity mediates homophily by categorical gender. The study concludes by discussing the limitations of this computational approach and its unique strengths in enhancing theories on categories, boundaries, and stratification.","PeriodicalId":21849,"journal":{"name":"Sociological Methods & Research","volume":"32 1","pages":""},"PeriodicalIF":6.3,"publicationDate":"2025-04-22","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"143862884","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"社会学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
期刊
Sociological Methods & Research
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1