Machine Learning as a Model for Cultural Learning: Teaching an Algorithm What it Means to be Fat.

IF 6.5 2区社会学 Q1 SOCIAL SCIENCES, MATHEMATICAL METHODS Sociological Methods & Research Pub Date : 2022-11-01 Epub Date: 2022-12-02 DOI:10.1177/00491241221122603

Alina Arseniev-Koehler, Jacob G Foster

{"title":"Machine Learning as a Model for Cultural Learning: Teaching an Algorithm What it Means to be Fat.","authors":"Alina Arseniev-Koehler, Jacob G Foster","doi":"10.1177/00491241221122603","DOIUrl":null,"url":null,"abstract":"<p><p>Public culture is a powerful source of cognitive socialization; for example, media language is full of meanings about body weight. Yet it remains unclear how individuals process meanings in public culture. We suggest that schema learning is a core mechanism by which public culture becomes personal culture. We propose that a burgeoning approach in computational text analysis - neural word embeddings - can be interpreted as a formal model for cultural learning. Embeddings allow us to empirically model schema learning and activation from natural language data. We illustrate our approach by extracting four lower-order schemas from news articles: the gender, moral, health, and class meanings of body weight. Using these lower-order schemas we quantify how words about body weight \"fill in the blanks\" about gender, morality, health, and class. Our findings reinforce ongoing concerns that machine-learning models (e.g., of natural language) can encode and reproduce harmful human biases.</p>","PeriodicalId":21849,"journal":{"name":"Sociological Methods & Research","volume":"51 1","pages":"1484-1539"},"PeriodicalIF":6.5000,"publicationDate":"2022-11-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10653277/pdf/","citationCount":"27","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Sociological Methods & Research","FirstCategoryId":"90","ListUrlMain":"https://doi.org/10.1177/00491241221122603","RegionNum":2,"RegionCategory":"社会学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"2022/12/2 0:00:00","PubModel":"Epub","JCR":"Q1","JCRName":"SOCIAL SCIENCES, MATHEMATICAL METHODS","Score":null,"Total":0}

引用次数: 27

Abstract

Public culture is a powerful source of cognitive socialization; for example, media language is full of meanings about body weight. Yet it remains unclear how individuals process meanings in public culture. We suggest that schema learning is a core mechanism by which public culture becomes personal culture. We propose that a burgeoning approach in computational text analysis - neural word embeddings - can be interpreted as a formal model for cultural learning. Embeddings allow us to empirically model schema learning and activation from natural language data. We illustrate our approach by extracting four lower-order schemas from news articles: the gender, moral, health, and class meanings of body weight. Using these lower-order schemas we quantify how words about body weight "fill in the blanks" about gender, morality, health, and class. Our findings reinforce ongoing concerns that machine-learning models (e.g., of natural language) can encode and reproduce harmful human biases.

查看原文

微信好友朋友圈 QQ好友复制链接

本刊更多论文

机器学习作为文化学习的模式：教算法胖意味着什么

公共文化是认知社会化的强大源泉；例如，媒体语言充满了关于体重的含义。然而，目前尚不清楚个人在公共文化中是如何处理意义的。我们认为，图式学习是公共文化成为个人文化的核心机制。我们提出，计算文本分析中一种新兴的方法——神经单词嵌入——可以被解释为文化学习的形式模型。嵌入使我们能够根据自然语言数据对模式学习和激活进行实证建模。我们通过从新闻文章中提取四个低阶模式来说明我们的方法：体重的性别、道德、健康和阶级含义。使用这些低阶模式，我们量化了关于体重的单词如何“填补”关于性别、道德、健康和阶级的空白。我们的发现强化了人们一直以来的担忧，即机器学习模型（例如自然语言）可以编码和复制有害的人类偏见。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文去求助

来源期刊

Sociological Methods & Research Multiple-

CiteScore

16.30

自引率

3.20%

发文量

期刊介绍： Sociological Methods & Research is a quarterly journal devoted to sociology as a cumulative empirical science. The objectives of SMR are multiple, but emphasis is placed on articles that advance the understanding of the field through systematic presentations that clarify methodological problems and assist in ordering the known facts in an area. Review articles will be published, particularly those that emphasize a critical analysis of the status of the arts, but original presentations that are broadly based and provide new research will also be published. Intrinsically, SMR is viewed as substantive journal but one that is highly focused on the assessment of the scientific status of sociology. The scope is broad and flexible, and authors are invited to correspond with the editors about the appropriateness of their articles.