{"title":"自然语言中蕴含的美国优越性和美国性的种族框架","authors":"Messi H.J. Lee, Jacob M Montgomery, Calvin K Lai","doi":"10.1093/pnasnexus/pgad485","DOIUrl":null,"url":null,"abstract":"\n America’s racial framework can be summarized using two distinct dimensions: superiority/inferiority and Americanness/foreignness (Zou & Cheryan, 2017). We investigated America’s racial framework in a corpus of spoken and written language using word embeddings. Word embeddings place words on a low-dimensional space where words with similar meanings are proximate, allowing researchers to test whether the positions of group and attribute words in a semantic space reflect stereotypes. We trained a word embedding model on the Corpus of Contemporary American English - a corpus of one-billion words that span thirty years and eight text categories - and compared the positions of racial/ethnic groups with respect to superiority and Americanness. We found that America’s racial framework is embedded in American English. We also captured an additional nuance: Asian people were stereotyped as more American than Hispanic people. These results are empirical evidence that America’s racial framework is embedded in American English.","PeriodicalId":509985,"journal":{"name":"PNAS Nexus","volume":"64 1","pages":""},"PeriodicalIF":0.0000,"publicationDate":"2024-01-02","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"America’s racial framework of superiority and Americanness embedded in natural language\",\"authors\":\"Messi H.J. Lee, Jacob M Montgomery, Calvin K Lai\",\"doi\":\"10.1093/pnasnexus/pgad485\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"\\n America’s racial framework can be summarized using two distinct dimensions: superiority/inferiority and Americanness/foreignness (Zou & Cheryan, 2017). We investigated America’s racial framework in a corpus of spoken and written language using word embeddings. Word embeddings place words on a low-dimensional space where words with similar meanings are proximate, allowing researchers to test whether the positions of group and attribute words in a semantic space reflect stereotypes. We trained a word embedding model on the Corpus of Contemporary American English - a corpus of one-billion words that span thirty years and eight text categories - and compared the positions of racial/ethnic groups with respect to superiority and Americanness. We found that America’s racial framework is embedded in American English. We also captured an additional nuance: Asian people were stereotyped as more American than Hispanic people. These results are empirical evidence that America’s racial framework is embedded in American English.\",\"PeriodicalId\":509985,\"journal\":{\"name\":\"PNAS Nexus\",\"volume\":\"64 1\",\"pages\":\"\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2024-01-02\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"PNAS Nexus\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1093/pnasnexus/pgad485\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"PNAS Nexus","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1093/pnasnexus/pgad485","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0
摘要
美国的种族框架可以用两个不同的维度来概括:优越感/自卑感和美国性/异国性(Zou & Cheryan, 2017)。我们在口语和书面语语料库中使用单词嵌入法研究了美国的种族框架。单词嵌入将单词置于低维空间中,在该空间中,具有相似含义的单词较为接近,从而使研究人员能够测试群体和属性单词在语义空间中的位置是否反映了刻板印象。我们在《当代美国英语语料库》(Corpus of Contemporary American English)上训练了一个词语嵌入模型,该语料库包含 10 亿个词语,跨越 30 年和 8 个文本类别,并比较了种族/民族群体在优越感和美国性方面的位置。我们发现,美国的种族框架根植于美式英语之中。我们还捕捉到了一个额外的细微差别:与西班牙裔相比,亚洲人被刻板地认为更美国化。这些结果是美国的种族框架嵌入美式英语的经验证据。
America’s racial framework of superiority and Americanness embedded in natural language
America’s racial framework can be summarized using two distinct dimensions: superiority/inferiority and Americanness/foreignness (Zou & Cheryan, 2017). We investigated America’s racial framework in a corpus of spoken and written language using word embeddings. Word embeddings place words on a low-dimensional space where words with similar meanings are proximate, allowing researchers to test whether the positions of group and attribute words in a semantic space reflect stereotypes. We trained a word embedding model on the Corpus of Contemporary American English - a corpus of one-billion words that span thirty years and eight text categories - and compared the positions of racial/ethnic groups with respect to superiority and Americanness. We found that America’s racial framework is embedded in American English. We also captured an additional nuance: Asian people were stereotyped as more American than Hispanic people. These results are empirical evidence that America’s racial framework is embedded in American English.