Measuring diversity in Hollywood through the large-scale computational analysis of film.

IF 9.4 1区 综合性期刊 Q1 MULTIDISCIPLINARY SCIENCES Proceedings of the National Academy of Sciences of the United States of America Pub Date : 2024-11-12 Epub Date: 2024-11-04 DOI:10.1073/pnas.2409770121
David Bamman, Rachael Samberg, Richard Jean So, Naitian Zhou
{"title":"Measuring diversity in Hollywood through the large-scale computational analysis of film.","authors":"David Bamman, Rachael Samberg, Richard Jean So, Naitian Zhou","doi":"10.1073/pnas.2409770121","DOIUrl":null,"url":null,"abstract":"<p><p>Movies are a massively popular and influential form of media, but their computational study at scale has largely been off-limits to researchers in the United States due to the Digital Millennium Copyright Act. In this work, we illustrate use of a new regulatory framework to enable computational research on film that permits circumvention of technological protection measures on digital video discs (DVDs). We use this exemption to legally digitize a collection of 2,307 films representing the top 50 movies by U.S. box office over the period 1980 to 2022, along with award nominees. We design a computational pipeline for measuring the representation of gender and race/ethnicity in film, drawing on computer vision models for recognizing actors and human perceptions of gender and race/ethnicity. Doing so allows us to learn substantive facts about representation and diversity in Hollywood over this period, confirming earlier studies that see an increase in diversity over the past decade, while allowing us to use computational methods to uncover a range of ad hoc analytical findings. Our work illustrates the affordances of the data-driven analysis of film at a large scale.</p>","PeriodicalId":20548,"journal":{"name":"Proceedings of the National Academy of Sciences of the United States of America","volume":null,"pages":null},"PeriodicalIF":9.4000,"publicationDate":"2024-11-12","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Proceedings of the National Academy of Sciences of the United States of America","FirstCategoryId":"103","ListUrlMain":"https://doi.org/10.1073/pnas.2409770121","RegionNum":1,"RegionCategory":"综合性期刊","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"2024/11/4 0:00:00","PubModel":"Epub","JCR":"Q1","JCRName":"MULTIDISCIPLINARY SCIENCES","Score":null,"Total":0}
引用次数: 0

Abstract

Movies are a massively popular and influential form of media, but their computational study at scale has largely been off-limits to researchers in the United States due to the Digital Millennium Copyright Act. In this work, we illustrate use of a new regulatory framework to enable computational research on film that permits circumvention of technological protection measures on digital video discs (DVDs). We use this exemption to legally digitize a collection of 2,307 films representing the top 50 movies by U.S. box office over the period 1980 to 2022, along with award nominees. We design a computational pipeline for measuring the representation of gender and race/ethnicity in film, drawing on computer vision models for recognizing actors and human perceptions of gender and race/ethnicity. Doing so allows us to learn substantive facts about representation and diversity in Hollywood over this period, confirming earlier studies that see an increase in diversity over the past decade, while allowing us to use computational methods to uncover a range of ad hoc analytical findings. Our work illustrates the affordances of the data-driven analysis of film at a large scale.

查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
通过对电影进行大规模计算分析,衡量好莱坞的多样性。
电影是一种广受欢迎且极具影响力的媒体形式,但由于《数字千年版权法案》的限制,美国的研究人员在很大程度上无法对其进行大规模的计算研究。在这项工作中,我们说明了如何利用新的监管框架来开展电影计算研究,该框架允许规避数字视频光盘(DVD)上的技术保护措施。我们利用这一豁免权对 2,307 部电影进行了合法数字化处理,这些电影代表了 1980 年至 2022 年期间美国票房排名前 50 位的电影以及获奖提名电影。我们利用计算机视觉模型识别演员以及人类对性别和种族/族裔的感知,设计了一个用于测量电影中性别和种族/族裔代表性的计算管道。通过这种方法,我们可以了解到这一时期好莱坞代表性和多样性的实质性事实,证实了之前的研究认为过去十年中多样性有所提高,同时也让我们能够利用计算方法发现一系列特别的分析结果。我们的工作展示了大规模电影数据驱动分析的能力。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 去求助
来源期刊
CiteScore
19.00
自引率
0.90%
发文量
3575
审稿时长
2.5 months
期刊介绍: The Proceedings of the National Academy of Sciences (PNAS), a peer-reviewed journal of the National Academy of Sciences (NAS), serves as an authoritative source for high-impact, original research across the biological, physical, and social sciences. With a global scope, the journal welcomes submissions from researchers worldwide, making it an inclusive platform for advancing scientific knowledge.
期刊最新文献
Reply to Majer et al.: Negotiating policy action for transformation requires both sociopolitical and behavioral perspectives. The behavioral negotiation perspective can reveal how to navigate discord in sustainability transformations constructively. Deafness due to loss of a TRPV channel eliminates mating behavior in Aedes aegypti males. Extremely rapid, yet noncatastrophic, preservation of the flattened-feathered and 3D dinosaurs of the Early Cretaceous of China. Soft matter mechanics of baseball's Rubbing Mud.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1