首页 > 最新文献

Big Data & Society最新文献

英文 中文
European Search? How to counter-imagine and counteract hegemonic search with European search engine projects 欧洲搜索?如何用欧洲搜索引擎项目来反想象和对抗霸权搜索
IF 8.5 1区 社会学 Q1 Social Sciences Pub Date : 2023-01-01 DOI: 10.1177/20539517231163173
Astrid Mager
This article investigates how developers of alternative search engines challenge increasingly corporate imaginaries of digital futures by building out counter-imaginaries of search engines devoted to social values instead of mere profit maximization. Drawing on three in-depth case studies of European search engines, it analyzes how search engine developers counter-imagine hegemonic search, what social values support their imaginaries, and how they are intertwined with their sociotechnical practices. This analysis shows that notions like privacy, independence, and openness appear to be fluid, context-dependent, and changing over time, leading to a certain “value pragmatics” that allows the projects to scale beyond their own communities of practice. It further shows how European values, and broader notions of Europe as “unified or pluralistic,” are constructed and co-produced with developers’ attempts to counter-imagine and counteract hegemonic search. To conclude, I suggest three points of intervention that may help alternative search engine projects, and digital technologies more generally, to not only make their counter-imaginaries more powerful, but also acquire the necessary resources to build their technologies and infrastructures accordingly. I finally discuss how “European values,” in all their richness and diversity, can contribute to this undertaking.
本文研究了替代搜索引擎的开发人员如何通过构建致力于社会价值而不仅仅是利润最大化的搜索引擎的反想象来挑战越来越多的企业对数字未来的想象。通过对欧洲搜索引擎的三个深入案例研究,本文分析了搜索引擎开发者是如何反想象霸权式搜索的,什么样的社会价值观支持他们的想象,以及他们是如何与社会技术实践交织在一起的。这一分析表明,像隐私、独立性和开放性这样的概念似乎是流动的、依赖于上下文的,并且随着时间的推移而变化,从而导致某种“价值实用主义”,允许项目扩展到它们自己的实践社区之外。它进一步展示了欧洲价值观,以及更广泛的欧洲“统一或多元”概念,是如何与开发者试图反想象和抵制霸权搜索的努力共同构建和产生的。最后,我提出了三点干预建议,这些建议可以帮助替代搜索引擎项目,以及更广泛的数字技术,不仅使它们的反想象更强大,而且还获得必要的资源来相应地构建它们的技术和基础设施。最后,我讨论了“欧洲价值观”的丰富性和多样性如何为这一事业做出贡献。
{"title":"European Search? How to counter-imagine and counteract hegemonic search with European search engine projects","authors":"Astrid Mager","doi":"10.1177/20539517231163173","DOIUrl":"https://doi.org/10.1177/20539517231163173","url":null,"abstract":"This article investigates how developers of alternative search engines challenge increasingly corporate imaginaries of digital futures by building out counter-imaginaries of search engines devoted to social values instead of mere profit maximization. Drawing on three in-depth case studies of European search engines, it analyzes how search engine developers counter-imagine hegemonic search, what social values support their imaginaries, and how they are intertwined with their sociotechnical practices. This analysis shows that notions like privacy, independence, and openness appear to be fluid, context-dependent, and changing over time, leading to a certain “value pragmatics” that allows the projects to scale beyond their own communities of practice. It further shows how European values, and broader notions of Europe as “unified or pluralistic,” are constructed and co-produced with developers’ attempts to counter-imagine and counteract hegemonic search. To conclude, I suggest three points of intervention that may help alternative search engine projects, and digital technologies more generally, to not only make their counter-imaginaries more powerful, but also acquire the necessary resources to build their technologies and infrastructures accordingly. I finally discuss how “European values,” in all their richness and diversity, can contribute to this undertaking.","PeriodicalId":47834,"journal":{"name":"Big Data & Society","volume":null,"pages":null},"PeriodicalIF":8.5,"publicationDate":"2023-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"44558202","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"社会学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 3
On samples, data, and their mobility in biobanking: How imagined travels help to relate samples and data 关于样本、数据及其在生物银行中的流动性:想象的旅行如何帮助将样本和数据联系起来
IF 8.5 1区 社会学 Q1 Social Sciences Pub Date : 2023-01-01 DOI: 10.1177/20539517231158635
Ingrid Metzler, Lisa-Maria Ferent, U. Felt
Biobanking involves the assembling, curating, and distributing of samples and data. While relations between samples and data are often taken as defining properties of biobanking, several studies have pointed to the challenges in relating them in practice. This article investigates how samples and data are curated, connected, and made mobile in practice. Building on an analysis of data collected at five hospital-based biobanks in Austria, the article describes and compares biobanking in three types of biobank collections: ‘departmental collections’, ‘project-specific collections’ and ‘hospital-wide collections’. It draws attention to the invisible work going into this infrastructure and highlights the central role of visions to make samples and data travel to a different location and thus support biomedical research. It shows that while visions of future travels are often epistemologically uncertain, they are informed by social ties and relationships between the collectives involved in the curation of samples and data on the one hand and the imagined users on the other. Finally, we point to the importance that policy actors in this domain consider the aspects we identified—and, in particular, reflect the temporalities inherent in such a research infrastructure.
生物银行涉及样本和数据的收集、管理和分发。虽然样本和数据之间的关系通常被视为生物库的定义属性,但一些研究指出了在实践中将它们联系起来的挑战。本文研究了如何在实践中对样本和数据进行策划、连接和移动。在分析奥地利五家医院生物库收集的数据的基础上,文章描述并比较了三种类型的生物库收集中的生物库:“部门收集”、“特定项目收集”和“全医院收集”。它提请人们注意进入这一基础设施的无形工作,并强调了愿景的核心作用,即使样本和数据传播到不同的位置,从而支持生物医学研究。它表明,虽然未来旅行的愿景在认识论上往往是不确定的,但它们是由参与样本和数据管理的集体与想象中的用户之间的社会联系和关系所决定的。最后,我们指出了这一领域的政策参与者考虑我们确定的方面的重要性,特别是反映这种研究基础设施固有的时间性。
{"title":"On samples, data, and their mobility in biobanking: How imagined travels help to relate samples and data","authors":"Ingrid Metzler, Lisa-Maria Ferent, U. Felt","doi":"10.1177/20539517231158635","DOIUrl":"https://doi.org/10.1177/20539517231158635","url":null,"abstract":"Biobanking involves the assembling, curating, and distributing of samples and data. While relations between samples and data are often taken as defining properties of biobanking, several studies have pointed to the challenges in relating them in practice. This article investigates how samples and data are curated, connected, and made mobile in practice. Building on an analysis of data collected at five hospital-based biobanks in Austria, the article describes and compares biobanking in three types of biobank collections: ‘departmental collections’, ‘project-specific collections’ and ‘hospital-wide collections’. It draws attention to the invisible work going into this infrastructure and highlights the central role of visions to make samples and data travel to a different location and thus support biomedical research. It shows that while visions of future travels are often epistemologically uncertain, they are informed by social ties and relationships between the collectives involved in the curation of samples and data on the one hand and the imagined users on the other. Finally, we point to the importance that policy actors in this domain consider the aspects we identified—and, in particular, reflect the temporalities inherent in such a research infrastructure.","PeriodicalId":47834,"journal":{"name":"Big Data & Society","volume":null,"pages":null},"PeriodicalIF":8.5,"publicationDate":"2023-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"46303992","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"社会学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
The visible body and the invisible organization: Information asymmetry and college athletics data 有形主体与无形组织:信息不对称与高校体育数据
IF 8.5 1区 社会学 Q1 Social Sciences Pub Date : 2023-01-01 DOI: 10.1177/20539517231179197
Daniel Greene, Nate Beard, Tamara L. Clegg, E. Weight
Elite athletes are constantly tracked, measured, scored, and sorted to improve their performance. Privacy is sacrificed in the name of improvement. Athletes frequently do not know why particular personal data are collected or to what end. Our interview study of 23 elite US college athletes and 26 staff members reveals that their sports play is governed through information asymmetries. These asymmetries look different for different sports with different levels of investment, different racial and gender makeups, and different performance metrics. As large, data-intensive organizations with highly differentiated subgroups, university athletics are an excellent site for theory building in critical data studies, especially given the most consequential data collected from us, with the greatest effect on our lives, is frequently a product of collective engagement with specific organizational contexts like workplaces and schools. Empirical analysis reveals two key tensions in this data regime: Athletes in high-status sports, more likely to be Black men, have relatively less freedom to see or dispute their personal data, while athletes in general are more comfortable sharing personal data with people further away from them. We build from these findings to develop a theory of collective informational harm in bounded institutional settings such as the workplace. The quantified organization, as we term it, is concerned not with monitoring individuals but building data collectives through processes of category creation and managerial data relations of coercion and consent.
优秀运动员被不断地跟踪、测量、评分和分类,以提高他们的表现。以改进的名义牺牲隐私。运动员经常不知道为什么某些个人数据被收集,或者是为了什么目的。我们对23名优秀的美国大学运动员和26名工作人员的访谈研究表明,他们的体育活动受到信息不对称的支配。这些不对称在不同的体育项目、不同的投入水平、不同的种族和性别构成以及不同的表现指标中表现不同。作为具有高度分化的子群体的大型数据密集型组织,大学体育是构建关键数据研究理论的绝佳场所,特别是考虑到从我们身上收集到的最重要的数据,对我们的生活影响最大,往往是集体参与特定组织环境(如工作场所和学校)的产物。实证分析揭示了这种数据机制中的两个关键矛盾:高地位运动的运动员(更有可能是黑人)在查看或质疑个人数据方面的自由相对较少,而运动员通常更愿意与远离他们的人分享个人数据。我们从这些发现出发,发展了一种有限制度环境(如工作场所)中的集体信息伤害理论。我们称之为量化的组织,它关注的不是监控个人,而是通过类别创建和强制与同意的管理数据关系的过程来构建数据集体。
{"title":"The visible body and the invisible organization: Information asymmetry and college athletics data","authors":"Daniel Greene, Nate Beard, Tamara L. Clegg, E. Weight","doi":"10.1177/20539517231179197","DOIUrl":"https://doi.org/10.1177/20539517231179197","url":null,"abstract":"Elite athletes are constantly tracked, measured, scored, and sorted to improve their performance. Privacy is sacrificed in the name of improvement. Athletes frequently do not know why particular personal data are collected or to what end. Our interview study of 23 elite US college athletes and 26 staff members reveals that their sports play is governed through information asymmetries. These asymmetries look different for different sports with different levels of investment, different racial and gender makeups, and different performance metrics. As large, data-intensive organizations with highly differentiated subgroups, university athletics are an excellent site for theory building in critical data studies, especially given the most consequential data collected from us, with the greatest effect on our lives, is frequently a product of collective engagement with specific organizational contexts like workplaces and schools. Empirical analysis reveals two key tensions in this data regime: Athletes in high-status sports, more likely to be Black men, have relatively less freedom to see or dispute their personal data, while athletes in general are more comfortable sharing personal data with people further away from them. We build from these findings to develop a theory of collective informational harm in bounded institutional settings such as the workplace. The quantified organization, as we term it, is concerned not with monitoring individuals but building data collectives through processes of category creation and managerial data relations of coercion and consent.","PeriodicalId":47834,"journal":{"name":"Big Data & Society","volume":null,"pages":null},"PeriodicalIF":8.5,"publicationDate":"2023-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"43057794","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"社会学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Algorithmic probing: Prompting offensive Google results and their moderation 算法探测:提示攻击性谷歌结果及其节制
IF 8.5 1区 社会学 Q1 Social Sciences Pub Date : 2023-01-01 DOI: 10.1177/20539517231176228
Richard A. Rogers
Google results have been scrutinized over the years for what they privilege, be it the surface web, the powerful, optimized webpages, the personalized and/or their own properties. For some time now, another type of Google returns also has been the source of attention: the offensive result. The following revisits a selection of offensive and other problematic results found by journalists and researchers alike. In a technique termed ‘algorithmic probing’, the prompting queries are re-run to study what has come of these results in Google Web and Image Search but mainly in Google Autocompletion. The question concerns a different kind of privileging – Google's hierarchy of concerns – or the extent to which certain categories as well as languages are moderated and others less so. In all, it was found that Google heavily moderates religion, ethnicities and sexualities (albeit with gaps) but leaves alone stereotypes of gendered professions as well as ageism. It also moderates to a greater degree in English compared to southern European and Balkan languages. The article concludes with a discussion of the stakes of Google's moderation, including its uneven coverage.
b谷歌的搜索结果多年来一直在仔细审查他们的特权,无论是表面的网页,强大的,优化的网页,个性化和/或他们自己的属性。一段时间以来,另一种类型的谷歌回击也引起了人们的关注:进攻结果。以下是记者和研究人员发现的一些令人反感的和其他有问题的结果。在一种称为“算法探测”的技术中,提示查询被重新运行,以研究谷歌Web和图像搜索中这些结果的结果,但主要是谷歌自动完成。这个问题涉及到另一种特权——b谷歌的关注层次——或者某些类别和语言被缓和的程度,而另一些则不那么缓和。总的来说,谷歌在很大程度上缓和了宗教、种族和性取向(尽管存在差距),但没有涉及性别职业的刻板印象和年龄歧视。与南欧和巴尔干地区的语言相比,英语语言也更温和。文章最后讨论了b谷歌节制的利害关系,包括其不均衡的覆盖范围。
{"title":"Algorithmic probing: Prompting offensive Google results and their moderation","authors":"Richard A. Rogers","doi":"10.1177/20539517231176228","DOIUrl":"https://doi.org/10.1177/20539517231176228","url":null,"abstract":"Google results have been scrutinized over the years for what they privilege, be it the surface web, the powerful, optimized webpages, the personalized and/or their own properties. For some time now, another type of Google returns also has been the source of attention: the offensive result. The following revisits a selection of offensive and other problematic results found by journalists and researchers alike. In a technique termed ‘algorithmic probing’, the prompting queries are re-run to study what has come of these results in Google Web and Image Search but mainly in Google Autocompletion. The question concerns a different kind of privileging – Google's hierarchy of concerns – or the extent to which certain categories as well as languages are moderated and others less so. In all, it was found that Google heavily moderates religion, ethnicities and sexualities (albeit with gaps) but leaves alone stereotypes of gendered professions as well as ageism. It also moderates to a greater degree in English compared to southern European and Balkan languages. The article concludes with a discussion of the stakes of Google's moderation, including its uneven coverage.","PeriodicalId":47834,"journal":{"name":"Big Data & Society","volume":null,"pages":null},"PeriodicalIF":8.5,"publicationDate":"2023-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"46166854","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"社会学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 3
Editorial introduction: Towards a machinic anthropology 编辑简介:走向机器人类学
IF 8.5 1区 社会学 Q1 Social Sciences Pub Date : 2023-01-01 DOI: 10.1177/20539517231153803
M. Pedersen
Bringing together a motley crew of social scientists and data scientists, the aim of this special theme issue is to explore what an integration or even fusion between anthropology and data science might look like. Going beyond existing work on the complementarity between ‘thick’ qualitative and ‘big’ quantitative data, the ambition is to unsettle and push established disciplinary, methodological and epistemological boundaries by creatively and critically probing various computational methods for augmenting and automatizing the collection, processing and analysis of ethnographic data, and vice versa. Can ethnographic and other qualitative data and methods be integrated with natural language processing tools and other machine-learning techniques, and if so, to what effect? Does the rise of data science allow for the realization of Levi-Strauss’ old dream of a computational structuralism, and even if so, should it? Might one even go as far as saying that computers are now becoming agents of social scientific analysis or even thinking: are we about to witness the birth of distinctly anthropological forms of artificial intelligence? By exploring these questions, the hope is not only to introduce scholars and students to computational anthropological methods, but also to disrupt predominant norms and assumptions among computational social scientists and data science writ large.
本期专题将汇集社会科学家和数据科学家,旨在探讨人类学和数据科学之间的整合甚至融合可能是什么样子。超越现有的关于“厚”定性和“大”定量数据之间互补性的工作,我们的目标是通过创造性和批判性地探索各种计算方法来增加和自动化收集、处理和分析民族志数据,从而动摇和推动既定的学科、方法和认识论边界,反之亦然。民族志和其他定性数据和方法是否可以与自然语言处理工具和其他机器学习技术相结合,如果可以,会产生什么效果?数据科学的兴起是否允许实现列维-施特劳斯的计算结构主义的古老梦想,即使是这样,它应该吗?甚至有人可能会说,计算机现在正在成为社会科学分析甚至思考的代理人:我们是否即将见证独特的人类学形式的人工智能的诞生?通过探索这些问题,我们不仅希望向学者和学生介绍计算人类学方法,而且希望打破计算社会科学家和数据科学中的主流规范和假设。
{"title":"Editorial introduction: Towards a machinic anthropology","authors":"M. Pedersen","doi":"10.1177/20539517231153803","DOIUrl":"https://doi.org/10.1177/20539517231153803","url":null,"abstract":"Bringing together a motley crew of social scientists and data scientists, the aim of this special theme issue is to explore what an integration or even fusion between anthropology and data science might look like. Going beyond existing work on the complementarity between ‘thick’ qualitative and ‘big’ quantitative data, the ambition is to unsettle and push established disciplinary, methodological and epistemological boundaries by creatively and critically probing various computational methods for augmenting and automatizing the collection, processing and analysis of ethnographic data, and vice versa. Can ethnographic and other qualitative data and methods be integrated with natural language processing tools and other machine-learning techniques, and if so, to what effect? Does the rise of data science allow for the realization of Levi-Strauss’ old dream of a computational structuralism, and even if so, should it? Might one even go as far as saying that computers are now becoming agents of social scientific analysis or even thinking: are we about to witness the birth of distinctly anthropological forms of artificial intelligence? By exploring these questions, the hope is not only to introduce scholars and students to computational anthropological methods, but also to disrupt predominant norms and assumptions among computational social scientists and data science writ large.","PeriodicalId":47834,"journal":{"name":"Big Data & Society","volume":null,"pages":null},"PeriodicalIF":8.5,"publicationDate":"2023-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"44350115","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"社会学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Web3 as ‘self-infrastructuring’: The challenge is how Web3作为“自我基础设施”:挑战在于如何实现
IF 8.5 1区 社会学 Q1 Social Sciences Pub Date : 2023-01-01 DOI: 10.1177/20539517231159002
K. Nabben
The term ‘Web3’ refers to the practices of participating in digital infrastructures through the ability to read, write and coordinate digital assets. Web3 is hailed as an alternative to the failings of big tech, offering a participatory mode of digital self-organizing and shared ownership of digital infrastructure through software-encoded governance rules and participatory practices. Yet, very few analytical frameworks have been presented in academic literature by which to approach Web3. This piece draws on the theoretical lens of infrastructure studies to offer an analytical framework to approach the emergent field of Web3 as an exploration in ‘how to infrastructure’ through prefigurative self-infrastructuring. Drawing on qualitative examples from digital ethnographic methods, I demonstrate how the origins of Web3 reveal the intentions of its creators as a political tool of prefiguration, yet its practices reveal the inherent tension of expressing these ideals in coherent technical and institutional infrastructure. Thus, I argue that one of the fundamental challenges Web3 is negotiating through technical and governance experiments is ‘how to self-infrastructure?’.
“Web3”一词是指通过读写和协调数字资产的能力参与数字基础设施的实践。Web3被誉为大型科技公司失败的另一种选择,它提供了一种参与式的数字自组织模式,并通过软件编码的治理规则和参与式实践共享数字基础设施的所有权。然而,在学术文献中,很少有分析框架可以用来研究Web3。本文借鉴了基础设施研究的理论视角,提供了一个分析框架,来探讨Web3这个新兴领域,通过预言性的自我基础设施来探索“如何基础设施”。从数字人种学方法的定性例子中,我展示了Web3的起源如何揭示了它的创造者作为一种预示的政治工具的意图,然而它的实践揭示了在连贯的技术和制度基础设施中表达这些理想的内在张力。因此,我认为Web3正在通过技术和治理实验来解决的一个基本挑战是“如何自我基础设施?”
{"title":"Web3 as ‘self-infrastructuring’: The challenge is how","authors":"K. Nabben","doi":"10.1177/20539517231159002","DOIUrl":"https://doi.org/10.1177/20539517231159002","url":null,"abstract":"The term ‘Web3’ refers to the practices of participating in digital infrastructures through the ability to read, write and coordinate digital assets. Web3 is hailed as an alternative to the failings of big tech, offering a participatory mode of digital self-organizing and shared ownership of digital infrastructure through software-encoded governance rules and participatory practices. Yet, very few analytical frameworks have been presented in academic literature by which to approach Web3. This piece draws on the theoretical lens of infrastructure studies to offer an analytical framework to approach the emergent field of Web3 as an exploration in ‘how to infrastructure’ through prefigurative self-infrastructuring. Drawing on qualitative examples from digital ethnographic methods, I demonstrate how the origins of Web3 reveal the intentions of its creators as a political tool of prefiguration, yet its practices reveal the inherent tension of expressing these ideals in coherent technical and institutional infrastructure. Thus, I argue that one of the fundamental challenges Web3 is negotiating through technical and governance experiments is ‘how to self-infrastructure?’.","PeriodicalId":47834,"journal":{"name":"Big Data & Society","volume":null,"pages":null},"PeriodicalIF":8.5,"publicationDate":"2023-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"44627299","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"社会学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 8
Ethical scaling for content moderation: Extreme speech and the (in)significance of artificial intelligence 内容节制的伦理尺度:极端言论和人工智能的意义
IF 8.5 1区 社会学 Q1 Social Sciences Pub Date : 2023-01-01 DOI: 10.1177/20539517231172424
Sahana Udupa, Antonis Maronikolakis, Axel Wisiorek
In this article, we present new empirical evidence to demonstrate the severe limitations of existing machine learning content moderation methods to keep pace with, let alone stay ahead of, hateful language online. Building on the collaborative coding project “AI4Dignity” we outline the ambiguities and complexities of annotating problematic text in AI-assisted moderation systems. We diagnose the shortcomings of the content moderation and natural language processing approach as emerging from a broader epistemological trapping wrapped in the liberal-modern idea of “the human”. Presenting a decolonial critique of the “human vs machine” conundrum and drawing attention to the structuring effects of coloniality on extreme speech, we propose “ethical scaling” to highlight moderation process as political praxis. As a normative framework for platform governance, ethical scaling calls for a transparent, reflexive, and replicable process of iteration for content moderation with community participation and global parity, which should evolve in conjunction with addressing algorithmic amplification of divisive content and resource allocation for content moderation.
在这篇文章中,我们提出了新的经验证据,以证明现有的机器学习内容调节方法在跟上,更不用说领先于网络仇恨语言方面的严重局限性。在协作编码项目“AI4Dimity”的基础上,我们概述了在人工智能辅助审核系统中注释问题文本的模糊性和复杂性。我们将内容节制和自然语言处理方法的缺点诊断为从自由主义现代“人”思想中包裹的更广泛的认识论陷阱中出现。提出了对“人与机器”难题的非殖民化批判,并提请人们注意殖民主义对极端言论的结构性影响,我们提出了“道德尺度”,以强调温和过程是政治实践。作为平台治理的一个规范框架,道德扩展需要一个透明、反射性和可复制的迭代过程,以实现社区参与和全球平等的内容审核,这应该与解决分裂性内容的算法放大和内容审核的资源分配相结合。
{"title":"Ethical scaling for content moderation: Extreme speech and the (in)significance of artificial intelligence","authors":"Sahana Udupa, Antonis Maronikolakis, Axel Wisiorek","doi":"10.1177/20539517231172424","DOIUrl":"https://doi.org/10.1177/20539517231172424","url":null,"abstract":"In this article, we present new empirical evidence to demonstrate the severe limitations of existing machine learning content moderation methods to keep pace with, let alone stay ahead of, hateful language online. Building on the collaborative coding project “AI4Dignity” we outline the ambiguities and complexities of annotating problematic text in AI-assisted moderation systems. We diagnose the shortcomings of the content moderation and natural language processing approach as emerging from a broader epistemological trapping wrapped in the liberal-modern idea of “the human”. Presenting a decolonial critique of the “human vs machine” conundrum and drawing attention to the structuring effects of coloniality on extreme speech, we propose “ethical scaling” to highlight moderation process as political praxis. As a normative framework for platform governance, ethical scaling calls for a transparent, reflexive, and replicable process of iteration for content moderation with community participation and global parity, which should evolve in conjunction with addressing algorithmic amplification of divisive content and resource allocation for content moderation.","PeriodicalId":47834,"journal":{"name":"Big Data & Society","volume":null,"pages":null},"PeriodicalIF":8.5,"publicationDate":"2023-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"43134080","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"社会学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 3
The multifaceted and situated data center imaginary of Dutch Twitter 荷兰推特想象的多方面、位置优越的数据中心
IF 8.5 1区 社会学 Q1 Social Sciences Pub Date : 2023-01-01 DOI: 10.1177/20539517231155064
Karin van Es, Daan van der Weijden, Jeroen Bakker
Data centers are material structures that take up space, use resources like water and energy, and possess a large carbon footprint. This paper examines the broader long-term discussion around data centers during the period 2020–2022 in the Dutch Twittersphere. Through an analysis of tweets and images, it identifies and reflects on the communities active in the discussion and the range of visions and imaginaries of data centers they produce. Unpacking these tweets and images over time traces not only the emergence of a ‘reactive imaginary’, critical of the promises of information technology (IT) industry and (local) governments, but also the blind spots of the discussion. It furthermore reveals an important role for journalism in the discussion by questioning the claims of the industry and contributing to a ‘visibility expansion’ of data center’s impact on Earth's resources. The paper shows the multifaceted and situated nature of imaginaries and their role in shaping decision-making and policy.
数据中心是一种占用空间、使用水和能源等资源、碳足迹大的材料结构。本文考察了荷兰推特领域2020-2022年期间围绕数据中心进行的更广泛的长期讨论。通过对推文和图像的分析,它确定并反思了活跃在讨论中的社区,以及他们产生的数据中心的愿景和想象。随着时间的推移,打开这些推文和图片不仅可以追溯到一种“反应性想象”的出现,这种想象批评了信息技术(IT)行业和(地方)政府的承诺,还可以追溯到讨论的盲点。它进一步揭示了新闻业在讨论中的重要作用,质疑了该行业的说法,并有助于数据中心对地球资源影响的“可见性扩展”。本文展示了想象的多面性和情境性,以及它们在决策和政策制定中的作用。
{"title":"The multifaceted and situated data center imaginary of Dutch Twitter","authors":"Karin van Es, Daan van der Weijden, Jeroen Bakker","doi":"10.1177/20539517231155064","DOIUrl":"https://doi.org/10.1177/20539517231155064","url":null,"abstract":"Data centers are material structures that take up space, use resources like water and energy, and possess a large carbon footprint. This paper examines the broader long-term discussion around data centers during the period 2020–2022 in the Dutch Twittersphere. Through an analysis of tweets and images, it identifies and reflects on the communities active in the discussion and the range of visions and imaginaries of data centers they produce. Unpacking these tweets and images over time traces not only the emergence of a ‘reactive imaginary’, critical of the promises of information technology (IT) industry and (local) governments, but also the blind spots of the discussion. It furthermore reveals an important role for journalism in the discussion by questioning the claims of the industry and contributing to a ‘visibility expansion’ of data center’s impact on Earth's resources. The paper shows the multifaceted and situated nature of imaginaries and their role in shaping decision-making and policy.","PeriodicalId":47834,"journal":{"name":"Big Data & Society","volume":null,"pages":null},"PeriodicalIF":8.5,"publicationDate":"2023-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"48577553","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"社会学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Judgments as bulk data 作为批量数据的判断
IF 8.5 1区 社会学 Q1 Social Sciences Pub Date : 2023-01-01 DOI: 10.1177/20539517231160527
V. Janeček
Should court judgments be publicly available for text and data mining purposes? This article shows that the arguments for and against access to judgments conflate different understandings of what judgments are. On one view, judgments are seen as a ‘jurisprudential’ category, whereas the other view regards them as something ‘factual’. Once it is understood that these views and the claims based on them do not fight over the same territory, it should be easier to make judgments more widely available, including for the purposes of computational analysis of judgments as bulk data. The purpose of this article is to help to clear the ground for the debate around access to judgments as bulk data and highlight some relevant considerations for the preferred licencing regime concerning judgments.
为了文本和数据挖掘的目的,法院的判决是否应该公开?这篇文章表明,支持和反对获得判决的论点混淆了对什么是判决的不同理解。一种观点认为,判决被视为“法理学”范畴,而另一种观点则认为它们是“事实”的东西。一旦了解到这些意见和根据这些意见提出的要求不是在同一领土上进行斗争,就应该更容易使判决更广泛地获得,包括将判决作为大量数据进行计算分析。本文的目的是帮助澄清关于将判决作为批量数据访问的争论,并强调有关判决的首选许可制度的一些相关考虑因素。
{"title":"Judgments as bulk data","authors":"V. Janeček","doi":"10.1177/20539517231160527","DOIUrl":"https://doi.org/10.1177/20539517231160527","url":null,"abstract":"Should court judgments be publicly available for text and data mining purposes? This article shows that the arguments for and against access to judgments conflate different understandings of what judgments are. On one view, judgments are seen as a ‘jurisprudential’ category, whereas the other view regards them as something ‘factual’. Once it is understood that these views and the claims based on them do not fight over the same territory, it should be easier to make judgments more widely available, including for the purposes of computational analysis of judgments as bulk data. The purpose of this article is to help to clear the ground for the debate around access to judgments as bulk data and highlight some relevant considerations for the preferred licencing regime concerning judgments.","PeriodicalId":47834,"journal":{"name":"Big Data & Society","volume":null,"pages":null},"PeriodicalIF":8.5,"publicationDate":"2023-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"45671842","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"社会学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Google, data voids, and the dynamics of the politics of exclusion 谷歌、数据空洞和排斥政治的动态
IF 8.5 1区 社会学 Q1 Social Sciences Pub Date : 2023-01-01 DOI: 10.1177/20539517221149099
Ov Cristian Norocel, D. Lewandowski
This study deploys a critical approach to big data analytics to gauge the tentative contours of data voids in Google searches that reflect extreme-right dynamics of exclusion in the aftermath of the 2015 humanitarian crisis in Europe. The study adds complexity to the analysis of data voids, expanding the framework of investigation outside the USA context by concentrating on Germany and Sweden. Building on previous big data analytics addressing the politics of exclusion, the study proposes a catalogue of queries concerning the issue of migration in both Germany and Sweden on a continuum from mainstream to extreme-right vocabularies. This catalogue of queries enables specific and localized queries to identify data voids. The results show that a search engine's reliance on source popularity may lead to extreme-right sources appearing in top positions. Furthermore, using platforms for user-generated content provides a way for localized queries to gain top positions.
这项研究采用了一种关键的大数据分析方法,以衡量谷歌搜索中数据空白的初步轮廓,这些数据空白反映了2015年欧洲人道主义危机后极右翼的排斥动态。这项研究增加了数据空白分析的复杂性,通过将重点放在德国和瑞典,将调查框架扩展到了美国之外。在之前针对排斥政治的大数据分析的基础上,该研究提出了一系列关于德国和瑞典移民问题的问题,从主流词汇到极右翼词汇。此查询目录使特定的本地化查询能够识别数据空白。结果表明,搜索引擎对来源受欢迎程度的依赖可能导致极右翼来源出现在最高位置。此外,使用用户生成内容的平台为本地化查询提供了一种获得最高职位的方式。
{"title":"Google, data voids, and the dynamics of the politics of exclusion","authors":"Ov Cristian Norocel, D. Lewandowski","doi":"10.1177/20539517221149099","DOIUrl":"https://doi.org/10.1177/20539517221149099","url":null,"abstract":"This study deploys a critical approach to big data analytics to gauge the tentative contours of data voids in Google searches that reflect extreme-right dynamics of exclusion in the aftermath of the 2015 humanitarian crisis in Europe. The study adds complexity to the analysis of data voids, expanding the framework of investigation outside the USA context by concentrating on Germany and Sweden. Building on previous big data analytics addressing the politics of exclusion, the study proposes a catalogue of queries concerning the issue of migration in both Germany and Sweden on a continuum from mainstream to extreme-right vocabularies. This catalogue of queries enables specific and localized queries to identify data voids. The results show that a search engine's reliance on source popularity may lead to extreme-right sources appearing in top positions. Furthermore, using platforms for user-generated content provides a way for localized queries to gain top positions.","PeriodicalId":47834,"journal":{"name":"Big Data & Society","volume":null,"pages":null},"PeriodicalIF":8.5,"publicationDate":"2023-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"49172525","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"社会学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 3
期刊
Big Data & Society
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1