客观的眼光

IF 1.2 Q2 CULTURAL STUDIES Social Text Pub Date : 2023-09-01 DOI:10.1215/01642472-10613653

James E. Dobson

{"title":"客观的眼光","authors":"James E. Dobson","doi":"10.1215/01642472-10613653","DOIUrl":null,"url":null,"abstract":"Convolutional neural networks (CNNs) are a key technology powering the automated technologies of seeing known as computer vision. CNNs have been especially successful in systems that perform object recognition from visual data. This article examines the persistence of a mid-twentieth-century ontology of the digital image in these contemporary technologies. While CNNs are multidimensional, their ontology flattens distinctions between background and foreground, between subjects and objects, and even the relations established among the categories of information used to organize and train these models. This ontology enables the introduction and amplification of bias and troubling correlations and the transfer or slippage of learned associations between humans and objects found in the training image archives. Inspecting and interpreting what CNNs learn and index through their complex architectures can be difficult if not impossible because of how they encode and obfuscate quite human ways of seeing the world and the image repertoires used to train these algorithms that are rife with residues of prior representations.","PeriodicalId":47701,"journal":{"name":"Social Text","volume":"21 1","pages":"0"},"PeriodicalIF":1.2000,"publicationDate":"2023-09-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":"{\"title\":\"Objective Vision\",\"authors\":\"James E. Dobson\",\"doi\":\"10.1215/01642472-10613653\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Convolutional neural networks (CNNs) are a key technology powering the automated technologies of seeing known as computer vision. CNNs have been especially successful in systems that perform object recognition from visual data. This article examines the persistence of a mid-twentieth-century ontology of the digital image in these contemporary technologies. While CNNs are multidimensional, their ontology flattens distinctions between background and foreground, between subjects and objects, and even the relations established among the categories of information used to organize and train these models. This ontology enables the introduction and amplification of bias and troubling correlations and the transfer or slippage of learned associations between humans and objects found in the training image archives. Inspecting and interpreting what CNNs learn and index through their complex architectures can be difficult if not impossible because of how they encode and obfuscate quite human ways of seeing the world and the image repertoires used to train these algorithms that are rife with residues of prior representations.\",\"PeriodicalId\":47701,\"journal\":{\"name\":\"Social Text\",\"volume\":\"21 1\",\"pages\":\"0\"},\"PeriodicalIF\":1.2000,\"publicationDate\":\"2023-09-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"1\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Social Text\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1215/01642472-10613653\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q2\",\"JCRName\":\"CULTURAL STUDIES\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Social Text","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1215/01642472-10613653","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"CULTURAL STUDIES","Score":null,"Total":0}

引用次数: 1

摘要

卷积神经网络(cnn)是驱动自动视觉技术的关键技术，被称为计算机视觉。cnn在从视觉数据中进行物体识别的系统中尤其成功。这篇文章探讨了二十世纪中期的数字图像本体在这些当代技术中的持久性。虽然cnn是多维的，但它们的本体扁平化了背景和前景、主题和对象之间的区别，甚至是用于组织和训练这些模型的信息类别之间建立的关系。这个本体允许引入和放大偏见和令人不安的相关性，以及在训练图像档案中发现的人与对象之间的学习关联的转移或滑动。检查和解释cnn通过其复杂的架构学习和索引的内容可能是困难的，如果不是不可能的话，因为它们是如何编码和模糊人类看待世界的方式的，以及用于训练这些算法的图像库，这些算法充满了先验表示的残馀。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文

微信好友朋友圈 QQ好友复制链接

本刊更多论文

Objective Vision

Convolutional neural networks (CNNs) are a key technology powering the automated technologies of seeing known as computer vision. CNNs have been especially successful in systems that perform object recognition from visual data. This article examines the persistence of a mid-twentieth-century ontology of the digital image in these contemporary technologies. While CNNs are multidimensional, their ontology flattens distinctions between background and foreground, between subjects and objects, and even the relations established among the categories of information used to organize and train these models. This ontology enables the introduction and amplification of bias and troubling correlations and the transfer or slippage of learned associations between humans and objects found in the training image archives. Inspecting and interpreting what CNNs learn and index through their complex architectures can be difficult if not impossible because of how they encode and obfuscate quite human ways of seeing the world and the image repertoires used to train these algorithms that are rife with residues of prior representations.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助