{"title":"算法口技:人工智能语音生成器中饱受争议的语音状态","authors":"Ido Ramati","doi":"10.1177/20563051231224401","DOIUrl":null,"url":null,"abstract":"This article explores the vocal human–machine relations embedded in text-to-speech (TTS) generators. Retracing the human sources behind the synthetic speech and tracking the remediation of the voice by the machine-learning algorithm, it argues that artificial intelligence (AI) speaking agents such as Siri and Alexa, as well as other TTS acts such as TikTok’s, are performing algorithmic ventriloquism. Speaking mechanically with the voices of professional voiceover artists, AI speech technologies algorithmically manipulate these voices, thus generating personas that hold an interconnected chain of tensions between the embodied and the virtual, the particular and the general, the human and the non-human, as well as between speech and writing. Algorithmic ventriloquism serves as an analytical framework to tie the techno-vocalic operation of the TTS system with its cultural, economic, philosophical, and sociolinguistic predicaments. The last section discusses the implications of algorithmic ventriloquism beyond the realm of the voice.","PeriodicalId":47920,"journal":{"name":"Social Media + Society","volume":"1 7","pages":""},"PeriodicalIF":5.5000,"publicationDate":"2024-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Algorithmic Ventriloquism: The Contested State of Voice in AI Speech Generators\",\"authors\":\"Ido Ramati\",\"doi\":\"10.1177/20563051231224401\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"This article explores the vocal human–machine relations embedded in text-to-speech (TTS) generators. Retracing the human sources behind the synthetic speech and tracking the remediation of the voice by the machine-learning algorithm, it argues that artificial intelligence (AI) speaking agents such as Siri and Alexa, as well as other TTS acts such as TikTok’s, are performing algorithmic ventriloquism. Speaking mechanically with the voices of professional voiceover artists, AI speech technologies algorithmically manipulate these voices, thus generating personas that hold an interconnected chain of tensions between the embodied and the virtual, the particular and the general, the human and the non-human, as well as between speech and writing. Algorithmic ventriloquism serves as an analytical framework to tie the techno-vocalic operation of the TTS system with its cultural, economic, philosophical, and sociolinguistic predicaments. The last section discusses the implications of algorithmic ventriloquism beyond the realm of the voice.\",\"PeriodicalId\":47920,\"journal\":{\"name\":\"Social Media + Society\",\"volume\":\"1 7\",\"pages\":\"\"},\"PeriodicalIF\":5.5000,\"publicationDate\":\"2024-01-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Social Media + Society\",\"FirstCategoryId\":\"98\",\"ListUrlMain\":\"https://doi.org/10.1177/20563051231224401\",\"RegionNum\":1,\"RegionCategory\":\"文学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q1\",\"JCRName\":\"COMMUNICATION\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Social Media + Society","FirstCategoryId":"98","ListUrlMain":"https://doi.org/10.1177/20563051231224401","RegionNum":1,"RegionCategory":"文学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"COMMUNICATION","Score":null,"Total":0}
Algorithmic Ventriloquism: The Contested State of Voice in AI Speech Generators
This article explores the vocal human–machine relations embedded in text-to-speech (TTS) generators. Retracing the human sources behind the synthetic speech and tracking the remediation of the voice by the machine-learning algorithm, it argues that artificial intelligence (AI) speaking agents such as Siri and Alexa, as well as other TTS acts such as TikTok’s, are performing algorithmic ventriloquism. Speaking mechanically with the voices of professional voiceover artists, AI speech technologies algorithmically manipulate these voices, thus generating personas that hold an interconnected chain of tensions between the embodied and the virtual, the particular and the general, the human and the non-human, as well as between speech and writing. Algorithmic ventriloquism serves as an analytical framework to tie the techno-vocalic operation of the TTS system with its cultural, economic, philosophical, and sociolinguistic predicaments. The last section discusses the implications of algorithmic ventriloquism beyond the realm of the voice.
期刊介绍:
Social Media + Society is an open access, peer-reviewed scholarly journal that focuses on the socio-cultural, political, psychological, historical, economic, legal and policy dimensions of social media in societies past, contemporary and future. We publish interdisciplinary work that draws from the social sciences, humanities and computational social sciences, reaches out to the arts and natural sciences, and we endorse mixed methods and methodologies. The journal is open to a diversity of theoretic paradigms and methodologies. The editorial vision of Social Media + Society draws inspiration from research on social media to outline a field of study poised to reflexively grow as social technologies evolve. We foster the open access of sharing of research on the social properties of media, as they manifest themselves through the uses people make of networked platforms past and present, digital and non. The journal presents a collaborative, open, and shared space, dedicated exclusively to the study of social media and their implications for societies. It facilitates state-of-the-art research on cutting-edge trends and allows scholars to focus and track trends specific to this field of study.