Algorithmic Ventriloquism: The Contested State of Voice in AI Speech Generators

IF 5.5 1区 文学 Q1 COMMUNICATION Social Media + Society Pub Date : 2024-01-01 DOI:10.1177/20563051231224401
Ido Ramati
{"title":"Algorithmic Ventriloquism: The Contested State of Voice in AI Speech Generators","authors":"Ido Ramati","doi":"10.1177/20563051231224401","DOIUrl":null,"url":null,"abstract":"This article explores the vocal human–machine relations embedded in text-to-speech (TTS) generators. Retracing the human sources behind the synthetic speech and tracking the remediation of the voice by the machine-learning algorithm, it argues that artificial intelligence (AI) speaking agents such as Siri and Alexa, as well as other TTS acts such as TikTok’s, are performing algorithmic ventriloquism. Speaking mechanically with the voices of professional voiceover artists, AI speech technologies algorithmically manipulate these voices, thus generating personas that hold an interconnected chain of tensions between the embodied and the virtual, the particular and the general, the human and the non-human, as well as between speech and writing. Algorithmic ventriloquism serves as an analytical framework to tie the techno-vocalic operation of the TTS system with its cultural, economic, philosophical, and sociolinguistic predicaments. The last section discusses the implications of algorithmic ventriloquism beyond the realm of the voice.","PeriodicalId":47920,"journal":{"name":"Social Media + Society","volume":"1 7","pages":""},"PeriodicalIF":5.5000,"publicationDate":"2024-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Social Media + Society","FirstCategoryId":"98","ListUrlMain":"https://doi.org/10.1177/20563051231224401","RegionNum":1,"RegionCategory":"文学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"COMMUNICATION","Score":null,"Total":0}
引用次数: 0

Abstract

This article explores the vocal human–machine relations embedded in text-to-speech (TTS) generators. Retracing the human sources behind the synthetic speech and tracking the remediation of the voice by the machine-learning algorithm, it argues that artificial intelligence (AI) speaking agents such as Siri and Alexa, as well as other TTS acts such as TikTok’s, are performing algorithmic ventriloquism. Speaking mechanically with the voices of professional voiceover artists, AI speech technologies algorithmically manipulate these voices, thus generating personas that hold an interconnected chain of tensions between the embodied and the virtual, the particular and the general, the human and the non-human, as well as between speech and writing. Algorithmic ventriloquism serves as an analytical framework to tie the techno-vocalic operation of the TTS system with its cultural, economic, philosophical, and sociolinguistic predicaments. The last section discusses the implications of algorithmic ventriloquism beyond the realm of the voice.
查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
算法口技:人工智能语音生成器中饱受争议的语音状态
本文探讨了文本到语音(TTS)生成器中蕴含的人机关系。文章追溯了合成语音背后的人类来源,并追踪了机器学习算法对语音的修正,认为 Siri 和 Alexa 等人工智能(AI)语音代理以及 TikTok 等其他 TTS 行为都是在表演算法口技。人工智能语音技术机械地使用专业配音艺术家的声音说话,并对这些声音进行算法处理,从而生成了角色,在具身与虚拟、特殊与一般、人类与非人类以及语音与书写之间形成了一连串相互关联的紧张关系。算法口技作为一种分析框架,将 TTS 系统的技术发声操作与其文化、经济、哲学和社会语言学困境联系在一起。最后一节讨论了算法腹语在语音领域之外的影响。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 去求助
来源期刊
Social Media + Society
Social Media + Society COMMUNICATION-
CiteScore
9.20
自引率
3.80%
发文量
111
审稿时长
12 weeks
期刊介绍: Social Media + Society is an open access, peer-reviewed scholarly journal that focuses on the socio-cultural, political, psychological, historical, economic, legal and policy dimensions of social media in societies past, contemporary and future. We publish interdisciplinary work that draws from the social sciences, humanities and computational social sciences, reaches out to the arts and natural sciences, and we endorse mixed methods and methodologies. The journal is open to a diversity of theoretic paradigms and methodologies. The editorial vision of Social Media + Society draws inspiration from research on social media to outline a field of study poised to reflexively grow as social technologies evolve. We foster the open access of sharing of research on the social properties of media, as they manifest themselves through the uses people make of networked platforms past and present, digital and non. The journal presents a collaborative, open, and shared space, dedicated exclusively to the study of social media and their implications for societies. It facilitates state-of-the-art research on cutting-edge trends and allows scholars to focus and track trends specific to this field of study.
期刊最新文献
Telehealth “Verzuz” Radical Telehealing: Reimagining Social Media as Virtual Healing Spaces for Black Communities Queerness and Mental Health in India: An Intersectional Approach to Sensitive Social Media Disclosures Understanding the Motivations of Young Adults to Engage in Privacy Protection Behavior While Setting Up Smartphone Apps: A Cross-Country Comparison Between Romania and Germany Online Privacy, Young People, and Datafication: Different Perceptions About Online Privacy Across Antigua & Barbuda, Australia, Ghana, and Slovenia The AI Chatbot Always Flirts With Me, Should I Flirt Back: From the McDonaldization of Friendship to the Robotization of Love
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1