Formant manipulations in voice disguise by mimicry

2016 4th International Conference on Biometrics and Forensics (IWBF) Pub Date : 2016-03-03 DOI:10.1109/IWBF.2016.7449675

Rita Singh, D. Gençaga, B. Raj

{"title":"Formant manipulations in voice disguise by mimicry","authors":"Rita Singh, D. Gençaga, B. Raj","doi":"10.1109/IWBF.2016.7449675","DOIUrl":null,"url":null,"abstract":"The human voice can be disguised in many ways. The purpose of disguise could either be to impersonate another person, or to conceal the identity of the original speaker, or both. On the other hand, the goal of any biometric analysis on disguised voices could also be twofold: either to find out if the originator of the disguised voice is a given speaker, or to know how a speaker's voice can be manipulated so that the extent and type of disguise that the speaker can perform can be guessed a-priori. Any analysis toward the former goal must rely on the knowledge of what characteristics of a person's voice are least affected or unaffected by attempted disguise. Analysis towards the latter goal must use the knowledge of what sounds are typically most amenable to voluntary variation by the speaker, so that the extent to which given speakers can successfully disguise their voice can be estimated. Our paper attempts to establish a simple methodology for analysis of voice for both goals. We study the voice impersonations performed by an expert mimic, focusing specifically on formants and formant-related measurements, to find out the extent and type of formant manipulations that are performed by the expert at the level of individual phonemes. Expert mimicry is an extreme form of attempted disguise. Our study is presented with the expectation that non-expert attempts at voice disguise by mimicry will fall within the gold standard of manipulation patterns set by an expert mimic, and that it is therefore useful to establish this gold standard.","PeriodicalId":282164,"journal":{"name":"2016 4th International Conference on Biometrics and Forensics (IWBF)","volume":"20 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2016-03-03","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"11","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2016 4th International Conference on Biometrics and Forensics (IWBF)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/IWBF.2016.7449675","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 11

Abstract

The human voice can be disguised in many ways. The purpose of disguise could either be to impersonate another person, or to conceal the identity of the original speaker, or both. On the other hand, the goal of any biometric analysis on disguised voices could also be twofold: either to find out if the originator of the disguised voice is a given speaker, or to know how a speaker's voice can be manipulated so that the extent and type of disguise that the speaker can perform can be guessed a-priori. Any analysis toward the former goal must rely on the knowledge of what characteristics of a person's voice are least affected or unaffected by attempted disguise. Analysis towards the latter goal must use the knowledge of what sounds are typically most amenable to voluntary variation by the speaker, so that the extent to which given speakers can successfully disguise their voice can be estimated. Our paper attempts to establish a simple methodology for analysis of voice for both goals. We study the voice impersonations performed by an expert mimic, focusing specifically on formants and formant-related measurements, to find out the extent and type of formant manipulations that are performed by the expert at the level of individual phonemes. Expert mimicry is an extreme form of attempted disguise. Our study is presented with the expectation that non-expert attempts at voice disguise by mimicry will fall within the gold standard of manipulation patterns set by an expert mimic, and that it is therefore useful to establish this gold standard.

查看原文

微信好友朋友圈 QQ好友复制链接

本刊更多论文

通过模仿进行声音伪装的共振峰操纵

人的声音可以通过很多方式伪装。伪装的目的可能是模仿另一个人，或者隐藏原来说话人的身份，或者两者兼而有之。另一方面，对伪装的声音进行任何生物特征分析的目标也可能是双重的:要么找出伪装声音的发起者是否是给定的说话者，要么知道说话者的声音是如何被操纵的，以便可以先验地猜测说话者伪装的程度和类型。任何对前一个目标的分析都必须依赖于一个人的声音的哪些特征是最不受伪装影响的。对后一个目标的分析必须要用到什么样的声音最容易被说话者随意改变的知识，这样就可以估计出说话者成功伪装自己声音的程度。我们的论文试图建立一个简单的方法来分析这两个目标的声音。我们研究了由专家模仿的声音模仿，特别关注共振峰和共振峰相关的测量，以找出专家在单个音素水平上执行的共振峰操纵的程度和类型。熟练的模仿是一种极端的伪装。我们的研究期望通过模仿来伪装声音的非专家尝试将落入由专家模仿设定的操纵模式的黄金标准，因此建立这一黄金标准是有用的。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文去求助

来源期刊

2016 4th International Conference on Biometrics and Forensics (IWBF)

自引率

0.00%

发文量