TransVoice

The Adjunct Publication of the 32nd Annual ACM Symposium on User Interface Software and Technology Pub Date : 2019-10-14 DOI:10.1145/3332167.3357106

Riku Arakawa, Shinnosuke Takamichi, Hiroshi Saruwatari

引用次数: 2

Abstract

Despite promising initial studies, a speaker's original voice can cause problems when it comes to the application of real-time voice conversion (data-driven speaker conversion) technology in our daily lives, specifically in our near-field communication, because the overlapping speech degrades the sense of immersion to the converted speech. We present TransVoice, a real-time voice conversion system that physically confines original speech with a mask-shaped device. Our preliminary study shows the proposed device can reduce the volume of original speech significantly, while it ameliorates the deteriorated conversion quality of the deep neural network (DNN) thanks to an integrated filter that weakens the low frequency range. We discuss novel applications using TransVoice that can augment our communication.

查看原文