{"title":"Depersonalization of Speech Using Speaker-Specific Transform Based on Long-Term Spectrum","authors":"M. Rujzl, M. Sigmund","doi":"10.13164/re.2023.0523","DOIUrl":null,"url":null,"abstract":". This paper introduces a novel approach for hiding personal information in speech signals. The proposed approach applied a transform warping function, which is obtained from a long-term linear prediction spectrum individually for each speaker. The depersonalized speech was compared with the often used technique based on vocal tract length normalization. The proposed approach performs wider manipulation of fundamental frequency and provides higher intelligibility by 5% in clean speech and by 8% for signal-to-noise ratio 5 dB. It also significantly alters the derived glottal pulses, making them difficult to use for personality analysis. Speech intelligibility index and glottal pulse distortion are new aspects in the field of voice depersonalization.","PeriodicalId":54514,"journal":{"name":"Radioengineering","volume":null,"pages":null},"PeriodicalIF":0.5000,"publicationDate":"2023-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Radioengineering","FirstCategoryId":"5","ListUrlMain":"https://doi.org/10.13164/re.2023.0523","RegionNum":4,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q4","JCRName":"ENGINEERING, ELECTRICAL & ELECTRONIC","Score":null,"Total":0}
引用次数: 0
Abstract
. This paper introduces a novel approach for hiding personal information in speech signals. The proposed approach applied a transform warping function, which is obtained from a long-term linear prediction spectrum individually for each speaker. The depersonalized speech was compared with the often used technique based on vocal tract length normalization. The proposed approach performs wider manipulation of fundamental frequency and provides higher intelligibility by 5% in clean speech and by 8% for signal-to-noise ratio 5 dB. It also significantly alters the derived glottal pulses, making them difficult to use for personality analysis. Speech intelligibility index and glottal pulse distortion are new aspects in the field of voice depersonalization.
期刊介绍:
Since 1992, the Radioengineering Journal has been publishing original scientific and engineering papers from the area of wireless communication and application of wireless technologies. The submitted papers are expected to deal with electromagnetics (antennas, propagation, microwaves), signals, circuits, optics and related fields.
Each issue of the Radioengineering Journal is started by a feature article. Feature articles are organized by members of the Editorial Board to present the latest development in the selected areas of radio engineering.
The Radioengineering Journal makes a maximum effort to publish submitted papers as quickly as possible. The first round of reviews should be completed within two months. Then, authors are expected to improve their manuscript within one month. If substantial changes are recommended and further reviews are requested by the reviewers, the publication time is prolonged.