一种高效的无监督语音活动检测器,用于清洁语音

Mamta Kumari, I. Ali
{"title":"一种高效的无监督语音活动检测器,用于清洁语音","authors":"Mamta Kumari, I. Ali","doi":"10.1109/CCINTELS.2015.7437913","DOIUrl":null,"url":null,"abstract":"In the recent years, the trend of speech signal processing becomes one of the most challenging fields. Now a days, several voice based biometry authentication systems are available to comply the requirements of our daily needs. The overall performances of these systems are totally based on the complexity of each part of the algorithm. Voice Activity Detector (VAD) is one of such relevant part. It can be designed in time domain as well as frequency domain. Time domain VAD have low complexity than the frequency domain or, other domain, like, wavelet transform domain based VAD. Normally, VAD is used to remove the silence portions from the speech signal and a suitable VAD must have high speech activity detection rate (HR1) and high non-speech detection rate (HR0). HP1 and HP0 are quantitative measurement of VAD accuracy to enumerate the speech, and silence part correctly from a speech signal. In this paper, we consider only time domain VAD for clean speech but, the existing time domain based VAD algorithms are not suitable to meet both of them simultaneously. Therefore, we design an unsupervised VAD algorithm which has high HR1, as well as, high HR0. The main advantage of unsupervised data segmentation is that no prior threshold information is required. Here, we use K-mean as an unsupervised clustering technique. In this paper, we compare different time domain voice activity detectors with our proposed VAD which we call it KVD (K-mean Voice-activity Detector). The results of that comparison show the highest performance of our proposed VAD.","PeriodicalId":131816,"journal":{"name":"2015 Communication, Control and Intelligent Systems (CCIS)","volume":"2 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2015-11-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":"{\"title\":\"An efficient un-supervised Voice Activity Detector for clean speech\",\"authors\":\"Mamta Kumari, I. Ali\",\"doi\":\"10.1109/CCINTELS.2015.7437913\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"In the recent years, the trend of speech signal processing becomes one of the most challenging fields. Now a days, several voice based biometry authentication systems are available to comply the requirements of our daily needs. The overall performances of these systems are totally based on the complexity of each part of the algorithm. Voice Activity Detector (VAD) is one of such relevant part. It can be designed in time domain as well as frequency domain. Time domain VAD have low complexity than the frequency domain or, other domain, like, wavelet transform domain based VAD. Normally, VAD is used to remove the silence portions from the speech signal and a suitable VAD must have high speech activity detection rate (HR1) and high non-speech detection rate (HR0). HP1 and HP0 are quantitative measurement of VAD accuracy to enumerate the speech, and silence part correctly from a speech signal. In this paper, we consider only time domain VAD for clean speech but, the existing time domain based VAD algorithms are not suitable to meet both of them simultaneously. Therefore, we design an unsupervised VAD algorithm which has high HR1, as well as, high HR0. The main advantage of unsupervised data segmentation is that no prior threshold information is required. Here, we use K-mean as an unsupervised clustering technique. In this paper, we compare different time domain voice activity detectors with our proposed VAD which we call it KVD (K-mean Voice-activity Detector). The results of that comparison show the highest performance of our proposed VAD.\",\"PeriodicalId\":131816,\"journal\":{\"name\":\"2015 Communication, Control and Intelligent Systems (CCIS)\",\"volume\":\"2 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2015-11-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"1\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2015 Communication, Control and Intelligent Systems (CCIS)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/CCINTELS.2015.7437913\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2015 Communication, Control and Intelligent Systems (CCIS)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/CCINTELS.2015.7437913","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 1

摘要

近年来,语音信号处理的发展趋势成为最具挑战性的领域之一。目前,有几种基于语音的生物识别认证系统可以满足我们的日常需求。这些系统的整体性能完全取决于算法各部分的复杂度。语音活动检测器(VAD)就是其中一个重要组成部分。既可以在时域设计,也可以在频域设计。时域VAD比频域或其他域,如基于小波变换域的VAD具有较低的复杂性。通常,VAD用于去除语音信号中的沉默部分,合适的VAD必须具有高的语音活动检测率(HR1)和高的非语音检测率(HR0)。HP1和HP0是VAD精度的定量测量,用于从语音信号中枚举语音,并正确地沉默部分。在本文中,我们只考虑时域的VAD,但是现有的基于时域的VAD算法并不适合同时满足这两种情况。因此,我们设计了一种具有高HR1和高HR0的无监督VAD算法。无监督数据分割的主要优点是不需要先验阈值信息。在这里,我们使用K-mean作为一种无监督聚类技术。在本文中,我们比较了不同的时域语音活动检测器与我们提出的VAD,我们称之为KVD (K-mean voice -activity Detector)。比较的结果显示了我们所提出的VAD的最高性能。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
An efficient un-supervised Voice Activity Detector for clean speech
In the recent years, the trend of speech signal processing becomes one of the most challenging fields. Now a days, several voice based biometry authentication systems are available to comply the requirements of our daily needs. The overall performances of these systems are totally based on the complexity of each part of the algorithm. Voice Activity Detector (VAD) is one of such relevant part. It can be designed in time domain as well as frequency domain. Time domain VAD have low complexity than the frequency domain or, other domain, like, wavelet transform domain based VAD. Normally, VAD is used to remove the silence portions from the speech signal and a suitable VAD must have high speech activity detection rate (HR1) and high non-speech detection rate (HR0). HP1 and HP0 are quantitative measurement of VAD accuracy to enumerate the speech, and silence part correctly from a speech signal. In this paper, we consider only time domain VAD for clean speech but, the existing time domain based VAD algorithms are not suitable to meet both of them simultaneously. Therefore, we design an unsupervised VAD algorithm which has high HR1, as well as, high HR0. The main advantage of unsupervised data segmentation is that no prior threshold information is required. Here, we use K-mean as an unsupervised clustering technique. In this paper, we compare different time domain voice activity detectors with our proposed VAD which we call it KVD (K-mean Voice-activity Detector). The results of that comparison show the highest performance of our proposed VAD.
求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
自引率
0.00%
发文量
0
期刊最新文献
EMG signal based finger movement recognition for prosthetic hand control Gain and directivity enhancement of microstrip patch array antenna with metallic ring for WLAN/Wi-Fi applications Analysis of prosody based automatic LID systems A novel framework for adaptive user interface Design of FIR band-pass digital filter using Heuristic Optimization Technique: A comparison
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1