Statistical Speaker Diarization Using Dependent Combination of Extracted Features

Hasan Almgotir Kadhim, L. Woo, S. Dlay
{"title":"Statistical Speaker Diarization Using Dependent Combination of Extracted Features","authors":"Hasan Almgotir Kadhim, L. Woo, S. Dlay","doi":"10.1109/AIMS.2015.53","DOIUrl":null,"url":null,"abstract":"The paper describes a novel method that improvises the procedure for supervised speaker diarization. The procedure supposes that the database of the speakers is available. Initially, the database and observation signal of the speakers, are prepared. The audio features has been extracted from the database and the observation signal. Instead of the using of one of Mel Frequency Cepstral Coefficient, Perceptual Linear Prediction, or Power Normalized Cepstral Coefficients, a combination of all of them have been used. The combination form of these features is independent, i.e. They are concatenated in the feature matrix. The comparison between features of observation signal and statistical properties of database features, has been made. The comparing procedure is used to make the decision of the logical mask of the comparison. Both of bottom-up and top-down scenarios collaborate to complete the last decisions successfully. Diarization Error Rate test denotes that combination of features has less than errors than any one alone.","PeriodicalId":121874,"journal":{"name":"2015 3rd International Conference on Artificial Intelligence, Modelling and Simulation (AIMS)","volume":"113 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2015-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2015 3rd International Conference on Artificial Intelligence, Modelling and Simulation (AIMS)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/AIMS.2015.53","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0

Abstract

The paper describes a novel method that improvises the procedure for supervised speaker diarization. The procedure supposes that the database of the speakers is available. Initially, the database and observation signal of the speakers, are prepared. The audio features has been extracted from the database and the observation signal. Instead of the using of one of Mel Frequency Cepstral Coefficient, Perceptual Linear Prediction, or Power Normalized Cepstral Coefficients, a combination of all of them have been used. The combination form of these features is independent, i.e. They are concatenated in the feature matrix. The comparison between features of observation signal and statistical properties of database features, has been made. The comparing procedure is used to make the decision of the logical mask of the comparison. Both of bottom-up and top-down scenarios collaborate to complete the last decisions successfully. Diarization Error Rate test denotes that combination of features has less than errors than any one alone.
查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
基于提取特征相关联组合的统计说话人特征化
本文提出了一种改进监督说话人拨号程序的新方法。该过程假设说话人的数据库是可用的。首先,准备好扬声器的数据库和观测信号。从数据库和观测信号中提取音频特征。代替使用Mel频率倒谱系数、感知线性预测或功率归一化倒谱系数中的一种,使用了所有这些系数的组合。这些特征的组合形式是独立的,即它们在特征矩阵中串联。将观测信号的特征与数据库特征的统计特性进行了比较。比较过程用于决定比较的逻辑掩码。自底向上和自顶向下的场景都协作以成功地完成最后的决策。双化错误率测试表明特征组合的错误率小于任何单独的特征。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 去求助
来源期刊
自引率
0.00%
发文量
0
期刊最新文献
Real Time Detection and Tracking of Mouth Region of Single Human Face Tamper Detection in Speech Based Access Control Systems Using Watermarking A Clustering Algorithm for WSN to Optimize the Network Lifetime Using Type-2 Fuzzy Logic Model On the Trade-Off between Multi-level Security Classification Accuracy and Training Time An Improved Quality of Service Using R-AODV Protocol in MANETs
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1