Faster BIC segmentation using local speaker modeling

R. Travadi, G. Saha
{"title":"Faster BIC segmentation using local speaker modeling","authors":"R. Travadi, G. Saha","doi":"10.1109/NCC.2012.6176884","DOIUrl":null,"url":null,"abstract":"Segmentation is typically the most computationally expensive step involved in majority of speaker diarization systems. Bayesian Information Criterion (BIC) is a very widely adopted method for segmentation of audio data. While BIC returns fairly good results in terms of segmentation performance, it suffers from the problem of enormous complexity. Moreover, BIC based diarization systems encounter the worst case complexity when there is no change point in the input audio stream at all. Many audio streams contain fairly large segments separated by a very few change points. In such cases, it becomes impractical to employ BIC segmentation because of its complexity. In this paper, we have proposed a modification to the baseline BIC segmentation scheme, which makes use of local search information to reduce the overall complexity of the segmentation procedure. The results have been tested on several audio streams from broadcast news and the diarization runtime has been found to get reduced by a factor of 3.45, with a marginally better segmentation performance.","PeriodicalId":178278,"journal":{"name":"2012 National Conference on Communications (NCC)","volume":"13 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2012-04-03","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2012 National Conference on Communications (NCC)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/NCC.2012.6176884","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0

Abstract

Segmentation is typically the most computationally expensive step involved in majority of speaker diarization systems. Bayesian Information Criterion (BIC) is a very widely adopted method for segmentation of audio data. While BIC returns fairly good results in terms of segmentation performance, it suffers from the problem of enormous complexity. Moreover, BIC based diarization systems encounter the worst case complexity when there is no change point in the input audio stream at all. Many audio streams contain fairly large segments separated by a very few change points. In such cases, it becomes impractical to employ BIC segmentation because of its complexity. In this paper, we have proposed a modification to the baseline BIC segmentation scheme, which makes use of local search information to reduce the overall complexity of the segmentation procedure. The results have been tested on several audio streams from broadcast news and the diarization runtime has been found to get reduced by a factor of 3.45, with a marginally better segmentation performance.
查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
使用本地说话人建模更快的BIC分割
在大多数说话人分界系统中,分割通常是计算成本最高的步骤。贝叶斯信息准则(BIC)是一种被广泛采用的音频数据分割方法。虽然BIC在分割性能方面返回了相当好的结果,但它受到巨大复杂性的问题的困扰。此外,当输入音频流中根本没有变化点时,基于BIC的码化系统会遇到最坏情况的复杂性。许多音频流包含相当大的片段,由很少的改变点分开。在这种情况下,由于BIC分割的复杂性,使用它变得不切实际。在本文中,我们提出了一种改进的基线BIC分割方案,该方案利用局部搜索信息来降低分割过程的整体复杂性。结果已经在来自广播新闻的几个音频流上进行了测试,发现分组运行时间减少了3.45倍,分割性能略有提高。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 去求助
来源期刊
自引率
0.00%
发文量
0
期刊最新文献
Quantized modulation diversity for 64-QAM IITKGP-MLILSC speech database for language identification Strip lined - Truncated ground plane for flat response of miniaturized UWB patch antenna On the underwater wireless network clustering Faster BIC segmentation using local speaker modeling
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1