首页 > 最新文献

The 2022 Far-field Speaker Verification Challenge (FFSVC2022)最新文献

英文 中文
Cross-Domain ArcFace:Learnging Robust Speaker Representation Under the Far-Field Speaker Verification 跨域ArcFace:远场说话人验证下的稳健说话人表征学习
Pub Date : 2022-09-17 DOI: 10.21437/ffsvc.2022-2
Yuke Lin, Xiaoyi Qin, Ming Li
The system of speaker verification system shows outstanding performance with the assistance of different types of loss functions with angular margin penalty, which can enforce the intra-class compactness and inter-class discrepancy. However, the power of classification may degrade largely when encountering the cross-domain problems, especially in far-field scenes. Thus, we propose a novel Cross-Domain ArcFace(CD-ArcFace) loss function. By adopting distinct margin penalty in different domain when conducting mix-data fine-tuning, the performance of various speaker verification system can be further improved. This experiment is carried on FFSVC2022. The final score level of our fusion system for the task1 achieves 4.028% and 4.368% EER on the development set and evaluation set.
在不同类型的损失函数的辅助下,具有角度余量惩罚的说话人验证系统表现出优异的性能,可以增强类内紧密性和类间差异。然而,当遇到跨域问题时,特别是在远场场景下,分类能力可能会大大降低。因此,我们提出了一种新的跨域ArcFace(CD-ArcFace)损失函数。在进行混合数据微调时,在不同的域采用不同的余量惩罚,可以进一步提高各种说话人验证系统的性能。本实验在FFSVC2022上进行。我们的融合系统在task1的最终得分水平在开发集和评估集上分别达到4.028%和4.368%的EER。
{"title":"Cross-Domain ArcFace:Learnging Robust Speaker Representation Under the Far-Field Speaker Verification","authors":"Yuke Lin, Xiaoyi Qin, Ming Li","doi":"10.21437/ffsvc.2022-2","DOIUrl":"https://doi.org/10.21437/ffsvc.2022-2","url":null,"abstract":"The system of speaker verification system shows outstanding performance with the assistance of different types of loss functions with angular margin penalty, which can enforce the intra-class compactness and inter-class discrepancy. However, the power of classification may degrade largely when encountering the cross-domain problems, especially in far-field scenes. Thus, we propose a novel Cross-Domain ArcFace(CD-ArcFace) loss function. By adopting distinct margin penalty in different domain when conducting mix-data fine-tuning, the performance of various speaker verification system can be further improved. This experiment is carried on FFSVC2022. The final score level of our fusion system for the task1 achieves 4.028% and 4.368% EER on the development set and evaluation set.","PeriodicalId":282527,"journal":{"name":"The 2022 Far-field Speaker Verification Challenge (FFSVC2022)","volume":"32 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2022-09-17","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"114488559","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
ZXIC Speaker Verification System for FFSVC 2022 Challenge 面向FFSVC 2022挑战赛的ZXIC扬声器验证系统
Pub Date : 2022-09-17 DOI: 10.21437/ffsvc.2022-1
Yuan Lei, Zhou Cao, Dehui Kong, Ke Xu
This paper presents the development of ZXIC speaker verification system submitted to the task 1 of Interspeech 2022 Far-Field Speaker Verification Challenge (FFSVC2022). Deep neural network based discriminative embeddings, such as x-vectors, have been shown to perform well in speaker verification tasks. In far-field speaker verification system, mismatch between training and testing data and mismatch between enrollment and authentication utterances impact the system performance a lot. To alleviate this mismatch and improve the system performance, in this paper we propose a novel multi-reader domain adaption learning framework based on asymmetric metric learning. In this challenge, we also explore advanced neural network based embedding extractor structures including ECAPA-TDNN and ResNet-SE. A number of experiments on these architectures show that our proposed method is effective and improves the systems performance a lot. The final submitted systems are the fusion of several models. In FFSVC2022, our best system achieves a minimum of the detection cost function (minDCF) of 0.511and an equal error rate (EER) of 4.409 % on the evaluation set.
本文介绍了提交Interspeech 2022远场演讲者验证挑战赛(FFSVC2022)任务1的ZXIC演讲者验证系统的开发。基于深度神经网络的判别嵌入,如x向量,已被证明在说话人验证任务中表现良好。在远场说话人验证系统中,训练数据与测试数据不匹配、注册语音与认证语音不匹配等问题严重影响系统性能。为了缓解这种不匹配,提高系统性能,本文提出了一种基于非对称度量学习的多阅读器域自适应学习框架。在这个挑战中,我们还探索了先进的基于神经网络的嵌入提取器结构,包括ECAPA-TDNN和ResNet-SE。在这些结构上的大量实验表明,我们提出的方法是有效的,并且大大提高了系统的性能。最后提交的系统是几个模型的融合。在FFSVC2022中,我们的最佳系统在评估集上实现了最小检测成本函数(minDCF) 0.511和相等错误率(EER) 4.409%。
{"title":"ZXIC Speaker Verification System for FFSVC 2022 Challenge","authors":"Yuan Lei, Zhou Cao, Dehui Kong, Ke Xu","doi":"10.21437/ffsvc.2022-1","DOIUrl":"https://doi.org/10.21437/ffsvc.2022-1","url":null,"abstract":"This paper presents the development of ZXIC speaker verification system submitted to the task 1 of Interspeech 2022 Far-Field Speaker Verification Challenge (FFSVC2022). Deep neural network based discriminative embeddings, such as x-vectors, have been shown to perform well in speaker verification tasks. In far-field speaker verification system, mismatch between training and testing data and mismatch between enrollment and authentication utterances impact the system performance a lot. To alleviate this mismatch and improve the system performance, in this paper we propose a novel multi-reader domain adaption learning framework based on asymmetric metric learning. In this challenge, we also explore advanced neural network based embedding extractor structures including ECAPA-TDNN and ResNet-SE. A number of experiments on these architectures show that our proposed method is effective and improves the systems performance a lot. The final submitted systems are the fusion of several models. In FFSVC2022, our best system achieves a minimum of the detection cost function (minDCF) of 0.511and an equal error rate (EER) of 4.409 % on the evaluation set.","PeriodicalId":282527,"journal":{"name":"The 2022 Far-field Speaker Verification Challenge (FFSVC2022)","volume":"2010 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2022-09-17","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"129127975","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
The 2022 Far-field Speaker Verification Challenge: Exploring domain mismatch and semi-supervised learning under the far-field scenario 2022远场说话者验证挑战:探索远场场景下的领域不匹配和半监督学习
Pub Date : 2022-09-12 DOI: 10.21437/ffsvc.2022-3
Xiaoyi Qin, Ming Li, Hui Bu, Shrikanth S. Narayanan, Haizhou Li
FFSVC2022 is the second challenge of far-field speaker verification. FFSVC2022 provides the fully-supervised far-field speaker verification to further explore the far-field scenario and proposes semi-supervised far-field speaker verification. In contrast to FFSVC2020, FFSVC2022 focus on the single-channel scenario. In addition, a supplementary set for the FFSVC2020 dataset is released this year. The supplementary set consists of more recording devices and has the same data distribution as the FFSVC2022 evaluation set. This paper summarizes the FFSVC 2022, including tasks description, trial designing details, a baseline system and a summary of challenge results. The challenge results indicate substantial progress made in the field but also present that there are still difficulties with the far-field scenario.
FFSVC2022是远场扬声器验证的第二个挑战。FFSVC2022提供全监督远场扬声器验证,进一步探索远场场景,并提出半监督远场扬声器验证。与FFSVC2020相比,FFSVC2022专注于单通道场景。此外,今年还发布了FFSVC2020数据集的补充集。补充集由更多的记录设备组成,其数据分布与FFSVC2022评估集相同。本文对FFSVC 2022进行了总结,包括任务描述、试验设计细节、基线系统和挑战结果总结。挑战结果表明该领域取得了实质性进展,但也表明远场情景仍然存在困难。
{"title":"The 2022 Far-field Speaker Verification Challenge: Exploring domain mismatch and semi-supervised learning under the far-field scenario","authors":"Xiaoyi Qin, Ming Li, Hui Bu, Shrikanth S. Narayanan, Haizhou Li","doi":"10.21437/ffsvc.2022-3","DOIUrl":"https://doi.org/10.21437/ffsvc.2022-3","url":null,"abstract":"FFSVC2022 is the second challenge of far-field speaker verification. FFSVC2022 provides the fully-supervised far-field speaker verification to further explore the far-field scenario and proposes semi-supervised far-field speaker verification. In contrast to FFSVC2020, FFSVC2022 focus on the single-channel scenario. In addition, a supplementary set for the FFSVC2020 dataset is released this year. The supplementary set consists of more recording devices and has the same data distribution as the FFSVC2022 evaluation set. This paper summarizes the FFSVC 2022, including tasks description, trial designing details, a baseline system and a summary of challenge results. The challenge results indicate substantial progress made in the field but also present that there are still difficulties with the far-field scenario.","PeriodicalId":282527,"journal":{"name":"The 2022 Far-field Speaker Verification Challenge (FFSVC2022)","volume":"45 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2022-09-12","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"123750168","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 5
期刊
The 2022 Far-field Speaker Verification Challenge (FFSVC2022)
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1