通过深度学习技术识别说话人:全面回顾和研究挑战

Nirupam Shome, Anisha Sarkar, Arit Ghosh, R. Laskar, Richik Kashyap
{"title":"通过深度学习技术识别说话人:全面回顾和研究挑战","authors":"Nirupam Shome, Anisha Sarkar, Arit Ghosh, R. Laskar, Richik Kashyap","doi":"10.3311/ppee.20971","DOIUrl":null,"url":null,"abstract":"Deep learning has now become an integral part of today's world and advancement in the field of deep learning has gained a huge development. Due to the extensive use and fast growth of deep learning, it has captured the attention of researchers in the field of speaker recognition. A detailed investigation regarding the process becomes essential and helpful to the researchers for designing robust applications in the field of speaker recognition, both in speaker verification and identification. This paper reviews the field of speaker recognition taking into consideration of deep learning advancement in the present era that boosts up this technology. The paper continues with a systematic review by firstly giving a basic idea of deep learning and its architecture with its field of application, then entering into the high-lighted portion of our paper i.e., speaker recognition which is one of the important applications of deep learning. Here we have mentioned its types, different processing techniques, challenges that come across in this technology, performance evaluation criteria, deep learning implementation frameworks, and lastly various databases used in the field of speaker identification (SI) and Speaker Verification (SV).","PeriodicalId":37664,"journal":{"name":"Periodica polytechnica Electrical engineering and computer science","volume":"31 1","pages":"300-336"},"PeriodicalIF":0.0000,"publicationDate":"2023-03-30","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Speaker Recognition through Deep Learning Techniques: A Comprehensive Review and Research Challenges\",\"authors\":\"Nirupam Shome, Anisha Sarkar, Arit Ghosh, R. Laskar, Richik Kashyap\",\"doi\":\"10.3311/ppee.20971\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Deep learning has now become an integral part of today's world and advancement in the field of deep learning has gained a huge development. Due to the extensive use and fast growth of deep learning, it has captured the attention of researchers in the field of speaker recognition. A detailed investigation regarding the process becomes essential and helpful to the researchers for designing robust applications in the field of speaker recognition, both in speaker verification and identification. This paper reviews the field of speaker recognition taking into consideration of deep learning advancement in the present era that boosts up this technology. The paper continues with a systematic review by firstly giving a basic idea of deep learning and its architecture with its field of application, then entering into the high-lighted portion of our paper i.e., speaker recognition which is one of the important applications of deep learning. Here we have mentioned its types, different processing techniques, challenges that come across in this technology, performance evaluation criteria, deep learning implementation frameworks, and lastly various databases used in the field of speaker identification (SI) and Speaker Verification (SV).\",\"PeriodicalId\":37664,\"journal\":{\"name\":\"Periodica polytechnica Electrical engineering and computer science\",\"volume\":\"31 1\",\"pages\":\"300-336\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2023-03-30\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Periodica polytechnica Electrical engineering and computer science\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.3311/ppee.20971\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q3\",\"JCRName\":\"Computer Science\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Periodica polytechnica Electrical engineering and computer science","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.3311/ppee.20971","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q3","JCRName":"Computer Science","Score":null,"Total":0}
引用次数: 0

摘要

深度学习已经成为当今世界不可分割的一部分,深度学习领域的进步已经取得了巨大的发展。由于深度学习的广泛应用和快速发展,它已经引起了说话人识别领域研究人员的关注。详细研究这一过程对于研究人员在说话人识别领域设计稳健的应用程序至关重要,无论是在说话人验证还是身份识别方面。本文回顾了当前时代深度学习的发展对说话人识别技术的推动作用。本文首先对深度学习的基本概念、体系结构及其应用领域进行了系统的综述,然后进入了本文的重点部分,即语音识别,这是深度学习的重要应用之一。在这里,我们提到了它的类型,不同的处理技术,在该技术中遇到的挑战,性能评估标准,深度学习实现框架,以及在说话人识别(SI)和说话人验证(SV)领域使用的各种数据库。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
Speaker Recognition through Deep Learning Techniques: A Comprehensive Review and Research Challenges
Deep learning has now become an integral part of today's world and advancement in the field of deep learning has gained a huge development. Due to the extensive use and fast growth of deep learning, it has captured the attention of researchers in the field of speaker recognition. A detailed investigation regarding the process becomes essential and helpful to the researchers for designing robust applications in the field of speaker recognition, both in speaker verification and identification. This paper reviews the field of speaker recognition taking into consideration of deep learning advancement in the present era that boosts up this technology. The paper continues with a systematic review by firstly giving a basic idea of deep learning and its architecture with its field of application, then entering into the high-lighted portion of our paper i.e., speaker recognition which is one of the important applications of deep learning. Here we have mentioned its types, different processing techniques, challenges that come across in this technology, performance evaluation criteria, deep learning implementation frameworks, and lastly various databases used in the field of speaker identification (SI) and Speaker Verification (SV).
求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
Periodica polytechnica Electrical engineering and computer science
Periodica polytechnica Electrical engineering and computer science Engineering-Electrical and Electronic Engineering
CiteScore
2.60
自引率
0.00%
发文量
36
期刊介绍: The main scope of the journal is to publish original research articles in the wide field of electrical engineering and informatics fitting into one of the following five Sections of the Journal: (i) Communication systems, networks and technology, (ii) Computer science and information theory, (iii) Control, signal processing and signal analysis, medical applications, (iv) Components, Microelectronics and Material Sciences, (v) Power engineering and mechatronics, (vi) Mobile Software, Internet of Things and Wearable Devices, (vii) Solid-state lighting and (viii) Vehicular Technology (land, airborne, and maritime mobile services; automotive, radar systems; antennas and radio wave propagation).
期刊最新文献
Modeling and Study of Different Magnet Topologies in Rotor of Low Rating IPMSMs Improving Reinforcement Learning Exploration by Autoencoders A Self-adapting Pixel Antenna - Substrate Lens System for Infrared Frequencies Palmprint Identification Using Dolphin Optimization Parasitic Loaded Shorting Pin Based Compact Multi-slot LoRa Antenna for Military Application
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1