Alexandre Costa Ferro Filho, Elisa Ayumi Masasi de Oliveira, Iago Alves Brito, Pedro Martins Bittencourt
{"title":"Implementation and Applications of WakeWords Integrated with Speaker Recognition: A Case Study","authors":"Alexandre Costa Ferro Filho, Elisa Ayumi Masasi de Oliveira, Iago Alves Brito, Pedro Martins Bittencourt","doi":"arxiv-2407.18985","DOIUrl":null,"url":null,"abstract":"This paper explores the application of artificial intelligence techniques in\naudio and voice processing, focusing on the integration of wake words and\nspeaker recognition for secure access in embedded systems. With the growing\nprevalence of voice-activated devices such as Amazon Alexa, ensuring secure and\nuser-specific interactions has become paramount. Our study aims to enhance the\nsecurity framework of these systems by leveraging wake words for initial\nactivation and speaker recognition to validate user permissions. By\nincorporating these AI-driven methodologies, we propose a robust solution that\nrestricts system usage to authorized individuals, thereby mitigating\nunauthorized access risks. This research delves into the algorithms and\ntechnologies underpinning wake word detection and speaker recognition,\nevaluates their effectiveness in real-world applications, and discusses the\npotential for their implementation in various embedded systems, emphasizing\nsecurity and user convenience. The findings underscore the feasibility and\nadvantages of employing these AI techniques to create secure, user-friendly\nvoice-activated systems.","PeriodicalId":501178,"journal":{"name":"arXiv - CS - Sound","volume":null,"pages":null},"PeriodicalIF":0.0000,"publicationDate":"2024-07-25","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"arXiv - CS - Sound","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/arxiv-2407.18985","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0
Abstract
This paper explores the application of artificial intelligence techniques in
audio and voice processing, focusing on the integration of wake words and
speaker recognition for secure access in embedded systems. With the growing
prevalence of voice-activated devices such as Amazon Alexa, ensuring secure and
user-specific interactions has become paramount. Our study aims to enhance the
security framework of these systems by leveraging wake words for initial
activation and speaker recognition to validate user permissions. By
incorporating these AI-driven methodologies, we propose a robust solution that
restricts system usage to authorized individuals, thereby mitigating
unauthorized access risks. This research delves into the algorithms and
technologies underpinning wake word detection and speaker recognition,
evaluates their effectiveness in real-world applications, and discusses the
potential for their implementation in various embedded systems, emphasizing
security and user convenience. The findings underscore the feasibility and
advantages of employing these AI techniques to create secure, user-friendly
voice-activated systems.