Nguyen Nang An, Tran Thanh Trung, Tran Kim Hoan, Nguyen Tuan Anh, Pham Minh Doanh
{"title":"Integrate deep neural network and support vector machine to improve the quality of voice processing in internet of things devices","authors":"Nguyen Nang An, Tran Thanh Trung, Tran Kim Hoan, Nguyen Tuan Anh, Pham Minh Doanh","doi":"10.56824/vujs.2023a005","DOIUrl":null,"url":null,"abstract":"Along with the development of science and technology, especially internet of things (IOT), IOT-related products increasingly contribute to improving people’s life. Among those products, it is impossible not to mention smarts city, internet of Vehicle devices and especially smart home, which are usually voice controlled. Therefore, voice processing technology is also in need of improvement. The article mainly focuses on processing human voice independently of text. In particular, Convolutional network (CNN) and Support Vector Machine (SVM) will be integrated to create Feature Building Machine. SVMs are often used in voice and image classification, which accordingly is a critical and swift data sorter. The article analyzes the advantages of the combination Deep Neural Network (DNN) and SVMs in voice recognition and is the foundation to develop devices for smart home. The experimental results, which was used in the standard Voxceleb database, demonstrate the superiority in sound recognition compared to traditional i-vector methods or other CNN methods.","PeriodicalId":447825,"journal":{"name":"Vinh University Journal of Science","volume":"1 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2023-03-20","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Vinh University Journal of Science","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.56824/vujs.2023a005","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0
Abstract
Along with the development of science and technology, especially internet of things (IOT), IOT-related products increasingly contribute to improving people’s life. Among those products, it is impossible not to mention smarts city, internet of Vehicle devices and especially smart home, which are usually voice controlled. Therefore, voice processing technology is also in need of improvement. The article mainly focuses on processing human voice independently of text. In particular, Convolutional network (CNN) and Support Vector Machine (SVM) will be integrated to create Feature Building Machine. SVMs are often used in voice and image classification, which accordingly is a critical and swift data sorter. The article analyzes the advantages of the combination Deep Neural Network (DNN) and SVMs in voice recognition and is the foundation to develop devices for smart home. The experimental results, which was used in the standard Voxceleb database, demonstrate the superiority in sound recognition compared to traditional i-vector methods or other CNN methods.
随着科技尤其是物联网的发展,物联网相关产品越来越多地为人们的生活做出贡献。在这些产品中,不可能不提到智能城市,车联网设备,特别是智能家居,它们通常是语音控制的。因此,语音处理技术也需要改进。本文主要研究独立于文本的人类语音处理。特别是将卷积网络(CNN)和支持向量机(SVM)相结合,形成Feature Building Machine。支持向量机通常用于语音和图像分类,因此它是一种关键和快速的数据分类器。分析了深度神经网络(DNN)与支持向量机(svm)相结合在语音识别中的优势,为智能家居设备的开发奠定了基础。在标准Voxceleb数据库中使用的实验结果表明,与传统的i向量方法或其他CNN方法相比,该方法在声音识别方面具有优势。