基于隐马尔可夫模型/人工神经网络(HMM/ANN)混合关键字识别框架的小词汇量菲律宾语脏话自动抑制系统

2014 International Conference on Humanoid, Nanotechnology, Information Technology, Communication and Control, Environment and Management (HNICEM) Pub Date : 2014-11-01 DOI:10.1109/HNICEM.2014.7016183

Fernando I. Ablaza, Timothy Oliver D. Danganan, Bryan Paul L. Javier, Kevin S. Manalang, Denise Erica V. Montalvo, L. Ambata

{"title":"基于隐马尔可夫模型/人工神经网络(HMM/ANN)混合关键字识别框架的小词汇量菲律宾语脏话自动抑制系统","authors":"Fernando I. Ablaza, Timothy Oliver D. Danganan, Bryan Paul L. Javier, Kevin S. Manalang, Denise Erica V. Montalvo, L. Ambata","doi":"10.1109/HNICEM.2014.7016183","DOIUrl":null,"url":null,"abstract":"This paper describes an implementation of speech recognition that recognizes and suppresses ten (10) defined profane and vulgar Filipino words. The adapted speech recognition architecture was that of the Oregon Graduate Institute's (OGI) Center for Spoken Language and Learning (CSLU). It utilizes a hybrid Hidden Markov Model/ Artificial Neural Network (HMM/ANN) keyword spotting framework. The feature extraction method used was Mel-Frequency Cepstral Coefficients (MFCC). The ANN is a 3-layer feedforward neural network using Multi-Layer Perceptron (MLP). In recognizing the words, an HMM decoder was used which implemented the Viterbi Beam Search Algorithm. Whenever a profane word was recognized, it would be replaced with a constant frequency tone. The training and testing data (recordings) were gathered from 30 random (15 male and 15 female) Filipino speakers.","PeriodicalId":309548,"journal":{"name":"2014 International Conference on Humanoid, Nanotechnology, Information Technology, Communication and Control, Environment and Management (HNICEM)","volume":"76 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2014-11-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"4","resultStr":"{\"title\":\"A small vocabulary automatic filipino speech profanity suppression system using hybrid Hidden Markov Model/Artificial Neural Network (HMM/ANN) keyword spotting framework\",\"authors\":\"Fernando I. Ablaza, Timothy Oliver D. Danganan, Bryan Paul L. Javier, Kevin S. Manalang, Denise Erica V. Montalvo, L. Ambata\",\"doi\":\"10.1109/HNICEM.2014.7016183\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"This paper describes an implementation of speech recognition that recognizes and suppresses ten (10) defined profane and vulgar Filipino words. The adapted speech recognition architecture was that of the Oregon Graduate Institute's (OGI) Center for Spoken Language and Learning (CSLU). It utilizes a hybrid Hidden Markov Model/ Artificial Neural Network (HMM/ANN) keyword spotting framework. The feature extraction method used was Mel-Frequency Cepstral Coefficients (MFCC). The ANN is a 3-layer feedforward neural network using Multi-Layer Perceptron (MLP). In recognizing the words, an HMM decoder was used which implemented the Viterbi Beam Search Algorithm. Whenever a profane word was recognized, it would be replaced with a constant frequency tone. The training and testing data (recordings) were gathered from 30 random (15 male and 15 female) Filipino speakers.\",\"PeriodicalId\":309548,\"journal\":{\"name\":\"2014 International Conference on Humanoid, Nanotechnology, Information Technology, Communication and Control, Environment and Management (HNICEM)\",\"volume\":\"76 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2014-11-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"4\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2014 International Conference on Humanoid, Nanotechnology, Information Technology, Communication and Control, Environment and Management (HNICEM)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/HNICEM.2014.7016183\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2014 International Conference on Humanoid, Nanotechnology, Information Technology, Communication and Control, Environment and Management (HNICEM)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/HNICEM.2014.7016183","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 4

摘要

本文描述了一种语音识别的实现，它可以识别和抑制十(10)个定义为亵渎和粗俗的菲律宾词。改编的语音识别架构是俄勒冈研究生院(OGI)口语和学习中心(CSLU)的。它利用隐马尔可夫模型/人工神经网络(HMM/ANN)混合关键字识别框架。特征提取方法为Mel-Frequency Cepstral Coefficients (MFCC)。该神经网络是一个采用多层感知器(MLP)的三层前馈神经网络。在单词识别中，采用HMM解码器实现维特比波束搜索算法。每当一个亵渎的词被识别出来，它就会被一个恒定频率的音调所取代。训练和测试数据(录音)是随机从30名菲律宾人(15名男性和15名女性)中收集的。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文

微信好友朋友圈 QQ好友复制链接

本刊更多论文

A small vocabulary automatic filipino speech profanity suppression system using hybrid Hidden Markov Model/Artificial Neural Network (HMM/ANN) keyword spotting framework

This paper describes an implementation of speech recognition that recognizes and suppresses ten (10) defined profane and vulgar Filipino words. The adapted speech recognition architecture was that of the Oregon Graduate Institute's (OGI) Center for Spoken Language and Learning (CSLU). It utilizes a hybrid Hidden Markov Model/ Artificial Neural Network (HMM/ANN) keyword spotting framework. The feature extraction method used was Mel-Frequency Cepstral Coefficients (MFCC). The ANN is a 3-layer feedforward neural network using Multi-Layer Perceptron (MLP). In recognizing the words, an HMM decoder was used which implemented the Viterbi Beam Search Algorithm. Whenever a profane word was recognized, it would be replaced with a constant frequency tone. The training and testing data (recordings) were gathered from 30 random (15 male and 15 female) Filipino speakers.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

2014 International Conference on Humanoid, Nanotechnology, Information Technology, Communication and Control, Environment and Management (HNICEM)

自引率

0.00%

发文量