自动识别性别和口音的印地语口语话语与区域印度口音

Kamini Malhotra, A. Khosla
{"title":"自动识别性别和口音的印地语口语话语与区域印度口音","authors":"Kamini Malhotra, A. Khosla","doi":"10.1109/SLT.2008.4777902","DOIUrl":null,"url":null,"abstract":"In the past significant effort has been focused on automatic extraction of information from speech signals. Most techniques have aimed at automatic speech recognition or speaker identification. Automatic accent identification (AID) has received far less attention. This paper gives an approach to identify gender and accent of a speaker using Gaussian mixture modeling technique. The proposed approach is text independent and identifies accent among four regional Indian accents in spoken Hindi and also identifies the gender. The accents worked upon are Kashmiri, Manipuri, Bengali and neutral Hindi. The Gaussian mixture model (GMM) approach precludes the need of speech segmentation for training and makes the implementation of the system very simple. When gender dependent GMMs are used, the accent identification score is enhanced and gender is also correctly recognized. The results show that the GMMs lend themselves to accent and gender identification task very well. In this approach spectral features have been incorporated in the form of mel frequency cepstral coefficients (MFCC). The approach has a wide scope of expansion to incorporate other regional accents in a very simple way.","PeriodicalId":186876,"journal":{"name":"2008 IEEE Spoken Language Technology Workshop","volume":"42 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2008-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"20","resultStr":"{\"title\":\"Automatic identification of gender & accent in spoken Hindi utterances with regional Indian accents\",\"authors\":\"Kamini Malhotra, A. Khosla\",\"doi\":\"10.1109/SLT.2008.4777902\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"In the past significant effort has been focused on automatic extraction of information from speech signals. Most techniques have aimed at automatic speech recognition or speaker identification. Automatic accent identification (AID) has received far less attention. This paper gives an approach to identify gender and accent of a speaker using Gaussian mixture modeling technique. The proposed approach is text independent and identifies accent among four regional Indian accents in spoken Hindi and also identifies the gender. The accents worked upon are Kashmiri, Manipuri, Bengali and neutral Hindi. The Gaussian mixture model (GMM) approach precludes the need of speech segmentation for training and makes the implementation of the system very simple. When gender dependent GMMs are used, the accent identification score is enhanced and gender is also correctly recognized. The results show that the GMMs lend themselves to accent and gender identification task very well. In this approach spectral features have been incorporated in the form of mel frequency cepstral coefficients (MFCC). The approach has a wide scope of expansion to incorporate other regional accents in a very simple way.\",\"PeriodicalId\":186876,\"journal\":{\"name\":\"2008 IEEE Spoken Language Technology Workshop\",\"volume\":\"42 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2008-12-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"20\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2008 IEEE Spoken Language Technology Workshop\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/SLT.2008.4777902\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2008 IEEE Spoken Language Technology Workshop","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/SLT.2008.4777902","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 20

摘要

过去,人们一直致力于从语音信号中自动提取信息。大多数技术的目标是自动语音识别或说话人识别。自动口音识别(AID)的研究很少受到关注。本文提出了一种利用高斯混合建模技术识别说话人性别和口音的方法。所提出的方法是文本独立的,可以识别印地语口语中四个地区印度口音中的口音,也可以识别性别。这些口音包括克什米尔语、曼尼普尔语、孟加拉语和中立的印地语。高斯混合模型(GMM)方法排除了训练中语音分割的需要,使得系统的实现非常简单。当使用性别依赖gmm时,口音识别得分提高,性别识别也正确。结果表明,gmm在口音和性别识别任务中表现良好。在这种方法中,频谱特征以mel频率倒谱系数(MFCC)的形式被纳入。这种方法有很大的扩展空间,可以用一种非常简单的方式融入其他地区的口音。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
Automatic identification of gender & accent in spoken Hindi utterances with regional Indian accents
In the past significant effort has been focused on automatic extraction of information from speech signals. Most techniques have aimed at automatic speech recognition or speaker identification. Automatic accent identification (AID) has received far less attention. This paper gives an approach to identify gender and accent of a speaker using Gaussian mixture modeling technique. The proposed approach is text independent and identifies accent among four regional Indian accents in spoken Hindi and also identifies the gender. The accents worked upon are Kashmiri, Manipuri, Bengali and neutral Hindi. The Gaussian mixture model (GMM) approach precludes the need of speech segmentation for training and makes the implementation of the system very simple. When gender dependent GMMs are used, the accent identification score is enhanced and gender is also correctly recognized. The results show that the GMMs lend themselves to accent and gender identification task very well. In this approach spectral features have been incorporated in the form of mel frequency cepstral coefficients (MFCC). The approach has a wide scope of expansion to incorporate other regional accents in a very simple way.
求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
自引率
0.00%
发文量
0
期刊最新文献
“Who is this” quiz dialogue system and users' evaluation Latent dirichlet language model for speech recognition Modelling user behaviour in the HIS-POMDP dialogue manager A syntactic language model based on incremental CCG parsing Improving word segmentation for Thai speech translation
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1