{"title":"Speaker distinction using vowel polygons: Experimental study","authors":"M. Stanek, M. Sigmund","doi":"10.1109/RADIOELEK.2015.7128973","DOIUrl":null,"url":null,"abstract":"This paper presents experimental study oriented on finding the most suitable polygon generated by Czech vowels for speaker recognition. For created speaker database containing 37 Czech native speakers, speaker variances have been observed by the length of vector created by two different centers of gravity for 16 different vowel polygons in 10 formant planes. The suitability is regarded by the dispersion coefficient of generated distance vectors and their minimal value. The most suitable vowel polygons have been set as EIOU25 tetragon and IOU34 vowel triangle reached the best results in both criteria. The dispersion of individual centers of gravity is much more suitable for speaker recognition generated by higher formants. Presented observations can be further used in the case of actual emotional state recognition or alcohol/drug detection.","PeriodicalId":193275,"journal":{"name":"2015 25th International Conference Radioelektronika (RADIOELEKTRONIKA)","volume":"109 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2015-04-21","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2015 25th International Conference Radioelektronika (RADIOELEKTRONIKA)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/RADIOELEK.2015.7128973","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 1
Abstract
This paper presents experimental study oriented on finding the most suitable polygon generated by Czech vowels for speaker recognition. For created speaker database containing 37 Czech native speakers, speaker variances have been observed by the length of vector created by two different centers of gravity for 16 different vowel polygons in 10 formant planes. The suitability is regarded by the dispersion coefficient of generated distance vectors and their minimal value. The most suitable vowel polygons have been set as EIOU25 tetragon and IOU34 vowel triangle reached the best results in both criteria. The dispersion of individual centers of gravity is much more suitable for speaker recognition generated by higher formants. Presented observations can be further used in the case of actual emotional state recognition or alcohol/drug detection.