{"title":"基于单位特征的大规模语音语料库决策树剪枝","authors":"Zhe Zhang, Lixing Huang, J. Tao","doi":"10.1109/ICOSP.2008.4697231","DOIUrl":null,"url":null,"abstract":"In this paper, we proposed and realized a corpus pruning method using decision tree. In the process of clustering, instead of conventional method, we measure the distance of pitch contours by feature vector composed by slope mean. The subjective and objective evaluation results showed that synthetic outputs based on corpus pruned through our method are close to outputs based on no-pruning corpus and are superior to conventional method with the same storage size.","PeriodicalId":445699,"journal":{"name":"2008 9th International Conference on Signal Processing","volume":"30 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2008-12-08","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Unit feature based pruning of large-scale speech corpus using decision tree\",\"authors\":\"Zhe Zhang, Lixing Huang, J. Tao\",\"doi\":\"10.1109/ICOSP.2008.4697231\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"In this paper, we proposed and realized a corpus pruning method using decision tree. In the process of clustering, instead of conventional method, we measure the distance of pitch contours by feature vector composed by slope mean. The subjective and objective evaluation results showed that synthetic outputs based on corpus pruned through our method are close to outputs based on no-pruning corpus and are superior to conventional method with the same storage size.\",\"PeriodicalId\":445699,\"journal\":{\"name\":\"2008 9th International Conference on Signal Processing\",\"volume\":\"30 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2008-12-08\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2008 9th International Conference on Signal Processing\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/ICOSP.2008.4697231\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2008 9th International Conference on Signal Processing","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICOSP.2008.4697231","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
Unit feature based pruning of large-scale speech corpus using decision tree
In this paper, we proposed and realized a corpus pruning method using decision tree. In the process of clustering, instead of conventional method, we measure the distance of pitch contours by feature vector composed by slope mean. The subjective and objective evaluation results showed that synthetic outputs based on corpus pruned through our method are close to outputs based on no-pruning corpus and are superior to conventional method with the same storage size.