{"title":"Unit feature based pruning of large-scale speech corpus using decision tree","authors":"Zhe Zhang, Lixing Huang, J. Tao","doi":"10.1109/ICOSP.2008.4697231","DOIUrl":null,"url":null,"abstract":"In this paper, we proposed and realized a corpus pruning method using decision tree. In the process of clustering, instead of conventional method, we measure the distance of pitch contours by feature vector composed by slope mean. The subjective and objective evaluation results showed that synthetic outputs based on corpus pruned through our method are close to outputs based on no-pruning corpus and are superior to conventional method with the same storage size.","PeriodicalId":445699,"journal":{"name":"2008 9th International Conference on Signal Processing","volume":"30 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2008-12-08","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2008 9th International Conference on Signal Processing","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICOSP.2008.4697231","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0
Abstract
In this paper, we proposed and realized a corpus pruning method using decision tree. In the process of clustering, instead of conventional method, we measure the distance of pitch contours by feature vector composed by slope mean. The subjective and objective evaluation results showed that synthetic outputs based on corpus pruned through our method are close to outputs based on no-pruning corpus and are superior to conventional method with the same storage size.