{"title":"基于I-CNN的自定义服务语音分类算法","authors":"Xuefeng Huang, Rongheng Lin","doi":"10.1109/SERVICES.2018.00030","DOIUrl":null,"url":null,"abstract":"Speech classification methods mainly focus on the content of the voice segment. To help better underestand the information in a segmented voice, the contents of other segments in the same paragraph should also be paid attention to. In our custom service speech classification problem, we are facing a problem of classification a series of voice segments in a conversation separately into category \"custom\" or \"custom service\". Sometimes the voice of both parties in the same conversation can be both sound like a \"custom service\" or both sound like \"custom\". In order to make the right prediction, the model needs to know not only the content of the voice segment that it's classifying, but both parties' voice in a conversation, the extra information can help the model to determine who is \"more likely\" to be a custom service in a conversation. We propose a method called I-CNN, which combines the info-feed layer with CNN. The Info-feed layer allows the CNN to use information from other samples in the same batch, which is helpful in improving the model's performance in our custom service speech classification problem.","PeriodicalId":130225,"journal":{"name":"2018 IEEE World Congress on Services (SERVICES)","volume":"23 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2018-07-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"An I-CNN Based Speech Classification Algorithm for Custom Service\",\"authors\":\"Xuefeng Huang, Rongheng Lin\",\"doi\":\"10.1109/SERVICES.2018.00030\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Speech classification methods mainly focus on the content of the voice segment. To help better underestand the information in a segmented voice, the contents of other segments in the same paragraph should also be paid attention to. In our custom service speech classification problem, we are facing a problem of classification a series of voice segments in a conversation separately into category \\\"custom\\\" or \\\"custom service\\\". Sometimes the voice of both parties in the same conversation can be both sound like a \\\"custom service\\\" or both sound like \\\"custom\\\". In order to make the right prediction, the model needs to know not only the content of the voice segment that it's classifying, but both parties' voice in a conversation, the extra information can help the model to determine who is \\\"more likely\\\" to be a custom service in a conversation. We propose a method called I-CNN, which combines the info-feed layer with CNN. The Info-feed layer allows the CNN to use information from other samples in the same batch, which is helpful in improving the model's performance in our custom service speech classification problem.\",\"PeriodicalId\":130225,\"journal\":{\"name\":\"2018 IEEE World Congress on Services (SERVICES)\",\"volume\":\"23 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2018-07-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2018 IEEE World Congress on Services (SERVICES)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/SERVICES.2018.00030\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2018 IEEE World Congress on Services (SERVICES)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/SERVICES.2018.00030","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
An I-CNN Based Speech Classification Algorithm for Custom Service
Speech classification methods mainly focus on the content of the voice segment. To help better underestand the information in a segmented voice, the contents of other segments in the same paragraph should also be paid attention to. In our custom service speech classification problem, we are facing a problem of classification a series of voice segments in a conversation separately into category "custom" or "custom service". Sometimes the voice of both parties in the same conversation can be both sound like a "custom service" or both sound like "custom". In order to make the right prediction, the model needs to know not only the content of the voice segment that it's classifying, but both parties' voice in a conversation, the extra information can help the model to determine who is "more likely" to be a custom service in a conversation. We propose a method called I-CNN, which combines the info-feed layer with CNN. The Info-feed layer allows the CNN to use information from other samples in the same batch, which is helpful in improving the model's performance in our custom service speech classification problem.