Chocko Valliappa, R. S. Sabeenian, M. Paramasivam, Eldho Paul, K. Manju, R. Pragadeesh
{"title":"基于深度学习方法的单侧声带麻痹患者语音支持系统","authors":"Chocko Valliappa, R. S. Sabeenian, M. Paramasivam, Eldho Paul, K. Manju, R. Pragadeesh","doi":"10.4103/2468-8827.330655","DOIUrl":null,"url":null,"abstract":"Vocal cord paralysis is a common problem faced by individuals, where the vocal cord fails to reverberate to produce sound waves. As a result, they are unable to speak out as they were speaking before. The proposed method is designed for aiding unilateral paralyzed peoples whose vocal cord fails to give the desired reverberations. The proposed system consists of voice-to-text and text-to-voice conversions. The voice of the paralyzed person is artificially reproduced by training a deep neural network with the unaffected voice of the patient. The confidence of the predicted output is improved by introducing voice-to-text conversion block along with the deep neural network. The performance metrics reveals the effectiveness of the proposed algorithm to reproduce natural sound. The similarity index is also high compared to that of other state-of-the-art techniques.","PeriodicalId":0,"journal":{"name":"","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2021-11-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":"{\"title\":\"Voice support system using deep learning approaches for unilateral vocal cord paralyzed patients\",\"authors\":\"Chocko Valliappa, R. S. Sabeenian, M. Paramasivam, Eldho Paul, K. Manju, R. Pragadeesh\",\"doi\":\"10.4103/2468-8827.330655\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Vocal cord paralysis is a common problem faced by individuals, where the vocal cord fails to reverberate to produce sound waves. As a result, they are unable to speak out as they were speaking before. The proposed method is designed for aiding unilateral paralyzed peoples whose vocal cord fails to give the desired reverberations. The proposed system consists of voice-to-text and text-to-voice conversions. The voice of the paralyzed person is artificially reproduced by training a deep neural network with the unaffected voice of the patient. The confidence of the predicted output is improved by introducing voice-to-text conversion block along with the deep neural network. The performance metrics reveals the effectiveness of the proposed algorithm to reproduce natural sound. The similarity index is also high compared to that of other state-of-the-art techniques.\",\"PeriodicalId\":0,\"journal\":{\"name\":\"\",\"volume\":null,\"pages\":null},\"PeriodicalIF\":0.0,\"publicationDate\":\"2021-11-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"1\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.4103/2468-8827.330655\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.4103/2468-8827.330655","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
Voice support system using deep learning approaches for unilateral vocal cord paralyzed patients
Vocal cord paralysis is a common problem faced by individuals, where the vocal cord fails to reverberate to produce sound waves. As a result, they are unable to speak out as they were speaking before. The proposed method is designed for aiding unilateral paralyzed peoples whose vocal cord fails to give the desired reverberations. The proposed system consists of voice-to-text and text-to-voice conversions. The voice of the paralyzed person is artificially reproduced by training a deep neural network with the unaffected voice of the patient. The confidence of the predicted output is improved by introducing voice-to-text conversion block along with the deep neural network. The performance metrics reveals the effectiveness of the proposed algorithm to reproduce natural sound. The similarity index is also high compared to that of other state-of-the-art techniques.