{"title":"基于深度学习技术的视觉语音识别研究综述","authors":"Ritika Chand, Pushpit Jain, Abhinav Mathur, Shiwansh Raj, Prashasti Kanikar","doi":"10.1109/CSCITA55725.2023.10104811","DOIUrl":null,"url":null,"abstract":"Lip Reading has evolved and from where it began to help deaf people has slowly turned into a service where in the Digital Entertainment industry has started utilizing it. With the recent rise of AI, automated technologies have touched the boundaries of Lip Reading as well. Various Algorithms have been devised using Neural Network Methodologies. We observe that a lot of the algorithms reviewed, have been exploring various techniques whether it be a variation from detecting lip features to the text generation process itself.With the amount of research done in the field, one can always look out towards a better & optimized lip detection. The study emphasizes more towards looking at the utilization of the Machine Learning & Deep Learning technologies and thus provides a vivid view at the bigger picture of the interpolation of AI in the Visual based Lip Reading domain.","PeriodicalId":224479,"journal":{"name":"2023 International Conference on Communication System, Computing and IT Applications (CSCITA)","volume":"134 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2023-03-31","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":"{\"title\":\"Survey on Visual Speech Recognition using Deep Learning Techniques\",\"authors\":\"Ritika Chand, Pushpit Jain, Abhinav Mathur, Shiwansh Raj, Prashasti Kanikar\",\"doi\":\"10.1109/CSCITA55725.2023.10104811\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Lip Reading has evolved and from where it began to help deaf people has slowly turned into a service where in the Digital Entertainment industry has started utilizing it. With the recent rise of AI, automated technologies have touched the boundaries of Lip Reading as well. Various Algorithms have been devised using Neural Network Methodologies. We observe that a lot of the algorithms reviewed, have been exploring various techniques whether it be a variation from detecting lip features to the text generation process itself.With the amount of research done in the field, one can always look out towards a better & optimized lip detection. The study emphasizes more towards looking at the utilization of the Machine Learning & Deep Learning technologies and thus provides a vivid view at the bigger picture of the interpolation of AI in the Visual based Lip Reading domain.\",\"PeriodicalId\":224479,\"journal\":{\"name\":\"2023 International Conference on Communication System, Computing and IT Applications (CSCITA)\",\"volume\":\"134 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2023-03-31\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"1\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2023 International Conference on Communication System, Computing and IT Applications (CSCITA)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/CSCITA55725.2023.10104811\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2023 International Conference on Communication System, Computing and IT Applications (CSCITA)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/CSCITA55725.2023.10104811","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
Survey on Visual Speech Recognition using Deep Learning Techniques
Lip Reading has evolved and from where it began to help deaf people has slowly turned into a service where in the Digital Entertainment industry has started utilizing it. With the recent rise of AI, automated technologies have touched the boundaries of Lip Reading as well. Various Algorithms have been devised using Neural Network Methodologies. We observe that a lot of the algorithms reviewed, have been exploring various techniques whether it be a variation from detecting lip features to the text generation process itself.With the amount of research done in the field, one can always look out towards a better & optimized lip detection. The study emphasizes more towards looking at the utilization of the Machine Learning & Deep Learning technologies and thus provides a vivid view at the bigger picture of the interpolation of AI in the Visual based Lip Reading domain.