{"title":"基于卷积神经网络的实时视觉孟加拉语手语检测","authors":"Riyad Bin Rafiq, S. Hakim, Thasina Tabashum","doi":"10.1109/ICACC-202152719.2021.9708141","DOIUrl":null,"url":null,"abstract":"Sign Language is an essential means of communication for people with speech and hearing impairment. In spite of this, there are no effective tools to assist the social interaction between Bangla sign language speakers and non-sign language speakers. Our main objective is to implement an automated translation system that is capable of translating Bangla sign language to Bangla text using common computing environments such as a computer and a generic webcam. The dataset has been created for this project with 1500 images for 10 signs. A seven-layered custom sequential CNN model has been trained and validated with the processed dataset. For real-time detection, we have extracted the region of interest and then detected the specified sign. The system runs in real-time and can provide output from a video feed with a time delay of 120.6 ms. Our system has been tested for an accuracy of 97.0%.","PeriodicalId":198810,"journal":{"name":"2021 International Conference on Advances in Computing and Communications (ICACC)","volume":"2 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2021-10-21","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":"{\"title\":\"Real-time Vision-based Bangla Sign Language Detection using Convolutional Neural Network\",\"authors\":\"Riyad Bin Rafiq, S. Hakim, Thasina Tabashum\",\"doi\":\"10.1109/ICACC-202152719.2021.9708141\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Sign Language is an essential means of communication for people with speech and hearing impairment. In spite of this, there are no effective tools to assist the social interaction between Bangla sign language speakers and non-sign language speakers. Our main objective is to implement an automated translation system that is capable of translating Bangla sign language to Bangla text using common computing environments such as a computer and a generic webcam. The dataset has been created for this project with 1500 images for 10 signs. A seven-layered custom sequential CNN model has been trained and validated with the processed dataset. For real-time detection, we have extracted the region of interest and then detected the specified sign. The system runs in real-time and can provide output from a video feed with a time delay of 120.6 ms. Our system has been tested for an accuracy of 97.0%.\",\"PeriodicalId\":198810,\"journal\":{\"name\":\"2021 International Conference on Advances in Computing and Communications (ICACC)\",\"volume\":\"2 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2021-10-21\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"1\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2021 International Conference on Advances in Computing and Communications (ICACC)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/ICACC-202152719.2021.9708141\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2021 International Conference on Advances in Computing and Communications (ICACC)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICACC-202152719.2021.9708141","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
Real-time Vision-based Bangla Sign Language Detection using Convolutional Neural Network
Sign Language is an essential means of communication for people with speech and hearing impairment. In spite of this, there are no effective tools to assist the social interaction between Bangla sign language speakers and non-sign language speakers. Our main objective is to implement an automated translation system that is capable of translating Bangla sign language to Bangla text using common computing environments such as a computer and a generic webcam. The dataset has been created for this project with 1500 images for 10 signs. A seven-layered custom sequential CNN model has been trained and validated with the processed dataset. For real-time detection, we have extracted the region of interest and then detected the specified sign. The system runs in real-time and can provide output from a video feed with a time delay of 120.6 ms. Our system has been tested for an accuracy of 97.0%.