S. Akash, Debobrata Chakraborty, Mehedi Mahmud Kaushik, Barsan Saha Babu, Md. Saniat Rahman Zishan
{"title":"基于动作识别的实时孟加拉语手语检测与造句","authors":"S. Akash, Debobrata Chakraborty, Mehedi Mahmud Kaushik, Barsan Saha Babu, Md. Saniat Rahman Zishan","doi":"10.1109/ICREST57604.2023.10070072","DOIUrl":null,"url":null,"abstract":"Sign language is a system of communication that uses visual motions and signs to communicate with persons who are deaf or mute due to a hearing or speech impairment. A real-time Bangla Sign Language (BdSL) detection system was proposed in this paper, which can generate Bangla sentences from a sequence of images or a video feed which can help those who are not familiar with sign language. Blazepose algorithm was used to identify the sign language body posture sequence. After detecting the body posture the data was gathered as a numpy file. A Long Short-Term Memory (LSTM) network was used to train the numpy files since this network can generate predictions based on sequential data. After 85 epochs of training, the model's training accuracy was 93.85%, and its validation accuracy was 87.14%, which indicates that the model's ability to recognize BdSL sentences in real-time is adequate.","PeriodicalId":389360,"journal":{"name":"2023 3rd International Conference on Robotics, Electrical and Signal Processing Techniques (ICREST)","volume":"2 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2023-01-07","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Action Recognition Based Real-time Bangla Sign Language Detection and Sentence Formation\",\"authors\":\"S. Akash, Debobrata Chakraborty, Mehedi Mahmud Kaushik, Barsan Saha Babu, Md. Saniat Rahman Zishan\",\"doi\":\"10.1109/ICREST57604.2023.10070072\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Sign language is a system of communication that uses visual motions and signs to communicate with persons who are deaf or mute due to a hearing or speech impairment. A real-time Bangla Sign Language (BdSL) detection system was proposed in this paper, which can generate Bangla sentences from a sequence of images or a video feed which can help those who are not familiar with sign language. Blazepose algorithm was used to identify the sign language body posture sequence. After detecting the body posture the data was gathered as a numpy file. A Long Short-Term Memory (LSTM) network was used to train the numpy files since this network can generate predictions based on sequential data. After 85 epochs of training, the model's training accuracy was 93.85%, and its validation accuracy was 87.14%, which indicates that the model's ability to recognize BdSL sentences in real-time is adequate.\",\"PeriodicalId\":389360,\"journal\":{\"name\":\"2023 3rd International Conference on Robotics, Electrical and Signal Processing Techniques (ICREST)\",\"volume\":\"2 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2023-01-07\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2023 3rd International Conference on Robotics, Electrical and Signal Processing Techniques (ICREST)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/ICREST57604.2023.10070072\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2023 3rd International Conference on Robotics, Electrical and Signal Processing Techniques (ICREST)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICREST57604.2023.10070072","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
Action Recognition Based Real-time Bangla Sign Language Detection and Sentence Formation
Sign language is a system of communication that uses visual motions and signs to communicate with persons who are deaf or mute due to a hearing or speech impairment. A real-time Bangla Sign Language (BdSL) detection system was proposed in this paper, which can generate Bangla sentences from a sequence of images or a video feed which can help those who are not familiar with sign language. Blazepose algorithm was used to identify the sign language body posture sequence. After detecting the body posture the data was gathered as a numpy file. A Long Short-Term Memory (LSTM) network was used to train the numpy files since this network can generate predictions based on sequential data. After 85 epochs of training, the model's training accuracy was 93.85%, and its validation accuracy was 87.14%, which indicates that the model's ability to recognize BdSL sentences in real-time is adequate.