Hao Xian Chung, Nazia Hameed, Jérémie Clos, M. Hasan
{"title":"面向实时手语翻译的集成CNN模型框架","authors":"Hao Xian Chung, Nazia Hameed, Jérémie Clos, M. Hasan","doi":"10.1109/SKIMA57145.2022.10029673","DOIUrl":null,"url":null,"abstract":"American sign language (ASL) is a natural language to minimize communication barrier for the people suffering from hearing and speech impairment. However, sign languages are not a common form of communication in society, and it is lacking a sign language translator to ease the communication. Hence, this research proposes an improved visual-based communication framework to translate ASL alphabets in real-time. The proposed framework consists of different steps i.e., image pre-processing, hand segmentation with U-Net, data-augmentation techniques, and classification of American sign language. A n ensemble classification model i s proposed where V GG19, R esNet-50 and MobileNet are used as base models, and 2 fully connected layers are used as meta model. The proposed approach achieved 99.86% accuracy and performed better when compared with existing literature.","PeriodicalId":277436,"journal":{"name":"2022 14th International Conference on Software, Knowledge, Information Management and Applications (SKIMA)","volume":"106 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2022-12-02","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"A Framework of Ensemble CNN Models for Real-Time Sign Language Translation\",\"authors\":\"Hao Xian Chung, Nazia Hameed, Jérémie Clos, M. Hasan\",\"doi\":\"10.1109/SKIMA57145.2022.10029673\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"American sign language (ASL) is a natural language to minimize communication barrier for the people suffering from hearing and speech impairment. However, sign languages are not a common form of communication in society, and it is lacking a sign language translator to ease the communication. Hence, this research proposes an improved visual-based communication framework to translate ASL alphabets in real-time. The proposed framework consists of different steps i.e., image pre-processing, hand segmentation with U-Net, data-augmentation techniques, and classification of American sign language. A n ensemble classification model i s proposed where V GG19, R esNet-50 and MobileNet are used as base models, and 2 fully connected layers are used as meta model. The proposed approach achieved 99.86% accuracy and performed better when compared with existing literature.\",\"PeriodicalId\":277436,\"journal\":{\"name\":\"2022 14th International Conference on Software, Knowledge, Information Management and Applications (SKIMA)\",\"volume\":\"106 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2022-12-02\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2022 14th International Conference on Software, Knowledge, Information Management and Applications (SKIMA)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/SKIMA57145.2022.10029673\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2022 14th International Conference on Software, Knowledge, Information Management and Applications (SKIMA)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/SKIMA57145.2022.10029673","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
A Framework of Ensemble CNN Models for Real-Time Sign Language Translation
American sign language (ASL) is a natural language to minimize communication barrier for the people suffering from hearing and speech impairment. However, sign languages are not a common form of communication in society, and it is lacking a sign language translator to ease the communication. Hence, this research proposes an improved visual-based communication framework to translate ASL alphabets in real-time. The proposed framework consists of different steps i.e., image pre-processing, hand segmentation with U-Net, data-augmentation techniques, and classification of American sign language. A n ensemble classification model i s proposed where V GG19, R esNet-50 and MobileNet are used as base models, and 2 fully connected layers are used as meta model. The proposed approach achieved 99.86% accuracy and performed better when compared with existing literature.