{"title":"Neural Network Image Segmentation for Sign Language Interpretation","authors":"Vegim Shala, E. Bytyçi","doi":"10.46338/ijetae0323_11","DOIUrl":null,"url":null,"abstract":"The use of neural networks to recognize and classify objects in images is a popular field in computer science. It is highly likely that an object in an image chosen for classification will have a representation matrix with significantly less pixels than the background or other elements of the image. As a result, the initial plan would be to divide or segment that object from the other portions of the image that are not essential for categorization. This also serves as the study's objective, for which we employ segmentation to separate the components essential to the classification procedure and assess any room for improvement in the final classification outcome. Mask Region Convolutional Neural Network was the model used for segmentation, and Convolutional Neural Network was the model used for classification. The study's findings demonstrate a notable improvement in the classification in the case of sign language. Further advancement of image segmentation models implies better more accurate results for classification models once they are combined. Keywords— Neural network, Image segmentation, Sign language, Classification, Mask Regional Convolutional Neural Network.","PeriodicalId":169403,"journal":{"name":"International Journal of Emerging Technology and Advanced Engineering","volume":"21 5","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2023-03-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"International Journal of Emerging Technology and Advanced Engineering","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.46338/ijetae0323_11","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0
Abstract
The use of neural networks to recognize and classify objects in images is a popular field in computer science. It is highly likely that an object in an image chosen for classification will have a representation matrix with significantly less pixels than the background or other elements of the image. As a result, the initial plan would be to divide or segment that object from the other portions of the image that are not essential for categorization. This also serves as the study's objective, for which we employ segmentation to separate the components essential to the classification procedure and assess any room for improvement in the final classification outcome. Mask Region Convolutional Neural Network was the model used for segmentation, and Convolutional Neural Network was the model used for classification. The study's findings demonstrate a notable improvement in the classification in the case of sign language. Further advancement of image segmentation models implies better more accurate results for classification models once they are combined. Keywords— Neural network, Image segmentation, Sign language, Classification, Mask Regional Convolutional Neural Network.