{"title":"Algorithm for Counting the Number of Students in Class Based on Mask-RCNN Optimization","authors":"Jinbin Li, Jia-kun Xie","doi":"10.1109/ISET55194.2022.00059","DOIUrl":null,"url":null,"abstract":"In order to solve the problems of low efficiency, small scale, and data reliability in the traditional manual counting method, a method using mask region convolutional neural network (Mask R-CNN) to automatically calculate attendance was proposed. In order to extract deeper image information, the feature extraction network is designed as ResNet101, and feature map fusion is performed on multi-level feature maps. In order to make up for the lack of recognition of the objects whose body parts are occluded, the Mask R-CNN algorithm is used for the second recognition. The experimental results on the self-built classroom monitoring screenshot data set show that compared with the method of directly using the Mask R-CNN algorithm for recognition, the secondary recognition method can identify more targets and improve the accuracy of identifying the number of people.","PeriodicalId":365516,"journal":{"name":"2022 International Symposium on Educational Technology (ISET)","volume":"18 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2022-07-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2022 International Symposium on Educational Technology (ISET)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ISET55194.2022.00059","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0
Abstract
In order to solve the problems of low efficiency, small scale, and data reliability in the traditional manual counting method, a method using mask region convolutional neural network (Mask R-CNN) to automatically calculate attendance was proposed. In order to extract deeper image information, the feature extraction network is designed as ResNet101, and feature map fusion is performed on multi-level feature maps. In order to make up for the lack of recognition of the objects whose body parts are occluded, the Mask R-CNN algorithm is used for the second recognition. The experimental results on the self-built classroom monitoring screenshot data set show that compared with the method of directly using the Mask R-CNN algorithm for recognition, the secondary recognition method can identify more targets and improve the accuracy of identifying the number of people.