Po-Yung Chou, Cheng-Hung Lin, W. Kao, Yi-Fang Lee, Chen-Chien James Hsu
{"title":"一种用于篮球犯规分类的时间分数网络","authors":"Po-Yung Chou, Cheng-Hung Lin, W. Kao, Yi-Fang Lee, Chen-Chien James Hsu","doi":"10.1109/ICCE-Berlin56473.2022.9937110","DOIUrl":null,"url":null,"abstract":"Deep learning has developed rapidly in recent years, not only in image recognition, but now also in action recognition. The research on action recognition started with 3D-CNN, which has achieved good results on many tasks. But most action recognition networks have room for improvement in fine-grained action recognition. The reason is that there is only a slight difference between categories in the fine-grained classification task. e.g. basketball fouls only occur in a few frames and a small region. This situation may lead to some errors with 3D-CNN methods because these models tend to merge all temporal features. To identify these fouls, it is necessary to strengthen the detection of small periods. In this paper, we propose a temporal score network suitable for existing networks, including 3D-Resnet50, 3D-wide-Resnet50, $\\mathbf{R}\\mathbf{(}\\mathbf{2}\\mathbf{+}\\mathbf{1}\\mathbf{)}$ D-Resnet50, and I3D-50 to improve the accuracy of fine-grained action recognition. The experimental results show that the accuracy of various models is improved by 3.85% to 6% after adding the proposed network. Since there is no relevant public dataset, we collect the data ourselves to create a basketball foul dataset.","PeriodicalId":138931,"journal":{"name":"2022 IEEE 12th International Conference on Consumer Electronics (ICCE-Berlin)","volume":"12 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2022-09-02","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"A Temporal Scores Network for Basketball Foul Classification\",\"authors\":\"Po-Yung Chou, Cheng-Hung Lin, W. Kao, Yi-Fang Lee, Chen-Chien James Hsu\",\"doi\":\"10.1109/ICCE-Berlin56473.2022.9937110\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Deep learning has developed rapidly in recent years, not only in image recognition, but now also in action recognition. The research on action recognition started with 3D-CNN, which has achieved good results on many tasks. But most action recognition networks have room for improvement in fine-grained action recognition. The reason is that there is only a slight difference between categories in the fine-grained classification task. e.g. basketball fouls only occur in a few frames and a small region. This situation may lead to some errors with 3D-CNN methods because these models tend to merge all temporal features. To identify these fouls, it is necessary to strengthen the detection of small periods. In this paper, we propose a temporal score network suitable for existing networks, including 3D-Resnet50, 3D-wide-Resnet50, $\\\\mathbf{R}\\\\mathbf{(}\\\\mathbf{2}\\\\mathbf{+}\\\\mathbf{1}\\\\mathbf{)}$ D-Resnet50, and I3D-50 to improve the accuracy of fine-grained action recognition. The experimental results show that the accuracy of various models is improved by 3.85% to 6% after adding the proposed network. Since there is no relevant public dataset, we collect the data ourselves to create a basketball foul dataset.\",\"PeriodicalId\":138931,\"journal\":{\"name\":\"2022 IEEE 12th International Conference on Consumer Electronics (ICCE-Berlin)\",\"volume\":\"12 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2022-09-02\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2022 IEEE 12th International Conference on Consumer Electronics (ICCE-Berlin)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/ICCE-Berlin56473.2022.9937110\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2022 IEEE 12th International Conference on Consumer Electronics (ICCE-Berlin)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICCE-Berlin56473.2022.9937110","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
A Temporal Scores Network for Basketball Foul Classification
Deep learning has developed rapidly in recent years, not only in image recognition, but now also in action recognition. The research on action recognition started with 3D-CNN, which has achieved good results on many tasks. But most action recognition networks have room for improvement in fine-grained action recognition. The reason is that there is only a slight difference between categories in the fine-grained classification task. e.g. basketball fouls only occur in a few frames and a small region. This situation may lead to some errors with 3D-CNN methods because these models tend to merge all temporal features. To identify these fouls, it is necessary to strengthen the detection of small periods. In this paper, we propose a temporal score network suitable for existing networks, including 3D-Resnet50, 3D-wide-Resnet50, $\mathbf{R}\mathbf{(}\mathbf{2}\mathbf{+}\mathbf{1}\mathbf{)}$ D-Resnet50, and I3D-50 to improve the accuracy of fine-grained action recognition. The experimental results show that the accuracy of various models is improved by 3.85% to 6% after adding the proposed network. Since there is no relevant public dataset, we collect the data ourselves to create a basketball foul dataset.