Muhammad Zeeshan Khan, Summra Saleem, Muhammad A. Hassan, Muhammad Usman Ghanni Khan
{"title":"学习深度C3D功能的足球视频事件检测","authors":"Muhammad Zeeshan Khan, Summra Saleem, Muhammad A. Hassan, Muhammad Usman Ghanni Khan","doi":"10.1109/ICET.2018.8603644","DOIUrl":null,"url":null,"abstract":"Soccer video event identification has been an interesting task in research community during past few decades. Numerous machine learning techniques and C2D (Convolution 2-dimensional) have been used for this problem, but C3D has not been implemented for this task. By taking advantage from C3D (Convolution 3-dimensional), to completely exploit spatio-temporal relation, deep convolution network is developed to highlight distinct video events in proposed research work. Initially, we detect soccer video event marks by pixel differencing and edge change ratio. After this semantic features of segmented frames are extracted followed by CNN to map soccer event categories: Corner, Shoot, Goal Attempt, Penalty Kick. Because no effective and suitable dataset is available, we categorized soccer videos into four classes and developed soccer videos dataset for training CNN network. Evaluation results on soccer match clips generated results with high efficiency.","PeriodicalId":443353,"journal":{"name":"2018 14th International Conference on Emerging Technologies (ICET)","volume":"3 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2018-11-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"16","resultStr":"{\"title\":\"Learning Deep C3D Features For Soccer Video Event Detection\",\"authors\":\"Muhammad Zeeshan Khan, Summra Saleem, Muhammad A. Hassan, Muhammad Usman Ghanni Khan\",\"doi\":\"10.1109/ICET.2018.8603644\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Soccer video event identification has been an interesting task in research community during past few decades. Numerous machine learning techniques and C2D (Convolution 2-dimensional) have been used for this problem, but C3D has not been implemented for this task. By taking advantage from C3D (Convolution 3-dimensional), to completely exploit spatio-temporal relation, deep convolution network is developed to highlight distinct video events in proposed research work. Initially, we detect soccer video event marks by pixel differencing and edge change ratio. After this semantic features of segmented frames are extracted followed by CNN to map soccer event categories: Corner, Shoot, Goal Attempt, Penalty Kick. Because no effective and suitable dataset is available, we categorized soccer videos into four classes and developed soccer videos dataset for training CNN network. Evaluation results on soccer match clips generated results with high efficiency.\",\"PeriodicalId\":443353,\"journal\":{\"name\":\"2018 14th International Conference on Emerging Technologies (ICET)\",\"volume\":\"3 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2018-11-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"16\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2018 14th International Conference on Emerging Technologies (ICET)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/ICET.2018.8603644\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2018 14th International Conference on Emerging Technologies (ICET)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICET.2018.8603644","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
Learning Deep C3D Features For Soccer Video Event Detection
Soccer video event identification has been an interesting task in research community during past few decades. Numerous machine learning techniques and C2D (Convolution 2-dimensional) have been used for this problem, but C3D has not been implemented for this task. By taking advantage from C3D (Convolution 3-dimensional), to completely exploit spatio-temporal relation, deep convolution network is developed to highlight distinct video events in proposed research work. Initially, we detect soccer video event marks by pixel differencing and edge change ratio. After this semantic features of segmented frames are extracted followed by CNN to map soccer event categories: Corner, Shoot, Goal Attempt, Penalty Kick. Because no effective and suitable dataset is available, we categorized soccer videos into four classes and developed soccer videos dataset for training CNN network. Evaluation results on soccer match clips generated results with high efficiency.