Deep Learning for Semantic Segmentation of Football Match Image

Yutian Wu, Wuqi Zhao, Chen-Chun Huang, Yaming Xi, Qing Li, Heng Wang
{"title":"Deep Learning for Semantic Segmentation of Football Match Image","authors":"Yutian Wu, Wuqi Zhao, Chen-Chun Huang, Yaming Xi, Qing Li, Heng Wang","doi":"10.1109/IDITR57726.2023.10145987","DOIUrl":null,"url":null,"abstract":"As one of the most popular sports, football has been a subject to growth and advancements in technology. The combination of football and artificial intelligence is expected to be used for intelligent football analysis. Image semantic segmentation is an important basis for image analysis and understanding. This paper proposes a deep learning-based image segmentation model for pixel-level classification of the video recordings frames of football matches. Every pixel of football video frame is classified into one of the 10 classes, e.g., players, ball, goal bar and several background scenes. In this paper, we first test a variety of CNN architectures and pre-trained models and select the MobileNet-UNet architecture as our baseline. We note the severe unbalanced data distribution in football scene segmentation. To solve this problem, the weighted multi-class cross-entropy loss is adopted in training of MobileNet-UNet to redistribute the weights of classification loss, focusing on smaller foreground object classes and improving segmentation accuracy. We also propose to use image transformations and a random mixture sampling technique for training data augmentation to reduce model overfitting. The model is trained and validated in the well-annotated Football Semantic Segmentation Open Dataset. The proposed best model achieves 0.96 frequency weighted IoU and 0.90 mean IoU segmentation accuracy on validation set.","PeriodicalId":272880,"journal":{"name":"2023 2nd International Conference on Innovations and Development of Information Technologies and Robotics (IDITR)","volume":"593 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2023-05-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2023 2nd International Conference on Innovations and Development of Information Technologies and Robotics (IDITR)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/IDITR57726.2023.10145987","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0

Abstract

As one of the most popular sports, football has been a subject to growth and advancements in technology. The combination of football and artificial intelligence is expected to be used for intelligent football analysis. Image semantic segmentation is an important basis for image analysis and understanding. This paper proposes a deep learning-based image segmentation model for pixel-level classification of the video recordings frames of football matches. Every pixel of football video frame is classified into one of the 10 classes, e.g., players, ball, goal bar and several background scenes. In this paper, we first test a variety of CNN architectures and pre-trained models and select the MobileNet-UNet architecture as our baseline. We note the severe unbalanced data distribution in football scene segmentation. To solve this problem, the weighted multi-class cross-entropy loss is adopted in training of MobileNet-UNet to redistribute the weights of classification loss, focusing on smaller foreground object classes and improving segmentation accuracy. We also propose to use image transformations and a random mixture sampling technique for training data augmentation to reduce model overfitting. The model is trained and validated in the well-annotated Football Semantic Segmentation Open Dataset. The proposed best model achieves 0.96 frequency weighted IoU and 0.90 mean IoU segmentation accuracy on validation set.
查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
基于深度学习的足球比赛图像语义分割
作为最受欢迎的运动之一,足球一直是技术发展和进步的主题。足球与人工智能的结合有望用于智能足球分析。图像语义分割是图像分析和理解的重要基础。提出了一种基于深度学习的图像分割模型,用于足球比赛录像帧的像素级分类。将足球视频帧的每个像素分成10个类中的一个,如球员、球、门柱和几个背景场景。在本文中,我们首先测试了各种CNN架构和预训练模型,并选择MobileNet-UNet架构作为我们的基线。我们注意到足球场景分割中数据分布严重不平衡。为了解决这一问题,在MobileNet-UNet的训练中采用加权的多类交叉熵损失,重新分配分类损失的权重,关注较小的前景目标类,提高分割精度。我们还建议使用图像变换和随机混合采样技术进行训练数据增强,以减少模型过拟合。该模型在注释良好的足球语义分割开放数据集上进行了训练和验证。该模型在验证集上的频率加权IoU分割精度为0.96,平均IoU分割精度为0.90。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 去求助
来源期刊
自引率
0.00%
发文量
0
期刊最新文献
Research on Network Privacy Information Protection Technology and Strategy Deep Learning for Semantic Segmentation of Football Match Image Design of a Constant Flow Control System for Cut Tobacco Feeder A Comparative Study of Cross-Sentence Features for Named Entity Recognition Analysis of New Distribution Network Planning Using Artificial Intelligence Semantic Recognition
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1