基于机器学习的韩国职业棒球联赛观众发展预测模型

Jung-Hwan Cho, Boo-Gil Seok
{"title":"基于机器学习的韩国职业棒球联赛观众发展预测模型","authors":"Jung-Hwan Cho, Boo-Gil Seok","doi":"10.35159/kjss.2023.10.32.5.547","DOIUrl":null,"url":null,"abstract":"[Purpose] The purpose of this study is to identify the main factors related to the prediction of the number of spectators in Korean professional baseball by using machine learning. [Methods] For the purpose of the study, the daily numbers of spectators for professional baseball from 2017 to 2019 were collected. External factors such as the weather and holidays on the day of the match and the internal situation of the match, such as the away team factor, were input as observation variables. The collected data was analyzed with Python ver 3.6, and the predictive power was cross-validated using three machine learning models: Lasso regression, random forest, and XGboost. [Results] As a result of the analysis, the XGboost model showed the highest predictive power and showed 58.4% accuracy when predicting the number of spectators for the entire KBO league. The most frequently used factor in the entire league was the ‘Date’ factor, and as a single-factor, holidays were the most frequently used in prediction. As for the factors for predicting the total number of spectators by team, the ‘Away team’ factor and the ‘Date’ factor were most frequently used. [Conclusions] Based on the results of this study, it is decided that teams and league will be able to suggest various marketing strategies if the number of spectators is predicted considering the game performance, opponent team, and weather.","PeriodicalId":497986,"journal":{"name":"The Korean Society of Sports Science","volume":"50 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2023-10-31","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"The Development prediction model of Korea Professional Baseball league spectator using machine learning\",\"authors\":\"Jung-Hwan Cho, Boo-Gil Seok\",\"doi\":\"10.35159/kjss.2023.10.32.5.547\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"[Purpose] The purpose of this study is to identify the main factors related to the prediction of the number of spectators in Korean professional baseball by using machine learning. [Methods] For the purpose of the study, the daily numbers of spectators for professional baseball from 2017 to 2019 were collected. External factors such as the weather and holidays on the day of the match and the internal situation of the match, such as the away team factor, were input as observation variables. The collected data was analyzed with Python ver 3.6, and the predictive power was cross-validated using three machine learning models: Lasso regression, random forest, and XGboost. [Results] As a result of the analysis, the XGboost model showed the highest predictive power and showed 58.4% accuracy when predicting the number of spectators for the entire KBO league. The most frequently used factor in the entire league was the ‘Date’ factor, and as a single-factor, holidays were the most frequently used in prediction. As for the factors for predicting the total number of spectators by team, the ‘Away team’ factor and the ‘Date’ factor were most frequently used. [Conclusions] Based on the results of this study, it is decided that teams and league will be able to suggest various marketing strategies if the number of spectators is predicted considering the game performance, opponent team, and weather.\",\"PeriodicalId\":497986,\"journal\":{\"name\":\"The Korean Society of Sports Science\",\"volume\":\"50 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2023-10-31\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"The Korean Society of Sports Science\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.35159/kjss.2023.10.32.5.547\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"The Korean Society of Sports Science","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.35159/kjss.2023.10.32.5.547","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0

摘要

【目的】本研究的目的是利用机器学习识别与韩国职业棒球观众人数预测相关的主要因素。【方法】为研究目的,收集2017 - 2019年职业棒球比赛的每日观众人数。输入比赛当天的天气和节假日等外部因素,以及比赛的内部情况,如客场球队因素,作为观察变量。使用Python ver 3.6对收集到的数据进行分析,并使用Lasso回归、随机森林和XGboost三种机器学习模型交叉验证预测能力。[结果]分析结果显示,XGboost模型在预测整个KBO联赛的观众人数时具有最高的预测能力,准确率达到58.4%。整个联盟中最常用的因素是“日期”因素,作为一个单一因素,假期是预测中最常用的因素。至于预测球队总观众人数的因素,“客场球队”因素和“日期”因素是最常用的。[结论]基于本研究的结果,决定球队和联赛将能够提出各种营销策略,如果预测观众的数量,考虑比赛成绩,对手球队和天气。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
The Development prediction model of Korea Professional Baseball league spectator using machine learning
[Purpose] The purpose of this study is to identify the main factors related to the prediction of the number of spectators in Korean professional baseball by using machine learning. [Methods] For the purpose of the study, the daily numbers of spectators for professional baseball from 2017 to 2019 were collected. External factors such as the weather and holidays on the day of the match and the internal situation of the match, such as the away team factor, were input as observation variables. The collected data was analyzed with Python ver 3.6, and the predictive power was cross-validated using three machine learning models: Lasso regression, random forest, and XGboost. [Results] As a result of the analysis, the XGboost model showed the highest predictive power and showed 58.4% accuracy when predicting the number of spectators for the entire KBO league. The most frequently used factor in the entire league was the ‘Date’ factor, and as a single-factor, holidays were the most frequently used in prediction. As for the factors for predicting the total number of spectators by team, the ‘Away team’ factor and the ‘Date’ factor were most frequently used. [Conclusions] Based on the results of this study, it is decided that teams and league will be able to suggest various marketing strategies if the number of spectators is predicted considering the game performance, opponent team, and weather.
求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
自引率
0.00%
发文量
0
期刊最新文献
Attractive Factors in the CrossFit Participation Process and Sustainable Participation Plans The Relationship between Golf Course Caddy"s Social Servicescape, Golf Course Satisfaction, and Golf Course Loyalty A Study on the Revitalize Creating Shared Value(CSV) through Productivity Redefinition in the Value Chain: Focusing on Sports Public Interest Corporations Effects of 8 weeks of pilates exercise on chronic low back pain, lower muscle strength and static・dynamic balance in female healthcare workers Effects of swimming exercise on health fitness and immune-related inflammatory changes in sedentary middle-aged women
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1