机器学习在COVID-19病例预测中的应用研究

Maleerat Maliyaem, Nguyen Minh Tuan, Demontray Lockhart, S. Muenthong
{"title":"机器学习在COVID-19病例预测中的应用研究","authors":"Maleerat Maliyaem, Nguyen Minh Tuan, Demontray Lockhart, S. Muenthong","doi":"10.37256/ccds.3220221488","DOIUrl":null,"url":null,"abstract":"With an unprecedented challenge to combat COVID-19, the prediction of confirmed cases is very important to ensure medical aid and healthy living conditions. In order to predict confirmed cases, the current study uses a dataset prepared by the White House Office of Science and Technology Policy which brought together companies and research to address questions concerning COVID-19. The importance of this was to identify factors that seem to affect the transmission rate of COVID-19. The focus of the current research, however, is to predict global cases of COVID-19. There have been many papers written about the prediction of confirmed cases and fatalities, but they failed to show promising results. Our research applies machine learning for predicting fatalities in the world using the COVID-19 Forecasting dataset from Kaggle. After trying several algorithms, our findings reveal that Logistic Regression, Decision Tree, KNeighbors, GaussianNB, and Random Forest algorithms provide the best predictions. Thus, the results show Random Forest as having the highest accuracy followed by Logistic Regression and Decision Tree. The results are promising opening up the door for further research.","PeriodicalId":158315,"journal":{"name":"Cloud Computing and Data Science","volume":"1 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2022-07-14","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"3","resultStr":"{\"title\":\"A Study of Using Machine Learning in Predicting COVID-19 Cases\",\"authors\":\"Maleerat Maliyaem, Nguyen Minh Tuan, Demontray Lockhart, S. Muenthong\",\"doi\":\"10.37256/ccds.3220221488\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"With an unprecedented challenge to combat COVID-19, the prediction of confirmed cases is very important to ensure medical aid and healthy living conditions. In order to predict confirmed cases, the current study uses a dataset prepared by the White House Office of Science and Technology Policy which brought together companies and research to address questions concerning COVID-19. The importance of this was to identify factors that seem to affect the transmission rate of COVID-19. The focus of the current research, however, is to predict global cases of COVID-19. There have been many papers written about the prediction of confirmed cases and fatalities, but they failed to show promising results. Our research applies machine learning for predicting fatalities in the world using the COVID-19 Forecasting dataset from Kaggle. After trying several algorithms, our findings reveal that Logistic Regression, Decision Tree, KNeighbors, GaussianNB, and Random Forest algorithms provide the best predictions. Thus, the results show Random Forest as having the highest accuracy followed by Logistic Regression and Decision Tree. The results are promising opening up the door for further research.\",\"PeriodicalId\":158315,\"journal\":{\"name\":\"Cloud Computing and Data Science\",\"volume\":\"1 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2022-07-14\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"3\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Cloud Computing and Data Science\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.37256/ccds.3220221488\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Cloud Computing and Data Science","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.37256/ccds.3220221488","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 3

摘要

面对前所未有的抗疫挑战,确诊病例预测对于确保医疗救助和健康生活条件至关重要。为了预测确诊病例,目前的研究使用了白宫科技政策办公室准备的数据集,该数据集汇集了公司和研究人员,以解决与COVID-19有关的问题。这样做的重要性在于确定似乎影响COVID-19传播率的因素。然而,目前的研究重点是预测全球新冠肺炎病例。关于预测确诊病例和死亡人数的论文有很多,但它们都没有显示出令人鼓舞的结果。我们的研究利用Kaggle的COVID-19预测数据集,应用机器学习来预测世界上的死亡人数。在尝试了几种算法之后,我们的研究结果表明,逻辑回归、决策树、KNeighbors、GaussianNB和随机森林算法提供了最好的预测。因此,结果显示随机森林具有最高的准确性,其次是逻辑回归和决策树。这些结果很有希望,为进一步的研究打开了大门。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
A Study of Using Machine Learning in Predicting COVID-19 Cases
With an unprecedented challenge to combat COVID-19, the prediction of confirmed cases is very important to ensure medical aid and healthy living conditions. In order to predict confirmed cases, the current study uses a dataset prepared by the White House Office of Science and Technology Policy which brought together companies and research to address questions concerning COVID-19. The importance of this was to identify factors that seem to affect the transmission rate of COVID-19. The focus of the current research, however, is to predict global cases of COVID-19. There have been many papers written about the prediction of confirmed cases and fatalities, but they failed to show promising results. Our research applies machine learning for predicting fatalities in the world using the COVID-19 Forecasting dataset from Kaggle. After trying several algorithms, our findings reveal that Logistic Regression, Decision Tree, KNeighbors, GaussianNB, and Random Forest algorithms provide the best predictions. Thus, the results show Random Forest as having the highest accuracy followed by Logistic Regression and Decision Tree. The results are promising opening up the door for further research.
求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
自引率
0.00%
发文量
0
期刊最新文献
DeepMetaDroid: Real-Time Android Malware Detection Using Deep Learning and Metadata Features Advancing Stock Market Predictions with Time Series Analysis including LSTM and ARIMA Geochemical and Geospatial Distribution of Organic Contaminants in the Flood Plain of Ekpetiama, Niger Delta Region of Nigeria Smart Contracts Security Application and Challenges: A Review A Review on Current Trends and Applications of Social Media Research in Sri Lanka
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1