新冠肺炎研究中的智力数据分析方法

O. Senko, A. Kuznetsova, E. M. Voronin, O. Kravtsova, Ludmila R. Borisova, I. L. Kirilyuk, V. Akimkin
{"title":"新冠肺炎研究中的智力数据分析方法","authors":"O. Senko, A. Kuznetsova, E. M. Voronin, O. Kravtsova, Ludmila R. Borisova, I. L. Kirilyuk, V. Akimkin","doi":"10.33581/2520-6508-2022-1-83-96","DOIUrl":null,"url":null,"abstract":"The paper presents an original method for solving the problem of finding a connection between the course of the epidemic and socio-economic, demographic and climatic factors. The method was applied to solve this problem for 110 countries of the world using a set of corresponding curves of the COVID-19 growth rate for the period from January 2020 to August 2021. Hierarchical agglomerative clustering was applied. Four large clusters with uniform curves were identified – 11, 39, 17 and 13 countries, respectively. Another 30 countries were not included in any cluster. Using machine learning methods, we identified the differences in socio-economic, demographic and geographical and climatic indicators in the selected clusters of countries of the world. The most important indicators by which the clusters differ from each other are amplitude of temperatures throughout the year, high-tech exports, Gini coefficient, size of the urban population and the general population, index of net barter terms of trade, population growth, average January temperature, territory (land area), number of deaths due to natural disasters, birth rate, coastline length, oil reserves, population in urban agglomerations with a population of more than 1 million etc. This approach (the use of clustering in combination with classification by methods of logical-statistical analysis) has not been used by anyone before. The found patterns will make it possible to more accurately predict the epidemiological process in countries belonging to different clusters. Supplementing this approach with autoregressive models will automate the forecast and improve its accuracy.","PeriodicalId":36323,"journal":{"name":"Zhurnal Belorusskogo Gosudarstvennogo Universiteta. Matematika. Informatika","volume":" ","pages":""},"PeriodicalIF":0.0000,"publicationDate":"2022-04-06","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Methods of intellectual data analysis in COVID-19 research\",\"authors\":\"O. Senko, A. Kuznetsova, E. M. Voronin, O. Kravtsova, Ludmila R. Borisova, I. L. Kirilyuk, V. Akimkin\",\"doi\":\"10.33581/2520-6508-2022-1-83-96\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"The paper presents an original method for solving the problem of finding a connection between the course of the epidemic and socio-economic, demographic and climatic factors. The method was applied to solve this problem for 110 countries of the world using a set of corresponding curves of the COVID-19 growth rate for the period from January 2020 to August 2021. Hierarchical agglomerative clustering was applied. Four large clusters with uniform curves were identified – 11, 39, 17 and 13 countries, respectively. Another 30 countries were not included in any cluster. Using machine learning methods, we identified the differences in socio-economic, demographic and geographical and climatic indicators in the selected clusters of countries of the world. The most important indicators by which the clusters differ from each other are amplitude of temperatures throughout the year, high-tech exports, Gini coefficient, size of the urban population and the general population, index of net barter terms of trade, population growth, average January temperature, territory (land area), number of deaths due to natural disasters, birth rate, coastline length, oil reserves, population in urban agglomerations with a population of more than 1 million etc. This approach (the use of clustering in combination with classification by methods of logical-statistical analysis) has not been used by anyone before. The found patterns will make it possible to more accurately predict the epidemiological process in countries belonging to different clusters. Supplementing this approach with autoregressive models will automate the forecast and improve its accuracy.\",\"PeriodicalId\":36323,\"journal\":{\"name\":\"Zhurnal Belorusskogo Gosudarstvennogo Universiteta. Matematika. Informatika\",\"volume\":\" \",\"pages\":\"\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2022-04-06\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Zhurnal Belorusskogo Gosudarstvennogo Universiteta. Matematika. Informatika\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.33581/2520-6508-2022-1-83-96\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q4\",\"JCRName\":\"Mathematics\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Zhurnal Belorusskogo Gosudarstvennogo Universiteta. Matematika. Informatika","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.33581/2520-6508-2022-1-83-96","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q4","JCRName":"Mathematics","Score":null,"Total":0}
引用次数: 0

摘要

本文提出了一种解决流行病过程与社会经济、人口和气候因素之间联系问题的独创方法。该方法被应用于解决世界110个国家的这一问题,使用了2020年1月至2021年8月期间新冠肺炎增长率的一组对应曲线。采用层次聚集聚类方法。确定了四个具有统一曲线的大集群,分别为11个、39个、17个和13个国家。另有30个国家未被列入任何一组。使用机器学习方法,我们确定了世界上选定的国家集群在社会经济、人口、地理和气候指标方面的差异。集群之间差异的最重要指标是全年气温幅度、高科技出口、基尼系数、城市人口和普通人口规模、净易货贸易条件指数、人口增长、1月平均气温、领土(陆地面积)、自然灾害死亡人数、出生率、海岸线长度,石油储量、人口超过100万的城市群人口等。这种方法(使用聚类与逻辑统计分析方法分类相结合)以前从未被任何人使用过。所发现的模式将有可能更准确地预测属于不同集群的国家的流行病学过程。用自回归模型补充这种方法将使预测自动化并提高其准确性。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
Methods of intellectual data analysis in COVID-19 research
The paper presents an original method for solving the problem of finding a connection between the course of the epidemic and socio-economic, demographic and climatic factors. The method was applied to solve this problem for 110 countries of the world using a set of corresponding curves of the COVID-19 growth rate for the period from January 2020 to August 2021. Hierarchical agglomerative clustering was applied. Four large clusters with uniform curves were identified – 11, 39, 17 and 13 countries, respectively. Another 30 countries were not included in any cluster. Using machine learning methods, we identified the differences in socio-economic, demographic and geographical and climatic indicators in the selected clusters of countries of the world. The most important indicators by which the clusters differ from each other are amplitude of temperatures throughout the year, high-tech exports, Gini coefficient, size of the urban population and the general population, index of net barter terms of trade, population growth, average January temperature, territory (land area), number of deaths due to natural disasters, birth rate, coastline length, oil reserves, population in urban agglomerations with a population of more than 1 million etc. This approach (the use of clustering in combination with classification by methods of logical-statistical analysis) has not been used by anyone before. The found patterns will make it possible to more accurately predict the epidemiological process in countries belonging to different clusters. Supplementing this approach with autoregressive models will automate the forecast and improve its accuracy.
求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
CiteScore
0.50
自引率
0.00%
发文量
21
审稿时长
16 weeks
期刊最新文献
Algorithm for solving the knapsack problem with certain properties of Pareto layers Numerical study of the relative equilibrium of a droplet with a simply connected free surface on a rotating plane On the Hosoya polynomial of the third type of the chain hex-derived network Algebraic equations and polynomials over the ring of p-complex numbers On the theory of operator interpolation in spaces of rectangular matrixes
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1