用C-SMOTE预测COVID-19的传播

IF 7.4 3区 管理学 Q1 COMPUTER SCIENCE, INFORMATION SYSTEMS Business & Information Systems Engineering Pub Date : 2021-07-02 DOI:10.52825/bis.v1i.45
Alessio Bernardo, Emanuele Della Valle
{"title":"用C-SMOTE预测COVID-19的传播","authors":"Alessio Bernardo, Emanuele Della Valle","doi":"10.52825/bis.v1i.45","DOIUrl":null,"url":null,"abstract":"Data continuously gathered monitoring the spreading of the COVID-19 pandemic form an unbounded flow of data. Accurately forecasting if the infections will increase or decrease has a high impact, but it is challenging because the pandemic spreads and contracts periodically. Technically, the flow of data is said to be imbalanced and subject to concept drifts because signs of decrements are the minority class during the spreading periods, while they become the majority class in the contraction periods and the other way round. In this paper, we propose a case study applying the Continuous Synthetic Minority Oversampling Technique (C-SMOTE), a novel meta-strategy to pipeline with Streaming Machine Learning (SML) classification algorithms, to forecast the COVID-19 pandemic trend. Benchmarking SML pipelinesthat use C-SMOTE against state-of-the-art methods on a COVID-19 dataset, we bring statistical evidence that models learned using C-SMOTE are better.","PeriodicalId":56020,"journal":{"name":"Business & Information Systems Engineering","volume":"12 1","pages":"27-38"},"PeriodicalIF":7.4000,"publicationDate":"2021-07-02","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Predict COVID-19 Spreading With C-SMOTE\",\"authors\":\"Alessio Bernardo, Emanuele Della Valle\",\"doi\":\"10.52825/bis.v1i.45\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Data continuously gathered monitoring the spreading of the COVID-19 pandemic form an unbounded flow of data. Accurately forecasting if the infections will increase or decrease has a high impact, but it is challenging because the pandemic spreads and contracts periodically. Technically, the flow of data is said to be imbalanced and subject to concept drifts because signs of decrements are the minority class during the spreading periods, while they become the majority class in the contraction periods and the other way round. In this paper, we propose a case study applying the Continuous Synthetic Minority Oversampling Technique (C-SMOTE), a novel meta-strategy to pipeline with Streaming Machine Learning (SML) classification algorithms, to forecast the COVID-19 pandemic trend. Benchmarking SML pipelinesthat use C-SMOTE against state-of-the-art methods on a COVID-19 dataset, we bring statistical evidence that models learned using C-SMOTE are better.\",\"PeriodicalId\":56020,\"journal\":{\"name\":\"Business & Information Systems Engineering\",\"volume\":\"12 1\",\"pages\":\"27-38\"},\"PeriodicalIF\":7.4000,\"publicationDate\":\"2021-07-02\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Business & Information Systems Engineering\",\"FirstCategoryId\":\"94\",\"ListUrlMain\":\"https://doi.org/10.52825/bis.v1i.45\",\"RegionNum\":3,\"RegionCategory\":\"管理学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q1\",\"JCRName\":\"COMPUTER SCIENCE, INFORMATION SYSTEMS\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Business & Information Systems Engineering","FirstCategoryId":"94","ListUrlMain":"https://doi.org/10.52825/bis.v1i.45","RegionNum":3,"RegionCategory":"管理学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"COMPUTER SCIENCE, INFORMATION SYSTEMS","Score":null,"Total":0}
引用次数: 0

摘要

持续收集的监测COVID-19大流行传播的数据形成了无界的数据流。准确预测感染增加或减少的影响很大,但由于大流行的周期性传播和收缩,这一预测具有挑战性。从技术上说,数据的流动是不平衡的,并受到概念漂移的影响,因为减量的迹象是在扩大期是少数阶层,而在收缩期则是多数阶层,反之亦然。在本文中,我们提出了一个应用连续合成少数过采样技术(C-SMOTE)的案例研究,这是一种基于流机器学习(SML)分类算法的流水线元策略,用于预测COVID-19大流行趋势。通过在COVID-19数据集上对使用C-SMOTE的SML管道与最先进的方法进行基准测试,我们提供了统计证据,表明使用C-SMOTE学习的模型更好。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
Predict COVID-19 Spreading With C-SMOTE
Data continuously gathered monitoring the spreading of the COVID-19 pandemic form an unbounded flow of data. Accurately forecasting if the infections will increase or decrease has a high impact, but it is challenging because the pandemic spreads and contracts periodically. Technically, the flow of data is said to be imbalanced and subject to concept drifts because signs of decrements are the minority class during the spreading periods, while they become the majority class in the contraction periods and the other way round. In this paper, we propose a case study applying the Continuous Synthetic Minority Oversampling Technique (C-SMOTE), a novel meta-strategy to pipeline with Streaming Machine Learning (SML) classification algorithms, to forecast the COVID-19 pandemic trend. Benchmarking SML pipelinesthat use C-SMOTE against state-of-the-art methods on a COVID-19 dataset, we bring statistical evidence that models learned using C-SMOTE are better.
求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
Business & Information Systems Engineering
Business & Information Systems Engineering Computer Science-Information Systems
CiteScore
13.60
自引率
7.60%
发文量
44
审稿时长
3 months
期刊介绍: Business & Information Systems Engineering (BISE) is a double-blind peer-reviewed journal with a primary focus on the design and utilization of information systems for social welfare. The journal aims to contribute to the understanding and advancement of information systems in ways that benefit societal well-being.
期刊最新文献
The Design of Citizen-Centric Green IS in Sustainable Smart Districts A Maturity Model for Assessing the Digitalization of Public Health Agencies IT Professionals in the Gig Economy A Reference System Architecture with Data Sovereignty for Human-Centric Data Ecosystems Analyzing Medical Data with Process Mining: a COVID-19 Case Study
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1