Segmentation of Time Series Based on Kinetic Characteristics for Storage Consumption Prediction

Beibei Miao, Chen Yu, Jin Xuebo, Wang Bo, Xianping Qu, Shimin Tao, Wang Dong, Zang Zhi
{"title":"Segmentation of Time Series Based on Kinetic Characteristics for Storage Consumption Prediction","authors":"Beibei Miao, Chen Yu, Jin Xuebo, Wang Bo, Xianping Qu, Shimin Tao, Wang Dong, Zang Zhi","doi":"10.1109/ICDCS.2017.254","DOIUrl":null,"url":null,"abstract":"The Internet services generate huge amount of data, which require large space for storage. Determining device purchase plan turns out to be very important for the service providers. Under-purchasing might lead to data loss, while over-purchasing would result in waste. In this paper, we propose a linear regression based approach to predict the storage demand according to the time series of the storage consumption. We partitioned the storage con-sumption time series into several linear segments, and perform prediction on the last segment using linear regression. Since the position of turning points between adjacent segments and the total number of the segments are both unknown, how to achieve the online segmentation becomes a big challenge. Aiming to solve this problem, we carried out the Kalman-Anova segmentation method. Experiment results show that our method has good accuracy in precision, recall and F-measure values. Moreover, the method is able to segment nonlinear time series as well, suggesting a potential wider application. The proposed method has been deployed in Baidu Inc. and saves about 45 thousand dollars in one of its device purchase program.","PeriodicalId":6300,"journal":{"name":"2012 IEEE 32nd International Conference on Distributed Computing Systems","volume":"147 1","pages":"2559-2560"},"PeriodicalIF":0.0000,"publicationDate":"2017-06-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2012 IEEE 32nd International Conference on Distributed Computing Systems","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICDCS.2017.254","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0

Abstract

The Internet services generate huge amount of data, which require large space for storage. Determining device purchase plan turns out to be very important for the service providers. Under-purchasing might lead to data loss, while over-purchasing would result in waste. In this paper, we propose a linear regression based approach to predict the storage demand according to the time series of the storage consumption. We partitioned the storage con-sumption time series into several linear segments, and perform prediction on the last segment using linear regression. Since the position of turning points between adjacent segments and the total number of the segments are both unknown, how to achieve the online segmentation becomes a big challenge. Aiming to solve this problem, we carried out the Kalman-Anova segmentation method. Experiment results show that our method has good accuracy in precision, recall and F-measure values. Moreover, the method is able to segment nonlinear time series as well, suggesting a potential wider application. The proposed method has been deployed in Baidu Inc. and saves about 45 thousand dollars in one of its device purchase program.
查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
基于动力特性的时间序列分割用于电量预测
互联网服务产生了大量的数据,这些数据需要很大的存储空间。确定设备采购计划对服务提供商来说是非常重要的。采购不足可能导致数据丢失,而采购过多则会造成浪费。在本文中,我们提出了一种基于线性回归的方法,根据存储消耗的时间序列来预测存储需求。我们将存储消耗时间序列划分为几个线性段,并使用线性回归对最后一个段进行预测。由于相邻线段之间的拐点位置和线段总数都是未知的,因此如何实现在线分割成为一个很大的挑战。针对这一问题,我们进行了Kalman-Anova分割方法。实验结果表明,该方法在精密度、召回率和f测量值等方面都具有较好的准确性。此外,该方法还能对非线性时间序列进行分割,具有广阔的应用前景。该方法已在百度公司得到应用,并在其设备采购计划中节省了约4.5万美元。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 去求助
来源期刊
自引率
0.00%
发文量
0
期刊最新文献
Design and Simulation of Multiple Quantum well based InGaN/GaN Light Emitting Diode for High power applications Virtual Reality based System for Training and Monitoring Fire Safety Awareness for Children with Autism Spectrum Disorder A Cognitive Based Channel Assortment Using Ant-Colony Optimized Stable Path Selection in an IoTN Design and Implementation of DNA Based Cryptographic Algorithm A Compact Wearable 2.45 GHz Antenna for WBAN Applications
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1