Fitting heavy-tailed HTTP traces with the new stratified EM-algorithm

2008 4th International Telecommunication Networking Workshop on QoS in Multiservice IP Networks Pub Date : 2008-04-11 DOI:10.1109/ITNEWS.2008.4488162

R. Sadre, B. Haverkort

引用次数: 13

Abstract

A typical step in the model-based evaluation of communication systems is to fit measured data to analytically tractable distributions. Due to the increased speed of today's networks, even basic measurements, such as logging the requests at a Web server, can quickly generate large data traces with millions of entries. Employing complex fitting algorithms on such traces can take a significant amount of time. In this paper, we focus on the Expectation Maximization-based fitting of heavy- tailed distributed data to hyper-exponential distributions. We present a data aggregation algorithm which accelerates the fitting by several orders of magnitude. The employed aggregation algorithm has been derived from a sampling stratification technique and adapts dynamically to the distribution of the data. We illustrate the performance of the algorithm by applying it to empirical and artificial data traces.

查看原文

微信好友朋友圈 QQ好友复制链接

本刊更多论文

用新的分层em算法拟合重尾HTTP路径

在基于模型的通信系统评估中，一个典型的步骤是将测量数据拟合到可分析处理的分布中。由于当今网络速度的提高，即使是最基本的测量，例如记录Web服务器上的请求，也可以快速生成包含数百万个条目的大型数据跟踪。在这样的轨迹上使用复杂的拟合算法可能会花费大量的时间。本文主要研究基于期望最大化的重尾分布数据对超指数分布的拟合。提出了一种数据聚合算法，将拟合速度提高了几个数量级。所采用的聚合算法源自抽样分层技术，并能动态适应数据的分布。我们通过将其应用于经验和人工数据轨迹来说明算法的性能。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文去求助

来源期刊

2008 4th International Telecommunication Networking Workshop on QoS in Multiservice IP Networks

自引率

0.00%

发文量

期刊最新文献

A push-based scheduling algorithm for large scale P2P live streaming Novel traffic engineering scheme based upon application flows for QoS enhancement Quality of provisioning as an OPEX-related issue in research networks Coping with distributed monitoring of QoS-enabled heterogeneous networks Multi-chip multicast schedulers in input-queued switches