Assessing the overhead and scalability of system monitors for large data centers

CloudCP '11 Pub Date : 2011-04-10 DOI:10.1145/1967422.1967425

M. Andreolini, M. Colajanni, R. Lancellotti

引用次数: 4

Abstract

Current data centers are shifting towards cloud-based architectures as a means to obtain a scalable, cost-effective, robust service platform. In spite of this, the underlying management infrastructure has grown in terms of hardware resources and software complexity, making automated resource monitoring a necessity. There are several infrastructure monitoring tools designed to scale to a very high number of physical nodes. However, these tools either collect performance measure at a low frequency (missing the chance to capture the dynamics of a short-term management task) or are simply not equipped with instrumentation specific to cloud computing and virtualization. In this scenario, monitoring the correctness and efficiency of live migrations can become a nightmare. This situation will only worsen in the future, with the increased service demand due to spreading of the user base. In this paper, we assess the scalability of a prototype monitoring subsystem for different user scenarios. We also identify all the major bottlenecks and give insight on how to remove them.

查看原文

微信好友朋友圈 QQ好友复制链接

本刊更多论文

评估大型数据中心系统监视器的开销和可伸缩性

当前的数据中心正在转向基于云的架构，以此作为获得可扩展、经济高效、健壮的服务平台的一种手段。尽管如此，底层管理基础设施在硬件资源和软件复杂性方面已经增长，使得自动化资源监控成为必要。有几种基础设施监控工具设计用于扩展到非常多的物理节点。然而，这些工具要么以较低的频率收集性能度量(错失捕捉短期管理任务动态的机会)，要么根本没有配备特定于云计算和虚拟化的工具。在这种情况下，监视实时迁移的正确性和效率可能会成为一场噩梦。随着用户群的扩大，服务需求的增加，这种情况在未来只会恶化。在本文中，我们评估了原型监控子系统在不同用户场景下的可扩展性。我们还确定了所有主要瓶颈，并提供了如何消除它们的见解。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文去求助

来源期刊

CloudCP '11

自引率

0.00%

发文量