2009 IEEE International Symposium on Modeling, Analysis & Simulation of Computer and Telecommunication Systems最新文献

英文中文

Business-oriented autonomic load balancing for multitiered Web sites 面向业务的多层Web站点的自主负载平衡

2009 IEEE International Symposium on Modeling, Analysis & Simulation of Computer and Telecommunication Systems

Pub Date : 2009-12-28 DOI: 10.1109/MASCOT.2009.5366756

J. M. Ewing, D. Menascé

Autonomic computing systems are able to adapt to changing environments (such as changes in the workload intensity or component failures) in a way that preserves highlevel operational goals, such as service level objectives. This paper focuses on autonomic computing systems that are self-optimizing and self-configuring. More specifically, the paper presents the detailed design of an autonomic load balancer (LB) for multi-tiered Web sites. It is assumed that customers can be categorized into distinct classes (gold, silver, and bronze) according to their business value to the site. While the example used in the paper is that of an auction site, the approach can be easily applied to any other Web site. The autonomic LB is able to dynamically change its request redirection policy as well as its resource allocation policy, which determines the allocation of servers to server clusters, in a way that maximizes a business-oriented utility function. The autonomic LB was evaluated through very detailed and comprehensive simulation experiments and was compared against a round-robin LB and against a situation where each customer category has a dedicated number of servers. The results showed that the autonomic LB outperforms the other load balancing approaches in terms of providing a higher utility for highly dynamic workloads.

自主计算系统能够适应不断变化的环境(例如工作负载强度的变化或组件故障)，同时保留高层次的操作目标，例如服务级目标。本文主要研究具有自优化和自配置功能的自主计算系统。更具体地说，本文给出了一个用于多层Web站点的自动负载平衡器(LB)的详细设计。假设可以根据客户对站点的业务价值将其分为不同的类别(黄金、白银和青铜)。虽然本文中使用的示例是一个拍卖站点，但该方法可以很容易地应用于任何其他Web站点。自治LB能够动态更改其请求重定向策略以及资源分配策略(资源分配策略决定服务器到服务器集群的分配)，从而最大化面向业务的实用功能。通过非常详细和全面的模拟实验对自主负载均衡进行了评估，并与循环负载均衡和每个客户类别都有专用数量的服务器的情况进行了比较。结果表明，在为高动态工作负载提供更高的效用方面，自主负载均衡优于其他负载平衡方法。

{"title":"Business-oriented autonomic load balancing for multitiered Web sites","authors":"J. M. Ewing, D. Menascé","doi":"10.1109/MASCOT.2009.5366756","DOIUrl":"https://doi.org/10.1109/MASCOT.2009.5366756","url":null,"abstract":"Autonomic computing systems are able to adapt to changing environments (such as changes in the workload intensity or component failures) in a way that preserves highlevel operational goals, such as service level objectives. This paper focuses on autonomic computing systems that are self-optimizing and self-configuring. More specifically, the paper presents the detailed design of an autonomic load balancer (LB) for multi-tiered Web sites. It is assumed that customers can be categorized into distinct classes (gold, silver, and bronze) according to their business value to the site. While the example used in the paper is that of an auction site, the approach can be easily applied to any other Web site. The autonomic LB is able to dynamically change its request redirection policy as well as its resource allocation policy, which determines the allocation of servers to server clusters, in a way that maximizes a business-oriented utility function. The autonomic LB was evaluated through very detailed and comprehensive simulation experiments and was compared against a round-robin LB and against a situation where each customer category has a dedicated number of servers. The results showed that the autonomic LB outperforms the other load balancing approaches in terms of providing a higher utility for highly dynamic workloads.","PeriodicalId":275737,"journal":{"name":"2009 IEEE International Symposium on Modeling, Analysis & Simulation of Computer and Telecommunication Systems","volume":"191 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2009-12-28","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"132629131","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 16

On finding the right balance between fairness and efficiency in WiMAX scheduling through analytical modeling 基于分析建模的WiMAX调度公平性与效率平衡研究

2009 IEEE International Symposium on Modeling, Analysis & Simulation of Computer and Telecommunication Systems

Pub Date : 2009-12-28 DOI: 10.1109/MASCOT.2009.5366812

Sébastien Doirieux, B. Baynat, Thomas Begin

In this paper, we explore a way to find the right scheduling policy for WiMAX networks, that achieves the best compromise between an efficient use of the resource and a relative fairness among users. This problem is of primary importance as no scheduling policy has been recommended in the WiMAX standard. To do so, we develop an extension of our previous analytical model for WiMAX networks, that takes into account a more general scheduling policy than those previously studied (i.e., instantaneous throughput fairness, slot sharing fairness and opportunistic scheduling). We show that this general policy covers the two extreme cases, namely the instantaneous throughput fairness policy and the opportunistic policy, and offers intermediate policies that are good candidates for finding the right trade-off. In order to formulate the decision criterion, we introduce a new performance parameter, the mean throughput obtained by a user depending on its efficiency to use the resource. The model has a closed-form solution, and all performance parameters can be obtained instantaneously. This allows us to carry out dimensioning studies that require several thousands of evaluations, which would not be tractable with any simulation tool.

在本文中，我们探索了一种适合WiMAX网络的调度策略，以实现资源的有效利用和用户之间的相对公平之间的最佳折衷。这个问题非常重要，因为在WiMAX标准中没有推荐调度策略。为此，我们开发了之前的WiMAX网络分析模型的扩展，该模型考虑了比以前研究的更一般的调度策略(即瞬时吞吐量公平，插槽共享公平和机会调度)。我们表明，该通用策略涵盖了两种极端情况，即瞬时吞吐量公平策略和机会主义策略，并提供了寻找正确权衡的良好候选中间策略。为了制定决策准则，我们引入了一个新的性能参数，即用户根据其使用资源的效率而获得的平均吞吐量。该模型具有封闭解，可即时获得所有性能参数。这允许我们执行需要数千次评估的尺寸研究，这将无法用任何模拟工具处理。

{"title":"On finding the right balance between fairness and efficiency in WiMAX scheduling through analytical modeling","authors":"Sébastien Doirieux, B. Baynat, Thomas Begin","doi":"10.1109/MASCOT.2009.5366812","DOIUrl":"https://doi.org/10.1109/MASCOT.2009.5366812","url":null,"abstract":"In this paper, we explore a way to find the right scheduling policy for WiMAX networks, that achieves the best compromise between an efficient use of the resource and a relative fairness among users. This problem is of primary importance as no scheduling policy has been recommended in the WiMAX standard. To do so, we develop an extension of our previous analytical model for WiMAX networks, that takes into account a more general scheduling policy than those previously studied (i.e., instantaneous throughput fairness, slot sharing fairness and opportunistic scheduling). We show that this general policy covers the two extreme cases, namely the instantaneous throughput fairness policy and the opportunistic policy, and offers intermediate policies that are good candidates for finding the right trade-off. In order to formulate the decision criterion, we introduce a new performance parameter, the mean throughput obtained by a user depending on its efficiency to use the resource. The model has a closed-form solution, and all performance parameters can be obtained instantaneously. This allows us to carry out dimensioning studies that require several thousands of evaluations, which would not be tractable with any simulation tool.","PeriodicalId":275737,"journal":{"name":"2009 IEEE International Symposium on Modeling, Analysis & Simulation of Computer and Telecommunication Systems","volume":"1 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2009-12-28","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"128012694","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 3

CPM: Cooperative power management for object-based storage cluster CPM:基于对象存储集群的协同电源管理

2009 IEEE International Symposium on Modeling, Analysis & Simulation of Computer and Telecommunication Systems

Pub Date : 2009-12-28 DOI: 10.1109/MASCOT.2009.5366151

Q. Wei, B. Veeravalli, Zhixiang Li

Disk idle periods in server workload are short, which significantly limits the effectiveness of underline disk power management. To release this limitation, we present a Cooperative Power Management (referred to as CPM) scheme to save energy with performance guarantee for object-based storage cluster. CPM reclaims idle memories of neighboring Object-based Storage Devices (OSDs) over high speed network as remote cache to store evicted objects. Then requests missed in local cache could be hit by remote cache, and local disk does not necessarily spin back up to service these requests. Hence, CPM can artificially create long idle periods to provide more opportunities for underlying disk power management. CPM minimizes the risk of performance and energy penalty by spinning down disks only when predicted idle period is long enough to justify statetransition energy. Our rigorous experiment results conclusively demonstrate that CPM can dynamically adapt to workload changes and outperform existing solutions in terms of energy saving and performance for large-scale OSD cluster.

服务器工作负载中的磁盘空闲时间很短，这极大地限制了底层磁盘电源管理的有效性。为了克服这一限制，我们提出了一种基于对象存储集群的协同电源管理(Cooperative Power Management，简称CPM)方案，在保证性能的前提下实现节能。CPM通过高速网络回收相邻osd (Object-based Storage device)的空闲内存作为远程缓存来存储被驱逐的对象。然后，在本地缓存中丢失的请求可能会被远程缓存击中，并且本地磁盘不一定会重新启动以服务这些请求。因此，CPM可以人为地创建较长的空闲时间，为底层磁盘电源管理提供更多机会。CPM只有在预测的空闲时间长到足以证明状态转换能量的合理性时，才会关闭磁盘，从而将性能和能量损失的风险降至最低。我们严谨的实验结果最终证明，CPM可以动态适应工作负载的变化，并且在节能和性能方面优于现有的大规模OSD集群解决方案。

引用次数: 0

A sensitivity analysis of a new hardware-supported Global Synchronization Unit 一种新的硬件支持的全局同步单元的灵敏度分析

2009 IEEE International Symposium on Modeling, Analysis & Simulation of Computer and Telecommunication Systems

Pub Date : 2009-12-28 DOI: 10.1109/MASCOT.2009.5367047

E. Lynch, G. Riley

Historically, large-scale low-lookahead parallel simulation has been a difficult problem. As a solution, we have designed a Global Synchronization Unit (GSU) that would reside centrally on a multi-core chip and asynchronously compute the Lower Bound on Time Stamps (LBTS), the minimum timestamp of all unprocessed events in the simulation, on demand to synchronize conservative parallel simulators. Our GSU also accounts for transient messages, messages that have been sent but not yet processed by their recipient, eliminating the need for the simulator to acknowledge received messages. In this paper we analyze the sensitivity of simulation performance to the time required to access the GSU. The sensitivity analysis revealed that with GSU access times as high as hundreds of cycles, there was still a significant performance advantage over the baseline shared-memory implementation.

长期以来，大规模低前瞻并行仿真一直是一个难题。作为一种解决方案，我们设计了一个全局同步单元(GSU)，它将驻留在多核芯片上，并根据需要异步计算时间戳下限(LBTS)，即仿真中所有未处理事件的最小时间戳，以同步保守并行模拟器。我们的GSU还考虑了瞬态消息，即已发送但尚未被接收方处理的消息，从而消除了模拟器确认接收消息的需要。在本文中，我们分析了仿真性能对访问GSU所需时间的敏感性。灵敏度分析显示，即使GSU访问时间高达数百个周期，仍然比基线共享内存实现具有显著的性能优势。

引用次数: 1

Models for high-speed interconnection networks performance analysis 高速互连网络性能分析模型

2009 IEEE International Symposium on Modeling, Analysis & Simulation of Computer and Telecommunication Systems

Pub Date : 2009-12-28 DOI: 10.1109/MASCOT.2009.5366358

D. Lugones, Daniel Franco, Eduardo Argollo, E. Luque

Modeling Interconnection networks is an important research topic enabling the study of the interconnection behavior and its significance in telecommunication applications and distributed systems. However, complexity of large-scale networks makes development of models and simulation tools a prohibitively difficult task. In this paper we have explored the network modeling space design to provide models following two different approaches: accurate simulation models based on finite state machines (FSM), and also, analytical models to provide profitable speedup with a minimal accuracy loss. Experiments results show that the proposed analytical model provides a faithful abstraction for the scale of systems that are of interest in the foreseeable future, it reaches an 8% error and speedup of around 30x vs. a FSM model.

互连网络建模是研究互连行为及其在电信应用和分布式系统中的重要研究课题。然而，大规模网络的复杂性使得模型和仿真工具的开发成为一项艰巨的任务。在本文中，我们探索了网络建模空间设计，以提供以下两种不同方法的模型:基于有限状态机(FSM)的精确仿真模型，以及以最小精度损失提供有利加速的分析模型。实验结果表明，所提出的分析模型为可预见的未来感兴趣的系统规模提供了忠实的抽象，与FSM模型相比，它的误差达到8%，速度提高约30倍。

引用次数: 2

Connection and performance model driven optimization of pageview response time 连接和性能模型驱动的页面浏览响应时间优化

2009 IEEE International Symposium on Modeling, Analysis & Simulation of Computer and Telecommunication Systems

Pub Date : 2009-12-28 DOI: 10.1109/MASCOT.2009.5366184

Dinesh Kumar, D. Olshefski, Li Zhang

Managing client perceived pageview response time for multiple classes of service is essential in today's highly competitive, e-commerce environment. We present Connection and Performance Model Driven Optimization (CP-MDO), a novel approach for providing optimal QoS as defined by a cost objective based on client perceived pageview response time and pageview drop rate. Our approach combines two vital models: 1) a latency model for connection establishment that captures the interactions between web browsers and web servers across network protocol layers and 2) a server performance model based on queueing theory that models performance across all tiers of a server complex. An algorithm capable of enforcing the optimal admission control based on the inter-arrival time between pageview admissions is given. Our approach has been implemented and evaluated in an experimental setting, demonstrating how CP-MDO achieves the minimal cost while providing minimal pageview response times under minimal drop rates across multiple classes of service.

在当今竞争激烈的电子商务环境中，管理多个服务类别的客户感知页面浏览响应时间是必不可少的。我们提出了连接和性能模型驱动优化(CP-MDO)，这是一种提供最佳QoS的新方法，该方法由基于客户端感知的页面浏览量响应时间和页面浏览量下降率的成本目标定义。我们的方法结合了两个重要的模型:1)连接建立的延迟模型，该模型捕获了跨网络协议层的web浏览器和web服务器之间的交互;2)基于排队理论的服务器性能模型，该模型模拟了服务器复合结构的所有层的性能。给出了一种基于页面浏览量入场间隔时间的最优入场控制算法。我们的方法已经在实验环境中实施和评估，展示了CP-MDO如何在多个服务类别的最小下降率下实现最小的成本，同时提供最小的页面浏览量响应时间。

引用次数: 3

Anonymous routing protocols: Impact on performance in MANETs 匿名路由协议:对manet性能的影响

2009 IEEE International Symposium on Modeling, Analysis & Simulation of Computer and Telecommunication Systems

Pub Date : 2009-12-28 DOI: 10.1109/MASCOT.2009.5366143

Marga Nácher, C. Calafate, Juan-Carlos Cano, P. Manzoni

Eavesdropping is an important threat in the context of mobile ad-hoc networks due to the use of open air as the transmission medium. As a consequence, some works trying to prevent this threat have been proposed. Some of these proposals focus on the use of anonymous routing protocols. In this paper we analyze two of the most popular: ANODR and MASK. We evaluate their performance through simulation in terms of throughput and routing overhead in order to measure the cost of providing anonymity. Simulation results show that these anonymous routing protocols reduce performance to inefficient levels.

在移动自组织网络中，由于使用露天传输媒介，窃听是一个重要的威胁。因此，有人提出了一些努力防止这种威胁的工作。其中一些建议侧重于匿名路由协议的使用。在本文中，我们分析了两种最流行的:ANODR和MASK。我们通过模拟吞吐量和路由开销来评估它们的性能，以衡量提供匿名的成本。仿真结果表明，这些匿名路由协议将性能降低到低效的程度。

引用次数: 3

Protecting against rare event failures in archival systems 防止档案系统中的罕见事件故障

2009 IEEE International Symposium on Modeling, Analysis & Simulation of Computer and Telecommunication Systems

Pub Date : 2009-12-28 DOI: 10.1109/MASCOT.2009.5366825

Avani Wildani, T. Schwarz, E. L. Miller, D. Long

Digital archives are growing rapidly, necessitating stronger reliability measures than RAID to avoid data loss from device failure. Mirroring, a popular solution, is too expensive over time. We present a compromise solution that uses multi-level redundancy coding to reduce the probability of data loss from multiple simultaneous device failures. This approach handles small-scale failures of one or two devices efficiently while still allowing the system to survive rare-event, larger-scale failures of four or more devices. In our approach, each disk is split into a set of fixed size disklets which are used to construct reliability stripes. To protect against rare event failures, reliability stripes are grouped into larger super-groups, each of which has a corresponding super-parity; super-parity is only used to recover data when disk failures overwhelm the redundancy in a single reliability stripe. Super-parity can be stored on a variety of devices such as NV-RAM and always-on disks to offset write bottlenecks while still keeping the number of active devices low. Our calculations of failure probabilities show that adding super-parity allows our system to absorb many more disk failures without data loss. Through discrete event simulation, we found that adding super-groups has a significant impact on mean time to data loss and that rebuilds are slow but not unmanageable. Finally, we showed that robustness against rare events can be achieved for a fraction of total system cost.

数字档案发展迅速，需要比RAID更强的可靠性措施，以避免设备故障导致的数据丢失。镜像是一种流行的解决方案，但随着时间的推移，它的成本太高了。我们提出了一个折衷的解决方案，使用多级冗余编码来减少多个设备同时故障造成数据丢失的概率。这种方法可以有效地处理一个或两个设备的小规模故障，同时仍然允许系统在四个或更多设备的罕见事件、大规模故障中存活下来。在我们的方法中，每个磁盘被分割成一组固定大小的小磁盘，这些小磁盘用于构建可靠性条带。为了防止罕见的事件故障，可靠性条带被分组到更大的超级组中，每个超级组都有一个相应的超级奇偶校验;只有当单个可靠性分条中硬盘故障超过冗余时，才可以使用超级奇偶校验进行数据恢复。超级奇偶校验可以存储在各种设备上，例如NV-RAM和始终在线的磁盘，以抵消写瓶颈，同时仍然保持低活动设备的数量。我们对故障概率的计算表明，添加超奇偶校验使我们的系统能够吸收更多的磁盘故障而不会丢失数据。通过离散事件模拟，我们发现添加超级组对数据丢失的平均时间有显著影响，并且重建缓慢但并非不可管理。最后，我们证明了对罕见事件的鲁棒性可以用系统总成本的一小部分来实现。

{"title":"Protecting against rare event failures in archival systems","authors":"Avani Wildani, T. Schwarz, E. L. Miller, D. Long","doi":"10.1109/MASCOT.2009.5366825","DOIUrl":"https://doi.org/10.1109/MASCOT.2009.5366825","url":null,"abstract":"Digital archives are growing rapidly, necessitating stronger reliability measures than RAID to avoid data loss from device failure. Mirroring, a popular solution, is too expensive over time. We present a compromise solution that uses multi-level redundancy coding to reduce the probability of data loss from multiple simultaneous device failures. This approach handles small-scale failures of one or two devices efficiently while still allowing the system to survive rare-event, larger-scale failures of four or more devices. In our approach, each disk is split into a set of fixed size disklets which are used to construct reliability stripes. To protect against rare event failures, reliability stripes are grouped into larger super-groups, each of which has a corresponding super-parity; super-parity is only used to recover data when disk failures overwhelm the redundancy in a single reliability stripe. Super-parity can be stored on a variety of devices such as NV-RAM and always-on disks to offset write bottlenecks while still keeping the number of active devices low. Our calculations of failure probabilities show that adding super-parity allows our system to absorb many more disk failures without data loss. Through discrete event simulation, we found that adding super-groups has a significant impact on mean time to data loss and that rebuilds are slow but not unmanageable. Finally, we showed that robustness against rare events can be achieved for a fraction of total system cost.","PeriodicalId":275737,"journal":{"name":"2009 IEEE International Symposium on Modeling, Analysis & Simulation of Computer and Telecommunication Systems","volume":"83 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2009-12-28","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"123528045","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 39

On maximizing the delivery ratio of ultra low duty-cycle WSNs under real-time constraints 实时约束下超低占空比无线传感器网络传输比最大化研究

2009 IEEE International Symposium on Modeling, Analysis & Simulation of Computer and Telecommunication Systems

Pub Date : 2009-12-28 DOI: 10.1109/MASCOT.2009.5366188

Fei Yang, I. Augé-Blum

Duty-cycle prolongs the lifetime of battery-powered wireless sensor networks (WSNs). However, it incurs additional delay because the nodes may be asleep. In addition to energy constraints, many applications have real-time constraints, which means the sink has to be informed before a deadline when an event occurs. Moreover, wireless links among low power radios are highly unreliable. These pose big challenges to design protocols for real-time applications. In this paper, a novel forwarding scheme based on distributed wakeup scheduling is proposed which can guarantee bounded delay and have higher delivery ratio for ultra low duty-cycle WSNs under unreliable links. The proposed wakeup scheduling algorithm schedules the wakeup time of each node according to the hop number and expected delivery ratio to the sink. We model the forwarding scheme and analyze its properties. Simulation results show that the proposed algorithm has better performances in terms of delivery ratio and end-to-end delay.

占空比延长了电池供电无线传感器网络(WSNs)的使用寿命。但是，它会导致额外的延迟，因为节点可能处于休眠状态。除了能量限制外，许多应用程序还具有实时限制，这意味着必须在事件发生的最后期限之前通知接收器。此外，低功率无线电之间的无线链路非常不可靠。这些都给实时应用程序的协议设计带来了巨大的挑战。本文提出了一种基于分布式唤醒调度的超低占空比WSNs在不可靠链路下能够保证有界延迟和更高的转发率的转发方案。提出的唤醒调度算法根据节点的跳数和期望发送到sink的比率来调度每个节点的唤醒时间。建立了该转发方案的模型，并对其特性进行了分析。仿真结果表明，该算法在传输率和端到端延迟方面具有较好的性能。

引用次数: 3

TSS: Applying two-stage sampling in micro-architecture simulations TSS:在微架构模拟中应用两阶段采样

2009 IEEE International Symposium on Modeling, Analysis & Simulation of Computer and Telecommunication Systems

Pub Date : 2009-12-28 DOI: 10.1109/MASCOT.2009.5366603

Zhibin Yu, Hai Jin, Jing Chen, L. John

Accelerating micro-architecture simulation is becoming increasingly urgent as the complexity of workload and simulated processor increases. This paper presents a novel two-stage sampling (TSS) scheme to accelerate the sampling-based simulation. It firstly selects some large samples from a dynamic instruction stream as candidates of detail simulation and then samples some small groups from each selected first stage sample to do detail simulation. Since the distribution of standard deviation of cycle per instruction (CPI) is insensitive to microarchitecture, TSS could be used to speedup design space exploration by splitting the sampling process into two stages, which is able to remove redundant instruction samples from detail simulation when the program is in stable program phase (standard deviation of CPI is near zero). It also adopts systematic sampling to accelerate the functional warm-up in sampling simulation. Experimental results show that, by combining these two techniques, TSS achieves an average and maximum speedup of 1.3 and 2.29 over SMARTS, with the average CPI relative error is less than 3%. TSS could significantly accelerate the time consuming iterative early design evaluation process.

随着工作负载和仿真处理器复杂性的增加，加速微体系结构仿真变得越来越迫切。本文提出了一种新的两阶段采样(TSS)方案来加速基于采样的仿真。首先从动态指令流中选取一些大样本作为详细仿真的候选样本，然后从每个选取的第一阶段样本中选取一些小样本进行详细仿真。由于CPI (cycle per instruction)的标准差分布对微体系结构不敏感，TSS可以通过将采样过程分为两个阶段来加速设计空间的探索，当程序处于稳定的程序阶段(CPI的标准差接近于零)时，可以从细节仿真中去除冗余的指令样本。在采样仿真中采用系统采样加速功能预热。实验结果表明，结合这两种技术，TSS比SMARTS平均和最大加速分别提高了1.3和2.29，平均CPI相对误差小于3%。TSS可以显著加快耗时的迭代早期设计评估过程。

{"title":"TSS: Applying two-stage sampling in micro-architecture simulations","authors":"Zhibin Yu, Hai Jin, Jing Chen, L. John","doi":"10.1109/MASCOT.2009.5366603","DOIUrl":"https://doi.org/10.1109/MASCOT.2009.5366603","url":null,"abstract":"Accelerating micro-architecture simulation is becoming increasingly urgent as the complexity of workload and simulated processor increases. This paper presents a novel two-stage sampling (TSS) scheme to accelerate the sampling-based simulation. It firstly selects some large samples from a dynamic instruction stream as candidates of detail simulation and then samples some small groups from each selected first stage sample to do detail simulation. Since the distribution of standard deviation of cycle per instruction (CPI) is insensitive to microarchitecture, TSS could be used to speedup design space exploration by splitting the sampling process into two stages, which is able to remove redundant instruction samples from detail simulation when the program is in stable program phase (standard deviation of CPI is near zero). It also adopts systematic sampling to accelerate the functional warm-up in sampling simulation. Experimental results show that, by combining these two techniques, TSS achieves an average and maximum speedup of 1.3 and 2.29 over SMARTS, with the average CPI relative error is less than 3%. TSS could significantly accelerate the time consuming iterative early design evaluation process.","PeriodicalId":275737,"journal":{"name":"2009 IEEE International Symposium on Modeling, Analysis & Simulation of Computer and Telecommunication Systems","volume":"29 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2009-12-28","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"131681712","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 8

首页上一页

下一页尾页

类型

全部化学•材料生命科学医学物理工程技术环境•农林材料科学地球科学法学管理学化学环境科学与生态学计算机科学教育学经济学农林科学人文科学生物学数学物理与天体物理心理学综合性期刊其他工业工程理学历史学农学文学信息工程

数据库

全部 ACS Publications Elsevier ieeexplore Springer The Royal Society of Chemistry Wiley

期刊

2009 IEEE International Symposium on Modeling, Analysis & Simulation of Computer and Telecommunication Systems

全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.

﹀