2008 The 28th International Conference on Distributed Computing Systems最新文献

英文中文

Adaptive Distributed Time-Slot Based Scheduling for Fairness in Multi-Hop Wireless Networks 基于自适应分布式时隙的多跳无线网络公平性调度

2008 The 28th International Conference on Distributed Computing Systems

Pub Date : 2008-06-17 DOI: 10.1109/ICDCS.2008.108

A. Rao, I. Stoica

Recent research indicates that multi-hop wireless networks can suffer from extreme imbalances in the throughput achieved by simultaneous competing flows. We address this problem by designing a practical distributed algorithm to compute a time-slot based schedule that provides end-to-end max-min fairness. Our system uses randomized priorities based on local weights to arbitrate access between nodes that directly compete with each other (we call this weighted slot allocation or WSA). The local weights are in turn computed by a higher layer called end-to-end fairness using local weights (EFLoW). EFLoW implements an additive-increase multiplicative-decrease (AIMD) algorithm that can automatically adapt to changes in traffic demands and network conditions. In each iteration, EFLoW only uses state obtained from within a given node's contention region. We have implemented WSA and EFLoW in both a simulator and a real system by using the overlay MAC layer (OML). Unlike previous work on end-to-end fairness, our approach does not use a centralized coordinator and works for traffic patterns with any number of sources and sinks. Also, since we compute both the fair allocation and a schedule to achieve it, we do not make any assumptions about the efficiency of carrier-sense (CS) based MACs - this is very important in the light of recent work which shows that current CS-based MACs can be very unfair even when all nodes are limited to sending at their fair rate. Our results show that WSA and EFLoW can prevent starvation of flows and improve fairness without sacrificing efficiency for a wide variety of traffic patterns.

最近的研究表明，多跳无线网络在同时竞争流所获得的吞吐量方面存在极端不平衡。我们通过设计一个实用的分布式算法来计算一个基于时隙的时间表，从而提供端到端的最大最小公平性来解决这个问题。我们的系统使用基于本地权重的随机优先级来仲裁直接相互竞争的节点之间的访问(我们称之为加权槽分配或WSA)。本地权重依次由称为端到端公平性的更高层使用本地权重(EFLoW)计算。EFLoW实现了一种AIMD算法，能够自动适应流量需求和网络条件的变化。在每次迭代中，EFLoW只使用从给定节点的争用区域内获得的状态。我们通过使用覆盖MAC层(OML)在模拟器和实际系统中实现了WSA和EFLoW。与之前关于端到端公平性的工作不同，我们的方法不使用集中式协调器，并且适用于具有任意数量的源和汇的流量模式。此外，由于我们计算了公平分配和实现它的时间表，因此我们没有对基于载波感知(CS)的mac的效率做出任何假设——这一点非常重要，因为最近的工作表明，即使所有节点都被限制以公平速率发送，当前基于CS的mac也可能非常不公平。我们的研究结果表明，WSA和EFLoW可以在不牺牲各种流量模式的效率的情况下防止流量饥饿和提高公平性。

{"title":"Adaptive Distributed Time-Slot Based Scheduling for Fairness in Multi-Hop Wireless Networks","authors":"A. Rao, I. Stoica","doi":"10.1109/ICDCS.2008.108","DOIUrl":"https://doi.org/10.1109/ICDCS.2008.108","url":null,"abstract":"Recent research indicates that multi-hop wireless networks can suffer from extreme imbalances in the throughput achieved by simultaneous competing flows. We address this problem by designing a practical distributed algorithm to compute a time-slot based schedule that provides end-to-end max-min fairness. Our system uses randomized priorities based on local weights to arbitrate access between nodes that directly compete with each other (we call this weighted slot allocation or WSA). The local weights are in turn computed by a higher layer called end-to-end fairness using local weights (EFLoW). EFLoW implements an additive-increase multiplicative-decrease (AIMD) algorithm that can automatically adapt to changes in traffic demands and network conditions. In each iteration, EFLoW only uses state obtained from within a given node's contention region. We have implemented WSA and EFLoW in both a simulator and a real system by using the overlay MAC layer (OML). Unlike previous work on end-to-end fairness, our approach does not use a centralized coordinator and works for traffic patterns with any number of sources and sinks. Also, since we compute both the fair allocation and a schedule to achieve it, we do not make any assumptions about the efficiency of carrier-sense (CS) based MACs - this is very important in the light of recent work which shows that current CS-based MACs can be very unfair even when all nodes are limited to sending at their fair rate. Our results show that WSA and EFLoW can prevent starvation of flows and improve fairness without sacrificing efficiency for a wide variety of traffic patterns.","PeriodicalId":240205,"journal":{"name":"2008 The 28th International Conference on Distributed Computing Systems","volume":"3 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2008-06-17","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"130852468","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 10

Routing of XML and XPath Queries in Data Dissemination Networks 数据传播网络中XML和XPath查询的路由

2008 The 28th International Conference on Distributed Computing Systems

Pub Date : 2008-06-17 DOI: 10.1109/ICDCS.2008.31

Guoli Li, Shuang Hou, H. Jacobsen

XML-based data dissemination networks are rapidly gaining momentum. In these networks XML content is routed from data producers to data consumers throughout an overlay network of content-based routers. Routing decisions are based on XPath expressions (XPEs) stored at each router. To enable efficient routing, while keeping the routing state small, we introduce an advertisement-based routing algorithm for XML content, present a novel data structure for managing XPEs, especially apt for the hierarchical nature of XPEs and XML, and develop several optimizations for reducing the number of XPEs required to manage the routing state. The experimental evaluation shows that our algorithms and optimizations reduce the routing table size by up to 90%, improve the routing time by roughly 85%, and reduce overall network traffic by about 35%. Experiments running on PlanetLab show the scalability of our approach.

基于xml的数据传播网络正在迅速获得动力。在这些网络中，XML内容通过基于内容的路由器覆盖网络从数据生产者路由到数据消费者。路由决策基于存储在每个路由器上的XPath表达式(xpe)。为了实现高效的路由，同时保持路由状态较小，我们为XML内容引入了一种基于广告的路由算法，提出了一种新的数据结构来管理xpe，特别适合xpe和XML的层次结构，并开发了一些优化来减少管理路由状态所需的xpe数量。实验评估表明，我们的算法和优化将路由表大小减少了90%，将路由时间提高了约85%，并将整体网络流量减少了约35%。在PlanetLab上运行的实验显示了我们方法的可扩展性。

引用次数: 28

Can We Really Recover Data if Storage Subsystem Fails? 存储子系统故障，数据真的能恢复吗?

2008 The 28th International Conference on Distributed Computing Systems

Pub Date : 2008-06-17 DOI: 10.1109/ICDCS.2008.38

Weijun Xiao, Qing Yang

This paper presents a theoretical and experimental study on the limitations of copy-on-write snapshots and incremental backups in terms of data recoverability. We provide mathematical proofs of our new findings as well as implementation experiments to show how data recovery is done in case of various failures. Based on our study, we propose a new system architecture that will overcome the problems of existing technologies. The new architecture can provide two-way data recovery capability with the same storage overheads and can be implemented fairly easily on existing systems. We show that the new architecture has maximum data recoverability and is practically feasible.

本文对写时复制快照和增量备份在数据可恢复性方面的局限性进行了理论和实验研究。我们提供了新发现的数学证明以及实现实验，以展示在各种故障的情况下如何进行数据恢复。在此基础上，我们提出了一种新的系统架构，以克服现有技术存在的问题。新的体系结构可以在相同的存储开销下提供双向数据恢复功能，并且可以相当容易地在现有系统上实现。结果表明，该架构具有最大的数据可恢复性，是切实可行的。

引用次数: 9

Correlation-Aware Object Placement for Multi-Object Operations 面向多对象操作的关联感知对象放置

2008 The 28th International Conference on Distributed Computing Systems

Pub Date : 2008-06-17 DOI: 10.1109/ICDCS.2008.60

Ming Zhong, Kai Shen, J. Seiferas

A multi-object operation incurs communication or synchronization overhead when the requested objects are distributed over different nodes. The object pair correlations (the probability for a pair of objects to be requested together in an operation) are often highly skewed and yet stable over time for real-world distributed applications. Thus, placing strongly correlated objects on the same node (subject to node space constraint) tends to reduce communication overhead for multi-object operations. This paper studies the optimization of correlation-aware data placement. First, we formalize a restricted form of the problem as a variant of the classic Quadratic Assignment problem and we show that it is NP-hard. Based on a linear programming relaxation, we then propose a polynomial-time approximation algorithm that finds an object placement with communication overhead at most two times that of the optimal placement. We further show that the computation cost can be reduced by limiting the optimization scope to a relatively small number of most important objects. We quantitatively evaluate our approach on keyword index placement for full-text search engines using real traces of 3.7 million web pages and 6.8 million search queries. Compared to the correlation-oblivious random object placement, our approach achieves 37-86% communication overhead reduction on a range of optimization scopes and system sizes. The communication reduction is 30-78% compared to a correlation-aware greedy approach.

当请求的对象分布在不同的节点上时，多对象操作会导致通信或同步开销。对象对相关性(在操作中同时请求一对对象的概率)通常是高度倾斜的，但对于真实的分布式应用程序来说，随着时间的推移是稳定的。因此，在同一节点上放置强相关对象(受节点空间约束)往往会减少多对象操作的通信开销。本文研究了关联感知数据放置的优化问题。首先，我们将问题的限制形式形式化为经典二次分配问题的变体，并证明它是np困难的。基于线性规划松弛，我们提出了一种多项式时间近似算法，该算法可以找到通信开销最多为最优放置的两倍的对象放置。我们进一步表明，通过将优化范围限制在相对较少的最重要对象上，可以减少计算成本。我们使用370万个网页和680万个搜索查询的真实痕迹，定量地评估了全文搜索引擎的关键字索引放置方法。与无关随机对象放置相比，我们的方法在优化范围和系统大小的范围内实现了37-86%的通信开销减少。与关联感知贪婪方法相比，通信减少了30-78%。

{"title":"Correlation-Aware Object Placement for Multi-Object Operations","authors":"Ming Zhong, Kai Shen, J. Seiferas","doi":"10.1109/ICDCS.2008.60","DOIUrl":"https://doi.org/10.1109/ICDCS.2008.60","url":null,"abstract":"A multi-object operation incurs communication or synchronization overhead when the requested objects are distributed over different nodes. The object pair correlations (the probability for a pair of objects to be requested together in an operation) are often highly skewed and yet stable over time for real-world distributed applications. Thus, placing strongly correlated objects on the same node (subject to node space constraint) tends to reduce communication overhead for multi-object operations. This paper studies the optimization of correlation-aware data placement. First, we formalize a restricted form of the problem as a variant of the classic Quadratic Assignment problem and we show that it is NP-hard. Based on a linear programming relaxation, we then propose a polynomial-time approximation algorithm that finds an object placement with communication overhead at most two times that of the optimal placement. We further show that the computation cost can be reduced by limiting the optimization scope to a relatively small number of most important objects. We quantitatively evaluate our approach on keyword index placement for full-text search engines using real traces of 3.7 million web pages and 6.8 million search queries. Compared to the correlation-oblivious random object placement, our approach achieves 37-86% communication overhead reduction on a range of optimization scopes and system sizes. The communication reduction is 30-78% compared to a correlation-aware greedy approach.","PeriodicalId":240205,"journal":{"name":"2008 The 28th International Conference on Distributed Computing Systems","volume":"31 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2008-06-17","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"114707018","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 15

PAS: A Wireless-Enabled, Cell-Phone-Incorporated Personal Assistant System for Independent and Assisted Living PAS:一种独立和辅助生活的无线移动电话个人助理系统

2008 The 28th International Conference on Distributed Computing Systems

Pub Date : 2008-06-17 DOI: 10.1109/ICDCS.2008.99

Zheng Zeng, Sammy Yu, Wook Shin, J. Hou

Advances in networking, sensors, medical devices and smart phones have made it feasible to monitor and provide medical and other assistance to people either in their homes or outside. Aging populations will benefit from reduced costs and improved healthcare through assisted living based on these technologies. However, these systems challenge current state-of-the-art techniques for usability, reliability, and security. In this paper we present the PAS open architecture for assisted living, which allows independently developed third party components to collaborate. Furthermore, we incorporate cell phones in PAS as the local intelligence in order to enhance the robustness and ubiquity. We discuss key technological issues in assisted living systems, such as software architecture layout, power preserving, security and privacy; and results from our pilot study in a real assisted living facility are presented.

网络、传感器、医疗设备和智能手机方面的进步，使得在家中或室外对人们进行监测并向他们提供医疗和其他援助成为可能。通过基于这些技术的辅助生活，老龄人口将受益于成本的降低和医疗保健的改善。然而，这些系统在可用性、可靠性和安全性方面挑战了当前最先进的技术。在本文中，我们提出了辅助生活的PAS开放架构，它允许独立开发的第三方组件进行协作。此外，为了增强系统的鲁棒性和泛在性，我们将手机作为局部智能体加入到系统中。我们讨论了辅助生活系统的关键技术问题，如软件架构布局、节能、安全和隐私;并介绍了我们在一个真实的辅助生活设施中的初步研究结果。

引用次数: 14

Measurement Manipulation and Space Selection in Network Coordinates 网络坐标中的测量操作与空间选择

2008 The 28th International Conference on Distributed Computing Systems

Pub Date : 2008-06-17 DOI: 10.1109/ICDCS.2008.27

C. Lumezanu, N. Spring

Internet coordinate systems have emerged as an efficient method to estimate the latency between pairs of nodes without any communication between them. However, most coordinate systems have been evaluated solely on data sets built by their authors from measurements gathered over large periods of time. Although they show good prediction results, it is unclear whether the accuracy is the result of the system design properties or is more connected to the characteristics of the data sets. In this paper, we revisit a simple question: how do the features of the embedding space and the inherent attributes of the data sets interact in producing good embeddings? We adapt the Vivaldi algorithm to use Hyperbolic space for embedding and evaluate both Euclidean and Hyperbolic Vivaldi on seven sets of real-world latencies. Our results show that node filtering and latency distributions can significantly influence the accuracy of the predictions. For example, although Euclidean Vivaldi performs well on data sets that were chosen, constructed and filtered by the designers of the algorithm, its performance and robustness decrease considerably when run on third party data sets that were not filtered a priori. Our results offer important insight into designing and building coordinate systems that are both robust and accurate in Internet-like environments.

互联网坐标系统已经成为一种有效的方法，可以在不进行任何通信的情况下估计节点对之间的延迟。然而，大多数坐标系统仅仅是根据作者在很长一段时间内收集的测量数据集来评估的。虽然它们显示出良好的预测结果，但目前尚不清楚这种准确性是系统设计属性的结果，还是与数据集的特征更多地相关。在本文中，我们重新审视了一个简单的问题:如何在产生好的嵌入时，嵌入空间的特征和数据集的固有属性是如何相互作用的?我们将Vivaldi算法改进为使用双曲空间进行嵌入，并在7组真实延迟上评估欧几里得和双曲Vivaldi算法。我们的研究结果表明，节点过滤和延迟分布可以显著影响预测的准确性。例如，尽管Euclidean Vivaldi在由算法设计者选择、构建和过滤的数据集上表现良好，但在未经先验过滤的第三方数据集上运行时，其性能和鲁棒性显著下降。我们的研究结果为在类似互联网的环境中设计和构建既健壮又准确的坐标系统提供了重要的见解。

{"title":"Measurement Manipulation and Space Selection in Network Coordinates","authors":"C. Lumezanu, N. Spring","doi":"10.1109/ICDCS.2008.27","DOIUrl":"https://doi.org/10.1109/ICDCS.2008.27","url":null,"abstract":"Internet coordinate systems have emerged as an efficient method to estimate the latency between pairs of nodes without any communication between them. However, most coordinate systems have been evaluated solely on data sets built by their authors from measurements gathered over large periods of time. Although they show good prediction results, it is unclear whether the accuracy is the result of the system design properties or is more connected to the characteristics of the data sets. In this paper, we revisit a simple question: how do the features of the embedding space and the inherent attributes of the data sets interact in producing good embeddings? We adapt the Vivaldi algorithm to use Hyperbolic space for embedding and evaluate both Euclidean and Hyperbolic Vivaldi on seven sets of real-world latencies. Our results show that node filtering and latency distributions can significantly influence the accuracy of the predictions. For example, although Euclidean Vivaldi performs well on data sets that were chosen, constructed and filtered by the designers of the algorithm, its performance and robustness decrease considerably when run on third party data sets that were not filtered a priori. Our results offer important insight into designing and building coordinate systems that are both robust and accurate in Internet-like environments.","PeriodicalId":240205,"journal":{"name":"2008 The 28th International Conference on Distributed Computing Systems","volume":"1 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2008-06-17","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"132597790","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 23

On Detection of Malicious Users Using Group Testing Techniques 基于组测试技术的恶意用户检测研究

2008 The 28th International Conference on Distributed Computing Systems

Pub Date : 2008-06-17 DOI: 10.1109/ICDCS.2008.75

M. Thai, Ying Xuan, Incheol Shin, T. Znati

Despite decades of research, there have not been developed concrete defense solutions for most of current attacks to Internet services, let alone new attack types. An essential problem to overcome is that malicious traffic can be similar to legitimate ones. Thus a more fundamental model which should be based on the overall performance of servers/subnets without inspecting each traffic must be remedied. Based on this observation, we propose a novel system framework, called detection of malicious users (DMU) which attempts to solve various attack types. Motivated by DMU, we introduce a new theoretical model, called size constraint group testing (SCGT). Several algorithms based on SCGT for various networking scenarios are proposed. We also provide several fundamental results on SCGT, revealing some necessary conditions to obtain an O(1) detection time algorithm.

尽管经过了几十年的研究，目前大多数针对互联网服务的攻击还没有开发出具体的防御解决方案，更不用说新的攻击类型了。需要克服的一个重要问题是，恶意流量可能与合法流量相似。因此，必须纠正一个更基本的模型，该模型应该基于服务器/子网的整体性能，而不检查每个流量。基于这种观察，我们提出了一个新的系统框架，称为恶意用户检测(DMU)，它试图解决各种攻击类型。在DMU的激励下，我们提出了一种新的理论模型——尺寸约束群测试(SCGT)。针对不同的组网场景，提出了几种基于SCGT的算法。我们还提供了几个关于SCGT的基本结果，揭示了获得O(1)检测时间算法的一些必要条件。

引用次数: 18

End-to-End Congestion Control for High Speed Networks Based on Population Ecology Models 基于种群生态模型的高速网络端到端拥塞控制

2008 The 28th International Conference on Distributed Computing Systems

Pub Date : 2008-06-17 DOI: 10.1109/ICDCS.2008.26

Xiaomeng Huang, Fengyuan Ren, Guangwen Yang, Yongwei Wu, W. Zhen, Chuang Lin

Since TCP congestion control is ill-suited for high speed networks, designing a replacement for TCP has become a challenge. To address this problem, we extend the population ecology theory to design a novel congestion control algorithm. We treat the network flows as the species in nature, the throughput of the flows as the population number, and the bottleneck bandwidth as the food resources. Then we use the key idea of constructing population ecology models to develop a novel congestion control model, and implement the corresponding end-to-end transport protocol through measurement, which called Population Ecology TCP (PE-TCP). The theoretical analysis and simulation results validate that PE-TCP achieves high utilization, fast convergence, fair bandwidth allocation, and near-zero packet drops. These qualities are desirable for high speed networks.

由于TCP拥塞控制不适用于高速网络，因此设计TCP的替代品已成为一项挑战。为了解决这一问题，我们扩展了种群生态学理论，设计了一种新的拥塞控制算法。我们将网络流量视为自然界的物种，将网络流量的吞吐量视为种群数量，将瓶颈带宽视为食物资源。然后利用构建种群生态模型的核心思想，建立了一种新的拥塞控制模型，并通过测量实现了相应的端到端传输协议——种群生态TCP (PE-TCP)。理论分析和仿真结果表明，PE-TCP具有利用率高、收敛速度快、带宽分配公平、丢包接近于零的特点。这些特性是高速网络所需要的。

引用次数: 4

Relative Network Positioning via CDN Redirections 通过CDN重定向的相对网络定位

2008 The 28th International Conference on Distributed Computing Systems

Pub Date : 2008-06-17 DOI: 10.1109/ICDCS.2008.54

Ao-Jan Su, D. Choffnes, F. Bustamante, A. Kuzmanovic

Many large-scale distributed systems can benefit from a service that allows them to select among alternative nodes based on their relative network positions. A variety of approaches propose new measurement infrastructures that attempt to scale this service to large numbers of nodes by reducing the amount of direct measurements to end hosts. In this paper, we introduce a new approach to relative network positioning that eliminates direct probing by leveraging pre-existing infrastructure. Specifically, we exploit the dynamic association of nodes with replica servers from large content distribution networks (CDNs) to determine relative position information - we call this approach CDN-based relative network positioning (CRP). We demonstrate how CRP can support two common examples of location information used by distributed applications: server selection and dynamic node clustering. After describing CRP in detail, we present results from an extensive wide-area evaluation that demonstrates its effectiveness.

许多大型分布式系统可以从一项服务中获益，该服务允许它们根据相对网络位置在可选节点中进行选择。各种方法提出了新的测量基础设施，试图通过减少对终端主机的直接测量量来将此服务扩展到大量节点。在本文中，我们介绍了一种新的相对网络定位方法，该方法通过利用预先存在的基础设施来消除直接探测。具体来说，我们利用节点与大型内容分发网络(cdn)的副本服务器的动态关联来确定相对位置信息——我们称之为基于cdn的相对网络定位(CRP)。我们将演示CRP如何支持分布式应用程序使用的两个常见位置信息示例:服务器选择和动态节点集群。在详细描述CRP之后，我们提出了一个广泛的广域评估结果，证明了它的有效性。

引用次数: 24

PFC: Transparent Optimization of Existing Prefetching Strategies for Multi-Level Storage Systems PFC:多级存储系统现有预取策略的透明优化

2008 The 28th International Conference on Distributed Computing Systems

Pub Date : 2008-06-01 DOI: 10.1109/ICDCS.2008.89

Zhe Zhang, Kyuhyung Lee, Xiaosong Ma, Yuanyuan Zhou

The multi-level storage architecture has been widely adopted in servers and data centers. However, while prefetching has been shown as a crucial technique to exploit the sequentiality in accesses common for such systems and hide the increasing relative cost of disk I/O, existing multi-level storage studies have focused mostly on cache replacement strategies. In this paper, we show that prefetching algorithms designed for single-level systems may have their limitations magnified when applied to multi-level systems. Overly conservative prefetching will not be able to effectively use the lower-level cache space, while overly aggressive prefetching will be compounded across levels and generate large amounts of wasted prefetch. We take an innovative approach to this problem: rather than designing a new, multi-level prefetching algorithm, we developed prefetching-coordinator (PFC), a hierarchy-aware optimization applicable to any existing prefetching algorithms. PFC does not require any application hints, a priori knowledge on the application access pattern or the native prefetching algorithm, or modification to the I/O interface. Instead, it monitors the upper-level access patterns as well as the lower-level cache status, and dynamically adjusts the aggressiveness of the lower-level prefetching activities. We evaluated PFC with extensive simulation study using a verified multi-level storage simulator, an accurate disk simulator, and access traces with different access patterns. Our results indicate that PFC dynamically controls lower-level prefetching in reaction to multiple system and workload parameters, improving the overall system performance in all 96 test cases. Working with four well-known existing prefetching algorithms adopted in real systems, PFC obtains an improvement of up to 35% to the average request response time, with an average improvement of 14.6% over all cases.

多层次存储体系结构在服务器和数据中心中被广泛采用。然而，虽然预取已被证明是利用此类系统中常见的访问顺序和隐藏磁盘I/O相对成本增加的关键技术，但现有的多级存储研究主要集中在缓存替换策略上。在本文中，我们证明了为单级系统设计的预取算法在应用于多级系统时可能会放大其局限性。过于保守的预取将无法有效地使用较低级别的缓存空间，而过于激进的预取将跨层叠加，并产生大量浪费的预取。我们采用了一种创新的方法来解决这个问题:而不是设计一个新的，多层次的预取算法，我们开发了预取协调器(PFC)，一种适用于任何现有预取算法的层次感知优化。PFC不需要任何应用程序提示、对应用程序访问模式或本机预取算法的先验知识，也不需要对I/O接口进行修改。相反，它监视上层访问模式和下层缓存状态，并动态调整下层预取活动的活跃度。我们对PFC进行了广泛的仿真研究，使用了经过验证的多级存储模拟器、精确的磁盘模拟器和具有不同访问模式的访问跟踪。我们的结果表明，PFC根据多个系统和工作负载参数动态控制低级预取，在所有96个测试用例中提高了整体系统性能。与实际系统中采用的四种著名的预取算法一起，PFC的平均请求响应时间提高了35%，在所有情况下平均提高了14.6%。

{"title":"PFC: Transparent Optimization of Existing Prefetching Strategies for Multi-Level Storage Systems","authors":"Zhe Zhang, Kyuhyung Lee, Xiaosong Ma, Yuanyuan Zhou","doi":"10.1109/ICDCS.2008.89","DOIUrl":"https://doi.org/10.1109/ICDCS.2008.89","url":null,"abstract":"The multi-level storage architecture has been widely adopted in servers and data centers. However, while prefetching has been shown as a crucial technique to exploit the sequentiality in accesses common for such systems and hide the increasing relative cost of disk I/O, existing multi-level storage studies have focused mostly on cache replacement strategies. In this paper, we show that prefetching algorithms designed for single-level systems may have their limitations magnified when applied to multi-level systems. Overly conservative prefetching will not be able to effectively use the lower-level cache space, while overly aggressive prefetching will be compounded across levels and generate large amounts of wasted prefetch. We take an innovative approach to this problem: rather than designing a new, multi-level prefetching algorithm, we developed prefetching-coordinator (PFC), a hierarchy-aware optimization applicable to any existing prefetching algorithms. PFC does not require any application hints, a priori knowledge on the application access pattern or the native prefetching algorithm, or modification to the I/O interface. Instead, it monitors the upper-level access patterns as well as the lower-level cache status, and dynamically adjusts the aggressiveness of the lower-level prefetching activities. We evaluated PFC with extensive simulation study using a verified multi-level storage simulator, an accurate disk simulator, and access traces with different access patterns. Our results indicate that PFC dynamically controls lower-level prefetching in reaction to multiple system and workload parameters, improving the overall system performance in all 96 test cases. Working with four well-known existing prefetching algorithms adopted in real systems, PFC obtains an improvement of up to 35% to the average request response time, with an average improvement of 14.6% over all cases.","PeriodicalId":240205,"journal":{"name":"2008 The 28th International Conference on Distributed Computing Systems","volume":"3 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2008-06-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"133224615","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 23

首页上一页

下一页尾页

类型

全部化学•材料生命科学医学物理工程技术环境•农林材料科学地球科学法学管理学化学环境科学与生态学计算机科学教育学经济学农林科学人文科学生物学数学物理与天体物理心理学综合性期刊其他工业工程理学历史学农学文学信息工程

数据库

全部 ACS Publications Elsevier ieeexplore Springer The Royal Society of Chemistry Wiley

期刊

2008 The 28th International Conference on Distributed Computing Systems

全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.

﹀